Identifying disease polymorphisms from case-control genetic association data.
Park, L
2010-12-01
In case-control association studies, it is typical to observe several associated polymorphisms in a gene region. Often the most significantly associated polymorphism is considered to be the disease polymorphism; however, it is not clear whether it is the disease polymorphism or there is more than one disease polymorphism in the gene region. Currently, there is no method that can handle these problems based on the linkage disequilibrium (LD) relationship between polymorphisms. To distinguish real disease polymorphisms from markers in LD, a method that can detect disease polymorphisms in a gene region has been developed. Relying on the LD between polymorphisms in controls, the proposed method utilizes model-based likelihood ratio tests to find disease polymorphisms. This method shows reliable Type I and Type II error rates when sample sizes are large enough, and works better with re-sequenced data. Applying this method to fine mapping using re-sequencing or dense genotyping data would provide important information regarding the genetic architecture of complex traits.
Genomic DNA sequence and cytosine methylation changes of adult rice leaves after seeds space flight
NASA Astrophysics Data System (ADS)
Shi, Jinming
In this study, cytosine methylation on CCGG site and genomic DNA sequence changes of adult leaves of rice after seeds space flight were detected by methylation-sensitive amplification polymorphism (MSAP) and Amplified fragment length polymorphism (AFLP) technique respectively. Rice seeds were planted in the trial field after 4 days space flight on the shenzhou-6 Spaceship of China. Adult leaves of space-treated rice including 8 plants chosen randomly and 2 plants with phenotypic mutation were used for AFLP and MSAP analysis. Polymorphism of both DNA sequence and cytosine methylation were detected. For MSAP analysis, the average polymorphic frequency of the on-ground controls, space-treated plants and mutants are 1.3%, 3.1% and 11% respectively. For AFLP analysis, the average polymorphic frequencies are 1.4%, 2.9%and 8%respectively. Total 27 and 22 polymorphic fragments were cloned sequenced from MSAP and AFLP analysis respectively. Nine of the 27 fragments from MSAP analysis show homology to coding sequence. For the 22 polymorphic fragments from AFLP analysis, no one shows homology to mRNA sequence and eight fragments show homology to repeat region or retrotransposon sequence. These results suggest that although both genomic DNA sequence and cytosine methylation status can be effected by space flight, the genomic region homology to the fragments from genome DNA and cytosine methylation analysis were different.
Templated sequence insertion polymorphisms in the human genome
NASA Astrophysics Data System (ADS)
Onozawa, Masahiro; Aplan, Peter
2016-11-01
Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.
Markers and mapping revisited: finding your gene.
Jones, Neil; Ougham, Helen; Thomas, Howard; Pasakinskiene, Izolda
2009-01-01
This paper is an update of our earlier review (Jones et al., 1997, Markers and mapping: we are all geneticists now. New Phytologist 137: 165-177), which dealt with the genetics of mapping, in terms of recombination as the basis of the procedure, and covered some of the first generation of markers, including restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNA (RAPDs), simple sequence repeats (SSRs) and quantitative trait loci (QTLs). In the intervening decade there have been numerous developments in marker science with many new systems becoming available, which are herein described: cleavage amplification polymorphism (CAP), sequence-specific amplification polymorphism (S-SAP), inter-simple sequence repeat (ISSR), sequence tagged site (STS), sequence characterized amplification region (SCAR), selective amplification of microsatellite polymorphic loci (SAMPL), single nucleotide polymorphism (SNP), expressed sequence tag (EST), sequence-related amplified polymorphism (SRAP), target region amplification polymorphism (TRAP), microarrays, diversity arrays technology (DArT), single-strand conformation polymorphism (SSCP), denaturing gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE) and methylation-sensitive PCR. In addition there has been an explosion of knowledge and databases in the area of genomics and bioinformatics. The number of flowering plant ESTs is c. 19 million and counting, with all the opportunity that this provides for gene-hunting, while the survey of bioinformatics and computer resources points to a rapid growth point for future activities in unravelling and applying the burst of new information on plant genomes. A case study is presented on tracking down a specific gene (stay-green (SGR), a post-transcriptional senescence regulator) using the full suite of mapping tools and comparative mapping resources. We end with a brief speculation on how genome analysis may progress into the future of this highly dynamic arena of plant science.
USDA-ARS?s Scientific Manuscript database
Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...
Prion gene haplotypes of U.S. cattle
Clawson, Michael L; Heaton, Michael P; Keele, John W; Smith, Timothy PL; Harhay, Gregory P; Laegreid, William W
2006-01-01
Background Bovine spongiform encephalopathy (BSE) is a fatal neurological disorder characterized by abnormal deposits of a protease-resistant isoform of the prion protein. Characterizing linkage disequilibrium (LD) and haplotype networks within the bovine prion gene (PRNP) is important for 1) testing rare or common PRNP variation for an association with BSE and 2) interpreting any association of PRNP alleles with BSE susceptibility. The objective of this study was to identify polymorphisms and haplotypes within PRNP from the promoter region through the 3'UTR in a diverse sample of U.S. cattle genomes. Results A 25.2-kb genomic region containing PRNP was sequenced from 192 diverse U.S. beef and dairy cattle. Sequence analyses identified 388 total polymorphisms, of which 287 have not previously been reported. The polymorphism alleles define PRNP by regions of high and low LD. High LD is present between alleles in the promoter region through exon 2 (6.7 kb). PRNP alleles within the majority of intron 2, the entire coding sequence and the untranslated region of exon 3 are in low LD (18.0 kb). Two haplotype networks, one representing the region of high LD and the other the region of low LD yielded nineteen different combinations that represent haplotypes spanning PRNP. The haplotype combinations are tagged by 19 polymorphisms (htSNPS) which characterize variation within and across PRNP. Conclusion The number of polymorphisms in the prion gene region of U.S. cattle is nearly four times greater than previously described. These polymorphisms define PRNP haplotypes that may influence BSE susceptibility in cattle. PMID:17092337
Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism.
Gur-Arie, R; Cohen, C J; Eitan, Y; Shelef, L; Hallerman, E M; Kashi, Y
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.
Chen, Y. C.; Eisner, J. D.; Kattar, M. M.; Rassoulian-Barrett, S. L.; LaFe, K.; Yarfitz, S. L.; Limaye, A. P.; Cookson, B. T.
2000-01-01
Identification of medically relevant yeasts can be time-consuming and inaccurate with current methods. We evaluated PCR-based detection of sequence polymorphisms in the internal transcribed spacer 2 (ITS2) region of the rRNA genes as a means of fungal identification. Clinical isolates (401), reference strains (6), and type strains (27), representing 34 species of yeasts were examined. The length of PCR-amplified ITS2 region DNA was determined with single-base precision in less than 30 min by using automated capillary electrophoresis. Unique, species-specific PCR products ranging from 237 to 429 bp were obtained from 92% of the clinical isolates. The remaining 8%, divided into groups with ITS2 regions which differed by ≤2 bp in mean length, all contained species-specific DNA sequences easily distinguishable by restriction enzyme analysis. These data, and the specificity of length polymorphisms for identifying yeasts, were confirmed by DNA sequence analysis of the ITS2 region from 93 isolates. Phenotypic and ITS2-based identification was concordant for 427 of 434 yeast isolates examined using sequence identity of ≥99%. Seven clinical isolates contained ITS2 sequences that did not agree with their phenotypic identification, and ITS2-based phylogenetic analyses indicate the possibility of new or clinically unusual species in the Rhodotorula and Candida genera. This work establishes an initial database, validated with over 400 clinical isolates, of ITS2 length and sequence polymorphisms for 34 species of yeasts. We conclude that size and restriction analysis of PCR-amplified ITS2 region DNA is a rapid and reliable method to identify clinically significant yeasts, including potentially new or emerging pathogenic species. PMID:10834993
Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A
2011-01-01
PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
2010-01-01
Background The cultivated olive (Olea europaea L.) is the most agriculturally important species of the Oleaceae family. Although many studies have been performed on plastid polymorphisms to evaluate taxonomy, phylogeny and phylogeography of Olea subspecies, only few polymorphic regions discriminating among the agronomically and economically important olive cultivars have been identified. The objective of this study was to sequence the entire plastome of olive and analyze many potential polymorphic regions to develop new inter-cultivar genetic markers. Results The complete plastid genome of the olive cultivar Frantoio was determined by direct sequence analysis using universal and novel PCR primers designed to amplify all overlapping regions. The chloroplast genome of the olive has an organisation and gene order that is conserved among numerous Angiosperm species and do not contain any of the inversions, gene duplications, insertions, inverted repeat expansions and gene/intron losses that have been found in the chloroplast genomes of the genera Jasminum and Menodora, from the same family as Olea. The annotated sequence was used to evaluate the content of coding genes, the extent, and distribution of repeated and long dispersed sequences and the nucleotide composition pattern. These analyses provided essential information for structural, functional and comparative genomic studies in olive plastids. Furthermore, the alignment of the olive plastome sequence to those of other varieties and species identified 30 new organellar polymorphisms within the cultivated olive. Conclusions In addition to identifying mutations that may play a functional role in modifying the metabolism and adaptation of olive cultivars, the new chloroplast markers represent a valuable tool to assess the level of olive intercultivar plastome variation for use in population genetic analysis, phylogenesis, cultivar characterisation and DNA food tracking. PMID:20868482
Simple Sequence Repeats in Escherichia coli: Abundance, Distribution, Composition, and Polymorphism
Gur-Arie, Riva; Cohen, Cyril J.; Eitan, Yuval; Shelef, Leora; Hallerman, Eric M.; Kashi, Yechezkel
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020–209030 and AF209508–209518.] PMID:10645951
Molecular basis of length polymorphism in the human zeta-globin gene complex.
Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J
1983-01-01
The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667
Martins, Ademir Jesus; Lins, Rachel Mazzei Moura de Andrade; Linss, Jutta Gerlinde Birgitt; Peixoto, Alexandre Afranio; Valle, Denise
2009-07-01
The nature of pyrethroid resistance in Aedes aegypti Brazilian populations was investigated. Quantification of enzymes related to metabolic resistance in two distinct populations, located in the Northeast and Southeast regions, revealed increases in Glutathione-S-transferase (GST) and Esterase levels. Additionally, polymorphism was found in the IIS6 region of Ae. aegypti voltage-gated sodium channel (AaNa(V)), the pyrethroid target site. Sequences were classified in two haplotype groups, A and B, according to the size of the intron in that region. Rockefeller, a susceptible control lineage, contains only B sequences. In field populations, some A sequences present a substitution in the 1011 site (Ile/Met). When resistant and susceptible individuals were compared, the frequency of both A (with the Met mutation) and B sequences were slightly increased in resistant specimens. The involvement of the AaNa(V) polymorphism in pyrethroid resistance and the metabolic mechanisms that lead to potential cross-resistance between organophosphate and pyrethroids are discussed.
Norman, Paul J.; Norberg, Steven J.; Guethlein, Lisbeth A.; Nemat-Gorgani, Neda; Royce, Thomas; Wroblewski, Emily E.; Dunn, Tamsen; Mann, Tobias; Alicata, Claudia; Hollenbach, Jill A.; Chang, Weihua; Shults Won, Melissa; Gunderson, Kevin L.; Abi-Rached, Laurent; Ronaghi, Mostafa; Parham, Peter
2017-01-01
The most polymorphic part of the human genome, the MHC, encodes over 160 proteins of diverse function. Half of them, including the HLA class I and II genes, are directly involved in immune responses. Consequently, the MHC region strongly associates with numerous diseases and clinical therapies. Notoriously, the MHC region has been intractable to high-throughput analysis at complete sequence resolution, and current reference haplotypes are inadequate for large-scale studies. To address these challenges, we developed a method that specifically captures and sequences the 4.8-Mbp MHC region from genomic DNA. For 95 MHC homozygous cell lines we assembled, de novo, a set of high-fidelity contigs and a sequence scaffold, representing a mean 98% of the target region. Included are six alternative MHC reference sequences of the human genome that we completed and refined. Characterization of the sequence and structural diversity of the MHC region shows the approach accurately determines the sequences of the highly polymorphic HLA class I and HLA class II genes and the complex structural diversity of complement factor C4A/C4B. It has also uncovered extensive and unexpected diversity in other MHC genes; an example is MUC22, which encodes a lung mucin and exhibits more coding sequence alleles than any HLA class I or II gene studied here. More than 60% of the coding sequence alleles analyzed were previously uncharacterized. We have created a substantial database of robust reference MHC haplotype sequences that will enable future population scale studies of this complicated and clinically important region of the human genome. PMID:28360230
Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.
Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L
2011-08-01
In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.
Pramanik, Sreemanta; Li, Honghua
2002-01-01
Direct polymerase chain reaction (PCR) detection of insertion/deletion (indel) polymorphisms requires sample homozygosity. For the indel polymorphisms that have the deletion allele with a relatively low frequency in the autosomal regions, direct PCR detection becomes difficult or impossible. The present study is, to our knowledge, the first designed to directly detect indel polymorphisms in a human autosomal region (i.e., the immunoglobulin VH region), through use of single haploid sperm cells as subjects. Unique marker sequences (n=32), spaced at ∼5-kb intervals, were selected near the 3′ end of the VH region. A two-round multiplex PCR protocol was used to amplify these sequences from single sperm samples from nine unrelated healthy donors. The parental haplotypes of the donors were determined by examining the presence or absence of these markers. Seven clustered markers in 6 of the 18 haplotypes were missing and likely represented a 35–40-kb indel polymorphism. The genotypes of the donors, with respect to this polymorphism, perfectly matched the expectation under Hardy-Weinberg equilibrium. Three VH gene segments, of which two are functional, are affected by this polymorphism. According to these results, >10% of individuals in the human population may not have these gene segments in their genome, and ∼44% may have only one copy of these gene segments. The biological impact of this polymorphism would be very interesting to study. The approach used in the present study could be applied to understand the physical structure and diversity of all other autosomal regions. PMID:12442231
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuomo, Christina A.; Guldener, Ulrich; Xu, Jin Rong
2007-09-07
We sequenced and annotated the genome of the filamentous fungus Fusarium graminearum, a major pathogen of cultivated cereals. Very few repetitive sequences were detected, and the process of repeat-induced point mutation, in which duplicated sequences are subject to extensive mutation, may partially account for the reduced repeat content and apparent low number of paralogous (ancestrally duplicated) genes. A second strain of F. graminearum contained more than 10,000 single-nucleotide polymorphisms, which were frequently located near telomeres and within other discrete chromosomal segments. Many highly polymorphic regions contained sets of genes implicated in plant-fungus interactions and were unusually divergent, with higher ratesmore » of recombination. These regions of genome innovation may result from selection due to interactions of F. graminearum with its plant hosts.« less
Cloning of polymorphisms (COP): enrichment of polymorphic sequences from complex genomes
Li, Jingfeng; Wang, Fuli; Zabarovska, Veronika; Wahlestedt, Claes; Zabarovsky, Eugene R.
2000-01-01
Here we describe a new procedure (cloning of polymorphisms, COP) for enrichment of single nucleotide polymorphisms (SNPs) that represent restriction fragment length polymorphisms (RFLPs). COP would be applicable to the isolation of SNPs from particular regions of the genome, e.g. CpG islands, chromosomal bands, YACs or PAC contigs. A combination of digestion with restriction enzymes, treatment with uracil-DNA glycosylase and mung bean nuclease, PCR amplification and purification with streptavidin magnetic beads was used to isolate polymorphic sequences from the genomes of two human samples. After only two cycles of enrichment, 80% of the isolated clones were found to contain RFLPs. A simple method for the PCR detection of these polymorphisms was also developed. PMID:10606669
Molecular characterization of the canine mitochondrial DNA control region for forensic applications.
Eichmann, Cordula; Parson, Walther
2007-09-01
The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.
BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES
Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...
Hernández-Martínez, Miguel Ángel; Escalante, Ananías A.; Arévalo-Herrera, Myriam; Herrera, Sócrates
2011-01-01
Circumsporozoite (CS) protein is a malaria antigen involved in sporozoite invasion of hepatocytes, and thus considered to have good vaccine potential. We evaluated the polymorphism of the Plasmodium vivax CS gene in 24 parasite isolates collected from malaria-endemic areas of Colombia. We sequenced 27 alleles, most of which (25/27) corresponded to the VK247 genotype and the remainder to the VK210 type. All VK247 alleles presented a mutation (Gly → Asn) at position 28 in the N-terminal region, whereas the C-terminal presented three insertions: the ANKKAGDAG, which is common in all VK247 isolates; 12 alleles presented the insertion GAGGQAAGGNAANKKAGDAG; and 5 alleles presented the insertion GGNAGGNA. Both repeat regions were polymorphic in gene sequence and size. Sequences coding for B-, T-CD4+, and T-CD8+ cell epitopes were found to be conserved. This study confirms the high polymorphism of the repeat domain and the highly conserved nature of the flanking regions. PMID:21292878
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y
2015-10-22
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.
2015-01-01
Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340
Ezquerra, M; Campdelacreu, J; Munoz, E; Oliva, R; Tolosa, E
2004-01-01
Objectives: To search for genetic changes in the 3'untranslated region (3'UTR) of tau and adjacent sequence LOC147077, and in the coding region of STH in PSP patients. Methods: The study included 57 PSP patients and 83 healthy controls. The genetic analysis of each region was performed through sequencing. The Q7R polymorphism was studied through restriction enzyme and electrophoresis analysis. Results: No mutations were found in the regions analysed. The QQ genotype of the STH polymorphism was over-represented in participants with PSP (91.5%) compared with control subjects (47%) (p⩽0.00001). This genotype co-segregated with the H1/H1 haplotype in our PSP cases. Conclusions: Our results do not support a major role for the tau 3'UTR in PSP genetics. The QQ genotype of STH confers susceptibility for PSP and is in linkage disequilibrium with the H1/H1 haplotype. PMID:14707330
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-09-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this.
2013-01-01
Background APOAI, a member of the APOAI/CIII/IV/V gene cluster on chromosome 11q23-24, encodes a major protein component of HDL that has been associated with serum lipid levels. The aim of this study was to determine the genetic association of polymorphisms in the APOAI promoter region with plasma lipid levels in a cohort of healthy Kuwaiti volunteers. Methods A 435 bp region of the APOAI promoter was analyzed by re-sequencing in 549 Kuwaiti samples. DNA was extracted from blood taken from 549 healthy Kuwaiti volunteers who had fasted for the previous 12 h. Univariate and multivariate analysis was used to determine allele association with serum lipid levels. Results The target sequence included a partial segment of the promoter region, 5’UTR and exon 1 located between nucleotides −141 to +294 upstream of the APOAI gene on chromosome 11. No novel single nucleotide polymorphisms (SNPs) were observed. The sequences obtained were deposited with the NCBI GenBank with accession number [GenBank: JX438706]. The allelic frequencies for the three SNPs were as follows: APOAI rs670G = 0.807; rs5069C = 0.964; rs1799837G = 0.997 and found to be in HWE. A significant association (p < 0.05) was observed for the APOAI rs670 polymorphism with increased serum LDL-C. Multivariate analysis showed that APOAI rs670 was an independent predictive factor when controlling for age, sex and BMI for both LDL-C (OR: 1.66, p = 0.014) and TC (OR: 1.77, p = 0.006) levels. Conclusion This study is the first to report sequence analysis of the APOAI promoter in an Arab population. The unexpected positive association found between the APOAI rs670 polymorphism and increased levels of LDL-C and TC may be due to linkage disequilibrium with other polymorphisms in candidate and neighboring genes known to be associated with lipid metabolism and transport. PMID:24028463
Al-Bustan, Suzanne A; Al-Serri, Ahmad E; Annice, Babitha G; Alnaqeeb, Majed A; Ebrahim, Ghada A
2013-09-12
APOAI, a member of the APOAI/CIII/IV/V gene cluster on chromosome 11q23-24, encodes a major protein component of HDL that has been associated with serum lipid levels. The aim of this study was to determine the genetic association of polymorphisms in the APOAI promoter region with plasma lipid levels in a cohort of healthy Kuwaiti volunteers. A 435 bp region of the APOAI promoter was analyzed by re-sequencing in 549 Kuwaiti samples. DNA was extracted from blood taken from 549 healthy Kuwaiti volunteers who had fasted for the previous 12 h. Univariate and multivariate analysis was used to determine allele association with serum lipid levels. The target sequence included a partial segment of the promoter region, 5'UTR and exon 1 located between nucleotides -141 to +294 upstream of the APOAI gene on chromosome 11. No novel single nucleotide polymorphisms (SNPs) were observed. The sequences obtained were deposited with the NCBI GenBank with accession number [GenBank: JX438706]. The allelic frequencies for the three SNPs were as follows: APOAI rs670G = 0.807; rs5069C = 0.964; rs1799837G = 0.997 and found to be in HWE. A significant association (p < 0.05) was observed for the APOAI rs670 polymorphism with increased serum LDL-C. Multivariate analysis showed that APOAI rs670 was an independent predictive factor when controlling for age, sex and BMI for both LDL-C (OR: 1.66, p = 0.014) and TC (OR: 1.77, p = 0.006) levels. This study is the first to report sequence analysis of the APOAI promoter in an Arab population. The unexpected positive association found between the APOAI rs670 polymorphism and increased levels of LDL-C and TC may be due to linkage disequilibrium with other polymorphisms in candidate and neighboring genes known to be associated with lipid metabolism and transport.
Kong, Fanrong; Tong, Zhongsheng; Chen, Xiaoyou; Sorrell, Tania; Wang, Bin; Wu, Qixuan; Ellis, David; Chen, Sharon
2008-01-01
DNA sequencing analyses have demonstrated relatively limited polymorphisms within the fungal internal transcribed spacer (ITS) regions among Trichophyton spp. We sequenced the ITS region (ITS1, 5.8S, and ITS2) for 42 dermatophytes belonging to seven species (Trichophyton rubrum, T. mentagrophytes, T. soudanense, T. tonsurans, Epidermophyton floccosum, Microsporum canis, and M. gypseum) and developed a novel padlock probe and rolling-circle amplification (RCA)-based method for identification of single nucleotide polymorphisms (SNPs) that could be exploited to differentiate between Trichophyton spp. Sequencing results demonstrated intraspecies genetic variation for T. tonsurans, T. mentagrophytes, and T. soudanense but not T. rubrum. Signature sets of SNPs between T. rubrum and T. soudanense (4-bp difference) and T. violaceum and T. soudanense (3-bp difference) were identified. The RCA assay correctly identified five Trichophyton species. Although the use of two “group-specific” probes targeting both the ITS1 and the ITS2 regions were required to identify T. soudanense, the other species were identified by single ITS1- or ITS2-targeted species-specific probes. There was good agreement between ITS sequencing and the RCA assay. Despite limited genetic variation between Trichophyton spp., the sensitive, specific RCA-based SNP detection assay showed potential as a simple, reproducible method for the rapid (2-h) identification of Trichophyton spp. PMID:18234865
Guimarães, Lilian O; Wunderlich, Gerhard; Alves, João M P; Bueno, Marina G; Röhe, Fabio; Catão-Dias, José L; Neves, Amanda; Malafronte, Rosely S; Curado, Izilda; Domingues, Wilson; Kirchgatter, Karin
2015-11-16
The merozoite surface protein 1 (MSP1) gene encodes the major surface antigen of invasive forms of the Plasmodium erythrocytic stages and is considered a candidate vaccine antigen against malaria. Due to its polymorphisms, MSP1 is also useful for strain discrimination and consists of a good genetic marker. Sequence diversity in MSP1 has been analyzed in field isolates of three human parasites: P. falciparum, P. vivax, and P. ovale. However, the extent of variation in another human parasite, P. malariae, remains unknown. This parasite shows widespread, uneven distribution in tropical and subtropical regions throughout South America, Asia, and Africa. Interestingly, it is genetically indistinguishable from P. brasilianum, a parasite known to infect New World monkeys in Central and South America. Specific fragments (1 to 5) covering 60 % of the MSP1 gene (mainly the putatively polymorphic regions), were amplified by PCR in isolates of P. malariae and P. brasilianum from different geographic origin and hosts. Sequencing of the PCR-amplified products or cloned PCR fragments was performed and the sequences were used to construct a phylogenetic tree by the maximum likelihood method. Data were computed to give insights into the evolutionary and phylogenetic relationships of these parasites. Except for fragment 4, sequences from all other fragments consisted of unpublished sequences. The most polymorphic gene region was fragment 2, and in samples where this region lacks polymorphism, all other regions are also identical. The low variability of the P. malariae msp1 sequences of these isolates and the identification of the same haplotype in those collected many years apart at different locations is compatible with a low transmission rate. We also found greater diversity among P. brasilianum isolates compared with P. malariae ones. Lastly, the sequences were segregated according to their geographic origins and hosts, showing a strong genetic and geographic structure. Our data show that there is a low level of sequence diversity and a possible absence of allelic dimorphism of MSP1 in these parasites as opposed to other Plasmodium species. P. brasilianum strains apparently show greater divergence in comparison to P. malariae, thus P. malariae could derive from P. brasilianum, as it has been proposed.
Kühne, Annett; Kaiser, Rolf; Schirmer, Markus; Heider, Ulrike; Muhlke, Sabine; Niere, Wiebke; Overbeck, Tobias; Hohloch, Karin; Trümper, Lorenz; Sezer, Orhan; Brockmöller, Jürgen
2007-07-01
Melphalan is widely used in the treatment of multiple myeloma. Pharmacokinetics of this alkylating drug shows high inter-individual variability. As melphalan is a phenylalanine derivative, the pharmacokinetic variability may be determined by genetic polymorphisms in the L-type amino acid transporters LAT1 (SLC7A5) and LAT2 (SLC7A8). Pharmacokinetics were analysed in 64 patients after first administration of intravenous melphalan. Severity of side effects was documented according to WHO criteria. Genomic DNA was analysed for polymorphisms in LAT1 and LAT2 by sequencing of the entire coding region, intron-exon boundaries and 2 kb upstream promoter region. Selected polymorphisms in the common heavy chain of both transporters, the protein 4F2hc (SLC3A2), were analysed by single nucleotide primer extension. Melphalan pharmacokinetics was highly variable with up to 6.2-fold differences in total clearance. A total of 44 polymorphisms were identified in LAT1 and 21 polymorphisms in LAT2. From all variants, only five were in the coding region and only one heterozygous non-synonymous polymorphism (Ala94Thr) was found in LAT2. Numerous polymorphisms were found in the LAT1 and LAT2 5'-flanking regions but did not correlate with expression of the respective genes. No significant correlations could be observed between the polymorphisms in 4F2hc, LAT1, and LAT2 with melphalan pharmacokinetics or with melphalan side effects. The study confirmed that these transporter genes are highly conserved, particularly in the coding sequences. Genetic variation in 4F2hc, LAT1, and LAT2 does not appear to be a major cause of inter-individual variability in pharmacokinetics and of adverse reactions to melphalan.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsugu, H.; Horowitz, R.; Gibson, N.
1994-12-01
Sera from approximately 30% of patients with systemic lupus erythematosus (SLE) contain high titers of autoantibodies that bind to the 52-kDa Ro/SSA protein. We previously detected polymorphisms in the 52-kDa Ro/SSA gene (SSA1) with restriction enzymes, one of which is strongly associated with the presence of SLE (P < 0.0005) in African Americans. A higher disease frequency and more severe forms of the disease are commonly noted among these female patients. To determine the location and nature of this polymorphism, we obtained two clones that span 8.5 kb of the 52-kDa Ro/SSA locus including its upstream regulatory region. Six exonsmore » were identified, and their nucleotide sequences plus adjacent noncoding regions were determined. No differences were found between these exons and the coding region of one of the reported cDNAs. The disease-associated polymorphic site suggested by a restriction enzyme map and confirmed by DNA amplification and nucleotide sequencing was present upstream of exon 1. This polymorphism may be a genetic marker for a disease-related variation in the coding region for the protein or in the upstream regulatory region of this gene. Although this RFLP is present in Japanese, it is not associated with lupus in this race. 41 refs., 4 figs., 2 tabs.« less
HomSI: a homozygous stretch identifier from next-generation sequencing data.
Görmez, Zeliha; Bakir-Gungor, Burcu; Sagiroglu, Mahmut Samil
2014-02-01
In consanguineous families, as a result of inheriting the same genomic segments through both parents, the individuals have stretches of their genomes that are homozygous. This situation leads to the prevalence of recessive diseases among the members of these families. Homozygosity mapping is based on this observation, and in consanguineous families, several recessive disease genes have been discovered with the help of this technique. The researchers typically use single nucleotide polymorphism arrays to determine the homozygous regions and then search for the disease gene by sequencing the genes within this candidate disease loci. Recently, the advent of next-generation sequencing enables the concurrent identification of homozygous regions and the detection of mutations relevant for diagnosis, using data from a single sequencing experiment. In this respect, we have developed a novel tool that identifies homozygous regions using deep sequence data. Using *.vcf (variant call format) files as an input file, our program identifies the majority of homozygous regions found by microarray single nucleotide polymorphism genotype data. HomSI software is freely available at www.igbam.bilgem.tubitak.gov.tr/softwares/HomSI, with an online manual.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stenger, Drake C., E-mail: drake.stenger@ars.usda.
Population structure of Homalodisca coagulata Virus-1 (HoCV-1) among and within field-collected insects sampled from a single point in space and time was examined. Polymorphism in complete consensus sequences among single-insect isolates was dominated by synonymous substitutions. The mutant spectrum of the C2 helicase region within each single-insect isolate was unique and dominated by nonsynonymous singletons. Bootstrapping was used to correct the within-isolate nonsynonymous:synonymous arithmetic ratio (N:S) for RT-PCR error, yielding an N:S value ~one log-unit greater than that of consensus sequences. Probability of all possible single-base substitutions for the C2 region predicted N:S values within 95% confidence limits of themore » corrected within-isolate N:S when the only constraint imposed was viral polymerase error bias for transitions over transversions. These results indicate that bottlenecks coupled with strong negative/purifying selection drive consensus sequences toward neutral sequence space, and that most polymorphism within single-insect isolates is composed of newly-minted mutations sampled prior to selection. -- Highlights: •Sampling protocol minimized differential selection/history among isolates. •Polymorphism among consensus sequences dominated by negative/purifying selection. •Within-isolate N:S ratio corrected for RT-PCR error by bootstrapping. •Within-isolate mutant spectrum dominated by new mutations yet to undergo selection.« less
Australian wild rice reveals pre-domestication origin of polymorphism deserts in rice genome.
Krishnan S, Gopala; Waters, Daniel L E; Henry, Robert J
2014-01-01
Rice is a major source of human food with a predominantly Asian production base. Domestication involved selection of traits that are desirable for agriculture and to human consumers. Wild relatives of crop plants are a source of useful variation which is of immense value for crop improvement. Australian wild rices have been isolated from the impacts of domestication in Asia and represents a source of novel diversity for global rice improvement. Oryza rufipogon is a perennial wild progenitor of cultivated rice. Oryza meridionalis is a related annual species in Australia. We have examined the sequence of the genomes of AA genome wild rices from Australia that are close relatives of cultivated rice through whole genome re-sequencing. Assembly of the resequencing data to the O. sativa ssp. japonica cv. Nipponbare shows that Australian wild rices possess 2.5 times more single nucleotide polymorphisms than in the Asian wild rice and cultivated O. sativa ssp. indica. Analysis of the genome of domesticated rice reveals regions of low diversity that show very little variation (polymorphism deserts). Both the perennial and annual wild rice from Australia show a high degree of conservation of sequence with that found in cultivated rice in the same 4.58 Mbp region on chromosome 5, which suggests that some of the 'polymorphism deserts' in this and other parts of the rice genome may have originated prior to domestication due to natural selection. Analysis of genes in the 'polymorphism deserts' indicates that this selection may have been due to biotic or abiotic stress in the environment of early rice relatives. Despite having closely related sequences in these genome regions, the Australian wild populations represent an invaluable source of diversity supporting rice food security.
Sun, Zichen; Stack, Colin; Šlapeta, Jan
2012-05-25
In order to investigate the genetic variation between Tritrichomonas foetus from bovine and feline origins, cysteine protease 8 (CP8) coding sequence was selected as the polymorphic DNA marker. Direct sequencing of CP8 coding sequence of T. foetus from four feline isolates and two bovine isolates with polymerase chain reaction successfully revealed conserved nucleotide polymorphisms between feline and bovine isolates. These results provide useful information for CP8-based molecular differentiation of T. foetus genotypes. Copyright © 2011 Elsevier B.V. All rights reserved.
Kayesh, E; Zhang, Y Y; Liu, G S; Bilkish, N; Sun, X; Leng, X P; Fang, J G
2013-09-23
The objectives of this investigation were to develop and validate the expressed sequence tag (EST)-simple sequence repeat (SSR) markers from large EST sequences, and to study the segregation and distribution of SSRs within two grapevine parental lines. In total, 94 F₁ lines crossed between "Early Rose" and "Red Globe" were studied. Approximately 2100 EST-SSR sequences of Vitis vinifera L. were searched for SSRs and analyzed for the design of polymerase chain reaction (PCR) primers amplifying the SSR-rich regions. Trinucleotide repeats were found to be the most abundant, followed by other nucleotide repeats. A total of 182 SSR primer pairs were first developed for the study on the parental polymorphism. Among the 182 SSR primers, 142 primer pairs (78%) could amplify the anticipated PCR products, among which only 52 primer pairs (36.62%) showed polymorphism between the two parents. These polymorphic bands were further surveyed among the 94 F₁ lines, and the results showed that a total of 162 bands were amplified, and 98 of them were polymorphic in both parents (60.86% polymorphism), with an average of 1.88 polymorphic DNA bands for each primer pair. After testing with the chi-square test, 33 of the clearly amplified polymorphic bands followed a 3:1 ratio, and 37 followed a 1:1 ratio. The rest showed distorted segregation ratios.
Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H
2006-04-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.
2006-01-01
Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus.
Lakshmikumaran, M S; D'Ambrosio, E; Laimins, L A; Lin, D T; Furano, A V
1985-01-01
The insulin 1, but not the insulin 2, locus is polymorphic (i.e., exhibits allelic variation) in rats. Restriction enzyme analysis and hybridization studies showed that the polymorphic region is 2.2 kilobases upstream of the insulin 1 coding region and is due to the presence or absence of an approximately 2.7-kilobase repeated DNA element. DNA sequence determination showed that this DNA element is a member of a long interspersed repeated DNA family (LINE) that is highly repeated (greater than 50,000 copies) and highly transcribed in the rat. Although the presence or absence of LINE sequences at the insulin 1 locus occurs in both the homozygous and heterozygous states, LINE-containing insulin 1 alleles are more prevalent in the rat population than are alleles without LINEs. Restriction enzyme analysis of the LINE-containing alleles indicated that at least two versions of the LINE sequence may be present at the insulin 1 locus in different rats. Either repeated transposition of LINE sequences or gene conversion between the resident insulin 1 LINE and other sequences in the genome are possible explanations for this. Images PMID:3016521
Hutcheson, Kelly A; Paluru, Prasuna C; Bernstein, Steven L; Koh, Jamie; Rappaport, Eric F; Leach, Richard A; Young, Terri L
2005-07-14
Retinopathy of prematurity (ROP) is a leading cause of visual loss in the pediatric population. Mutations in the Norrie disease gene (NDP) are associated with heritable retinal vascular disorders, and have been found in a small subset of patients with severe retinopathy of prematurity. Varying rates of progression to threshold disease in different races may have a genetic basis, as recent studies suggest that the incidence of NDP mutations may vary in different groups. African Americans, for example, are less likely to develop severe degrees of ROP. We screened a large cohort of ethnically diverse patients for mutations in the entire NDP. A total of 143 subjects of different ethnic backgrounds were enrolled in the study. Fifty-four patients had severe ROP (Stage 3 or worse). Of these, 38 were threshold in at least one eye (with a mean gestational age of 26.1 weeks and mean birth weight of 788.4 g). There were 36 patients with mild or no ROP, 31 parents with no history of retinal disease or prematurity, and 22 wild type (normal) controls. There were 70 African American subjects, 55 Caucasians, and 18 of other races. Severe ROP was noted in 29 African American subjects, 17 Caucasians, and 8 of other races. Seven polymerase chain reaction primer pairs spanning the NDP were optimized for denaturing high performance liquid chromatography and direct sequencing. Three primer pairs covered the coding region, and the remaining four spanned the 3' and 5' untranslated regions (UTR). Six of 54 (11%) infants with severe ROP had polymorphisms in the NDP. Five of the infants were African American, and one was Caucasian. Two parents were heterozygous for the same polymorphism as their child. One parent-child pair had a single base pair (bp) insertion in the 3' UTR region. Another parent-child pair had two mutations: a 14 bp deletion in the 5' UTR region of exon 1 and a single nucleotide polymorphism in the 5' UTR region of exon 2. No coding region sequence changes were found. No polymorphisms were observed in infants with mild or no ROP, or in the wild type controls. Of the six sequence alterations found, five were novel nucleotide changes: One in the 5' UTR region of exon 2, and four in the 3' UTR region of exon 3. The extent of NDP polymorphisms in this large, racially diverse group of infants is moderate. NDP polymorphisms may play a role in the pathogenesis of ROP, but do not appear to be a major causative factor.
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Larsen, Charles E.; Alford, Dennis R.; Trautwein, Michael R.; Jalloh, Yanoh K.; Tarnacki, Jennifer L.; Kunnenkeri, Sushruta K.; Fici, Dolores A.; Yunis, Edmond J.; Awdeh, Zuheir L.; Alper, Chester A.
2014-01-01
We resequenced and phased 27 kb of DNA within 580 kb of the MHC class II region in 158 population chromosomes, most of which were conserved extended haplotypes (CEHs) of European descent or contained their centromeric fragments. We determined the single nucleotide polymorphism and deletion-insertion polymorphism alleles of the dominant sequences from HLA-DQA2 to DAXX for these CEHs. Nine of 13 CEHs remained sufficiently intact to possess a dominant sequence extending at least to DAXX, 230 kb centromeric to HLA-DPB1. We identified the regions centromeric to HLA-DQB1 within which single instances of eight “common” European MHC haplotypes previously sequenced by the MHC Haplotype Project (MHP) were representative of those dominant CEH sequences. Only two MHP haplotypes had a dominant CEH sequence throughout the centromeric and extended class II region and one MHP haplotype did not represent a known European CEH anywhere in the region. We identified the centromeric recombination transition points of other MHP sequences from CEH representation to non-representation. Several CEH pairs or groups shared sequence identity in small blocks but had significantly different (although still conserved for each separate CEH) sequences in surrounding regions. These patterns partly explain strong calculated linkage disequilibrium over only short (tens to hundreds of kilobases) distances in the context of a finite number of observed megabase-length CEHs comprising half a population's haplotypes. Our results provide a clearer picture of European CEH class II allelic structure and population haplotype architecture, improved regional CEH markers, and raise questions concerning regional recombination hotspots. PMID:25299700
Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Lee, Hyun Oh; Joh, Ho Jun; Kim, Nam-Hoon; Park, Hyun-Seung; Yang, Tae-Jin
2015-01-01
We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication. PMID:26061692
Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.
Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S
2013-12-10
Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.
Barroso, G.; Blesa, S.; Labarere, J.
1995-01-01
We used restriction fragment length polymorphisms to examine mitochondrial genome rearrangements in 36 wild strains of the cultivated basidiomycete Agrocybe aegerita, collected from widely distributed locations in Europe. We identified two polymorphic regions within the mitochondrial DNA which varied independently: one carrying the Cox II coding sequence and the other carrying the Cox I, ATP6, and ATP8 coding sequences. Two types of mutations were responsible for the restriction fragment length polymorphisms that we observed and, accordingly, were involved in the A. aegerita mitochondrial genome evolution: (i) point mutations, which resulted in strain-specific mitochondrial markers, and (ii) length mutations due to genome rearrangements, such as deletions, insertions, or duplications. Within each polymorphic region, the length differences defined only two mitochondrial types, suggesting that these length mutations were not randomly generated but resulted from a precise rearrangement mechanism. For each of the two polymorphic regions, the two molecular types were distributed among the 36 strains without obvious correlation with their geographic origin. On the basis of these two polymorphisms, it is possible to define four mitochondrial haplotypes. The four mitochondrial haplotypes could be the result of intermolecular recombination between allelic forms present in the population long enough to reach linkage equilibrium. All of the 36 dikaryotic strains contained only a single mitochondrial type, confirming the previously described mitochondrial sorting out after cytoplasmic mixing in basidiomycetes. PMID:16534984
Detection and correction of false segmental duplications caused by genome mis-assembly
2010-01-01
Diploid genomes with divergent chromosomes present special problems for assembly software as two copies of especially polymorphic regions may be mistakenly constructed, creating the appearance of a recent segmental duplication. We developed a method for identifying such false duplications and applied it to four vertebrate genomes. For each genome, we corrected mis-assemblies, improved estimates of the amount of duplicated sequence, and recovered polymorphisms between the sequenced chromosomes. PMID:20219098
Sequence variations of the bovine prion protein gene (PRNP) in native Korean Hanwoo cattle
Choi, Sangho
2012-01-01
Bovine spongiform encephalopathy (BSE) is one of the fatal neurodegenerative diseases known as transmissible spongiform encephalopathies (TSEs) caused by infectious prion proteins. Genetic variations correlated with susceptibility or resistance to TSE in humans and sheep have not been reported for bovine strains including those from Holstein, Jersey, and Japanese Black cattle. Here, we investigated bovine prion protein gene (PRNP) variations in Hanwoo cattle [Bos (B.) taurus coreanae], a native breed in Korea. We identified mutations and polymorphisms in the coding region of PRNP, determined their frequency, and evaluated their significance. We identified four synonymous polymorphisms and two non-synonymous mutations in PRNP, but found no novel polymorphisms. The sequence and number of octapeptide repeats were completely conserved, and the haplotype frequency of the coding region was similar to that of other B. taurus strains. When we examined the 23-bp and 12-bp insertion/deletion (indel) polymorphisms in the non-coding region of PRNP, Hanwoo cattle had a lower deletion allele and 23-bp del/12-bp del haplotype frequency than healthy and BSE-affected animals of other strains. Thus, Hanwoo are seemingly less susceptible to BSE than other strains due to the 23-bp and 12-bp indel polymorphisms. PMID:22705734
Hayden, M R; Hewitt, J; Wasmuth, J J; Kastelein, J J; Langlois, S; Conneally, M; Haines, J; Smith, B; Hilbert, C; Allard, D
1988-01-01
A polymorphic marker (D4S62) that is genetically closely linked to D4S10 and is in the region of the gene for Huntington disease is described. A four-allele polymorphism is detected when HincII-digested DNA is hybridized with D4S62. D4S62 maps, by Southern blot analysis using somatic-cell hybrids, to 4p16.1 closer to the centromere than does D4S10. The use of the polymorphisms detected by D4S62 increases the informativeness of markers close to the gene for Huntington disease and will be useful for preclinical diagnosis. D4S62 detects transcripts of approximately 6,000 nucleotides in rat, mouse, and monkey liver and brain. This represents the first demonstration of conserved expressed sequences close to the gene for Huntington disease. Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 6 PMID:2892395
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.
Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa
2017-10-01
Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial genetic diversity. The significance of Pvrom1 conserved exon 2 sequence remains to be investigated. Copyright © 2017 Mahidol University. Published by Elsevier Inc. All rights reserved.
Landscape of Insertion Polymorphisms in the Human Genome
Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.
2015-01-01
Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018
Shao, Chengchen; Zhang, Yaqi; Zhou, Yueqin; Zhu, Wei; Xu, Hongmei; Liu, Zhiping; Tang, Qiqun; Shen, Yiwen; Xie, Jianhui
2015-01-01
Aim To systemically select and evaluate short tandem repeats (STRs) on the chromosome 14 and obtain new STR loci as expanded genotyping markers for forensic application. Methods STRs on the chromosome 14 were filtered from Tandem Repeats Database and further selected based on their positions on the chromosome, repeat patterns of the core sequences, sequence homology of the flanking regions, and suitability of flanking regions in primer design. The STR locus with the highest heterozygosity and polymorphism information content (PIC) was selected for further analysis of genetic polymorphism, forensic parameters, and the core sequence. Results Among 26 STR loci selected as candidates, D14S739 had the highest heterozygosity (0.8691) and PIC (0.8432), and showed no deviation from the Hardy-Weinberg equilibrium. 14 alleles were observed, ranging in size from 21 to 34 tetranucleotide units in the core region of (GATA)9-18 (GACA)7-12 GACG (GACA)2 GATA. Paternity testing showed no mutations. Conclusion D14S739 is a highly informative STR locus and could be a suitable genetic marker for forensic applications in the Han Chinese population. PMID:26526885
A new polymorphic and multicopy MHC gene family related to nonmammalian class I
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leelayuwat, C.; Degli-Esposti, M.A.; Abraham, L.J.
1994-12-31
The authors have used genomic analysis to characterize a region of the central major histocompatibility complex (MHC) spanning {approximately} 300 kilobases (kb) between TNF and HLA-B. This region has been suggested to carry genetic factors relevant to the development of autoimmune diseases such as myasthenia gravis (MG) and insulin dependent diabetes mellitus (IDDM). Genomic sequence was analyzed for coding potential, using two neural network programs, GRAIL and GeneParser. A genomic probe, JAB, containing putative coding sequences (PERB11) located 60 kb centromeric of HLA-B, was used for northern analysis of human tissues. Multiple transcripts were detected. Southern analysis of genomic DNAmore » and overlapping YAC clones, covering the region from BAT1 to HLA-F, indicated that there are at least five copies of PERB11, four of which are located within this region of the MHC. The partial cDNA sequence of PERB11 was obtained from poly-A RNA derived from skeletal muscle. The putative amino acid sequence of PERB11 shares {approximately} 30% identity to MHC class I molecules from various species, including reptiles, chickens, and frogs, as well as to other MHC class I-like molecules, such as the IgG FcR of the mouse and rat and the human Zn-{alpha}2-glycoprotein. From direct comparison of amino acid sequences, it is concluded that PERB11 is a distinct molecule more closely related to nonmammalian than known mammalian MHC class I molecules. Genomic sequence analysis of PERB11 from five MHC ancestral haplotypes (AH) indicated that the gene is polymorphic at both DNA and protein level. The results suggest that the authors have identified a novel polymorphic gene family with multiple copies within the MHC. 48 refs., 10 figs., 2 tabs.« less
Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN
Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger
2016-01-01
Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831
Alam, Nuhu; Kim, Jeong Hwa; Shim, Mi Ja; Lee, U Youn
2010-01-01
This study evaluated the optimal vegetative growth conditions and molecular phylogenetic relationships of eleven strains of Agrocybe cylindracea collected from different ecological regions of Korea, China and Taiwan. The optimal temperature and pH for mycelial growth were observed at 25℃ and 6. Potato dextrose agar and Hennerberg were the favorable media for vegetative growth, whereas glucose tryptone was unfavorable. Dextrin, maltose, and fructose were the most effective carbon sources. The most suitable nitrogen sources were arginine and glycine, whereas methionine, alanine, histidine, and urea were least effective for the mycelial propagation of A. cylindracea. The internal transcribed spacer (ITS) regions of rDNA were amplified using PCR. The sequence of ITS2 was more variable than that of ITS1, while the 5.8S sequences were identical. The reciprocal homologies of the ITS sequences ranged from 98 to 100%. The strains were also analyzed by random amplification of polymorphic DNA (RAPD) using 20 arbitrary primers. Fifteen primers efficiently amplified the genomic DNA. The average number of polymorphic bands observed per primer was 3.8. The numbers of amplified bands varied based on the primers and strains, with polymorphic fragments ranging from 0.1 to 2.9 kb. The results of RAPD analysis were similar to the ITS region sequences. The results revealed that RAPD and ITS techniques were well suited for detecting the genetic diversity of all A. cylindracea strains tested. PMID:23956633
Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki
2014-01-01
Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies
Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity. PMID:27454301
Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity.
Ferreira, Keila Adriana Magalhães; Fajardo, Emanuella Francisco; Baptista, Rodrigo P; Macedo, Andrea Mara; Lages-Silva, Eliane; Ramírez, Luis Eduardo; Pedrosa, André Luiz
2014-06-01
Trypanosoma cruzi and Trypanosoma rangeli are kinetoplastid parasites which are able to infect humans in Central and South America. Misdiagnosis between these trypanosomes can be avoided by targeting barcoding sequences or genes of each organism. This work aims to analyze the feasibility of using species-specific markers for identification of intraspecific polymorphisms and as target for diagnostic methods by PCR. Accordingly, primers which are able to specifically detect T. cruzi or T. rangeli genomic DNA were characterized. The use of intergenic regions, generally divergent in the trypanosomatids, and the serine carboxypeptidase gene were successful. Using T. rangeli genomic sequences for the identification of group-specific polymorphisms and a polymorphic AT(n) dinucleotide repeat permitted the classification of the strains into two groups, which are entirely coincident with T. rangeli main lineages, KP1 (+) and KP1 (-), previously determined by kinetoplast DNA (kDNA) characterization. The sequences analyzed totalize 622 bp (382 bp represent a hypothetical protein sequence, and 240 bp represent an anonymous sequence), and of these, 581 (93.3%) are conserved sites and 41 bp (6.7%) are polymorphic, with 9 transitions (21.9%), 2 transversions (4.9%), and 30 (73.2%) insertion/deletion events. Taken together, the species-specific markers analyzed may be useful for the development of new strategies for the accurate diagnosis of infections. Furthermore, the identification of T. rangeli polymorphisms has a direct impact in the understanding of the population structure of this parasite.
Ruggiero, Maria Valeria; Procaccini, Gabriele
2004-01-01
Halophila stipulacea is a dioecious marine angiosperm, widely distributed along the western coasts of the Indian Ocean and the Red Sea. This species is thought to be a Lessepsian immigrant that entered the Mediterranean Sea from the Red Sea after the opening of the Suez Canal (1869). Previous studies have revealed both high phenotypic and genetic variability in Halophila stipulacea populations from the western Mediterranean basin. In order to test the hypothesis of a Lessepsian introduction, we compare genetic polymorphism between putative native (Red Sea) and introduced (Mediterranean) populations through rDNA ITS region (ITS1-5.8S-ITS2) sequence analysis. A high degree of intraindividual variability of ITS sequences was found. Most of the intragenomic polymorphism was due to pseudogenic sequences, present in almost all individuals. Features of ITS functional sequences and pseudogenes are described. Possible causes for the lack of homogenization of ITS paralogues within individuals are discussed.
NASA Astrophysics Data System (ADS)
Flot, J.-F.; Licuanan, W. Y.; Nakano, Y.; Payri, C.; Cruaud, C.; Tillier, S.
2008-12-01
The taxonomy of corals of the genus Seriatopora has not previously been studied using molecular sequence markers. As a first step toward a re-evaluation of species boundaries in this genus, mitochondrial sequence variability was analyzed in 51 samples collected from Okinawa, New Caledonia, and the Philippines. Four clusters of sequences were detected that showed little concordance with species currently recognized on a morphological basis. The most likely explanation is that the skeletal characters used for species identification are highly variable (polymorphic or phenotypically plastic); alternative explanations include introgression/hybridization, or deep coalescence and the retention of ancestral mitochondrial polymorphisms. In all individuals sequenced, two copies of trnW were found on either side of the atp8 gene near the putative D-loop, a novel mitochondrial gene arrangement that may have arisen from a duplication of the trnW-atp8 region followed by a deletion of one atp8.
Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain
Silva, Alcino J.; Johnson, John P.; White, Raymond L.
1987-01-01
A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636
Genetic characterization of the UCS and Kex1 loci of Pneumocystis jirovecii.
Esteves, F; Tavares, A; Costa, M C; Gaspar, J; Antunes, F; Matos, O
2009-02-01
Nucleotide variation in the Pneumocystis jirovecii upstream conserved sequence (UCS) and kexin-like serine protease (Kex1) loci was studied in pulmonary specimens from Portuguese HIV-positive patients. DNA was extracted and used for specific molecular sequence analysis. The number of UCS tandem repeats detected in 13 successfully sequenced isolates ranged from three (9 isolates, 69%) to four (4 isolates, 31%). A novel tandem repeat pattern and two novel polymorphisms were detected in the UCS region. For the Kex1 gene, the wild-type (24 isolates, 86%) was the most frequent sequence detected among the 28 sequenced isolates. Nevertheless, a nonsynonymous (1 isolate, 3%) and three synonymous (3 isolates, 11%) polymorphisms were detected and are described here for the first time.
Jaramillo-Correa, J P; Bousquet, J; Beaulieu, J; Isabel, N; Perron, M; Bouillé, M
2003-05-01
Primers previously developed to amplify specific non-coding regions of the mitochondrial genome in Angiosperms, and new primers for additional non-coding mtDNA regions, were tested for their ability to direct DNA amplification in 12 conifer taxa and to detect sequence-tagged-site (STS) polymorphisms within and among eight species in Picea. Out of 12 primer pairs, nine were successful at amplifying mtDNA in most of the taxa surveyed. In conifers, indels and substitutions were observed for several loci, allowing them to distinguish between families, genera and, in some cases, between species within genera. In Picea, interspecific polymorphism was detected for four loci, while intraspecific variation was observed for three of the mtDNA regions studied. One of these (SSU rRNA V1 region) exhibited indel polymorphisms, and the two others ( nad1 intron b/c and nad5 intron1) revealed restriction differences after digestion with Sau3AI (PCR-RFLP). A fourth locus, the nad4L- orf25 intergenic region, showed a multibanding pattern for most of the spruce species, suggesting a possible gene duplication. Maternal inheritance, expected for mtDNA in conifers, was observed for all polymorphic markers except the intergenic region nad4L- orf25. Pooling of the variation observed with the remaining three markers resulted in two to six different mtDNA haplotypes within the different species of Picea. Evidence for intra-genomic recombination was observed in at least two taxa. Thus, these mitotypes are likely to be more informative than single-locus haplotypes. They should be particularly useful for the study of biogeography and the dynamics of hybrid zones.
Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.
Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A
2018-06-01
Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.
King, Timothy L.; Eackles, Michael S.; Reshetnikov, Andrey N.
2015-01-01
Human-mediated translocations and subsequent large-scale colonization by the invasive fish rotan (Perccottus glenii Dybowski, 1877; Perciformes, Odontobutidae), also known as Amur or Chinese sleeper, has resulted in dramatic transformations of small lentic ecosystems. However, no detailed genetic information exists on population structure, levels of effective movement, or relatedness among geographic populations of P. glenii within the European part of the range. We used massively parallel genomic DNA shotgun sequencing on the semiconductor-based Ion Torrent Personal Genome Machine (PGM) sequencing platform to identify nuclear microsatellite and mitochondrial DNA sequences in P. glenii from European Russia. Here we describe the characterization of nine nuclear microsatellite loci, ascertain levels of allelic diversity, heterozygosity, and demographic status of P. glenii collected from Ilev, Russia, one of several initial introduction points in European Russia. In addition, we mapped sequence reads to the complete P. glenii mitochondrial DNA sequence to identify polymorphic regions. Nuclear microsatellite markers developed for P. glenii yielded sufficient genetic diversity to: (1) produce unique multilocus genotypes; (2) elucidate structure among geographic populations; and (3) provide unique perspectives for analysis of population sizes and historical demographics. Among 4.9 million filtered P. glenii Ion Torrent PGM sequence reads, 11,304 mapped to the mitochondrial genome (NC_020350). This resulted in 100 % coverage of this genome to a mean coverage depth of 102X. A total of 130 variable sites were observed between the publicly available genome from China and the studied composite mitochondrial genome. Among these, 82 were diagnostic and monomorphic between the mitochondrial genomes and distributed among 15 genome regions. The polymorphic sites (N = 48) were distributed among 11 mitochondrial genome regions. Our results also indicate that sequence reads generated from two three-hour runs on the Ion Torrent PGM can generate a sufficient number of nuclear and mitochondrial markers to improve understanding of the evolutionary and ecological dynamics of non-model and in particular, invasive species.
Association of α-, β-, and γ-Synuclein With Diffuse Lewy Body Disease
Nishioka, Kenya; Wider, Christian; Vilariño-Güell, Carles; Soto-Ortolaza, Alexandra I.; Lincoln, Sarah J.; Kachergus, Jennifer M.; Jasinska-Myga, Barbara; Ross, Owen A.; Rajput, Alex; Robinson, Christopher A.; Ferman, Tanis J.; Wszolek, Zbigniew K.; Dickson, Dennis W.; Farrer, Matthew J.
2016-01-01
Objective To determine the association of the genes that encode α-, β-, and γ-synuclein (SNCA, SNCB, and SNCG, respectively) with diffuse Lewy body disease (DLBD). Design Case-control study. Subjects A total of 172 patients with DLBD consistent with a clinical diagnosis of Parkinson disease dementia/dementia with Lewy bodies and 350 clinically and 97 pathologically normal controls. Interventions Sequencing of SNCA, SNCB, and SNCG and genotyping of single-nucleotide polymorphisms performed on an Applied Biosystems capillary sequencer and a Sequenom MassArray pLEX platform, respectively. Associations were determined using χ2 or Fisher exact tests. Results Initial sequencing studies of the coding regions of each gene in 89 patients with DLBD did not detect any pathogenic substitutions. Nevertheless, genotyping of known polymorphic variability in sequence-conserved regions detected several single-nucleotide polymorphisms in the SNCA and SNCG genes that were significantly associated with disease (P=.05 to <.001). Significant association was also observed for 3 single-nucleotide polymorphisms located in SNCB when comparing DLBD cases and pathologically confirmed normal controls (P=.03-.01); however, this association was not significant for the clinical controls alone or the combined clinical and pathological controls (P>.05). After correction for multiple testing, only 1 single-nucleotide polymorphism in SNCG (rs3750823) remained significant in all of the analyses (P=.05-.009). Conclusion These findings suggest that variants in all 3 members of the synuclein gene family, particularly SNCA and SNCG, affect the risk of developing DLBD and warrant further investigation in larger, pathologically defined data sets as well as clinically diagnosed Parkinson disease/dementia with Lewy bodies case-control series. PMID:20697047
Potier, M; Dutriaux, A; Orti, R; Groet, J; Gibelin, N; Karadima, G; Lutfalla, G; Lynn, A; Van Broeckhoven, C; Chakravarti, A; Petersen, M; Nizetic, D; Delabar, J; Rossier, J
1998-08-01
Physical mapping across a duplication can be a tour de force if the region is larger than the size of a bacterial clone. This was the case of the 170- to 275-kb duplication present on the long arm of chromosome 21 in normal human at 21q11.1 (proximal region) and at 21q22.1 (distal region), which we described previously. We have constructed sequence-ready contigs of the two copies of the duplication of which all the clones are genuine representatives of one copy or the other. This required the identification of four duplicon polymorphisms that are copy-specific and nonallelic variations in the sequence of the STSs. Thirteen STSs were mapped inside the duplicated region and 5 outside but close to the boundaries. Among these STSs 10 were end clones from YACs, PACs, or cosmids, and the average interval between two markers in the duplicated region was 16 kb. Eight PACs and cosmids showing minimal overlaps were selected in both copies of the duplication. Comparative sequence analysis along the duplication showed three single-basepair changes between the two copies over 659 bp sequenced (4 STSs), suggesting that the duplication is recent (less than 4 mya). Two CpG islands were located in the duplication, but no genes were identified after a 36-kb cosmid from the proximal copy of the duplication was sequenced. The homology of this chromosome 21 duplicated region with the pericentromeric regions of chromosomes 13, 2, and 18 suggests that the mechanism involved is probably similar to pericentromeric-directed mechanisms described in interchromosomal duplications. Copyright 1998 Academic Press.
Itoh, Takehiko
2018-01-01
Batesian mimicry protects animals from predators when mimics resemble distasteful models. The female-limited Batesian mimicry in Papilio butterflies is controlled by a supergene locus switching mimetic and nonmimetic forms. In Papilio polytes, recent studies revealed that a highly diversified region (HDR) containing doublesex (dsx-HDR) constitutes the supergene with dimorphic alleles and is likely maintained by a chromosomal inversion. In the closely related Papilio memnon, which exhibits a similar mimicry polymorphism, we performed whole-genome sequence analyses in 11 butterflies, which revealed a nearly identical dsx-HDR containing three genes (dsx, Nach-like, and UXT) with dimorphic sequences strictly associated with the mimetic/nonmimetic phenotypes. In addition, expression of these genes, except that of Nach-like in female hind wings, showed differences correlated with phenotype. The dimorphic dsx-HDR in P. memnon is maintained without a chromosomal inversion, suggesting that a separate mechanism causes and maintains allelic divergence in these genes. More abundant accumulation of transposable elements and repetitive sequences in the dsx-HDR than in other genomic regions may contribute to the suppression of chromosomal recombination. Gene trees for Dsx, Nach-like, and UXT indicated that mimetic alleles evolved independently in the two Papilio species. These results suggest that the genomic region involving the above three genes has repeatedly diverged so that two allelic sequences of this region function as developmental switches for mimicry polymorphism in the two Papilio species. The supergene structures revealed here suggest that independent evolutionary processes with different genetic mechanisms have led to parallel evolution of similar female-limited polymorphisms underlying Batesian mimicry in Papilio butterflies. PMID:29675466
Weitemier, Kevin; Straub, Shannon C K; Fishbein, Mark; Liston, Aaron
2015-01-01
Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual's consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the "noncoding" ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming).
Grossen, Christine; Keller, Lukas; Biebach, Iris; Croll, Daniel
2014-01-01
The major histocompatibility complex (MHC) is a crucial component of the vertebrate immune system and shows extremely high levels of genetic polymorphism. The extraordinary genetic variation is thought to be ancient polymorphisms maintained by balancing selection. However, introgression from related species was recently proposed as an additional mechanism. Here we provide evidence for introgression at the MHC in Alpine ibex (Capra ibex ibex). At a usually very polymorphic MHC exon involved in pathogen recognition (DRB exon 2), Alpine ibex carried only two alleles. We found that one of these DRB alleles is identical to a DRB allele of domestic goats (Capra aegagrus hircus). We sequenced 2489 bp of the coding and non-coding regions of the DRB gene and found that Alpine ibex homozygous for the goat-type DRB exon 2 allele showed nearly identical sequences (99.8%) to a breed of domestic goats. Using Sanger and RAD sequencing, microsatellite and SNP chip data, we show that the chromosomal region containing the goat-type DRB allele has a signature of recent introgression in Alpine ibex. A region of approximately 750 kb including the DRB locus showed high rates of heterozygosity in individuals carrying one copy of the goat-type DRB allele. These individuals shared SNP alleles both with domestic goats and other Alpine ibex. In a survey of four Alpine ibex populations, we found that the region surrounding the DRB allele shows strong linkage disequilibria, strong sequence clustering and low diversity among haplotypes carrying the goat-type allele. Introgression at the MHC is likely adaptive and introgression critically increased MHC DRB diversity in the genetically impoverished Alpine ibex. Our finding contradicts the long-standing view that genetic variability at the MHC is solely a consequence of ancient trans-species polymorphism. Introgression is likely an underappreciated source of genetic diversity at the MHC and other loci under balancing selection. PMID:24945814
Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo
2014-01-01
The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. PMID:24972598
Iehisa, Julio Cesar Masaru; Ohno, Ryoko; Kimura, Tatsuro; Enoki, Hiroyuki; Nishimura, Satoru; Okamoto, Yuki; Nasuda, Shuhei; Takumi, Shigeo
2014-10-01
The large genome and allohexaploidy of common wheat have complicated construction of a high-density genetic map. Although improvements in the throughput of next-generation sequencing (NGS) technologies have made it possible to obtain a large amount of genotyping data for an entire mapping population by direct sequencing, including hexaploid wheat, a significant number of missing data points are often apparent due to the low coverage of sequencing. In the present study, a microarray-based polymorphism detection system was developed using NGS data obtained from complexity-reduced genomic DNA of two common wheat cultivars, Chinese Spring (CS) and Mironovskaya 808. After design and selection of polymorphic probes, 13,056 new markers were added to the linkage map of a recombinant inbred mapping population between CS and Mironovskaya 808. On average, 2.49 missing data points per marker were observed in the 201 recombinant inbred lines, with a maximum of 42. Around 40% of the new markers were derived from genic regions and 11% from repetitive regions. The low number of retroelements indicated that the new polymorphic markers were mainly derived from the less repetitive region of the wheat genome. Around 25% of the mapped sequences were useful for alignment with the physical map of barley. Quantitative trait locus (QTL) analyses of 14 agronomically important traits related to flowering, spikes, and seeds demonstrated that the new high-density map showed improved QTL detection, resolution, and accuracy over the original simple sequence repeat map. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
2012-01-01
Background Plasmodium vivax Duffy binding protein (PvDBP) plays an essential role in erythrocyte invasion and a potential asexual blood stage vaccine candidate antigen against P. vivax. The polymorphic nature of PvDBP, particularly amino terminal cysteine-rich region (PvDBPII), represents a major impediment to the successful design of a protective vaccine against vivax malaria. In this study, the genetic polymorphism and natural selection at PvDBPII among Myanmar P. vivax isolates were analysed. Methods Fifty-four P. vivax infected blood samples collected from patients in Myanmar were used. The region flanking PvDBPII was amplified by PCR, cloned into Escherichia coli, and sequenced. The polymorphic characters and natural selection of the region were analysed using the DnaSP and MEGA4 programs. Results Thirty-two point mutations (28 non-synonymous and four synonymous mutations) were identified in PvDBPII among the Myanmar P. vivax isolates. Sequence analyses revealed that 12 different PvDBPII haplotypes were identified in Myanmar P. vivax isolates and that the region has evolved under positive natural selection. High selective pressure preferentially acted on regions identified as B- and T-cell epitopes of PvDBPII. Recombination may also be played a role in the resulting genetic diversity of PvDBPII. Conclusions PvDBPII of Myanmar P. vivax isolates displays a high level of genetic polymorphism and is under selective pressure. Myanmar P. vivax isolates share distinct types of PvDBPII alleles that are different from those of other geographical areas. These results will be useful for understanding the nature of the P. vivax population in Myanmar and for development of PvDBPII-based vaccine. PMID:22380592
Orlovskis, Zigmunds; Canale, Maria Cristina; Haryono, Mindia; Lopes, João Roberto Spotti
2017-01-01
Background and Aims Maize bushy stunt phytoplasma (MBSP) is a bacterial pathogen of maize (Zea mays L.) across Latin America. MBSP belongs to the 16SrI-B sub-group within the genus ‘Candidatus Phytoplasma’. MBSP and its insect vector Dalbulus maidis (Hemiptera: Cicadellidae) are restricted to maize; both are thought to have coevolved with maize during its domestication from a teosinte-like ancestor. MBSP-infected maize plants show a diversity of symptoms. and it is likely that MBSP is under strong selection for increased virulence and insect transmission on maize hybrids that are widely grown in Brazil. In this study it was investigated whether the differences in genome sequences of MBSP isolates from two maize-growing regions in South-east Brazil explain variations in symptom severity of the MBSP isolates on various maize genotypes. Methods MBSP isolates were collected from maize production fields in Guaíra and Piracicaba in South-east Brazil for infection assays. One representative isolate was chosen for de novo whole-genome assembly and for the alignment of sequence reads from the genomes of other phytoplasma isolates to detect polymorphisms. Statistical methods were applied to investigate the correlation between variations in disease symptoms of infected maize plants and MBSP sequence polymorphisms. Key Results MBSP isolates contributed consistently to organ proliferation symptoms and maize genotype to leaf necrosis, reddening and yellowing of infected maize plants. The symptom differences are associated with polymorphisms in a phase-variable lipoprotein, which is a candidate effector, and an ATP-dependent lipoprotein ABC export protein, whereas no polymorphisms were observed in other candidate effector genes. Lipoproteins and ABC export proteins activate host defence responses, regulate pathogen attachment to host cells and activate effector secretion systems in other pathogens. Conclusions Polymorphisms in two putative virulence genes among MBSP isolates from maize-growing regions in South-east Brazil are associated with variations in organ proliferation symptoms of MBSP-infected maize plants. PMID:28069632
Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.
Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M
2002-01-01
Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.
Analysis of MHC class I genes across horse MHC haplotypes
Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.
2010-01-01
The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, P.J.; Walthers, E.A.; Richmond, K.L.
1997-04-01
PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less
Boronnikova, S V; Kalendar', R N
2010-01-01
Species-specific LTR retrotransposons were first cloned in five rare relic species of drug plants located in the Perm' region. Sequences of LTR retrotransposons were used for PCR analysis based on amplification of repeated sequences from LTR or other sites of retrotransposons (IRAP). Genetic diversity was studied in six populations of rare relic species of plants Adonis vernalis L. by means of the IRAP method; 125 polymorphic IRAP-markers were analyzed. Parameters for DNA polymorphism and genetic diversity of A. vernalis populations were determined.
The murine Cd48 gene: allelic polymorphism in the IgV-like region.
Cabrero, J G; Freeman, G J; Reiser, H
1998-12-01
The murine CD48 molecule is a member of the immunoglobulin superfamily which regulates the activation of T lymphocytes. prior cloning experiments using mRNA from two different mouse strains had yielded discrepant sequences within the IgV-like domain of murine CD48. To resolve this issue, we have directly sequenced genomic DNA of 10 laboratory strains and two inbred strains of wild origin. The results of our analysis reveal an allelic polymorphism within the IgV-like domain of murine CD48.
Polymorphisms in the phosducin (PDC) gene on chromosome 1q25-32
DOE Office of Scientific and Technical Information (OSTI.GOV)
Humphries, P.; Mansergh, F.C.; Farrar, G.J.
1994-09-01
Phosducin (33 kDa protein or MEKA) is a principal water-soluble phosphoprotein in the rod and cone photoreceptor cells and pinealocytes. This protein modulates the phototransduction cascade by binding to the beta and gamma subunit complexes of transducin. The PDC gene has been mapped to 1q25-32, the region of linkage of two hereditary retinal degenerative disorders; autosomal dominant juvenile-onset open-angle glaucoma and one form of autosomal recessive RP. Using previously published sequence data, PCR primers were designed to amplify the coding and 5{prime} flanking regions of the PDC gene. Direct sequencing revealed three polymorphisms in the 5{prime} flanking region, two ofmore » which were in regions highly homologous between humans and mice. Analysis of the polymorphisms was then extended to larger population samples using SSCPE and denaturing gel analysis. The first polymorphism PDC1 resulted from an insertion of a G residue at position -653/4. Allele frequencies were determined to be 0.51 (insG) and 0.49 (normal) giving a PIC value of 0.50. A deletion of a T residue at position -488 was the basis of the PDC2 polymorphism with allele frequencies of 0.88 (normal) and 0.12 (delT) and a PIC value of 0.21. Interestingly, the allele with an inserted G residue in PDC1 always segregrated with the deleted T allele in PDC2. The third polymorphism PDC3 was caused by a T or G residue at position -1083. Allele frequencies of 0.26 (G residue) and 0.74 (T residue) were determined from an analysis of 80 individuals with an overall PIC value of 0.39. The identification of these three polymorphisms in the PDC gene will be useful for future genetic linkage studies of chromosome 1q in inherited retinopathies.« less
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong
2012-05-01
This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Sahana, G; Guldbrandtsen, B; Thomsen, B; Holm, L-E; Panitz, F; Brøndum, R F; Bendixen, C; Lund, M S
2014-11-01
Mastitis is a mammary disease that frequently affects dairy cattle. Despite considerable research on the development of effective prevention and treatment strategies, mastitis continues to be a significant issue in bovine veterinary medicine. To identify major genes that affect mastitis in dairy cattle, 6 chromosomal regions on Bos taurus autosome (BTA) 6, 13, 16, 19, and 20 were selected from a genome scan for 9 mastitis phenotypes using imputed high-density single nucleotide polymorphism arrays. Association analyses using sequence-level variants for the 6 targeted regions were carried out to map causal variants using whole-genome sequence data from 3 breeds. The quantitative trait loci (QTL) discovery population comprised 4,992 progeny-tested Holstein bulls, and QTL were confirmed in 4,442 Nordic Red and 1,126 Jersey cattle. The targeted regions were imputed to the sequence level. The highest association signal for clinical mastitis was observed on BTA 6 at 88.97 Mb in Holstein cattle and was confirmed in Nordic Red cattle. The peak association region on BTA 6 contained 2 genes: vitamin D-binding protein precursor (GC) and neuropeptide FF receptor 2 (NPFFR2), which, based on known biological functions, are good candidates for affecting mastitis. However, strong linkage disequilibrium in this region prevented conclusive determination of the causal gene. A different QTL on BTA 6 located at 88.32 Mb in Holstein cattle affected mastitis. In addition, QTL on BTA 13 and 19 were confirmed to segregate in Nordic Red cattle and QTL on BTA 16 and 20 were confirmed in Jersey cattle. Although several candidate genes were identified in these targeted regions, it was not possible to identify a gene or polymorphism as the causal factor for any of these regions. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Mokhtar, Morad M; Adawy, Sami S; El-Assal, Salah El-Din S; Hussein, Ebtissam H A
2016-01-01
The present investigation was carried out aiming to use the bioinformatics tools in order to identify and characterize, simple sequence repeats within the third Version of the date palm genome and develop a new SSR primers database. In addition single nucleotide polymorphisms (SNPs) that are located within the SSR flanking regions were recognized. Moreover, the pathways for the sequences assigned by SSR primers, the biological functions and gene interaction were determined. A total of 172,075 SSR motifs was identified on date palm genome sequence with a frequency of 450.97 SSRs per Mb. Out of these, 130,014 SSRs (75.6%) were located within the intergenic regions with a frequency of 499 SSRs per Mb. While, only 42,061 SSRs (24.4%) were located within the genic regions with a frequency of 347.5 SSRs per Mb. A total of 111,403 of SSR primer pairs were designed, that represents 291.9 SSR primers per Mb. Out of the 111,403, only 31,380 SSR primers were in the genic regions, while 80,023 primers were in the intergenic regions. A number of 250,507 SNPs were recognized in 84,172 SSR flanking regions, which represents 75.55% of the total SSR flanking regions. Out of 12,274 genes only 463 genes comprising 896 SSR primers were mapped onto 111 pathways using KEGG data base. The most abundant enzymes were identified in the pathway related to the biosynthesis of antibiotics. We tested 1031 SSR primers using both publicly available date palm genome sequences as templates in the in silico PCR reactions. Concerning in vitro validation, 31 SSR primers among those used in the in silico PCR were synthesized and tested for their ability to detect polymorphism among six Egyptian date palm cultivars. All tested primers have successfully amplified products, but only 18 primers detected polymorphic amplicons among the studied date palm cultivars.
Molecular identification of Mango, Mangifera indica L.var. totupura
Jagarlamudi, Sankar; G, Rosaiah; Kurapati, Ravi Kumar; Pinnamaneni, Rajasekhar
2011-01-01
Mango (>Mangifera indica) belonging to Anacardiaceae family is a fruit that grows in tropical regions. It is considered as the King of fruits. The present work was taken up to identify a tool in identifying the mango species at the molecular level. The chloroplast trnL-F region was amplified from extracted total genomic DNA using the polymerase chain reaction (PCR) and sequenced. Sequence of the dominant DGGE band revealed that Mangifera indica in tested leaves was Mangifera indica (100% similarity to the ITS sequences of Mangifera indica). This sequence was deposited in NCBI with the accession no. GQ927757. Abbreviations AFLP - Amplified fragment length polymorphism , cpDNA - Chloroplast DNA, DDGE - Denaturing gradient gel electrophoresis, DNA - Deoxyribo nucleic acid, EDTA - Ethylenediamine tetraacetic acid, HCl - Hydrochloric acid, ISSR - Inter simple sequence repeats, ITS - Internal transcribed spacer, MATAB - Methyl Ammonium Bromide, Na2SO3 - Sodium sulphite, NaCl - Sodium chloride, NCBI - National Centre for Biotechnology Information, PCR - Polymerase chain reaction, PEG - Polyethylene glycol, RAPD - Randomly amplified polymorphic DNA, trnL-F - Transfer RNA genes start codon- termination codon. PMID:21423885
Botti, Sara; Giuffra, Elisabetta
2010-08-23
DNA barcodes are a global standard for species identification and have countless applications in the medical, forensic and alimentary fields, but few barcoding methods work efficiently in samples in which DNA is degraded, e.g. foods and archival specimens. This limits the choice of target regions harbouring a sufficient number of diagnostic polymorphisms. The method described here uses existing PCR and sequencing methodologies to detect mitochondrial DNA polymorphisms in complex matrices such as foods. The reported application allowed the discrimination among 17 fish species of the Scombridae family with high commercial interest such as mackerels, bonitos and tunas which are often present in processed seafood. The approach can be easily upgraded with the release of new genetic diversity information to increase the range of detected species. Cocktail of primers are designed for PCR using publicly available sequences of the target sequence. They are composed of a fixed 5' region and of variable 3' cocktail portions that allow amplification of any member of a group of species of interest. The population of short amplicons is directly sequenced and indexed using primers containing a longer 5' region and the non polymorphic portion of the cocktail portion. A 226 bp region of CytB was selected as target after collection and screening of 148 online sequences; 85 SNPs were found, of which 75 were present in at least two sequences. Primers were also designed for two shorter sub-fragments that could be amplified from highly degraded samples. The test was used on 103 samples of seafood (canned tuna and scomber, tuna salad, tuna sauce) and could successfully detect the presence of different or additional species that were not identified on the labelling of canned tuna, tuna salad and sauce samples. The described method is largely independent of the degree of degradation of DNA source and can thus be applied to processed seafood. Moreover, the method is highly flexible: publicly available sequence information on mitochondrial genomes are rapidly increasing for most species, facilitating the choice of target sequences and the improvement of resolution of the test. This is particularly important for discrimination of marine and aquaculture species for which genome information is still limited.
Fritz, David T; Jiang, Shan; Xu, Junwang; Rogers, Melissa B
2006-07-01
The bone morphogenetic protein (BMP)2 gene has been genetically linked to osteoporosis and osteoarthritis. We have shown that the 3'-untranslated regions (UTR) of BMP2 genes from mammals to fishes are extraordinarily conserved. This indicates that the BMP2 3'-UTR is under stringent selective pressure. We present evidence that the conserved region is a strong posttranscriptional regulator of BMP2 expression. Polymorphisms in cis-regulatory elements have been proven to influence susceptibility to a growing number of diseases. A common single nucleotide polymorphism (SNP) disrupts a putative posttranscriptional regulatory motif, an AU-rich element, within the BMP2 3'-UTR. The affinity of specific proteins for the rs15705 SNP sequence differs from their affinity for the normal human sequence. More importantly, the in vitro decay rate of RNAs with the SNP is higher than that of RNAs with the normal sequence. Such changes in mRNA:protein interactions may influence the posttranscriptional mechanisms that control BMP2 gene expression. The consequent alterations in BMP2 protein levels may influence the development or physiology of bone or other BMP2-influenced tissues.
A complex dominance hierarchy is controlled by polymorphism of small RNAs and their targets.
Yasuda, Shinsuke; Wada, Yuko; Kakizaki, Tomohiro; Tarutani, Yoshiaki; Miura-Uno, Eiko; Murase, Kohji; Fujii, Sota; Hioki, Tomoya; Shimoda, Taiki; Takada, Yoshinobu; Shiba, Hiroshi; Takasaki-Yasuda, Takeshi; Suzuki, Go; Watanabe, Masao; Takayama, Seiji
2016-12-22
In diploid organisms, phenotypic traits are often biased by effects known as Mendelian dominant-recessive interactions between inherited alleles. Phenotypic expression of SP11 alleles, which encodes the male determinants of self-incompatibility in Brassica rapa, is governed by a complex dominance hierarchy 1-3 . Here, we show that a single polymorphic 24 nucleotide small RNA, named SP11 methylation inducer 2 (Smi2), controls the linear dominance hierarchy of the four SP11 alleles (S 44 > S 60 > S 40 > S 29 ). In all dominant-recessive interactions, small RNA variants derived from the linked region of dominant SP11 alleles exhibited high sequence similarity to the promoter regions of recessive SP11 alleles and acted in trans to epigenetically silence their expression. Together with our previous study 4 , we propose a new model: sequence similarity between polymorphic small RNAs and their target regulates mono-allelic gene expression, which explains the entire five-phased linear dominance hierarchy of the SP11 phenotypic expression in Brassica.
Contrasting Histories of Three Gene Regions Associated with In(3l)payne of Drosophila Melanogaster
Hasson, E.; Eanes, W. F.
1996-01-01
In the present report, we studied nucleotide variation in three gene regions of Drosophila melanogaster, spanning >5 kb and showing different degrees of association with the cosmopolitan inversion In(3-L)Payne. The analysis of sequence variation in the regions surrounding the breakpoints and the heat shock 83 (Hsp83) gene locus, located close to the distal breakpoint, revealed the absence of shared polymorphisms and the presence of a number of fixed differences between arrangements, indicating absence of genetic exchange. In contrast, for the esterase-6 gene region, located in the center of the inversion, we observed the presence of shared polymorphisms between arrangements suggesting genetic exchange. In the regions close to the breakpoints, the common St arrangement is 10 times more polymorphic than inverted chromosomes. We propose that the lack of recombination between arrangements in these regions coupled with genetic hitchhiking is the best explanation for the low heterozygosity observed in inverted lines. Using the data for the breakpoints, we estimate that this inversion polymorphism is around 0.36 million yr old. Although it is widely accepted that inversions are examples of balanced polymorphisms, none of the current neutrality tests including our Monte Carlo simulations showed significant departure from neutral expectations. PMID:8978045
Herrnstadt, Corinna; Elson, Joanna L; Fahy, Eoin; Preston, Gwen; Turnbull, Douglass M; Anderson, Christen; Ghosh, Soumitra S; Olefsky, Jerrold M; Beal, M Flint; Davis, Robert E; Howell, Neil
2002-05-01
The evolution of the human mitochondrial genome is characterized by the emergence of ethnically distinct lineages or haplogroups. Nine European, seven Asian (including Native American), and three African mitochondrial DNA (mtDNA) haplogroups have been identified previously on the basis of the presence or absence of a relatively small number of restriction-enzyme recognition sites or on the basis of nucleotide sequences of the D-loop region. We have used reduced-median-network approaches to analyze 560 complete European, Asian, and African mtDNA coding-region sequences from unrelated individuals to develop a more complete understanding of sequence diversity both within and between haplogroups. A total of 497 haplogroup-associated polymorphisms were identified, 323 (65%) of which were associated with one haplogroup and 174 (35%) of which were associated with two or more haplogroups. Approximately one-half of these polymorphisms are reported for the first time here. Our results confirm and substantially extend the phylogenetic relationships among mitochondrial genomes described elsewhere from the major human ethnic groups. Another important result is that there were numerous instances both of parallel mutations at the same site and of reversion (i.e., homoplasy). It is likely that homoplasy in the coding region will confound evolutionary analysis of small sequence sets. By a linkage-disequilibrium approach, additional evidence for the absence of human mtDNA recombination is presented here.
Straub, Shannon C.K.; Fishbein, Mark; Liston, Aaron
2015-01-01
Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual’s consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the “noncoding” ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming). PMID:25653903
Patil, Tejas Suresh; Tamboli, Asif Shabodin; Patil, Swapnil Mahadeo; Bhosale, Amrut Ravindra; Govindwar, Sanjay Prabhu; Muley, Dipak Vishwanathrao
2016-01-01
Genus Nemacheilus, Nemachilichthys and Schistura belong to the family Nemacheilidae of the order Cypriniformes. The present investigation was undertaken to observe genetic diversity, phylogenetic relationship and to develop a molecular-based tool for taxonomic identification. For this purpose, four different types of molecular markers were utilized in which 29 random amplified polymorphic DNA (RAPD), 25 inter-simple sequence repeat (ISSR) markers, and 10 amplified fragment length polymorphism (AFLP) marker sets were screened and mitochondrial COI gene was sequenced. This study added COI barcodes for the identification of Nemacheilus anguilla, Nemachilichthys rueppelli and Schistura denisoni. RAPD showed higher polymorphism (100%) than the ISSR (93.75-100%) and AFLP (93.86-98.96%). The polymorphic information content (PIC), heterozygosity, multiplex ratio, and gene diversity was observed highest for AFLP primers, whereas the major allele frequency was observed higher for RAPD (0.5556) and lowest for AFLP (0.1667). The COI region of all individuals was successfully amplified and sequenced, which gave a 100% species resolution. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Dushyanth, K; Bhattacharya, T K; Shukla, R; Chatterjee, R N; Sitaramamma, T; Paswan, C; Guru Vishnu, P
2016-10-01
Myostatin is a member of TGF-β super family and is directly involved in regulation of body growth through limiting muscular growth. A study was carried out in three chicken lines to identify the polymorphism in the coding region of the myostatin gene through SSCP and DNA sequencing. A total of 12 haplotypes were observed in myostatin coding region of chicken. Significant associations between haplogroups with body weight at day 1, 14, 28, and 42 days, and carcass traits at 42 days were observed across the lines. It is concluded that the coding region of myostatin gene was polymorphic, with varied levels of expression among lines and had significant effects on growth traits. The expression of MSTN gene varied during embryonic and post hatch development stage.
Ramu, P; Kassahun, B; Senthilvel, S; Ashok Kumar, C; Jayashree, B; Folkertsma, R T; Reddy, L Ananda; Kuruvinashetti, M S; Haussmann, B I G; Hash, C T
2009-11-01
The sequencing and detailed comparative functional analysis of genomes of a number of select botanical models open new doors into comparative genomics among the angiosperms, with potential benefits for improvement of many orphan crops that feed large populations. In this study, a set of simple sequence repeat (SSR) markers was developed by mining the expressed sequence tag (EST) database of sorghum. Among the SSR-containing sequences, only those sharing considerable homology with rice genomic sequences across the lengths of the 12 rice chromosomes were selected. Thus, 600 SSR-containing sorghum EST sequences (50 homologous sequences on each of the 12 rice chromosomes) were selected, with the intention of providing coverage for corresponding homologous regions of the sorghum genome. Primer pairs were designed and polymorphism detection ability was assessed using parental pairs of two existing sorghum mapping populations. About 28% of these new markers detected polymorphism in this 4-entry panel. A subset of 55 polymorphic EST-derived SSR markers were mapped onto the existing skeleton map of a recombinant inbred population derived from cross N13 x E 36-1, which is segregating for Striga resistance and the stay-green component of terminal drought tolerance. These new EST-derived SSR markers mapped across all 10 sorghum linkage groups, mostly to regions expected based on prior knowledge of rice-sorghum synteny. The ESTs from which these markers were derived were then mapped in silico onto the aligned sorghum genome sequence, and 88% of the best hits corresponded to linkage-based positions. This study demonstrates the utility of comparative genomic information in targeted development of markers to fill gaps in linkage maps of related crop species for which sufficient genomic tools are not available.
Nowacka-Woszuk, Joanna; Switonski, Marek
2009-01-01
The sex determination process is under the control of several genes of which two (SRY and SOX9), encoding transcription factors, play a crucial role. It is well-known that mutations at these genes may cause the development of an intersexual phenotype. The aim of this study was to conduct a comparative analysis of the coding sequence and 5'-flanking regions of both genes in four species of the family Canidae (the dog, red fox, arctic fox and Chinese raccoon dog). Similarity of the coding sequence of the SOX9 gene among the studied species was higher (99.7-99.9%) than in the case of the SRY gene (96.7-97.3%). Only single nucleotide changes were found in the compared coding sequences, whereas in the 5'-flanking region of both genes nucleotide substitutions, as well as insertions and deletions were observed. None of the changes detected in the 5'-flanking region occurred within the potential consensus sequences for transcription factors. No polymorphism was found for either of these genes in any of the analyzed species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toye, P.G.; Metzelaar, M.J.; Wijngaard, P.L.J.
1995-08-01
Theileria parva, a tick-transmitted protozoan parasite related to Plasmodium spp., causes the disease East Coast fever, an acute and usually fatal lymphoproliferative disorder of cattle in Africa. Previous studies using sera from cattle that have survived infection identified a polymorphic immunodominant molecule (PIM) that is expressed by both the infective sporozoite stage of the parasite and the intracellular schizont. Here we show that mAb specific for the PIM Ag can inhibit sporozoite invasion of lymphocytes in vitro. A cDNA clone encoding the PIM Ag of the T. parva (Muguga) stock was obtained by using these mAb in a novel eukaryoticmore » expression cloning system that allows isolation of cDNA encoding cytoplasmic or surface Ags. To establish the molecular basis of the polymorphism of PIM, the cDNA of the PIM Ag from a buffalo-derived T. parva stock was isolated and its sequence was compared with that of the cattle-derived Muguga PIM. The two cDNAs showed considerable identity in both the 5{prime} and 3{prime} regions, but there was substantial sequence divergence in the central regions. Several types of repeated sequences were identified in the variant regions. In the Muguga form of the molecule, there were five tandem repeats of the tetrapeptide, QPEP, that were shown, by transfection of a deleted version of the PIM gene, not to react with several anti-PIM mAbs. By isolating and sequencing the genomic version of the gene, we identified two small introns in the 3{prime} region of the gene. Finally, we showed that polyclonal rat Abs against recombinant PIM neutralize sporozoite infectivity in vitro, suggesting that the PIM Ag should be evaluated for its capacity to immunize cattle against East Coast Fever.« less
Ait-Arkoub, Zaïna; Voujon, Delphine; Deback, Claire; Abrao, Emiliana P.; Agut, Henri; Boutolleau, David
2013-01-01
The complete 154-kbp linear double-stranded genomic DNA sequence of herpes simplex virus 2 (HSV-2), consisting of two extended regions of unique sequences bounded by a pair of inverted repeat elements, was published in 1998 and since then has been widely employed in a wide range of studies. Throughout the HSV-2 genome are scattered 150 microsatellites (also referred to as short tandem repeats) of 1- to 6-nucleotide motifs, mainly distributed in noncoding regions. Microsatellites are considered reliable markers for genetic mapping to differentiate herpesvirus strains, as shown for cytomegalovirus and HSV-1. The aim of this work was to characterize 12 polymorphic microsatellites within the HSV-2 genome by use of 3 multiplex PCR assays in combination with length polymorphism analysis for the rapid genetic differentiation of 56 HSV-2 clinical isolates and 2 HSV-2 laboratory strains (gHSV-2 and MS). This new system was applied to a specific new HSV-2 variant recently identified in HIV-1-infected patients originating from West Africa. Our results confirm that microsatellite polymorphism analysis is an accurate tool for studying the epidemiology of HSV-2 infections. PMID:23966512
Kawaguchi, Fuki; Kigoshi, Hiroto; Nakajima, Ayaka; Matsumoto, Yuta; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji
2018-05-17
Fatty acid composition is an important indicator of beef quality. The objective of this study was to search the potential candidate region for fatty acid composition. We performed pool-based genome-wide association studies (GWAS) for oleic acid percentage (C18:1) in a Japanese Black cattle population from the Hyogo prefecture. GWAS analysis revealed two novel candidate regions on BTA9 and BTA14. The most significant single nucleotide polymorphisms (SNPs) in each region were genotyped in a population (n = 899) to verify their effect on C18:1. Statistical analysis revealed that both SNPs were significantly associated with C18:1 (p = .0080 and .0003), validating the quantitative trait loci (QTLs) detected in GWAS. We subsequently selected VNN1 and LYPLA1 genes as candidate genes from each region on BTA9 and BTA14, respectively. We sequenced full-length coding sequence (CDS) of these genes in eight individuals and identified a nonsynonymous SNP T66M on VNN1 gene as a putative candidate polymorphism. The polymorphism was also significantly associated with C18:1, but the p value (p = .0162) was higher than the most significant SNP on BTA9, suggesting that it would not be responsible for the QTL. Although further investigation will be needed to determine the responsible gene and polymorphism, our findings would contribute to development of selective markers for fatty acid composition in the Japanese Black cattle of Hyogo. © 2018 Japanese Society of Animal Science.
von Kohn, Christopher; Kiełkowska, Agnieszka; Havey, Michael J
2013-12-01
Male-sterile (S) cytoplasm of onion is an alien cytoplasm introgressed into onion in antiquity and is widely used for hybrid seed production. Owing to the biennial generation time of onion, classical crossing takes at least 4 years to classify cytoplasms as S or normal (N) male-fertile. Molecular markers in the organellar DNAs that distinguish N and S cytoplasms are useful to reduce the time required to classify onion cytoplasms. In this research, we completed next-generation sequencing of the chloroplast DNAs of N- and S-cytoplasmic onions; we assembled and annotated the genomes in addition to identifying polymorphisms that distinguish these cytoplasms. The sizes (153 538 and 153 355 base pairs) and GC contents (36.8%) were very similar for the chloroplast DNAs of N and S cytoplasms, respectively, as expected given their close phylogenetic relationship. The size difference was primarily due to small indels in intergenic regions and a deletion in the accD gene of N-cytoplasmic onion. The structures of the onion chloroplast DNAs were similar to those of most land plants with large and small single copy regions separated by inverted repeats. Twenty-eight single nucleotide polymorphisms, two polymorphic restriction-enzyme sites, and one indel distributed across 20 chloroplast genes in the large and small single copy regions were selected and validated using diverse onion populations previously classified as N or S cytoplasmic using restriction fragment length polymorphisms. Although cytoplasmic male sterility is likely associated with the mitochondrial DNA, maternal transmission of the mitochondrial and chloroplast DNAs allows for polymorphisms in either genome to be useful for classifying onion cytoplasms to aid the development of hybrid onion cultivars.
Jaratlerdsiri, Weerachai; Isberg, Sally R.; Higgins, Damien P.; Miles, Lee G.; Gongora, Jaime
2014-01-01
Major Histocompatibility Complex (MHC) class II genes encode for molecules that aid in the presentation of antigens to helper T cells. MHC characterisation within and between major vertebrate taxa has shed light on the evolutionary mechanisms shaping the diversity within this genomic region, though little characterisation has been performed within the Order Crocodylia. Here we investigate the extent and effect of selective pressures and trans-species polymorphism on MHC class II α and β evolution among 20 extant species of Crocodylia. Selection detection analyses showed that diversifying selection influenced MHC class II β diversity, whilst diversity within MHC class II α is the result of strong purifying selection. Comparison of translated sequences between species revealed the presence of twelve trans-species polymorphisms, some of which appear to be specific to the genera Crocodylus and Caiman. Phylogenetic reconstruction clustered MHC class II α sequences into two major clades representing the families Crocodilidae and Alligatoridae. However, no further subdivision within these clades was evident and, based on the observation that most MHC class II α sequences shared the same trans-species polymorphisms, it is possible that they correspond to the same gene lineage across species. In contrast, phylogenetic analyses of MHC class II β sequences showed a mixture of subclades containing sequences from Crocodilidae and/or Alligatoridae, illustrating orthologous relationships among those genes. Interestingly, two of the subclades containing sequences from both Crocodilidae and Alligatoridae shared specific trans-species polymorphisms, suggesting that they may belong to ancient lineages pre-dating the divergence of these two families from the common ancestor 85–90 million years ago. The results presented herein provide an immunogenetic resource that may be used to further assess MHC diversity and functionality in Crocodylia. PMID:24503938
Genome skimming identifies polymorphism in tern populations and species
2012-01-01
Background Terns (Charadriiformes: Sterninae) are a lineage of cosmopolitan shorebirds with a disputed evolutionary history that comprises several species of conservation concern. As a non-model system in genetics, previous study has left most of the nuclear genome unexplored, and population-level studies are limited to only 15% of the world's species of terns and noddies. Screening of polymorphic nuclear sequence markers is needed to enhance genetic resolution because of supposed low mitochondrial mutation rate, documentation of nuclear insertion of hypervariable mitochondrial regions, and limited success of microsatellite enrichment in terns. Here, we investigated the phylogenetic and population genetic utility for terns and relatives of a variety of nuclear markers previously developed for other birds and spanning the nuclear genome. Markers displaying a variety of mutation rates from both the nuclear and mitochondrial genome were tested and prioritized according to optimal cross-species amplification and extent of genetic polymorphism between (1) the main tern clades and (2) individual Royal Terns (Thalasseus maxima) breeding on the US East Coast. Results Results from this genome skimming effort yielded four new nuclear sequence-based markers for tern phylogenetics and 11 intra-specific polymorphic markers. Further, comparison between the two genomes indicated a phylogenetic conflict at the base of terns, involving the inclusion (mitochondrial) or exclusion (nuclear) of the Angel Tern (Gygis alba). Although limited mitochondrial variation was confirmed, both nuclear markers and a short tandem repeat in the mitochondrial control region indicated the presence of considerable genetic variation in Royal Terns at a regional scale. Conclusions These data document the value of intronic markers to the study of terns and allies. We expect that these and additional markers attained through next-generation sequencing methods will accurately map the genetic origin and species history of this group of birds. PMID:22333071
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Biaoyang; Nasir, J.; Kalchman, M.A.
1995-02-10
We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
WANG, ZHANG-YANG; HONG, WEI-LONG; ZHU, ZHE-HUI; CHEN, YUN-HAO; YE, WEN-LE; CHU, GUANG-YU; LI, JIA-LIN; CHEN, BI-CHENG; XIA, PENG
2015-01-01
BK polyomavirus (BKV) is important pathogen for kidney transplant recipients, as it is frequently re-activated, leading to nephropathy. The aim of this study was to investigate the phylogenetic reconstruction and polymorphism of the VP2 gene in BKV isolated from Chinese kidney transplant recipients. Phylogenetic analysis was carried out in the VP2 region from 135 BKV-positive samples and 28 reference strains retrieved from GenBank. The unweighted pair-group method with arithmetic mean (UPGMA) grouped all strains into subtypes, but failed to subdivide strains into subgroups. Among the plasma and urine samples, all plasma (23/23) and 82 urine samples (82/95) were identified to contain subtype I; the other 10 urine samples contained subtype IV. A 86-bp fragment was identified as a highly conserved sequence. Following alignment with 36 published BKV sequences from China, 92 sites of polymorphism were identified, including 11 single nucleotide polymorphisms (SNPs) prevalent in Chinese individuals and 30 SNPs that were specific to the two predominant subtypes I and IV. The limitations of the VP2 gene segment in subgrouping were confirmed by phylogenetic analysis. The conserved sequence and polymorphism identified in this study may be helpful in the detection and genotyping of BKV. PMID:26640547
Molecular analysis of the glucocerebrosidase gene locus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Winfield, S.L.; Martin, B.M.; Fandino, A.
1994-09-01
Gaucher disease is due to a deficiency in the activity of the lysosomal enzyme glucocerebrosidase. Both the functional gene for this enzyme and a pseudogene are located in close proximity on chromosome 1q21. Analysis of the mutations present in patient samples has suggested interaction between the functional gene and the pseudogene in the origin of mutant genotypes. To investigate the involvement of regions flanking the functional gene and pseudogene in the origin of mutations found in Gaucher disease, a YAC clone containing DNA from this locus has been subcloned and characterized. The original YAC containing {approximately}360 kb was truncated withmore » the use of fragmentation plasmids to about 85 kb. A lambda library derived from this YAC was screened to obtain clones containing glucocerebrosidase sequences. PCR amplification was used to identify subclones containing 5{prime}, central, or 3{prime} sequences of the functional gene or of the pseudogene. Clones spanning the entire distance from the last exon of the functional gene to intron 1 of the pseudogene, the 5{prime} end of the functional gene and 16 kb of 5{prime} flanking region and approximately 15 kb of 3{prime} flanking region of the pseudogene were sequenced. Sequence data from 48 kb of intergenic and flanking regions of the glucocerebrosidase gene and its pseudogene has been generated. A large number of Alu sequences and several simple repeats have been found. Two of these repeats exhibit fragment length polymorphism. There is almost 100% homology between the 3{prime} flanking regions of the functional gene and the pseudogene, extending to about 4 kb past the termination codons. A much lower degree of homology is observed in the 5{prime} flanking region. Patient samples are currently being screened for polymorphisms in these flanking regions.« less
Jiang, Yong-Hui; Wauki, Kekio; Liu, Qian; Bressler, Jan; Pan, Yanzhen; Kashork, Catherine D; Shaffer, Lisa G; Beaudet, Arthur L
2008-01-28
Prader-Willi syndrome (PWS) is a neurobehavioral disorder characterized by neonatal hypotonia, childhood obesity, dysmorphic features, hypogonadism, mental retardation, and behavioral problems. Although PWS is most often caused by a paternal interstitial deletion of a 6-Mb region of chromosome 15q11-q13, the identity of the exact protein coding or noncoding RNAs whose deficiency produces the PWS phenotype is uncertain. There are also reports describing a PWS-like phenotype in a subset of patients with full mutations in the FMR1 (fragile X mental retardation 1) gene. Taking advantage of the human genome sequence, we have performed extensive sequence analysis and molecular studies for the PWS candidate region. We have characterized transcripts for the first time for two UCSC Genome Browser predicted protein-coding genes, GOLGA8E (golgin subfamily a, 8E) and WHDC1L1 (WAS protein homology region containing 1-like 1) and have further characterized two previously reported genes, CYF1P1 and NIPA2; all four genes are in the region close to the proximal/centromeric deletion breakpoint (BP1). GOLGA8E belongs to the golgin subfamily of coiled-coil proteins associated with the Golgi apparatus. Six out of 16 golgin subfamily proteins in the human genome have been mapped in the chromosome 15q11-q13 and 15q24-q26 regions. We have also identified more than 38 copies of GOLGA8E-like sequence in the 15q11-q14 and 15q23-q26 regions which supports the presence of a GOLGA8E-associated low copy repeat (LCR). Analysis of the 15q11-q13 region by PFGE also revealed a polymorphic region between BP1 and BP2. WHDC1L1 is a novel gene with similarity to mouse Whdc1 (WAS protein homology region 2 domain containing 1) and human JMY protein (junction-mediating and regulatory protein). Expression analysis of cultured human cells and brain tissues from PWS patients indicates that CYFIP1 and NIPA2 are biallelically expressed. However, we were not able to determine the allele-specific expression pattern for GOLGA8E and WHDC1L1 because these two genes have highly related sequences that might also be expressed. We have presented an updated version of a sequence-based physical map for a complex chromosomal region, and we raise the possibility of polymorphism in the genomic orientation of the BP1 to BP2 region. The identification of two new proteins GOLGA8E and WHDC1L1 encoded by genes in the 15q11-q13 region may extend our understanding of the molecular basis of PWS. In terms of copy number variation and gene organization, this is one of the most polymorphic regions of the human genome, and perhaps the single most polymorphic region of this type.
Rios, Jonathan J; Perelygin, Andrey A; Long, Maureen T; Lear, Teri L; Zharkikh, Andrey A; Brinton, Margo A; Adelson, David L
2007-01-01
Background The mammalian OAS/RNASEL pathway plays an important role in antiviral host defense. A premature stop-codon within the murine Oas1b gene results in the increased susceptibility of mice to a number of flaviviruses, including West Nile virus (WNV). Mutations in either the OAS1 or RNASEL genes may also modulate the outcome of WNV-induced disease or other viral infections in horses. Polymorphisms in the human OAS gene cluster have been previously utilized for case-control analysis of virus-induced disease in humans. No polymorphisms have yet been identified in either the equine OAS1 or RNASEL genes for use in similar case-control studies. Results Genomic sequence for equine OAS1 was obtained from a contig assembly generated from a shotgun subclone library of CHORI-241 BAC 100I10. Specific amplification of regions of the OAS1 gene from 13 horses of various breeds identified 33 single nucleotide polymorphisms (SNP) and two microsatellites. RNASEL cDNA sequences were determined for 8 mammals and utilized in a phylogenetic analysis. The chromosomal location of the RNASEL gene was assigned by FISH to ECA5p17-p16 using two selected CHORI-241 BAC clones. The horse genomic RNASEL sequence was assembled. Specific amplification of regions of the RNASEL gene from 13 horses identified 31 SNPs. Conclusion In this report, two dinucleotide microsatellites and 64 single nucleotide polymorphisms within the equine OAS1 and RNASEL genes were identified. These polymorphisms are the first to be reported for these genes and will facilitate future case-control studies of horse susceptibility to infectious diseases. PMID:17822564
Dallas, J F
1988-09-01
A human minisatellite DNA probe detects several restriction fragment length polymorphisms in cultivars of Asian and African rice. Certain fragments appear to be inherited in a Mendelian fashion and may represent unlinked loci. The hybridization patterns appear to be cultivar-specific and largely unchanged after the regeneration of plants from tissue culture. The results suggest that these regions of the rice genome may be used to generate cultivar-specific DNA fingerprints. The demonstration of similarity between a human minisatellite sequence and polymorphic regions in the rice genome suggests that such regions also occur in the genomes of many other plant species.
Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex
Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa
2016-01-01
Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051
Huang, Jie; Li, Yu-Zhi; Du, Lian-Ming; Yang, Bo; Shen, Fu-Jun; Zhang, He-Min; Zhang, Zhi-He; Zhang, Xiu-Yue; Yue, Bi-Song
2015-02-07
The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda. By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What's more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system. The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual identification method and the genetic diversity analysis method in this study provided adequate material for the future study of giant panda.
Huang, Zhen; Peng, Gary; Liu, Xunjia; Deora, Abhinandan; Falk, Kevin C.; Gossen, Bruce D.; McDonald, Mary R.; Yu, Fengqun
2017-01-01
Clubroot, caused by Plasmodiophora brassicae, is an important disease of canola (Brassica napus) in western Canada and worldwide. In this study, a clubroot resistance gene (Rcr2) was identified and fine mapped in Chinese cabbage cv. “Jazz” using single-nucleotide polymorphisms (SNP) markers identified from bulked segregant RNA sequencing (BSR-Seq) and molecular markers were developed for use in marker assisted selection. In total, 203.9 million raw reads were generated from one pooled resistant (R) and one pooled susceptible (S) sample, and >173,000 polymorphic SNP sites were identified between the R and S samples. One significant peak was observed between 22 and 26 Mb of chromosome A03, which had been predicted by BSR-Seq to contain the causal gene Rcr2. There were 490 polymorphic SNP sites identified in the region. A segregating population consisting of 675 plants was analyzed with 15 SNP sites in the region using the Kompetitive Allele Specific PCR method, and Rcr2 was fine mapped between two SNP markers, SNP_A03_32 and SNP_A03_67 with 0.1 and 0.3 cM from Rcr2, respectively. Five SNP markers co-segregated with Rcr2 in this region. Variants were identified in 14 of 36 genes annotated in the Rcr2 target region. The numbers of poly variants differed among the genes. Four genes encode TIR-NBS-LRR proteins and two of them Bra019410 and Bra019413, had high numbers of polymorphic variants and so are the most likely candidates of Rcr2. PMID:28894454
Feuk, Lars; MacDonald, Jeffrey R; Tang, Terence; Carson, Andrew R; Li, Martin; Rao, Girish; Khaja, Razi; Scherer, Stephen W
2005-10-01
With a draft genome-sequence assembly for the chimpanzee available, it is now possible to perform genome-wide analyses to identify, at a submicroscopic level, structural rearrangements that have occurred between chimpanzees and humans. The goal of this study was to investigate chromosomal regions that are inverted between the chimpanzee and human genomes. Using the net alignments for the builds of the human and chimpanzee genome assemblies, we identified a total of 1,576 putative regions of inverted orientation, covering more than 154 mega-bases of DNA. The DNA segments are distributed throughout the genome and range from 23 base pairs to 62 mega-bases in length. For the 66 inversions more than 25 kilobases (kb) in length, 75% were flanked on one or both sides by (often unrelated) segmental duplications. Using PCR and fluorescence in situ hybridization we experimentally validated 23 of 27 (85%) semi-randomly chosen regions; the largest novel inversion confirmed was 4.3 mega-bases at human Chromosome 7p14. Gorilla was used as an out-group to assign ancestral status to the variants. All experimentally validated inversion regions were then assayed against a panel of human samples and three of the 23 (13%) regions were found to be polymorphic in the human genome. These polymorphic inversions include 730 kb (at 7p22), 13 kb (at 7q11), and 1 kb (at 16q24) fragments with a 5%, 30%, and 48% minor allele frequency, respectively. Our results suggest that inversions are an important source of variation in primate genome evolution. The finding of at least three novel inversion polymorphisms in humans indicates this type of structural variation may be a more common feature of our genome than previously realized.
Hybridization capture reveals evolution and conservation across the entire Koala retrovirus genome.
Tsangaras, Kyriakos; Siracusa, Matthew C; Nikolaidis, Nikolas; Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M; Roca, Alfred L; Greenwood, Alex D
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin.
Hybridization Capture Reveals Evolution and Conservation across the Entire Koala Retrovirus Genome
Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M.; Roca, Alfred L.; Greenwood, Alex D.
2014-01-01
The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin. PMID:24752422
Human immunoglobulin allotypes
Lefranc, Marie-Paule
2009-01-01
More than twenty recombinant monoclonal antibodies are approved as therapeutics. Almost all of these are based on the whole IgG isotype format, but vary in the origin of the variable regions between mouse (chimeric), humanized mouse and fully human sequences; all of those with whole IgG format employ human constant region sequences. Currently, the opposing merits of the four IgG subclasses are considered with respect to the in vivo biological activities considered to be appropriate to the disease indication being treated. Human heavy chain genes also exhibit extensive structural polymorphism(s) and, being closely linked, are inherited as a haplotype. Polymorphisms (allotypes) within the IgG isotype were originally discovered and described using serological reagents derived from humans; demonstrating that allotypic variants can be immunogenic and provoke antibody responses as a result of allo-immunization. The serologically defined allotypes differ widely within and between population groups; therefore, a mAb of a given allotype will, inevitably, be delivered to a cohort of patients homozygous for the alternative allotype. This publication reviews the serologically defined human IgG allotypes and considers the potential for allotype differences to contribute to or potentiate immunogenicity. PMID:20073133
Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop.
Hazzouri, Khaled M; Flowers, Jonathan M; Visser, Hendrik J; Khierallah, Hussam S M; Rosas, Ulises; Pham, Gina M; Meyer, Rachel S; Johansen, Caryn K; Fresquez, Zoë A; Masmoudi, Khaled; Haider, Nadia; El Kadri, Nabila; Idaghdour, Youssef; Malek, Joel A; Thirkhill, Deborah; Markhand, Ghulam S; Krueger, Robert R; Zaid, Abdelouahhab; Purugganan, Michael D
2015-11-09
Date palms (Phoenix dactylifera) are the most significant perennial crop in arid regions of the Middle East and North Africa. Here, we present a comprehensive catalogue of approximately seven million single nucleotide polymorphisms in date palms based on whole genome re-sequencing of a collection of 62 cultivars. Population structure analysis indicates a major genetic divide between North Africa and the Middle East/South Asian date palms, with evidence of admixture in cultivars from Egypt and Sudan. Genome-wide scans for selection suggest at least 56 genomic regions associated with selective sweeps that may underlie geographic adaptation. We report candidate mutations for trait variation, including nonsense polymorphisms and presence/absence variation in gene content in pathways for key agronomic traits. We also identify a copia-like retrotransposon insertion polymorphism in the R2R3 myb-like orthologue of the oil palm virescens gene associated with fruit colour variation. This analysis documents patterns of post-domestication diversification and provides a genomic resource for this economically important perennial tree crop.
Whole genome re-sequencing of date palms yields insights into diversification of a fruit tree crop
Hazzouri, Khaled M.; Flowers, Jonathan M.; Visser, Hendrik J.; Khierallah, Hussam S. M.; Rosas, Ulises; Pham, Gina M.; Meyer, Rachel S.; Johansen, Caryn K.; Fresquez, Zoë A.; Masmoudi, Khaled; Haider, Nadia; El Kadri, Nabila; Idaghdour, Youssef; Malek, Joel A.; Thirkhill, Deborah; Markhand, Ghulam S.; Krueger, Robert R.; Zaid, Abdelouahhab; Purugganan, Michael D.
2015-01-01
Date palms (Phoenix dactylifera) are the most significant perennial crop in arid regions of the Middle East and North Africa. Here, we present a comprehensive catalogue of approximately seven million single nucleotide polymorphisms in date palms based on whole genome re-sequencing of a collection of 62 cultivars. Population structure analysis indicates a major genetic divide between North Africa and the Middle East/South Asian date palms, with evidence of admixture in cultivars from Egypt and Sudan. Genome-wide scans for selection suggest at least 56 genomic regions associated with selective sweeps that may underlie geographic adaptation. We report candidate mutations for trait variation, including nonsense polymorphisms and presence/absence variation in gene content in pathways for key agronomic traits. We also identify a copia-like retrotransposon insertion polymorphism in the R2R3 myb-like orthologue of the oil palm virescens gene associated with fruit colour variation. This analysis documents patterns of post-domestication diversification and provides a genomic resource for this economically important perennial tree crop. PMID:26549859
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fennelly, J.; Laval, S.; Wright, E.
1996-04-01
We have identified a genomic locus (DXYH1) that is polymorphic and hypervariable within the CBA/H colony. Using a panel of C57BL/6 x Mus spretus backcross offspring, it was mapped to the distal end of the X chromosome. Pseudoautosomal inheritance was demonstrated through three generations of CBA/H x CBA/H and CBA/H x C57BL/6 crosses and confirmed through linkage to the Sxr locus in X/Y Sxr x 3H1 crosses. Meiotic recombination frequencies place DXYH1 {approximately}28% into the pseudoautosomal region from the boundary. The de novo generation of CBA/H variant DXYH1 restriction fragment length polymorphisms during spermatogenesis is suggestive of the germline instabilitymore » associated with hypermutable human minisatellites. The absence of DXY1-related sequences in Mus spretus provides DNA sequence evidence to support the observed failure of X-Y pairing during meiosis and consequent hybrid infertility in C57BL/6 x Mus spretus male F1 offspring. 19 refs., 4 figs.« less
Analysis of genetic diversity in pigeon pea germplasm using retrotransposon-based molecular markers.
Maneesha; Upadhyaya, Kailash C
2017-09-01
Pigeon pea (Cajanus cajan), an important legume crop is predominantly cultivated in tropical and subtropical regions of Asia and Africa. It is normally considered to have a low degree of genetic diversity, an impediment in undertaking crop improvement programmes.We have analysed genetic polymorphism of domesticated pigeon pea germplasm (47 accessions) across the world using earlier characterized panzee retrotransposon-based molecularmarkers. Itwas conjectured that since retrotransposons are interspersed throughout the genome, retroelements-based markers would be able to uncover polymorphism possibly inherent in the diversity of retroelement sequences. Two PCR-based techniques, sequence-specific amplified polymorphism (SSAP) and retrotransposon microsatellite amplified polymorphism (REMAP) were utilized for the analyses.We show that a considerable degree of polymorphism could be detected using these techniques. Three primer combinations in SSAP generated 297 amplified products across 47 accessions with an average of 99 amplicons per assay. Degree of polymorphism varied from 84-95%. In the REMAP assays, the number of amplicons was much less but up to 73% polymorphism could be detected. On the basis of similarity coefficients, dendrograms were constructed. The results demonstrate that the retrotransposon-based markers could serve as a better alternative for the assessment of genetic diversity in crops with apparent low genetic base.
Robakowska-Hyzorek, Dagmara; Oprzadek, Jolanta; Zelazowska, Beata; Olbromski, Rafał; Zwierzchowski, Lech
2010-06-01
Myogenic factor 5 (Myf5), a product of the Myf5 gene, belongs to the MRF family of basic helix-loop-helix transcription factors that regulate myogenesis. Their roles in muscle growth and development make their genes candidates for molecular markers of meat production in livestock, but nucleotide sequence polymorphism has not been thoroughly studied in MRF genes. We detected four single nucleotide polymorphisms (SNPs) within exon 1 of the Myf5 gene, encoding the NH-terminal transactivation domain of the Myf5 protein. Three of these mutations change the amino acid sequence. The distribution of these SNPs was highly skewed in cattle populations; most of the mutations were found in only a few or even single individuals. Of the nine SNPs found in the promoter region of Myf5, one (transversion g.-723G-->T) was represented by all three genotypes distributed in the cattle populations studied. This polymorphism showed an influence on Myf5 gene expression in the longissimus dorsi muscle and was associated with sirloin weight and fat weight in sirloin in carcasses of Holstein-Friesian cattle.
Brikun, I; Suziedelis, K; Berg, D E
1994-01-01
Derivatives of Escherichia coli K-12 of known ancestry were characterized by random amplified polymorphic DNA (RAPD) fingerprinting to better understand genome evolution in this family of closely related strains. This sensitive method entails PCR amplification with arbitrary primers at low stringency and yields arrays of anonymous DNA fragments that are strain specific. Among 150 fragments scored, eight were polymorphic in that they were produced from some but not all strains. Seven polymorphic bands were chromosomal, and one was from the F-factor plasmid. Five of the six mapped polymorphic chromosomal bands came from just 7% of the genome, a 340-kb segment that includes the terminus of replication. Two of these were from the cryptic Rac prophage, and the inability to amplify them from strains was attributable to deletion (excision) or to rearrangement of Rac. Two other terminus-region segments that resulted in polymorphic bands appeared to have sustained point mutations that affected the ability to amplify them. Control experiments showed that RAPD bands from the 340-kb terminus-region segment and also from two plasmids (P1 and F) were represented in approximate proportion to their size. Optimization experiments showed that the concentration of thermostable polymerase strongly affected the arrays of RAPD products obtained. Comparison of RAPD polymorphisms and positions of strains exhibiting them in the pedigree suggests that many sequence changes occurred in these historic E. coli strains during their storage. We propose that the clustering of such mutations near the terminus reflects errors during completion of chromosome replication, possibly during slow growth in the stab cultures that were often used to store E. coli strains in the early years of bacterial genetics. Images PMID:8132463
Béraud-Colomb, E; Roubin, R; Martin, J; Maroc, N; Gardeisen, A; Trabuchet, G; Goosséns, M
1995-12-01
Analyzing the nuclear DNA from ancient human bones is an essential step to the understanding of genetic diversity in current populations, provided that such systematic studies are experimentally feasible. This article reports the successful extraction and amplification of nuclear DNA from the beta-globin region from 5 of 10 bone specimens up to 12,000 years old. These have been typed for beta-globin frameworks by sequencing through two variable positions and for a polymorphic (AT) chi (T) gamma microsatellite 500 bp upstream of the beta-globin gene. These specimens of human remains are somewhat older than those analyzed in previous nuclear gene sequencing reports and considerably older than those used to study high-copy-number human mtDNA. These results show that the systematic study of nuclear DNA polymorphisms of ancient populations is feasible.
Screening and Characterization of RAPD Markers in Viscerotropic Leishmania Parasites
Mkada–Driss, Imen; Talbi, Chiraz; Guerbouj, Souheila; Driss, Mehdi; Elamine, Elwaleed M.; Cupolillo, Elisa; Mukhtar, Moawia M.; Guizani, Ikram
2014-01-01
Visceral leishmaniasis (VL) is mainly due to the Leishmania donovani complex. VL is endemic in many countries worldwide including East Africa and the Mediterranean region where the epidemiology is complex. Taxonomy of these pathogens is under controversy but there is a correlation between their genetic diversity and geographical origin. With steady increase in genome knowledge, RAPD is still a useful approach to identify and characterize novel DNA markers. Our aim was to identify and characterize polymorphic DNA markers in VL Leishmania parasites in diverse geographic regions using RAPD in order to constitute a pool of PCR targets having the potential to differentiate among the VL parasites. 100 different oligonucleotide decamers having arbitrary DNA sequences were screened for reproducible amplification and a selection of 28 was used to amplify DNA from 12 L. donovani, L. archibaldi and L. infantum strains having diverse origins. A total of 155 bands were amplified of which 60.65% appeared polymorphic. 7 out of 28 primers provided monomorphic patterns. Phenetic analysis allowed clustering the parasites according to their geographical origin. Differentially amplified bands were selected, among them 22 RAPD products were successfully cloned and sequenced. Bioinformatic analysis allowed mapping of the markers and sequences and priming sites analysis. This study was complemented with Southern-blot to confirm assignment of markers to the kDNA. The bioinformatic analysis identified 16 nuclear and 3 minicircle markers. Analysis of these markers highlighted polymorphisms at RAPD priming sites with mainly 5′ end transversions, and presence of inter– and intra– taxonomic complex sequence and microsatellites variations; a bias in transitions over transversions and indels between the different sequences compared is observed, which is however less marked between L. infantum and L. donovani. The study delivers a pool of well-documented polymorphic DNA markers, to develop molecular diagnostics assays to characterize and differentiate VL causing agents. PMID:25313833
USDA-ARS?s Scientific Manuscript database
The aneupolyploidy genome of sugarcane (Saccharum hybrids spp.) and lack of a classical genetic linkage map make genetics research most difficult for sugarcane. Whole genome sequencing and genetic characterization of sugarcane and related taxa are far behind other crops. In this study, universal PCR...
Gray, Stanton B; Howard, Timothy D; Langefeld, Carl D; Hawkins, Gregory A; Diallo, Abdoulaye F; Wagner, Janice D
2009-01-01
Tumor necrosis factor is a cytokine that plays critical roles in inflammation, the innate immune response, and a variety of other physiologic and pathophysiologic processes. In addition, TNF has recently been shown to mediate an intersection of chronic, low-grade inflammation and concurrent metabolic dysregulation associated with obesity and its comorbidities. As part of an ongoing initiative to further characterize vervet monkeys originating from St Kitts as an animal model of obesity and inflammation, we sequenced and genotyped the human ortholog vervet TNF gene and approximately 1 kb of the flanking 3′ and 5′ regions from 265 monkeys in a closed, pedigreed colony. This process revealed a total of 11 single-nucleotide polymorphisms (SNPs) and a single 4-bp insertion–deletion, with minor allele frequencies of 0.08 to 0.39. Many of these polymorphisms were in strong or complete linkage disequilibrium with each other, and all but 1 were contained within a single haplotype block, comprising 5 haplotypes with frequencies of 0.075 to 0.298. Using sequences from humans, chimpanzees, vervets, baboons, and rhesus macaques, phylogenetic shadowing of the TNF promoter region revealed that vervet SNPs, like the SNPs in related species, were clustered nonrandomly and nonuniformly around conserved transcription factor binding sites. These data, combined with previously defined heritable phenotypes, permit future association analyses in this nonhuman primate model and have great potential to help dissect the genetic and nongenetic contributions to complex diseases like obesity. More broadly, the sequence data and comparative analyses reported herein facilitates study of the evolution of regulatory sequences of inflammatory and immune-related genes. PMID:20034434
Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M
2005-03-01
Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca
2015-01-01
Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450
Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca
2015-01-01
Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources.
Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat
2013-01-01
Drosophila subobscura is a paleartic species of the obscura group with a rich chromosomal polymorphism. To further our understanding on the origin of inversions and on how they regain variation, we have identified and sequenced the two breakpoints of a polymorphic inversion of D. subobscura--inversion 3 of the O chromosome--in a population sample. The breakpoints could be identified as two rather short fragments (∼300 bp and 60 bp long) with no similarity to any known transposable element family or repetitive sequence. The presence of the ∼300-bp fragment at the two breakpoints of inverted chromosomes implies its duplication, an indication of the inversion origin via staggered double-strand breaks. Present results and previous findings support that the mode of origin of inversions is neither related to the inversion age nor species-group specific. The breakpoint regions do not consistently exhibit the lower level of variation within and stronger genetic differentiation between arrangements than more internal regions that would be expected, even in moderately small inversions, if gene conversion were greatly restricted at inversion breakpoints. Comparison of the proximal breakpoint region in species of the obscura group shows that this breakpoint lies in a small high-turnover fragment within a long collinear region (∼300 kb). © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Villard, E.; Soubrier, F.; Tiret, L.
1996-06-01
Plasma angiotensin I-converting enzyme (ACE) levels are highly genetically determined. A previous segregation-linkage analysis suggested the existence of a functional mutation located within or close to the ACE locus, in almost complete linkage disequilibrium (LD) with the ACE insertion/deletion (I/D) polymorphism and accounting for half the ACE variance. In order to identify the functional variant at the molecular level, we compared ACE gene sequences between four subjects selected for having contrasted ACE levels and I/D genotypes. We identified 10 new polymorphisms, among which 8 were genotyped in 95 healthy nuclear families, in addition to the I/D polymorphism. These polymorphisms couldmore » be divided into two groups: five polymorphisms in the 5{prime} region and three in the coding sequence and the 3{prime} UTR. Within each group, polymorphisms were in nearly complete association, whereas polymorphisms from the two groups were in strong negative LD. After adjustment for the I/D polymorphism, all polymorphisms of the 5{prime} group remained significantly associated with ACE levels, which suggests the existence of two quantitative trait loci (QTL) acting additively on ACE levels. Segregation-linkage analyses including one or two ACE-linked QTLs in LD with two ACE markers were performed to test this hypothesis. The two QTLs and the two markers were assumed to be in complete LD. Results supported the existence of two ACE-linked QTLs, which would explain 38% and 49% of the ACE variance in parents and offspring, respectively. One of these QTLs might be the I/D polymorphism itself or the newly characterized 4656(CT){sub 2/3} polymorphism. The second QTL would have a frequency of {approximately}.20, which is incompatible with any of the yet-identified polymorphisms. More extensive sequencing and extended analyses in larger samples and in other populations will be necessary to characterize definitely the functional variants. 30 refs., 1 fig., 6 tabs.« less
2014-01-01
Background Central core disease is a congenital myopathy, characterized by presence of central core-like areas in muscle fibers. Patients have mild or moderate weakness, hypotonia and motor developmental delay. The disease is caused by mutations in the human ryanodine receptor gene (RYR1), which encodes a calcium-release channel. Since the RYR1 gene is huge, containing 106 exons, mutation screening has been limited to three ‘hot spots’, with particular attention to the C-terminal region. Recent next- generation sequencing methods are now identifying multiple numbers of variants in patients, in which interpretation and phenotype prevision is difficult. Case presentation In a Brazilian Caucasian family, clinical, histopathological and molecular analysis identified a new case of central core disease in a 48-year female. Sanger sequencing of the C-terminal region of the RYR1 gene identified two different missense mutations: c.14256 A > C polymorphism in exon 98 and c.14693 T > C in exon 102, which have already been described as pathogenic. Trans-position of the 2 mutations was confirmed because patient’s daughter, mother and sister carried only the exon 98’s mutation, a synonymous variant that was subsequently found in the frequency of 013–0,05 of alleles. Further next generation sequencing study of the whole RYR1 gene in the patient revealed the presence of additional 5 common silent polymorphisms in homozygosis and 8 polymorphisms in heterozygosis. Conclusions Considering that patient’s relatives showed no pathologic phenotype, and the phenotype presented by the patient is within the range observed in other central core disease patients with the same mutation, it was concluded that the c.14256 A > C polymorphism alone is not responsible for disease, and the associated additional silent polymorphisms are not acting as modifiers of the primary pathogenic mutation in the affected patient. The case described above illustrates the present reality where new methods for wide genome screening are becoming more accessible and able to identify a great variety of mutations and polymorphisms of unknown function in patients and their families. PMID:25084811
Ocan, Moses; Bwanga, Freddie; Okeng, Alfred; Katabazi, Fred; Kigozi, Edgar; Kyobe, Samuel; Ogwal-Okeng, Jasper; Obua, Celestino
2016-08-19
In the absence of an effective vaccine, malaria treatment and eradication is still a challenge in most endemic areas globally. This is especially the case with the current reported emergence of resistance to artemisinin agents in Southeast Asia. This study therefore explored the prevalence of K13-propeller gene polymorphisms among Plasmodium falciparum parasites in northern Uganda. Adult patients (≥18 years) presenting to out-patients department of Lira and Gulu regional referral hospitals in northern Uganda were randomly recruited. Laboratory investigation for presence of plasmodium infection among patients was done using Plasmodium falciparum exclusive rapid diagnostic test, histidine rich protein-2 (HRP2) (Pf). Finger prick capillary blood from patients with a positive malaria test was spotted on a filter paper Whatman no. 903. The parasite DNA was extracted using chelex resin method and sequenced for mutations in K13-propeller gene using Sanger sequencing. PCR DNA sequence products were analyzed using in DNAsp 5.10.01software, data was further processed in Excel spreadsheet 2007. A total of 60 parasite DNA samples were sequenced. Polymorphisms in the K13-propeller gene were detected in four (4) of the 60 parasite DNA samples sequenced. A non-synonymous polymorphism at codon 533 previously detected in Cambodia was found in the parasite DNA samples analyzed. Polymorphisms at codon 522 (non-synonymous) and codon 509 (synonymous) were also found in the samples analyzed. The study found evidence of positive selection in the Plasmodium falciparum population in northern Uganda (Tajima's D = -1.83205; Fu and Li's D = -1.82458). Polymorphism in the K13-propeller gene previously reported in Cambodia has been found in the Ugandan Plasmodium falciparum parasites. There is need for continuous surveillance for artemisinin resistance gene markers in the country.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tay, J.S.H.; Liu, Y.; Low, P.S.
A length polymorphism at the 5{prime} untranslated region of exon 1 and an RFLP (Dde I) in intron 5 (nt 160) of the ATIII gene were amplified by polymerase chain reaction with primers of published sequences. DNA fragments were size-fractionated by agarose gel electrophoresis (3% NuSieve and 1% Seakem GTG) and photographed over a UV transilluminator. A strong linkage disequilibrium was observed between these two polymorphisms of the ATIII gene in the Chinese ({chi}{sup 2} = 63.7; {triangle} 0.42, P < 0.001). The estimated frequencies of the three haplotypes were found to be 0.37 for SD+, 0.40 for LD+ andmore » 0.23 for LD-.« less
Judith D. Toms; Lori S. Eggert; Wayne J. Arendt; John Faaborg
2012-01-01
While testing genetic sexing techniques in Ovenbirds (Seiurus aurocapilla),we found a genetic polymorphism in the ATP5A1 gene in 38% of individuals. The Z ' allele included changes in both intronic and exonic portions of the sequenced region, but there was no evidence that this changed the resulting ATP synthase product. Males that had one or more copies of...
A polymorphism in the bovine gamma-S-crystallin gene revealed by allele-specific amplification.
Kemp, S J; Maillard, J C; Teale, A J
1993-04-01
A polymorphism was detected in the 3' untranslated region of the bovine gamma-S-crystallin gene by direct sequencing of polymerase chain reaction (PCR) products from genomic DNA of an N'Dama bull and a Boran cow. A set of three PCR primers was designed to detect this difference and thus give allele-specific amplification. The two allele-specific primers differ in length by 20 nucleotides so that the allelic products may be distinguished by simple agarose gel electrophoresis following a single PCR reaction. This provides a simple and rapid assay for this polymorphism.
Costantino, Angela; Spada, Enea; Equestre, Michele; Bruni, Roberto; Tritarelli, Elena; Coppola, Nicola; Sagnelli, Caterina; Sagnelli, Evangelista; Ciccaglione, Anna Rita
2015-11-14
The detection of baseline resistance mutations to new direct-acting antivirals (DAAs) in HCV chronically infected treatment-naïve patients could be important for their management and outcome prevision. In this study, we investigated the presence of mutations, which have been previously reported to be associated with resistance to DAAs in HCV polymerase (NS5B) and HCV protease (NS3) regions, in sera of treatment-naïve patients. HCV RNA from 152 naïve patients (84 % Italian and 16 % immigrants from various countries) infected with different HCV genotypes (21,1a; 21, 1b; 2, 2a; 60, 2c; 22, 3a; 25, 4d and 1, 4k) was evaluated for sequence analysis. Amplification and sequencing of fragments in the NS5B (nt 8256-8640) and NS3 (nt 3420-3960) regions of HCV genome were carried out for 152 and 28 patients, respectively. The polymorphism C316N/H in NS5B region, associated with resistance to sofosbuvir, was detected in 9 of the 21 (43 %) analysed sequences from genotype 1b-infected patients. Naturally occurring mutations V36L, and M175L in the NS3 protease region were observed in 100 % of patients infected with subtype 2c and 4. A relevant proportion of treatment naïve genotype 1b infected patients evaluated in this study harboured N316 polymorphism and might poorly respond to sofosbuvir treatment. As sofosbuvir has been approved for treatment of HCV chronic infection in USA and Europe including Italy, pre-treatment testing for N316 polymorphism on genotype 1b naïve patients should be considered for this drug.
Jin, Tianbo; Yang, Hua; Zhang, Jiayi; Yunus, Zulfiya; Sun, Qiang; Geng, Tingting; Chen, Chao; Yang, Jie
2015-01-01
Genetic polymorphisms in CYP3A4 can change its activity to a certain degree, thus leading to differences among different populations in drug efficacy or adverse drug reactions. The study was intended to validate the genetic polymorphisms in CYP3A4 in Uygur Chinese population, we sequenced and screened for genetic variants including 5'UTR, promoters, exons, introns, and 3'UTR region of the whole CYP3A4 gene in 100 unrelated, healthy. Twenty-one genetic polymorphisms in CYP3A4, and nine of them were novel. We detected CYP3A4*8, a putative poor-metabolizer allele, with the frequency of 0.5% in Uygur population. Tfsitescan revealed that the density of transcription factor varied in the different promoter regions, among which some were key regions for transcription factor binding. our results provide basic information about CPY3A4 alleles in Uygur and suggest that the enzymatic activities of CPY3A4 may differ among the diverse ethnic populations of China.
Jin, Tianbo; Yang, Hua; Zhang, Jiayi; Yunus, Zulfiya; Sun, Qiang; Geng, Tingting; Chen, Chao; Yang, Jie
2015-01-01
Purpose: Genetic polymorphisms in CYP3A4 can change its activity to a certain degree, thus leading to differences among different populations in drug efficacy or adverse drug reactions. Methods: The study was intended to validate the genetic polymorphisms in CYP3A4 in Uygur Chinese population, we sequenced and screened for genetic variants including 5’UTR, promoters, exons, introns, and 3’UTR region of the whole CYP3A4 gene in 100 unrelated, healthy. Results: Twenty-one genetic polymorphisms in CYP3A4, and nine of them were novel. We detected CYP3A4*8, a putative poor-metabolizer allele, with the frequency of 0.5% in Uygur population. Tfsitescan revealed that the density of transcription factor varied in the different promoter regions, among which some were key regions for transcription factor binding. Conclusion: our results provide basic information about CPY3A4 alleles in Uygur and suggest that the enzymatic activities of CPY3A4 may differ among the diverse ethnic populations of China. PMID:26261601
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davies, K.E.; Morrison, K.E.; Daniels, R.I.
1994-09-01
We previously reported that the 400 kb interval flanked the polymorphic loci D5S435 and D5S557 contains blocks of a chromosome 5 specific repeat. This interval also defines the SMA candidate region by genetic analysis of recombinant families. A YAC contig of 2-3 Mb encompassing this area has been constructed and a 5.5 kb conserved fragment, isolated from a YAC end clone within the above interval, was used to obtain cDNAs from both fetal and adult brain libraries. We describe the identification of cDNAs with stretches of high DNA sequence homology to exons of {beta} glucuronidase on human chromosome 7. Themore » cDNAs map both to the candidate region and to an area of 5p using FISH and deletion hybrid analysis. Hybridization to bacteriophage and cosmid clones from the YACs localizes the {beta} glucuronidase related sequences within the 400 kb region of the YAC contig. The cDNAs show a polymorphic pattern on hybridization to genomic BamH1 fragments in the size range of 10-250 kb. Further analysis using YAC fragmentation vectors is being used to determine how these {beta} glucuronidase related cDNAs are distributed within 5q13. Dinucleotide repeats within the region are being investigated to determine linkage disequilibrium with the disease locus.« less
Mercenaro, Luca; Nieddu, Giovanni; Porceddu, Andrea; Pezzotti, Mario; Camiolo, Salvatore
2017-01-01
The genetic diversity among grapevine (Vitis vinifera L.) cultivars that underlies differences in agronomic performance and wine quality reflects the accumulation of single nucleotide polymorphisms (SNPs) and small indels as well as larger genomic variations. A combination of high throughput sequencing and mapping against the grapevine reference genome allows the creation of comprehensive sequence variation maps. We used next generation sequencing and bioinformatics to generate an inventory of SNPs and small indels in four widely cultivated Sardinian grape cultivars (Bovale sardo, Cannonau, Carignano and Vermentino). More than 3,200,000 SNPs were identified with high statistical confidence. Some of the SNPs caused the appearance of premature stop codons and thus identified putative pseudogenes. The analysis of SNP distribution along chromosomes led to the identification of large genomic regions with uninterrupted series of homozygous SNPs. We used a digital comparative genomic hybridization approach to identify 6526 genomic regions with significant differences in copy number among the four cultivars compared to the reference sequence, including 81 regions shared between all four cultivars and 4953 specific to single cultivars (representing 1.2 and 75.9% of total copy number variation, respectively). Reads mapping at a distance that was not compatible with the insert size were used to identify a dataset of putative large deletions with cultivar Cannonau revealing the highest number. The analysis of genes mapping to these regions provided a list of candidates that may explain some of the phenotypic differences among the Bovale sardo, Cannonau, Carignano and Vermentino cultivars. PMID:28775732
Ginther, C; Corach, D; Penacino, G A; Rey, J A; Carnese, F R; Hutz, M H; Anderson, A; Just, J; Salzano, F M; King, M C
1993-01-01
DNA samples from 60 Mapuche Indians, representing 39 maternal lineages, were genetically characterized for (1) nucleotide sequences of the mtDNA control region; (2) presence or absence of a nine base duplication in mtDNA region V; (3) HLA loci DRB1 and DQA1; (4) variation at three nuclear genes with short tandem repeats; and (5) variation at the polymorphic marker D2S44. The genetic profile of the Mapuche population was compared to other Amerinds and to worldwide populations. Two highly polymorphic portions of the mtDNA control region, comprising 650 nucleotides, were amplified by the polymerase chain reaction (PCR) and directly sequenced. The 39 maternal lineages were defined by two or three generation families identified by the Mapuches. These 39 lineages included 19 different mtDNA sequences that could be grouped into four classes. The same classes of sequences appear in other Amerinds from North, Central, and South American populations separated by thousands of miles, suggesting that the origin of the mtDNA patterns predates the migration to the Americas. The mtDNA sequence similarity between Amerind populations suggests that the migration throughout the Americas occurred rapidly relative to the mtDNA mutation rate. HLA DRB1 alleles 1602 and 1402 were frequent among the Mapuches. These alleles also occur at high frequency among other Amerinds in North and South America, but not among Spanish, Chinese or African-American populations. The high frequency of these alleles throughout the Americas, and their specificity to the Americas, supports the hypothesis that Mapuches and other Amerind groups are closely related.(ABSTRACT TRUNCATED AT 250 WORDS)
Scar markers in a longleaf pine x slash pine F1 family
C. Weng; Thomas L. Kubisiak; M. Stine
1998-01-01
Sequence characterized amplified region (SCAR) markers were derived from random amplified polymorphic DNAs (RAPDs) that segregate in a longleaf pine x slash pine F1 family. Nine RAPD fragments, five from longleaf pine and four from slash pine, were cloned and end sequenced. A total of 13 SCAR primer pairs, with lengths between 17 and 24...
Polymorphic human somatostatin gene is located on chromosome 3.
Naylor, S L; Sakaguchi, A Y; Shen, L P; Bell, G I; Rutter, W J; Shows, T B
1983-01-01
Somatostatin is a 14-amino-acid neuropeptide and hormone that inhibits the secretion of several peptide hormones. The human gene for somatostatin SST has been cloned, and the sequence has been determined. This clone was used as a probe in chromosome mapping studies to detect the human somatostatin sequence in human-rodent hybrids. Southern blot analysis of 41 hybrids, including some containing translocations of human chromosomes, placed SST in the q21 leads to qter region of chromosome 3. Human DNAs from unrelated individuals were screened for restriction fragment polymorphisms detectable by the somatostatin gene probe. Two polymorphisms were found: (i) an EcoRI variant located at the 3' end of the gene, found in Caucasian, U.S. Black, and Asian populations with a frequency of approximately 0.10 and (ii) a BamHI variant in the intron, which occurs in Caucasians at a frequency of 0.13. Images PMID:6133281
NASA Astrophysics Data System (ADS)
Benito, S.; Ferrer, A.; Benabou, S.; Aviñó, A.; Eritja, R.; Gargallo, R.
2018-05-01
Guanine-rich sequences may fold into highly ordered structures known as G-quadruplexes. Apart from the monomeric G-quadruplex, these sequences may form multimeric structures that are not usually considered when studying interaction with ligands. This work studies the interaction of a ligand, crystal violet, with three guanine-rich DNA sequences with the capacity to form multimeric structures. These sequences correspond to short stretches found near the promoter regions of c-kit and SMARCA4 genes. Instrumental techniques (circular dichroism, molecular fluorescence, size-exclusion chromatography and electrospray ionization mass spectrometry) and multivariate data analysis were used for this purpose. The polymorphism of G-quadruplexes was characterized prior to the interaction studies. The ligand was shown to interact preferentially with the monomeric G-quadruplex; the binding stoichiometry was 1:1 and the binding constant was in the order of 105 M-1 for all three sequences. The results highlight the importance of DNA treatment prior to interaction studies.
Suyama, Yoshihisa; Matsuki, Yu
2015-01-01
Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities. PMID:26593239
Search for methylation-sensitive amplification polymorphisms in mutant figs.
Rodrigues, M G F; Martins, A B G; Bertoni, B W; Figueira, A; Giuliatti, S
2013-07-08
Fig (Ficus carica) breeding programs that use conventional approaches to develop new cultivars are rare, owing to limited genetic variability and the difficulty in obtaining plants via gamete fusion. Cytosine methylation in plants leads to gene repression, thereby affecting transcription without changing the DNA sequence. Previous studies using random amplification of polymorphic DNA and amplified fragment length polymorphism markers revealed no polymorphisms among select fig mutants that originated from gamma-irradiated buds. Therefore, we conducted methylation-sensitive amplified polymorphism analysis to verify the existence of variability due to epigenetic DNA methylation among these mutant selections compared to the main cultivar 'Roxo-de-Valinhos'. Samples of genomic DNA were double-digested with either HpaII (methylation sensitive) or MspI (methylation insensitive) and with EcoRI. Fourteen primer combinations were tested, and on an average, non-methylated CCGG, symmetrically methylated CmCGG, and hemimethylated hmCCGG sites accounted for 87.9, 10.1, and 2.0%, respectively. MSAP analysis was effective in detecting differentially methylated sites in the genomic DNA of fig mutants, and methylation may be responsible for the phenotypic variation between treatments. Further analyses such as polymorphic DNA sequencing are necessary to validate these differences, standardize the regions of methylation, and analyze reads using bioinformatic tools.
Doorduin, Leonie; Gravendeel, Barbara; Lammers, Youri; Ariyurek, Yavuz; Chin-A-Woeng, Thomas; Vrieling, Klaas
2011-01-01
Invasive individuals from the pest species Jacobaea vulgaris show different allocation patterns in defence and growth compared with native individuals. To examine if these changes are caused by fast evolution, it is necessary to identify native source populations and compare these with invasive populations. For this purpose, we are in need of intraspecific polymorphic markers. We therefore sequenced the complete chloroplast genomes of 12 native and 5 invasive individuals of J. vulgaris with next generation sequencing and discovered single-nucleotide polymorphisms (SNPs) and microsatellites. This is the first study in which the chloroplast genome of that many individuals within a single species was sequenced. Thirty-two SNPs and 34 microsatellite regions were found. For none of the individuals, differences were found between the inverted repeats. Furthermore, being the first chloroplast genome sequenced in the Senecioneae clade, we compared it with four other members of the Asteraceae family to identify new regions for phylogentic inference within this clade and also within the Asteraceae family. Five markers (ndhC-trnV, ndhC-atpE, rps18-rpl20, clpP and psbM-trnD) contained parsimony-informative characters higher than 2%. Finally, we compared two procedures of preparing chloroplast DNA for next generation sequencing. PMID:21444340
Abraham, Paul E; Wang, Xiaojing; Ranjan, Priya; Nookaew, Intawat; Zhang, Bing; Tuskan, Gerald A; Hettich, Robert L
2015-12-04
Next-generation sequencing has transformed the ability to link genotypes to phenotypes and facilitates the dissection of genetic contribution to complex traits. However, it is challenging to link genetic variants with the perturbed functional effects on proteins encoded by such genes. Here we show how RNA sequencing can be exploited to construct genotype-specific protein sequence databases to assess natural variation in proteins, providing information about the molecular toolbox driving cellular processes. For this study, we used two natural genotypes selected from a recent genome-wide association study of Populus trichocarpa, an obligate outcrosser with tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs), as well as insertions and deletions. We profiled the frequency of 128 types of naturally occurring amino acid substitutions, including both expected (neutral) and unexpected (non-neutral) SAAPs, with a subset occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. By zeroing in on the molecular signatures of these important regions that might have previously been uncharacterized, we now provide a high-resolution molecular inventory that should improve accessibility and subsequent identification of natural protein variants in future genotype-to-phenotype studies.
Schiavo, Giuseppina; Hoffmann, Orsolya Ivett; Ribani, Anisa; Utzeri, Valerio Joe; Ghionda, Marco Ciro; Bertolini, Francesca; Geraci, Claudia; Bovo, Samuele; Fontanesi, Luca
2017-10-01
Nuclear DNA sequences of mitochondrial origin (numts) are derived by insertion of mitochondrial DNA (mtDNA), into the nuclear genome. In this study, we provide, for the first time, a genome picture of numts inserted in the pig nuclear genome. The Sus scrofa reference nuclear genome (Sscrofa10.2) was aligned with circularized and consensus mtDNA sequences using LAST software. A total of 430 numt sequences that may represent 246 different numt integration events (57 numt regions determined by at least two numt sequences and 189 singletons) were identified, covering about 0.0078% of the nuclear genome. Numt integration events were correlated (0.99) to the chromosome length. The longest numt sequence (about 11 kbp) was located on SSC2. Six numts were sequenced and PCR amplified in pigs of European commercial and local pig breeds, of the Chinese Meishan breed and in European wild boars. Three of them were polymorphic for the presence or absence of the insertion. Surprisingly, the estimated age of insertion of two of the three polymorphic numts was more ancient than that of the speciation time of the Sus scrofa, supporting that these polymorphic sites were originated from interspecies admixture that contributed to shape the pig genome. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Harpke, Doerte; Peterson, Angela
2008-05-01
The internal transcribed spacer (ITS) region (ITS1, 5.8S rDNA, ITS2) represents the most widely applied nuclear marker in eukaryotic phylogenetics. Although this region has been assumed to evolve in concert, the number of investigations revealing high degrees of intra-individual polymorphism connected with the presence of pseudogenes has risen. The 5.8S rDNA is the most important diagnostic marker for functionality of the ITS region. In Mammillaria, intra-individual 5.8S rDNA polymorphisms of up to 36% and up to nine different types have been found. Twenty-eight of 30 cloned genomic Mammillaria sequences were identified as putative pseudogenes. For the identification of pseudogenic ITS regions, in addition to formal tests based on substitution rates, we attempted to focus on functional features of the 5.8S rDNA (5.8S motif, secondary structure). The importance of functional data for the identification of pseudogenes is outlined and discussed. The identification of pseudogenes is essential, because they may cause erroneous phylogenies and taxonomic problems.
Ayesh, Basim M
2017-01-01
Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oksenberg, J.R.; Cavalli-Sforza, L.L.; Steinman, L.
1989-02-01
Polymorphic markers in genes encoding the {alpha} chain of the human T-cell receptor (TcR) have been detected by Southern blot analysis in Pss I digests. Polymorphic bands were observed at 6.3 and 2.0 kilobases (kb) with frequencies of 0.30 and 0.44, respectively, in the general population. Using the polymerase chain reaction (PCR) method, the authors amplified selected sequences derived from the full-length TcR {alpha} cDNA probe. These PcR products were used as specific probes to demonstrate that the 6.3-kb polymorphic fragment hybridizes to the variable (V)-region probe and the 2.0-kb fragment hybridizes to the constant (C)-region probe. Segregation of themore » polymorphic bands was analyzed in family studies. To look for associations between these markers and autoimmune diseases, the authors have studied the restriction fragment length polymorphism distribution of the Pss I markers in patients with multiple sclerosis, myasthenia gravis, and Graves disease. Significant differences in the frequency of the polymorphic V{sub {alpha}} and C{sub {alpha}} markers were identified between patients and healthy individuals.« less
Polukonova, N V; Karmokov, M Kh; Shaternikov, A N
2015-02-01
The karyotype of Camptochironomus pallidivittatus Edwards, 1929 (Diptera, Chironomidae) from five populations of the Lower Volga region and Central Caucasus (the northern macroslope) has been studied. In populations of S. pallidivittatus from the Central Caucasus, 11 banding sequences (BS) were found; one sequence, pal B10, was new to the species. In the Saratov population, 11 BS were also found, three of which were new for the species-pal A3, pal B11, and pal B12. The banding sequences detected for the first time have not yet been found in other parts of the habitat of this species and may be endemic to these regions. In the studied populations ofS. pallidivittatus, banding sequences were found that were nonstandard but fixed in the karyotype. This is indicative of some degree of chromosomal divergence. These banding sequences include pal A2.2 in arm A and pal B10.10 in arm B in the Central Caucasus region, as well as pal B2.2 and pal G2.2 in the Lower Volga region. Arms A, B, D, and G in the Central Caucasian populations and A, B, and D in the Saratov oblast were polymorphic. The composition of heterozygous sequences between populations from different regions coincided only in arm D (pal D 1.2). In arms A and B, the set of heterozygous BS was different: pal A1.2 and pal B1.10 sequences were found in the Central Caucasian populations, and pal A1.3 and B11.12 were found in Saratov oblast. The number of genotypic combinations of S. pallidivittatus was higher in the Central Caucasus region, whereas the number of zygotic combinations was higher in the Saratov population. The percentage of heterozygous larvae in the Central Caucasian populations varied from 20 to 80, whereas all individuals in the Saratov population had heterozygous inversions. Zygotic combinations of larvae in all the studied populations were different.
High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.
Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A
2006-08-01
The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.
Jain, Varsha; Patel, Brijesh; Umar, Farhat Paul; Ajithakumar, H. M.; Gurjar, Suraj K.; Gupta, I. D.; Verma, Archana
2017-01-01
Aim: This study was conducted with the objective to identify single nucleotide polymorphism (SNP) in protein phosphatase 1 regulatory subunit 11 (PPP1R11) gene in Murrah bulls. Materials and Methods: Genomic DNA was isolated by phenol–chloroform extraction method from the frozen semen samples of 65 Murrah bulls maintained at Artificial Breeding Research Centre, ICAR-National Dairy Research Institute, Karnal. The quality and concentration of DNA was checked by spectrophotometer reading and agarose gel electrophoresis. The target region of PPP1R11 gene was amplified using four sets of primer designed based on Bos taurus reference sequence. The amplified products were sequenced and aligned using Clustal Omega for identification of SNPs. Animals were genotyped by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) using EcoNI restriction enzyme. Results: The sequences in the NCBI accession number NW_005785016.1 for Bubalus bubalis were compared and aligned with the edited sequences of Murrah bulls with Clustal Omega software. A total of 10 SNPs were found, out of which 1 at 5’UTR, 3 at intron 1, and 6 at intron 2 region. PCR-RFLP using restriction enzyme EcoNI revealed only AA genotype indicating monomorphism in PPP1R11 gene of all Murrah animals included in the study. Conclusion: A total of 10 SNPs were found. PCR-RFLP revealed only AA genotype indicating monomorphism in PPP1R11 gene of all Murrah animals included in the study, due to which association analysis with conception rate was not feasible. PMID:28344410
Isozyme, ISSR and RAPD profiling of genotypes in marvel grass (Dichanthium annulatum).
Saxena, Raghvendra; Chandra, Amaresh
2010-11-01
Genetic analysis of 30 accessions of marvel grass (Dichanthium annulatum Forsk.), a tropical range grass collected from grasslands and open fields of drier regions, was carried out with the objectives of identifying unique materials that could be used in developing the core germplasm for such regions as well as to explore gene (s) for drought tolerance. Five inter-simple sequence repeat (ISSR) primers [(CA)4, (AGAC), (GACA) 4; 27 random amplified polymorphic DNA (RAPD) and four enzyme systems were employed in the present study. In total, ISSR yielded 61 (52 polymorphic), RAPD 269 (253 polymorphic) and enzyme 55 isozymes (44 polymorphic) bands. The average polymorphic information content (PIC) and marker index (MI) across all polymorphic bands of 3 markers systems ranged from 0.419 to 0.480 and 4.34 to 5.25 respectively Dendrogram analysis revealed three main clusters with all three markers. Four enzymes namely esterase (EST), polyphenoloxidase (PPO), peroxidase (PRX) and superoxide dismutase (SOD) revealed 55 alleles from a total of 16 enzyme-coding loci. Of these, 14 loci and 44 alleles were polymorphic. The mean number of alleles per locus was 3.43. Mean heterozygosity observed among the polymorphic loci ranged from 0.406 (SOD) to 0.836 (EST) and accession wise from 0.679 (1G3108) to 0.743 (IGKMD-10). Though there was intermixing of few accessions of one agro-climatic region to another largely groupings of accessions were with their regions of collections. Bootstrap analysis at 1000 iterations also showed large numbers of nodes (11 to 17) having strong clustering (> 50 bootstrap values) in all three marker systems. The accessions of the arid and drier regions forming one cluster are assigned as distinct core collection of Dichanthium and can be targeted for isolation of gene (s) for drought tolerance. Variations in isozyme allele numbers and high PIC (0.48) and MI (4.98) as observed with ISSR markers indicated their usefulness for germplasm characterization.
Sacco, James; Ruplin, Andrew; Skonieczny, Paul; Ohman, Michael
2017-01-01
In humans, reduced activity of the enzyme monoamine oxidase type A (MAOA) due to genetic polymorphisms within the MAOA gene leads to increased brain neurotransmitter levels associated with aggression. In order to study MAOA genetic diversity in dogs, we designed a preliminary study whose objectives were to identify novel alleles in functionally important regions of the canine MAOA gene, and to investigate whether the frequencies of these polymorphisms varied between five broad breed groups (ancient, herding, mastiff, modern European, and mountain). Fifty dogs representing these five breed groups were sequenced. A total of eleven polymorphisms were found. Seven were single nucleotide polymorphisms (SNPs; two exonic, two intronic and three in the promoter), while four were repeat intronic variations. The most polymorphic loci were repeat regions in introns 1, 2 (7 alleles) and 10 (3 alleles), while the exonic and the promoter regions were highly conserved. Comparison of the allele frequencies of certain microsatellite polymorphisms among the breed groups indicated a decreasing or increasing trend in the number of repeats at different microsatellite loci, as well as the highest genetic diversity for the ancient breeds and the lowest for the most recent mountain breeds, perhaps attributable to canine domestication and recent breed formation. While a specific promoter SNP (-212A > G) is rare in the dog, it is the major allele in wolves. Replacement of this ancestral allele in domestic dogs may lead to the deletion of heat shock factor binding sites on the MAOA promoter. Dogs exhibit significant variation in certain intronic regions of the MAOA gene, while the coding and promoter regions are well-conserved. Distinct genetic differences were observed between breed groups. Further studies are now required to establish whether such polymorphisms are associated in any way with MAOA level and canine behaviour including aggression.
Genetic diversity of Plasmodium Vivax revealed by the merozoite surface protein-1 icb5-6 fragment.
Ruan, Wei; Zhang, Ling-Ling; Feng, Yan; Zhang, Xuan; Chen, Hua-Liang; Lu, Qiao-Yi; Yao, Li-Nong; Hu, Wei
2017-06-05
Plasmodium vivax remains a potential cause of morbidity and mortality for people living in its endemic areas. Understanding the genetic diversity of P. vivax from different regions is valuable for studying population dynamics and tracing the origins of parasites. The PvMSP-1 gene is highly polymorphic and has been used as a marker in many P. vivax population studies. The aim of this study was to investigate the genetic diversity of the PvMSP-1 gene icb5-6 fragment and to provide more genetic polymorphism data for further studies on P. vivax population structure and tracking of the origin of clinical cases. Nested PCR and sequencing of the PvMSP-1 icb5-6 marker were performed to obtain the nucleotide sequences of 95 P. vivax isolates collected from Zhejiang province, China. To investigate the genetic diversity of PvMSP-1, the 95 nucleotide sequences of the PvMSP-1 icb5-6 fragment were genotyped and analyzed using DnaSP v5, MEGA software. The 95 P. vivax isolates collected from Zhejiang province were either indigenous cases or imported cases from different regions around the world. A total of 95 sequences ranging from 390 to 460 bp were obtained. The 95 sequences were genotyped into four allele-types (Sal I, Belem, R-III and R-IV) and 17 unique haplotypes. R-III and Sal I were the predominant allele-types. The haplotype diversity (Hd) and nucleotide diversity (Pi) were estimated to be 0.729 and 0.062, indicating that the PvMSP-1 icb5-6 fragment had the highest level of polymorphism due to frequent recombination processes and single nucleotide polymorphism. The values of dN/dS and Tajima's D both suggested neutral selection for the PvMSP-1icb5-6 fragment. In addition, a rare recombinant style of R-IV type was identified. This study presented high genetic diversity in the PvMSP-1 marker among P. vivax strains from around the world. The genetic data is valuable for expanding the polymorphism information on P. vivax, which could be helpful for further study on population dynamics and tracking the origin of P. vivax.
Giovannetti, Elisa; Ugrasena, Dewa G; Supriyadi, Eddy; Vroling, Laura; Azzarello, Antonino; de Lange, Desiree; Peters, Godefridus J; Veerman, Anjo J P; Cloos, Jacqueline
2008-01-01
Genetic variations in the polymorphic tandem repeat sequence of the enhancer region of the thymidylate synthase promoter (TSER), as well as in methylenetetrahydrofolate reductase (MTHFR) C677T polymorphism, influence methotrexate sensitivity. We studied these polymorphisms in children with acute lymphoblastic leukaemia (ALL) and in subjects without malignancy in Indonesia and Holland. The frequencies of TT and CT genotypes were two-fold higher in Dutch children. The TSER 3R/3R repeat was three-fold more frequent in the Indonesian children, while the 2R/2R repeat was only 1% compared to 21% in the Dutch children. No differences of these polymorphisms were found between ALL cells and normal blood cells, indicating an ethnic rather than leukemic origin. These results may have implications for treatment of Indonesian children with ALL.
Heatley, Susan L.; Pietra, Gabriella; Lin, Jie; Widjaja, Jacqueline M. L.; Harpur, Christopher M.; Lester, Sue; Rossjohn, Jamie; Szer, Jeff; Schwarer, Anthony; Bradstock, Kenneth; Bardy, Peter G.; Mingari, Maria Cristina; Moretta, Lorenzo; Sullivan, Lucy C.; Brooks, Andrew G.
2013-01-01
Natural killer (NK) cell recognition of the nonclassical human leukocyte antigen (HLA) molecule HLA-E is dependent on the presentation of a nonamer peptide derived from the leader sequence of other HLA molecules to CD94-NKG2 receptors. However, human cytomegalovirus can manipulate this central innate interaction through the provision of a “mimic” of the HLA-encoded peptide derived from the immunomodulatory glycoprotein UL40. Here, we analyzed UL40 sequences isolated from 32 hematopoietic stem cell transplantation recipients experiencing cytomegalovirus reactivation. The UL40 protein showed a “polymorphic hot spot” within the region that encodes the HLA leader sequence mimic. Although all sequences that were identical to those encoded within HLA-I genes permitted the interaction between HLA-E and CD94-NKG2 receptors, other UL40 polymorphisms reduced the affinity of the interaction between HLA-E and CD94-NKG2 receptors. Furthermore, functional studies using NK cell clones expressing either the inhibitory receptor CD94-NKG2A or the activating receptor CD94-NKG2C identified UL40-encoded peptides that were capable of inhibiting target cell lysis via interaction with CD94-NKG2A, yet had little capacity to activate NK cells through CD94-NKG2C. The data suggest that UL40 polymorphisms may aid evasion of NK cell immunosurveillance by modulating the affinity of the interaction with CD94-NKG2 receptors. PMID:23335510
Heatley, Susan L; Pietra, Gabriella; Lin, Jie; Widjaja, Jacqueline M L; Harpur, Christopher M; Lester, Sue; Rossjohn, Jamie; Szer, Jeff; Schwarer, Anthony; Bradstock, Kenneth; Bardy, Peter G; Mingari, Maria Cristina; Moretta, Lorenzo; Sullivan, Lucy C; Brooks, Andrew G
2013-03-22
Natural killer (NK) cell recognition of the nonclassical human leukocyte antigen (HLA) molecule HLA-E is dependent on the presentation of a nonamer peptide derived from the leader sequence of other HLA molecules to CD94-NKG2 receptors. However, human cytomegalovirus can manipulate this central innate interaction through the provision of a "mimic" of the HLA-encoded peptide derived from the immunomodulatory glycoprotein UL40. Here, we analyzed UL40 sequences isolated from 32 hematopoietic stem cell transplantation recipients experiencing cytomegalovirus reactivation. The UL40 protein showed a "polymorphic hot spot" within the region that encodes the HLA leader sequence mimic. Although all sequences that were identical to those encoded within HLA-I genes permitted the interaction between HLA-E and CD94-NKG2 receptors, other UL40 polymorphisms reduced the affinity of the interaction between HLA-E and CD94-NKG2 receptors. Furthermore, functional studies using NK cell clones expressing either the inhibitory receptor CD94-NKG2A or the activating receptor CD94-NKG2C identified UL40-encoded peptides that were capable of inhibiting target cell lysis via interaction with CD94-NKG2A, yet had little capacity to activate NK cells through CD94-NKG2C. The data suggest that UL40 polymorphisms may aid evasion of NK cell immunosurveillance by modulating the affinity of the interaction with CD94-NKG2 receptors.
Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase
Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins
2008-01-01
Background In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. Methods The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Results Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. Conclusion It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy. PMID:18442404
Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase.
Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins
2008-04-28
In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy.
Puerma, Eva; Orengo, Dorcas J; Salguero, David; Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat
2014-09-01
Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Oh, Chang Seok; Lee, Soong Deok; Kim, Yi-Suk; Shin, Dong Hoon
2015-01-01
Previous study showed that East Asian mtDNA haplogroups, especially those of Koreans, could be successfully assigned by the coupled use of analyses on coding region SNP markers and control region mutation motifs. In this study, we tried to see if the same triple multiplex analysis for coding regions SNPs could be also applicable to ancient samples from East Asia as the complementation for sequence analysis of mtDNA control region. By the study on Joseon skeleton samples, we know that mtDNA haplogroup determined by coding region SNP markers successfully falls within the same haplogroup that sequence analysis on control region can assign. Considering that ancient samples in previous studies make no small number of errors in control region mtDNA sequencing, coding region SNP analysis can be used as good complimentary to the conventional haplogroup determination, especially of archaeological human bone samples buried underground over long periods. PMID:26345190
Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K
2011-01-20
Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
Identification of common, unique and polymorphic microsatellites among 73 cyanobacterial genomes.
Kabra, Ritika; Kapil, Aditi; Attarwala, Kherunnisa; Rai, Piyush Kant; Shanker, Asheesh
2016-04-01
Microsatellites also known as Simple Sequence Repeats are short tandem repeats of 1-6 nucleotides. These repeats are found in coding as well as non-coding regions of both prokaryotic and eukaryotic genomes and play a significant role in the study of gene regulation, genetic mapping, DNA fingerprinting and evolutionary studies. The availability of 73 complete genome sequences of cyanobacteria enabled us to mine and statistically analyze microsatellites in these genomes. The cyanobacterial microsatellites identified through bioinformatics analysis were stored in a user-friendly database named CyanoSat, which is an efficient data representation and query system designed using ASP.net. The information in CyanoSat comprises of perfect, imperfect and compound microsatellites found in coding, non-coding and coding-non-coding regions. Moreover, it contains PCR primers with 200 nucleotides long flanking region. The mined cyanobacterial microsatellites can be freely accessed at www.compubio.in/CyanoSat/home.aspx. In addition to this 82 polymorphic, 13,866 unique and 2390 common microsatellites were also detected. These microsatellites will be useful in strain identification and genetic diversity studies of cyanobacteria.
spa typing for epidemiological surveillance of Staphylococcus aureus.
Hallin, Marie; Friedrich, Alexander W; Struelens, Marc J
2009-01-01
The spa typing method is based on sequencing of the polymorphic X region of the protein A gene (spa), present in all strains of Staphylococcus aureus. The X region is constituted of a variable number of 24-bp repeats flanked by well-conserved regions. This single-locus sequence-based typing method combines a number of technical advantages, such as rapidity, reproducibility, and portability. Moreover, due to its repeat structure, the spa locus simultaneously indexes micro- and macrovariations, enabling the use of spa typing in both local and global epidemiological studies. These studies are facilitated by the establishment of standardized spa type nomenclature and Internet shared databases.
Korchagin, V I; Badaeva, T N; Tokarskaya, O N; Martirosyan, I A; Darevsky, I S; Ryskov, A P
2007-05-01
Populations of parthenogenetic lizards of the genus Darevskia consist of genetically identical animals, and represent a unique model for studying the molecular mechanisms underlying the variability and evolution of hypervariable DNA repeats. As unisexual lineages, parthenogenetic lizards are characterized by some level of genetic diversity at microsatellite loci. We cloned and sequenced a number of (GATA)n microsatellite loci of Darevskia unisexualis. PCR products from these loci were also sequenced and the degree of intraspecific polymorphism was assessed. Among the five (GATA)n loci analysed, two (Du215 and Du281) were polymorphic. Cross-species analysis of Du215 and Du281 indicate that the priming sites at the D. unisexualis loci are conserved in the bisexual parental species, D. raddei and D. valentini. Sequencing the PCR products amplified from Du215 and Du281 and from monomorphic Du323 showed that allelic differences at the polymorphic loci are caused by microsatellite mutations and by point mutations in the flanking regions. The haplotypes identified among the allelic variants of Du281 and among its orthologues in the parental species provide new evidence of the cross-species origin of D. unisexualis. To our knowledge, these data are the first to characterize the nucleotide sequences of allelic variants at microsatellite loci within parthenogenetic vertebrate animals.
Diekmann, Kerstin; Hodkinson, Trevor R; Wolfe, Kenneth H; van den Bekerom, Rob; Dix, Philip J; Barth, Susanne
2009-06-01
Lolium perenne L. (perennial ryegrass) is globally one of the most important forage and grassland crops. We sequenced the chloroplast (cp) genome of Lolium perenne cultivar Cashel. The L. perenne cp genome is 135 282 bp with a typical quadripartite structure. It contains genes for 76 unique proteins, 30 tRNAs and four rRNAs. As in other grasses, the genes accD, ycf1 and ycf2 are absent. The genome is of average size within its subfamily Pooideae and of medium size within the Poaceae. Genome size differences are mainly due to length variations in non-coding regions. However, considerable length differences of 1-27 codons in comparison of L. perenne to other Poaceae and 1-68 codons among all Poaceae were also detected. Within the cp genome of this outcrossing cultivar, 10 insertion/deletion polymorphisms and 40 single nucleotide polymorphisms were detected. Two of the polymorphisms involve tiny inversions within hairpin structures. By comparing the genome sequence with RT-PCR products of transcripts for 33 genes, 31 mRNA editing sites were identified, five of them unique to Lolium. The cp genome sequence of L. perenne is available under Accession number AM777385 at the European Molecular Biology Laboratory, National Center for Biotechnology Information and DNA DataBank of Japan.
Transposon Variants and Their Effects on Gene Expression in Arabidopsis
Wang, Xi; Weigel, Detlef; Smith, Lisa M.
2013-01-01
Transposable elements (TEs) make up the majority of many plant genomes. Their transcription and transposition is controlled through siRNAs and epigenetic marks including DNA methylation. To dissect the interplay of siRNA–mediated regulation and TE evolution, and to examine how TE differences affect nearby gene expression, we investigated genome-wide differences in TEs, siRNAs, and gene expression among three Arabidopsis thaliana accessions. Both TE sequence polymorphisms and presence of linked TEs are positively correlated with intraspecific variation in gene expression. The expression of genes within 2 kb of conserved TEs is more stable than that of genes next to variant TEs harboring sequence polymorphisms. Polymorphism levels of TEs and closely linked adjacent genes are positively correlated as well. We also investigated the distribution of 24-nt-long siRNAs, which mediate TE repression. TEs targeted by uniquely mapping siRNAs are on average farther from coding genes, apparently because they more strongly suppress expression of adjacent genes. Furthermore, siRNAs, and especially uniquely mapping siRNAs, are enriched in TE regions missing in other accessions. Thus, targeting by uniquely mapping siRNAs appears to promote sequence deletions in TEs. Overall, our work indicates that siRNA–targeting of TEs may influence removal of sequences from the genome and hence evolution of gene expression in plants. PMID:23408902
Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.
Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi
2016-03-01
Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.
A conserved post-transcriptional BMP2 switch in lung cells.
Jiang, Shan; Fritz, David T; Rogers, Melissa B
2010-05-15
An ultra-conserved sequence in the bone morphogenetic protein 2 (BMP2) 3' untranslated region (UTR) markedly represses BMP2 expression in non-transformed lung cells. In contrast, the ultra-conserved sequence stimulates BMP2 expression in transformed lung cells. The ultra-conserved sequence functions as a post-transcriptional cis-regulatory switch. A common single-nucleotide polymorphism (SNP, rs15705, +A1123C), which has been shown to influence human morphology, disrupts a conserved element within the ultra-conserved sequence and altered reporter gene activity in non-transformed lung cells. This polymorphism changed the affinity of the BMP2 RNA for several proteins including nucleolin, which has an increased affinity for the C allele. Elevated BMP2 synthesis is associated with increased malignancy in mouse models of lung cancer and poor lung cancer patient prognosis. Understanding the cis- and trans-regulatory factors that control BMP2 synthesis is relevant to the initiation or progression of pathologies associated with abnormal BMP2 levels. (c) 2010 Wiley-Liss, Inc.
Genome-wide DNA polymorphisms in two cultivars of mei (Prunus mume sieb. et zucc.).
Sun, Lidan; Zhang, Qixiang; Xu, Zongda; Yang, Weiru; Guo, Yu; Lu, Jiuxing; Pan, Huitang; Cheng, Tangren; Cai, Ming
2013-10-06
Mei (Prunus mume Sieb. et Zucc.) is a famous ornamental plant and fruit crop grown in East Asian countries. Limited genetic resources, especially molecular markers, have hindered the progress of mei breeding projects. Here, we performed low-depth whole-genome sequencing of Prunus mume 'Fenban' and Prunus mume 'Kouzi Yudie' to identify high-quality polymorphic markers between the two cultivars on a large scale. A total of 1464.1 Mb and 1422.1 Mb of 'Fenban' and 'Kouzi Yudie' sequencing data were uniquely mapped to the mei reference genome with about 6-fold coverage, respectively. We detected a large number of putative polymorphic markers from the 196.9 Mb of sequencing data shared by the two cultivars, which together contained 200,627 SNPs, 4,900 InDels, and 7,063 SSRs. Among these markers, 38,773 SNPs, 174 InDels, and 418 SSRs were distributed in the 22.4 Mb CDS region, and 63.0% of these marker-containing CDS sequences were assigned to GO terms. Subsequently, 670 selected SNPs were validated using an Agilent's SureSelect solution phase hybridization assay. A subset of 599 SNPs was used to assess the genetic similarity of a panel of mei germplasm samples and a plum (P. salicina) cultivar, producing a set of informative diversity data. We also analyzed the frequency and distribution of detected InDels and SSRs in mei genome and validated their usefulness as DNA markers. These markers were successfully amplified in the cultivars and in their segregating progeny. A large set of high-quality polymorphic SNPs, InDels, and SSRs were identified in parallel between 'Fenban' and 'Kouzi Yudie' using low-depth whole-genome sequencing. The study presents extensive data on these polymorphic markers, which can be useful for constructing high-resolution genetic maps, performing genome-wide association studies, and designing genomic selection strategies in mei.
Analysis of mutational changes at the HLA locus in single human sperm.
Huang, M M; Erlich, H A; Goodman, M F; Arnheim, N
1995-01-01
Using a simple and efficient single sperm PCR and direct sequencing method, we screened for HLA-DPB1 gene mutations that may give rise to new alleles at this highly polymorphic locus. More than 800 single sperm were studied from a heterozygous individual whose two alleles carried 16 nucleotide sequence differences clustered in six polymorphic regions. A potential microgene conversion event was detected. Unrepaired heteroduplex DNA similar to that which gives rise to postmeiotic segregation events in yeast was observed in three cases. Control experiments also revealed unusual sperm from DPB1 homozygous individuals. The data may help explain allelic diversity in the MHC and suggest that a possible source of human mosaicism may be incomplete DNA mismatch repair during gametogenesis.
Sawada, Akihisa; Croom-Carter, Deborah; Kondo, Osamu; Yasui, Masahiro; Koyama-Sato, Maho; Inoue, Masami; Kawa, Keisei; Rickinson, Alan B; Tierney, Rosemary J
2011-05-01
Polymorphisms in Epstein-Barr virus (EBV) latent genes can identify virus strains from different human populations and individual strains within a population. An Asian EBV signature has been defined almost exclusively from Chinese viruses, with little information from other Asian countries. Here we sequenced polymorphic regions of the EBNA1, 2, 3A, 3B, 3C and LMP1 genes of 31 Japanese strains from control donors and EBV-associated T/NK-cell lymphoproliferative disease (T/NK-LPD) patients. Though identical to Chinese strains in their dominant EBNA1 and LMP1 alleles, Japanese viruses were subtly different at other loci. Thus, while Chinese viruses mainly fall into two families with strongly linked 'Wu' or 'Li' alleles at EBNA2 and EBNA3A/B/C, Japanese viruses all have the consensus Wu EBNA2 allele but fall into two families at EBNA3A/B/C. One family has variant Li-like sequences at EBNA3A and 3B and the consensus Li sequence at EBNA3C; the other family has variant Wu-like sequences at EBNA3A, variants of a low frequency Chinese allele 'Sp' at EBNA3B and a consensus Sp sequence at EBNA3C. Thus, EBNA3A/B/C allelotypes clearly distinguish Japanese from Chinese strains. Interestingly, most Japanese viruses also lack those immune-escape mutations in the HLA-A11 epitope-encoding region of EBNA3B that are so characteristic of viruses from the highly A11-positive Chinese population. Control donor-derived and T/NK-LPD-derived strains were similarly distributed across allelotypes and, by using allelic polymorphisms to track virus strains in patients pre- and post-haematopoietic stem-cell transplant, we show that a single strain can induce both T/NK-LPD and B-cell-lymphoproliferative disease in the same patient.
USDA-ARS?s Scientific Manuscript database
The highly effective Hessian fly-resistance gene, H33, was introgressed from durum wheat into common wheat and genetically mapped to chromosome 3AS, in previous research. However, H33 located to a region that is well-known to be devoid of molecular markers, with the closest flanking simple sequence ...
Lin, You-Yu; Hsieh, Chia-Hung; Chen, Jiun-Hong; Lu, Xuemei; Kao, Jia-Horng; Chen, Pei-Jer; Chen, Ding-Shinn; Wang, Hurng-Yi
2017-04-26
The accuracy of metagenomic assembly is usually compromised by high levels of polymorphism due to divergent reads from the same genomic region recognized as different loci when sequenced and assembled together. A viral quasispecies is a group of abundant and diversified genetically related viruses found in a single carrier. Current mainstream assembly methods, such as Velvet and SOAPdenovo, were not originally intended for the assembly of such metagenomics data, and therefore demands for new methods to provide accurate and informative assembly results for metagenomic data. In this study, we present a hybrid method for assembling highly polymorphic data combining the partial de novo-reference assembly (PDR) strategy and the BLAST-based assembly pipeline (BBAP). The PDR strategy generates in situ reference sequences through de novo assembly of a randomly extracted partial data set which is subsequently used for the reference assembly for the full data set. BBAP employs a greedy algorithm to assemble polymorphic reads. We used 12 hepatitis B virus quasispecies NGS data sets from a previous study to assess and compare the performance of both PDR and BBAP. Analyses suggest the high polymorphism of a full metagenomic data set leads to fragmentized de novo assembly results, whereas the biased or limited representation of external reference sequences included fewer reads into the assembly with lower assembly accuracy and variation sensitivity. In comparison, the PDR generated in situ reference sequence incorporated more reads into the final PDR assembly of the full metagenomics data set along with greater accuracy and higher variation sensitivity. BBAP assembly results also suggest higher assembly efficiency and accuracy compared to other assembly methods. Additionally, BBAP assembly recovered HBV structural variants that were not observed amongst assembly results of other methods. Together, PDR/BBAP assembly results were significantly better than other compared methods. Both PDR and BBAP independently increased the assembly efficiency and accuracy of highly polymorphic data, and assembly performances were further improved when used together. BBAP also provides nucleotide frequency information. Together, PDR and BBAP provide powerful tools for metagenomic data studies.
Vembar, Shruthi Sridhar; Seetin, Matthew; Lambert, Christine; Nattestad, Maria; Schatz, Michael C.; Baybayan, Primo; Scherf, Artur; Smith, Melissa Laird
2016-01-01
The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [∼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [∼90–99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. PMID:27345719
Development of Genomic Simple Sequence Repeats (SSR) by Enrichment Libraries in Date Palm.
Al-Faifi, Sulieman A; Migdadi, Hussein M; Algamdi, Salem S; Khan, Mohammad Altaf; Al-Obeed, Rashid S; Ammar, Megahed H; Jakse, Jerenj
2017-01-01
Development of highly informative markers such as simple sequence repeats (SSR) for cultivar identification and germplasm characterization and management is essential for date palms genetic studies. The present study documents the development of SSR markers and assesses genetic relationships of commonly grown date palm (Phoenix dactylifera L.) cultivars in different geographical regions of Saudi Arabia. A total of 93 novel simple sequence repeat (SSR) markers were screened for their ability to detect polymorphism in date palm. Around 71% of genomic SSRs are dinucleotide, 25% trinucleotide, 3% tetranucleotide, and 1% pentanucleotide motives and show 100% polymorphism. The Unweighted Pair Group Method with Arithmetic Mean (UPGMA) cluster analysis illustrates that cultivars trend to group according to their class of maturity, region of cultivation, and fruit color. Analysis of molecular variations (AMOVA) reveals genetic variation among and within cultivars of 27% and 73%, respectively, according to the geographical distribution of the cultivars. Developed microsatellite markers are of additional value to date palm characterization, tools which can be used by researchers in population genetics, cultivar identification, as well as genetic resource exploration and management. The cultivars tested exhibited a significant amount of genetic diversity and could be suitable for successful breeding programs. Genomic sequences generated from this study are available at the National Center for Biotechnology Information (NCBI), Sequence Read Archive (Accession numbers. LIBGSS_039019).
Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod
2016-02-01
Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili.
Chung, H Y; Choi, Y C; Park, H N
2015-05-18
We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.
Küpper, Clemens; Burke, Terry; Lank, David B.
2015-01-01
Sequence variation in the melanocortin-1 receptor (MC1R) gene explains color morph variation in several species of birds and mammals. Ruffs (Philomachus pugnax) exhibit major dark/light color differences in melanin-based male breeding plumage which is closely associated with alternative reproductive behavior. A previous study identified a microsatellite marker (Ppu020) near the MC1R locus associated with the presence/absence of ornamental plumage. We investigated whether coding sequence variation in the MC1R gene explains major dark/light plumage color variation and/or the presence/absence of ornamental plumage in ruffs. Among 821bp of the MC1R coding region from 44 male ruffs we found 3 single nucleotide polymorphisms, representing 1 nonsynonymous and 2 synonymous amino acid substitutions. None were associated with major dark/light color differences or the presence/absence of ornamental plumage. At all amino acid sites known to be functionally important in other avian species with dark/light plumage color variation, ruffs were either monomorphic or the shared polymorphism did not coincide with color morph. Neither ornamental plumage color differences nor the presence/absence of ornamental plumage in ruffs are likely to be caused entirely by amino acid variation within the coding regions of the MC1R locus. Regulatory elements and structural variation at other loci may be involved in melanin expression and contribute to the extreme plumage polymorphism observed in this species. PMID:25534935
NASA Astrophysics Data System (ADS)
Kooistra, Wiebe H. C. F.; de Boer, M. Karin; Vrieling, Engel G.; Connell, Laurie B.; Gieskes, Winfried W. C.
2001-12-01
The flagellate micro-alga Fibrocapsa japonica can form harmful algal blooms along all temperate coastal regions of the world. The species was first observed in coastal waters of Japan and the western US in the 1970s; it has been reported regularly worldwide since. To unravel whether this apparent range expansion can be tracked, we assessed genetic variation among nuclear ribosomal DNA ITS sequences, obtained from sixteen global strains collected over the course of three decades. Ten sequence positions showed polymorphism across the strains. Nine out of these revealed ambiguities in several or most sequences sampled. The oldest strain collected (LB-2161) was the only one without such intra-individual polymorphism. In the others, the proportion of ambiguities at variable sites increased with more recent collection date. The pattern does not result from loss of variation due to sexual reproduction and random drift in culture because sister cultures CS-332 and NIES-136 showed virtually the same ITS-pattern after seven years of separation. Neither are the patterns explained by recent range expansion of a single genotype, because in that case one would expect lowest genetic diversity in the recently invaded North Sea; instead, polymorphism is highest there. Recent ballast-water-mediated mixing of formerly isolated populations and subsequent ongoing sexual reproduction among them can explain the increase in ambiguities. The species' capacity to form harmful blooms may well have been enhanced through increased genetic diversity of regional populations.
Stam, L. F.; Laurie, C. C.
1996-01-01
A molecular mapping experiment shows that a major gene effect on a quantitative trait, the level of alcohol dehydrogenase expression in Drosophila melanogaster, is due to multiple polymorphisms within the Adh gene. These polymorphisms are located in an intron, the coding sequence, and the 3' untranslated region. Because of nonrandom associations among polymorphisms at different sites, the individual effects combine (in some cases epistatically) to produce ``superalleles'' with large effect. These results have implications for the interpretation of major gene effects detected by quantitative trait locus mapping methods. They show that large effects due to a single locus may be due to multiple associated polymorphisms (or sequential fixations in isolated populations) rather than individual mutations of large effect. PMID:8978044
Amino acid and structural variability of Yersinia pestis LcrV protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anisimov, A P; Dentovskaya, S V; Panfertsev, E A
2009-11-09
The LcrV protein is a multifunctional virulence factor and protective antigen of the plague bacterium which is generally conserved between the epidemic strains of Yersinia pestis. They investigated the diversity in the LcrV sequences among non-epidemic Y. pestis strains which have a limited virulence in selected animal models and for humans. Sequencing of lcrV genes from ten Y. pestis strains belonging to different phylogenetic groups (subspecies) showed that the LcrV proteins possess four major variable hotspots at positions 18, 72, 273, and 324-326. These major variations, together with other minor substitutions in amino acid sequences, allowed them to classify themore » LcrV alleles into five sequence types (A-E). They observed that the strains of different Y. pestis subspecies can have the same typ of LcrV, and different types of LcrV can exist within the same natural plague focus. The LcrV polymorphisms were structurally analyzed by comparing the modeled structures of LcrV from all available strains. All changes except one occurred either in flexible regions or on the surface of the protein, but local chemical properties (i.e. those of a hydrophobic, hydrophilic, amphipathic, or charged nature) were conserved across all of the strains. Polymorphisms in flexible and surface regions are likely subject to less selective pressure, and have a limited impact on the structure. In contrast, the substitution of tryptophan at position 113 with either glutamic acid or glycine likely has a serious influence on the regional structure of the protein, and these mutations might have an effect on the function of LcrV. The polymorphisms at positions 18, 72 and 273 were accountable for differences in oligomerization of LcrV. The importance of the latter property in emergence of epidemic strains of Y. pestis during evolution of this pathogen will need to be further investigated.« less
Reed, K M; Dorschner, M O; Todd, T N; Phillips, R B
1998-09-01
Sequence variation in the control region (D-loop) of the mitochondrial DNA (mtDNA) was examined to assess the genetic distinctiveness of the shortjaw cisco (Coregonus zenithicus). Individuals from within the Great Lakes Basin as well as inland lakes outside the basin were sampled. DNA fragments containing the entire D-loop were amplified by PCR from specimens of C. zenithicus and the related species C. artedi, C. hoyi, C. kiyi, and C. clupeaformis. DNA sequence analysis revealed high similarity within and among species and shared polymorphism for length variants. Based on this analysis, the shortjaw cisco is not genetically distinct from other cisco species.
Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.
Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz
2004-08-18
The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.
Robarts, Daniel W H; Wolfe, Andrea D
2014-07-01
In the past few decades, many investigations in the field of plant biology have employed selectively neutral, multilocus, dominant markers such as inter-simple sequence repeat (ISSR), random-amplified polymorphic DNA (RAPD), and amplified fragment length polymorphism (AFLP) to address hypotheses at lower taxonomic levels. More recently, sequence-related amplified polymorphism (SRAP) markers have been developed, which are used to amplify coding regions of DNA with primers targeting open reading frames. These markers have proven to be robust and highly variable, on par with AFLP, and are attained through a significantly less technically demanding process. SRAP markers have been used primarily for agronomic and horticultural purposes, developing quantitative trait loci in advanced hybrids and assessing genetic diversity of large germplasm collections. Here, we suggest that SRAP markers should be employed for research addressing hypotheses in plant systematics, biogeography, conservation, ecology, and beyond. We provide an overview of the SRAP literature to date, review descriptive statistics of SRAP markers in a subset of 171 publications, and present relevant case studies to demonstrate the applicability of SRAP markers to the diverse field of plant biology. Results of these selected works indicate that SRAP markers have the potential to enhance the current suite of molecular tools in a diversity of fields by providing an easy-to-use, highly variable marker with inherent biological significance.
Robarts, Daniel W. H.; Wolfe, Andrea D.
2014-01-01
In the past few decades, many investigations in the field of plant biology have employed selectively neutral, multilocus, dominant markers such as inter-simple sequence repeat (ISSR), random-amplified polymorphic DNA (RAPD), and amplified fragment length polymorphism (AFLP) to address hypotheses at lower taxonomic levels. More recently, sequence-related amplified polymorphism (SRAP) markers have been developed, which are used to amplify coding regions of DNA with primers targeting open reading frames. These markers have proven to be robust and highly variable, on par with AFLP, and are attained through a significantly less technically demanding process. SRAP markers have been used primarily for agronomic and horticultural purposes, developing quantitative trait loci in advanced hybrids and assessing genetic diversity of large germplasm collections. Here, we suggest that SRAP markers should be employed for research addressing hypotheses in plant systematics, biogeography, conservation, ecology, and beyond. We provide an overview of the SRAP literature to date, review descriptive statistics of SRAP markers in a subset of 171 publications, and present relevant case studies to demonstrate the applicability of SRAP markers to the diverse field of plant biology. Results of these selected works indicate that SRAP markers have the potential to enhance the current suite of molecular tools in a diversity of fields by providing an easy-to-use, highly variable marker with inherent biological significance. PMID:25202637
Nadjar-Boger, Elisabeth; Funkenstein, Bruria
2011-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily that functions as a negative regulator of skeletal muscle development and growth in mammals. Fish express at least two genes for MSTN: MSTN-1 and MSTN-2. To date, MSTN-2 promoters have been cloned only from salmonids and zebrafish. Here we described the cloning and sequence analysis of MSTN-2 gene and its 5' flanking region in the marine fish Sparus aurata (saMSTN-2). We demonstrate the existence of three alleles of the promoter and three alleles of the first intron. Sequence comparison of the promoter region in the three alleles revealed that although the sequences of the first 1050 bp upstream of the translation start site are almost identical in the three alleles, a substantial sequence divergence is seen further upstream. Careful sequence analysis of the region upstream of the first 1050 bp in the three alleles identified several elements that appear to be repeated in some or all sequences, at different positions. This suggests that the promoter region of saMSTN-2 has been subjected to various chromosomal rearrangements during the course of evolution, reflecting either insertion or deletion events. Screening of several genomic DNA collections indicated differences in allele frequency, with allele 'b' being the most abundant, followed by allele 'c', whereas allele 'a' is relatively rare. Sequence analysis of saMSTN-2 gene also revealed polymorphism in the first intron, identifying three alleles. The length difference in alleles '1R' and '2R' of the first intron is due to the presence of one or two copies of a repeated block of approximately 150 bp, located at the 5' end of the first intron. The third allele, '4R', has an additional insertion of 323 bp located 116 bp upstream of the 3' end of the first intron. Analysis of several DNA collections showed that the '2R' allele is the most common, followed by the '4R' allele, whereas the '1R' allele is relatively rare. Progeny analysis of a full-sib family showed a Mendelian mode of inheritance of the two genetic loci. No clear association was found between the two genetic markers and growth rate. These results show for the first time a substantial degree of polymorphism in both the promoter and first intron of MSTN-2 gene in a perciform fish species which points to chromosomal rearrangements that took place during evolution.
Toll like receptor 2 and 4 polymorphisms in malaria endemic populations of India.
Bali, Prerna; Pradhan, Sabyasachi; Sharma, Divya; Adak, Tridibes
2013-02-01
Toll like receptors (TLRs) play a pivotal role in recognizing the invading malaria parasite Plasmodium, thus genetic makeup of the exposed population can be of utmost importance for its predisposition to malaria. In this study 264 malaria patients from seven different eco epidemiological regions of India were genotyped for TLR2 and TLR4 polymorphisms using DNA sequencing methods. No variation was observed at residue positions 677 and 753 in TLR2 whereas residue positions 299 and 399 in TLR4 were highly polymorphic. The GC haplotype (Asp299Gly/Thr399Thr) was observed at the highest frequency in populations of East Singhbhum, Vizianagaram and North Goa and absent in Kolkata, Dakshin Kannada and Nicobar district. All polymorphisms were in Hardy Weinberg equilibrium. Populations of Kolkata, Nicobar district, Sundergarh and Dakshin Kannada were observed to be closely related. TLR2 polymorphism was absent in the Indian population and an overall heterogeneous pattern of TLR4 polymorphism can be attributed to genetic drift. However it can be inferred that GC haplotype is under the process of natural selection in the Indian population and one of the factors contributing to its selection could be predominance of Plasmodium falciparum in these regions. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Population-Level Immune-Mediated Adaptation in HIV-1 Polymerase during the North American Epidemic
Kinloch, Natalie N.; MacMillan, Daniel R.; Le, Anh Q.; Cotton, Laura A.; Bangsberg, David R.; Buchbinder, Susan; Carrington, Mary; Fuchs, Jonathan; Harrigan, P. Richard; Koblin, Beryl; Kushel, Margot; Markowitz, Martin; Mayer, Kenneth; Milloy, M. J.; Schechter, Martin T.; Wagner, Theresa; Walker, Bruce D.; Carlson, Jonathan M.; Poon, Art F. Y.
2015-01-01
ABSTRACT Human leukocyte antigen (HLA) class I-associated polymorphisms in HIV-1 that persist upon transmission to HLA-mismatched hosts may spread in the population as the epidemic progresses. Transmission of HIV-1 sequences containing such adaptations may undermine cellular immune responses to the incoming virus in future hosts. Building upon previous work, we investigated the extent of HLA-associated polymorphism accumulation in HIV-1 polymerase (Pol) through comparative analysis of linked HIV-1/HLA class I genotypes sampled during historic (1979 to 1989; n = 338) and modern (2001 to 2011; n = 278) eras from across North America (Vancouver, BC, Canada; Boston, MA; New York, NY; and San Francisco, CA). Phylogenies inferred from historic and modern HIV-1 Pol sequences were star-like in shape, with an inferred most recent common ancestor (epidemic founder virus) sequence nearly identical to the modern North American subtype B consensus sequence. Nevertheless, modern HIV-1 Pol sequences exhibited roughly 2-fold-higher patristic (tip-to-tip) genetic distances than historic sequences, with HLA pressures likely driving ongoing diversification. Moreover, the frequencies of published HLA-associated polymorphisms in individuals lacking the selecting HLA class I allele was on average ∼2.5-fold higher in the modern than in the historic era, supporting their spread in circulation, though some remained stable in frequency during this time. Notably, polymorphisms restricted by protective HLA alleles appear to be spreading to a greater relative extent than others, though these increases are generally of modest absolute magnitude. However, despite evidence of polymorphism spread, North American hosts generally remain at relatively low risk of acquiring an HIV-1 polymerase sequence substantially preadapted to their HLA profiles, even in the present era. IMPORTANCE HLA class I-restricted cytotoxic T-lymphocyte (CTL) escape mutations in HIV-1 that persist upon transmission may accumulate in circulation over time, potentially undermining host antiviral immunity to the transmitted viral strain. We studied >600 experimentally collected HIV-1 polymerase sequences linked to host HLA information dating back to 1979, along with phylogenetically reconstructed HIV-1 sequences dating back to the virus' introduction into North America. Overall, our results support the gradual spread of many—though not all—HIV-1 polymerase immune escape mutations in circulation over time. This is consistent with recent observations from other global regions, though the extent of polymorphism accumulation in North America appears to be lower than in populations with high seroprevalence, older epidemics, and/or limited HLA diversity. Importantly, the risk of acquiring an HIV-1 polymerase sequence at transmission that is substantially preadapted to one's HLA profile remains relatively low in North America, even in the present era. PMID:26559841
Bishoyi, Ashok Kumar; Kavane, Aarti; Sharma, Anjali; Geetha, K A
2017-02-01
CYMBOPOGON: is an important member of grass family Poaceae, cultivated for essential oils which have greater medicinal and industrial value. Taxonomic identification of Cymbopogon species is determined mainly by morphological markers, odour of essential oils and concentration of bioactive compounds present in the oil matrices which are highly influenced by environment. Authenticated molecular marker based taxonomical identification is also lacking in the genus; hence effort was made to evaluate potential DNA barcode loci in six commercially important Cymbopogon species for their individual discrimination and authentication at the species level. Four widely used DNA barcoding regions viz., ITS 1 & ITS 2 spacers, matK, psbA-trnH and rbcL were taken for the study. Gene sequences of the same or related genera of the concerned loci were mined from NCBI domain and primers were designed and validated for barcode loci amplification. Out of the four loci studied, sequences from matK and ITS spacer loci revealed 0.46% and 5.64% nucleotide sequence diversity, respectively whereas the other two loci i.e., psbA-trnH and rbcL showed 100% sequence homology. The newly developed primers can be used for barcode loci amplification in the genus Cymbopogon. The identified Single Nucleotide Polymorphisms from the studied sequences may be used as barcodes for the six Cymbopogon species. The information generated can also be utilized for barcode development of the genus by including more number of Cymbopgon species in future.
2009-01-01
In order to identify new markers around the glaucoma locus GLC1B as a tool to refine its critical region at 2p11.2-2q11.2, we searched the critical region sequence obtained from the UCSC database for tetranucleotide (GATA)n and (GTCT)n repeats of at least 10 units in length. Three out of four potential microsatellite loci were found to be polymorphic, heterozygosity ranging from 64.56% to 79.59%. The identified markers are useful not only for GLC1B locus but also for the study of other disease loci at 2p11.2-2q11.2, a region with scarcity of microsatellite markers. PMID:21637444
Structure and polymorphism of the mouse myelin/oligodendrocyte glycoprotein gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daubas, P.; Pham-Dinh, D.; Dautigny, A.
1994-09-01
The authors have isolated and characterized genomic clones containing the mouse myelin/oligodendrocyte glycoprotein (MOG) gene. It spans a region of 12.5 kb and consists of eight exons. Its exon-intron structure differs from that of classical MHC-class I genes, with which it is linked in the mouse genome. Nucleotide sequencing of the 5{prime} flanking region revelas that it contains several putative protein-binding sites, some of them in common with other myelin gene promoters. One intragenic polymorphism has been identified: it consists of a GA repeat, defining at least three alleles in mouse inbred strains, and is easily detectable using the polymerasemore » chain reaction method.« less
Baetu, Irina; Burns, Nicholas R; Urry, Kristi; Barbante, Girolamo Giovanni; Pitcher, Julia B
2015-11-01
Performing sequences of movements is a ubiquitous skill that involves dopamine transmission. However, it is unclear which components of the dopamine system contribute to which aspects of motor sequence learning. Here we used a genetic approach to investigate the relationship between different components of the dopamine system and specific aspects of sequence learning in humans. In particular, we investigated variations in genes that code for the catechol-O-methyltransferase (COMT) enzyme, the dopamine transporter (DAT) and dopamine D1 and D2 receptors (DRD1 and DRD2). COMT and the DAT regulate dopamine availability in the prefrontal cortex and the striatum, respectively, two key regions recruited during learning, whereas dopamine D1 and D2 receptors are thought to be involved in long-term potentiation and depression, respectively. We show that polymorphisms in the COMT, DRD1 and DRD2 genes differentially affect behavioral performance on a sequence learning task in 161 Caucasian participants. The DRD1 polymorphism predicted the ability to learn new sequences, the DRD2 polymorphism predicted the ability to perform a previously learnt sequence after performing interfering random movements, whereas the COMT polymorphism predicted the ability to switch flexibly between two sequences. We used computer simulations to explore potential mechanisms underlying these effects, which revealed that the DRD1 and DRD2 effects are possibly related to neuroplasticity. Our prediction-error algorithm estimated faster rates of connection strengthening in genotype groups with presumably higher D1 receptor densities, and faster rates of connection weakening in genotype groups with presumably higher D2 receptor densities. Consistent with current dopamine theories, these simulations suggest that D1-mediated neuroplasticity contributes to learning to select appropriate actions, whereas D2-mediated neuroplasticity is involved in learning to inhibit incorrect action plans. However, the learning algorithm did not account for the COMT effect, suggesting that prefrontal dopamine availability might affect sequence switching via other, non-learning, mechanisms. These findings provide insight into the function of the dopamine system, which is relevant to the development of treatments for disorders such as Parkinson's disease. Our results suggest that treatments targeting dopamine D1 receptors may improve learning of novel sequences, whereas those targeting dopamine D2 receptors may improve the ability to initiate previously learned sequences of movements. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Jiao, J; Wu, J; Lv, Z; Sun, C; Gao, L; Yan, X; Cui, L; Tang, Z; Yan, B; Jia, Y
2015-11-26
This study aimed to investigate cytosine methylation profiles in different tobacco (Nicotiana tabacum) cultivars grown in China. Methylation-sensitive amplified polymorphism was used to analyze genome-wide global methylation profiles in four tobacco cultivars (Yunyan 85, NC89, K326, and Yunyan 87). Amplicons with methylated C motifs were cloned by reamplified polymerase chain reaction, sequenced, and analyzed. The results show that geographical location had a greater effect on methylation patterns in the tobacco genome than did sampling time. Analysis of the CG dinucleotide distribution in methylation-sensitive polymorphic restriction fragments suggested that a CpG dinucleotide cluster-enriched area is a possible site of cytosine methylation in the tobacco genome. The sequence alignments of the Nia1 gene (that encodes nitrate reductase) in Yunyan 87 in different regions indicate that a C-T transition might be responsible for the tobacco phenotype. T-C nucleotide replacement might also be responsible for the tobacco phenotype and may be influenced by geographical location.
Mitochondrial DNA polymorphism in a maternal lineage of Holstein cows.
Hauswirth, W W; Laipis, P J
1982-01-01
Two mitochondrial genotypes are shown to exist within one Holstein cow maternal lineage. They were detected by the appearance of an extra Hae III recognition site in one genotype. The nucleotide sequence of this region has been determined and the genotypes are distinguished by an adenine/guanine base transition which creates the new Hae III site. This point mutation occurs within an open reading frame at the third position of a glycine codon and therefore does not alter the amino acid sequence. The present pattern of genotypes within the lineage demands that multiple shifts between genotypes must have occurred within the past 20 years with the most rapid shift taking place in no more than 4 years and indicates that mitochondrial DNA polymorphism can occur between maternally related mammals. The process that gave rise to different genotypes in one lineage is clearly of fundamental importance in understanding intraspecific mitochondrial polymorphism and evolution in mammals. Several potential mechanisms for rapid mitochondrial DNA variation are discussed in light of these results. Images PMID:6289312
Novel human CRYGD rare variant in a Brazilian family with congenital cataract
Giordano, Gabriel Gorgone; Tavares, Anderson; da Silva, Márcio José; de Vasconcellos, José Paulo Cabral; Arieta, Carlos Eduardo Leite; de Melo, Mônica Barbosa
2011-01-01
Purpose To describe a novel polymorphism in the γD-crystallin (CRYGD) gene in a Brazilian family with congenital cataract. Methods A Brazilian four-generation family was analyzed. The proband had bilateral lamellar cataract and the phenotypes were classified by slit lamp examination. Genomic DNA was extracted from peripheral blood and coding regions and intron/exon boundaries of the αA-crystallin (CRYAA), γC-crystallin (CRYGC), and CRYGD genes were amplified by polymerase chain reaction and directly sequenced. Results Sequencing of the coding regions of CRYGD showed the presence of a heterozygous A→G transversion at c.401 position, which results in the substitution of a tyrosine to a cysteine (Y134C). The polymorphism was identified in three individuals, two affected and one unaffected. Conclusions A novel rare variant in CRYGD (Y134C) was detected in a Brazilian family with congenital cataract. Because there is no segregation between the substitution and the phenotypes in this family, other genetic alterations are likely to be present. PMID:21866214
Miller, John J; Eackles, Michael S.; Stauffer, Jay R; King, Timothy L.
2015-01-01
We characterized variation within the mitochondrial genomes of the invasive silver carp (Hypophthalmichthys molitrix) and bighead carp (H. nobilis) from the Mississippi River drainage by mapping our Next-Generation sequences to their publicly available genomes. Variant detection resulted in 338 single-nucleotide polymorphisms for H. molitrix and 39 for H. nobilis. The much greater genetic variation in H. molitrix mitochondria relative to H. nobilis may be indicative of a greater North American female effective population size of the former. When variation was quantified by gene, many tRNA loci appear to have little or no variability based on our results whereas protein-coding regions were more frequently polymorphic. These results provide biologists with additional regions of DNA to be used as markers to study the invasion dynamics of these species.
Polymorphism in the Eruption Sequence of Primary Dentition: A Cross-sectional Study
Bhojraj, Nandlal; Narayanappa
2017-01-01
Introduction Primary teeth have shown wide variations in their eruption time among different population. Population specific eruption ages are provided as mean with standard deviations or median ages with its percentile range. This alone will be insufficient for prediction of tooth eruption sequence because they provide no information on the frequency of sequence variation within the pairs of teeth. Norms of polymorphic variation in the eruption sequence can be more useful. Aim This study aims at providing norms for the sequence polymorphism in primary teeth among the children of Mysore population. Materials and Methods A cross-sectional study was designed with 1392 children, recruited from December 2015 to June 2016 by simple random sampling method. Tooth was recorded as present or absent. Across the entire possible intra quadrant tooth pair, cases of present-present, absent-absent, present-absent and absent-present and were counted and computed as percentages. Results Sequence polymorphisms were more common in 82-84 pairs of teeth. Significant polymorphic reverse sequence was observed in 52-54 (9%), 82-84 (35%) in males and 82-84 (18%) in females. There was no polymorphism in maxillary arch in females. Conclusion The present study provides the baseline data values for sequence variation in primary teeth eruption. To the best of investigators knowledge, there are no previous studies describing the sequence polymorphism in primary teeth in Indian population. The results of this study helps in assessment of eruption sequence problems in paediatric dentistry and in evaluation and prediction of tooth eruption sequence in individual child. PMID:28658912
Boldogköi, Zsolt
2004-09-01
Population genetics, the mathematical theory of modern evolutionary biology, defines evolution as the alteration of the frequency of distinct gene variants (alleles) differing in fitness over the time. The major problem with this view is that in gene and protein sequences we can find little evidence concerning the molecular basis of phenotypic variance, especially those that would confer adaptive benefit to the bearers. Some novel data, however, suggest that a large amount of genetic variation exists in the regulatory region of genes within populations. In addition, comparison of homologous DNA sequences of various species shows that evolution appears to depend more strongly on gene expression than on the genes themselves. Furthermore, it has been demonstrated in several systems that genes form functional networks, whose products exhibit interrelated expression profiles. Finally, it has been found that regulatory circuits of development behave as evolutionary units. These data demonstrate that our view of evolution calls for a new synthesis. In this article I propose a novel concept, termed the selfish gene network hypothesis, which is based on an overall consideration of the above findings. The major statements of this hypothesis are as follows. (1) Instead of individual genes, gene networks (GNs) are responsible for the determination of traits and behaviors. (2) The primary source of microevolution is the intraspecific polymorphism in GNs and not the allelic variation in either the coding or the regulatory sequences of individual genes. (3) GN polymorphism is generated by the variation in the regulatory regions of the component genes and not by the variance in their coding sequences. (4) Evolution proceeds through continuous restructuring of the composition of GNs rather than fixing of specific alleles or GN variants.
Skoglund, Pontus; Höglund, Jacob
2010-04-23
Population variation in the degree of seasonal polymorphism is rare in birds, and the genetic basis of this phenomenon remains largely undescribed. Both sexes of Scandinavian and Scottish Willow grouse (Lagopus lagopus) display marked differences in their winter phenotypes, with Scottish grouse retaining a pigmented plumage year-round and Scandinavian Willow grouse molting to a white morph during winter. A widely studied pathway implicated in vertebrate pigmentation is the melanin system, for which functional variation has been characterised in many taxa. We sequenced coding regions from four genes involved in melanin pigmentation (DCT, MC1R, TYR and TYRP1), and an additional control involved in the melanocortin pathway (AGRP), to investigate the genetic basis of winter plumage in Lagopus. Despite the well documented role of the melanin system in animal coloration, we found no plumage-associated polymorphism or evidence for selection in a total of approximately 2.6 kb analysed sequence. Our results indicate that the genetic basis of alternating between pigmented and unpigmented seasonal phenotypes is more likely explained by regulatory changes controlling the expression of these or other loci in the physiological pathway leading to pigmentation.
Ben Ali, Sina-Elisabeth; Madi, Zita Erika; Hochegger, Rupert; Quist, David; Prewein, Bernhard; Haslberger, Alexander G; Brandes, Christian
2014-10-31
Genetic mutations must be avoided during the production and use of seeds. In the European Union (EU), Directive 2001/18/EC requires any DNA construct introduced via transformation to be stable. Establishing genetic stability is critical for the approval of genetically modified organisms (GMOs). In this study, genetic stability of two GMOs was examined using high resolution melting (HRM) analysis and real-time polymerase chain reaction (PCR) employing Scorpion primers for amplification. The genetic variability of the transgenic insert and that of the flanking regions in a single oilseed rape variety (GT73) and a stacked maize (MON88017×MON810) was studied. The GT73 and the 5' region of MON810 showed no instabilities in the examined regions. However; two out of 100 analyzed samples carried a heterozygous point mutation in the 3' region of MON810 in the stacked variety. These results were verified by direct sequencing of the amplified PCR products as well as by sequencing of cloned PCR fragments. The occurrence of the mutation suggests that the 5' region is more suitable than the 3' region for the quantification of MON810. The identification of the single nucleotide polymorphism (SNP) in a stacked event is in contrast to the results of earlier studies of the same MON810 region in a single event where no DNA polymorphism was found.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.
Wyszyńska-Koko, J; Kurył, J
2004-01-01
MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
Blaiotta, Giuseppe; Pepe, Olimpia; Mauriello, Gianluigi; Villani, Francesco; Andolfi, Rosamaria; Moschetti, Giancarlo
2002-12-01
The intergenic spacer region (ISR) between the 16S and 23S rRNA genes was tested as a tool for differentiating lactococci commonly isolated in a dairy environment. 17 reference strains, representing 11 different species belonging to the genera Lactococcus, Streptococcus, Lactobacillus, Enterococcus and Leuconostoc, and 127 wild streptococcal strains isolated during the whole fermentation process of "Fior di Latte" cheese were analyzed. After 16S-23S rDNA ISR amplification by PCR, species or genus-specific patterns were obtained for most of the reference strains tested. Moreover, results obtained after nucleotide analysis show that the 16S-23S rDNA ISR sequences vary greatly, in size and sequence, among Lactococcus garvieae, Lactococcus raffinolactis, Lactococcus lactis as well as other streptococci from dairy environments. Because of the high degree of inter-specific polymorphism observed, 16S-23S rDNA ISR can be considered a good potential target for selecting species-specific molecular assays, such as PCR primer or probes, for a rapid and extremely reliable differentiation of dairy lactococcal isolates.
Gao, J; He, Q; Hua, D; Mao, Y; Li, Y; Shen, L
2013-08-01
Capecitabine-containing chemotherapy was widely used in clinic medication. We investigated the association of the thymidylate synthase (TS), methylenetetrahydrofolate reductase (MTHFR), and dihydropyrimidine dehydrogenase (DPD) polymorphisms with the clinical outcome of Chinese advanced gastric cancer patients receiving first-line capecitabine plus paclitaxel. Blood samples were collected prior to treatment from 125 patients with advanced gastric cancer and the TS (two or three repeats of a 28 bp sequence in 5'-untranslated region and 6 bp insertion or deletion in 3'-untranslated region), MTHFR (C677T) and DPD (IVS14+1G > A) polymorphisms were determined using PCR amplification and Sanger sequencing. The median age of 125 patients was 58 years (range, 23-76) with female 42 and male 83, and the response rate, median progression-free survival and overall survival (OS) were 43.2 %, 5.2 and 11.0 months. The median OS in patients with TS ins6/ins6 genotype (6.8 months) was significantly shorter than those in patients with ins6/del6 (11.0 months, P = 0.016) and del6/del6 (11.5 months, P = 0.039) genotypes. Cox multivariate analysis also showed that TS ins6/ins6 genotype was the independent poor OS predictor (P = 0.001, HR = 3.182). No significant associations were found between the polymorphisms of TS 5'-UTR/MTHFR and clinical outcome, and no IVS14+1G > A polymorphism of DPD was found in this study. We first reported that TS 3'-UTR ins6/ins6 genotype could predict the poor survival of advanced gastric cancer patients treated with capecitabine plus paclitaxel, which would be further verified in a large multicenter study.
Uncovering Local Trends in Genetic Effects of Multiple Phenotypes via Functional Linear Models.
Vsevolozhskaya, Olga A; Zaykin, Dmitri V; Barondess, David A; Tong, Xiaoren; Jadhav, Sneha; Lu, Qing
2016-04-01
Recent technological advances equipped researchers with capabilities that go beyond traditional genotyping of loci known to be polymorphic in a general population. Genetic sequences of study participants can now be assessed directly. This capability removed technology-driven bias toward scoring predominantly common polymorphisms and let researchers reveal a wealth of rare and sample-specific variants. Although the relative contributions of rare and common polymorphisms to trait variation are being debated, researchers are faced with the need for new statistical tools for simultaneous evaluation of all variants within a region. Several research groups demonstrated flexibility and good statistical power of the functional linear model approach. In this work we extend previous developments to allow inclusion of multiple traits and adjustment for additional covariates. Our functional approach is unique in that it provides a nuanced depiction of effects and interactions for the variables in the model by representing them as curves varying over a genetic region. We demonstrate flexibility and competitive power of our approach by contrasting its performance with commonly used statistical tools and illustrate its potential for discovery and characterization of genetic architecture of complex traits using sequencing data from the Dallas Heart Study. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
Development of genetic markers in abalone through construction of a SNP database.
Kang, J-H; Appleyard, S A; Elliott, N G; Jee, Y-J; Lee, J B; Kang, S W; Baek, M K; Han, Y S; Choi, T-J; Lee, Y S
2011-06-01
In the absence of a reference genome, single-nucleotide polymorphisms (SNP) discovery in a group of abalone species was undertaken by random sequence assembly. A web-based interface was constructed, and 11 932 DNA sequences from the genus Haliotis were assembled, with 1321 contigs built. Of these, 118 contigs that consisted of at least ten annotation groups were selected. The 1577 putative SNPs were identified from the 118 contigs, with SNPs in several HSP70 gene contigs confirmed by PCR amplification of an 809-bp DNA fragment. SNPs in the HSP70 gene were compared across eight abalone species. A total of 129 polymorphic sites, including heterozygote sites within and among species, were observed. Phylogenetic analysis of the partial HSP70 gene region showed separation of the tested abalone into two groups, one reflecting the southern hemisphere species and the other the northern hemisphere species. Interestingly, Haliotis iris from New Zealand showed a closer relationship to species distributed in the northern Pacific region. Although HSP genes are known to be highly conserved among taxa, the validation of polymorphic SNPs from HSP70 in this mollusc demonstrates the applicability of cross-species SNP markers in abalone and the first step towards universal nuclear markers in Haliotis. © 2010 NFRDI, Animal Genetics © 2010 Stichting International Foundation for Animal Genetics.
Polymorphism and genetic mapping of the human oxytocin receptor gene on chromosome 3
DOE Office of Scientific and Technical Information (OSTI.GOV)
Michelini, S.; Urbanek, M.; Goldman, D.
1995-06-19
Centrally administered oxytocin has been reported to facilitate affiliative and social behaviors, in functional harmony with its well-known peripheral effects on uterine contraction and milk ejection. The biological effects of oxytocin could be perturbed by mutations occurring in the sequence of the oxytocin receptor gene, and it would be of interest to establish the position of this gene on the human linkage map. Therefore we identified a polymorphism at the human oxytocin receptor gene. A portion of the 3{prime} untranslated region containing a 30 bp CA repeat was amplified by polymerase chain reaction (PCR), revealing a polymorphism with two allelesmore » occurring with frequencies of 0.77 and 0.23 in a sample of Caucasian CEPH parents (n = 70). The CA repeat polymorphism we detected was used to map the human oxytocin receptor to chromosome 3p25-3p26, in a region which contains several important genes, including loci for Von Hippel-Lindau disease (VHL) and renal cell carcinoma. 53 refs., 2 figs., 1 tab.« less
A survey of copy number variation in the porcine genome detected from whole-genome sequence
USDA-ARS?s Scientific Manuscript database
An important challenge to post-genomic biology is relating observed phenotypic variation to the underlying genotypic variation. Genome-wide association studies (GWAS) have made thousands of connections between single nucleotide polymorphisms (SNPs) and phenotypes, implicating regions of the genome t...
Trichoderma asperellum reconsidered: two cryptic species
USDA-ARS?s Scientific Manuscript database
Analysis of a world-wide collection of strains of Trichoderma asperellum using multilocus genealogies of four genomic regions (tef1, rbp2, act, ITS1, 2, 5.8s), sequence polymorphism-derived (SPD) markers, matrix-assisted laser desorption/ionisation–time of flight mass spectrometry (MALDI-TOF MS) of ...
Alam, Nuhu; Shim, Mi Ja; Lee, Min Woong; Shin, Pyeong Gyun; Yoo, Young Bok; Lee, Tae Soo
2009-09-01
The molecular phylogeny in nine different commercial cultivated strains of Pleurotus nebrodensis was studied based on their internal transcribed spacer (ITS) region and RAPD. In the sequence of ITS region of selected strains, it was revealed that the total length ranged from 592 to 614 bp. The size of ITS1 and ITS2 regions varied among the strains from 219 to 228 bp and 211 to 229 bp, respectively. The sequence of ITS2 was more variable than ITS1 and the region of 5.8S sequences were identical. Phylogenetic tree of the ITS region sequences indicated that selected strains were classified into five clusters. The reciprocal homologies of the ITS region sequences ranged from 99 to 100%. The strains were also analyzed by RAPD with 20 arbitrary primers. Twelve primers were efficient to applying amplification of the genomic DNA. The sizes of the polymorphic fragments obtained were in the range of 200 to 2000 bp. RAPD and ITS analysis techniques were able to detect genetic variation among the tested strains. Experimental results suggested that IUM-1381, IUM-3914, IUM-1495 and AY-581431 strains were genetically very similar. Therefore, all IUM and NCBI gene bank strains of P. nebrodensis were genetically same with some variations.
Guo, Hongfang; Raza, Sayed Haidar Abbas; Schreurs, Nicola M; Khan, Rajwali; Wei, Dawei; Wang, Li; Zhang, Song; Zhang, Le; Wu, Sen; Ullah, Irfan; Hosseini, Seyed Mahdi; Zan, Linsen
2018-06-08
Krüppel-like factor 3 (KLF3), a member of the Krüppel-like factor (KLF) family, plays an important role in adipogenesis and lipid metabolism. The aim of this study was to investigate whether KLF3 could be used as a candidate gene in the breeding of cattle. The expression pattern of bovine KLF3 gene revealed that it was highly expressed in abdominal fat and perirenal fat. Using DNA sequencing, three single nucleotide polymorphisms (SNPs) within the promoter regions of KLF3 gene were identified in 448 Qinchuan cattle, which are located in the recognition sequences of 11 transcription factors and the four haplotypes representing four potential different compositions of polymorphic potential cis-acting elements. Association analysis results indicated that individuals with the Hap7/7 diplotype showed higher (P < 0.05) intramuscular fat content (IFC) than those with H7/8. In addition, the H7 haplotype had much higher (P < 0.05) transcriptional activity than the H8 haplotype, consistent with the association analysis. We speculated that polymorphisms in transcription factor binding sites of the KLF3 promoter region affected transcriptional activity of KLF3, which subsequently influence intramuscular fat content in Qinchuan cattle and KLF3 gene could be used as molecular markers for fat deposition traits using early marker-assisted selection (MAS) of Qinchuan cattle breeding in the future. Copyright © 2017. Published by Elsevier B.V.
Feres, Juliana Massimino; Monteiro, Mariza; Zucchi, Maria I; Pinheiro, José B; Mestriner, Moacyr A; Alzate-Marin, Ana Lilia
2012-04-01
We developed and characterized nuclear microsatellite markers for Anadenanthera colubrina, a tropical tree species widely distributed in South America. Leaf samples of mature A. colubrina trees, popularly called "angico," were collected from an area that is greatly impacted by agricultural practices in the region of Ribeirão Preto in São Paulo State in southeastern Brazil. Twenty simple sequence repeat (SSR) markers were developed, 14 of which had polymorphic loci. A total of 96 alleles were detected with an average of 6.86 alleles per polymorphic locus. The expected heterozygosity, calculated at polymorphic loci, ranged from 0.18 to 0.83. Finally, we demonstrated that 18 loci were cross-amplified in A. peregrina. A total of 14 polymorphic markers suggest a high potential for genetic diversity, gene flow, and mating system analyses in A. colubrina.
Vieira, Leila do Nascimento; Dos Anjos, Karina Goulart; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Greco, Thiago Machado; Pedrosa, Fábio de Oliveira; de Souza, Emanuel Maltempi; Rogalski, Marcelo; de Souza, Robson Francisco; Guerra, Miguel Pedro
2016-05-01
The complete plastome sequencing is an efficient option for increasing phylogenetic resolution and evolutionary studies, as well as may greatly facilitate the use of plastid DNA markers in plant population genetic studies. Merostachys and Guadua stand out as the most common and the highest potential utilization bamboos indigenous of Brazil. Here, we sequenced the complete plastome sequences of the Brazilian Guadua chacoensis and Merostachys sp. to perform full plastome phylogeny and characterize the occurrence, type, and distribution of SRRs using 20 Bambuseae species. The determined plastome sequence of Merostachys sp. and G. chacoensis is 136,334 and 135,403 bp in size, respectively, with an identical gene content and typical quadripartite structure consisting of a pair of IRs separated by the LSC and SSC regions. The Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of Paleotropical and Neotropical Bamboos clades. The Neotropical bamboos segregated into three well-supported lineages, Chusqueinae, Guaduinae, and Arthrostylidiinae, with the last two forming a well-supported sister relationship. Paleotropical bamboos segregated into two well-supported lineages, Hickeliinae and Bambusinae + Melocanninae. We identified 141.8 cpSSR in Bambuseae plastomes and an inferior value (38.15) for plastome coding sequences. Among them, we identified 16 polymorphic SSR loci, with number of alleles varying from 3 to 10. These 16 polymorphic cpSSR loci in Bambuseae plastome can be assessed for the intraspecific level of polymorphism, leading to innovative highly sensitive phylogeographic and population genetics studies for this tribe.
Pelle, Roger; Mwacharo, Joram M.; Njahira, Moses N.; Marcellino, Wani L.; Kiara, Henry; Malak, Agol K.; EL Hussein, Abdel Rahim M.; Bishop, Richard; Skilton, Robert A.
2017-01-01
East Coast fever (ECF), caused by Theileria parva infection, is a frequently fatal disease of cattle in eastern, central and southern Africa, and an emerging disease in South Sudan. Immunization using the infection and treatment method (ITM) is increasingly being used for control in countries affected by ECF, but not yet in South Sudan. It has been reported that CD8+ T-cell lymphocytes specific for parasitized cells play a central role in the immunity induced by ITM and a number of T. parva antigens recognized by parasite-specific CD8+ T-cells have been identified. In this study we determined the sequence diversity among two of these antigens, Tp1 and Tp2, which are under evaluation as candidates for inclusion in a sub-unit vaccine. T. parva samples (n = 81) obtained from cattle in four geographical regions of South Sudan were studied for sequence polymorphism in partial sequences of the Tp1 and Tp2 genes. Eight positions (1.97%) in Tp1 and 78 positions (15.48%) in Tp2 were shown to be polymorphic, giving rise to four and 14 antigen variants in Tp1 and Tp2, respectively. The overall nucleotide diversity in the Tp1 and Tp2 genes was π = 1.65% and π = 4.76%, respectively. The parasites were sampled from regions approximately 300 km apart, but there was limited evidence for genetic differentiation between populations. Analyses of the sequences revealed limited numbers of amino acid polymorphisms both overall and in residues within the mapped CD8+ T-cell epitopes. Although novel epitopes were identified in the samples from South Sudan, a large number of the samples harboured several epitopes in both antigens that were similar to those in the T. parva Muguga reference stock, which is a key component in the widely used live vaccine cocktail. PMID:28231338
Yang, Xiumin; Sugita, Takashi; Takashima, Masako; Hiruma, Masataro; Li, Ruoyu; Sudo, Hajime; Ogawa, Hideoki; Ikeda, Shigaku
2009-04-01
Trichophyton rubrum is the most common pathogen causing dermatophytosis worldwide. Recent genetic investigations showed that the microorganism originated in Africa and then spread to Europe and North America via Asia. We investigated the intraspecific diversity of T. rubrum isolated from two closely located Asian countries, Japan and China. A total of 150 clinical isolates of T. rubrum obtained from Japanese and Chinese patients were analyzed by randomly amplified polymorphic DNA (RAPD) and DNA sequence analysis of the non-transcribed spacer (NTS) region in the rRNA gene. RAPD analysis divided the 150 strains into two major clusters, A and B. Of the Japanese isolates, 30% belonged to cluster A and 70% belonged to cluster B, whereas 91% of the Chinese isolates were in cluster A. The NTS region of the rRNA gene was divided into four major groups (I-IV) based on DNA sequencing. The majority of Japanese isolates were type IV (51%), and the majority of Chinese isolates were type III (75%). These results suggest that although Japan and China are neighboring countries, the origins of T. rubrum isolates from these countries may not be identical. These findings provide information useful for tracing the global transmission routes of T. rubrum.
Phylogenetic Network for European mtDNA
Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari
2001-01-01
The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229
Miller, Marcia M.; Taylor, Robert L.
2016-01-01
Nearly all genes presently mapped to chicken chromosome 16 (GGA 16) have either a demonstrated role in immune responses or are considered to serve in immunity by reason of sequence homology with immune system genes defined in other species. The genes are best described in regional units. Among these, the best known is the polymorphic major histocompatibility complex-B (MHC-B) region containing genes for classical peptide antigen presentation. Nearby MHC-B is a small region containing two CD1 genes, which encode molecules known to bind lipid antigens and which will likely be found in chickens to present lipids to specialized T cells, as occurs with CD1 molecules in other species. Another region is the MHC-Y region, separated from MHC-B by an intervening region of tandem repeats. Like MHC-B, MHC-Y is polymorphic. It contains specialized class I and class II genes and c-type lectin-like genes. Yet another region, separated from MHC-Y by the single nucleolar organizing region (NOR) in the chicken genome, contains olfactory receptor genes and scavenger receptor genes, which are also thought to contribute to immunity. The structure, distribution, linkages and patterns of polymorphism in these regions, suggest GGA 16 evolves as a microchromosome devoted to immune defense. Many GGA 16 genes are polymorphic and polygenic. At the moment most disease associations are at the haplotype level. Roles of individual MHC genes in disease resistance are documented in only a very few instances. Provided suitable experimental stocks persist, the availability of increasingly detailed maps of GGA 16 genes combined with new means for detecting genetic variability will lead to investigations defining the contributions of individual loci and more applications for immunogenetics in breeding healthy poultry. PMID:26740135
Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
2013-01-01
Background The origins and dispersal of Plasmodium vivax to its current worldwide distribution remains controversial. Although progress on P. vivax genetics and genomics has been achieved worldwide, information concerning New World parasites remains fragmented and largely incomplete. More information on the genetic diversity in Latin America (LA) is needed to better explain current patterns of parasite dispersion and evolution. Methods Plasmodium vivax circumsporozoite protein gene polymorphism was investigated using polymerase chain reaction amplification and restriction fragment length polymorphism (PCR-RFLP), and Sanger sequencing in isolates from the Pacific Ocean coast of Mexico, Nicaragua, and Peru. In conjunction with worldwide sequences retrieved from the Genbank, mismatch distribution analysis of central repeat region (CRR), frequency estimation of unique repeat types and phylogenetic analysis of the 3′ terminal region, were performed to obtain an integrative view of the genetic relationships between regional and worldwide isolates. Results Four RFLP subtypes, vk210a, b, c and d were identified in Southern Mexico and three subtypes vk210a, e and f in Nicaragua. The nucleotide sequences showed that Mexican vk210a and all Nicaraguan isolates were similar to other American parasites. In contrast, vk210b, c and d were less frequent, had a domain ANKKAEDA in their carboxyl end and clustered with Asian isolates. All vk247 isolates from Mexico and Peru had identical RFLP pattern. Their nucleotide sequences showed two copies of GGQAAGGNAANKKAGDAGA at the carboxyl end. Differences in mismatch distribution parameters of the CRR separate vk247 from most vk210 isolates. While vk247 isolates display a homogeneous pattern with no geographical clustering, vk210 isolates display a heterogeneous geographically clustered pattern which clearly separates LA from non-American isolates, except vk210b, c and d from Southern Mexico. Conclusions The presence of vk210a in Mexico and vk210e, f and g in Nicaragua are consistent with other previously reported LA isolates and reflect their circulation throughout the continent. The vk210b, c and d are novel genotypes in LA. Their genetic relationships and low variability within these vk210 and/or within the vk247 parasites in Southern Mexico suggest its recent introduction and/or recent expansion to this region. The global analysis of P. vivax csp suggests this parasite introduction to the region and likely LA by different independent events. PMID:23855807
Guirao-Rico, Sara; Sánchez-Gracia, Alejandro; Charlesworth, Deborah
2017-03-01
DNA sequence diversity in genes in the partially sex-linked pseudoautosomal region (PAR) of the sex chromosomes of the plant Silene latifolia is higher than expected from within-species diversity of other genes. This could be the footprint of sexually antagonistic (SA) alleles that are maintained by balancing selection in a PAR gene (or genes) and affect polymorphism in linked genome regions. SA selection is predicted to occur during sex chromosome evolution, but it is important to test whether the unexpectedly high sequence polymorphism could be explained without it, purely by the combined effects of partial linkage with the sex-determining region and the population's demographic history, including possible introgression from Silene dioica. To test this, we applied approximate Bayesian computation-based model choice to autosomal sequence diversity data, to find the most plausible scenario for the recent history of S. latifolia and then to estimate the posterior density of the most relevant parameters. We then used these densities to simulate variation to be expected at PAR genes. We conclude that an excess of variants at high frequencies at PAR genes should arise in S. latifolia populations only for genes with strong associations with fully sex-linked genes, which requires closer linkage with the fully sex-linked region than that estimated for the PAR genes where apparent deviations from neutrality were observed. These results support the need to invoke selection to explain the S. latifolia PAR gene diversity, and encourage further work to test the possibility of balancing selection due to sexual antagonism. © 2016 John Wiley & Sons Ltd.
Reed, Kent M.; Dorschner, Michael O.; Todd, Thomas N.; Phillips, Ruth B.
1998-01-01
Sequence variation in the control region (D-loop) of the mitochondrial DNA (mtDNA) was examined to assess the genetic distinctiveness of the shortjaw cisco (Coregonus zenithicus). Individuals from within the Great Lakes Basin as well as inland lakes outside the basin were sampled. DNA fragments containing the entire D-loop were amplified by PCR from specimens ofC. zenithicus and the related species C. artedi, C. hoyi, C. kiyi, and C. clupeaformis. DNA sequence analysis revealed high similarity within and among species and shared polymorphism for length variants. Based on this analysis, the shortjaw cisco is not genetically distinct from other cisco species.
Development and validation of new SSR markers from expressed regions in the garlic genome
USDA-ARS?s Scientific Manuscript database
Limited number of simple sequence repeat (SSR) markers is available for the genome of garlic (Allium sativum L.) although SSR markers have become one of the most preferred marker systems because they are typically co-dominant, reproducible, cross species transferable and highly polymorphic. In this ...
2010-01-01
Background Unlike in tomato, little is known about the genetic and molecular control of fleshy fruit development of perennial fruit trees like grapevine (Vitis vinifera L.). Here we present the study of the sequence polymorphism in a 1 Mb grapevine genome region at the top of chromosome 18 carrying the fleshless berry mutation (flb) in order, first to identify SNP markers closely linked to the gene and second to search for possible signatures of domestication. Results In total, 62 regions (17 SSR, 3 SNP, 1 CAPS and 41 re-sequenced gene fragments) were scanned for polymorphism along a 3.4 Mb interval (85,127-3,506,060 bp) at the top of the chromosome 18, in both V. vinifera cv. Chardonnay and a genotype carrying the flb mutation, V. vinifera cv. Ugni Blanc mutant. A nearly complete homozygosity in Ugni Blanc (wild and mutant forms) and an expected high level of heterozygosity in Chardonnay were revealed. Experiments using qPCR and BAC FISH confirmed the observed homozygosity. Under the assumption that flb could be one of the genes involved into the domestication syndrome of grapevine, we sequenced 69 gene fragments, spread over the flb region, representing 48,874 bp in a highly diverse set of cultivated and wild V. vinifera genotypes, to identify possible signatures of domestication in the cultivated V. vinifera compartment. We identified eight gene fragments presenting a significant deviation from neutrality of the Tajima's D parameter in the cultivated pool. One of these also showed higher nucleotide diversity in the wild compartments than in the cultivated compartments. In addition, SNPs significantly associated to berry weight variation were identified in the flb region. Conclusions We observed the occurrence of a large homozygous region in a non-repetitive region of the grapevine otherwise highly-heterozygous genome and propose a hypothesis for its formation. We demonstrated the feasibility to apply BAC FISH on the very small grapevine chromosomes and provided a specific probe for the identification of chromosome 18 on a cytogenetic map. We evidenced genes showing putative signatures of selection and SNPs significantly associated with berry weight variation in the flb region. In addition, we provided to the community 554 SNPs at the top of chromosome 18 for the development of a genotyping chip for future fine mapping of the flb gene in a F2 population when available. PMID:21176183
Genetic variation of the porcine NR5A1 is associated with meat color.
Görres, Andreas; Ponsuksili, Siriluck; Wimmers, Klaus; Muráni, Eduard
2016-02-01
Because of the central role of Steroidogenic factor 1 in the regulation of the development and function of steroidogenic tissues, including the adrenal gland, we chose the encoding gene NR5A1 as a candidate for stress response, meat quality and carcass composition in the domestic pig. To identify polymorphisms of the porcine NR5A1 we comparatively sequenced the coding, untranslated and regulatory regions in four commercial pig lines. Single nucleotide polymorphisms could be found in the 3' UTR and in an intronic enhancer, whereas no polymorphisms were detected in the proximal promoter and coding region. A subset of the detected polymorphisms was genotyped in Piétrain x (German Large White x German Landrace) and German Landrace pigs. For the same animals, carcass composition traits, meat quality characteristics and parameters of adrenal function were recorded. Associations with meat color were found for two of the discovered SNPs in Piétrain x (German Large White x German Landrace) and German Landrace pigs but no connections to parameters of adrenal function could be established. We conclude that NR5A1 variations influence meat color in a hypothalamus-pituitary-adrenal axis independent manner and that further regulatory regions need to be analyzed for genetic variations to understand the discovered effects.
Iacoboni, Paola A; Hasenauer, Flavia C; Caffaro, M Eugenia; Gaido, Analia; Rossetto, Cristina; Neumann, Roberto D; Salatin, Antonio; Bertoni, Emiliano; Poli, Mario A; Rossetti, Carlos A
2014-08-15
Goats are susceptible to brucellosis and the detection of Brucella-infected animals is carried out by serological tests. In other ruminant species, polymorphisms in microsatellites (Ms) of 3' untranslated region (3'UTR) of the solute carrier family 11 member A1 (SLC11A1) gene were associated with resistance to Brucella abortus infection. Goats present two polymorphic Ms at the 3'UTR end of SLC11A1 gene, called regions A and B. Here, we evaluated if polymorphisms in regions A and/or B are associated with Brucella infection in goats. Serum (for the detection of Brucella-specific antibodies) and hair samples (for DNA isolation and structure analysis of the SLC11A1 gene) were randomly collected from 229 adult native goats from the northwest of Argentina. Serological status was evaluated by buffer plate antigen test (BPAT) complemented by the fluorescent polarization assay (FPA), and the genotype of the 3'UTR of the SLC11A1 gene was determined by capillary electrophoresis and confirmed by sequence analysis. Polymorphisms in regions A and B of the 3'UTR SLC11A1 gene were found statistically significant associated with protection to Brucella infection. Specifically, the association study indicates statistical significance of the allele A15 and B7/B7 genotype with absence of Brucella-specific antibodies (p=0.0003 and 0.0088, respectively). These data open a promising opportunity for limiting goat brucellosis through selective breeding of animals based on genetic markers associated with natural resistance to B. melitensis infection. Copyright © 2014 Elsevier B.V. All rights reserved.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis
Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting
2013-01-01
Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Boutte, Julien; Aliaga, Benoît; Lima, Oscar; Ferreira de Carvalho, Julie; Ainouche, Abdelkader; Macas, Jiri; Rousseau-Gueutin, Mathieu; Coriton, Olivier; Ainouche, Malika; Salmon, Armel
2015-01-01
Gene and whole-genome duplications are widespread in plant nuclear genomes, resulting in sequence heterogeneity. Identification of duplicated genes may be particularly challenging in highly redundant genomes, especially when there are no diploid parents as a reference. Here, we developed a pipeline to detect the different copies in the ribosomal RNA gene family in the hexaploid grass Spartina maritima from next-generation sequencing (Roche-454) reads. The heterogeneity of the different domains of the highly repeated 45S unit was explored by identifying single nucleotide polymorphisms (SNPs) and assembling reads based on shared polymorphisms. SNPs were validated using comparisons with Illumina sequence data sets and by cloning and Sanger (re)sequencing. Using this approach, 29 validated polymorphisms and 11 validated haplotypes were reported (out of 34 and 20, respectively, that were initially predicted by our program). The rDNA domains of S. maritima have similar lengths as those found in other Poaceae, apart from the 5′-ETS, which is approximately two-times longer in S. maritima. Sequence homogeneity was encountered in coding regions and both internal transcribed spacers (ITS), whereas high intragenomic variability was detected in the intergenic spacer (IGS) and the external transcribed spacer (ETS). Molecular cytogenetic analysis by fluorescent in situ hybridization (FISH) revealed the presence of one pair of 45S rDNA signals on the chromosomes of S. maritima instead of three expected pairs for a hexaploid genome, indicating loss of duplicated homeologous loci through the diploidization process. The procedure developed here may be used at any ploidy level and using different sequencing technologies. PMID:26530424
Lorenzetti, Mario Alejandro; Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-03-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America.
Gantuz, Magdalena; Altcheh, Jaime; De Matteo, Elena; Chabay, Paola Andrea; Preciado, María Victoria
2012-01-01
The ubiquitous Epstein-Barr virus (EBV) is related to the development of lymphoma and is also the etiological agent for infectious mononucleosis (IM). Sequence variations in the gene encoding LMP1 have been deeply studied in different pathologies and geographic regions. Controversial results propose the existence of tumor-related variants, while others argued in favor of a geographical distribution of these variants. Reports assessing EBV variants in IM were performed in adult patients who displayed multiple variant infections. In the present study, LMP1 variants in 15 pediatric patients with IM and 20 pediatric patients with EBV-associated lymphomas from Argentina were analyzed as representatives of benign and malignant infections in children, respectively. A 3-month follow-up study of LMP1 variants in peripheral blood cells and in oral secretions of patients with IM was performed. Moreover, an integrated linkage analysis was performed with variants of EBNA1 and the promoter region of BZLF1. Similar sequence polymorphisms were detected in both pathological conditions, IM and lymphoma, but these differ from those previously described in healthy donors from Argentina and Brazil. The results suggest that certain LMP1 polymorphisms, namely, the 30-bp deletion and high copy number of the 33-bp repeats, are associated with EBV-related pathologies, either benign or malignant, instead of just being tumor related. Additionally, this is the first study to describe the Alaskan variant in EBV-related lymphomas that previously was restricted to nasopharyngeal carcinomas from North America. PMID:22205789
Lin, Chung-Jian; Huang, Chi-Chung; Huang, Chao-Ching; Chiang, Yu-Chung; Chiang, Tzen-Yuh
2012-01-01
Background Pinus massoniana, an ecologically and economically important conifer, is widespread across central and southern mainland China and Taiwan. In this study, we tested the central–marginal paradigm that predicts that the marginal populations tend to be less polymorphic than the central ones in their genetic composition, and examined a founders' effect in the island population. Methodology/Principal Findings We examined the phylogeography and population structuring of the P. massoniana based on nucleotide sequences of cpDNA atpB-rbcL intergenic spacer, intron regions of the AdhC2 locus, and microsatellite fingerprints. SAMOVA analysis of nucleotide sequences indicated that most genetic variants resided among geographical regions. High levels of genetic diversity in the marginal populations in the south region, a pattern seemingly contradicting the central–marginal paradigm, and the fixation of private haplotypes in most populations indicate that multiple refugia may have existed over the glacial maxima. STRUCTURE analyses on microsatellites revealed that genetic structure of mainland populations was mediated with recent genetic exchanges mostly via pollen flow, and that the genetic composition in east region was intermixed between south and west regions, a pattern likely shaped by gene introgression and maintenance of ancestral polymorphisms. As expected, the small island population in Taiwan was genetically differentiated from mainland populations. Conclusions/Significance The marginal populations in south region possessed divergent gene pools, suggesting that the past glaciations might have low impacts on these populations at low latitudes. Estimates of ancestral population sizes interestingly reflect a recent expansion in mainland from a rather smaller population, a pattern that seemingly agrees with the pollen record. PMID:22952747
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozaki, N.; Lappalainen, J.; Linnoila, M.
Serotonin (5-HT){sub ID} receptors are 5-HT release-regulating autoreceptors in the human brain. Abnormalities in brain 5-HT function have been hypothesized in the pathophysiology of various psychiatric disorders, including obsessive-compulsive disorder, autism, mood disorders, eating disorders, impulsive violent behavior, and alcoholism. Thus, mutations occurring in 5-HT autoreceptors may cause or increase the vulnerability to any of these conditions. 5-HT{sub 1D{alpha}} and 5-HT{sub 1D{Beta}} subtypes have been previously localized to chromosomes 1p36.3-p34.3 and 6q13, respectively, using rodent-human hybrids and in situ localization. In this communication, we report the detection of a 5-HT{sub 1D{alpha}} receptor gene polymorphism by single strand conformation polymorphism (SSCP)more » analysis of the coding sequence. The polymorphism was used for fine scale linkage mapping of 5-HT{sub 1D{alpha}} on chromosome 1. This polymorphism should also be useful for linkage studies in populations and in families. Our analysis also demonstrates that functionally significant coding sequence variants of the 5-HT{sub 1D{alpha}} are probably not abundant either among alcoholics or in the general population. 14 refs., 1 fig., 1 tab.« less
Santini, A C; Magalhães, J T; Cascardo, J C M; Corrêa, R X
2016-04-28
Chromobacterium violaceum is a free-living Gram-negative bacillus usually found in the water and soil in tropical regions, which causes infections in humans. Chromobacteriosis is characterized by rapid dissemination and high mortality. The aim of this study was to detect the genetic variability among C. violaceum type strain ATCC 12472, and seven isolates from the environment and one from a pulmonary secretion from a chromobacteriosis patient from Ilhéus, Bahia. The molecular characterization of all samples was performed by polymerase chain reaction (PCR) sequencing and 16S rDNA analysis. Primers specific for two ATCC 12472 pathogenicity genes, hilA and yscD, as well as random amplified polymorphic DNA (RAPD), were used for PCR amplification and comparative sequencing of the products. For a more specific approach, the PCR products of 16S rDNA were digested with restriction enzymes. Seven of the samples, including type-strain ATCC 12472, were amplified by the hilA primers; these were subsequently sequenced. Gene yscD was amplified only in type-strain ATCC 12472. MspI and AluI digestion revealed 16S rDNA polymorphisms. This data allowed the generation of a dendogram for each analysis. The isolates of C. violaceum have variability in random genomic regions demonstrated by RAPD. Also, these isolates have variability in pathogenicity genes, as demonstrated by sequencing and restriction enzyme digestion.
Paris, Margot; Marcombe, Sebastien; Coissac, Eric; Corbel, Vincent; David, Jean-Philippe; Després, Laurence
2013-01-01
Mosquito control is often the main method used to reduce mosquito-transmitted diseases. In order to investigate the genetic basis of resistance to the bio-insecticide Bacillus thuringiensis subsp. israelensis (Bti), we used information on polymorphism obtained from cDNA tag sequences from pooled larvae of laboratory Bti-resistant and susceptible Aedes aegypti mosquito strains to identify and analyse 1520 single nucleotide polymorphisms (SNPs). Of the 372 SNPs tested, 99.2% were validated using DNA Illumina GoldenGate® array, with a strong correlation between the allelic frequencies inferred from the pooled and individual data (r = 0.85). A total of 11 genomic regions and five candidate genes were detected using a genome scan approach. One of these candidate genes showed significant departures from neutrality in the resistant strain at sequence level. Six natural populations from Martinique Island were sequenced for the 372 tested SNPs with a high transferability (87%), and association mapping analyses detected 14 loci associated with Bti resistance, including one located in a putative receptor for Cry11 toxins. Three of these loci were also significantly differentiated between the laboratory strains, suggesting that most of the genes associated with resistance might differ between the two environments. It also suggests that common selected regions might harbour key genes for Bti resistance. PMID:24187584
Wheat CBF gene family: identification of polymorphisms in the CBF coding sequence.
Mohseni, Sara; Che, Hua; Djillali, Zakia; Dumont, Estelle; Nankeu, Joseph; Danyluk, Jean
2012-12-01
Expression of cold-regulated genes needed for protection against freezing stress is mediated, in part, by the CBF transcription factor family. Previous studies with temperate cereals suggested that the CBF gene family in wheat was large, and that CBF genes were at the base of an important low temperature tolerance trait. Therefore, the goal of our study was to identify the CBF repertoire in the freezing-tolerant hexaploid wheat cultivar Norstar, and then to examine if the coding region of CBF genes in two spring cultivars contain polymorphisms that could affect the protein sequence and structure. Our analyses reveal that hexaploid wheat contains a complex CBF family consisting of at least 65 CBF genes of which 60 are known to be expressed in the cultivar Norstar. They represent 27 paralogous genes with 1-3 homeologous copies for the A, B, and D genomes. The cultivar Norstar contains two pseudogenes and at least 24 additional proteins having sequences and (or) structures that deviate from the consensus in the conserved AP2 DNA-binding and (or) C-terminal activation-domains. This suggests that in cultivars such as Norstar, low temperature tolerance may be increased through breeding of additional optimal alleles. The examination of the CBF repertoire present in the two spring cultivars, Chinese Spring and Manitou, reveals that they have additional polymorphisms affecting conserved positions in these domains. Understanding the effects of these polymorphisms will provide additional information for the selection of optimum CBF alleles in Triticeae breeding programs.
Polymorphism at codon 36 of the p53 gene.
Felix, C A; Brown, D L; Mitsudomi, T; Ikagaki, N; Wong, A; Wasserman, R; Womer, R B; Biegel, J A
1994-01-01
A polymorphism at codon 36 in exon 4 of the p53 gene was identified by single strand conformation polymorphism (SSCP) analysis and direct sequencing of genomic DNA PCR products. The polymorphic allele, present in the heterozygous state in genomic DNAs of four of 100 individuals (4%), changes the codon 36 CCG to CCA, eliminates a FinI restriction site and creates a BccI site. Including this polymorphism there are four known polymorphisms in the p53 coding sequence.
Macé, Matthias; Crouau-Roy, Brigitte
2008-01-01
Background The early radiation of the Cetartiodactyla is complex, and unambiguous molecular characters are needed to clarify the positions of hippotamuses, camels and pigs relative to the remaining taxa (Cetacea and Ruminantia). There is also a need for informative genealogic markers for Y-chromosome population genetics as well as a sexing method applicable to all species from this group. We therefore studied the sequence variation of a partial sequence of the evolutionary conserved amelogenin gene to assess its potential use in each of these fields. Results and discussion We report a large interstitial insertion in the Y amelogenin locus in most of the Cetartiodactyla lineages (cetaceans and ruminants). This sex-linked size polymorphism is the result of a 460–465 bp inserted element in intron 4 of the amelogenin gene of Ruminants and Cetaceans. Therefore, this polymorphism can easily be used in a sexing assay for these species. When taking into account this shared character in addition to nucleotide sequence, gene genealogy follows sex-chromosome divergence in Cetartiodactyla whereas it is more congruent with zoological history when ignoring these characters. This could be related to a loss of homology between chromosomal copies given the old age of the insertion. The 1 kbp Amel-Y amplified fragment is also characterized by high nucleotide diversity (64 polymorphic sites spanning over 1 kbp in seven haplotypes) which is greater than for other Y-chromosome sequence markers studied so far but less than the mitochondrial control region. Conclusion The gender-dependent polymorphism we have identified is relevant not only for phylogenic inference within the Cetartiodactyla but also for Y-chromosome based population genetics and gender determination in cetaceans and ruminants. One single protocol can therefore be used for studies in population and evolutionary genetics, reproductive biotechnologies, and forensic science. PMID:18925953
Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T
1997-12-01
A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.
Zhu, X Q; Gasser, R B
1998-06-01
In this study, we assessed single-strand conformation polymorphism (SSCP)-based approaches for their capacity to fingerprint sequence variation in ribosomal DNA (rDNA) of ascaridoid nematodes of veterinary and/or human health significance. The second internal transcribed spacer region (ITS-2) of rDNA was utilised as the target region because it is known to provide species-specific markers for this group of parasites. ITS-2 was amplified by PCR from genomic DNA derived from individual parasites and subjected to analysis. Direct SSCP analysis of amplicons from seven taxa (Toxocara vitulorum, Toxocara cati, Toxocara canis, Toxascaris leonina, Baylisascaris procyonis, Ascaris suum and Parascaris equorum) showed that the single-strand (ss) ITS-2 patterns produced allowed their unequivocal identification to species. While no variation in SSCP patterns was detected in the ITS-2 within four species for which multiple samples were available, the method allowed the direct display of four distinct sequence types of ITS-2 among individual worms of T. cati. Comparison of SSCP/sequencing with the methods of dideoxy fingerprinting (ddF) and restriction endonuclease fingerprinting (REF) revealed that also ddF allowed the definition of the four sequence types, whereas REF displayed three of four. The findings indicate the usefulness of the SSCP-based approaches for the identification of ascaridoid nematodes to species, the direct display of sequence variation in rDNA and the detection of population variation. The ability to fingerprint microheterogeneity in ITS-2 rDNA using such approaches also has implications for studying fundamental aspects relating to mutational change in rDNA.
Wilson, Kitchener D; Shen, Peidong; Fung, Eula; Karakikes, Ioannis; Zhang, Angela; InanlooRahatloo, Kolsoum; Odegaard, Justin; Sallam, Karim; Davis, Ronald W; Lui, George K; Ashley, Euan A; Scharfe, Curt; Wu, Joseph C
2015-09-11
Thousands of mutations across >50 genes have been implicated in inherited cardiomyopathies. However, options for sequencing this rapidly evolving gene set are limited because many sequencing services and off-the-shelf kits suffer from slow turnaround, inefficient capture of genomic DNA, and high cost. Furthermore, customization of these assays to cover emerging targets that suit individual needs is often expensive and time consuming. We sought to develop a custom high throughput, clinical-grade next-generation sequencing assay for detecting cardiac disease gene mutations with improved accuracy, flexibility, turnaround, and cost. We used double-stranded probes (complementary long padlock probes), an inexpensive and customizable capture technology, to efficiently capture and amplify the entire coding region and flanking intronic and regulatory sequences of 88 genes and 40 microRNAs associated with inherited cardiomyopathies, congenital heart disease, and cardiac development. Multiplexing 11 samples per sequencing run resulted in a mean base pair coverage of 420, of which 97% had >20× coverage and >99% were concordant with known heterozygous single nucleotide polymorphisms. The assay correctly detected germline variants in 24 individuals and revealed several polymorphic regions in miR-499. Total run time was 3 days at an approximate cost of $100 per sample. Accurate, high-throughput detection of mutations across numerous cardiac genes is achievable with complementary long padlock probe technology. Moreover, this format allows facile insertion of additional probes as more cardiomyopathy and congenital heart disease genes are discovered, giving researchers a powerful new tool for DNA mutation detection and discovery. © 2015 American Heart Association, Inc.
Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin
2011-01-01
Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species. PMID:21525982
Mobile elements reveal small population size in the ancient ancestors of Homo sapiens.
Huff, Chad D; Xing, Jinchuan; Rogers, Alan R; Witherspoon, David; Jorde, Lynn B
2010-02-02
The genealogies of different genetic loci vary in depth. The deeper the genealogy, the greater the chance that it will include a rare event, such as the insertion of a mobile element. Therefore, the genealogy of a region that contains a mobile element is on average older than that of the rest of the genome. In a simple demographic model, the expected time to most recent common ancestor (TMRCA) is doubled if a rare insertion is present. We test this expectation by examining single nucleotide polymorphisms around polymorphic Alu insertions from two completely sequenced human genomes. The estimated TMRCA for regions containing a polymorphic insertion is two times larger than the genomic average (P < <10(-30)), as predicted. Because genealogies that contain polymorphic mobile elements are old, they are shaped largely by the forces of ancient population history and are insensitive to recent demographic events, such as bottlenecks and expansions. Remarkably, the information in just two human DNA sequences provides substantial information about ancient human population size. By comparing the likelihood of various demographic models, we estimate that the effective population size of human ancestors living before 1.2 million years ago was 18,500, and we can reject all models where the ancient effective population size was larger than 26,000. This result implies an unusually small population for a species spread across the entire Old World, particularly in light of the effective population sizes of chimpanzees (21,000) and gorillas (25,000), which each inhabit only one part of a single continent.
Tashakori, Mahnaz; Mahnaz, Tashakori; Kuhls, Katrin; Katrin, Kuhls; Al-Jawabreh, Amer; Amer, Al-Jawabreh; Mauricio, Isabel L; Isabel, Mauricio; Schönian, Gabriele; Gabriele, Schönian; Farajnia, Safar; Safar, Farajnia; Alimohammadian, Mohammad Hossein; Hossein, Alimohammadian Mohammad
2006-04-01
Protozoan parasites of Leishmania major are the causative agents of cutaneous leishmaniasis in different parts of Iran. We applied PCR-based methods to analyze L. major parasites isolated from patients with active lesions from different geographic areas in Iran in order to understand DNA polymorphisms within L. major species. Twenty-four isolates were identified as L. major by RFLP analysis of the ribosomal internal transcribed spacer 1 (ITS1) amplicons. These isolates were further studied by single-strand conformation polymorphism (SSCP) analysis and sequencing of ITS1 and ITS2. Data obtained from SSCP analysis of the ITS1 and ITS2 loci revealed three and four different patterns among all studied samples, respectively. Sequencing of ITS1 and ITS2 confirmed the results of SSCP analysis and showed the potential of the PCR-SSCP method for assessing genetic heterogeneity within L. major. Different patterns in ITS1 were due to substitution of one nucleotide, whereas in ITS2 the changes were defined by variation in the number of repeats in two polymorphic microsatellites. In total five genotypic groups LmA, LmB, LmC, LmD and LmE were identified among L. major isolates. The most frequent genotype, LmA, was detected in isolates collected from different endemic areas of cutaneous leishmaniasis in Iran. Genotypes LmC, LmD and LmE were found only in the new focus of CL in Damghan (Semnan province) and LmB was identified exclusively among isolates of Kashan focus (Isfahan province). The distribution of genetic polymorphisms suggests the existence of distinct endemic regions of L. major in Iran.
Yu, Qichao; Zhang, Wei; Zhang, Xiaolong; Zeng, Yongli; Wang, Yeming; Wang, Yanhui; Xu, Liqin; Huang, Xiaoyun; Li, Nannan; Zhou, Xinlan; Lu, Jie; Guo, Xiaosen; Li, Guibo; Hou, Yong; Liu, Shiping; Li, Bo
2017-09-01
Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms. © The Authors 2017. Published by Oxford University Press.
Pajuelo, Mónica J; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H; Gilman, Robert H; Porcella, Steve; Zimic, Mirko
2015-12-01
Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies.
Kim, S-J; Young, L J; Gonen, D; Veenstra-VanderWeele, J; Courchesne, R; Courchesne, E; Lord, C; Leventhal, B L; Cook, E H; Insel, T R
2002-01-01
Impairment in social reciprocity is a central component of autism. In preclinical studies, arginine vasopressin (AVP) has been shown to increase a range of social behaviors, including affiliation and attachment, via the V(1a) receptor (AVPR1A) in the brain. Both the behavioral effects of AVP and the neural distribution of the V1a receptor vary greatly across mammalian species. This difference in regional receptor expression as well as differences in social behavior may result from a highly variable repetitive sequence in the 5' flanking region of the V1a gene (AVPR1A). Given this comparative evidence for a role in inter-species variation in social behavior, we explored whether within our own species, variation in the human AVPR1A may contribute to individual variations in social behavior, with autism representing an extreme form of social impairment. We genotyped two microsatellite polymorphisms from the 5' flanking region of AVPR1A for 115 autism trios and found nominally significant transmission disequilibrium between autism and one of the microsatellite markers by Multiallelic Transmission/Disequilibrium test (MTDT) that was not significant after Bonferroni correction. We also screened approximately 2 kb of the 5' flanking region and the coding region and identified 10 single nucleotide polymorphisms.
Genotyping and Bioforensics of Ricinus communis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hinckley, Aubree Christine
The castor bean plant (Ricinus communis) is a member of the family Euphorbiaceae. In spite of its common name, the castor plant is not a true bean (i.e., leguminous plants belonging to the family, Fabaceae). Ricinus communis is native to tropical Africa, but because the plant was recognized for its production of oil with many desirable properties, it has been introduced and cultivated in warm temperate regions throughout the world (Armstrong 1999 and Brown 2005). Castor bean plants have also been valued by gardeners as an ornamental plant and, historically, as a natural rodenticide. Today, escaped plants grow like weedsmore » throughout much of the southwestern United States, and castor seeds are even widely available to the public for order through the Internet. In this study, multiple loci of chloroplast noncoding sequence data and a few nuclear noncoding regions were examined to identify DNA polymorphisms present among representatives from a geographically diverse panel of Ricinus communis cultivated varieties. The primary objectives for this research were (1) to successfully cultivate castor plants and extract sufficient yields of high quality DNA from an assortment of castor cultivated varieties, (2) to use PCR and sequencing to screen available universal oligos against a small panel of castor cultivars, (3) to identify DNA polymorphisms within the amplified regions, and (4) to evaluate these DNA polymorphisms as appropriate candidates for assay development (see Figure 1). Additional goals were to design, test and optimize assays targeting any DNA polymorphisms that were discovered and to rapidly screen many castor cultivars to determine the amount of diversity present at that particular locus. Ultimately, the goal of this study was to construct a phylogeographic tree representing the genetic relationships present among Ricinus communis cultivars from diverse geographic regions. These research objectives were designed to test the hypothesis that cultivated varieties of Ricinus communis from various geographic regions can be distinguished from one another based on differences present at the genetic level. In addition, the present study sought to determine the amount of diversity present among Ricinus communis cultivars.« less
Mallick, Pijush; Chattaraj, Shruti; Sikdar, Samir Ranjan
2017-10-01
The 12 pfls somatic hybrids and 2 parents of Pleurotus florida and Lentinus s quarrosulus were characterized by ISSR and sequencing of rRNA-ITS genes. Five ISSR primers were used and amplified a total of 54 reproducible fragments with 98.14% polymorphism among all the pfls hybrid populations and parental strains. UPGMA-based cluster exhibited a dendrogram with three major groups between the parents and pfls hybrids. Parent P . florida and L . squarrosulus showed different degrees of genetic distance with all the hybrid lines and they showed closeness to hybrid pfls 1m and pfls 1h , respectively. ITS1(F) and ITS4(R) amplified the rRNA-ITS gene with 611-867 bp sequence length. The nucleotide polymorphisms were found in the ITS1, ITS2 and 5.8S rRNA region with different number of bases. Based on rRNA-ITS sequence, UPGMA cluster exhibited three distinct groups between L. squarrosulus and pfls 1p , pfls 1m and pfls 1s , and pfls 1e and P. florida .
Ben Ali, Sina-Elisabeth; Madi, Zita Erika; Hochegger, Rupert; Quist, David; Prewein, Bernhard; Haslberger, Alexander G.; Brandes, Christian
2014-01-01
Genetic mutations must be avoided during the production and use of seeds. In the European Union (EU), Directive 2001/18/EC requires any DNA construct introduced via transformation to be stable. Establishing genetic stability is critical for the approval of genetically modified organisms (GMOs). In this study, genetic stability of two GMOs was examined using high resolution melting (HRM) analysis and real-time polymerase chain reaction (PCR) employing Scorpion primers for amplification. The genetic variability of the transgenic insert and that of the flanking regions in a single oilseed rape variety (GT73) and a stacked maize (MON88017 × MON810) was studied. The GT73 and the 5' region of MON810 showed no instabilities in the examined regions. However; two out of 100 analyzed samples carried a heterozygous point mutation in the 3' region of MON810 in the stacked variety. These results were verified by direct sequencing of the amplified PCR products as well as by sequencing of cloned PCR fragments. The occurrence of the mutation suggests that the 5' region is more suitable than the 3' region for the quantification of MON810. The identification of the single nucleotide polymorphism (SNP) in a stacked event is in contrast to the results of earlier studies of the same MON810 region in a single event where no DNA polymorphism was found. PMID:25365178
Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L
2003-10-01
The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-06-04
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
Que, Ting-zhi; Zhao, Shu-min; Li, Cheng-tao
2010-08-01
Determination strategies for half sibling sharing a same mother were investigated through the detection of autosomal and X-chromosomal STR (X-STR) loci and polymorphisms on hypervariable (HV) region of mitochondrial DNA (mtDNA). Genomic DNA were extracted from blood stain samples of the 3 full siblings and one dubious half sibling sharing the same mother with them. Fifteen autosomal STR loci were genotyped by Sinofiler kit, and 19 X-STR loci were genotyped by Mentype Argus X-8 kit and 16 plex in-house system. Polymorphisms of mtDNA HV-I and HV-II were also detected with sequencing technology. Full sibling relationship between the dubious half sibling and each of the 3 full siblings were excluded based on the results of autosomal STR genotyping and calculation of full sibling index (FSI) and half sibling index (HIS). Results of sequencing for mtDNA HV-I and HV-II showed that all of the 4 samples came from a same maternal line. X-STR genotyping results determined that the dubious half sibling shared a same mother with the 3 full siblings. It is reliable to combine three different genotyping technologies including autosomal STR, X-STR and sequencing of mtDNA HV-I and HV-II for determination of half sibling sharing a same mother.
Li, Hong; Sun, Gui-Rong; Tian, Ya-Dong; Han, Rui-Li; Li, Guo-Xi; Kang, Xiang-Tao
2013-05-01
In the present study, a total of 860 chickens from a Gushi-Anka F2 resource population were used to evaluate the genetic effect of the gga-miR-1614-3p gene. A novel, silent, single nucleotide polymorphism (SNP, +5 C>T) was detected in the gga-miR-1614-3p gene seed region through AvaII polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and PCR products sequencing methods. Associations between the SNP and chicken growth, meat quality and carcass traits were performed by association analysis. The results showed that the SNP was significantly associated with breast muscle shear force and leg muscle water loss rate, wing weight, liver weight and heart weight (p<0.05), and highly significantly associated with the weight of the abdominal fat (p<0.01). The secondary structure of gga-miR-1614 and the free energy were altered due to the variation predicted by the M-fold program.
Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe
2017-01-01
Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1–3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry. PMID:28056060
Zhang, Xiuqing; Xu, Zhangyang; Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe
2017-01-01
Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1-3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry.
van Endert, P M; Lopez, M T; Patel, S D; Monaco, J J; McDevitt, H O
1992-01-01
Recently, two subunits of a large cytosolic protease and two putative peptide transporter proteins were found to be encoded by genes within the class II region of the major histocompatibility complex (MHC). These genes have been suggested to be involved in the processing of antigenic proteins for presentation by MHC class I molecules. Because of the high degree of polymorphism in MHC genes, and previous evidence for both functional and polypeptide sequence polymorphism in the proteins encoded by the antigen-processing genes, we tested DNA from 27 consanguineous human cell lines for genomic polymorphism by restriction fragment length polymorphism (RFLP) analysis. These studies demonstrate a strong linkage disequilibrium between TAP1 and LMP2 RFLPs. Moreover, RFLPs, as well as a polymorphic stop codon in the telomeric TAP2 gene, appear to be in linkage disequilibrium with HLA-DR alleles and RFLPs in the HLA-DO gene. A high rate of recombination, however, seems to occur in the center of the complex, between the TAP1 and TAP2 genes. Images PMID:1360671
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
HTRA1 promoter polymorphism predisposes Japanese to age-related macular degeneration.
Yoshida, Tsunehiko; DeWan, Andrew; Zhang, Hong; Sakamoto, Ryosuke; Okamoto, Haru; Minami, Masayoshi; Obazawa, Minoru; Mizota, Atsushi; Tanaka, Minoru; Saito, Yoshihiro; Takagi, Ikue; Hoh, Josephine; Iwata, Takeshi
2007-04-04
To study the effect of candidate single nucleotide polymorphisms (SNPs) on chromosome 10q26, recently shown to be associated with wet age-related macular degeneration (AMD) in Chinese and Caucasian cohorts, in a Japanese cohort. Using genomic DNA isolated from peripheral blood of wet AMD cases and age-matched controls, we genotyped two SNPs, rs10490924, and rs11200638, on chromosome 10q26, 6.6 kb and 512 bp upstream of the HTRA1 gene, respectively, using temperature gradient capillary electrophoresis (TGCE) and direct sequencing. Association tests were performed for individual SNPs and jointly with SNP complement factor H (CFH) Y402H. The two SNPs, rs10490924 and rs11200638, are in complete linkage disequilibrium (D'=1). Previous sequence comparisons among seventeen species revealed that the genomic region containing rs11200638 was highly conserved while the region surrounding rs10490924 was not. The allelic association test for rs11200638 yielded a p-value <10(-11). SNP rs11200638 conferred disease risk in an autosomal recessive fashion: Odds ratio was 10.1 (95% CI 4.36, 23.06), adjusted for SNP CFH 402, for those carrying two copies of the risk allele, whereas indistinguishable from unity if carrying only one risk allele. The HTRA1 promoter polymorphism, rs11200638, is a strong candidate with a functional consequence that predisposes Japanese to develop neovascular AMD.
Modliszewski, Jennifer L; Thomas, David T; Fan, Chuanzhu; Crawford, Daniel J; Depamphilis, Claude W; Xiang, Qiu-Yun Jenny
2006-03-01
Knowledge regarding the origin and maintenance of hybrid zones is critical for understanding the evolutionary outcomes of natural hybridization. To evaluate the contribution of historical contact vs. long-distance gene flow in the formation of a broad hybrid zone in central and northern Georgia that involves Aesculus pavia, A. sylvatica, and A. flava, three cpDNA regions (matK, trnD-trnT, and trnH-trnK) were analyzed. The maternal inheritance of cpDNA in Aesculus was confirmed via sequencing of matK from progeny of controlled crosses. Restriction site analyses identified 21 unique haplotypes among 248 individuals representing 29 populations from parental species and hybrids. Haplotypes were sequenced for all cpDNA regions. Restriction site and sequence data were subjected to phylogeographic and population genetic analyses. Considerable cpDNA variation was detected in the hybrid zone, as well as ancestral cpDNA polymorphism; furthermore, the distribution of haplotypes indicates limited interpopulation gene flow via seeds. The genealogy and structure of genetic variation further support the historical presence of A. pavia in the Piedmont, although they are at present locally extinct. In conjunction with previous allozyme studies, the cpDNA data suggest that the hybrid zone originated through historical local gene flow, yet is maintained by periodic long-distance pollen dispersal.
Komínková, Eva; Dreiseitl, Antonín; Malečková, Eva; Doležel, Jaroslav
2016-01-01
Population surveys of Blumeria graminis f. sp. hordei (Bgh), a causal agent of more than 50% of barley fungal infections in the Czech Republic, have been traditionally based on virulence tests, at times supplemented with non-specific Restriction fragment length polymorphism or Random amplified polymorphic DNA markers. A genomic sequence of Bgh, which has become available recently, enables identification of potential markers suitable for population genetics studies. Two major strategies relying on transposable elements and microsatellites were employed in this work to develop a set of Repeat junction markers, Single sequence repeat and Single nucleotide polymorphism markers. A resolution power of the new panel of markers comprising 33 polymorphisms was demonstrated by a phylogenetic analysis of 158 Bgh isolates. A core set of 97 Czech isolates was compared to a set 50 Australian isolates on the background of 11 diverse isolates collected throughout the world. 73.2% of Czech isolates were found to be genetically unique. An extreme diversity of this collection was in strong contrast with the uniformity of the Australian one. This work paves the way for studies of population structure and dynamics based on genetic variability among different Bgh isolates originating from geographically limited regions. PMID:27875588
Vasudevan, Kumar; Vera Cruz, Casiana M.; Gruissem, Wilhelm; Bhullar, Navreet K.
2016-01-01
Rice blast is caused by Magnaporthe oryzae, which is the most destructive fungal pathogen affecting rice growing regions worldwide. The rice blast resistance gene Pib confers broad-spectrum resistance against Southeast Asian M. oryzae races. We investigated the allelic diversity of Pib in rice germplasm originating from 12 major rice growing countries. Twenty-five new Pib alleles were identified that have unique single nucleotide polymorphisms (SNPs), insertions and/or deletions, in addition to the polymorphic nucleotides that are shared between the different alleles. These partially or completely shared polymorphic nucleotides indicate frequent sequence exchange events between the Pib alleles. In some of the new Pib alleles, nucleotide diversity is high in the LRR domain, whereas, in others it is distributed among the NB-ARC and LRR domains. Most of the polymorphic amino acids in LRR and NB-ARC2 domains are predicted as solvent-exposed. Several of the alleles and the unique SNPs are country specific, suggesting a diversifying selection of alleles in various geographical locations in response to the locally prevalent M. oryzae population. Together, the new Pib alleles are an important genetic resource for rice blast resistance breeding programs and provide new information on rice-M. oryzae interactions at the molecular level. PMID:27446145
Jorge, Paulo H; Mastrochirico-Filho, Vito A; Hata, Milene E; Mendes, Natália J; Ariede, Raquel B; de Freitas, Milena Vieira; Vera, Manuel; Porto-Foresti, Fábio; Hashimoto, Diogo T
2018-01-01
The pirapitinga, Piaractus brachypomus (Characiformes, Serrasalmidae), is a fish from the Amazon basin and is considered to be one of the main native species used in aquaculture production in South America. The objectives of this study were: (1) to perform liver transcriptome sequencing of pirapitinga through NGS and then validate a set of microsatellite markers for this species; and (2) to use polymorphic microsatellites for analysis of genetic variability in farmed stocks. The transcriptome sequencing was carried out through the Roche/454 technology, which resulted in 3,696 non-redundant contigs. Of this total, 2,568 contigs had similarity in the non-redundant (nr) protein database (Genbank) and 2,075 sequences were characterized in the categories of Gene Ontology (GO). After the validation process of 30 microsatellite loci, eight markers showed polymorphism. The analysis of these polymorphic markers in farmed stocks revealed that fish farms from North Brazil had a higher genetic diversity than fish farms from Southeast Brazil. AMOVA demonstrated that the highest proportion of variation was presented within the populations. However, when comparing different groups (1: Wild; 2: North fish farms; 3: Southeast fish farms), a considerable variation between the groups was observed. The F ST values showed the occurrence of genetic structure among the broodstocks from different regions of Brazil. The transcriptome sequencing in pirapitinga provided important genetic resources for biological studies in this non-model species, and microsatellite data can be used as the framework for the genetic management of breeding stocks in Brazil, which might provide a basis for a genetic pre-breeding programme.
Vishnevskaia, M S; Pavlov, A V; Dziubenko, E A; Dziubenko, N I; Potokina, E K
2014-04-01
Based on legume genome syntheny, the nucleotide sequence of Srlk gene, key role of which in response to salt stress was demonstrated for the model species Medicago truncatula, was identified in the major forage and siderate crop alfalfa (Medicago sativa). In twelve alfalfa samples originating from regions with contrasting growing conditions, 19 SNPs were revealed in the Srlk gene. For two nonsynonymous SNPs, molecular markers were designed that could be further used to analyze the association between Srlk gene nucleotide polymorphism and the variability in salt stress tolerance among alfalfa cultivars.
Chwialkowska, Karolina; Korotko, Urszula; Kosinska, Joanna; Szarejko, Iwona; Kwasniewski, Miroslaw
2017-01-01
Epigenetic mechanisms, including histone modifications and DNA methylation, mutually regulate chromatin structure, maintain genome integrity, and affect gene expression and transposon mobility. Variations in DNA methylation within plant populations, as well as methylation in response to internal and external factors, are of increasing interest, especially in the crop research field. Methylation Sensitive Amplification Polymorphism (MSAP) is one of the most commonly used methods for assessing DNA methylation changes in plants. This method involves gel-based visualization of PCR fragments from selectively amplified DNA that are cleaved using methylation-sensitive restriction enzymes. In this study, we developed and validated a new method based on the conventional MSAP approach called Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq). We improved the MSAP-based approach by replacing the conventional separation of amplicons on polyacrylamide gels with direct, high-throughput sequencing using Next Generation Sequencing (NGS) and automated data analysis. MSAP-Seq allows for global sequence-based identification of changes in DNA methylation. This technique was validated in Hordeum vulgare . However, MSAP-Seq can be straightforwardly implemented in different plant species, including crops with large, complex and highly repetitive genomes. The incorporation of high-throughput sequencing into MSAP-Seq enables parallel and direct analysis of DNA methylation in hundreds of thousands of sites across the genome. MSAP-Seq provides direct genomic localization of changes and enables quantitative evaluation. We have shown that the MSAP-Seq method specifically targets gene-containing regions and that a single analysis can cover three-quarters of all genes in large genomes. Moreover, MSAP-Seq's simplicity, cost effectiveness, and high-multiplexing capability make this method highly affordable. Therefore, MSAP-Seq can be used for DNA methylation analysis in crop plants with large and complex genomes.
Chwialkowska, Karolina; Korotko, Urszula; Kosinska, Joanna; Szarejko, Iwona; Kwasniewski, Miroslaw
2017-01-01
Epigenetic mechanisms, including histone modifications and DNA methylation, mutually regulate chromatin structure, maintain genome integrity, and affect gene expression and transposon mobility. Variations in DNA methylation within plant populations, as well as methylation in response to internal and external factors, are of increasing interest, especially in the crop research field. Methylation Sensitive Amplification Polymorphism (MSAP) is one of the most commonly used methods for assessing DNA methylation changes in plants. This method involves gel-based visualization of PCR fragments from selectively amplified DNA that are cleaved using methylation-sensitive restriction enzymes. In this study, we developed and validated a new method based on the conventional MSAP approach called Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq). We improved the MSAP-based approach by replacing the conventional separation of amplicons on polyacrylamide gels with direct, high-throughput sequencing using Next Generation Sequencing (NGS) and automated data analysis. MSAP-Seq allows for global sequence-based identification of changes in DNA methylation. This technique was validated in Hordeum vulgare. However, MSAP-Seq can be straightforwardly implemented in different plant species, including crops with large, complex and highly repetitive genomes. The incorporation of high-throughput sequencing into MSAP-Seq enables parallel and direct analysis of DNA methylation in hundreds of thousands of sites across the genome. MSAP-Seq provides direct genomic localization of changes and enables quantitative evaluation. We have shown that the MSAP-Seq method specifically targets gene-containing regions and that a single analysis can cover three-quarters of all genes in large genomes. Moreover, MSAP-Seq's simplicity, cost effectiveness, and high-multiplexing capability make this method highly affordable. Therefore, MSAP-Seq can be used for DNA methylation analysis in crop plants with large and complex genomes. PMID:29250096
Davey, Sue; Navarrete, Cristina; Brown, Colin
2017-06-01
Twenty-nine human platelet antigen systems have been described to date, but the majority of current genotyping methods are restricted to the identification of those most commonly associated with alloantibody production in a clinical context. This can result in a protracted investigation if causative human platelet antigens are rare or novel. A targeted next-generation sequencing approach was designed to detect all known human platelet antigens with the additional capability of identifying novel mutations in the encoding genes. A targeted enrichment, high-sensitivity HaloPlex assay was designed to sequence all exons and flanking regions of the six genes known to encode human platelet antigens. Indexed DNA libraries were prepared from 47 previously human platelet antigen-genotyped samples and subsequently combined into one of three pools for sequencing on an Illumina MiSeq platform. The generated FASTQ files were aligned and scrutinized for each human platelet antigen polymorphism using SureCall data analysis software. Forty-six samples were successfully genotyped for human platelet antigens 1 through 29bw, with an average per base coverage depth of 1144. Concordance with historical human platelet antigen genotypes was 100%. A putative novel mutation in Exon 10 of the integrin β-3 (ITGB3) gene from an unsolved case of fetal neonatal alloimmune thrombocytopenia was also detected. A next-generation sequencing-based method that can accurately define all known human platelet antigen polymorphisms was developed. With the ability to sequence up to 96 samples simultaneously, our HaloPlex design could be used for high-throughput human platelet antigen genotyping. This method is also applicable for investigating fetal neonatal alloimmune thrombocytopenia when rare or novel human platelet antigens are suspected. © 2017 AABB.
The proteolytic processing site of the precursor of lysyl oxidase.
Cronshaw, A D; Fothergill-Gilmore, L A; Hulmes, D J
1995-01-01
The precise cleavage site of the N-terminal propeptide region of the precursor of lysyl oxidase has not yet been established, due to N-terminal blocking of the mature protein. Using a combination of peptide fragmentation, amino acid sequencing, time-of-flight m.s. and partial chemical unblocking procedures, it is shown that the mature form of lysyl oxidase begins at residue Asp-169 of the precursor protein (numbered according to the human sequence). The cleavage site is 28 residues to the C-terminal side of the site previously suggested on the basis of apparant molecular mass by SDS/PAGE, with the consequence that the two putative, N-linked glycosylation sites and the position of the Arg/Gln sequence polymorphism are now all in the precursor region. PMID:7864821
King, Lanikea B.; Walum, Hasse; Inoue, Kiyoshi; Eyrich, Nicholas W.; Young, Larry J.
2015-01-01
Background Oxytocin (OXT) modulates several aspects of social behavior. Intranasal OXT is a leading candidate for treating social deficits in autism spectrum disorder (ASD) and common genetic variants in the human oxytocin receptor (OXTR) are associated with emotion recognition, relationship quality and ASD. Animal models have revealed that individual differences in Oxtr expression in the brain drive social behavior variation. Our understanding of how genetic variation contributes to brain OXTR expression is very limited. Methods We investigated Oxtr expression in monogamous prairie voles, which have a well characterized OXT system. We quantified brain region-specific levels of Oxtr mRNA and OXTR protein with established neuroanatomical methods. We used pyrosequencing to investigate allelic imbalance of Oxtr mRNA, a molecular signature of polymorphic genetic regulatory elements. We performed next-generation sequencing to discover variants in and near the Oxtr gene. We investigated social attachment using the partner preference test. Results Our allelic imbalance data demonstrates that genetic variants contribute to individual differences in Oxtr expression, but only in particular brain regions, including the nucleus accumbens (NAcc), where OXTR signaling facilitates social attachment. Next-generation sequencing identified one polymorphism in the Oxtr intron, near a putative cis-regulatory element, explaining 74% of the variance in striatal Oxtr expression specifically. Males homozygous for the high expressing allele display enhanced social attachment. Discussion Taken together, these findings provide convincing evidence for robust genetic influence on Oxtr expression and provide novel insights into how non-coding polymorphisms in the OXTR might influence individual differences in human social cognition and behavior PMID:26893121
Feltus, F Alex; Wan, Jun; Schulze, Stefan R; Estill, James C; Jiang, Ning; Paterson, Andrew H
2004-09-01
Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% +/- 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% +/- 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp.
Feltus, F. Alex; Wan, Jun; Schulze, Stefan R.; Estill, James C.; Jiang, Ning; Paterson, Andrew H.
2004-01-01
Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp. PMID:15342564
Masny, Aleksander; Jagiełło, Agata; Płucienniczak, Grażyna; Golab, Elzbieta
2012-09-01
Ribo HRM, a single-tube PCR and high resolution melting (HRM) assay for detection of polymorphisms in the large subunit ribosomal DNA expansion segment V, was developed on a Trichinella model. Four Trichinella species: T. spiralis (isolates ISS3 and ISS160), T. nativa (isolates ISS10 and ISS70), T. britovi (isolates ISS2 and ISS392) and T. pseudospiralis (isolates ISS13 and ISS1348) were genotyped. Cloned allelic variants of the expansion segment V were used as standards to prepare reference HRM curves characteristic for single sequences and mixtures of several cloned sequences imitating allelic composition detected in Trichinella isolates. Using the primer pair Tsr1 and Trich1bi, it was possible to amplify a fragment of the ESV and detect PCR products obtained from the genomic DNA of pools of larvae belonging to the four investigated species: T. pseudospiralis, T. spiralis, T. britovi and T. nativa, in a single tube Real-Time PCR reaction. Differences in the shape of the HRM curves of Trichinella isolates suggested the presence of differences between examined isolates of T. nativa, T. britovi and T. pseudospiralis species. No differences were observed between T. spiralis isolates. The presence of polymorphisms within the amplified ESV sequence fragment of T. nativa T. britovi and T. pseudospiralis was confirmed by sequencing of the cloned PCR products. Novel sequences were discovered and deposited in GenBank (GenBank IDs: JN971020-JN971027, JN120902.1, JN120903.1, JN120904.1, JN120906.1, JN120905.1). Screening the ESV region of Trichinella for polymorphism is possible using the genotyping assay Ribo HRM at the current state of its development. The Ribo HRM assay could be useful in phylogenetic studies of the Trichinella genus. Copyright © 2012 Elsevier B.V. All rights reserved.
Kang, Tae Hwa; Han, Sang Hoon; Park, Sun Jae
2015-01-01
We developed microsatellite markers for genetic structural analyses of Dorcus hopei, a stag beetle species, using next generation sequencing and polymerase chain reaction (PCR)-based genotyping for regional populations. A total of 407,070,351 base pairs of genomic DNA containing >4000 microsatellite loci except AT repeats were sequenced. From 76 loci selected for primer design, 27 were polymorphic. Of these 27 markers, 10 were tested on three regional populations: two Chinese (Shichuan and Guangxi) and one Korean (Wanju). Three markers were excluded due to inconsistent amplification, genotyping errors, and Hardy-Weinberg equilibrium (HWE). By multi-locus genotyping, the allele number, observed heterozygosity and polymorphism information content of seven microsatellite loci were ranged 2‒10, 0.1333‒1.0000, and 0.1228‒0.8509, respectively. In an analysis on the genetic differentiation among regional populations including one Japanese population and one cross-breeding population, the individual colored bar-plots showed that both Chinese populations were closer to each other than to the Far East Asian populations. In Far East Asian populations, Wanju and Nirasaki populations could not be distinguished from each other because the frequency of genetic contents was very similar in some individuals of two populations. Moreover, the cross-breeding population contained all patterns of genetic contents shown in Chinese, Korean, and Japanese populations, compared with the genetic content frequency of each regional population. As a result, we examined whether the cross-breeding population might be a hybrid population, and might contain a possibility of interbreeding with Chinese populations in parental generations. Therefore, these markers will be useful for analyses of genetic diversity in populations, genetic relationships between regional populations, genetic structure analyses, and origin tests. PMID:26370965
Microfluidic droplet enrichment for targeted sequencing
Eastburn, Dennis J.; Huang, Yong; Pellegrino, Maurizio; Sciambi, Adam; Ptáček, Louis J.; Abate, Adam R.
2015-01-01
Targeted sequence enrichment enables better identification of genetic variation by providing increased sequencing coverage for genomic regions of interest. Here, we report the development of a new target enrichment technology that is highly differentiated from other approaches currently in use. Our method, MESA (Microfluidic droplet Enrichment for Sequence Analysis), isolates genomic DNA fragments in microfluidic droplets and performs TaqMan PCR reactions to identify droplets containing a desired target sequence. The TaqMan positive droplets are subsequently recovered via dielectrophoretic sorting, and the TaqMan amplicons are removed enzymatically prior to sequencing. We demonstrated the utility of this approach by generating an average 31.6-fold sequence enrichment across 250 kb of targeted genomic DNA from five unique genomic loci. Significantly, this enrichment enabled a more comprehensive identification of genetic polymorphisms within the targeted loci. MESA requires low amounts of input DNA, minimal prior locus sequence information and enriches the target region without PCR bias or artifacts. These features make it well suited for the study of genetic variation in a number of research and diagnostic applications. PMID:25873629
Antonini, S R; N'Diaye, N; Baldacchino, V; Hamet, P; Tremblay, J; Lacroix, A
2004-07-01
Gastric inhibitory polypeptide (GIP)-dependent Cushing's syndrome (CS) results from the ectopic expression of non-mutated GIP receptor (hGIPR) in the adrenal cortex. We evaluated whether mutations or polymorphisms in the regulatory region of the GIPR gene could lead to this aberrant expression. We studied 9.0kb upstream and 1.3kb downstream of the GIPR gene putative promoter (pProm) by sequencing leukocyte DNA from controls and from adrenal tissues of GIP- and non-GIP-dependent CS patients. The putative proximal promoter region (800 bp) and the first exon and intron of the hGIPR gene were sequenced on adrenal DNA from nine GIP-dependent CS, as well as on leukocyte DNA of nine normal controls. Three variations found in this region were found in all patients and controls; at position -4/-5, an insertion of a T was seen in four out of nine patients and in five out of nine controls. Transient transfection studies conducted in rat GC and mouse Y1 cells showed that the TT allele confers loss of 40% in the promoter activity. The analysis of the 8-kb distal pProm region revealed eight distal single nucleotide polymorphisms (SNPs) without probable association with the disease, since frequencies in patients and controls were very similar. In conclusion, mutations or SNPs in the regulatory region of the GIPR gene are unlikely to underlie GIP-dependent CS. Copyright 2004 Elsevier Ltd.
Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas
2006-01-01
Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
Staes, Nicky; Stevens, Jeroen M. G.; Helsen, Philippe; Hillyer, Mia; Korody, Marisa; Eens, Marcel
2014-01-01
Recent literature has revealed the importance of variation in neuropeptide receptor gene sequences in the regulation of behavioral phenotypic variation. Here we focus on polymorphisms in the oxytocin receptor gene (OXTR) and vasopressin receptor gene 1a (Avpr1a) in chimpanzees and bonobos. In humans, a single nucleotide polymorphism (SNP) in the third intron of OXTR (rs53576 SNP (A/G)) is linked with social behavior, with the risk allele (A) carriers showing reduced levels of empathy and prosociality. Bonobos and chimpanzees differ in these same traits, therefore we hypothesized that these differences might be reflected in variation at the rs53576 position. We sequenced a 320 bp region surrounding rs53576 but found no indications of this SNP in the genus Pan. However, we identified previously unreported SNP variation in the chimpanzee OXTR sequence that differs from both humans and bonobos. Humans and bonobos have previously been shown to have a more similar 5′ promoter region of Avpr1a when compared to chimpanzees, who are polymorphic for the deletion of ∼360 bp in this region (+/− DupB) which includes a microsatellite (RS3). RS3 has been linked with variation in levels of social bonding, potentially explaining part of the interspecies behavioral differences found in bonobos, chimpanzees and humans. To date, results for bonobos have been based on small sample sizes. Our results confirmed that there is no DupB deletion in bonobos with a sample size comprising approximately 90% of the captive founder population, whereas in chimpanzees the deletion of DupB had the highest frequency. Because of the higher frequency of DupB alleles in our bonobo population, we suggest that the presence of this microsatellite may partly reflect documented differences in levels of sociability found in bonobos and chimpanzees. PMID:25405348
Espin‐Garcia, Osvaldo; Craiu, Radu V.
2017-01-01
ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism
USDA-ARS?s Scientific Manuscript database
Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
Archer, Simon N; Carpen, Jayshan D; Gibson, Mark; Lim, Gim Hui; Johnston, Jonathan D; Skene, Debra J; von Schantz, Malcolm
2010-05-01
To screen the PER3 promoter for polymorphisms and investigate the phenotypic associations of these polymorphisms with diurnal preference, delayed sleep phase disorder/syndrome (DSPD/DSPS), and their effects on reporter gene expression. Interspecific comparison was used to define the approximate extent of the PER3 promoter as the region between the transcriptional start site and nucleotide position -874. This region was screened in DNA pools using PCR and direct sequencing, which was also used to screen DNA from individual participants. The different promoter alleles were cloned into a luciferase expression vector and a deletion library created. Promoter activation was measured by chemiluminescence. N/A. DNA samples were obtained from volunteers with defined diurnal preference (3 x 80, selected from a pool of 1,590), and DSPD patients (n=23). N/A. We verified three single nucleotide polymorphisms (G -320T, C -319A, G -294A), and found a novel variable number tandem repeat (VNTR) polymorphism (-318 1/2 VNTR). The -320T and -319A alleles occurred more frequently in DSPD compared to morning (P = 0.042 for each) or evening types (P = 0.006 and 0.033). The allele combination TA2G was more prevalent in DSPD compared to morning (P 0.033) or evening types (P = 0.002). Luciferase expression driven by the TA2G combination was greater than for the more common GC2A (P < 0.05) and the rarer TA1G (P < 0.001) combinations. Deletion reporter constructs identified two enhancer regions (-703 to -605, and -283 to -80). Polymorphisms in the PER3 promoter could affect its expression, leading to potential differences in the observed functions of PER3.
[Clustered regularly interspaced short palindromic repeats (CRISPR) site in Bacillus anthracis].
Gao, Zhiqi; Wang, Dongshu; Feng, Erling; Wang, Bingxiang; Hui, Yiming; Han, Shaobo; Jiao, Lei; Liu, Xiankai; Wang, Hengliang
2014-11-04
To investigate the polymorphism of clustered regularly interspaced short palindromic repeats (CRISPR) in Bacillu santhracis and the application to molecular typing based on the polymorphism of CRISPR in B. anthracis. We downloaded the whole genome sequence of 6 B. anthracis strains and extracted the CRISPR sites. We designed the primers of CRISPR sites and amplified the CRISPR fragments in 193 B. anthracis strains by PCR and sequenced these fragments. In order to reveal the polymorphism of CRISPR in B. anthracis, wealigned all the extracted sequences and sequenced results by local blasting. At the same time, we also analyzed the CRISPR sites in B. cereus and B. thuringiensis. We did not find any polymorphism of CRISPR in B. anthracis. The molecular typing approach based on CRISPR polymorphism is not suitable for B. anthracis, but it is possible for us to distinguish B. anthracis from B. cereus and B. thuringiensis.
Polymorphism and methylation of the MC4R gene in obese and non-obese dogs.
Mankowska, Monika; Nowacka-Woszuk, Joanna; Graczyk, Aneta; Ciazynska, Paulina; Stachowiak, Monika; Switonski, Marek
2017-08-01
The dog is considered to be a useful biomedical model for human diseases and disorders, including obesity. One of the numerous genes associated with human polygenic obesity is MC4R, encoding the melanocortin 4 receptor. The aim of our study was to analyze polymorphisms and methylation of the canine MC4R in relation to adiposity. Altogether 270 dogs representing four breeds predisposed to obesity: Labrador Retriever (n = 187), Golden Retriever (n = 38), Beagle (n = 28) and Cocker Spaniel (n = 17), were studied. The dogs were classified into three groups: lean, overweight and obese, according to the 5-point Body Condition Score (BCS) scale. In the cohort of Labradors a complete phenotypic data (age, sex, neutering status, body weight and BCS) were collected for 127 dogs. The entire coding sequence as well as 5' and 3'-flanking regions of the studied gene were sequenced and six polymorphic sites were reported. Genotype frequencies differed considerably between breeds and Labrador Retrievers appeared to be the less polymorphic. Moreover, distribution of some polymorphic variants differed significantly (P < 0.05) between small cohorts with diverse BCS in Golden Retrievers (c.777T>C, c.868C>T and c.*33C>G) and Beagles (c.-435T>C and c.637G>T). On the contrary, in Labradors no association between the studied polymorphisms and BCS or body weight was observed. Methylation analysis, using bisulfite DNA conversion followed by Sanger sequencing, was carried out for 12 dogs with BCS = 3 and 12 dogs with BCS = 5. Two intragenic CpG islands, containing 19 cytosines, were analyzed and the methylation profile did not differ significantly between lean and obese animals. We conclude that an association of the MC4R gene polymorphism with dog obesity or body weight is unlikely, in spite of the fact that some associations were found in small cohorts of Beagles and Golden Retrievers. Also methylation level of this gene is not related with dog adiposity.
Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.
He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y
2013-09-04
To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.
KWASNIEWSKI, WOJCIECH; GOZDZICKA-JOZEFIAK, ANNA; WOLUN-CHOLEWA, MARIA; POLAK, GRZEGORZ; SIEROCINSKA-SAWA, JADWIGA; KWASNIEWSKA, ANNA; KOTARSKI, JAN
2016-01-01
Endometrial carcinoma (EC) is the most common type of gynecological malignancy. Studies have demonstrated that the insulin growth factor (IGF) pathway is implicated in the development of endometrial tumors and that the serum levels of IGF-1 are affected by estrogen. Most EC cells with high microsatellite instability (MSI-H) accumulate mutations at a microsatellite sequence in the IGF-1 gene. The present study investigated the CA repeat polymorphism in the P1 promoter region of the IGF-1 gene among Caucasian females with endometrial hyperplasia, EC and healthy control subjects, whose blood serum and surgical tissue specimens were analyzed. Differences or correlations between the analyzed parameters [serum levels of IGF-1 and IGF binding protein (IGFBP)-1 and IGFBP-3 as well as estrogens among the polymorphisms] were verified using the χ2, Mann-Whitney U, Kruskal-Wallis or Spearman's rank correlation tests. A PCR amplification and DNA sequencing analysis was used for identification of (CA)n repeats in the P1 region of IGF-1. ELISA was used to determine the blood serum levels of IGF-1, IGFBP-1, IGFBP-3 and estrogens. Furthermore, IGF-1 was assessed in endometrial tissues by immunohistochemical analysis. The present study indicated no statistically significant differences between serum levels of IGF-1, IGFBP-1, IGFBP-3 and estrone, estriol and estradiol in the control and study groups. A significant correlation was identified between the IGF-1 levels and estrone levels in the MSI-H polymorphism (r=−0.41, P=0.012) as well as a highly negative correlation between IGF-1 levels and the estradiol levels in the MSI-H polymorphism (r=−0.6, P=0.002). Genotypes without the 19 CA allele were predominantly found in EC. Furthermore, statistical analysis indicated that the number of IGF-1-expressing cells was significantly elevated in MSI-H type 18-20 (P= 0.0072), MSI-L type 19-20 (P=0.025) and microsatellite-stable MSS type 19-19 (P=0.024) compared with those in the MSI-H 20-20 genotype. The present study suggested that it is rather likely that the polymorphisms in the IGF-1 promoter are associated with EC in Caucasian females with regard to its development. In the present study, polymorphisms of the IGF-1 promoter may have been introduced during the genesis of EC and contributed to it by leading to aberrant expression of IGF-1. PMID:27121258
Fingerprinting of HLA class I genes for improved selection of unrelated bone marrow donors.
Martinelli, G; Farabegoli, P; Buzzi, M; Panzica, G; Zaccaria, A; Bandini, G; Calori, E; Testoni, N; Rosti, G; Conte, R; Remiddi, C; Salvucci, M; De Vivo, A; Tura, S
1996-02-01
The degree of matching of HLA genes between the selected donor and recipient is an important aspect of the selection of unrelated donors for allogeneic bone marrow transplantation (UBMT). The most sensitive methods currently used are serological typing of HLA class I genes, mixed lymphocyte culture (MLC), IEF and molecular genotyping of HLA class II genes by direct sequencing of PCR products. Serological typing of class I antigenes (A, B and C) fails to detect minor differences demonstrated by direct sequencing of DNA polymorphic regions. Molecular genotyping of HLA class I genes by DNA analysis is costly and work-intensive. To improve compatibility between donor and recipient, we have set up a new rapid and non-radioisotopic application of the 'fingerprinting PCR' technique for the analysis of the polymorphic second exon of the HLA class I A, B and C genes. This technique is based on the formation of specific patterns (PCR fingerprints) of homoduplexes and heteroduplexes between heterologous amplified DNA sequences. After an electrophoretic run on non-denaturing polyacrylamide gel, different HLA class I types give allele-specific banding patterns. HLA class I matching is performed, after the gel has been soaked in ethidium bromide or silver-stained, by visual comparison of patients' fingerprints with those of donors. Identity can be confirmed by mixing donor and recipient DNAs in an amplification cross-match. To assess the technique, 10 normal samples, 22 related allogeneic bone marrow transplanted pairs and 10 unrelated HLA-A and HLA-B serologically matched patient-donor pairs were analysed for HLA class I polymorphic regions. In all the related pairs and in 1/10 unrelated pairs, matched donor-recipient patterns were identified. This new application of PCR fingerprinting may confirm the HLA class I serological selection of unrelated marrow donors.
Kaushik, Mahima; Kukreti, Shrikant
2015-01-01
Our previous work on structural polymorphism shown at a single nucleotide polymorphism (SNP) (A → G) site located on HS4 region of locus control region (LCR) of β-globin gene has established a hairpin → duplex equilibrium corresponding to A → B like DNA transition (Kaushik M, Kukreti, R., Grover, D., Brahmachari, S.K. and Kukreti S. Nucleic Acids Res. 2003; Kaushik M, Kukreti S. Nucleic Acids Res. 2006). The G-allele of A → G SNP has been shown to be significantly associated with the occurrence of β-thalassemia. Considering the significance of this 11-nt long quasi-palindromic sequence [5'-TGGGG(G/A)CCCCA; HP(G/A)11] of β-globin gene LCR, we further explored the differential behavior of the same DNA sequence with its RNA counterpart, using various biophysical and biochemical techniques. In contrast to its DNA counterpart exhibiting a A → B structural transition and an equilibrium between duplex and hairpin forms, the studied RNA oligonucleotide sequence [5'-UGGGG(G/A)CCCCA; RHP(G/A)11] existed only in duplex form (A-conformation) and did not form hairpin. The single residue difference from A to G led to the unusual thermal stability of the RNA structure formed by the studied sequence. Since, naturally occurring mutations and various SNP sites may stabilize or destabilize the local DNA/RNA secondary structures, these structural transitions may affect the gene expression by a change in the protein-DNA recognition patterns.
Kumar, Akash; Dougherty, Max; Findlay, Gregory M; Geisheker, Madeleine; Klein, Jason; Lazar, John; Machkovech, Heather; Resnick, Jesse; Resnick, Rebecca; Salter, Alexander I; Talebi-Liasi, Faezeh; Arakawa, Christopher; Baudin, Jacob; Bogaard, Andrew; Salesky, Rebecca; Zhou, Qian; Smith, Kelly; Clark, John I; Shendure, Jay; Horwitz, Marshall S
2014-01-01
Even in cases where there is no obvious family history of disease, genome sequencing may contribute to clinical diagnosis and management. Clinical application of the genome has not yet become routine, however, in part because physicians are still learning how best to utilize such information. As an educational research exercise performed in conjunction with our medical school human anatomy course, we explored the potential utility of determining the whole genome sequence of a patient who had died following a clinical diagnosis of idiopathic pulmonary fibrosis (IPF). Medical students performed dissection and whole genome sequencing of the cadaver. Gross and microscopic findings were more consistent with the fibrosing variant of nonspecific interstitial pneumonia (NSIP), as opposed to IPF per se. Variants in genes causing Mendelian disorders predisposing to IPF were not detected. However, whole genome sequencing identified several common variants associated with IPF, including a single nucleotide polymorphism (SNP), rs35705950, located in the promoter region of the gene encoding mucin glycoprotein MUC5B. The MUC5B promoter polymorphism was recently found to markedly elevate risk for IPF, though a particular association with NSIP has not been previously reported, nor has its contribution to disease risk previously been evaluated in the genome-wide context of all genetic variants. We did not identify additional predicted functional variants in a region of linkage disequilibrium (LD) adjacent to MUC5B, nor did we discover other likely risk-contributing variants elsewhere in the genome. Whole genome sequencing thus corroborates the association of rs35705950 with MUC5B dysregulation and interstitial lung disease. This novel exercise additionally served a unique mission in bridging clinical and basic science education.
Chelomina, Galina N; Rozhkovan, Konstantin V; Voronova, Anastasia N; Burundukova, Olga L; Muzarok, Tamara I; Zhuravlev, Yuri N
2016-04-01
Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.
Chelomina, Galina N.; Rozhkovan, Konstantin V.; Voronova, Anastasia N.; Burundukova, Olga L.; Muzarok, Tamara I.; Zhuravlev, Yuri N.
2015-01-01
Background Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440–640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine. PMID:27158239
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe
2016-01-01
The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131
Novel polymorphisms of the APOA2 gene and its promoter region affect body traits in cattle.
Zhou, Yang; Li, Caixia; Cai, Hanfang; Xu, Yao; Lan, Xianyong; Lei, Chuzhao; Chen, Hong
2013-12-01
Apolipoprotein A-II (APOA2) is one of the major constituents of high-density lipoprotein and plays a critical role in lipid metabolism and obesity. However, similar research for the bovine APOA2 gene is lacking. In this study, polymorphisms of the bovine APOA2 gene and its promoter region were detected in 1021 cows from four breeds by sequencing and PCR-RFLP methods. Totally, we detected six novel mutations which included one mutation in the promoter region, two mutations in the exons and three mutations in the introns. There were four polymorphisms within APOA2 gene were analyzed. The allele A, T, T and G frequencies of the four loci were predominant in the four breeds when in separate or combinations analysis which suggested cows with those alleles to be more adapted to the steppe environment. The association analysis indicated three SVs in Nangyang cows, two SVs in Qinchun cows and the 9 haplotypes in Nangyang cows were significantly associated with body traits (P<0.05 or P<0.01). The results of this study suggested the bovine APOA2 gene may be a strong candidate gene for body traits in the cattle breeding program. © 2013.
Primary structure of Lep d I, the main Lepidoglyphus destructor allergen.
Varela, J; Ventas, P; Carreira, J; Barbas, J A; Gimenez-Gallego, G; Polo, F
1994-10-01
The most relevant allergen of the storage mite Lepidoglyphus destructor (Lep d I) has been characterized. Lep d I is a monomer protein of 13273 Da. The primary structure of Lep d I was determined by N-terminal Edman degradation and partially confirmed by cDNA sequencing. Sequence polymorphism was observed at six positions, with non-conservative substitutions in three of them. No potential N-glycosylation site was revealed by peptide sequencing. The 125-residue sequence of Lep d I shows approximately 40% identity (including the six cysteines) with the overlapping regions of group II allergens from the genus Dermatophagoides, which, however, do not share common allergenic epitopes with Lep d I.
Genetic Variation of Bordetella pertussis in Austria.
Wagner, Birgit; Melzer, Helen; Freymüller, Georg; Stumvoll, Sabine; Rendi-Wagner, Pamela; Paulke-Korinek, Maria; Repa, Andreas; Mooi, Frits R; Kollaritsch, Herwig; Mittermayer, Helmut; Kessler, Harald H; Stanek, Gerold; Steinborn, Ralf; Duchêne, Michael; Wiedermann, Ursula
2015-01-01
In Austria, vaccination coverage against Bordetella pertussis infections during infancy is estimated at around 90%. Within the last years, however, the number of pertussis cases has increased steadily, not only in children but also in adolescents and adults, indicating both insufficient herd immunity and vaccine coverage. Waning immunity in the host and/or adaptation of the bacterium to the immunised hosts could contribute to the observed re-emergence of pertussis. In this study we therefore addressed the genetic variability in B. pertussis strains from several Austrian cities. Between the years 2002 and 2008, 110 samples were collected from Vienna (n = 32), Linz (n = 63) and Graz (n = 15) by nasopharyngeal swabs. DNA was extracted from the swabs, and bacterial sequence polymorphisms were examined by MLVA (multiple-locus variable number of tandem repeat analysis) (n = 77), by PCR amplification and conventional Sanger sequencing of the polymorphic regions of the prn (pertactin) gene (n = 110), and by amplification refractory mutation system quantitative PCR (ARMS-qPCR) (n = 110) to directly address polymorphisms in the genes encoding two pertussis toxin subunits (ptxA and ptxB), a fimbrial adhesin (fimD), tracheal colonisation factor (tcfA), and the virulence sensor protein (bvgS). Finally, the ptxP promoter region was screened by ARMS-qPCR for the presence of the ptxP3 allele, which has been associated with elevated production of pertussis toxin. The MLVA analysis revealed the highest level of polymorphisms with an absence of MLVA Type 29, which is found outside Austria. Only Prn subtypes Prn1/7, Prn2 and Prn3 were found with a predominance of the non-vaccine type Prn2. The analysis of the ptxA, ptxB, fimD, tcfA and bvgS polymorphisms showed a genotype mixed between the vaccine strain Tohama I and a clinical isolate from 2006 (L517). The major part of the samples (93%) displayed the ptxP3 allele. The consequences for the vaccination strategy are discussed.
Henning, Frederico; Renz, Adina Josepha; Fukamachi, Shoji; Meyer, Axel
2010-05-01
Natural populations of the Midas cichlid species in several different crater lakes in Nicaragua exhibit a conspicuous color polymorphism. Most individuals are dark and the remaining have a gold coloration. The color morphs mate assortatively and sympatric population differentiation has been shown based on neutral molecular data. We investigated the color polymorphism using segregation analysis and a candidate gene approach. The segregation patterns observed in a mapping cross between a gold and a dark individual were consistent with a single dominant gene as a cause of the gold phenotype. This suggests that a simple genetic architecture underlies some of the speciation events in the Midas cichlids. We compared the expression levels of several candidate color genes Mc1r, Ednrb1, Slc45a2, and Tfap1a between the color morphs. Mc1r was found to be up regulated in the gold morph. Given its widespread association in color evolution and role on melanin synthesis, the Mc1r locus was further investigated using sequences derived from a genomic library. Comparative analysis revealed conserved synteny in relation to the majority of teleosts and highlighted several previously unidentified conserved non-coding elements (CNEs) in the upstream and downstream regions in the vicinity of Mc1r. The identification of the CNEs regions allowed the comparison of sequences from gold and dark specimens of natural populations. No polymorphisms were found between in the population sample and Mc1r showed no linkage to the gold phenotype in the mapping cross, demonstrating that it is not causally related to the color polymorphism in the Midas cichlid.
Bimolata, Waikhom; Kumar, Anirudh; Sundaram, Raman Meenakshi; Laha, Gouri Shankar; Qureshi, Insaf Ahmed; Reddy, Gajjala Ashok; Ghazi, Irfan Ahmad
2013-08-01
Xa27 is one of the important R-genes, effective against bacterial blight disease of rice caused by Xanthomonas oryzae pv. oryzae (Xoo). Using natural population of Oryza, we analyzed the sequence variation in the functionally important domains of Xa27 across the Oryza species. DNA sequences of Xa27 alleles from 27 rice accessions revealed higher nucleotide diversity among the reported R-genes of rice. Sequence polymorphism analysis revealed synonymous and non-synonymous mutations in addition to a number of InDels in non-coding regions of the gene. High sequence variation was observed in the promoter region including the 5'UTR with 'π' value 0.00916 and 'θ w ' = 0.01785. Comparative analysis of the identified Xa27 alleles with that of IRBB27 and IR24 indicated the operation of both positive selection (Ka/Ks > 1) and neutral selection (Ka/Ks ≈ 0). The genetic distances of alleles of the gene from Oryza nivara were nearer to IRBB27 as compared to IR24. We also found the presence of conserved and null UPT (upregulated by transcriptional activator) box in the isolated alleles. Considerable amino acid polymorphism was localized in the trans-membrane domain for which the functional significance is yet to be elucidated. However, the absence of functional UPT box in all the alleles except IRBB27 suggests the maintenance of single resistant allele throughout the natural population.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Labbe, Jessy L; Murat, Claude; Morin, Emmanuelle
It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in themore » L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.« less
Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S
2015-09-01
The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat
2017-07-01
The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.
USDA-ARS?s Scientific Manuscript database
Mexican papita viroid (MPVd), a pospiviroid, causes serious disease outbreaks on greenhouse tomato in North America. Two dominant genotypes (MPV-S and MPV-M), sharing 93.8% sequence identity, incited striking different symptom expression (severe and mild) on tomato ‘Rutgers’. To determine genetic ...
Fliegner, R A; Holloway, S A; Lester, S; McLure, C A; Dawkins, R L
2008-08-01
The class II region of the major histocompatibility complex was evaluated in 25 greyhounds by sequence-based typing and the genomic matching technique (GMT). Two new DLA-DRB1 alleles were identified. Twenty-four dogs carried the DLA-DRB1*01201/DQA1*00401/DQB1*01303/DQB1*01701 haplotype, which carries two DQB1 alleles. One haplotype was identified from which DQB1 and DQA1 appeared to be deleted. The GMT enabled detection of DQB1 copy number, discrimination of the different class II haplotypes and the identification of new, possibly biologically relevant polymorphisms.
A candidate gene for choanal atresia in alpaca.
Reed, Kent M; Bauer, Miranda M; Mendoza, Kristelle M; Armién, Aníbal G
2010-03-01
Choanal atresia (CA) is a common nasal craniofacial malformation in New World domestic camelids (alpaca and llama). CA results from abnormal development of the nasal passages and is especially debilitating to newborn crias. CA in camelids shares many of the clinical manifestations of a similar condition in humans (CHARGE syndrome). Herein we report on the regulatory gene CHD7 of alpaca, whose homologue in humans is most frequently associated with CHARGE. Sequence of the CHD7 coding region was obtained from a non-affected cria. The complete coding region was 9003 bp, corresponding to a translated amino acid sequence of 3000 aa. Additional genomic sequences corresponding to a significant portion of the CHD7 gene were identified and assembled from the 2x alpaca whole genome sequence, providing confirmatory sequence for much of the CHD7 coding region. The alpaca CHD7 mRNA sequence was 97.9% similar to the human sequence, with the greatest sequence difference being an insertion in exon 38 that results in a polyalanine repeat (A12). Polymorphism in this repeat was tested for association with CA in alpaca by cloning and sequencing the repeat from both affected and non-affected individuals. Variation in length of the poly-A repeat was not associated with CA. Complete sequencing of the CHD7 gene will be necessary to determine whether other mutations in CHD7 are the cause of CA in camelids.
Bozdoğan, Sevcan Tuğ; Kuran, Gökhan; Yüregir, Özge Özalp; Aslan, Hüseyin; Haytoğlu, Süheyl; Ayaz, Akif; Arıkan, Osman Kürşat
2015-08-01
To date, studies in all populations showed that mutations in the gene of Gap junction protein beta 2 (GJB2) play an important role in non-syndromic autosomal recessive congenital hearing loss. The aim of this study was to evaluate GJB2 gene of patients with hearing loss in our region using deoxyribonucleic acid (DNA) sequencing method and to demonstrate region-specific mutation and polymorphism distribution. Patients who had bilateral severe sensorineural non-syndromic hearing loss identified by audiologic evaluation were included. Peripheral blood samples were collected and the GJB2 gene exon1 and exon 2 regions were amplified by polymerase chain reaction (PCR). Obtained PCR products were sequenced by the DNA sequence analysis method (SeqFinder Sequencing System; ABI 3130; Foster City, CA, USA) and analyzed using the SeqScape software. Of the 77 patients, 16 had homozygous or heterozygous mutation. The mutation of 35delG, which is known as the most frequent mutation of GJB2 gene, was also the most frequently seen mutation at a ratio of 5.5% in patients with hearing loss in our region; this was followed by the V27I mutation. As this is the first study conducted by sequence analysis in our region, it was worth to be presented in terms of showing the distribution of mutation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalchman, M.; Lin, B.; Nasir, J.
1994-09-01
The mouse homologue of the Huntington disease gene (Hdh) has recently been cloned and mapped to a region of synteny with the human, on mouse chromosome 5. The two genes share a high degree of both coding (90% amino acid) and nucleotide (86.2%) identity. We have subsequently performed a detailed comparison of the genomic organization of the 5{prime} region of the two genes encompassing the promoter region and first five exons of both the human and mouse genes. The comparative sequence analysis of the promoter region between HD and Hdh reveals two highly conserved regions. One region (-56 to -118)more » (+1 is the ATG start codon), shared 84% nucleotide identity and another region (-130 to -206) had 81% nucleotide identity. Nine putative Sp1 sites appear in the human promoter region contrasted with only 3 in a similar region in the mouse. Furthermore, 17 and 20 base pair direct repeats present in the HD 5{prime} region are absent in the similar Hdh region. Although both the mouse and human intron/exon boundaries conform to the GT/AG rule, the intron sizes between HD and Hdh are markedly different. The first four introns in Hdh are 15, 7, 5 and 0.5 kb compared to sizes of 10, 15, 7 and 0.5 kb, respectively. Comparison between the mouse and human intronic sequences immediately adjacent to the first five exons (excluding exon 1) reveals only about 46 to 50% identity within the first 60 bp of intronic sequence. Furthermore, we have identified novel polymorphic di-, tri- and tetra-nucleotide repeats in Hdh introns of various mouse strains that are not present in the human. For example, polymorphic CT repeats are present in introns 2 and 4 of Hdh and a novel mouse 56 AAG trinucleotide repeat (interrupted by an AAGG) is also located within intron 2. This information concerning the promoter and genomic organization of both HD and Hdh is critical for designing appropriate gene targetting vectors for studying the normal function of the HD and Hdh genes in model systems.« less
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean
2012-01-01
Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
Rindi, Laura; Lari, Nicoletta; Garzelli, Carlo
2012-10-01
The Euro-American lineage of the Mycobacterium tuberculosis complex consists of 10 sublineages, each defined by a deletion of a large genomic region (RD, region of difference); by spoligotyping, that probes the polymorphism of the Direct Repeat (DR) locus, the Euro-American strains are classified into 5 lineages (T, Haarlem, LAM, S and X) and 34 sublineages, but the relationships between the RD-defined sublineages and the spoligotype groupings are largely unclear. By testing a global sample of 158 Euro-American strains, mutually exclusive deletions of RD115, RD122, RD174, RD182, RD183, RD193, RD219, RD726 or RD761 were found in 122 strains; deletion of RD724, typical of strains from Central Africa, was not found. The RD-defined sublineages, tested for katG463/gyrA95 polymorphism, belonged to Principal Genotypic Group (PGG) 2, with the exception of RD219 sublineage belonging to PGG3; the 36 strains with no deletion were of either PGG2 or 3. Based on these polymorphisms, a phylogenetic reconstruction of the Euro-American lineage, that integrates the previously reported phylogeny, is proposed. Although certain deletions were found to be associated to certain spoligotype lineages (i.e., deletion RD115 to T and LAM, RD174 to LAM, RD182 to Haarlem, RD219 to T), our analysis indicates a general lack of concordance between RD-defined sublineages and spoligotype groupings. Moreover, of the 42 spoligotypes detected among the study strains, sixteen were shared by strains belonging to different RD sublineages. IS6110-RFLP analysis of strains sharing spoligotypes confirmed a poor genetic relatedness between strains of different RD sublineages. These findings provide evidence for the occurrence of a high degree of homoplasy in the DR locus leading to convergent evolution to identical spoligotypes. The incongruence between Large Sequence Polymorphism and spoligotype polymorphism argues against the use of spoligotyping for establishing phylogenetic relationships within the Euro-American lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
A statistical method for the detection of variants from next-generation resequencing of DNA pools.
Bansal, Vikas
2010-06-15
Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.
Genovar: a detection and visualization tool for genomic variants.
Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung
2012-05-08
Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. http://genovar.sourceforge.net/.
Molecular characterization of a Toxocara variant from cats in Kuala Lumpur, Malaysia.
Zhu, X Q; Jacobs, D E; Chilton, N B; Sani, R A; Cheng, N A; Gasser, R B
1998-08-01
The ascaridoid nematode of cats from Kuala Lumpur, Malaysia, previously identified morphologically as Toxocara canis, was characterized using a molecular approach. The nuclear ribosomal DNA (rDNA) region spanning the first internal transcribed spacer (ITS-1), the 5.8S gene and the second internal transcribed spacer (ITS-2) was amplified and sequenced. The sequences for the parasite from Malaysian cats were compared with those for T. canis and T. cati. The sequence data showed that this taxon was genetically more similar to T. cati than to T. canis in the ITS-1, 5.8S and ITS-2. Differences in the ITS-1 and ITS-2 sequences between the taxa (9.4-26.1%) were markedly higher than variation between samples within T. canis and T. cati (0-2.9%). The sequence data demonstrate that the parasite from Malaysian cats is neither T. canis nor T. cati and indicate that it is a distinct species. Based on these data, PCR-linked restriction fragment length polymorphism (RFLP) and single-strand conformation polymorphism (SSCP) methods were employed for the unequivocal differentiation of the Toxocara variant from T. canis and T. cati. These methods should provide valuable tools for studying the life-cycle, transmission pattern(s) and zoonotic potential of this parasite.
Zhang, Peng; Zhu, Yuqiang; Wang, Lili; Chen, Liping; Zhou, Shengjun
2015-12-14
Powdery mildew (PM) is the most common fungal disease of cucumber and other cucurbit crops, while breeding the PM-resistant materials is the effective way to defense this disease, and the recent development of modern genetics and genomics make us aware of that studying the resistance genes is the essential way to breed the PM high-resistance plant. With the ever increasing throughput of next-generation sequencing (NGS), the development of specific length amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for large-scale de novo SNP discovery is gradually applied for functional gene mining. Here we combined the bulked segregant analysis (BSA) with SLAF-seq to identify candidate genes associated with PM resistance in cucumber. A segregating population comprising 251 F2 individuals was developed using H136 (female parent) as susceptible parent and BK2 (male parent) as resistance donor. After PMR test, total genomic DNA was prepared from each plant. Systemic genomic analysis of the GC content, repeat sequence, etc. was carried out by prediction software SLAF_Predict to establish condition to ensure the uniformity and density of the molecular markers. After samples were gel purified, SLAFs were generated at Biomarker Technologies Corporation in Beijing. Based on SLAF tags and the PMR test result, the hot region were annotated. A total of 73,100 high-quality SLAF tags with an average depth of 99.11× were sequenced. Among these, 5,355 polymorphic tags were identified with a polymorphism rate of 7.34 %, including 7.09 % SNPs and other polymorphism types. Finally, 140 associated SLAFs were identified, and two main Hot Regions were detected on chromosome 1 and 6, which contained five genes invovled in defense response, toxin metabolism, cell stress response, and injury response in cucumber. Associated markers identified by super-BSA in this study, could not only speed up the study of the PMR genes, but also provide a feasible solution for breeding the marker-assisted PMR cucumber. Moreover, this study could also be extended to any other species with reference genome.
Albrechtsen, A; Grarup, N; Li, Y; Sparsø, T; Tian, G; Cao, H; Jiang, T; Kim, S Y; Korneliussen, T; Li, Q; Nie, C; Wu, R; Skotte, L; Morris, A P; Ladenvall, C; Cauchi, S; Stančáková, A; Andersen, G; Astrup, A; Banasik, K; Bennett, A J; Bolund, L; Charpentier, G; Chen, Y; Dekker, J M; Doney, A S F; Dorkhan, M; Forsen, T; Frayling, T M; Groves, C J; Gui, Y; Hallmans, G; Hattersley, A T; He, K; Hitman, G A; Holmkvist, J; Huang, S; Jiang, H; Jin, X; Justesen, J M; Kristiansen, K; Kuusisto, J; Lajer, M; Lantieri, O; Li, W; Liang, H; Liao, Q; Liu, X; Ma, T; Ma, X; Manijak, M P; Marre, M; Mokrosiński, J; Morris, A D; Mu, B; Nielsen, A A; Nijpels, G; Nilsson, P; Palmer, C N A; Rayner, N W; Renström, F; Ribel-Madsen, R; Robertson, N; Rolandsson, O; Rossing, P; Schwartz, T W; Slagboom, P E; Sterner, M; Tang, M; Tarnow, L; Tuomi, T; van't Riet, E; van Leeuwen, N; Varga, T V; Vestmar, M A; Walker, M; Wang, B; Wang, Y; Wu, H; Xi, F; Yengo, L; Yu, C; Zhang, X; Zhang, J; Zhang, Q; Zhang, W; Zheng, H; Zhou, Y; Altshuler, D; 't Hart, L M; Franks, P W; Balkau, B; Froguel, P; McCarthy, M I; Laakso, M; Groop, L; Christensen, C; Brandslund, I; Lauritzen, T; Witte, D R; Linneberg, A; Jørgensen, T; Hansen, T; Wang, J; Nielsen, R; Pedersen, O
2013-02-01
Human complex metabolic traits are in part regulated by genetic determinants. Here we applied exome sequencing to identify novel associations of coding polymorphisms at minor allele frequencies (MAFs) >1% with common metabolic phenotypes. The study comprised three stages. We performed medium-depth (8×) whole exome sequencing in 1,000 cases with type 2 diabetes, BMI >27.5 kg/m(2) and hypertension and in 1,000 controls (stage 1). We selected 16,192 polymorphisms nominally associated (p < 0.05) with case-control status, from four selected annotation categories or from loci reported to associate with metabolic traits. These variants were genotyped in 15,989 Danes to search for association with 12 metabolic phenotypes (stage 2). In stage 3, polymorphisms showing potential associations were genotyped in a further 63,896 Europeans. Exome sequencing identified 70,182 polymorphisms with MAF >1%. In stage 2 we identified 51 potential associations with one or more of eight metabolic phenotypes covered by 45 unique polymorphisms. In meta-analyses of stage 2 and stage 3 results, we demonstrated robust associations for coding polymorphisms in CD300LG (fasting HDL-cholesterol: MAF 3.5%, p = 8.5 × 10(-14)), COBLL1 (type 2 diabetes: MAF 12.5%, OR 0.88, p = 1.2 × 10(-11)) and MACF1 (type 2 diabetes: MAF 23.4%, OR 1.10, p = 8.2 × 10(-10)). We applied exome sequencing as a basis for finding genetic determinants of metabolic traits and show the existence of low-frequency and common coding polymorphisms with impact on common metabolic traits. Based on our study, coding polymorphisms with MAF above 1% do not seem to have particularly high effect sizes on the measured metabolic traits.
Phylogenetic analysis of mtDNA lineages in South American mummies.
Monsalve, M V; Cardenas, F; Guhl, F; Delaney, A D; Devine, D V
1996-07-01
Some studies of mtDNA propose that contemporary Amerindians have descended from four haplotype groups, each defined by specific sets of polymorphisms. One recent study also found evidence of other potential founder haplotypes. We wanted to determine whether the four haplotypes in modern populations were also present in ancient South American aboriginals. We subjected mtDNA from Colombian mummies (470 to 1849 AD) to PCR amplification and restriction endonuclease analysis. The mtDNA D-loop region was surveyed for sequence variation by restriction analysis and a segment of this region was sequenced for each mummy to characterize the haplotypes. Our mummies exhibited three of the four major characteristic haplotypes of Amerindian populations defined by four markers. With sequence data obtained in the ancient samples and published data on contemporary Amerindians it was possible to infer the origin of these six mummies.
McCutchen-Maloney, Sandra L.
2002-01-01
DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Yebra, Gonzalo; Holguín, Africa
2008-01-01
Bevirimat (PA-457) is the first candidate of a new family of antiretroviral drugs, the maturation inhibitors. Its action is based on disruption of the protease cleavage of the Gag precursor region. Six resistance mutations have been described and analysed in virus from both treatment-naive and protease inhibitor (PI)-experienced patients, but only in the subtype B of HIV type-1 (HIV-1) virus. Thus, genotypic resistance in non-B subtypes still requires analysis. HIV-1 sequences of different subtypes (54 B, 81 non-B and recombinants) were analysed for the presence of resistance mutations to bevirimat, located within the capsid (CA) protein and spacer peptide 1 (SP1) cleavage site. No resistance mutations were found, although polymorphisms appeared in some CA-SP1 residues. The C-terminal CA protein and the N-terminal SP1 presented high conservation, whereas C-terminal SP1 was highly variable in sequence and length, with unknown influence in resistance acquisition. The results of the present study confirm an absolute conservation of the residues involved in bevirimat in vitro resistance in a large panel of HIV-1 subtypes and recombinants from both treatment-naive and PI-experienced patients. Treatment alone seemed to increase the polymorphisms account in CRF02_AG recombinant sequences; however, the influence of natural polymorphisms needs to be explored.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-01-01
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J
2008-01-01
* The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Micklos, David A.
2006-10-30
This project achieved its goal of implementing a nationwide training program to introduce high school biology teachers to the key uses and societal implications of human DNA polymorphisms. The 2.5-day workshop introduced high school biology faculty to a laboratory-based unit on human DNA polymorphisms â which provides a uniquely personal perspective on the science and Ethical, Legal and Social Implications (ELSI) of the Human Genome Project. As proposed, 12 workshops were conducted at venues across the United States. The workshops were attended by 256 high school faculty, exceeding proposed attendance of 240 by 7%. Each workshop mixed theoretical, laboratory, andmore » computer work with practical and ethical implications. Program participants learned simplified lab techniques for amplifying three types of chromosomal polymorphisms: an Alu insertion (PV92), a VNTR (pMCT118/D1S80), and single nucleotide polymorphisms (SNPs) in the mitochondrial control region. These polymorphisms illustrate the use of DNA variations in disease diagnosis, forensic biology, and identity testing - and provide a starting point for discussing the uses and potential abuses of genetic technology. Participants also learned how to use their Alu and mitochondrial data as an entrée to human population genetics and evolution. Our work to simplify lab techniques for amplifying human DNA polymorphisms in educational settings culminated with the release in 1998 of three Advanced Technology (AT) PCR kits by Carolina Biological Supply Company, the nationâÂÂs oldest educational science supplier. The kits use a simple 30-minute method to isolate template DNA from hair sheaths or buccal cells and streamlined PCR chemistry based on Pharmacia Ready-To-Go Beads, which incorporate Taq polymerase, deoxynucleotide triphosphates, and buffer in a freeze-dried pellet. These kits have greatly simplified teacher implementation of human PCR labs, and their use is growing at a rapid pace. Sales of human polymorphism kits by Carolina Biological rose from 700 units in 1999 to 1,132 in 2000 â a 62% increase. Competing kits using the Alu system, and based substantially on our earlier work, are also marketed by Biorad and Edvotek. In parallel with the lab experiments, we developed a suite of database/statistical applications and easy-to-use interfaces that allow students to use their own DNA data to explore human population genetics and to test theories of human evolution. Database searches and statistical analyses are launched from a centralized workspace. Workshop participants were introduced to these and other resources available at the DNALC WWW site (http://vector.cshl.org/bioserver/): 1) Allele Server tests Hardy-Weinberg equilibrium and statistically compares PV92 data from world populations. 2) Sequence Server uses DNA sequence data to search Genbank using BLASTN, compare sequences using CLUSTALW, and create phylogenetic trees using PHYLIP. 3) Simulation Server uses a Monte Carlo generator to model the long-term effects of drift, selection, and population bottlenecks. By targeting motivated and innovative biology faculty, we believe that this project offered a cost-effective means to bring high school biology education up-to-the-minute with genomic biology. The workshop reached a target audience of highly professional faculty who have already implemented hands-on labs in molecular genetics and many of whom offer laboratory electives in biotechnology. Many attend professional meetings, develop curriculum, collaborate with scientists, teach faculty workshops, and manage equipment-sharing programs. These individuals are life-long learners, anxious for deeper insight and additional training to further extend their leadership. This contention was supported by data from a mail survey, conducted in February-March 2000 and 2001, of 256 faculty who participated in workshops conducted during the current term of DOE support. Seventy percent of participants responded, providing direct reports on how their teaching behavior had changed since taking the DOE workshop. About nine of ten respondents said they had provided new classroom materials and first-hand accounts of DNA typing, sequencing, or PCR. Three-fourths had introduced new units on human molecular genetics. Most strikingly, half had students use PCR to amplify their own insertion polymorphisms (PV92), and better than one-fourth amplified a VNTR polymorphism and the mitochondrial control region. One in five had mitochondrial DNA sequenced by the DNALC Sequencing Service. A majority (58%) used online materials at the DNALC WWW site, and 28% analyzed student polymorphism data with Bioservers at the DNALC site. A majority (58%) assisted other faculty with student labs on polymorphisms, reaching an additional 786 teachers.« less
Liang, Xing-huan; Qin, Ying-fen; Ma, Yan; Xie, Xin-rong; Xie, Kai-qing; Luo, Zuo-jie
2006-06-01
To investigate the relationship between the polymorphic (AT)n repeats in 3ountranslated region of exon 4 of CTLA4 gene [CTLA4(AT)n] and Graveso disease (GD) in Zhuang nationality population of Guangxi province. The studied groups comprised 48 patients with GD and 44 normal controls. Amplification of target DNA was carried out by polymerase chain reaction (PCR). The amplified products were run by 8% polyacrylamide gel electrophoresis, and then followed by 0.1% silver staining. Some of amplified products were sequenced directly. Nineteen alleles of CTLA4 gene microsatellite polymorphism were found in Guangxi Zhuang nationality individuals. The 106 bp long allele was apparently increased in patients with GD of Zhuang nationality but not in healthy controls (P< 0.05). CTLA4 gene microsatellite polymorphism is strongly associated with Graveso disease in Zhuang nationality population of Guangxi province. CTLA4(AT)n 106 bp may be the susceptible gene in GD patients of Zhuang nationality in Guangxi; 19 alleles of CTLA4 gene microsatellite polymorphism were found in Guangxi Zhuang nationality individuals.
A High Proportion of Chromosome 21 Promoter Polymorphisms Influence Transcriptional Activity
Buckland, Paul R.; Coleman, Sharol L.; Hoogendoorn, Bastiaan; Guy, Carol; Smith, S. Kaye; O’Donovan, Michael C.
2004-01-01
We have sought to obtain an unbiased estimate of the proportion of polymorphisms in promoters of human genes that have functional effects. We carried out polymorphism discovery on a randomly selected group of 51 gene promoters mapping to human chromosome 21 and successfully analyzed the effect on transcription of 38 of the sequence variants. To achieve this, a total of 53 different haplotypes from 20 promoters were cloned into a modified pGL3 luciferase reporter gene vector and were tested for their abilities to promote transcription in HEK293t and JEG-3 cells. Up to seven (18%) of the 38 tested variants altered transcription by 1.5-fold, confirming that a surprisingly high proportion of promoter region polymorphisms are likely to be functionally important. The functional variants were distributed across the promoters of CRYAA, IFNAR1, KCNJ15, NCAM2, IGSF5, and B3GALT5. Three of the genes (NCAM2, IFNAR1, and CRYAA) have been previously associated with human phenotypes and the polymorphisms we describe here may therefore play a role in those phenotypes. PMID:15200235
Pattaradilokrat, Sittiporn; Trakoolsoontorn, Chawinya; Simpalipan, Phumin; Warrit, Natapot; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai
2018-01-22
The glutamate-rich protein (GLURP) of the malaria parasite Plasmodium falciparum is a key surface antigen that serves as a component of a clinical vaccine. Moreover, the GLURP gene is also employed routinely as a genetic marker for malarial genotyping in epidemiological studies. While extensive size polymorphisms in GLURP are well recorded, the extent of the sequence diversity of this gene is rarely investigated. The present study aimed to explore the genetic diversity of GLURP in natural populations of P. falciparum. The polymorphic C-terminal repetitive R2 region of GLURP sequences from 65 P. falciparum isolates in Thailand were generated and combined with the data from 103 worldwide isolates to generate a GLURP database. The collection was comprised of 168 alleles, encoding 105 unique GLURP subtypes, characterized by 18 types of amino acid repeat units (AAU). Of these, 28 GLURP subtypes, formed by 10 AAU types, were detected in P. falciparum in Thailand. Among them, 19 GLURP subtypes and 2 AAU types are described for the first time in the Thai parasite population. The AAU sequences were highly conserved, which is likely due to negative selection. Standard Fst analysis revealed the shared distributions of GLURP types among the P. falciparum populations, providing evidence of gene flow among the different demographic populations. Sequence diversity causing size variations in GLURP in Thai P. falciparum populations were detected, and caused by non-synonymous substitutions in repeat units and some insertion/deletion of aspartic acid or glutamic acid codons between repeat units. The P. falciparum population structure based on GLURP showed promising implications for the development of GLURP-based vaccines and for monitoring vaccine efficacy.
Chou, A; Burke, J
1999-05-01
DNA sequence clustering has become a valuable method in support of gene discovery and gene expression analysis. Our interest lies in leveraging the sequence diversity within clusters of expressed sequence tags (ESTs) to model gene structure for the study of gene variants that arise from, among other things, alternative mRNA splicing, polymorphism, and divergence after gene duplication, fusion, and translocation events. In previous work, CRAW was developed to discover gene variants from assembled clusters of ESTs. Most importantly, novel gene features (the differing units between gene variants, for example alternative exons, polymorphisms, transposable elements, etc.) that are specialized to tissue, disease, population, or developmental states can be identified when these tools collate DNA source information with gene variant discrimination. While the goal is complete automation of novel feature and gene variant detection, current methods are far from perfect and hence the development of effective tools for visualization and exploratory data analysis are of paramount importance in the process of sifting through candidate genes and validating targets. We present CRAWview, a Java based visualization extension to CRAW. Features that vary between gene forms are displayed using an automatically generated color coded index. The reporting format of CRAWview gives a brief, high level summary report to display overlap and divergence within clusters of sequences as well as the ability to 'drill down' and see detailed information concerning regions of interest. Additionally, the alignment viewing and editing capabilities of CRAWview make it possible to interactively correct frame-shifts and otherwise edit cluster assemblies. We have implemented CRAWview as a Java application across windows NT/95 and UNIX platforms. A beta version of CRAWview will be freely available to academic users from Pangea Systems (http://www.pangeasystems.com). Contact :
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan
2018-01-01
CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.
2010-01-01
Background FAE1 (fatty acid elongase1) is the key gene in the control of erucic acid synthesis in seeds of Brassica species. Due to oil with low erucic acid (LEA) content is essential for human health and not enough LEA resource could be available, thus new LEA genetic resources are being sought for Brassica breeding. EcoTILLING, a powerful genotyping method, can readily be used to identify polymorphisms in Brassica. Results Seven B. rapa, nine B. oleracea and 101 B. napus accessions were collected for identification of FAE1 polymorphisms. Three polymorphisms were detected in the two FAE1 paralogues of B. napus using EcoTILLING and were found to be strongly associated with differences in the erucic acid contents of seeds. In genomic FAE1 sequences obtained from seven B. rapa accessions, one SNP in the coding region was deduced to cause loss of gene function. Molecular evolution analysis of FAE1 homologues showed that the relationship between the Brassica A and C genomes is closer than that between the A/C genomes and Arabidopsis genome. Alignment of the coding sequences of these FAE1 homologues indicated that 18 SNPs differed between the A and C genomes and could be used as genome-specific markers in Brassica. Conclusion This study showed the applicability of EcoTILLING for detecting gene polymorphisms in Brassica. The association between B. napus FAE1 polymorphisms and the erucic acid contents of seeds may provide useful guidance for LEA breeding. The discovery of the LEA resource in B. rapa can be exploited in Brasscia cultivation. PMID:20594317
Wang, Nian; Shi, Lei; Tian, Fang; Ning, Huicai; Wu, Xiaoming; Long, Yan; Meng, Jinling
2010-07-01
FAE1 (fatty acid elongase1) is the key gene in the control of erucic acid synthesis in seeds of Brassica species. Due to oil with low erucic acid (LEA) content is essential for human health and not enough LEA resource could be available, thus new LEA genetic resources are being sought for Brassica breeding. EcoTILLING, a powerful genotyping method, can readily be used to identify polymorphisms in Brassica. Seven B. rapa, nine B. oleracea and 101 B. napus accessions were collected for identification of FAE1 polymorphisms. Three polymorphisms were detected in the two FAE1 paralogues of B. napus using EcoTILLING and were found to be strongly associated with differences in the erucic acid contents of seeds. In genomic FAE1 sequences obtained from seven B. rapa accessions, one SNP in the coding region was deduced to cause loss of gene function. Molecular evolution analysis of FAE1 homologues showed that the relationship between the Brassica A and C genomes is closer than that between the A/C genomes and Arabidopsis genome. Alignment of the coding sequences of these FAE1 homologues indicated that 18 SNPs differed between the A and C genomes and could be used as genome-specific markers in Brassica. This study showed the applicability of EcoTILLING for detecting gene polymorphisms in Brassica. The association between B. napus FAE1 polymorphisms and the erucic acid contents of seeds may provide useful guidance for LEA breeding. The discovery of the LEA resource in B. rapa can be exploited in Brasscia cultivation.
Rachman, C N; Kabadjova, P; Prévost, H; Dousset, X
2003-01-01
The restriction fragment length polymorphism (RFLP) method was used to differentiate Lactobacillus species having closely related identities in the 16S-23S rDNA intergenic spacer region (ISR). Species-specific primers for Lact. farciminis and Lact. alimentarius were designed and allowed rapid identification of these species. The 16S-23S rDNA spacer region was amplified by primers tAla and 23S/p10, then digested by HinfI and TaqI enzymes and analysed by electrophoresis. Digestion by HinfI was not sufficient to differentiate Lact. sakei, Lact. curvatus, Lact. farciminis, Lact. alimentarius, Lact. plantarum and Lact. paraplantarum. In contrast, digestion carried out by TaqI revealed five different patterns allowing these species to be distinguished, except for Lact. plantarum from Lact. paraplantarum. The 16S-23S rDNA spacer region of Lact. farciminis and Lact. alimentarius were amplified and then cloned into vector pCR(R)2.1 and sequenced. The DNA sequences obtained were analysed and species-specific primers were designed from these sequences. The specificity of these primers was positively demonstrated as no response was obtained for 14 other species tested. The species-specific primers for Lact. farciminis and Lact. alimentarius were shown to be useful for identifying these species among other lactobacilli. The RFLP profile obtained upon digestion with HinfI and TaqI enzymes can be used to discriminate Lact. farciminis, Lact. alimentarius, Lact. sakei, Lact. curvatus and Lact. plantarum. In this paper, we have established the first species-specific primer for PCR identification of Lact. farciminis and Lact. alimentarius. Both species-specific primer and RFLP, could be used as tools for rapid identification of lactobacilli up to species level.
Vacca, G M; Dettori, M L; Balia, F; Luridiana, S; Mura, M C; Carcangiu, V; Pazzola, M
2013-09-01
The purpose was to analyze the growth hormone GH1/GH2-N and GH2-Z gene copies and to assess their possible association with milk traits in Sarda sheep. Two hundred multiparous lactating ewes were monitored. The two gene copies were amplified separately and each was used as template for a nested PCR, to investigate single strand conformation polymorphism (SSCP) of the 5'UTR, exon-1, exon-5 and 3'UTR DNA regions. SSCP analysis revealed marked differences in the number of polymorphic patterns between the two genes. Sequencing revealed five nucleotide changes at the GH1/GH2-N gene. Five nucleotide changes occurred at the GH2-Z gene: one was located in exon-5 (c.556G > A) and resulted in a putative amino acid substitution G186S. All the nucleotide changes were copy-specific, except c.*30delT, which was common to both GH1/GH2-N and GH2-Z. Variability in the promoter regions of each gene might have consequences on the expression level, due to the involvement in potential transcription factor binding sites. Both gene copies influenced milk yield. A correlation with milk protein and casein content was also evidenced. These results may have implications that make them useful for future breeding strategies in dairy sheep breeding.
Jiang, Rui ; Yang, Hua ; Zhou, Linqi ; Kuo, C.-C. Jay ; Sun, Fengzhu ; Chen, Ting
2007-01-01
The increasing demand for the identification of genetic variation responsible for common diseases has translated into a need for sophisticated methods for effectively prioritizing mutations occurring in disease-associated genetic regions. In this article, we prioritize candidate nonsynonymous single-nucleotide polymorphisms (nsSNPs) through a bioinformatics approach that takes advantages of a set of improved numeric features derived from protein-sequence information and a new statistical learning model called “multiple selection rule voting” (MSRV). The sequence-based features can maximize the scope of applications of our approach, and the MSRV model can capture subtle characteristics of individual mutations. Systematic validation of the approach demonstrates that this approach is capable of prioritizing causal mutations for both simple monogenic diseases and complex polygenic diseases. Further studies of familial Alzheimer diseases and diabetes show that the approach can enrich mutations underlying these polygenic diseases among the top of candidate mutations. Application of this approach to unclassified mutations suggests that there are 10 suspicious mutations likely to cause diseases, and there is strong support for this in the literature. PMID:17668383
Wei, Guang-hui; Zhao, Bo; Wang, Zhen-jun
2008-09-01
To compare the sensibility and specificity between single-stranded conformation polymorphism (SSCP) and denaturing high-performance liquid chromatography (DHPLC) in screening hMSH2 and hMLH1 gene mutations for the diagnosis of hereditary non-polyposis colorectal cancer (HNPCC). Seven Chinese HNPCC kindreds were collected. PCR-SSCP and DHPLC were used to screen the coding regions of hMSH2 and hMLH1 genes and the abnormal profiles were sequenced by a 377 DNA sequencer. Seven gene sequence variations of hMSH2 or hMLH1 were found. Among them, 4 variations were not found by SSCP, but by DHPLC. The sensibility of SSCP and DHPLC were 51.6% and 100% respectively, and the specificity were 66.6% and 93.3% respectively. DHPLC has better sensibility and specificity in screening hMSH2 and hMLH1 gene mutation as compared to SSCP. DHPLC is an ideal method in the diagnosis of HNPCC.
2009-01-01
Background Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation between species allowed synteny comparisons to be made to sequenced genomes. This synteny analysis may support positional cloning of target genes in common bean through the use of genomic information from these other legumes. PMID:20030833
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lappalainen, J.; Dean, M.; Virkkunen, M.
1995-04-24
Abnormal brain serotonin function may be characteristic of several neuropsychiatric disorders. Thus, it is important to identify polymorphic genes and screen for functional variants at loci coding for genes that control normal serotonin functions. 5-HT{sub 1D{beta}} is a terminal serotonin autoreceptor which may play a role in regulating serotonin synthesis and release. Using an SSCP technique we screened for 5-HT{sub 1D{beta}} coding sequence variants in psychiatrically interviewed populations, which included controls, alcoholics, and alcoholic arsonists and alcoholic violent offenders with low CSF concentrations of the main serotonin metabolite 5-HIAA. A common polymorphism was identified in the 5-HT{sub 1D{beta}} gene withmore » allele frequencies of 0.72 and 0.28. The SSCP variant was caused by a silent G to C substitution at nucleotide 861 of the coding region. This polymorphism could also be detected as a HincII RFLP of amplified DNA. DNAs from informative CEPH families were typed for the HincII RFLP and analyzed with respect to 20 linked markers on chromosome 6. Multipoint analysis placed the 5-HT{sub 1D{beta}} receptor gene between markers D6S286 and D6S275. A maximum two-point lod score of 10.90 was obtained to D6S26, which had been previously localized on 6q14-15. Chromosomal aberrations involving this region have been previously shown to cause retinal anomalies, developmental delay, and abnormal brain development. This region also contains the gene for North Carolina-type macular dystrophy. 34 refs., 3 figs., 1 tab.« less
Devaney, Joseph M; Tosi, Laura L; Fritz, David T; Gordish-Dressman, Heather A; Jiang, Shan; Orkunoglu-Suer, Funda E; Gordon, Andrew H; Harmon, Brennan T; Thompson, Paul D; Clarkson, Priscilla M; Angelopoulos, Theodore J; Gordon, Paul M; Moyna, Niall M; Pescatello, Linda S; Visich, Paul S; Zoeller, Robert F; Brandoli, Cinzia; Hoffman, Eric P; Rogers, Melissa B
2009-08-15
A classic morphogen, bone morphogenetic protein 2 (BMP2) regulates the differentiation of pluripotent mesenchymal cells. High BMP2 levels promote osteogenesis or chondrogenesis and low levels promote adipogenesis. BMP2 inhibits myogenesis. Thus, BMP2 synthesis is tightly controlled. Several hundred nucleotides within the 3' untranslated regions of BMP2 genes are conserved from mammals to fishes indicating that the region is under stringent selective pressure. Our analyses indicate that this region controls BMP2 synthesis by post-transcriptional mechanisms. A common A to C single nucleotide polymorphism (SNP) in the BMP2 gene (rs15705, +A1123C) disrupts a putative post-transcriptional regulatory motif within the human ultra-conserved sequence. In vitro studies indicate that RNAs bearing the A or C alleles have different protein binding characteristics in extracts from mesenchymal cells. Reporter genes with the C allele of the ultra-conserved sequence were differentially expressed in mesenchymal cells. Finally, we analyzed MRI data from the upper arm of 517 healthy individuals aged 18-41 years. Individuals with the C/C genotype were associated with lower baseline subcutaneous fat volumes (P = 0.0030) and an increased gain in skeletal muscle volume (P = 0.0060) following resistance training in a cohort of young males. The rs15705 SNP explained 2-4% of inter-individual variability in the measured parameters. The rs15705 variant is one of the first genetic markers that may be exploited to facilitate early diagnosis, treatment, and/or prevention of diseases associated with poor fitness. Furthermore, understanding the mechanisms by which regulatory polymorphisms influence BMP2 synthesis will reveal novel pharmaceutical targets for these disabling conditions. (c) 2009 Wiley-Liss, Inc.
Devaney, Joseph M.; Tosi, Laura L.; Fritz, David T.; Gordish-Dressman, Heather A.; Jiang, Shan; Orkunoglu-Suer, Funda E.; Gordon, Andrew H.; Harmon, Brennan T.; Thompson, Paul D.; Clarkson, Priscilla M.; Angelopoulos, Theodore J.; Gordon, Paul M.; Moyna, Niall M.; Pescatello, Linda S.; Visich, Paul S.; Zoeller, Robert F.; Brandoli, Cinzia; Hoffman, Eric P.; Rogers, Melissa B.
2014-01-01
A classic morphogen, bone morphogenetic protein 2 (BMP2) regulates the differentiation of pluripotent mesenchymal cells. High BMP2 levels promote osteogenesis or chondrogenesis and low levels promote adipogenesis. BMP2 inhibits myogenesis. Thus, BMP2 synthesis is tightly controlled. Several hundred nucleotides within the 3′ untranslated regions of BMP2 genes are conserved from mammals to fishes indicating that the region is under stringent selective pressure. Our analyses indicate that this region controls BMP2 synthesis by post-transcriptional mechanisms. A common A to C single nucleotide polymorphism (SNP) in the BMP2 gene (rs15705, +A1123C) disrupts a putative post-transcriptional regulatory motif within the human ultra-conserved sequence. In vitro studies indicate that RNAs bearing the A or C alleles have different protein binding characteristics in extracts from mesenchymal cells. Reporter genes with the C allele of the ultra-conserved sequence were differentially expressed in mesenchymal cells. Finally, we analyzed MRI data from the upper arm of 517 healthy individuals aged 18–41 years. Individuals with the C/C genotype were associated with lower baseline subcutaneous fat volumes (P = 0.0030) and an increased gain in skeletal muscle volume (P = 0.0060) following resistance training in a cohort of young males. The rs15705 SNP explained 2–4% of inter-individual variability in the measured parameters. The rs15705 variant is one of the first genetic markers that maybe exploited to facilitate early diagnosis, treatment, and/or prevention of diseases associated with poor fitness. Furthermore, understanding the mechanisms by which regulatory polymorphisms influence BMP2 synthesis will reveal novel pharmaceutical targets for these disabling conditions. PMID:19492344
Carvalho, S; Caldeira, R L; Simpson, A J; Vidigal, T H
2001-01-01
Freshwater snails belonging to the genus Biomphalaria are intermediate hosts of the trematode Schistosoma mansoni in the Neotropical region and Africa. In Brazil, one subspecies and ten species of Biomphalaria have been identified: B. glabrata, B. tenagophila, B. straminea, B. occidentalis, B. peregrina, B. kuhniana, B. schrammi, B. amazonica, B. oligoza, B. intermedia and B.t. guaibensis. However, only the first three species are found naturally infected with S. mansoni. The classical identification of these planorbids is based on comparison of morphological characteristics of the shell and male and female reproductive organs, which is greatly complicated by the extensive intra-specific variation. Several molecular techniques have been used in studies on the identification, genetic structure as well as phylogenetic relationships between these groups of organisms. Using the randomly amplified polymorphic DNAs (RAPD) analysis we demonstrated that B. glabrata exhibits a remarkable degree of intra-specific polymorphism. Thus, the genetics of the snail host may be more important to the epidemiology of schistosomiasis than those of the parasite itself. Using the simple sequence repeat anchored polymerase chain reaction (SSR-PCR) in intra-populational and intra-specific studies we have demonstrated that snails belonging to the B. straminea complex (B. straminea, B. kuhniana and B. intermedia) clearly presented higher heterogeneity. Using the low stringency polymerase chain reaction (LS-PCR) technique we were able to separate B. glabrata from B. tenagophila and B. tenagophila from B. occidentalis. To separate all Brazilian Biomphalaria species we used the restriction fragment length polymorphism (PCR-RFLP) of the internal transcribed spacer region (ITS) of the DNA gene. The method also proved to be efficient for the specific identification of DNA extracted from snail eggs. Recently we have sequenced the ITS2 region for phylogenetic studies of all Biomphalaria snails from Brazil.
Emerman, Amy B; Bowman, Sarah K; Barry, Andrew; Henig, Noa; Patel, Kruti M; Gardner, Andrew F; Hendrickson, Cynthia L
2017-07-05
Next-generation sequencing (NGS) is a powerful tool for genomic studies, translational research, and clinical diagnostics that enables the detection of single nucleotide polymorphisms, insertions and deletions, copy number variations, and other genetic variations. Target enrichment technologies improve the efficiency of NGS by only sequencing regions of interest, which reduces sequencing costs while increasing coverage of the selected targets. Here we present NEBNext Direct ® , a hybridization-based, target-enrichment approach that addresses many of the shortcomings of traditional target-enrichment methods. This approach features a simple, 7-hr workflow that uses enzymatic removal of off-target sequences to achieve a high specificity for regions of interest. Additionally, unique molecular identifiers are incorporated for the identification and filtering of PCR duplicates. The same protocol can be used across a wide range of input amounts, input types, and panel sizes, enabling NEBNext Direct to be broadly applicable across a wide variety of research and diagnostic needs. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Sommer, Simone
2011-01-01
The genes of the major histocompatibility complex (MHC) are a key component of the mammalian immune system and have become important molecular markers for fitness-related genetic variation in wildlife populations. Currently, no information about the MHC sequence variation and constitution in African leopards exists. In this study, we isolated and characterized genetic variation at the adaptively most important region of MHC class I and MHC class II-DRB genes in 25 free-ranging African leopards from Namibia and investigated the mechanisms that generate and maintain MHC polymorphism in the species. Using single-stranded conformation polymorphism analysis and direct sequencing, we detected 6 MHC class I and 6 MHC class II-DRB sequences, which likely correspond to at least 3 MHC class I and 3 MHC class II-DRB loci. Amino acid sequence variation in both MHC classes was higher or similar in comparison to other reported felids. We found signatures of positive selection shaping the diversity of MHC class I and MHC class II-DRB loci during the evolutionary history of the species. A comparison of MHC class I and MHC class II-DRB sequences of the leopard to those of other felids revealed a trans-species mode of evolution. In addition, the evolutionary relationships of MHC class II-DRB sequences between African and Asian leopard subspecies are discussed.
Assessment of genome origins and genetic diversity in the genus Eleusine with DNA markers.
Salimath, S S; de Oliveira, A C; Godwin, I D; Bennetzen, J L
1995-08-01
Finger millet (Eleusine coracana), an allotetraploid cereal, is widely cultivated in the arid and semiarid regions of the world. Three DNA marker techniques, restriction fragment length polymorphism (RFLP), randomly amplified polymorphic DNA (RAPD), and inter simple sequence repeat amplification (ISSR), were employed to analyze 22 accessions belonging to 5 species of Eleusine. An 8 probe--3 enzyme RFLP combination, 18 RAPD primers, and 6 ISSR primers, respectively, revealed 14, 10, and 26% polymorphism in 17 accessions of E. coracana from Africa and Asia. These results indicated a very low level of DNA sequence variability in the finger millets but did allow each line to be distinguished. The different Eleusine species could be easily identified by DNA marker technology and the 16% intraspecific polymorphism exhibited by the two analyzed accessions of E. floccifolia suggested a much higher level of diversity in this species than in E. coracana. Between species, E. coracana and E. indica shared the most markers, while E. indica and E. tristachya shared a considerable number of markers, indicating that these three species form a close genetic assemblage within the Eleusine. Eleusine floccifolia and E. compressa were found to be the most divergent among the species examined. Comparison of RFLP, RAPD, and ISSR technologies, in terms of the quantity and quality of data output, indicated that ISSRs are particularly promising for the analysis of plant genome diversity.
Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Fang, Weimin; Guan, Zhiyong; Teng, Nianjun; Liao, Yuan; Chen, Fadi
2014-01-01
The Asteraceae family is at the forefront of the evolution due to frequent hybridization. Hybridization is associated with the induction of widespread genetic and epigenetic changes and has played an important role in the evolution of many plant taxa. We attempted the intergeneric cross Chrysanthemum morifolium × Leucanthemum paludosum. To obtain the success in cross, we have to turn to ovule rescue. DNA profiling of the amphihaploid and amphidiploid was investigated using amplified fragment length polymorphism, sequence-related amplified polymorphism, start codon targeted polymorphism, and methylation-sensitive amplification polymorphism (MSAP). Hybridization induced rapid changes at the genetic and the epigenetic levels. The genetic changes mainly involved loss of parental fragments and gaining of novel fragments, and some eliminated sequences possibly from the noncoding region of L. paludosum. The MSAP analysis indicated that the level of DNA methylation was lower in the amphiploid (∼45%) than in the parental lines (51.5-50.6%), whereas it increased after amphidiploid formation. Events associated with intergeneric genomic shock were a feature of C. morifolium × L. paludosum hybrid, given that the genetic relationship between the parental species is relatively distant. Our results provide genetic and epigenetic evidence for understanding genomic shock in wide crosses between species in Asteraceae and suggest a need to expand our current evolutionary framework to encompass a genetic/epigenetic dimension when seeking to understand wide crosses.
The use of genetic markers in the molecular epidemiology of histoplasmosis: a systematic review.
Damasceno, L S; Leitão, T M J S; Taylor, M L; Muniz, M M; Zancopé-Oliveira, R M
2016-01-01
Histoplasmosis is a systemic mycosis caused by Histoplasma capsulatum, a dimorphic fungal pathogen that can infect both humans and animals. This disease has worldwide distribution and affects mainly immunocompromised individuals. In the environment, H. capsulatum grows as mold but undergoes a morphologic transition to the yeast morphotype under special conditions. Molecular techniques are important tools to conduct epidemiologic investigations for fungal detection, identification of infection sources, and determination of different fungal genotypes associated to a particular disease symptom. In this study, we performed a systematic review in the PubMed database to improve the understanding about the molecular epidemiology of histoplasmosis. This search was restricted to English and Spanish articles. We included a combination of specific keywords: molecular typing [OR] genetic diversity [OR] polymorphism [AND] H. capsulatum; molecular epidemiology [AND] histoplasmosis; and molecular epidemiology [AND] Histoplasma. In addition, we used the specific terms: histoplasmosis [AND] outbreaks. Non-English or non-Spanish articles, dead links, and duplicate results were excluded from the review. The results reached show that the main methods used for molecular typing of H. capsulatum were: restriction fragment length polymorphism, random amplified polymorphic DNA, microsatellites polymorphism, sequencing of internal transcribed spacers region, and multilocus sequence typing. Different genetic profiles were identified among H. capsulatum isolates, which can be grouped according to their source, geographical origin, and clinical manifestations.
Yang, Q L; Huang, X Y; Kong, J J; Zhao, S G; Liu, L X; Gun, S B
2016-08-19
Piglet diarrhea is one of the primary factors that affects the benefits of the swine industry. Recent studies have shown that exon 2 of the swine leukocyte antigen-DQA gene is associated with piglet resistance to diarrhea; however, the contributions of additional exon coding regions of this gene remain unclear. Here, we detected and sequenced variants in the exon 3 region and examined their associations with diarrhea infection in 425 suckling piglets using the polymerase chain reaction-single-strand conformational polymorphism and sequencing analysis. The results revealed that exon 3 of the swine leukocyte antigen-DQA gene is highly polymorphic and pivotal to both diarrhea susceptibility and resistance in piglets. We identified 14 genotypes (AA, AB, BB, BC, CC, EE, EF, BE, BF, CF, DD, DH, GG, and GF) and eight alleles (A-H) that were generated by 14 nucleotide variants, eight of which were novel, and three nucleotide deletions. Statistical analyses revealed that the genotypes AB and EF were associated with resistance to diarrheal disease (P < 0.05), and the genotype DD may contribute to diarrhea susceptibility but was unique to Large White pigs (P > 0.05). These results elucidate the genetic and immunological background to piglet diarrhea, and provide useful information for resistance breeding programs.
Boudon, Sylvain; Manceau, Charles; Nottéghem, Jean-Loup
2005-09-01
ABSTRACT Xanthomonas arboricola pv. pruni, the causal agent of bacterial spot on stone fruit, was found in 1995 in several orchards in southeastern France. We studied population genetics of this emerging pathogen in comparison with populations from the United States, where the disease was first described, and from Italy, where the disease has occurred since 1920. Four housekeeping genes (atpD, dnaK, efp, and glnA) and the intergenic transcribed spacer region were sequenced from a total of 3.9 kb of sequences, and fluorescent amplified fragment length polymorphism (FAFLP) analysis was performed. A collection of 64 X. arboricola pv. pruni strains, including 23 strains from France, was analyzed. The X. arboricola pv. pruni population had a low diversity because no sequence polymorphisms were observed. Population diversity revealed by FAFLP was lower for the West European population than for the American population. The same bacterial genotype was detected from five countries on three continents, a geographic distribution that can be explained by human-aided migration of bacteria. Our data support the hypothesis that the pathogen originated in the United States and subsequently has been disseminated to other stone-fruit-growing regions of the world. In France, emergence of this disease was due to a recent introduction of the most prevalent genotype of the bacterium found worldwide.
Regulatory polymorphisms modulate the expression of HLA class II molecules and promote autoimmunity
Raj, Prithvi; Rai, Ekta; Song, Ran; Khan, Shaheen; Wakeland, Benjamin E; Viswanathan, Kasthuribai; Arana, Carlos; Liang, Chaoying; Zhang, Bo; Dozmorov, Igor; Carr-Johnson, Ferdicia; Mitrovic, Mitja; Wiley, Graham B; Kelly, Jennifer A; Lauwerys, Bernard R; Olsen, Nancy J; Cotsapas, Chris; Garcia, Christine K; Wise, Carol A; Harley, John B; Nath, Swapan K; James, Judith A; Jacob, Chaim O; Tsao, Betty P; Pasare, Chandrashekhar; Karp, David R; Li, Quan Zhen; Gaffney, Patrick M; Wakeland, Edward K
2016-01-01
Targeted sequencing of sixteen SLE risk loci among 1349 Caucasian cases and controls produced a comprehensive dataset of the variations causing susceptibility to systemic lupus erythematosus (SLE). Two independent disease association signals in the HLA-D region identified two regulatory regions containing 3562 polymorphisms that modified thirty-seven transcription factor binding sites. These extensive functional variations are a new and potent facet of HLA polymorphism. Variations modifying the consensus binding motifs of IRF4 and CTCF in the XL9 regulatory complex modified the transcription of HLA-DRB1, HLA-DQA1 and HLA-DQB1 in a chromosome-specific manner, resulting in a 2.5-fold increase in the surface expression of HLA-DR and DQ molecules on dendritic cells with SLE risk genotypes, which increases to over 4-fold after stimulation. Similar analyses of fifteen other SLE risk loci identified 1206 functional variants tightly linked with disease-associated SNPs and demonstrated that common disease alleles contain multiple causal variants modulating multiple immune system genes. DOI: http://dx.doi.org/10.7554/eLife.12089.001 PMID:26880555
Hansen, Tina V A; Thamsborg, Stig M; Olsen, Annette; Prichard, Roger K; Nejsum, Peter
2013-08-12
The whipworm Trichuris trichiura has been estimated to infect 604 - 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 - 99%. Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species.
2013-01-01
Background The whipworm Trichuris trichiura has been estimated to infect 604 – 795 million people worldwide. The current control strategy against trichuriasis using the benzimidazoles (BZs) albendazole (400 mg) or mebendazole (500 mg) as single-dose treatment is not satisfactory. The occurrence of single nucleotide polymorphisms (SNPs) in codons 167, 198 or 200 of the beta-tubulin gene has been reported to convey BZ-resistance in intestinal nematodes of veterinary importance. It was hypothesised that the low susceptibility of T. trichiura to BZ could be due to a natural occurrence of such SNPs. The aim of this study was to investigate whether these SNPs were present in the beta-tubulin gene of Trichuris spp. from humans and baboons. As a secondary objective, the degree of identity between T. trichiura from humans and Trichuris spp. from baboons was evaluated based on the beta-tubulin gene and the internal transcribed spacer 2 region (ITS2). Methods Nucleotide sequences of the beta-tubulin gene were generated by PCR using degenerate primers, specific primers and DNA from worms and eggs of T. trichiura and worms of Trichuris spp. from baboons. The ITS2 region was amplified using adult Trichuris spp. from baboons. PCR products were sequenced and analysed. The beta-tubulin fragments were studied for SNPs in codons 167, 198 or 200 and the ITS2 amplicons were compared with GenBank records of T. trichiura. Results No SNPs in codons 167, 198 or 200 were identified in any of the analysed Trichuris spp. from humans and baboons. Based on the ITS2 region, the similarity between Trichuris spp. from baboons and GenBank records of T. trichiura was found to be 98 – 99%. Conclusions Single nucleotide polymorphisms in codon 167, 198 and 200, known to confer BZ-resistance in other nematodes, were absent in the studied material. This study does not provide data that could explain previous reports of poor BZ treatment efficacy in terms of polymorphism in these codons of beta-tubulin. Based on a fragment of the beta-tubulin gene and the ITS2 region sequenced, it was found that T. trichiura from humans and Trichuris spp. isolated from baboons are closely related and may be the same species. PMID:23938038
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Depaulis, F; Brazier, L; Veuille, M
1999-01-01
The hitchhiking model of population genetics predicts that an allele favored by Darwinian selection can replace haplotypes from the same locus previously established at a neutral mutation-drift equilibrium. This process, known as "selective sweep," was studied by comparing molecular variation between the polymorphic In(2L)t inversion and the standard chromosome. Sequence variation was recorded at the Suppressor of Hairless (Su[H]) gene in an African population of Drosophila melanogaster. We found 47 nucleotide polymorphisms among 20 sequences of 1.2 kb. Neutrality tests were nonsignificant at the nucleotide level. However, these sites were strongly associated, because 290 out of 741 observed pairwise combinations between them were in significant linkage disequilibrium. We found only seven haplotypes, two occurring in the 9 In(2L)t chromosomes, and five in the 11 standard chromosomes, with no shared haplotype. Two haplotypes, one in each chromosome arrangement, made up two-thirds of the sample. This low haplotype diversity departed from neutrality in a haplotype test. This pattern supports a selective sweep hypothesis for the Su(H) chromosome region. PMID:10388820
Retter, Ida; Chevillard, Christophe; Scharfe, Maren; Conrad, Ansgar; Hafner, Martin; Im, Tschong-Hun; Ludewig, Monika; Nordsiek, Gabriele; Severitt, Simone; Thies, Stephanie; Mauhar, America; Blöcker, Helmut; Müller, Werner; Riblet, Roy
2009-01-01
Although the entire mouse genome has been sequenced, there remain challenges concerning the elucidation of particular complex and polymorphic genomic loci. In the murine Igh locus, different haplotypes exist in different inbred mouse strains. For example, the Ighb haplotype sequence of the Mouse Genome Project strain C57BL/6 differs considerably from the Igha haplotype of BALB/c, which has been widely used in the analyses of Ab responses. We have sequenced and annotated the 3′ half of the Igha locus of 129S1/SvImJ, covering the CH region and approximately half of the VH region. This sequence comprises 128 VH genes, of which 49 are judged to be functional. The comparison of the Igha sequence with the homologous Ighb region from C57BL/6 revealed two major expansions in the germline repertoire of Igha. In addition, we found smaller haplotype-specific differences like the duplication of five VH genes in the Igha locus. We generated a VH allele table by comparing the individual VH genes of both haplotypes. Surprisingly, the number and position of DH genes in the 129S1 strain differs not only from the sequence of C57BL/6 but also from the map published for BALB/c. Taken together, the contiguous genomic sequence of the 3′ part of the Igha locus allows a detailed view of the recent evolution of this highly dynamic locus in the mouse. PMID:17675503
Identification of true EST alignments for recognising transcribed regions.
Ma, Chuang; Wang, Jia; Li, Lun; Duan, Mo-Jie; Zhou, Yan-Hong
2011-01-01
Transcribed regions can be determined by aligning Expressed Sequence Tags (ESTs) with genome sequences. The kernel of this strategy is to effectively distinguish true EST alignments from spurious ones. In this study, three measures including Direction Check, Identity Check and Terminal Check were introduced to more effectively eliminate spurious EST alignments. On the basis of these introduced measures and other widely used measures, a computational tool, named ESTCleanser, has been developed to identify true EST alignments for obtaining reliable transcribed regions. The performance of ESTCleanser has been evaluated on the well-annotated human ENCyclopedia of DNA Elements (ENCODE) regions using human ESTs in the dbEST database. The evaluation results show that the accuracy of ESTCleanser at exon and intron levels is more remarkably enhanced than that of UCSC-spliced EST alignments. This work would be helpful to EST-based researches on finding new genes, complementing genome annotation, recognising alternative splicing events and Single Nucleotide Polymorphisms (SNPs), etc.
Camargo, U; Toledo, R A; Cintra, J R; Nunes, D P T; Acayaba de Toledo, R; Brandão de Mattos, C C; Mattos, L C
2012-05-07
Genes located outside the HLA region (6p21) have been considered as candidates for susceptibility to ankylosing spondylitis. We tested the hypothesis that the G22A polymorphism of the adenosine deaminase gene (ADA; 20q13.11) is associated with ankylosing spondylitis in 166 Brazilian subjects genotyped for the HLA*27 gene (47 patients and 119 controls matched for gender, age and geographic origin). The HLA-B*27 gene and the G22A ADA polymorphism were identified by PCR with sequence-specific oligonucleotide probes and PCR-RFLP, respectively. There were no significant differences in frequencies of ADA genotypes [odds ratio (OR) = 1.200, 95% confidence interval (CI) = 0.3102-4.643, P > 0.8] and ADA*01 and ADA*02 alleles (OR = 1.192, 95%CI = 0.3155-4.505, P > 0.8) in patients versus controls. We conclude that the G22A polymorphism is not associated with ankylosing spondylitis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Low, P.S.; Liu, Y.; Saha, N.
A length polymorphism at the 5{prime} untranslated region of the ATIII gene has been described as having been detected by polymerase chain reaction (PCR) with a frequency of 0.75 for the short allele (S) in the Caucasian population. This length polymorphism of the ATIII gene has been studied in 251 Chinese healthy subjects. Genomic DNA was amplified by PCR with primers of published sequences. Fragments of the amplified DNA were separated by agarose gel electrophoresis (3% NuSieve and 1% Seakem GTG) and photographed on a UV transilluminator. The frequency of the short allele (S) was found to be significantly lowermore » (0.37) than that in the Caucasians (0.75). The distribution of genotypes of this polymorphism of the ATIII gene was at Hardy-Weinberg equilibrium. The large difference of allelic frequencies in the Mongoloid and Caucasian populations makes it a useful marker for population studies.« less
Venugopal, Priyanka; Lavu, Vamsi; RangaRao, Suresh; Venkatesan, Vettriselvi
2017-04-01
Periodontitis is an inflammatory disease caused by bacterial triggering of the host immune-inflammatory response, which in turn is regulated by microRNAs (miRNA). Polymorphisms in the miRNA pathways affect the expression of several target genes such as tumor necrosis factor-α and interleukins, which are associated with progression of disease. The objective of this study was to identify the association between the MiR-146a single nucleotide polymorphisms (SNPs) (rs2910164, rs57095329, and rs73318382), the MiR-196a2 (rs11614913) SNP and chronic periodontitis. Genotyping was performed for the MiR-146a (rs2910164, rs57095329, and rs73318382) and the MiR-196a2 (rs11614913) polymorphisms in 180 healthy controls and 190 cases of chronic periodontitis by the direct Sanger sequencing technique. The strength of the association between the polymorphisms and chronic periodontitis was evaluated using logistic regression analysis. Haplotype and linkage analyses among the polymorphisms was performed. Multifactorial dimensionality reduction was performed to determine epistatic interaction among the polymorphisms. The MiR-196a2 polymorphism revealed a significant inverse association with chronic periodontitis. Haplotype analysis of MiR-146a and MiR-196a2 polymorphisms revealed 13 different combinations, of which 5 were found to have an inverse association with chronic periodontitis. The present study has demonstrated a significant inverse association of MiR-196a2 polymorphism with chronic periodontitis.
de Souza, Gustavo A.; Arntzen, Magnus Ø.; Fortuin, Suereta; Schürch, Anita C.; Målen, Hiwa; McEvoy, Christopher R. E.; van Soolingen, Dick; Thiede, Bernd; Warren, Robin M.; Wiker, Harald G.
2011-01-01
Precise annotation of genes or open reading frames is still a difficult task that results in divergence even for data generated from the same genomic sequence. This has an impact in further proteomic studies, and also compromises the characterization of clinical isolates with many specific genetic variations that may not be represented in the selected database. We recently developed software called multistrain mass spectrometry prokaryotic database builder (MSMSpdbb) that can merge protein databases from several sources and be applied on any prokaryotic organism, in a proteomic-friendly approach. We generated a database for the Mycobacterium tuberculosis complex (using three strains of Mycobacterium bovis and five of M. tuberculosis), and analyzed data collected from two laboratory strains and two clinical isolates of M. tuberculosis. We identified 2561 proteins, of which 24 were present in M. tuberculosis H37Rv samples, but not annotated in the M. tuberculosis H37Rv genome. We were also able to identify 280 nonsynonymous single amino acid polymorphisms and confirm 367 translational start sites. As a proof of concept we applied the database to whole-genome DNA sequencing data of one of the clinical isolates, which allowed the validation of 116 predicted single amino acid polymorphisms and the annotation of 131 N-terminal start sites. Moreover we identified regions not present in the original M. tuberculosis H37Rv sequence, indicating strain divergence or errors in the reference sequence. In conclusion, we demonstrated the potential of using a merged database to better characterize laboratory or clinical bacterial strains. PMID:21030493
Polymorphism at Expressed DQ and DR Loci in Five Common Equine MHC Haplotypes
Miller, Donald; Tallmadge, Rebecca L.; Binns, Matthew; Zhu, Baoli; Mohamoud, Yasmin Ali; Ahmed, Ayeda; Brooks, Samantha A.; Antczak, Douglas F.
2016-01-01
The polymorphism of Major Histocompatibility Complex (MHC) class II DQ and DR genes in five common Equine Leukocyte Antigen (ELA) haplotypes was determined through sequencing of mRNA transcripts isolated from lymphocytes of eight ELA homozygous horses. Ten expressed MHC class II genes were detected in horses of the ELA-A3 haplotype carried by the donor horses of the equine Bacterial Artificial Chromosome (BAC) library and the reference genome sequence: four DR genes and six DQ genes. The other four ELA haplotypes contained at least eight expressed polymorphic MHC class II loci. Next Generation Sequencing (NGS) of genomic DNA of these four MHC haplotypes revealed stop codons in the DQA3 gene in the ELA-A2, ELA-A5, and ELA-A9 haplotypes. Few NGS reads were obtained for the other MHC class II genes that were not amplified in these horses. The amino acid sequences across haplotypes contained locus-specific residues, and the locus clusters produced by phylogenetic analysis were well supported. The MHC class II alleles within the five tested haplotypes were largely non-overlapping between haplotypes. The complement of equine MHC class II DQ and DR genes appears to be well conserved between haplotypes, in contrast to the recently described variation in class I gene loci between equine MHC haplotypes. The identification of allelic series of equine MHC class II loci will aid comparative studies of mammalian MHC conservation and evolution and may also help to interpret associations between the equine MHC class II region and diseases of the horse. PMID:27889800
Grindflek, Eli; Hansen, Marianne H S; Lien, Sigbjørn; van Son, Maren
2018-05-29
Umbilical hernia is one of the most prevalent congenital defect in pigs, causing economic losses and substantial animal welfare problems. Identification and implementation of genomic regions controlling umbilical hernia in breeding is of great interest to reduce incidences of hernia in commercial pig production. The aim of this study was to identify such regions and possibly identify causative variation affecting umbilical hernia in pigs. A case/control material consisting of 739 Norwegian Landrace pigs was collected and applied in a GWAS study with a genome-wide distributed panel of 60 K SNPs. Additionally candidate genes were sequenced to detect additional polymorphisms that were used for single SNP and haplotype association analyses in 453 of the pigs. The GWAS in this report detected a highly significant region affecting umbilical hernia around 50 Mb on SSC14 (P < 0.0001) explaining up to 8.6% of the phenotypic variance of the trait. The region is rather broad and includes 62 significant SNPs in high linkage disequilibrium with each other. Targeted sequencing of candidate genes within the region revealed polymorphisms within the Leukemia inhibitory factor (LIF) and Oncostatin M (OSM) that were significantly associated with umbilical hernia (P < 0.001). A highly significant QTL for umbilical hernia in Norwegian Landrace pigs was detected around 50 Mb on SSC14. Resequencing of candidate genes within the region revealed SNPs within LIF and OSM highly associated with the trait. However, because of extended LD within the region, studies in other populations and functional studies are needed to determine whether these variants are causal or not. Still without this knowledge, SNPs within the region can be used as genetic markers to reduce incidences of umbilical hernia in Norwegian Landrace pigs.
NASA Astrophysics Data System (ADS)
Liu, Siwei; Li, Qi; Yu, Hong; Kong, Lingfeng
2017-02-01
Glycogen is important not only for the energy supplementary of oysters, but also for human consumption. High glycogen content can improve the stress survival of oyster. A key enzyme in glycogenesis is glycogen synthase that is encoded by glycogen synthase gene GYS. In this study, the relationship between single nucleotide polymorphisms (SNPs) in coding regions of Crassostrea gigas GYS (Cg-GYS) and individual glycogen content was investigated with 321 individuals from five full-sib families. Single-strand conformation polymorphism (SSCP) procedure was combined with sequencing to confirm individual SNP genotypes of Cg-GYS. Least-square analysis of variance was performed to assess the relationship of variation in glycogen content of C. gigas with single SNP genotype and SNP haplotype. As a consequence, six SNPs were found in coding regions to be significantly associated with glycogen content ( P < 0.01), from which we constructed four main haplotypes due to linkage disequilibrium. Furthermore, the most effective haplotype H2 (GAGGAT) had extremely significant relationship with high glycogen content ( P < 0.0001). These findings revealed the potential influence of Cg-GYS polymorphism on the glycogen content and provided molecular biological information for the selective breeding of good quality traits of C. gigas.
Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V
2007-01-01
Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
Devold, M; Falk, K; Dale, B; Krossøy, B; Biering, E; Aspehaug, V; Nilsen, F; Nylund, A
2001-11-08
Infectious salmon anemia (ISA) is caused by a virus that probably belongs to the Orthomyxoviridae and was first recorded in Norway in 1984. The disease has since spread along the Norwegian coast and has later been found in Canada, Scotland, the Faroe Islands, Chile, and the USA. This study presents sequence variation of the hemagglutinin gene from 37 ISA virus isolates, viz. one isolate from Scotland, one from Canada and 35 from Norway. The hemagglutinin gene contains a highly polymorphic region (HPR), which together with the rest of the gene sequence provides a good tool for studies of epizootics. The gene shows temporal and geographical sequence variation, where certain areas are dominated by distinct groups of isolates. Evidence of transmission of ISA virus isolates within and between regions is given. It is suggested that the hemagglutinin gene from different isolates may recombine. Possible recombination sites are found within the HPR and in the 5'-end flanking region close to the HPR.
Damodar R. Kethidi; David B. Roden; Tim R. Ladd; Peter J. Krell; Arthur Ratnakaran; Qili Feng
2003-01-01
DNA markers were identified for the molecular detection of the Asian long-horned beetle (ALB), Anoplophora glabripennis (Mot.), based on sequence charaterized amplified regions (SCARS) derived from random amplified polymorphic DNA (RAPD) fragments. A 2,740-bp DNA fragment that was present only in ALB and not in other Cerambycids was identified after...
Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong
2012-06-01
The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cuppens, H.; Marynen, P.; Cassiman, J.J.
1993-12-01
The authors have previously shown that about 85% of the mutations in 194 Belgian cystic fibrosis alleles could be detected by a reverse dot-blot assay. In the present study, 50 Belgian chromosomes were analyzed for mutations in the cystic fibrosis transmembrane conductance regulator gene by means of direct solid phase automatic sequencing of PCR products of individual exons. Twenty-six disease mutations and 14 polymorphisms were found. Twelve of these mutations and 3 polymorphisms were not described before. With the exception of one mutant allele carrying two mutations, these mutations were the only mutations found in the complete coding region andmore » their exon/intron boundaries. The total sensitivity of mutant CF alleles that could be identified was 98.5%. Given the heterogeneity of these mutations, most of them very rare, CFTR mutation screening still remains rather complex in the population, and population screening, whether desirable or not, does not appear to be technically feasible with the methods currently available. 24 refs., 1 fig., 2 tabs.« less
Bashir, K M I; Awan, F S; Khan, I A; Khan, A I; Usman, M
2014-05-30
Roses (Rosa indica) belong to one of the most crucial groups of plants in the floriculture industry. Rosa species have special fragrances of interest to the perfume and pharmaceutical industries. The genetic diversity of plants based on morphological characteristics is difficult to measure under natural conditions due to the influence of environmental factors, which is why a reliable fingerprinting method was developed to overcome this problem. The development of molecular markers will enable the identification of Rosa species. In the present study, randomly amplified polymorphic DNA (RAPD) analysis was done on four Rosa species, Rosa gruss-an-teplitz (Surkha), Rosa bourboniana, Rosa centifolia, and Rosa damascena. A polymorphic RAPD fragment of 391 bp was detected in R. bourboniana, which was cloned, purified, sequenced, and used to design a pair of species-specific sequence-characterized amplified region (SCAR) primers (forward and reverse). These SCAR primers were used to amplify the specific regions of the rose genome. These PCR amplifications with specific primers are less sensitive to reaction conditions, and due to their high reproducibility, these species-specific SCAR primers can be used for marker-assisted selection and identification of Rosa species.
Goettel, Wolfgang; Xia, Eric; Upchurch, Robert; Wang, Ming-Li; Chen, Pengyin; An, Yong-Qiang Charles
2014-04-23
Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality.
Pajuelo, Mónica J.; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H.; Gilman, Robert H.; Porcella, Steve; Zimic, Mirko
2015-01-01
Background Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. Methods For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. Results The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. Conclusions/Significance The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies. PMID:26697878
SNPServer: a real-time SNP discovery tool.
Savage, David; Batley, Jacqueline; Erwin, Tim; Logan, Erica; Love, Christopher G; Lim, Geraldine A C; Mongin, Emmanuel; Barker, Gary; Spangenberg, German C; Edwards, David
2005-07-01
SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at http://hornbill.cspp.latrobe.edu.au/snpdiscovery.html.
Villard, Pierre; Malausa, Thibaut
2013-07-01
SP-Designer is an open-source program providing a user-friendly tool for the design of specific PCR primer pairs from a DNA sequence alignment containing sequences from various taxa. SP-Designer selects PCR primer pairs for the amplification of DNA from a target species on the basis of several criteria: (i) primer specificity, as assessed by interspecific sequence polymorphism in the annealing regions, (ii) the biochemical characteristics of the primers and (iii) the intended PCR conditions. SP-Designer generates tables, detailing the primer pair and PCR characteristics, and a FASTA file locating the primer sequences in the original sequence alignment. SP-Designer is Windows-compatible and freely available from http://www2.sophia.inra.fr/urih/sophia_mart/sp_designer/info_sp_designer.php. © 2013 John Wiley & Sons Ltd.
Global Genomic Diversity of Oryza sativa Varieties Revealed by Comparative Physical Mapping
Wang, Xiaoming; Kudrna, David A.; Pan, Yonglong; Wang, Hao; Liu, Lin; Lin, Haiyan; Zhang, Jianwei; Song, Xiang; Goicoechea, Jose Luis; Wing, Rod A.; Zhang, Qifa; Luo, Meizhong
2014-01-01
Bacterial artificial chromosome (BAC) physical maps embedding a large number of BAC end sequences (BESs) were generated for Oryza sativa ssp. indica varieties Minghui 63 (MH63) and Zhenshan 97 (ZS97) and were compared with the genome sequences of O. sativa spp. japonica cv. Nipponbare and O. sativa ssp. indica cv. 93-11. The comparisons exhibited substantial diversities in terms of large structural variations and small substitutions and indels. Genome-wide BAC-sized and contig-sized structural variations were detected, and the shared variations were analyzed. In the expansion regions of the Nipponbare reference sequence, in comparison to the MH63 and ZS97 physical maps, as well as to the previously constructed 93-11 physical map, the amounts and types of the repeat contents, and the outputs of gene ontology analysis, were significantly different from those of the whole genome. Using the physical maps of four wild Oryza species from OMAP (http://www.omap.org) as a control, we detected many conserved and divergent regions related to the evolution process of O. sativa. Between the BESs of MH63 and ZS97 and the two reference sequences, a total of 1532 polymorphic simple sequence repeats (SSRs), 71,383 SNPs, 1767 multiple nucleotide polymorphisms, 6340 insertions, and 9137 deletions were identified. This study provides independent whole-genome resources for intra- and intersubspecies comparisons and functional genomics studies in O. sativa. Both the comparative physical maps and the GBrowse, which integrated the QTL and molecular markers from GRAMENE (http://www.gramene.org) with our physical maps and analysis results, are open to the public through our Web site (http://gresource.hzau.edu.cn/resource/resource.html). PMID:24424778
Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra
2015-01-01
Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743
Yamazaki, Tomohiro; Matsuo, Junji; Takahashi, Satoshi; Kumagai, Shouta; Shimoda, Tomoko; Abe, Kiyotaka; Minami, Kunihiro; Yamaguchi, Hiroyuki
2015-12-01
Although sexually transmitted disease due to Chlamydia trachomatis occurs similarly in both men and women, the female urogenital tract differs from that of males anatomically and physiologically, possibly leading to specific polymorphisms of the bacterial surface molecules. In the present study, we therefore characterized polymorphic features in a high-definition phylogenetic marker, polymorphic outer membrane protein (Pmp) F of C. trachomatis strains isolated from male urogenital tracts in Japan (Category: Japan-males, n = 12), when compared with those isolated from female cervical ducts in Japan (Category: Japan-females, n = 11), female cervical ducts in the other country (Category: Ref-females, n = 12) or homosexual male rectums in the other country (Category: Ref-males, n = 7), by general bioinformatics analysis tool with MAFFT software. As a result, phylogenetic reconstruction of the PmpF amino acid sequences showing three distinct clusters revealed that the Japan-males were limited into cluster 1 and 2, although there were only four clusters even though including an outgroup. Meanwhile, the phylogenetic distance values of PmpF passenger domain without hinge region, but not its full-length sequence, showed that the Japan-males were more stable and displayed less diversity when compared with the other categories, supported by the sequence conservation features. Thus, PmpF passenger domain is a useful phylogenetic maker, and the phylogenic features indicate that C. trachomatis strains isolated from male urogenital tracts in Japan may be unique, suggesting an adaptation depending on selective pressure, such as the presence or absence of microbial flora, furthermore possibly connecting to sexual differentiation. Copyright © 2015 Japanese Society of Chemotherapy and The Japanese Association for Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Genomic profiling of plastid DNA variation in the Mediterranean olive tree
2011-01-01
Background Characterisation of plastid genome (or cpDNA) polymorphisms is commonly used for phylogeographic, population genetic and forensic analyses in plants, but detecting cpDNA variation is sometimes challenging, limiting the applications of such an approach. In the present study, we screened cpDNA polymorphism in the olive tree (Olea europaea L.) by sequencing the complete plastid genome of trees with a distinct cpDNA lineage. Our objective was to develop new markers for a rapid genomic profiling (by Multiplex PCRs) of cpDNA haplotypes in the Mediterranean olive tree. Results Eight complete cpDNA genomes of Olea were sequenced de novo. The nucleotide divergence between olive cpDNA lineages was low and not exceeding 0.07%. Based on these sequences, markers were developed for studying two single nucleotide substitutions and length polymorphism of 62 regions (with variable microsatellite motifs or other indels). They were then used to genotype the cpDNA variation in cultivated and wild Mediterranean olive trees (315 individuals). Forty polymorphic loci were detected on this sample, allowing the distinction of 22 haplotypes belonging to the three Mediterranean cpDNA lineages known as E1, E2 and E3. The discriminating power of cpDNA variation was particularly low for the cultivated olive tree with one predominating haplotype, but more diversity was detected in wild populations. Conclusions We propose a method for a rapid characterisation of the Mediterranean olive germplasm. The low variation in the cultivated olive tree indicated that the utility of cpDNA variation for forensic analyses is limited to rare haplotypes. In contrast, the high cpDNA variation in wild populations demonstrated that our markers may be useful for phylogeographic and populations genetic studies in O. europaea. PMID:21569271
Zhou, Nannan; Hernandez, Dennis; Ueland, Joseph; Yang, Xiaoyan; Yu, Fei; Sims, Karen; Yin, Philip D; McPhee, Fiona
2016-01-15
Daclatasvir is an NS5A inhibitor approved for treatment of infection due to hepatitis C virus (HCV) genotypes (GTs) 1-4. To support daclatasvir use in HCV genotype 4 infection, we examined a diverse genotype 4-infected population for HCV genotype 4 subtype prevalence, NS5A polymorphisms at residues associated with daclatasvir resistance (positions 28, 30, 31, or 93), and their effects on daclatasvir activity in vitro and clinically. We performed phylogenetic analysis of genotype 4 NS5A sequences from 186 clinical trial patients and 43 sequences from the European HCV database, and susceptibility analyses of NS5A polymorphisms and patient-derived NS5A sequences by using genotype 4 NS5A hybrid genotype 2a replicons. The clinical trial patients represented 14 genotype 4 subtypes; most prevalent were genotype 4a (55%) and genotype 4d (27%). Daclatasvir 50% effective concentrations for 10 patient-derived NS5A sequences representing diverse phylogenetic clusters were ≤0.080 nM. Most baseline sequences had ≥1 NS5A polymorphism at residues associated with daclatasvir resistance; however, only 3 patients (1.6%) had polymorphisms conferring ≥1000-fold daclatasvir resistance in vitro. Among 46 patients enrolled in daclatasvir trials, all 20 with baseline resistance polymorphisms achieved a sustained virologic response. Circulating genotype 4 subtypes are genetically diverse. Polymorphisms conferring high-level daclatasvir resistance in vitro are uncommon before therapy, and clinical data suggest that genotype 4 subtype and baseline polymorphisms have minimal impact on responses to daclatasvir-containing regimens. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America.
First complete genome sequence of infectious laryngotracheitis virus
2011-01-01
Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528
PeanutDB: an integrated bioinformatics web portal for Arachis hypogaea transcriptomics
2012-01-01
Background The peanut (Arachis hypogaea) is an important crop cultivated worldwide for oil production and food sources. Its complex genetic architecture (e.g., the large and tetraploid genome possibly due to unique cross of wild diploid relatives and subsequent chromosome duplication: 2n = 4x = 40, AABB, 2800 Mb) presents a major challenge for its genome sequencing and makes it a less-studied crop. Without a doubt, transcriptome sequencing is the most effective way to harness the genome structure and gene expression dynamics of this non-model species that has a limited genomic resource. Description With the development of next generation sequencing technologies such as 454 pyro-sequencing and Illumina sequencing by synthesis, the transcriptomics data of peanut is rapidly accumulated in both the public databases and private sectors. Integrating 187,636 Sanger reads (103,685,419 bases), 1,165,168 Roche 454 reads (333,862,593 bases) and 57,135,995 Illumina reads (4,073,740,115 bases), we generated the first release of our peanut transcriptome assembly that contains 32,619 contigs. We provided EC, KEGG and GO functional annotations to these contigs and detected SSRs, SNPs and other genetic polymorphisms for each contig. Based on both open-source and our in-house tools, PeanutDB presents many seamlessly integrated web interfaces that allow users to search, filter, navigate and visualize easily the whole transcript assembly, its annotations and detected polymorphisms and simple sequence repeats. For each contig, sequence alignment is presented in both bird’s-eye view and nucleotide level resolution, with colorfully highlighted regions of mismatches, indels and repeats that facilitate close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors. Conclusion As a public genomic database that integrates peanut transcriptome data from different sources, PeanutDB (http://bioinfolab.muohio.edu/txid3818v1) provides the Peanut research community with an easy-to-use web portal that will definitely facilitate genomics research and molecular breeding in this less-studied crop. PMID:22712730
Rogan, P K; Schneider, T D
1995-01-01
Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
Sulaiman, Irshad M; Torres, Patricia; Simpson, Steven; Kerdahi, Khalil; Ortega, Ynes
2013-04-01
We have described the development of a 2-step nested PCR protocol based on the characterization of the 70-kDa heat shock protein (HSP70) gene for rapid detection of the human-pathogenic Cyclospora cayetanensis parasite. We tested and validated these newly designed primer sets by PCR amplification followed by nucleotide sequencing of PCR-amplified HSP70 fragments belonging to 16 human C. cayetanensis isolates from 3 different endemic regions that include Nepal, Mexico, and Peru. No genetic polymorphism was observed among the isolates at the characterized regions of the HSP70 locus. This newly developed HSP70 gene-based nested PCR protocol provides another useful genetic marker for the rapid detection of C. cayetanensis in the future.
Gorkhali, Neena Amatya; Jiang, Lin; Shrestha, Bhola Shankar; He, Xiao-Hong; Junzhao, Qian; Han, Jian-Lin; Ma, Yue-Hui
2016-07-01
Heteroplasmy due to length polymorphism with tandem repeats in mtDNAs within individual was hardly studied in domestic animals. In the present study, we identified intra-individual length variation in the control region of mtDNAs in Nepalese sheep by molecular cloning and sequencing techniques. We observed one to four tandem repeats of a 75-bp nucleotide sequences in the mtDNA control region in 45% of the total Nepalese sheep sampled in contrast to the Chinese sheep, indicating that the heteroplasmy is specific to Nepalese sheep. The high rate of heteroplasmy in Nepalese sheep could be a resultant of the mtDNA mutation and independent segregation at intra-individual level or a strand slippage and mispairing during the replication.
Doritchamou, Justin; Sabbagh, Audrey; Jespersen, Jakob S.; Renard, Emmanuelle; Salanti, Ali; Nielsen, Morten A.; Deloron, Philippe; Tuikue Ndam, Nicaise
2015-01-01
The VAR2CSA protein of Plasmodium falciparum is transported to and expressed on the infected erythrocyte surface where it plays a key role in placental malaria (PM). It is the current leading candidate for a vaccine to prevent PM. However, the antigenic polymorphism integral to VAR2CSA poses a challenge for vaccine development. Based on detailed analysis of polymorphisms in the sequence of its ligand-binding N-terminal region, currently the main focus for vaccine development, we assessed var2csa from parasite isolates infecting pregnant women. The results reveal for the first time the presence of a major dimorphic region in the functionally critical N-terminal ID1 domain. Parasite isolates expressing VAR2CSA with particular motifs present within this domain are associated with gravidity- and parasite density-related effects. These observations are of particular interest in guiding efforts with respect to optimization of the VAR2CSA-based vaccines currently under development. PMID:26393516
Galindo-González, Leonardo; Mhiri, Corinne; Grandbastien, Marie-Angèle; Deyholos, Michael K
2016-12-07
Initial characterization of the flax genome showed that Ty1-copia retrotransposons are abundant, with several members being recently inserted, and in close association with genes. Recent insertions indicate a potential for ongoing transpositional activity that can create genomic diversity among accessions, cultivars or varieties. The polymorphisms generated constitute a good source of molecular markers that may be associated with phenotype if the insertions alter gene activity. Flax, where accessions are bred mainly for seed nutritional properties or for fibers, constitutes a good model for studying the relationship of transpositional activity with diversification and breeding. In this study, we estimated copy number and used a type of transposon display known as Sequence-Specific Amplification Polymorphisms (SSAPs), to characterize six families of Ty1-copia elements across 14 flax accessions. Polymorphic insertion sites were sequenced to find insertions that could potentially alter gene expression, and a preliminary test was performed with selected genes bearing transposable element (TE) insertions. Quantification of six families of Ty1-copia elements indicated different abundances among TE families and between flax accessions, which suggested diverse transpositional histories. SSAPs showed a high level of polymorphism in most of the evaluated retrotransposon families, with a trend towards higher levels of polymorphism in low-copy number families. Ty1-copia insertion polymorphisms among cultivars allowed a general distinction between oil and fiber types, and between spring and winter types, demonstrating their utility in diversity studies. Characterization of polymorphic insertions revealed an overwhelming association with genes, with insertions disrupting exons, introns or within 1 kb of coding regions. A preliminary test on the potential transcriptional disruption by TEs of four selected genes evaluated in three different tissues, showed one case of significant impact of the insertion on gene expression. We demonstrated that specific Ty1-copia families have been active since breeding commenced in flax. The retrotransposon-derived polymorphism can be used to separate flax types, and the close association of many insertions with genes defines a good source of potential mutations that could be associated with phenotypic changes, resulting in diversification processes.
Bigi, María Mercedes; Lopez, Beatriz; Blanco, Federico Carlos; Sasiain, María Del Carmen; De la Barrera, Silvia; Marti, Marcelo A; Sosa, Ezequiel Jorge; Fernández Do Porto, Darío Augusto; Ritacco, Viviana; Bigi, Fabiana; Soria, Marcelo Abel
2017-03-01
Globally, about 4.5% of new tuberculosis (TB) cases are multi-drug-resistant (MDR), i.e. resistant to the two most powerful first-line anti-TB drugs. Indeed, 480,000 people developed MDR-TB in 2015 and 190,000 people died because of MDR-TB. The MDR Mycobacterium tuberculosis M family, which belongs to the Haarlem lineage, is highly prosperous in Argentina and capable of building up further drug resistance without impairing its ability to spread. In this study, we sequenced the whole genomes of a highly prosperous M-family strain (Mp) and its contemporary variant, strain 410, which produced only one recorded tuberculosis case in the last two decades. Previous reports have demonstrated that Mp induced dysfunctional CD8 + cytotoxic T cell activity, suggesting that this strain has the ability to evade the immune response against M. tuberculosis. Comparative analysis of Mp and 410 genomes revealed non-synonymous polymorphisms in eleven genes and five intergenic regions with polymorphisms between both strains. Some of these genes and promoter regions are involved in the metabolism of cell wall components, others in drug resistance and a SNP in Rv1861, a gene encoding a putative transglycosylase that produces a truncated protein in Mp. The mutation in Rv3787c, a putative S-adenosyl-l-methionine-dependent methyltransferase, is conserved in all of the other prosperous M strains here analysed and absent in non-prosperous M strains. Remarkably, three polymorphic promoter regions displayed differential transcriptional activity between Mp and 410. We speculate that the observed mutations/polymorphisms are associated with the reported higher capacity of Mp for modulating the host's immune response. Copyright © 2017 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
David. A Micklos
2006-10-30
This project achieved its goal of implementing a nationwide training program to introduce high school biology teachers to the key uses and societal implications of human DNA polymorphisms. The 2.5-day workshop introduced high school biology faculty to a laboratory-based unit on human DNA polymorphisms – which provides a uniquely personal perspective on the science and Ethical, Legal and Social Implications (ELSI) of the Human Genome Project. As proposed, 12 workshops were conducted at venues across the United States. The workshops were attended by 256 high school faculty, exceeding proposed attendance of 240 by 7%. Each workshop mixed theoretical, laboratory, andmore » computer work with practical and ethical implications. Program participants learned simplified lab techniques for amplifying three types of chromosomal polymorphisms: an Alu insertion (PV92), a VNTR (pMCT118/D1S80), and single nucleotide polymorphisms (SNPs) in the mitochondrial control region. These polymorphisms illustrate the use of DNA variations in disease diagnosis, forensic biology, and identity testing - and provide a starting point for discussing the uses and potential abuses of genetic technology. Participants also learned how to use their Alu and mitochondrial data as an entrée to human population genetics and evolution. Our work to simplify lab techniques for amplifying human DNA polymorphisms in educational settings culminated with the release in 1998 of three Advanced Technology (AT) PCR kits by Carolina Biological Supply Company, the nation’s oldest educational science supplier. The kits use a simple 30-minute method to isolate template DNA from hair sheaths or buccal cells and streamlined PCR chemistry based on Pharmacia Ready-To-Go Beads, which incorporate Taq polymerase, deoxynucleotide triphosphates, and buffer in a freeze-dried pellet. These kits have greatly simplified teacher implementation of human PCR labs, and their use is growing at a rapid pace. Sales of human polymorphism kits by Carolina Biological rose from 700 units in 1999 to 1,132 in 2000 – a 62% increase. Competing kits using the Alu system, and based substantially on our earlier work, are also marketed by Biorad and Edvotek. In parallel with the lab experiments, we developed a suite of database/statistical applications and easy-to-use interfaces that allow students to use their own DNA data to explore human population genetics and to test theories of human evolution. Database searches and statistical analyses are launched from a centralized workspace. Workshop participants were introduced to these and other resources available at the DNALC WWW site (http://vector.cshl.org/bioserver/): 1) Allele Server tests Hardy-Weinberg equilibrium and statistically compares PV92 data from world populations. 2) Sequence Server uses DNA sequence data to search Genbank using BLASTN, compare sequences using CLUSTALW, and create phylogenetic trees using PHYLIP. 3) Simulation Server uses a Monte Carlo generator to model the long-term effects of drift, selection, and population bottlenecks. By targeting motivated and innovative biology faculty, we believe that this project offered a cost-effective means to bring high school biology education up-to-the-minute with genomic biology. The workshop reached a target audience of highly professional faculty who have already implemented hands-on labs in molecular genetics and many of whom offer laboratory electives in biotechnology. Many attend professional meetings, develop curriculum, collaborate with scientists, teach faculty workshops, and manage equipment-sharing programs. These individuals are life-long learners, anxious for deeper insight and additional training to further extend their leadership. This contention was supported by data from a mail survey, conducted in February-March 2000 and 2001, of 256 faculty who participated in workshops conducted during the current term of DOE support. Seventy percent of participants responded, providing direct reports on how their teaching behavior had changed since taking the DOE workshop. About nine of ten respondents said they had provided new classroom materials and first-hand accounts of DNA typing, sequencing, or PCR. Three-fourths had introduced new units on human molecular genetics. Most strikingly, half had students use PCR to amplify their own insertion polymorphisms (PV92), and better than one-fourth amplified a VNTR polymorphism and the mitochondrial control region. One in five had mitochondrial DNA sequenced by the DNALC Sequencing Service. A majority (58%) used online materials at the DNALC WWW site, and 28% analyzed student polymorphism data with Bioservers at the DNALC site. A majority (58%) assisted other faculty with student labs on polymorphisms, reaching an additional 786 teachers.« less
Shakhssalim, Nasser; Houshmand, Massoud; Kamalidehghan, Behnam; Faraji, Abolfazl; Sarhangnejad, Reza; Dadgar, Sepideh; Mobaraki, Maryam; Rosli, Rozita; Sanati, Mohammad Hossein
2013-12-05
Bladder cancer is a relatively common and potentially life-threatening neoplasm that ranks ninth in terms of worldwide cancer incidence. The aim of this study was to determine deletions and sequence variations in the mitochondrial displacement loop (D-loop) region from the blood specimens and tumoral tissues of patients with bladder cancer, compared to adjacent non-tumoral tissues. The DNA from blood, tumoral tissues and adjacent non-tumoral tissues of twenty-six patients with bladder cancer and DNA from blood of 504 healthy controls from different ethnicities were investigated to determine sequence variation in the mitochondrial D-loop region using multiplex polymerase chain reaction (PCR), DNA sequencing and southern blotting analysis. From a total of 110 variations, 48 were reported as new mutations. No deletions were detected in tumoral tissues, adjacent non-tumoral tissues and blood samples from patients. Although the polymorphisms at loci 16189, 16261 and 16311 were not significantly correlated with bladder cancer, the C16069T variation was significantly present in patient samples compared to control samples (p < 0.05). Interestingly, there was no significant difference (p > 0.05) of C variations, including C7TC6, C8TC6, C9TC6 and C10TC6, in D310 mitochondrial DNA between patients and control samples. Our study suggests that 16069 mitochondrial DNA D-Loop mutations may play a significant role in the etiology of bladder cancer and facilitate the definition of carcinogenesis-related mutations in human cancer.
Espin-Garcia, Osvaldo; Craiu, Radu V; Bull, Shelley B
2018-02-01
We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc.
Deep whole-genome sequencing of 90 Han Chinese genomes.
Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen
2017-09-01
Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.
Cowan, Graeme J. M.; Creasey, Alison M.; Dhanasarnsombut, Kelwalin; Thomas, Alan W.; Remarque, Edmond J.; Cavanagh, David R.
2011-01-01
Polymorphic parasite antigens are known targets of protective immunity to malaria, but this antigenic variation poses challenges to vaccine development. A synthetic MSP-1 Block 2 construct, based on all polymorphic variants found in natural Plasmodium falciparum isolates has been designed, combined with the relatively conserved Block 1 sequence of MSP-1 and expressed in E.coli. The MSP-1 Hybrid antigen has been produced with high yield by fed-batch fermentation and purified without the aid of affinity tags resulting in a pure and extremely thermostable antigen preparation. MSP-1 hybrid is immunogenic in experimental animals using adjuvants suitable for human use, eliciting antibodies against epitopes from all three Block 2 serotypes. Human serum antibodies from Africans naturally exposed to malaria reacted to the MSP-1 hybrid as strongly as, or better than the same serum reactivities to individual MSP-1 Block 2 antigens, and these antibody responses showed clear associations with reduced incidence of malaria episodes. The MSP-1 hybrid is designed to induce a protective antibody response to the highly polymorphic Block 2 region of MSP-1, enhancing the repertoire of MSP-1 Block 2 antibody responses found among immune and semi-immune individuals in malaria endemic areas. The target population for such a vaccine is young children and vulnerable adults, to accelerate the acquisition of a full range of malaria protective antibodies against this polymorphic parasite antigen. PMID:22073118
Schwelm, Arne; Berney, Cédric; Dixelius, Christina; Bass, David; Neuhauser, Sigrid
2016-01-01
Clubroot disease caused by Plasmodiophora brassicae is one of the most important diseases of cultivated brassicas. P. brassicae occurs in pathotypes which differ in the aggressiveness towards their Brassica host plants. To date no DNA based method to distinguish these pathotypes has been described. In 2011 polymorphism within the 28S rDNA of P. brassicae was reported which potentially could allow to distinguish pathotypes without the need of time-consuming bioassays. However, isolates of P. brassicae from around the world analysed in this study do not show polymorphism in their LSU rDNA sequences. The previously described polymorphism most likely derived from soil inhabiting Cercozoa more specifically Neoheteromita-like glissomonads. Here we correct the LSU rDNA sequence of P. brassicae. By using FISH we demonstrate that our newly generated sequence belongs to the causal agent of clubroot disease. PMID:27750174
Janova, Eva; Matiasovic, Jan; Vahala, Jiri; Vodicka, Roman; Van Dyk, Enette; Horin, Petr
2009-07-01
The major histocompatibility complex genes coding for antigen binding and presenting molecules are the most polymorphic genes in the vertebrate genome. We studied the DRA and DQA gene polymorphism of the family Equidae. In addition to 11 previously reported DRA and 24 DQA alleles, six new DRA sequences and 13 new DQA alleles were identified in the genus Equus. Phylogenetic analysis of both DRA and DQA sequences provided evidence for trans-species polymorphism in the family Equidae. The phylogenetic trees differed from species relationships defined by standard taxonomy of Equidae and from trees based on mitochondrial or neutral gene sequence data. Analysis of selection showed differences between the less variable DRA and more variable DQA genes. DRA alleles were more often shared by more species. The DQA sequences analysed showed strong amongst-species positive selection; the selected amino acid positions mostly corresponded to selected positions in rodent and human DQA genes.
USDA-ARS?s Scientific Manuscript database
The PCR-based Escherichia coli O157 (O157) strain typing system, Polymorphic Amplified Typing Sequences (PATS), targets insertions-deletions (Indels) and single nucleotide polymorphisms (SNPs) at the XbaI and AvrII(BlnI) restriction enzyme sites, respectively, besides amplifying four known virulenc...
Wang, Baosheng; Khalili Mahani, Marjan; Ng, Wei Lun; Kusumi, Junko; Phi, Hai Hong; Inomata, Nobuyuki; Wang, Xiao-Ru; Szmidt, Alfred E
2014-01-01
Pinus krempfii Lecomte is a morphologically and ecologically unique pine, endemic to Vietnam. It is regarded as vulnerable species with distribution limited to just two provinces: Khanh Hoa and Lam Dong. Although a few phylogenetic studies have included this species, almost nothing is known about its genetic features. In particular, there are no studies addressing the levels and patterns of genetic variation in natural populations of P. krempfii. In this study, we sampled 57 individuals from six natural populations of P. krempfii and analyzed their sequence variation in ten nuclear gene regions (approximately 9 kb) and 14 mitochondrial (mt) DNA regions (approximately 10 kb). We also analyzed variation at seven chloroplast (cp) microsatellite (SSR) loci. We found very low haplotype and nucleotide diversity at nuclear loci compared with other pine species. Furthermore, all investigated populations were monomorphic across all mitochondrial DNA (mtDNA) regions included in our study, which are polymorphic in other pine species. Population differentiation at nuclear loci was low (5.2%) but significant. However, structure analysis of nuclear loci did not detect genetically differentiated groups of populations. Approximate Bayesian computation (ABC) using nuclear sequence data and mismatch distribution analysis for cpSSR loci suggested recent expansion of the species. The implications of these findings for the management and conservation of P. krempfii genetic resources were discussed. PMID:25360263
Glaser-Schmitt, Amanda; Duchen, Pablo; Parsch, John
2016-01-01
Insertions and deletions (indels) are a major source of genetic variation within species and may result in functional changes to coding or regulatory sequences. In this study we report that an indel polymorphism in the 3’ untranslated region (UTR) of the metallothionein gene MtnA is associated with gene expression variation in natural populations of Drosophila melanogaster. A derived allele of MtnA with a 49-bp deletion in the 3' UTR segregates at high frequency in populations outside of sub-Saharan Africa. The frequency of the deletion increases with latitude across multiple continents and approaches 100% in northern Europe. Flies with the deletion have more than 4-fold higher MtnA expression than flies with the ancestral sequence. Using reporter gene constructs in transgenic flies, we show that the 3' UTR deletion significantly contributes to the observed expression difference. Population genetic analyses uncovered signatures of a selective sweep in the MtnA region within populations from northern Europe. We also find that the 3’ UTR deletion is associated with increased oxidative stress tolerance. These results suggest that the 3' UTR deletion has been a target of selection for its ability to confer increased levels of MtnA expression in northern European populations, likely due to a local adaptive advantage of increased oxidative stress tolerance. PMID:27120580
Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto
2005-02-01
We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.
Xu, Feng-Ling; Ding, Mei; Yao, Jun; Shi, Zhang-Sen; Wu, Xue; Zhang, Jing-Jing; Pang, Hao; Xing, Jia-Xin; Xuan, Jin-Feng; Wang, Bao-Jie
2017-01-01
To determine whether mitochondrial DNA (mtDNA) variations are associated with schizophrenia, 313 patients with schizophrenia and 326 unaffected participants of the northern Chinese Han population were included in a prospective study. Single-nucleotide polymorphisms (SNPs) including C5178A, A10398G, G13708A, and C13928G were analyzed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). Hypervariable regions I and II (HVSI and HVSII) were analyzed by sequencing. The results showed that the 4 SNPs and 11 haplotypes, composed of the 4 SNPs, did not differ significantly between patient and control groups. No significant association between haplogroups and the risk of schizophrenia was ascertained after Bonferroni correction. Drawing a conclusion, there was no evidence of an association between mtDNA (the 4 SNPs and the control region) and schizophrenia in the northern Chinese Han population.
2014-01-01
Background Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. Results In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. Conclusions As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality. PMID:24755115
Krawczyk, Paweł; Kucharczyk, Tomasz; Kowalski, Dariusz M; Powrózek, Tomasz; Ramlau, Rodryg; Kalinka-Warzocha, Ewa; Winiarczyk, Kinga; Knetki-Wróblewska, Magdalena; Wojas-Krawczyk, Kamila; Kałakucka, Katarzyna; Dyszkiewicz, Wojciech; Krzakowski, Maciej; Milanowski, Janusz
2014-12-01
We presented retrospective analysis of up to five polymorphisms in TS, MTHFR and ERCC1 genes as molecular predictive markers for homogeneous Caucasian, non-squamous NSCLC patients treated with pemetrexed and platinum front-line chemotherapy. The following polymorphisms in DNA isolated from 115 patients were analyzed: various number of 28-bp tandem repeats in 5'-UTR region of TS gene, single nucleotide polymorphism (SNP) within the second tandem repeat of TS gene (G>C); 6-bp deletion in 3'-UTR region of the TS (1494del6); 677C>T SNP in MTHFR; 19007C>T SNP in ERCC1. Molecular examinations' results were correlated with disease control rate, progression-free survival (PFS) and overall survival. Polymorphic tandem repeat sequence (2R, 3R) in the enhancer region of TS gene and G>C SNP within the second repeat of 3R allele seem to be important for the effectiveness of platinum and pemetrexed in first-line chemotherapy. The insignificant shortening of PFS in 3R/3R homozygotes as compared to 2R/2R and 2R/3R genotypes were observed, while it was significantly shorter in patients carrying synchronous 3R allele and G nucleotide. The combined analysis of TS VNTR and MTHFR 677C>T SNP revealed shortening of PFS in synchronous carriers of 3R allele in TS and two C alleles in MTHFR. The strongest factors increased the risk of progression were poor PS, weight loss, anemia and synchronous presence of 3R allele and G nucleotide in the second repeat of 3R allele in TS. Moreover, lack of application of second-line chemotherapy, weight loss and poor performance status and above-mentioned genotype of TS gene increased risk of early mortality. The examined polymorphisms should be accounted as molecular predictor factors for pemetrexed- and platinum-based front-line chemotherapy in non-squamous NSCLC patients.
Herrmann, Luise; Haase, Ilka; Blauhut, Maike; Barz, Nadine; Fischer, Markus
2014-12-17
Two cocoa types, Arriba and CCN-51, are being cultivated in Ecuador. With regard to the unique aroma, Arriba is considered a fine cocoa type, while CCN-51 is a bulk cocoa because of its weaker aroma. Because it is being assumed that Arriba is mixed with CCN-51, there is an interest in the analytical differentiation of the two types. Two methods to identify CCN-51 adulterations in Arriba cocoa were developed on the basis of differences in the chloroplast DNA. On the one hand, a different repeat of the sequence TAAAG in the inverted repeat region results in a different length of amplicons for the two cocoa types, which can be detected by agarose gel electrophoresis, capillary gel electrophoresis, and denaturing high-performance liquid chromatography. On the other hand, single nucleotide polymorphisms (SNPs) between the CCN-51 and Arriba sequences represent restriction sites, which can be used for restriction fragment length polymorphism analysis. A semi-quantitative analysis based on these SNPs is feasible. A method for an exact quantitation based on these results is not realizable. These sequence variations were confirmed for a comprehensive cultivar collection of Arriba and CCN-51, for both bean and leaf samples.
NcoI and TaqI RFLPs for human M creatine kinase (CKM)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perryman, M.B.; Hejtmancik, J.F.; Ashizawa, Tetsuo
1988-09-12
Probe pHMCKUT contains a 135 bp cDNA fragment inserted into pGEM 3. The probe corresponds to nucleotides 1,201 to 1,336 located in the 3{prime} untranslated region of human M creatine kinase. The probe is specific for human M creatine kinase and does not hybridize to human B cretine kinase sequences. NcoI identifies a two allele polymorphism of a band at either 2.5 kb or 3.6 kb. TaqI identifies a two allele polymorphism at either 3.8 kb or 4.5 kb. Human M creatine has been localized to chromosome 19q. Autosomal co-dominant inheritance was shown in six informative Caucasian families.
Genetic diversity and relationship analysis of Gossypium arboreum accessions.
Liu, F; Zhou, Z L; Wang, C Y; Wang, Y H; Cai, X Y; Wang, X X; Zhang, Z S; Wang, K B
2015-11-19
Simple sequence repeat techniques were used to identify the genetic diversity of 101 Gossypium arboreum accessions collected from India, Vietnam, and the southwest of China (Guizhou, Guangxi, and Yunnan provinces). Twenty-six pairs of SSR primers produced a total of 103 polymorphic loci with an average of 3.96 polymorphic loci per primer. The average of the effective number of alleles, Nei's gene diversity, and Shannon's information index were 0.59, 0.2835, and 0.4361, respectively. The diversity varied among different geographic regions. The result of principal component analysis was consistent with that of unweighted pair group method with arithmetic mean clustering analysis. The 101 G. arboreum accessions were clustered into 2 groups.
The sheep growth hormone gene polymorphism and its effects on milk traits.
Dettori, Maria Luisa; Pazzola, Michele; Pira, Emanuela; Paschino, Pietro; Vacca, Giuseppe Massimo
2015-05-01
Growth hormone (GH) is encoded by the GH gene, which may be single copy or duplicate in sheep. The two copies of the sheep GH gene (GH1/GH2-N and GH2-Z) were entirely sequenced in one 106 ewes of Sarda breed, in order to highlight sequence polymorphisms and investigate possible association between genetic variants and milk traits. Milk traits included milk yield, fat, protein, casein and lactose percentage. We evidenced 75 nucleotide changes. Transcription factor binding site prediction revealed two sequences potentially recognised by the pituitary-specific transcription factor POU1FI at the GH1/GH2-N gene, which were lost at the promoter of GH2-Z, which might explain the different tissues of expression of GH1/GH2-N (pituitary) and GH2-Z (placenta). Significant differences in milk traits were observed among genotypes at polymorphic loci only for the GH2-Z gene. Sheep with homozygote genotype ss748770547 CC had higher fat percentage (P < 0.01) than TT. SNP ss748770547 was part of a potential transcription factor binding site for C/EBP alpha (CCAAT/Enhancer Binding Protein), which is involved in the regulation of adipogenesis and adipoblast differentiation. SNP ss748770547, located in the GH2-Z gene 5' flanking region, may be a causal mutation affecting milk fat content. These findings might contribute to the knowledge of the sheep GH locus and might be useful in selection processes in sheep.
Development of a Multiplex Single Base Extension Assay for Mitochondrial DNA Haplogroup Typing
Nelson, Tahnee M.; Just, Rebecca S.; Loreille, Odile; Schanfield, Moses S.; Podini, Daniele
2007-01-01
Aim To provide a screening tool to reduce time and sample consumption when attempting mtDNA haplogroup typing. Methods A single base primer extension assay was developed to enable typing, in a single reaction, of twelve mtDNA haplogroup specific polymorphisms. For validation purposes a total of 147 samples were tested including 73 samples successfully haplogroup typed using mtDNA control region (CR) sequence data, 21 samples inconclusively haplogroup typed by CR data, 20 samples previously haplogroup typed using restriction fragment length polymorphism (RFLP) analysis, and 31 samples of known ancestral origin without previous haplogroup typing. Additionally, two highly degraded human bones embalmed and buried in the early 1950s were analyzed using the single nucleotide polymorphisms (SNP) multiplex. Results When the SNP multiplex was used to type the 96 previously CR sequenced specimens, an increase in haplogroup or macrohaplogroup assignment relative to conventional CR sequence analysis was observed. The single base extension assay was also successfully used to assign a haplogroup to decades-old, embalmed skeletal remains dating to World War II. Conclusion The SNP multiplex was successfully used to obtain haplogroup status of highly degraded human bones, and demonstrated the ability to eliminate possible contributors. The SNP multiplex provides a low-cost, high throughput method for typing of mtDNA haplogroups A, B, C, D, E, F, G, H, L1/L2, L3, M, and N that could be useful for screening purposes for human identification efforts and anthropological studies. PMID:17696300
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Isolation and characterization of major histocompatibility complex class II B genes in cranes.
Kohyama, Tetsuo I; Akiyama, Takuya; Nishida, Chizuko; Takami, Kazutoshi; Onuma, Manabu; Momose, Kunikazu; Masuda, Ryuichi
2015-11-01
In this study, we isolated and characterized the major histocompatibility complex (MHC) class II B genes in cranes. Genomic sequences spanning exons 1 to 4 were amplified and determined in 13 crane species and three other species closely related to cranes. In all, 55 unique sequences were identified, and at least two polymorphic MHC class II B loci were found in most species. An analysis of sequence polymorphisms showed the signature of positive selection and recombination. A phylogenetic reconstruction based on exon 2 sequences indicated that trans-species polymorphism has persisted for at least 10 million years, whereas phylogenetic analyses of the sequences flanking exon 2 revealed a pattern of concerted evolution. These results suggest that both balancing selection and recombination play important roles in the crane MHC evolution.
Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele
2008-01-01
Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794
Cho, Soochin; Huang, Zachary Y; Green, Daniel R; Smith, Deborah R; Zhang, Jianzhi
2006-11-01
The mechanism of sex determination varies substantively among evolutionary lineages. One important mode of genetic sex determination is haplodiploidy, which is used by approximately 20% of all animal species, including >200,000 species of the entire insect order Hymenoptera. In the honey bee Apis mellifera, a hymenopteran model organism, females are heterozygous at the csd (complementary sex determination) locus, whereas males are hemizygous (from unfertilized eggs). Fertilized homozygotes develop into sterile males that are eaten before maturity. Because homozygotes have zero fitness and because common alleles are more likely than rare ones to form homozygotes, csd should be subject to strong overdominant selection and negative frequency-dependent selection. Under these selective forces, together known as balancing selection, csd is expected to exhibit a high degree of intraspecific polymorphism, with long-lived alleles that may be even older than the species. Here we sequence the csd genes as well as randomly selected neutral genomic regions from individuals of three closely related species, A. mellifera, Apis cerana, and Apis dorsata. The polymorphic level is approximately seven times higher in csd than in the neutral regions. Gene genealogies reveal trans-species polymorphisms at csd but not at any neutral regions. Consistent with the prediction of rare-allele advantage, nonsynonymous mutations are found to be positively selected in csd only in early stages after their appearances. Surprisingly, three different hypervariable repetitive regions in csd are present in the three species, suggesting variable mechanisms underlying allelic specificities. Our results provide a definitive demonstration of balancing selection acting at the honey bee csd gene, offer insights into the molecular determinants of csd allelic specificities, and help avoid homozygosity in bee breeding.
Aarabi, Mahmoud; San Gabriel, Maria C.; Chan, Donovan; Behan, Nathalie A.; Caron, Maxime; Pastinen, Tomi; Bourque, Guillaume; MacFarlane, Amanda J.; Zini, Armand; Trasler, Jacquetta
2015-01-01
Dietary folate is a major source of methyl groups required for DNA methylation, an epigenetic modification that is actively maintained and remodeled during spermatogenesis. While high-dose folic acid supplementation (up to 10 times the daily recommended dose) has been shown to improve sperm parameters in infertile men, the effects of supplementation on the sperm epigenome are unknown. To assess the impact of 6 months of high-dose folic acid supplementation on the sperm epigenome, we studied 30 men with idiopathic infertility. Blood folate concentrations increased significantly after supplementation with no significant improvements in sperm parameters. Methylation levels of the differentially methylated regions of several imprinted loci (H19, DLK1/GTL2, MEST, SNRPN, PLAGL1, KCNQ1OT1) were normal both before and after supplementation. Reduced representation bisulfite sequencing (RRBS) revealed a significant global loss of methylation across different regions of the sperm genome. The most marked loss of DNA methylation was found in sperm from patients homozygous for the methylenetetrahydrofolate reductase (MTHFR) C677T polymorphism, a common polymorphism in a key enzyme required for folate metabolism. RRBS analysis also showed that most of the differentially methylated tiles were located in DNA repeats, low CpG-density and intergenic regions. Ingenuity Pathway Analysis revealed that methylation of promoter regions was altered in several genes involved in cancer and neurobehavioral disorders including CBFA2T3, PTPN6, COL18A1, ALDH2, UBE4B, ERBB2, GABRB3, CNTNAP4 and NIPA1. Our data reveal alterations of the human sperm epigenome associated with high-dose folic acid supplementation, effects that were exacerbated by a common polymorphism in MTHFR. PMID:26307085
DOE Office of Scientific and Technical Information (OSTI.GOV)
Murat, Claude; Riccioni, C; Belfiori, B
The level of genetic diversity and genetic structure in the Perigord black truffle (Tuber melanosporum Vittad.) has been debated for several years, mainly due to the lack of appropriate genetic markers. Microsatellites or simple sequence repeats (SSRs) are important for the genome organisation, phenotypic diversity and are one of the most popular molecular markers. In this study, we surveyed the T. melanosporum genome (1) to characterise its SSR pattern; (2) to compare it with SSR patterns found in 48 other fungal and three oomycetes genomes and (3) to identify new polymorphic SSR markers for population genetics. The T. melanosporum genomemore » is rich in SSRs with 22,425 SSRs with mono-nucleotides being the most frequent motifs. SSRs were found in all genomic regions although they are more frequent in non-coding regions (introns and intergenic regions). Sixty out of 135 PCR-amplified mono-, di-, tri-, tetra, penta, and hexanucleotides were polymorphic (44%) within black truffle populations and 27 were randomly selected and analysed on 139 T. melanosporum isolates from France, Italy and Spain. The number of alleles varied from 2 to 18 and the expected heterozygosity from 0.124 to 0.815. One hundred and thirty-two different multilocus genotypes out of the 139 T. melanosporum isolates were identified and the genotypic diversity was high (0.999). Polymorphic SSRs were found in UTR regulatory regions of fruiting bodies and ectomycorrhiza regulated genes, suggesting that they may play a role in phenotypic variation. In conclusion, SSRs developed in this study were highly polymorphic and our results showed that T. melanosporum is a species with an important genetic diversity, which is in agreement with its recently uncovered heterothallic mating system.« less
Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza
2015-01-01
Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002
Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza
2015-06-01
Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.
Gender Identification in Date Palm Using Molecular Markers.
Awan, Faisal Saeed; Maryam; Jaskani, Muhammad J; Sadia, Bushra
2017-01-01
Breeding of date palm is complicated because of its long life cycle and heterozygous nature. Sexual propagation of date palm does not produce true-to-type plants. Sex of date palms cannot be identified until the first flowering stage. Molecular markers such as random amplified polymorphic DNA (RAPD), sequence-characterized amplified regions (SCAR), and simple sequence repeats (SSR) have successfully been used to identify the sex-linked loci in the plant genome and to isolate the corresponding genes. This chapter highlights the use of three molecular markers including RAPD, SCAR, and SSR to identify the gender of date palm seedlings.
2013-01-01
Background Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. Results The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases. Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. Conclusions A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci. PMID:23379821
Fine-scale population structure and the era of next-generation sequencing.
Henn, Brenna M; Gravel, Simon; Moreno-Estrada, Andres; Acevedo-Acevedo, Suehelay; Bustamante, Carlos D
2010-10-15
Fine-scale population structure characterizes most continents and is especially pronounced in non-cosmopolitan populations. Roughly half of the world's population remains non-cosmopolitan and even populations within cities often assort along ethnic and linguistic categories. Barriers to random mating can be ecologically extreme, such as the Sahara Desert, or cultural, such as the Indian caste system. In either case, subpopulations accumulate genetic differences if the barrier is maintained over multiple generations. Genome-wide polymorphism data, initially with only a few hundred autosomal microsatellites, have clearly established differences in allele frequency not only among continental regions, but also within continents and within countries. We review recent evidence from the analysis of genome-wide polymorphism data for genetic boundaries delineating human population structure and the main demographic and genomic processes shaping variation, and discuss the implications of population structure for the distribution and discovery of disease-causing genetic variants, in the light of the imminent availability of sequencing data for a multitude of diverse human genomes.
E, G X; Na, R S; Zhao, Y J; Chen, L P; Qiu, X Y; Huang, Y F
2015-04-10
Cathelicidins are a major family of antimicrobial peptides (AMPs), an important component of innate immune system, playing a critical role in host defense and disease resistance in virtually all living species. Polymorphism and functional studies on cathelicidin of Tianzhu white yak contribute to understanding the specific innate immune mechanism in animals living at high altitudes in comparison to cattle and domesticated white yak. Thirty-six individuals of Tianzhu white yak, originating from the area of three ecotypes (Gansu in China), were investigated. The total length of the aligned Yak cathelicidin 6 (CATHL-6) sequences was 1923 bp, including six single nucleotide polymorphisms and one indel. Ten haplotypes were identified, and phylogenetic analyses resolved those 10 haplotypes in two clusters. The results indicate that the white yak originated from two domestication sites. In addition, lack of significant pairwise difference between sequences (Tajima's D = 0.92865, P > 0.10) in the CATHL-6 region indicates absence of population size expansion in current white yak population.
Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P
2008-01-01
Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
Extensive sequence-influenced DNA methylation polymorphism in the human genome
2010-01-01
Background Epigenetic polymorphisms are a potential source of human diversity, but their frequency and relationship to genetic polymorphisms are unclear. DNA methylation, an epigenetic mark that is a covalent modification of the DNA itself, plays an important role in the regulation of gene expression. Most studies of DNA methylation in mammalian cells have focused on CpG methylation present in CpG islands (areas of concentrated CpGs often found near promoters), but there are also interesting patterns of CpG methylation found outside of CpG islands. Results We compared DNA methylation patterns on both alleles between many pairs (and larger groups) of related and unrelated individuals. Direct observation and simulation experiments revealed that around 10% of common single nucleotide polymorphisms (SNPs) reside in regions with differences in the propensity for local DNA methylation between the two alleles. We further showed that for the most common form of SNP, a polymorphism at a CpG dinucleotide, the presence of the CpG at the SNP positively affected local DNA methylation in cis. Conclusions Taken together with the known effect of DNA methylation on mutation rate, our results suggest an interesting interdependence between genetics and epigenetics underlying diversity in the human genome. PMID:20497546
Zhan, Xiaoli; Gao, Jianbin; Huangfu, Yifan; Fu, Changzhen; Zan, Linsen
2013-12-01
The objective of this research were to detect bovine Dickkopf 2 (DKK2) gene polymorphism and analyze their associations with body measurement traits (BMT) and meat quality traits (MQT) of animals. Blood samples were taken from a total of 541 Qinchuan cattle aged from 18 to 24 months. Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) was employed to find out DKK2 single-polymorphism nucleotide (SNPs) and to explore their possible association with BMT and MQT. Sequence analysis of DKK2 gene revealed 2 SNPs (C29 T and A169C) in 5' untranslated region (5'UTR) of exon 1.C29T and A164T SNPs are both synonymous mutation, which showed 2 genotypes namely (CC, CT) and (AA and AC), respectively. Association analysis of polymorphism with body measurement and meat quality traits at the two locus showed that there were significant effects on CT, BL, RL, PBW, BFT, LMA, and IFC. These results suggest that the DKK2 gene might have potential effects on BMT and MQT in Qinchuan cattle population and could be used for marker-assisted selection.
MCT1, MCT4 and CD147 gene polymorphisms in healthy horses and horses with myopathy.
Mykkänen, A K; Koho, N M; Reeben, M; McGowan, C M; Pösö, A R
2011-12-01
Polymorphisms in human lactate transporter proteins (monocarboxylate transporters; MCTs), especially the MCT1 isoform, can affect lactate transport activity and cause signs of exercise-induced myopathy. Muscles express MCT1, MCT4 and CD147, an ancillary protein, indispensable for the activity of MCT1 and MCT4. We sequenced the coding sequence (cDNA) of horse MCT4 for the first time and examined polymorphisms in the cDNA of MCT1, MCT4 and CD147 of 16 healthy horses. To study whether signs of myopathy are linked to the polymorphisms, biopsy samples were taken from 26 horses with exercise-induced recurrent myopathy. Two polymorphisms that cause a change in amino acid sequence were found in MCT1 (Val(432)Ile and Lys(457)Gln) and one in CD147 (Met(125)Val). All polymorphisms in MCT4 were silent. Mutations in MCT1 or CD147 in equine muscle were not associated with myopathy. In the future, a functional study design is needed to evaluate the physiological role of the polymorphisms found. Copyright © 2010 Elsevier Ltd. All rights reserved.
The diploid genome sequence of an Asian individual
Wang, Jun; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin; Guo, Yiran; Feng, Binxiao; Li, Heng; Lu, Yao; Fang, Xiaodong; Liang, Huiqing; Du, Zhenglin; Li, Dong; Zhao, Yiqing; Hu, Yujie; Yang, Zhenzhen; Zheng, Hancheng; Hellmann, Ines; Inouye, Michael; Pool, John; Yi, Xin; Zhao, Jing; Duan, Jinjie; Zhou, Yan; Qin, Junjie; Ma, Lijia; Li, Guoqing; Yang, Zhentao; Zhang, Guojie; Yang, Bin; Yu, Chang; Liang, Fang; Li, Wenjie; Li, Shaochuan; Li, Dawei; Ni, Peixiang; Ruan, Jue; Li, Qibin; Zhu, Hongmei; Liu, Dongyuan; Lu, Zhike; Li, Ning; Guo, Guangwu; Zhang, Jianguo; Ye, Jia; Fang, Lin; Hao, Qin; Chen, Quan; Liang, Yu; Su, Yeyang; san, A.; Ping, Cuo; Yang, Shuang; Chen, Fang; Li, Li; Zhou, Ke; Zheng, Hongkun; Ren, Yuanyuan; Yang, Ling; Gao, Yang; Yang, Guohua; Li, Zhuo; Feng, Xiaoli; Kristiansen, Karsten; Wong, Gane Ka-Shu; Nielsen, Rasmus; Durbin, Richard; Bolund, Lars; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian
2009-01-01
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. PMID:18987735
Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Fang, Weimin; Guan, Zhiyong; Teng, Nianjun; Liao, Yuan; Chen, Fadi
2014-01-01
The Asteraceae family is at the forefront of the evolution due to frequent hybridization. Hybridization is associated with the induction of widespread genetic and epigenetic changes and has played an important role in the evolution of many plant taxa. We attempted the intergeneric cross Chrysanthemum morifolium × Leucanthemum paludosum. To obtain the success in cross, we have to turn to ovule rescue. DNA profiling of the amphihaploid and amphidiploid was investigated using amplified fragment length polymorphism, sequence-related amplified polymorphism, start codon targeted polymorphism, and methylation-sensitive amplification polymorphism (MSAP). Hybridization induced rapid changes at the genetic and the epigenetic levels. The genetic changes mainly involved loss of parental fragments and gaining of novel fragments, and some eliminated sequences possibly from the noncoding region of L. paludosum. The MSAP analysis indicated that the level of DNA methylation was lower in the amphiploid (∼45%) than in the parental lines (51.5–50.6%), whereas it increased after amphidiploid formation. Events associated with intergeneric genomic shock were a feature of C. morifolium × L. paludosum hybrid, given that the genetic relationship between the parental species is relatively distant. Our results provide genetic and epigenetic evidence for understanding genomic shock in wide crosses between species in Asteraceae and suggest a need to expand our current evolutionary framework to encompass a genetic/epigenetic dimension when seeking to understand wide crosses. PMID:24407856
Roy, Neha Samir; Park, Kyong-Cheul; Lee, Sung-Il; Im, Min-Ji; Ramekar, Rahul Vasudeo; Kim, Nam-Soo
2018-02-01
Molecular marker technologies have proven to be an important breakthrough for genetic studies, construction of linkage maps and population genetics analysis. Transposable elements (TEs) constitute major fractions of repetitive sequences in plants and offer a wide range of possible areas to be explored as molecular markers. Sequence characterized amplified region (SCAR) marker development provides us with a simple and time saving alternative approach for marker development. We employed the CACTA-TD to develop SCARs and then integrated them into linkage map and used them for population structure and genetic diversity analysis of corn inbred population. A total of 108 dominant SCAR markers were designed out of which, 32 were successfully integrated in to the linkage map of maize RIL population and the remaining were added to a physical map for references to check the distribution throughout all chromosomes. Moreover, 76 polymorphic SCARs were used for diversity analysis of corn accessions being used in Korean corn breeding program. The overall average polymorphic information content (PIC) was 0.34, expected heterozygosity was 0.324 and Shannon's information index was 0.491 with a percentage of polymorphism of 98.67%. Further analysis by associating with desirable traits may also provide some accurate trait specific tagged SCAR markers. TE linked SCARs can provide an added level of polymorphism as well as improved discriminating ability and therefore can be useful in further breeding programs to develop high yielding germplasm.
Simultaneous Differentiation and Typing of Entamoeba histolytica and Entamoeba dispar
Zaki, Mehreen; Meelu, Parool; Sun, Wei; Clark, C. Graham
2002-01-01
Sequences corresponding to some of the polymorphic loci previously reported from Entamoeba histolytica have been detected in Entamoeba dispar. Comparison of nucleotide sequences of two loci between E. dispar strain SAW760 and E. histolytica strain HM-1:IMSS revealed significant differences in both repeat and flanking regions. The tandem repeat units varied not only in sequence but also in number and arrangement between the two species at both the loci. Using the sequences obtained, primer pairs aimed at amplifying species-specific products were designed and tested on a variety of E. histolytica and E. dispar samples. Amplification results were in complete agreement with the original species classification in all cases, and the PCR products displayed discernible size and pattern variations among the isolates. PMID:11923344
DNApod: DNA polymorphism annotation database from next-generation sequence read archives.
Mochizuki, Takako; Tanizawa, Yasuhiro; Fujisawa, Takatomo; Ohta, Tazro; Nikoh, Naruo; Shimizu, Tokurou; Toyoda, Atsushi; Fujiyama, Asao; Kurata, Nori; Nagasaki, Hideki; Kaminuma, Eli; Nakamura, Yasukazu
2017-01-01
With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information.
DNApod: DNA polymorphism annotation database from next-generation sequence read archives
Mochizuki, Takako; Tanizawa, Yasuhiro; Fujisawa, Takatomo; Ohta, Tazro; Nikoh, Naruo; Shimizu, Tokurou; Toyoda, Atsushi; Fujiyama, Asao; Kurata, Nori; Nagasaki, Hideki; Kaminuma, Eli; Nakamura, Yasukazu
2017-01-01
With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information. PMID:28234924
Fullerton, Stephanie M; Clark, Andrew G; Weiss, Kenneth M; Taylor, Scott L; Stengård, Jari H; Salomaa, Veikko; Boerwinkle, Eric; Nickerson, Deborah A
2002-07-01
A 3.3-kb region, encompassing the APOA2 gene and 2 kb of 5' and 3' flanking DNA, was re-sequenced in a "core" sample of 24 individuals, sampled without regard to the health from each of three populations: African-Americans from Jackson (Miss., USA), Europeans from North Karelia (Finland), and non-Hispanic European-Americans from Rochester, (Minn., USA). Fifteen variable sites were identified (14 SNPs and one multi-allelic microsatellite, all silent), and these sites segregated as 18 sequence haplotypes (or nine, if SNPs only are considered). The haplotype distribution in the core African-American sample was unusual, with a deficit of particular haplotypes compared with those found in the other two samples, and a significantly (P<0.05) low level of nucleotide diversity relative to patterns of polymorphism and divergence at other human loci. Six of the 14 SNPs, whose variation captured the haplotype structure of the core data, were then genotyped by oligonucleotide ligation assay in an additional 2183 individuals from the same three populations (n=843, n=452, and n=888, respectively). All six sites varied in each of the larger "epidemiological" samples, and together, they defined 19 SNP haplotypes, seven with relative frequencies greater than 1% in the total sample; all of these common haplotypes had been identified earlier in the core re-sequencing survey. Here also, the African-American sample showed significantly lower SNP heterozygosity and haplotype diversity than the other two samples. The deficit of polymorphism is consistent with a population-specific non-neutral increase in the relative frequency of several haplotypes in Jackson.
Mutational screening of FGFR1, CER1, and CDON in a large cohort of trigonocephalic patients.
Jehee, Fernanda Sarquis; Alonso, Luis G; Cavalcanti, Denise P; Kim, Chong; Wall, Steven A; Mulliken, John B; Sun, Miao; Jabs, Ethylin Wang; Boyadjiev, Simeon A; Wilkie, Andrew O M; Passos-Bueno, Maria Rita
2006-03-01
Screen the known craniosynostotic related gene, FGFR1 (exon 7), and two new identified potential candidates, CER1 and CDON, in patients with syndromic and nonsyndromic metopic craniosynostosis to determine if they might be causative genes. Using single-strand conformational polymorphisms (SSCPs), denaturing high-performance liquid chromatography, and/or direct sequencing, we analyzed a total of 81 patients for FGFR1 (exon 7), 70 for CER1, and 44 for CDON. Patients were ascertained in the Centro de Estudos do Genoma Humano in São Paulo, Brazil (n = 39), the Craniofacial Unit, Oxford, U.K. (n = 23), and the Johns Hopkins University, Baltimore, Maryland (n = 31). Clinical inclusion criteria included a triangular head and/or forehead, with or without a metopic ridge, and a radiographic documentation of metopic synostosis. Both syndromic and nonsyndromic patients were studied. No sequence alterations were found for FGFR1 (exon 7). Different patterns of SSCP migration for CER1 compatible with the segregation of single nucleotide polymorphisms reported in the region were identified. Seventeen sequence alterations were detected in the coding region of CDON, seven of which are new, but segregation analysis in parents and homology studies did not indicate a pathological role. FGFR1 (exon 7), CER1, and CDON are not related to trigonocephaly in our sample and should not be considered as causative genes for metopic synostosis. Screening of FGFR1 (exon 7) for diagnostic purposes should not be performed in trigonocephalic patients.
Swine Leukocyte Antigen Diversity in Canadian Specific Pathogen-Free Yorkshire and Landrace Pigs
Gao, Caixia; Quan, Jinqiang; Jiang, Xinjie; Li, Changwen; Lu, Xiaoye; Chen, Hongyan
2017-01-01
The highly polymorphic swine major histocompatibility complex (MHC), termed swine leukocyte antigen (SLA), is associated with different levels of immunologic responses to infectious diseases, vaccines, and transplantation. Pig breeds with known SLA haplotypes are important genetic resources for biomedical research. Canadian Yorkshire and Landrace pigs represent the current specific pathogen-free (SPF) breeding stock maintained in the isolation environment at the Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences. In this study, we identified 61 alleles at five polymorphic SLA loci (SLA-1, SLA-2, SLA-3, DRB1, and DQB1) representing 17 class I haplotypes and 11 class II haplotypes using reverse transcription-polymerase chain reaction (RT-PCR) sequence-based typing and PCR-sequence specific primers methods in 367 Canadian SPF Yorkshire and Landrace pigs. The official designation of the alleles has been assigned by the SLA Nomenclature Committee of the International Society for Animal Genetics and released in updated Immuno Polymorphism Database-MHC SLA sequence database [Release 2.0.0.3 (2016-11-03)]. The submissions confirmed some unassigned alleles and standardized nomenclatures of many previously unconfirmed alleles in the GenBank database. Three class I haplotypes, Hp-37.0, 63.0, and 73.0, appeared to be novel and have not previously been reported in other pig populations. One crossover within the class I region and two between class I and class II regions were observed, resulting in three new recombinant haplotypes. The presence of the duplicated SLA-1 locus was confirmed in three class I haplotypes Hp-28.0, Hp-35.0, and Hp-63.0. Furthermore, we also analyzed the functional diversities of 19 identified frequent SLA class I molecules in this study and confirmed the existence of four supertypes using the MHCcluster method. These results will be useful for studying the adaptive immune response and immunological phenotypic differences in pigs, screening potential T-cell epitopes, and further developing the more effective vaccines. PMID:28360911
Classification of European Mtdnas from an Analysis of Three European Populations
Torroni, A.; Huoponen, K.; Francalacci, P.; Petrozzi, M.; Morelli, L.; Scozzari, R.; Obinu, D.; Savontaus, M. L.; Wallace, D. C.
1996-01-01
Mitochondrial DNA (mtDNA) sequence variation was examined in Finns, Swedes and Tuscans by PCR amplification and restriction analysis. About 99% of the mtDNAs were subsumed within 10 mtDNA haplogroups (H, I, J, K, M, T, U, V, W, and X) suggesting that the identified haplogroups could encompass virtually all European mtDNAs. Because both hypervariable segments of the mtDNA control region were previously sequenced in the Tuscan samples, the mtDNA haplogroups and control region sequences could be compared. Using a combination of haplogroup-specific restriction site changes and control region nucleotide substitutions, the distribution of the haplogroups was surveyed through the published restriction site polymorphism and control region sequence data of Caucasoids. This supported the conclusion that most haplogroups observed in Europe are Caucasoid-specific, and that at least some of them occur at varying frequencies in different Caucasoid populations. The classification of almost all European mtDNA variation in a number of well defined haplogroups could provide additional insights about the origin and relationships of Caucasoid populations and the process of human colonization of Europe, and is valuable for the definition of the role played by mtDNA backgrounds in the expression of pathological mtDNA mutations PMID:8978068
Mitochondrial DNA variation of indigenous goats in Narok and Isiolo counties of Kenya.
Kibegwa, F M; Githui, K E; Jung'a, J O; Badamana, M S; Nyamu, M N
2016-06-01
Phylogenetic relationships among and genetic variability within 60 goats from two different indigenous breeds in Narok and Isiolo counties in Kenya and 22 published goat samples were analysed using mitochondrial control region sequences. The results showed that there were 54 polymorphic sites in a 481-bp sequence and 29 haplotypes were determined. The mean haplotype diversity and nucleotide diversity were 0.981 ± 0.006 and 0.019 ± 0.001, respectively. The phylogenetic analysis in combination with goat haplogroup reference sequences from GenBank showed that all goat sequences were clustered into two haplogroups (A and G), of which haplogroup A was the commonest in the two populations. A very high percentage (99.90%) of the genetic variation was distributed within the regions, and a smaller percentage (0.10%) distributed among regions as revealed by the analysis of molecular variance (amova). This amova results showed that the divergence between regions was not statistically significant. We concluded that the high levels of intrapopulation diversity in Isiolo and Narok goats and the weak phylogeographic structuring suggested that there existed strong gene flow among goat populations probably caused by extensive transportation of goats in history. © 2015 Blackwell Verlag GmbH.
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification.
Ziesemer, Kirsten A; Mann, Allison E; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T; Brandt, Bernd W; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A; MacDonald, Sandy J; Thomas, Gavin H; Collins, Matthew J; Lewis, Cecil M; Hofman, Corinne; Warinner, Christina
2015-11-13
To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341-534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions.
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification
Ziesemer, Kirsten A.; Mann, Allison E.; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T.; Brandt, Bernd W.; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C.; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A.; MacDonald, Sandy J.; Thomas, Gavin H.; Collins, Matthew J.; Lewis, Cecil M.; Hofman, Corinne; Warinner, Christina
2015-01-01
To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341–534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions. PMID:26563586
Liu, Dewu; Zhang, Yushan; Du, Yinjun; Yang, Guanfu; Zhang, Xiquan
2007-06-01
The growth-correlated genes that are part of the neuroendocrine growth axis play crucial roles in the regulation of growth and development of pig. The identification of genetic polymorphisms in these genes will enable the scientist to evaluate the biological relevance of such polymorphisms and to gain a better understanding of quantitative traits like growth. In the present study, seven pairs of primers were designed to obtain unknown sequences of growth-correlated genes, and other 25 pairs of primers were designed to identify single nucleotide polymorphisms (SNP) using the denaturing high-performance liquid chromatography (DHPLC) technology in four pig breeds (Duroc, Landrace, Lantang and Wuzhishan), significantly differing in growth and development characteristics. A total of 101 polymorphisms were discovered in 10,707 base pairs (bp) from six genes of the ghrelin (GHRL), leptin (LEP), insulin-like growth factor II (IGF-II), insulin-like growth factor binding protein 2 (IGFBP-2), insulin-like growth factor binding protein 3 (IGFBP-3), and somatostatin (SS). The observed average distances between the SNP in the 5'UTR, coding regions, introns and 3'UTR were 134, 521, 81 and 92 bp, respectively. Four SNPs were found in the coding regions of IGF-II, IGFBP-2 and LEP, respectively. Two synonymous mutations were obtained in IGF-II and LEP genes respectively, and two non-synonymous were found in IGFBP-2 and LEP genes, respectively. Seven other mutations were also observed. Thirty-two PCR-RFLP markers were found among 101 polymorphisms of the six genes. The SNP discovered in this study would provide suitable markers for association studies of candidate genes with growth related traits in pig.
Simmons, Sheri L; Dibartolo, Genevieve; Denef, Vincent J; Goltsman, Daniela S Aliaga; Thelen, Michael P; Banfield, Jillian F
2008-07-22
Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination.
Denef, Vincent J; Goltsman, Daniela S. Aliaga; Thelen, Michael P; Banfield, Jillian F
2008-01-01
Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth ∼20×). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types (∼94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination. PMID:18651792
Talukder, Zahirul I; Hulke, Brent S; Qi, Lili; Scheffler, Brian E; Pegadaraju, Venkatramana; McPhee, Kevin; Gulya, Thomas J
2014-01-01
Functional markers for Sclerotinia basal stalk rot resistance in sunflower were obtained using gene-level information from the model species Arabidopsis thaliana. Sclerotinia stalk rot, caused by Sclerotinia sclerotiorum, is one of the most destructive diseases of sunflower (Helianthus annuus L.) worldwide. Markers for genes controlling resistance to S. sclerotiorum will enable efficient marker-assisted selection (MAS). We sequenced eight candidate genes homologous to Arabidopsis thaliana defense genes known to be associated with Sclerotinia disease resistance in a sunflower association mapping population evaluated for Sclerotinia stalk rot resistance. The total candidate gene sequence regions covered a concatenated length of 3,791 bp per individual. A total of 187 polymorphic sites were detected for all candidate gene sequences, 149 of which were single nucleotide polymorphisms (SNPs) and 38 were insertions/deletions. Eight SNPs in the coding regions led to changes in amino acid codons. Linkage disequilibrium decay throughout the candidate gene regions declined on average to an r (2) = 0.2 for genetic intervals of 120 bp, but extended up to 350 bp with r (2) = 0.1. A general linear model with modification to account for population structure was found the best fitting model for this population and was used for association mapping. Both HaCOI1-1 and HaCOI1-2 were found to be strongly associated with Sclerotinia stalk rot resistance and explained 7.4 % of phenotypic variation in this population. These SNP markers associated with Sclerotinia stalk rot resistance can potentially be applied to the selection of favorable genotypes, which will significantly improve the efficiency of MAS during the development of stalk rot resistant cultivars.
Tsuda, K; Kikkawa, Y; Yonekawa, H; Tanabe, Y
1997-08-01
To test the hypothesis that the domestic dogs are derived from several different ancestral gray wolf populations, we compared the sequence of the displacement (D)-loop region of the mitochondrial DNA (mtDNA) from 24 breeds of domestic dog (34 individual dogs) and 3 subspecies of gray wolf (Canis lupus lupus, C.l. pallipes and C.l. chanco; 19 individuals). The intraspecific sequence variations within domestic dogs (0.00-3.19%) and within wolves (0.00-2.88%) were comparable to the interspecific variations between domestic dogs and wolves (0.30-3.35%). A repetitive sequence with repeat units (TACACGTA/GCG) that causes the size variation in the D-loop region was also found in both dogs and wolves. However, no nucleotide substitutions or repetitive arrays were specific for domestic dogs or for wolves. These results showed that there is a close genetic relationship between dogs and wolves. Two major clades appeared in the phylogenetic trees constructed by neighbor-joining and by the maximum parsimony method; one clade containing Chinese wolf (C.l. chanco) showed extensive variations while the other showed only slight variation. This showed that there were two major genetic components both in domestic dogs and in wolves. However, neither clades nor haplotypes specific for any dog breed were observed, whereas subspecies-specific clades were found in Asiatic wolves. These results suggested that the extant breeds of domestic dogs have maintained a large degree of mtDNA polymorphisms introduced from their ancestral wolf populations, and that extensive interbreedings had occurred among multiple matriarchal origins.
Yong, Rita Y Y; Gan, Linda S H; Chang, Yuet Meng; Yap, Eric P H
2007-11-01
Amelogenin paralogs on Chromosome X (AMELX) and Y (AMELY) are commonly used sexing markers. Interstitial deletion of Yp involving the AMELY locus has previously been reported. The combined frequency of the AMELY null allele in Singapore and Malaysia populations is 2.7%, 0.6% in Indian and Malay ethnic groups respectively. It is absent among 541 Chinese screened. The null allele in this study belongs to 3 Y haplogroups; J2e1 (85.7%), F* (9.5%) and D* (4.8%). Low and high-resolution STS mapping, followed by sequence analysis of breakpoint junction confirmed a large deletion of 3 to 3.7-Mb located at the Yp11.2 region. Both breakpoints were located in TSPY repeat arrays, suggesting a non-allelic homologous recombination (NAHR) mechanism of deletion. All regional null samples shared identical breakpoint sequences according to their haplogroup affiliation, providing molecular evidence of a common ancestry origin for each haplogroup, and at least 3 independent deletion events recurred in history. The estimated ages based on Y-SNP and STR analysis were approximately 13.5 +/- 3.1 kyears and approximately 0.9 +/- 0.9 kyears for the J2e1 and F* mutations, respectively. A novel polymorphism G > A at Y-GATA-H4 locus in complete linkage disequilibrium with J2e1 null mutations is a more recent event. This work re-emphasizes the need to include other sexing markers for gender determination in certain regional populations. The frequency difference among global populations suggests it constitutes another structural variation locus of human chromosome Y. The breakpoint sequences provide further information to a better understanding of the NAHR mechanism and DNA rearrangements due to higher order genomic architecture.
Genotyping-by-sequencing enables linkage mapping in three octoploid cultivated strawberry families
Salinas, Natalia; Tennessen, Jacob A.; Zurn, Jason D.; Sargent, Daniel James; Hancock, James; Bassil, Nahla V.
2017-01-01
Genotyping-by-sequencing (GBS) was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria × ananassa) populations with the goal of evaluating this technique in a species with a complex octoploid genome. GBS sequence data were aligned to the F. vesca ‘Fvb’ reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30–65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F. × ananassa chromosomal regions derived from diploid ancestor F. vesca. PMID:28875078
Meadows, J R S; Kijas, J W
2009-02-01
The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.
Li, Jian; Chen, Jiangtao; Xie, Dongde; Eyi, Urbano Monsuy; Matesa, Rocio Apicante; Ondo Obono, Maximo Miko; Ehapo, Carlos Sala; Yang, Liye; Yang, Huitian; Lin, Min
2016-01-01
Objective With emergence and geographically expanding of antimalarial resistance worldwide, molecular markers are essential tool for surveillance of resistant Plasmodium parasites. Recently, single-nucleotide polymorphisms (SNPs) in the PF3D7_1343700 kelch propeller (K13-propeller) domain are shown to be associated with artemisinin (ART) resistance in vivo and in vitro. This study aims to investigate the ART resistance-associated polymorphisms of K13-propeller and PfATPase6 genes in Plasmodium falciparum isolates from Bioko Island, Equatorial Guinea (EG). Methods A total of 172 samples were collected from falciparum malaria patients on Bioko Island between 2013 and 2014. The polymorphisms of K13-propeller and PfATPase6 genes were analyzed by Nest-PCR and sequencing. Results Sequences of K13-propeller and PfATPase6 were obtained from 90.74% (98/108) and 91.45% (139/152) samples, respectively. The 2.04% (2/98) cases had non-synonymous K13-propeller A578S mutation but no found the mutations associated with ART resistance in Southeast Asia. For PfATPase6, the mutations were found at positions N569K and A630S with the mutation prevalence of 7.91% (11/139) and 1.44% (2/139), respectively. In addition, a sample with the mixed type at position I723V was discovered (0.72%, 1/139). Conclusions This study initially offers an insight of K13-propeller and PfATPase6 polymorphisms on Bioko Island, EG. It suggests no widespread ART resistance or tolerance in the region, and might be helpful for developing and updating guidance for the use of ART-based combination therapies (ACTs). PMID:27054064
Li, Jian; Chen, Jiangtao; Xie, Dongde; Eyi, Urbano Monsuy; Matesa, Rocio Apicante; Ondo Obono, Maximo Miko; Ehapo, Carlos Sala; Yang, Liye; Yang, Huitian; Lin, Min
2016-04-01
With emergence and geographically expanding of antimalarial resistance worldwide, molecular markers are essential tool for surveillance of resistant Plasmodium parasites. Recently, single-nucleotide polymorphisms (SNPs) in the PF3D7_1343700 kelch propeller (K13-propeller) domain are shown to be associated with artemisinin (ART) resistance in vivo and in vitro. This study aims to investigate the ART resistance-associated polymorphisms of K13-propeller and PfATPase6 genes in Plasmodium falciparum isolates from Bioko Island, Equatorial Guinea (EG). A total of 172 samples were collected from falciparum malaria patients on Bioko Island between 2013 and 2014. The polymorphisms of K13-propeller and PfATPase6 genes were analyzed by Nest-PCR and sequencing. Sequences of K13-propeller and PfATPase6 were obtained from 90.74% (98/108) and 91.45% (139/152) samples, respectively. The 2.04% (2/98) cases had non-synonymous K13-propeller A578S mutation but no found the mutations associated with ART resistance in Southeast Asia. For PfATPase6, the mutations were found at positions N569K and A630S with the mutation prevalence of 7.91% (11/139) and 1.44% (2/139), respectively. In addition, a sample with the mixed type at position I723V was discovered (0.72%, 1/139). This study initially offers an insight of K13-propeller and PfATPase6 polymorphisms on Bioko Island, EG. It suggests no widespread ART resistance or tolerance in the region, and might be helpful for developing and updating guidance for the use of ART-based combination therapies (ACTs).
Saxena, Raghvendra; Chandra, Amaresh
2011-11-01
Transferability of sequence-tagged-sites (STS) markers was assessed for genetic relationships study among accessions of marvel grass (Dichanthium annulatum Forsk.). In total, 17 STS primers of Stylosanthes origin were tested for their reactivity with thirty accessions of Dichanthium annulatum. Of these, 14 (82.4%) reacted and a total 106 (84 polymorphic) bands were scored. The number of bands generated by individual primer pairs ranged from 4 to 11 with an average of 7.57 bands, whereas polymorphic bands ranged from 4 to 9 with an average of 6.0 bands accounts to an average polymorphism of 80.1%. Polymorphic information content (PIC) ranged from 0.222 to 0.499 and marker index (MI) from 1.33 to 4.49. Utilizing Dice coefficient of genetic similarity dendrogram was generated through un-weighted pairgroup method with arithmetic mean (UPGMA) algorithm. Further, clustering through sequential agglomerative hierarchical and nested (SAHN) method resulted three main clusters constituted all accessions except IGBANG-D-2. Though there was intermixing of few accessions of one agro-climatic region to another, largely groupings of accessions were with their regions of collections. Bootstrap analysis at 1000 scale also showed large number of nodes (11 to 17) having strong clustering (> 50). Thus, results demonstrate the utility of STS markers of Stylosanthes in studying the genetic relationships among accessions of Dichanthium.
Rathinasabapathi, Pasupathi; Purushothaman, Natarajan; Parani, Madasamy
2016-05-01
Although rice genome was sequenced in the year 2002, efforts in resequencing the large number of available accessions, landraces, traditional cultivars, and improved varieties of this important food crop are limited. We have initiated resequencing of the traditional cultivars from India. Kavuni is an important traditional rice cultivar from South India that attracts premium price for its nutritional and therapeutic properties. Whole-genome sequencing of Kavuni using Illumina platform and SNPs analysis using Nipponbare reference genome identified 1 150 711 SNPs of which 377 381 SNPs were located in the genic regions. Non-synonymous SNPs (62 708) were distributed in 19 251 genes, and their number varied between 1 and 115 per gene. Large-effect DNA polymorphisms (7769) were present in 3475 genes. Pathway mapping of these polymorphisms revealed the involvement of genes related to carbohydrate metabolism, translation, protein-folding, and cell death. Analysis of the starch biosynthesis related genes revealed that the granule-bound starch synthase I gene had T/G SNPs at the first intron/exon junction and a two-nucleotide combination, which were reported to favour high amylose content and low glycemic index. The present study provided a valuable genomics resource to study the rice varieties with nutritional and medicinal properties.
Systematic analysis of protein identity between Zika virus and other arthropod-borne viruses.
Chang, Hsiao-Han; Huber, Roland G; Bond, Peter J; Grad, Yonatan H; Camerini, David; Maurer-Stroh, Sebastian; Lipsitch, Marc
2017-07-01
To analyse the proportions of protein identity between Zika virus and dengue, Japanese encephalitis, yellow fever, West Nile and chikungunya viruses as well as polymorphism between different Zika virus strains. We used published protein sequences for the Zika virus and obtained protein sequences for the other viruses from the National Center for Biotechnology Information (NCBI) protein database or the NCBI virus variation resource. We used BLASTP to find regions of identity between viruses. We quantified the identity between the Zika virus and each of the other viruses, as well as within-Zika virus polymorphism for all amino acid k -mers across the proteome, with k ranging from 6 to 100. We assessed accessibility of protein fragments by calculating the solvent accessible surface area for the envelope and nonstructural-1 (NS1) proteins. In total, we identified 294 Zika virus protein fragments with both low proportion of identity with other viruses and low levels of polymorphisms among Zika virus strains. The list includes protein fragments from all Zika virus proteins, except NS3. NS4A has the highest number (190 k -mers) of protein fragments on the list. We provide a candidate list of protein fragments that could be used when developing a sensitive and specific serological test to detect previous Zika virus infections.
Characterization of the COL2A1 VNTR polymorphism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berg, E.S.; Olaisen, B.
1993-05-01
The variable number of tandem repeat (VNTR) region 3{prime} to the collagen type II gene (COL2A1) was amplified in vitro by the polymerase chain reaction. Subsequent high-resolution gel electrophoresis showed that the five earlier reported alleles could be further subtyped. A total of 17 allelic variants with a heterozygosity of 73.0% were found in 202 unrelated Norwegians. DNA sequencing of 19 COL2A1 alleles has been performed. The internal organization of the VNTR was common for all alleles, as previously shown for a few alleles. Moreover, the polymorphism in the COL2A1 locus is mainly due to variation in the numbers ofmore » copies of two repeat units, containing 34 and 31 bp, respectively, and/or to small deletions in either of the two units. DNA sequencing of alleles with the same electrophoretic size revealed no heterogeneity such as an alternating order of the different units, a feature that might have been expected to be the result of unequal crossing-over events. The observed ordered structure of the VNTR and the possibility of single-stranded DNA from the cores in the VNTR forming hairpins and loops suggest that the COL2A1 polymorphism may have evolved mainly by replication slippage mechanisms. 23 refs., 2 figs., 3 tabs.« less
Erturk, Elif; Cecener, Gulsah; Polatkan, Volkan; Gokgoz, Sehsuvar; Egeli, Unal; Tunca, Berrin; Tezcan, Gulcin; Demirdogen, Elif; Ak, Secil; Tasdelen, Ismet
2014-01-01
Although genetic markers identifying women at an increased risk of developing breast cancer exist, the majority of inherited risk factors remain elusive. Mutations in the BRCA1/BRCA2 gene confer a substantial increase in breast cancer risk, yet routine clinical genetic screening is limited to the coding regions and intron- exon boundaries, precluding the identification of mutations in noncoding and untranslated regions. Because 3' untranslated region (3'UTR) polymorphisms disrupting microRNA (miRNA) binding can be functional and can act as genetic markers of cancer risk, we aimed to determine genetic variation in the 3'UTR of BRCA1/BRCA2 in familial and early-onset breast cancer patients with and without mutations in the coding regions of BRCA1/ BRCA2 and to identify specific 3'UTR variants that may be risk factors for cancer development. The 3'UTRs of the BRCA1 and BRCA2 genes were screened by heteroduplex analysis and DNA sequencing in 100 patients from 46 BRCA1/2 families, 54 non-BRCA1/2 families, and 47 geographically matched controls. Two polymorphisms were identified. SNPs c.*1287C>T (rs12516) (BRCA1) and c.*105A>C (rs15869) (BRCA2) were identified in 27% and 24% of patients, respectively. These 2 variants were also identified in controls with no family history of cancer (23.4% and 23.4%, respectively). In comparison to variations in the 3'UTR region of the BRCA1/2 genes and the BRCA1/2 mutational status in patients, there was a statistically significant relationship between the BRCA1 gene polymorphism c.*1287C>T (rs12516) and BRCA1 mutations (p=0.035) by Fisher's Exact Test. SNP c.*1287C>T (rs12516) of the BRCA1 gene may have potential use as a genetic marker of an increased risk of developing breast cancer and likely represents a non-coding sequence variation in BRCA1 that impacts BRCA1 function and leads to increased early-onset and/or familial breast cancer risk in the Turkish population.
Genomic Variation in Natural Populations of Drosophila melanogaster
Langley, Charles H.; Stevens, Kristian; Cardeno, Charis; Lee, Yuh Chwen G.; Schrider, Daniel R.; Pool, John E.; Langley, Sasha A.; Suarez, Charlyn; Corbett-Detig, Russell B.; Kolaczkowski, Bryan; Fang, Shu; Nista, Phillip M.; Holloway, Alisha K.; Kern, Andrew D.; Dewey, Colin N.; Song, Yun S.; Hahn, Matthew W.; Begun, David J.
2012-01-01
This report of independent genome sequences of two natural populations of Drosophila melanogaster (37 from North America and 6 from Africa) provides unique insight into forces shaping genomic polymorphism and divergence. Evidence of interactions between natural selection and genetic linkage is abundant not only in centromere- and telomere-proximal regions, but also throughout the euchromatic arms. Linkage disequilibrium, which decays within 1 kbp, exhibits a strong bias toward coupling of the more frequent alleles and provides a high-resolution map of recombination rate. The juxtaposition of population genetics statistics in small genomic windows with gene structures and chromatin states yields a rich, high-resolution annotation, including the following: (1) 5′- and 3′-UTRs are enriched for regions of reduced polymorphism relative to lineage-specific divergence; (2) exons overlap with windows of excess relative polymorphism; (3) epigenetic marks associated with active transcription initiation sites overlap with regions of reduced relative polymorphism and relatively reduced estimates of the rate of recombination; (4) the rate of adaptive nonsynonymous fixation increases with the rate of crossing over per base pair; and (5) both duplications and deletions are enriched near origins of replication and their density correlates negatively with the rate of crossing over. Available demographic models of X and autosome descent cannot account for the increased divergence on the X and loss of diversity associated with the out-of-Africa migration. Comparison of the variation among these genomes to variation among genomes from D. simulans suggests that many targets of directional selection are shared between these species. PMID:22673804
Wone, Bernard W M; Yim, Won C; Schutz, Heidi; Meek, Thomas H; Garland, Theodore
2018-04-04
Mitochondrial haplotypes have been associated with human and rodent phenotypes, including nonshivering thermogenesis capacity, learning capability, and disease risk. Although the mammalian mitochondrial D-loop is highly polymorphic, D-loops in laboratory mice are identical, and variation occurs elsewhere mainly between nucleotides 9820 and 9830. Part of this region codes for the tRNA Arg gene and is associated with mitochondrial densities and number of mtDNA copies. We hypothesized that the capacity for high levels of voluntary wheel-running behavior would be associated with mitochondrial haplotype. Here, we analyzed the mtDNA polymorphic region in mice from each of four replicate lines selectively bred for 54 generations for high voluntary wheel running (HR) and from four control lines (Control) randomly bred for 54 generations. Sequencing the polymorphic region revealed a variable number of adenine repeats. Single nucleotide polymorphisms (SNPs) varied from 2 to 3 adenine insertions, resulting in three haplotypes. We found significant genetic differentiations between the HR and Control groups (F st = 0.779, p ≤ 0.0001), as well as among the replicate lines of mice within groups (F sc = 0.757, p ≤ 0.0001). Haplotypes, however, were not strongly associated with voluntary wheel running (revolutions run per day), nor with either body mass or litter size. This system provides a useful experimental model to dissect the physiological processes linking mitochondrial, genomic SNPs, epigenetics, or nuclear-mitochondrial cross-talk to exercise activity. Copyright © 2018. Published by Elsevier B.V.
Chen, Zhengshuai; Li, Jingjie; Chen, Peng; Wang, Fengjiao; Zhang, Ning; Yang, Min; Jin, Tianbo; Chen, Chao
2016-09-01
1. Detection of CYP3A5 variant alleles, and knowledge about their allelic frequency in Uyghur ethnic groups, is important to establish the clinical relevance of screening for these polymorphisms to optimize pharmacotherapy. 2. We used DNA sequencing to investigate the promoter, exons and surrounding introns, and 3'-untranslated region of the CYP3A5 gene in 96 unrelated healthy Uyghur individuals. We also used SIFT and PolyPhen-2 to predict the protein function of the novel non-synonymous mutation in CYP3A5 coding regions. 3. We found 24 different CYP3A5 polymorphisms in the Uyghur population, three of which were novel: the synonymous mutation 43C > T in exon 1, two mutations 32120C > G and 32245T > C in 3'-untranslated region, and we detected the allele frequencies of CYP3A5*1 and *3 as 64.58% and 35.42%, respectively. While no subjects with CYP3A5*6 were identified. Other identified genotypes included the heterozygous genotype 1A/3A (59.38%) and 1A/3E (11.46%), which lead to decreased enzyme activity. In addition, the frequency of haplotype "TTAGGT" was the most prevalent with 0.781. 4. Our data provide new information regarding CYP3A5 genetic polymorphisms in Uyghur individuals, which may help to improve individualization of drug therapy and offer a preliminary basis for more rational use of drugs.
Valderrama-Aguirre, Augusto; Zúñiga-Soto, Evelin; Mariño-Ramírez, Leonardo; Moreno, Luz Ángela; Escalante, Ananías A.; Arévalo-Herrera, Myriam; Herrera, Sócrates
2011-01-01
Merozoite surface protein 1 (MSP-1) is a polymorphic malaria protein with functional domains involved in parasite erythrocyte interaction. Plasmodium vivax MSP-1 has a fragment (Pv200L) that has been identified as a potential subunit vaccine because it is highly immunogenic and induces partial protection against infectious parasite challenge in vaccinated monkeys. To determine the extent of genetic polymorphism and its effect on the translated protein, we sequenced the Pv200L coding region from isolates of 26 P. vivax-infected patients in a malaria-endemic area of Colombia. The extent of nucleotide diversity (π) in these isolates (0.061 ± 0.004) was significantly lower (P ≤ 0.001) than that observed in Thai and Brazilian isolates; 0.083 ± 0.006 and 0.090 ± 0.006, respectively. We found two new alleles and several previously unidentified dimorphic substitutions and significant size polymorphism. The presence of highly conserved blocks in this fragment has important implications for the development of Pv200L as a subunit vaccine candidate. PMID:21292880
Casillas, Rosario; Tabernero, David; Gregori, Josep; Belmonte, Irene; Cortese, Maria Francesca; González, Carolina; Riveiro-Barciela, Mar; López, Rosa Maria; Quer, Josep; Esteban, Rafael; Buti, Maria; Rodríguez-Frías, Francisco
2018-01-01
AIM To determine the variability/conservation of the domain of hepatitis B virus (HBV) preS1 region that interacts with sodium-taurocholate cotransporting polypeptide (hereafter, NTCP-interacting domain) and the prevalence of the rs2296651 polymorphism (S267F, NTCP variant) in a Spanish population. METHODS Serum samples from 246 individuals were included and divided into 3 groups: patients with chronic HBV infection (CHB) (n = 41, 73% Caucasians), patients with resolved HBV infection (n = 100, 100% Caucasians) and an HBV-uninfected control group (n = 105, 100% Caucasians). Variability/conservation of the amino acid (aa) sequences of the NTCP-interacting domain, (aa 2-48 in viral genotype D) and a highly conserved preS1 domain associated with virion morphogenesis (aa 92-103 in viral genotype D) were analyzed by next-generation sequencing and compared in 18 CHB patients with viremia > 4 log IU/mL. The rs2296651 polymorphism was determined in all individuals in all 3 groups using an in-house real-time PCR melting curve analysis. RESULTS The HBV preS1 NTCP-interacting domain showed a high degree of conservation among the examined viral genomes especially between aa 9 and 21 (in the genotype D consensus sequence). As compared with the virion morphogenesis domain, the NTCP-interacting domain had a smaller proportion of HBV genotype-unrelated changes comprising > 1% of the quasispecies (25.5% vs 31.8%), but a larger proportion of genotype-associated viral polymorphisms (34% vs 27.3%), according to consensus sequences from GenBank patterns of HBV genotypes A to H. Variation/conservation in both domains depended on viral genotype, with genotype C being the most highly conserved and genotype E the most variable (limited finding, only 2 genotype E included). Of note, proline residues were highly conserved in both domains, and serine residues showed changes only to threonine or tyrosine in the virion morphogenesis domain. The rs2296651 polymorphism was not detected in any participant. CONCLUSION In our CHB population, the NTCP-interacting domain was highly conserved, particularly the proline residues and essential amino acids related with the NTCP interaction, and the prevalence of rs2296651 was low/null. PMID:29456407
Schwelm, Arne; Berney, Cédric; Dixelius, Christina; Bass, David; Neuhauser, Sigrid
2016-12-01
Clubroot disease caused by Plasmodiophora brassicae is one of the most important diseases of cultivated brassicas. P. brassicae occurs in pathotypes which differ in the aggressiveness towards their Brassica host plants. To date no DNA based method to distinguish these pathotypes has been described. In 2011 polymorphism within the 28S rDNA of P. brassicae was reported which potentially could allow to distinguish pathotypes without the need of time-consuming bioassays. However, isolates of P. brassicae from around the world analysed in this study do not show polymorphism in their LSU rDNA sequences. The previously described polymorphism most likely derived from soil inhabiting Cercozoa more specifically Neoheteromita-like glissomonads. Here we correct the LSU rDNA sequence of P. brassicae. By using FISH we demonstrate that our newly generated sequence belongs to the causal agent of clubroot disease. Copyright © 2016 The Authors. Published by Elsevier GmbH.. All rights reserved.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.
Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T
1993-02-01
An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
[Divergence of paralogous growth-hormone-encoding genes and their promoters in Salmonidae].
Kamenskaya, D N; Pankova, M V; Atopkin, D M; Brykov, V A
2017-01-01
In many fish species, including salmonids, the growth-hormone is encoded by two duplicated paralogous genes, gh1 and gh2. Both genes were already in place at the time of divergence of species in this group. A comparison of the entire sequence of these genes of salmonids has shown that their conserved regions are associated with exons, while their most variable regions correspond to introns. Introns C and D include putative regulatory elements (sites Pit-1, CRE, and ERE), that are also conserved. In chars, the degree of polymorphism of gh2 gene is 2-3 times as large as that in gh1 gene. However, a comparison across all Salmonidae species would not extent this observation to other species. In both these chars' genes, the promoters are conserved mainly because they correspond to putative regulatory sequences (TATA box, binding sites for the pituitary transcription factor Pit-1 (F1-F4), CRE, GRE and RAR/RXR elements). The promoter of gh2 gene has a greater degree of polymorphism compared with gh1 gene promoter in all investigated species of salmonids. The observed differences in the rates of accumulation of changes in growth hormone encoding paralogs could be explained by differences in the intensity of selection.
Identification of a polymorphic collagen-like protein in the crustacean bacteria Pasteuria ramosa.
Mouton, Laurence; Traunecker, Emmanuel; McElroy, Kerensa; Du Pasquier, Louis; Ebert, Dieter
2009-12-01
Pasteuria ramosa is a spore-forming bacterium that infects Daphnia species. Previous results demonstrated a high specificity of host clone/parasite genotype interactions. Surface proteins of bacteria often play an important role in attachment to host cells prior to infection. We analyzed surface proteins of P. ramosa spores by two-dimensional gel electrophoresis. For the first time, we prove that two isolates selected for their differences in infectivity reveal few but clear-cut differences in protein patterns. Using internal sequencing and LC/MS/MS, we identified a collagen-like protein named Pcl1a (Pasteuria collagen-like protein 1a). This protein, reconstructed with the help of Pasteuria genome sequences, contains three domains: a 75-amino-acid amino-terminal domain with a potential transmembrane helix domain, a central collagen-like region (CLR) containing Gly-Xaa-Yaa (GXY) repeats, and a 7-amino-acid carboxy-terminal domain. The CLR region is polymorphic among the two isolates with amino-acid substitutions and a variable number of GXY triplets. Collagen-like proteins are rare in prokaryotes, although they have been described in several pathogenic bacteria, including Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis, closely related to Pasteuria species, in which they could be involved in the adherence of bacteria to host cells.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pichon, L.; Carn, G.; Bouric, P.
1996-03-01
Positional cloning strategies for the hemochromatosis gene have previously concentrated on a target area restricted to a maximum genomic expanse of 400 kb around the HLA-A and HLA-F loci. Recently, the candidate region has been extended to 2-3 Mb on the distal side of the MHC. In this study, 10 coding sequences [hemochromatosis candidate genes (HCG) I to X] were isolated by cDNA selection using YACs covering the HLA-A/HLA-F subregion. Two of these (HCG II and HCG IV) belong to multigene families, as well as other sequences already described in this region, i.e., P5, pMC 6.7, and HLA class I.more » Fingerprinting of the four YACSs overlapping the region was performed and allowed partial localization of the different multigene family sequences on each YAC without defining their exact positions. Fingerprinting on cosmids isolated from the ICRF chromosome 6-specific cosmid library allowed more precise localization of the redundant sequences in all of the multigene families and revealed their apparent organization in clusters. Further examination of these intertwined sequences demonstrated that this structural organization resulted from a succession of complex phenomena, including duplications and contractions. This study presents a precise description of the structural organization of the HLA-A/HLA-F region and a determination of the sequences involved in the megabase size polymorphism observed among the A3, A24, and A31 haplotypes. 29 refs., 2 figs., 2 tabs.« less
NASA Astrophysics Data System (ADS)
Shi, Jinming; Lu, Weihong; Sun, Yeqing
2014-04-01
Rice seeds, after space flight and low dose heavy ion radiation treatment were cultured on ground. Leaves of the mature plants were obtained for examination of genomic/epigenomic mutations by using amplified fragment length polymorphism (AFLP) and methylation sensitive amplification polymorphism (MSAP) method, respectively. The mutation sites were identified by fragment recovery and sequencing. The heritability of the mutations was detected in the next generation. Results showed that both space flight and low dose heavy ion radiation can induce significant alterations on rice genome and epigenome (P < 0.05). For both genetic and epigenetic assays, while there was no significant difference in mutation rates and their ability to be inherited to the next generation, the site of mutations differed between the space flight and radiation treated groups. More than 50% of the mutation sites were shared by two radiation treated groups, radiated with different LET value and dose, while only about 20% of the mutation sites were shared by space flight group and radiation treated group. Moreover, in space flight group, we found that DNA methylation changes were more prone to occur on CNG sequence than CG sequence. Sequencing results proved that both space flight and heavy ion radiation induced mutations were widely spread on rice genome including coding region and repeated region. Our study described and compared the characters of space flight and low dose heavy ion radiation induced genomic/epigenomic mutations. Our data revealed the mechanisms of application of space environment for mutagenesis and crop breeding. Furthermore, this work implicated that the nature of mutations induced under space flight conditions may involve factors beyond ion radiation.
Brütting, Christine; Emmer, Alexander; Kornhuber, Malte; Staege, Martin S
2016-08-01
Although multiple sclerosis (MS) is one of the most common central nervous system diseases in young adults, little is known about its etiology. Several human endogenous retroviruses (ERVs) are considered to play a role in MS. We are interested in which ERVs can be identified in the vicinity of MS associated genetic marker to find potential initiators of MS. We analysed the chromosomal regions surrounding 58 single nucleotide polymorphisms (SNPs) that are associated with MS identified in one of the last major genome wide association studies. We scanned these regions for putative endogenous retrovirus sequences with large open reading frames (ORFs). We observed that more retrovirus-related putative ORFs exist in the relatively close vicinity of SNP marker indices in multiple sclerosis compared to control SNPs. We found very high homologies to HERV-K, HCML-ARV, XMRV, Galidia ERV, HERV-H/env62 and XMRV-like mouse endogenous retrovirus mERV-XL. The associated genes (CYP27B1, CD6, CD58, MPV17L2, IL12RB1, CXCR5, PTGER4, TAGAP, TYK2, ICAM3, CD86, GALC, GPR65 as well as the HLA DRB1*1501) are mainly involved in the immune system, but also in vitamin D regulation. The most frequently detected ERV sequences are related to the multiple sclerosis-associated retrovirus, the human immunodeficiency virus 1, HERV-K, and the Simian foamy virus. Our data shows that there is a relation between MS associated SNPs and the number of retroviral elements compared to control. Our data identifies new ERV sequences that have not been associated with MS, so far.
Couldrey, C; Keehan, M; Johnson, T; Tiplady, K; Winkelman, A; Littlejohn, M D; Scott, A; Kemper, K E; Hayes, B; Davis, S R; Spelman, R J
2017-07-01
Single nucleotide polymorphisms have been the DNA variant of choice for genomic prediction, largely because of the ease of single nucleotide polymorphism genotype collection. In contrast, structural variants (SV), which include copy number variants (CNV), translocations, insertions, and inversions, have eluded easy detection and characterization, particularly in nonhuman species. However, evidence increasingly shows that SV not only contribute a substantial proportion of genetic variation but also have significant influence on phenotypes. Here we present the discovery of CNV in a prominent New Zealand dairy bull using long-read PacBio (Pacific Biosciences, Menlo Park, CA) sequencing technology and the Sniffles SV discovery tool (version 0.0.1; https://github.com/fritzsedlazeck/Sniffles). The CNV identified from long reads were compared with CNV discovered in the same bull from Illumina sequencing using CNVnator (read depth-based tool; Illumina Inc., San Diego, CA) as a means of validation. Subsequently, further validation was undertaken using whole-genome Illumina sequencing of 556 cattle representing the wider New Zealand dairy cattle population. Very limited overlap was observed in CNV discovered from the 2 sequencing platforms, in part because of the differences in size of CNV detected. Only a few CNV were therefore able to be validated using this approach. However, the ability to use CNVnator to genotype the 557 cattle for copy number across all regions identified as putative CNV allowed a genome-wide assessment of transmission level of copy number based on pedigree. The more highly transmissible a putative CNV region was observed to be, the more likely the distribution of copy number was multimodal across the 557 sequenced animals. Furthermore, visual assessment of highly transmissible CNV regions provided evidence supporting the presence of CNV across the sequenced animals. This transmission-based approach was able to confirm a subset of CNV that segregates in the New Zealand dairy cattle population. Genome-wide identification and validation of CNV is an important step toward their inclusion in genomic selection strategies. The Authors. Published by the Federation of Animal Science Societies and Elsevier Inc. on behalf of the American Dairy Science Association®. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/).
[Comparative analysis of variable regions in the genomes of variola virus].
Babkin, I V; Nepomniashchikh, T S; Maksiutov, R A; Gutorov, V V; Babkina, I N; Shchelkunov, S N
2008-01-01
Nucleotide sequences of two extended segments of the terminal variable regions in variola virus genome were determined. The size of the left segment was 13.5 kbp and of the right, 10.5 kbp. Totally, over 540 kbp were sequenced for 22 variola virus strains. The conducted phylogenetic analysis and the data published earlier allowed us to find the interrelations between 70 variola virus isolates, the character of their clustering, and the degree of intergroup and intragroup variations of the clusters of variola virus strains. The most polymorphic loci of the genome segments studied were determined. It was demonstrated that that these loci are localized to either noncoding genome regions or to the regions of destroyed open reading frames, characteristic of the ancestor virus. These loci are promising for development of the strategy for genotyping variola virus strains. Analysis of recombination using various methods demonstrated that, with the only exception, no statistically significant recombinational events in the genomes of variola virus strains studied were detectable.
Billaut-Laden, Ingrid; Allorge, Delphine; Crunelle-Thibaut, Aurélie; Rat, Emmanuel; Cauffiez, Christelle; Chevalier, Dany; Houdret, Nicole; Lo-Guidice, Jean-Marc; Broly, Franck
2006-08-01
Rhodanese or thiosulfate sulfurtransferase (TST) is a mitochondrial matrix enzyme that plays roles in cyanide detoxification, the formation of iron-sulfur proteins and the modification of sulfur-containing enzymes. Transsulfuration reaction catalyzed by TST is also involved in H(2)S detoxification. To date, no polymorphism of the human TST gene had been reported. We developed a screening strategy based on a PCR-SSCP method to search for mutations in the 3 exons of TST and their proximal flanking regions. This strategy has been applied to DNA samples from 50 unrelated French individuals of Caucasian origin. Eleven polymorphisms consisting in seven nucleotide substitutions in non-coding regions, two silent mutations and two missense mutations were characterized. The functional consequences of the identified mutations were assessed in vivo by measurement of erythrocyte TST activity and/or in vitro using heterologous expression in Saccharomyces cerevisiae or transient transfection assay in HT29 and Caco-2 cell lines. The P(285)A variant appears to encode a protein with a 50% decrease of in vitro intrinsic clearance compared to the wild-type enzyme. Additionally, the six polymorphisms located upstream the ATG initiation codon are responsible for a significant decrease (ranging from 40% to 73%) in promoter activity of a reporter gene compared to the corresponding wild-type sequence. This work constitutes the first report of the existence of a functional genetic polymorphism affecting TST activity and should be of great help to investigate certain disorders for which impairment of CN(-) or H(2)S detoxification have been suggested to be involved.
Guard, Jean; Sanchez-Ingunza, Roxana; Morales, Cesar; Stewart, Tod; Liljebjelke, Karen; Kessel, JoAnn; Ingram, Kim; Jones, Deana; Jackson, Charlene; Fedorka-Cray, Paula; Frye, Jonathan; Gast, Richard; Hinton, Arthur
2012-01-01
Two DNA-based methods were compared for the ability to assign serotype to 139 isolates of Salmonella enterica ssp. I. Intergenic sequence ribotyping (ISR) evaluated single nucleotide polymorphisms occurring in a 5S ribosomal gene region and flanking sequences bordering the gene dkgB. A DNA microarray hybridization method that assessed the presence and the absence of sets of genes was the second method. Serotype was assigned for 128 (92.1%) of submissions by the two DNA methods. ISR detected mixtures of serotypes within single colonies and it cost substantially less than Kauffmann–White serotyping and DNA microarray hybridization. Decreasing the cost of serotyping S. enterica while maintaining reliability may encourage routine testing and research. PMID:22998607
Haider, Nadia
2017-01-01
Investigation of genetic variation and phylogenetic relationships among date palm (Phoenix dactylifera L.) cultivars is useful for their conservation and genetic improvement. Various molecular markers such as restriction fragment length polymorphisms (RFLPs), simple sequence repeat (SSR), representational difference analysis (RDA), and amplified fragment length polymorphism (AFLP) have been developed to molecularly characterize date palm cultivars. PCR-based markers random amplified polymorphic DNA (RAPD) and inter-simple sequence repeat (ISSR) are powerful tools to determine the relatedness of date palm cultivars that are difficult to distinguish morphologically. In this chapter, the principles, materials, and methods of RAPD and ISSR techniques are presented. Analysis of data generated from these two techniques and the use of these data to reveal phylogenetic relationships among date palm cultivars are also discussed.
Paul-Samojedny, Monika; Kowalczyk, Malgorzata; Suchanek, Renata; Owczarek, Aleksander; Fila-Danilow, Anna; Szczygiel, Aleksandra; Kowalski, Jan
2010-09-01
Schizophrenia is a multifactorial disease with changes in immunological system. Such changes are the result of cytokine-level disturbances connected with cytokine gene polymorphisms. However, research about cytokine gene polymorphisms in schizophrenia has been surprisingly limited and ambiguous. The aim of the study was to identify whether polymorphisms of interleukin (IL)-6 and IL-10 are risk factors for the development of paranoid schizophrenia in case-control study. IL-6 (-174G/C; rs 1800795) and IL-10 (-1082G/A; rs 1800896) promoter polymorphisms in patients with paranoid schizophrenia and healthy individuals were genotyped using polymerase chain reaction-restriction fragment length polymorphism method. Differences in IL-6 and IL-10 promoter haplotypes may play an important role in determining the transcription level for IL-6 and IL-10 genes in schizophrenic patients. The presence of allele C at position -174 of IL-6 promoter sequence may correlate with increasing risk of paranoid schizophrenia in the Polish population, but research on a broadened group of people is needed. The presence of allele G at position -1082 of IL-10 promoter sequence correlates with increasing risk of paranoid schizophrenia in the Polish population. The coexistence of genotype GG at position -1082 of IL-10 promoter sequence and genotype GC at position -174 of IL-6 promoter sequence correlates with increasing risk of paranoid schizophrenia in the Polish population.
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Natural killer cell receptor genes in the family Equidae: not only Ly49.
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes.
Natural Killer Cell Receptor Genes in the Family Equidae: Not only Ly49
Futas, Jan; Horin, Petr
2013-01-01
Natural killer (NK) cells have important functions in immunity. NK recognition in mammals can be mediated through killer cell immunoglobulin-like receptors (KIR) and/or killer cell lectin-like Ly49 receptors. Genes encoding highly variable NK cell receptors (NKR) represent rapidly evolving genomic regions. No single conservative model of NKR genes was observed in mammals. Single-copy low polymorphic NKR genes present in one mammalian species may expand into highly polymorphic multigene families in other species. In contrast to other non-rodent mammals, multiple Ly49-like genes appear to exist in the horse, while no functional KIR genes were observed in this species. In this study, Ly49 and KIR were sought and their evolution was characterized in the entire family Equidae. Genomic sequences retrieved showed the presence of at least five highly conserved polymorphic Ly49 genes in horses, asses and zebras. These findings confirmed that the expansion of Ly49 occurred in the entire family. Several KIR-like sequences were also identified in the genome of Equids. Besides a previously identified non-functional KIR-Immunoglobulin-like transcript fusion gene (KIR-ILTA) and two putative pseudogenes, a KIR3DL-like sequence was analyzed. In contrast to previous observations made in the horse, the KIR3DL sequence, genomic organization and mRNA expression suggest that all Equids might produce a functional KIR receptor protein molecule with a single non-mutated immune tyrosine-based inhibition motif (ITIM) domain. No evidence for positive selection in the KIR3DL gene was found. Phylogenetic analysis including rhinoceros and tapir genomic DNA and deduced amino acid KIR-related sequences showed differences between families and even between species within the order Perissodactyla. The results suggest that the order Perissodactyla and its family Equidae with expanded Ly49 genes and with a potentially functional KIR gene may represent an interesting model for evolutionary biology of NKR genes. PMID:23724088
Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi
2015-01-01
The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut.
Jha, Ruchira Menka; Koleck, Theresa A; Puccio, Ava M; Okonkwo, David O; Park, Seo-Young; Zusman, Benjamin E; Clark, Robert S B; Shutter, Lori A; Wallisch, Jessica S; Empey, Philip E; Kochanek, Patrick M; Conley, Yvette P
2018-04-19
ABCC8 encodes sulfonylurea receptor 1, a key regulatory protein of cerebral oedema in many neurological disorders including traumatic brain injury (TBI). Sulfonylurea-receptor-1 inhibition has been promising in ameliorating cerebral oedema in clinical trials. We evaluated whether ABCC8 tag single-nucleotide polymorphisms predicted oedema and outcome in TBI. DNA was extracted from 485 prospectively enrolled patients with severe TBI. 410 were analysed after quality control. ABCC8 tag single-nucleotide polymorphisms (SNPs) were identified (Hapmap, r 2 >0.8, minor-allele frequency >0.20) and sequenced (iPlex-Gold, MassArray). Outcomes included radiographic oedema, intracranial pressure (ICP) and 3-month Glasgow Outcome Scale (GOS) score. Proxy SNPs, spatial modelling, amino acid topology and functional predictions were determined using established software programs. Wild-type rs7105832 and rs2237982 alleles and genotypes were associated with lower average ICP (β=-2.91, p=0.001; β=-2.28, p=0.003) and decreased radiographic oedema (OR 0.42, p=0.012; OR 0.52, p=0.017). Wild-type rs2237982 also increased favourable 3-month GOS (OR 2.45, p=0.006); this was partially mediated by oedema (p=0.03). Different polymorphisms predicted 3-month outcome: variant rs11024286 increased (OR 1.84, p=0.006) and wild-type rs4148622 decreased (OR 0.40, p=0.01) the odds of favourable outcome. Significant tag and concordant proxy SNPs regionally span introns/exons 2-15 of the 39-exon gene. This study identifies four ABCC8 tag SNPs associated with cerebral oedema and/or outcome in TBI, tagging a region including 33 polymorphisms. In polymorphisms predictive of oedema, variant alleles/genotypes confer increased risk. Different variant polymorphisms were associated with favourable outcome, potentially suggesting distinct mechanisms. Significant polymorphisms spatially clustered flanking exons encoding the sulfonylurea receptor site and transmembrane domain 0/loop 0 (juxtaposing the channel pore/binding site). This, if validated, may help build a foundation for developing future strategies that may guide individualised care, treatment response, prognosis and patient selection for clinical trials. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Reference genome sequence of the model plant Setaria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M
2012-05-13
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Holtz, Yan; Ardisson, Morgane; Ranwez, Vincent; Besnard, Alban; Leroy, Philippe; Poux, Gérard; Roumet, Pierre; Viader, Véronique; Santoni, Sylvain; David, Jacques
2016-01-01
Targeted sequence capture is a promising technology which helps reduce costs for sequencing and genotyping numerous genomic regions in large sets of individuals. Bait sequences are designed to capture specific alleles previously discovered in parents or reference populations. We studied a set of 135 RILs originating from a cross between an emmer cultivar (Dic2) and a recent durum elite cultivar (Silur). Six thousand sequence baits were designed to target Dic2 vs. Silur polymorphisms discovered in a previous RNAseq study. These baits were exposed to genomic DNA of the RIL population. Eighty percent of the targeted SNPs were recovered, 65% of which were of high quality and coverage. The final high density genetic map consisted of more than 3,000 markers, whose genetic and physical mapping were consistent with those obtained with large arrays. PMID:27171472
Wustman, Brandon A; Morse, Daniel E; Evans, John Spencer
2004-08-05
The AP7 and AP24 proteins represent a class of mineral-interaction polypeptides that are found in the aragonite-containing nacre layer of mollusk shell (H. rufescens). These proteins have been shown to preferentially interfere with calcium carbonate mineral growth in vitro. It is believed that both proteins play an important role in aragonite polymorph selection in the mollusk shell. Previously, we demonstrated the 1-30 amino acid (AA) N-terminal sequences of AP7 and AP24 represent mineral interaction/modification domains in both proteins, as evidenced by their ability to frustrate calcium carbonate crystal growth at step edge regions. In this present report, using free N-terminal, C(alpha)-amide "capped" synthetic polypeptides representing the 1-30 AA regions of AP7 (AP7-1 polypeptide) and AP24 (AP24-1 polypeptide) and NMR spectroscopy, we confirm that both N-terminal sequences possess putative Ca (II) interaction polyanionic sequence regions (2 x -DD- in AP7-1, -DDDED- in AP24-1) that are random coil-like in structure. However, with regard to the remaining sequences regions, each polypeptide features unique structural differences. AP7-1 possesses an extended beta-strand or polyproline type II-like structure within the A11-M10, S12-V13, and S28-I27 sequence regions, with the remaining sequence regions adopting a random-coil-like structure, a trait common to other polyelectrolyte mineral-associated polypeptide sequences. Conversely, AP24-1 possesses random coil-like structure within A1-S9 and Q14-N16 sequence regions, and evidence for turn-like, bend, or loop conformation within the G10-N13, Q17-N24, and M29-F30 sequence regions, similar to the structures identified within the putative elastomeric proteins Lustrin A and sea urchin spicule matrix proteins. The similarities and differences in AP7 and AP24 N-terminal domain structure are discussed with regard to joint AP7-AP24 protein modification of calcium carbonate growth. Copyright 2004 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Nardo, Luca; Tosi, Giovanna; Bondani, Maria; Accolla, Roberto; Andreoni, Alessandra
2012-06-01
By tens-of-picosecond resolved fluorescence detection we study Förster resonance energy transfer between a donor and a black-hole-quencher bound at the 5'- and 3'-positions of an oligonucleotide probe matching the highly polymorphic region between codons 51 and 58 of the human leukocyte antigen DQB1 0201 allele, conferring susceptibility to type-1 diabetes. The probe is annealed with non-amplified genomic DNAs carrying either the 0201 sequence or other DQB1 allelic variants. We detect the longest-lived donor fluorescence in the case of hybridization with the 0201 allele and definitely faster and distinct decays for the other allelic variants, some of which are single-nucleotide polymorphic.
Choi, Hyung Jin; Cho, Young Min; Moon, Min Kyong; Choi, Hye Hun; Shin, Hyoung Doo; Jang, Hak Chul; Kim, Seong Yeon; Lee, Hong Kyu; Park, Kyong Soo
2006-11-01
Ghrelin is known to play a role in glucose metabolism and in beta-cell function. There are controversies regarding the role of ghrelin polymorphisms in diabetes and diabetes-related phenotypes. The objective of this study was to examine polymorphisms of the ghrelin gene in a Korean cohort and investigate associations between them and susceptibility to type 2 diabetes and its related phenotypes. The ghrelin gene was sequenced to identify polymorphisms in 24 DNA samples. Common variants were then genotyped in 760 type 2 diabetic patients and 641 nondiabetic subjects. Genetic associations with diabetes-related phenotypes were also analyzed. Nine polymorphisms were identified, and four common polymorphisms [g.-1500C>G, g.-1062G > C, g.-994C > T, g.+408C > A (Leu72Met)] were genotyped in a larger study. The genotype distributions of these four common polymorphisms in type 2 diabetes patients were similar to those of normal nondiabetic controls. However, these four common polymorphisms were variably associated with several diabetes-related phenotypes, such as high-density lipoprotein (HDL) cholesterol, fasting plasma glucose, and homeostasis model assessment of insulin resistance. In particular, subjects harboring g.-1062C were associated with a lower serum HDL cholesterol level after adjusting for other variables (P = 0.0004 or 0.01 after Bonferroni correction for 24 tests). The aforementioned four common polymorphisms in the ghrelin gene were not found to be significantly associated with susceptibility to type 2 diabetes mellitus in the Korean population. However, the common polymorphism g.-1062G > C in the promoter region of the ghrelin gene was found to be significantly associated with serum HDL cholesterol levels.
Hollox, E J; Atia, T; Cross, G; Parkin, T; Armour, J A L
2002-11-01
Subtelomeric regions of the human genome are gene rich, with a high level of sequence polymorphism. A number of clinical conditions, including learning disability, have been attributed to subtelomeric deletions or duplications, but screening for deletion in these regions using conventional cytogenetic methods and fluorescence in situ hybridisation (FISH) is laborious. Here we report that a new method, multiplex amplifiable probe hybridisation (MAPH), can be used to screen for copy number at subtelomeric regions. We have constructed a set of MAPH probes with each subtelomeric region represented at least once, so that one gel lane can assay copy number at all chromosome ends in one person. Each probe has been sequenced and, where possible, its position relative to the telomere determined by comparison with mapped clones. The sensitivity of the probes has been characterised on a series of cytogenetically verified positive controls and 83 normal controls were used to assess the frequency of polymorphic copy number with no apparent phenotypic effect. We have also used MAPH to test a cohort of 37 people selected from males referred for fragile X syndrome testing and found six changes that were confirmed by dosage PCR. MAPH can be used to screen subtelomeric regions of chromosomes for deletions and duplications before confirmation by FISH or dosage PCR. The high throughput nature of this technique allows it to be used for large scale screening of subtelomeric copy number, before confirmation by FISH. In practice, the availability of a rapid and efficient screen may allow subtelomeric analysis to be applied to a wider selection of patients than is currently possible using FISH alone.
Hollox, E; Atia, T; Cross, G; Parkin, T; Armour, J
2002-01-01
Background: Subtelomeric regions of the human genome are gene rich, with a high level of sequence polymorphism. A number of clinical conditions, including learning disability, have been attributed to subtelomeric deletions or duplications, but screening for deletion in these regions using conventional cytogenetic methods and fluorescence in situ hybridisation (FISH) is laborious. Here we report that a new method, multiplex amplifiable probe hybridisation (MAPH), can be used to screen for copy number at subtelomeric regions. Methods: We have constructed a set of MAPH probes with each subtelomeric region represented at least once, so that one gel lane can assay copy number at all chromosome ends in one person. Each probe has been sequenced and, where possible, its position relative to the telomere determined by comparison with mapped clones. Results: The sensitivity of the probes has been characterised on a series of cytogenetically verified positive controls and 83 normal controls were used to assess the frequency of polymorphic copy number with no apparent phenotypic effect. We have also used MAPH to test a cohort of 37 people selected from males referred for fragile X syndrome testing and found six changes that were confirmed by dosage PCR. Conclusions: MAPH can be used to screen subtelomeric regions of chromosomes for deletions and duplications before confirmation by FISH or dosage PCR. The high throughput nature of this technique allows it to be used for large scale screening of subtelomeric copy number, before confirmation by FISH. In practice, the availability of a rapid and efficient screen may allow subtelomeric analysis to be applied to a wider selection of patients than is currently possible using FISH alone. PMID:12414816
Wang, Shiyan; Zhang, Mingdong; Zeng, Zhirong; Tian, Linwei; Wu, Kaichun; Chu, Jianhong; Fan, Daiming; Hu, Pinjin; Sung, Joseph J Y; Yu, Jun
2011-04-25
Nuclear factor-kappa B inhibitor alpha (IκBα) polymorphisms were found to be associated with inflammatory diseases. However, the association between IκBα polymorphisms with gastric cancer is still unknown. We aim to investigate the association between IκBα polymorphisms and gastric cancer risk in a large population-based case-control study among southern Chinese. A population-based case-control study was conducted between 1999 and 2006 in Guangdong Province, China. A total of 1010 gastric cancer patients and 1500 healthy controls were enrolled in this study. IκBα polymorphisms were identified by sequencing of IκBα gene ranging from the 2kb promoter region to the 3.5kb genomic region. Polymorphisms in IκBα were analyzed by TaqMan SNP genotyping assay. rs17103265 deletion homozygote (-/-) had significantly increased gastric cancer risk (OR=2.11, 95% CI=1.17-3.83, P=0.01), compared with rs17103265 T homozygote (TT). rs17103265 (-/-) genotype was significantly associated with increased risk of intestinal-type gastric cancer with (OR=2.21, 95% CI=1.19-4.08, P=0.01), but not with the diffuse or mix type of gastric cancer. rs17103265 (-/-) was associated with poorly differentiated gastric cancer (OR=2.05, 95% CI=1.07-3.94, P=0.03), but not with moderately or well differentiated gastric cancer. A significant decrease in luciferase activity was observed in rs17103265 deletion allele as compared with the vector containing the rs17103265 T allele (P<0.0001). rs17103265 polymorphism was not associated with the prognosis of gastric cancer patients. IκBα rs17103265 deletion homozygote is a novel genetic risk factor for gastric carcinogenesis, especially for the development of certain subtypes of gastric cancer in southern Chinese population. Copyright © 2011 Elsevier Inc. All rights reserved.
Diekmann, Kerstin; Hodkinson, Trevor R.; Barth, Susanne
2012-01-01
Background and Aims Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne ‘Cashel’. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Methods Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. Key Results All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A8 mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. Conclusions The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species. PMID:22419761
Diekmann, Kerstin; Hodkinson, Trevor R; Barth, Susanne
2012-11-01
Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne 'Cashel'. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A(8) mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, J.K.; Shaw, M.A.; Barton, C.H.
1994-11-15
Recent interest has focused on the region of conserved synteny between mouse chromosome 1 and human 2q33-q37, particularly over the region encoding the murine macrophage resistance gene Ity/Lsh/Bcg (candidate Nramp) and members of the Il8r interleukin-8 (IL8) receptor gene cluster. In this paper, identification of a restriction fragment length polymorphism in the Il8RB gene in 35 pedigrees previously typed for markers in the 2q33-37 interval provided evidence (lod scores > 3) for linkage between Il8RB and the 2q34-135 markers FN1, TNP1, VIL1, and DES. Physical mapping, using yeast artificial chromosomes isolated with VIL1, confirmed that IL8RA, IL8RB and the IL8RBmore » pseudogene map within the NRAMP-VIL1 interval, with the physical distance (155 kb) from 5{prime} LSH to 3{prime} VIL1 representing {approx}3-fold that observed in the mouse. Partial sequencing of NRAMP confirmed the presence of the N-terminal proline/serine-rich putative SH3 binding domain in exon 2 of the human gene. Further analysis of Brazilian leprosy and visceral leishmaniasis pedigrees identified a rare second allele varying in a 9-nucleotide repeat motif of the exon 2 sequence but segregating independently of the disease phenotype. 38 refs., 4 figs., 3 tabs.« less
Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis
2015-09-01
The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Yang, Xiao; Richardson, Patricia A.; Hong, Chuanxue
2014-01-01
A novel Phytophthora species was frequently recovered from irrigation reservoirs at several ornamental plant production facilities in eastern Virginia. Initial sequencing of the internal transcribed spacer (ITS) region of this species generated unreadable sequences due to continual polymorphic positions. Cloning and sequencing the ITS region as well as sequencing the mitochondrially encoded cytochrome c oxidase 1 and beta-tubulin genes revealed that it is a hybrid between P. taxon PgChlamydo as its paternal parent and an unknown species genetically close to P. mississippiae as its maternal parent. This hybrid has some diagnostic morphological features of P. taxon PgChlamydo and P. mississippiae. It produces catenulate hyphal swellings, characteristic of P. mississippiae, and chlamydospores, typical of P. taxon PgChlamydo. It also produces both ornamented and relatively smooth-walled oogonia. Ornamented oogonia are another important diagnostic character of P. mississippiae. The relatively smooth-walled oogonia may be indicative of oogonial character of P. taxon PgChlamydo. The new hybrid is described here as Phytophthora ×stagnum. PMID:25072374
MHC class II B diversity in blue tits: a preliminary study.
Aguilar, Juan Rivero-de; Schut, Elske; Merino, Santiago; Martínez, Javier; Komdeur, Jan; Westerdahl, Helena
2013-07-01
In this study, we partly characterize major histocompatibility complex (MHC) class II B in the blue tit (Cyanistes caeruleus). A total of 22 individuals from three different European locations: Spain, The Netherlands, and Sweden were screened for MHC allelic diversity. The MHC genes were investigated using both PCR-based methods and unamplified genomic DNA with restriction fragment length polymorphism (RFLP) and southern blots. A total of 13 different exon 2 sequences were obtained independently from DNA and/or RNA, thus confirming gene transcription and likely functionality of the genes. Nine out of 13 alleles were found in more than one country, and two alleles appeared in all countries. Positive selection was detected in the region coding for the peptide binding region (PBR). A maximum of three alleles per individual was detected by sequencing and the RFLP pattern consisted of 4-7 fragments, indicating a minimum number of 2-4 loci per individual. A phylogenetic analysis, demonstrated that the blue tit sequences are divergent compared to sequences from other passerines resembling a different MHC lineage than those possessed by most passerines studied to date.
MHC class II B diversity in blue tits: a preliminary study
Aguilar, Juan Rivero-de; Schut, Elske; Merino, Santiago; Martínez, Javier; Komdeur, Jan; Westerdahl, Helena
2013-01-01
In this study, we partly characterize major histocompatibility complex (MHC) class II B in the blue tit (Cyanistes caeruleus). A total of 22 individuals from three different European locations: Spain, The Netherlands, and Sweden were screened for MHC allelic diversity. The MHC genes were investigated using both PCR-based methods and unamplified genomic DNA with restriction fragment length polymorphism (RFLP) and southern blots. A total of 13 different exon 2 sequences were obtained independently from DNA and/or RNA, thus confirming gene transcription and likely functionality of the genes. Nine out of 13 alleles were found in more than one country, and two alleles appeared in all countries. Positive selection was detected in the region coding for the peptide binding region (PBR). A maximum of three alleles per individual was detected by sequencing and the RFLP pattern consisted of 4–7 fragments, indicating a minimum number of 2–4 loci per individual. A phylogenetic analysis, demonstrated that the blue tit sequences are divergent compared to sequences from other passerines resembling a different MHC lineage than those possessed by most passerines studied to date. PMID:23919136
BS-virus-finder: virus integration calling using bisulfite sequencing data.
Gao, Shengjie; Hu, Xuesong; Xu, Fengping; Gao, Changduo; Xiong, Kai; Zhao, Xiao; Chen, Haixiao; Zhao, Shancen; Wang, Mengyao; Fu, Dongke; Zhao, Xiaohui; Bai, Jie; Mao, Likai; Li, Bo; Wu, Song; Wang, Jian; Li, Shengbin; Yang, Huangming; Bolund, Lars; Pedersen, Christian N S
2018-01-01
DNA methylation plays a key role in the regulation of gene expression and carcinogenesis. Bisulfite sequencing studies mainly focus on calling single nucleotide polymorphism, different methylation region, and find allele-specific DNA methylation. Until now, only a few software tools have focused on virus integration using bisulfite sequencing data. We have developed a new and easy-to-use software tool, named BS-virus-finder (BSVF, RRID:SCR_015727), to detect viral integration breakpoints in whole human genomes. The tool is hosted at https://github.com/BGI-SZ/BSVF. BS-virus-finder demonstrates high sensitivity and specificity. It is useful in epigenetic studies and to reveal the relationship between viral integration and DNA methylation. BS-virus-finder is the first software tool to detect virus integration loci by using bisulfite sequencing data. © The Authors 2017. Published by Oxford University Press.
Li, Xiaobai; Jin, Feng; Jin, Liang; Jackson, Aaron; Huang, Cheng; Li, Kehu; Shu, Xiaoli
2014-12-05
Cymbidium is a genus of 68 species in the orchid family, with extremely high ornamental value. Marker-assisted selection has proven to be an effective strategy in accelerating plant breeding for many plant species. Analysis of cymbidiums genetic background by molecular markers can be of great value in assisting parental selection and breeding strategy design, however, in plants such as cymbidiums limited genomic resources exist. In order to obtain efficient markers, we deep sequenced the C. ensifolium transcriptome to identify simple sequence repeats derived from gene regions (genic-SSR). The 7,936 genic-SSR markers were identified. A total of 80 genic-SSRs were selected, and primers were designed according to their flanking sequences. Of the 80 genic-SSR primer sets, 62 were amplified in C. ensifolium successfully, and 55 showed polymorphism when cross-tested among 9 Cymbidium species comprising 59 accessions. Unigenes containing the 62 genic-SSRs were searched against Non-redundant (Nr), Gene Ontology database (GO), eukaryotic orthologous groups (KOGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The search resulted in 53 matching Nr sequences, of which 39 had GO terms, 18 were assigned to KOGs, and 15 were annotated with KEGG. Genetic diversity and population structure were analyzed based on 55 polymorphic genic-SSR data among 59 accessions. The genetic distance averaged 0.3911, ranging from 0.016 to 0.618. The polymorphic index content (PIC) of 55 polymorphic markers averaged 0.407, ranging from 0.033 to 0.863. A model-based clustering analysis revealed that five genetic groups existed in the collection. Accessions from the same species were typically grouped together; however, C. goeringii accessions did not always form a separate cluster, suggesting that C. goeringii accessions were polyphyletic. The genic-SSR identified in this study constitute a set of markers that can be applied across multiple Cymbidium species and used for the evaluation of genetic relationships as well as qualitative and quantitative trait mapping studies. Genic-SSR's coupled with the functional annotations provided by the unigenes will aid in mapping candidate genes of specific function.
Kobayashi, Masaaki; Nagasaki, Hideki; Garcia, Virginie; Just, Daniel; Bres, Cécile; Mauxion, Jean-Philippe; Le Paslier, Marie-Christine; Brunel, Dominique; Suda, Kunihiro; Minakuchi, Yohei; Toyoda, Atsushi; Fujiyama, Asao; Toyoshima, Hiromi; Suzuki, Takayuki; Igarashi, Kaori; Rothan, Christophe; Kaminuma, Eli; Nakamura, Yasukazu; Yano, Kentaro; Aoki, Koh
2014-02-01
Tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. The genome sequencing of the tomato cultivar 'Heinz 1706' was recently completed. To accelerate the progress of tomato genomics studies, systematic bioresources, such as mutagenized lines and full-length cDNA libraries, have been established for the cultivar 'Micro-Tom'. However, these resources cannot be utilized to their full potential without the completion of the genome sequencing of 'Micro-Tom'. We undertook the genome sequencing of 'Micro-Tom' and here report the identification of single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) between 'Micro-Tom' and 'Heinz 1706'. The analysis demonstrated the presence of 1.23 million SNPs and 0.19 million indels between the two cultivars. The density of SNPs and indels was high in chromosomes 2, 5 and 11, but was low in chromosomes 6, 8 and 10. Three known mutations of 'Micro-Tom' were localized on chromosomal regions where the density of SNPs and indels was low, which was consistent with the fact that these mutations were relatively new and introgressed into 'Micro-Tom' during the breeding of this cultivar. We also report SNP analysis for two 'Micro-Tom' varieties that have been maintained independently in Japan and France, both of which have served as standard lines for 'Micro-Tom' mutant collections. Approximately 28,000 SNPs were identified between these two 'Micro-Tom' lines. These results provide high-resolution DNA polymorphic information on 'Micro-Tom' and represent a valuable contribution to the 'Micro-Tom'-based genomics resources.
Vogt, Julia; Wernstedt, Annekatrin; Ripperger, Tim; Pabst, Brigitte; Zschocke, Johannes; Kratz, Christian; Wimmer, Katharina
2016-11-01
Biallelic PMS2 mutations are responsible for more than half of all cases of constitutional mismatch repair deficiency (CMMRD), a recessively inherited childhood cancer predisposition syndrome. The mismatch repair gene PMS2 is partly embedded within one copy of an inverted 100-kb low-copy repeat (LCR) on 7p22.1. In an individual with CMMRD syndrome, PMS2 was found to be homozygously inactivated by a complex chromosomal rearrangement, which separates the 5'-part from the 3'-part of the gene. The rearrangement involves sequences of the inverted 100-kb LCR and a human endogenous retrovirus element and may be associated with an inversion that is indistinguishable from the known inversion polymorphism affecting the ~0.7-Mb sequence intervening the LCR. Its formation is best explained by a replication-based mechanism (RBM) such as fork stalling and template switching/microhomology-mediated break-induced replication (FoSTeS/MMBIR). This finding supports the hypothesis that the inverted LCR can not only facilitate the formation of the non-allelic homologous recombination-mediated inversion polymorphism but it also promotes the occurrence of more complex rearrangements that can be associated with a large inversion, as well, but are mediated by a RBM. This further suggests that among the inversion polymorphism on 7p22.1, more complex rearrangements might be hidden. Furthermore, as the locus is embedded in a common fragile site (CFS) region, this rearrangement also supports the recently raised hypothesis that CFS sequence motifs may facilitate replication-based rearrangement mechanisms.
Vogt, Julia; Wernstedt, Annekatrin; Ripperger, Tim; Pabst, Brigitte; Zschocke, Johannes; Kratz, Christian; Wimmer, Katharina
2016-01-01
Biallelic PMS2 mutations are responsible for more than half of all cases of constitutional mismatch repair deficiency (CMMRD), a recessively inherited childhood cancer predisposition syndrome. The mismatch repair gene PMS2 is partly embedded within one copy of an inverted 100-kb low-copy repeat (LCR) on 7p22.1. In an individual with CMMRD syndrome, PMS2 was found to be homozygously inactivated by a complex chromosomal rearrangement, which separates the 5′-part from the 3′-part of the gene. The rearrangement involves sequences of the inverted 100-kb LCR and a human endogenous retrovirus element and may be associated with an inversion that is indistinguishable from the known inversion polymorphism affecting the ~0.7-Mb sequence intervening the LCR. Its formation is best explained by a replication-based mechanism (RBM) such as fork stalling and template switching/microhomology-mediated break-induced replication (FoSTeS/MMBIR). This finding supports the hypothesis that the inverted LCR can not only facilitate the formation of the non-allelic homologous recombination-mediated inversion polymorphism but it also promotes the occurrence of more complex rearrangements that can be associated with a large inversion, as well, but are mediated by a RBM. This further suggests that among the inversion polymorphism on 7p22.1, more complex rearrangements might be hidden. Furthermore, as the locus is embedded in a common fragile site (CFS) region, this rearrangement also supports the recently raised hypothesis that CFS sequence motifs may facilitate replication-based rearrangement mechanisms. PMID:27329736
Yuan, Youhua; Wen, Yan; You, Yuangang; Xing, Yan; Li, Huanying; Weng, Xiaoman; Wu, Nan; Liu, Shuang; Zhang, Shanshan; Zhang, Wenhong; Zhang, Ying
2015-01-01
Leprosy continues to be prevalent in some mountainous regions of China, and genotypes of leprosy strains endemic to the country are not known. Mycobacterium lepromatosis is a new species that was discovered in Mexico in 2008, and it remains unclear whether this species exists in China. Here, we conducted PCR- restriction fragment length polymorphism (RFLP) analysis to classify genotypes of 85 DNA samples collected from patients from 18 different provinces. All 171 DNA samples from skin biopsies of leprosy patients were tested for the presence of Mycobacterium leprae and Mycobacterium lepromatosis by amplifying the 16S rRNA gene using nested PCR, followed by DNA sequencing. The new species M. lepromatosis was not found among the 171 specimens from leprosy patients in 22 provinces in China. However, we found three SNP genotypes among 85 leprosy patients. A mutation at C251T in the 16S rRNA gene was found in 76% of the strains. We also found that the strains that showed the 16S rRNA C251T mutation belonged to SNP type 3, whereas strains without the point mutation belonged to SNP type 1. The SNP type 3 leprosy strains were observed in patients from both the inner and coastal regions of China, but the SNP type 1 strains were focused only in the coastal region. This indicated that the SNP type 3 leprosy strains were more prevalent than the SNP type 1 strains in China. In addition, the 16S rRNA gene sequence mutation at C251T also indicated a difference in the geographical distribution of the strains. To our knowledge, this is the first report of a new polymorphism in 16S rRNA gene in M. leprae in China. Our findings shed light on the prevalent genotypes and provide insight about leprosy transmission that are important for leprosy control in China.
Genetic linkage map and QTL identification for adventitious rooting traits in red gum eucalypts.
Sumathi, Murugan; Bachpai, Vijaya Kumar Waman; Mayavel, A; Dasgupta, Modhumita Ghosh; Nagarajan, Binai; Rajasugunasekar, D; Sivakumar, Veerasamy; Yasodha, Ramasamy
2018-05-01
The eucalypt species, Eucalyptus tereticornis and Eucalyptus camaldulensis , show tolerance to drought and salinity conditions, respectively, and are widely cultivated in arid and semiarid regions of tropical countries. In this study, genetic linkage map was developed for interspecific cross E. tereticornis × E. camaldulensis using pseudo-testcross strategy with simple sequence repeats (SSRs), intersimple sequence repeats (ISSRs), and sequence-related amplified polymorphism (SRAP) markers. The consensus genetic map comprised totally 283 markers with 84 SSRs, 94 ISSRs, and 105 SRAP markers on 11 linkage groups spanning 1163.4 cM genetic distance. Blasting the SSR sequences against E. grandis sequences allowed an alignment of 64% and the average ratio of genetic-to-physical distance was 1.7 Mbp/cM, which strengths the evidence that high amount of synteny and colinearity exists among eucalypts genome. Blast searches also revealed that 37% of SSRs had homologies with genes, which could potentially be used in the variety of downstream applications including candidate gene polymorphism. Quantitative trait loci (QTL) analysis for adventitious rooting traits revealed six QTL for rooting percent and root length on five chromosomes with interval and composite interval mapping. All the QTL explained 12.0-14.7% of the phenotypic variance, showing the involvement of major effect QTL on adventitious rooting traits. Increasing the density of markers would facilitate the detection of more number of small-effect QTL and also underpinning the genes involved in rooting process.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya; ...
2015-10-20
The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses
KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu
2017-01-01
Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450
Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja
2012-08-01
Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Wettstein, P J; States, J S
1986-01-01
The extent of polymorphism and the rate of divergence of class I and class II sequences mapping to the mammalian major histocompatibility complex (MHC) have been the subject of experimentation and speculation. To provide further insight into the evolution of the MHC we have initiated the analysis of two geographically isolated subspecies of tassel-eared squirrels. In the preceding communication we described the number and polymorphism of TSLA class I and class II sequences in Kaibab squirrels (S. aberti kaibabensis), which live north of the Grand Canyon. In this report we present a parallel analysis of Abert squirrels (S. aberti aberti), which live south of the Grand Canyon in northern Arizona. Genomic DNA from 12 Abert squirrels was digested with restriction enzymes, electrophoresed, blotted, and hybridized with DR alpha, DR beta, DQ alpha, DQ beta, and HLA-B7 probes. The results of these hybridizations were remarkably similar to those obtained in Kaibab squirrels. The majority of class I and class II bands were identical in size and number, suggesting that Abert and Kaibab squirrels have not significantly diverged in the TSLA complex despite their geographical separation. Relative polymorphism of class II sequences was similar to that observed with Kaibab squirrels: beta sequences exhibited higher polymorphism than alpha sequences. As in Kaibab squirrels, a number of alpha and beta sequences were apparently carried on the same fragments. In comparison to class II beta sequences, there was limited polymorphism in class I sequences, although a diverse number of class I genotypes were observed. Attempts to identify segregating TSLA haplotypes were futile in that the only families of sequences with concordant distributions were DQ alpha and DQ beta. These observations and those obtained with Kaibab squirrels suggest that the present-day TSLA haplotypes of both subspecies are derived from a limited number of common, progenitor haplotypes through repeated intra-TSLA recombination.
Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar
2014-07-04
Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.
The structure of the human interferon alpha/beta receptor gene.
Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G
1992-02-05
Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.
Pinto, J V C; Crispim, B A; Vasconcelos, A A; Geelen, D; Grisolia, A B; Vieira, M C
2016-01-08
Schinus terebinthifolius Raddi is a perennial native from Atlantic forest. It is of high ecological plasticity and is used in traditional medicine. Based on promising reports concerning its bioactivity, it was included as a species of great interest for distribution through the National Health System. A number of agronomic studies to guide its crop production are therefore underway. This study examined diversity and phylogenetic relationships among native S. terebinthifolius populations from different Brazilian ecosystems: Cerrado; sandbanks; dense rainforest; and deciduous forest. The intergenic regions rpl20-5'rps12, trnH-psbA, and trnS-trnG were sequenced from cpDNA and aligned using BLASTn. There were few fragments for comparison in GenBank and so only region trnS-trnG was informative. There were variations among and within populations with intravarietal polymorphisms and three distinct haplotypes (HpSM, HpDDO, HpNE), once populations from NE (sandbanks and rainforest) clustered together. Sequences from HpSM, HpNE, and HpDDO returned greater similarity to haplotypes A (AY928398.1), B (AY928399.1), and C (AY928400.1), respectively. A network, built by median-joining among native haplotypes and 10 available on GenBank, revealed HpSM as the origin of all other haplogroups. HpDDO showed the most mutations and was closely related to haplogroups from Argentina. While this could indicate hybridization, we believe that the polymorphisms resulted from adaptation to events such as deforestation, fire, rising temperature, and seasonal drought during the transition from Atlantic forest to Cerrado. While more detailed phylogeographical studies are needed, these results indicate eligible groups for distinct climates as an important step for pre-breeding programs before field propagation.