sequence repeat loci: Topics by Science.gov

Sample records for sequence repeat loci

[Mutation Analysis of 19 STR Loci in 20 723 Cases of Paternity Testing].

PubMed

Bi, J; Chang, J J; Li, M X; Yu, C Y

2017-06-01

To observe and analyze the confirmed cases of paternity testing, and to explore the mutation rules of STR loci. The mutant STR loci were screened from 20 723 confirmed cases of paternity testing by Goldeneye 20A system．The mutation rates, and the sources, fragment length, steps and increased or decreased repeat sequences of mutant alleles were counted for the analysis of the characteristics of mutation-related factors. A total of 548 mutations were found on 19 STR loci, and 557 mutation events were observed. The loci mutation rate was 0.07‰-2.23‰. The ratio of paternal to maternal mutant events was 3.06:1. One step mutation was the main mutation, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. The repeat sequences were more likely to decrease in two steps mutation and above. Mutation mainly occurred in the medium allele, and the number of the increased repeat sequences was almost the same as the decreased repeat sequences. In long allele mutations, the decreased repeat sequences were significantly more than the increased repeat sequences. The number of the increased repeat sequences was almost the same as the decreased repeat sequences in paternal mutation, while the decreased repeat sequences were more than the increased in maternal mutation. There are significant differences in the mutation rate of each locus. When one or two loci do not conform to the genetic law, other detection system should be added, and PI value should be calculated combined with the information of the mutate STR loci in order to further clarify the identification opinions. Copyright© by the Editorial Department of Journal of Forensic Medicine
Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

PubMed

Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

2018-05-01

Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
Phylogeny and strain typing of Escherichia coli, inferred from variation at mononucleotide repeat loci.

PubMed

Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel

2004-04-01

Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.
Phylogeny and Strain Typing of Escherichia coli, Inferred from Variation at Mononucleotide Repeat Loci

PubMed Central

Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel

2004-01-01

Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845
SSR allelic variation in almond (Prunus dulcis Mill.).

PubMed

Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai

2006-01-01

Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Survey and Analysis of Microsatellites in the Silkworm, Bombyx mori

PubMed Central

Prasad, M. Dharma; Muthulakshmi, M.; Madhu, M.; Archak, Sunil; Mita, K.; Nagaraju, J.

2005-01-01

We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of ≥15 bases of mononucleotide repeats and ≥5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2–14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama. PMID:15371363
Variable-Number Tandem Repeats That Are Useful in Genotyping Isolates of Salmonella enterica subsp. enterica Serovars Typhimurium and Newport▿

PubMed Central

Witonski, D. ; Stefanova, R.; Ranganathan, A.; Schutze, G. E.; Eisenach, K. D.; Cave, M. D.

2006-01-01

The genome of Salmonella enterica subsp. enterica serovar Typhimurium strain LT2 was analyzed for direct repeats, and 54 sequences containing variable-number tandem repeat loci were identified. Ten primer pairs that anneal upstream and downstream of each selected locus were designed and used to amplify PCR targets in isolates of S. enterica serovars Typhimurium and Newport. Four of the 10 loci did not show polymorphism in the length of products. Six loci were selected for analysis. Isolates of S. enterica serovars Typhimurium and Newport that were related to specific outbreaks and showed identical pulsed-field gel electrophoresis patterns were indistinguishable by the length of the six variable-number tandem repeats. Isolates that differed in their pulsed-field gel electrophoresis patterns showed polymorphism in variable-number tandem repeat profiles. Length of the products was confirmed by DNA sequence analysis. Only 2 of the 10 loci contained exact integers of the direct repeat. Eight loci contained partial copies. The partial copies were maintained at the ends of the variable-number tandem repeat loci in all isolates. In spite of having partial copies that were maintained in all isolates, the number of direct repeats at a locus was polymorphic. Six variable-number tandem repeat loci were useful in distinguishing isolates of S. enterica serovars Typhimurium and Newport that had different pulsed-field gel electrophoresis patterns and in identifying outbreak-associated cases that shared a common pulsed-field gel pattern. PMID:16943354
Variation in the genomic locations and sequence conservation of STAR elements among staphylococcal species provides insight into DNA repeat evolution

PubMed Central

2012-01-01

Background Staphylococcus aureus Repeat (STAR) elements are a type of interspersed intergenic direct repeat. In this study the conservation and variation in these elements was explored by bioinformatic analyses of published staphylococcal genome sequences and through sequencing of specific STAR element loci from a large set of S. aureus isolates. Results Using bioinformatic analyses, we found that the STAR elements were located in different genomic loci within each staphylococcal species. There was no correlation between the number of STAR elements in each genome and the evolutionary relatedness of staphylococcal species, however higher levels of repeats were observed in both S. aureus and S. lugdunensis compared to other staphylococcal species. Unexpectedly, sequencing of the internal spacer sequences of individual repeat elements from multiple isolates showed conservation at the sequence level within deep evolutionary lineages of S. aureus. Whilst individual STAR element loci were demonstrated to expand and contract, the sequences associated with each locus were stable and distinct from one another. Conclusions The high degree of lineage and locus-specific conservation of these intergenic repeat regions suggests that STAR elements are maintained due to selective or molecular forces with some of these elements having an important role in cell physiology. The high prevalence in two of the more virulent staphylococcal species is indicative of a potential role for STAR elements in pathogenesis. PMID:23020678
Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

PubMed

Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

2015-01-01

Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.
Mining and validation of pyrosequenced simple sequence repeats (SSRs) from American cranberry (Vaccinium macrocarpon Ait.).

PubMed

Zhu, H; Senalik, D; McCown, B H; Zeldin, E L; Speers, J; Hyman, J; Bassil, N; Hummer, K; Simon, P W; Zalapa, J E

2012-01-01

The American cranberry (Vaccinium macrocarpon Ait.) is a major commercial fruit crop in North America, but limited genetic resources have been developed for the species. Furthermore, the paucity of codominant DNA markers has hampered the advance of genetic research in cranberry and the Ericaceae family in general. Therefore, we used Roche 454 sequencing technology to perform low-coverage whole genome shotgun sequencing of the cranberry cultivar 'HyRed'. After de novo assembly, the obtained sequence covered 266.3 Mb of the estimated 540-590 Mb in cranberry genome. A total of 107,244 SSR loci were detected with an overall density across the genome of 403 SSR/Mb. The AG repeat was the most frequent motif in cranberry accounting for 35% of all SSRs and together with AAG and AAAT accounted for 46% of all loci discovered. To validate the SSR loci, we designed 96 primer-pairs using contig sequence data containing perfect SSR repeats, and studied the genetic diversity of 25 cranberry genotypes. We identified 48 polymorphic SSR loci with 2-15 alleles per locus for a total of 323 alleles in the 25 cranberry genotypes. Genetic clustering by principal coordinates and genetic structure analyzes confirmed the heterogeneous nature of cranberries. The parentage composition of several hybrid cultivars was evident from the structure analyzes. Whole genome shotgun 454 sequencing was a cost-effective and efficient way to identify numerous SSR repeats in the cranberry sequence for marker development.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

PubMed Central

Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting

2013-01-01

Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Typing Clostridium difficile strains based on tandem repeat sequences

PubMed Central

2009-01-01

Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
Simultaneous Differentiation and Typing of Entamoeba histolytica and Entamoeba dispar

PubMed Central

Zaki, Mehreen; Meelu, Parool; Sun, Wei; Clark, C. Graham

2002-01-01

Sequences corresponding to some of the polymorphic loci previously reported from Entamoeba histolytica have been detected in Entamoeba dispar. Comparison of nucleotide sequences of two loci between E. dispar strain SAW760 and E. histolytica strain HM-1:IMSS revealed significant differences in both repeat and flanking regions. The tandem repeat units varied not only in sequence but also in number and arrangement between the two species at both the loci. Using the sequences obtained, primer pairs aimed at amplifying species-specific products were designed and tested on a variety of E. histolytica and E. dispar samples. Amplification results were in complete agreement with the original species classification in all cases, and the PCR products displayed discernible size and pattern variations among the isolates. PMID:11923344
Bioinformatic mining of EST-SSR loci in the Pacific oyster, Crassostrea gigas.

PubMed

Wang, Y; Ren, R; Yu, Z

2008-06-01

A set of expressed sequence tag-simple sequence repeat (EST-SSR) markers of the Pacific oyster, Crassostrea gigas, was developed through bioinformatic mining of the GenBank public database. As of June 30, 2007, a total of 5132 EST sequences from GenBank were downloaded and screened for di-, tri- and tetra-nucleotide repeats, with criteria set at a minimum of 5, 4 and 4 repeats for the three categories of SSRs respectively. Seventeen polymorphic microsatellite markers were characterized. Allele numbers ranged from 3 to 10, and the observed and expected heterozygosity values varied from 0.125 to 0.770 and from 0.113 to 0.732 respectively. Eleven loci were at Hardy-Weinberg equilibrium (HWE); the other six loci showed significant departure from HWE (P < 0.01), suggesting possible presence of null alleles. Pairwise check of linkage disequilibrium (LD) indicated that 11 of 136 pairs of loci showed significant LD (P < 0.01), likely due to HWE present in single markers. Cross-species amplification was examined for five other Crassostrea species and reasonable results were obtained, promising usefulness of these markers in oyster genetics.
Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carte, Jason; Wang, Ruiying; Li, Hong

An RNA-based gene silencing pathway that protects bacteria and archaea from viruses and other genome invaders is hypothesized to arise from guide RNAs encoded by CRISPR loci and proteins encoded by the cas genes. CRISPR loci contain multiple short invader-derived sequences separated by short repeats. The presence of virus-specific sequences within CRISPR loci of prokaryotic genomes confers resistance against corresponding viruses. The CRISPR loci are transcribed as long RNAs that must be processed to smaller guide RNAs. Here we identified Pyrococcus furiosus Cas6 as a novel endoribonuclease that cleaves CRISPR RNAs within the repeat sequences to release individual invader targetingmore » RNAs. Cas6 interacts with a specific sequence motif in the 5{prime} region of the CRISPR repeat element and cleaves at a defined site within the 3{prime} region of the repeat. The 1.8 angstrom crystal structure of the enzyme reveals two ferredoxin-like folds that are also found in other RNA-binding proteins. The predicted active site of the enzyme is similar to that of tRNA splicing endonucleases, and concordantly, Cas6 activity is metal-independent. cas6 is one of the most widely distributed CRISPR-associated genes. Our findings indicate that Cas6 functions in the generation of CRISPR-derived guide RNAs in numerous bacteria and archaea.« less
Isolation and characterization of microsatellite loci in the intertidal sponge Halichondria panicea

USGS Publications Warehouse

Knowlton, Anne L.; Pierson, Barbara J.; Talbot, S.L.; Highsmith, Ray C.

2003-01-01

GA- and CA-enriched genomic libraries were constructed for the intertidal sponge Halichondria panicea. Unique repeat motifs identified varied from the expected simple dinucleotide repeats to more complex repeat units. All sequences tended to be highly repetitive but did not necessarily contain the targeted motifs. Seven microsatellite loci were evaluated on sponges from the clone source population. All seven were polymorphic with 5.43 ± 0.92 mean number of alleles. Six of the seven loci that could be resolved had mean heterozygosities of 0.14–0.68. The loci identified here will be useful for population studies.
CRISPR Detection From Short Reads Using Partial Overlap Graphs.

PubMed

Ben-Bassat, Ilan; Chor, Benny

2016-06-01

Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. CRISPRs are important for many microbial studies and are playing an essential role in current gene editing techniques. As such, they attract substantial research interest. The exponential growth in the amount of bacterial sequence data in recent years enables the exploration of CRISPR loci in more and more species. Most of the automated tools that detect CRISPR loci rely on fully assembled genomes. However, many assemblers do not handle repetitive regions successfully. The first tool to work directly on raw sequence data is Crass, which requires reads that are long enough to contain two copies of the same repeat. We present a method to identify CRISPR repeats from raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. This enables us to avoid many of the difficulties that assemblers face, as we merely aim to identify the repeats that belong to CRISPR loci. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other existing tools fail to do so.
Molecular identification and characterization of clustered regularly interspaced short palindromic repeats (CRISPRs) in a urease-positive thermophilic Campylobacter sp. (UPTC).

PubMed

Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M

2012-02-01

Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
A set of plastid loci for use in multiplex fragment length genotyping for intraspecific variation in Pinus (Pinaceae)1

PubMed Central

Wofford, Austin M.; Finch, Kristen; Bigott, Adam; Willyard, Ann

2014-01-01

• Premise of the study: Recently released Pinus plastome sequences support characterization of 15 plastid simple sequence repeat (cpSSR) loci originally published for P. contorta and P. thunbergii. This allows selection of loci for single-tube PCR multiplexed genotyping in any subsection of the genus. • Methods: Unique placement of primers and primer conservation across the genus were investigated, and a set of six loci were selected for single-tube multiplexing. We compared interspecific variation between cpSSRs and nucleotide sequences of ycf1 and tested intraspecific variation for cpSSRs using 911 samples in the P. ponderosa species complex. • Results: The cpSSR loci contain mononucleotide and complex repeats with additional length variation in flanking regions. They are not located in hypervariable regions, and most primers are conserved across the genus. A single PCR per sample multiplexed for six loci yielded 45 alleles in 911 samples. • Discussion: The protocol allows efficient genotyping of many samples. The cpSSR loci are too variable for Pinus phylogenies but are useful for the study of genetic structure within and among populations. The multiplex method could easily be extended to other plant groups by choosing primers for cpSSR loci in a plastome alignment for the target group. PMID:25202625
Genetic characterization of the UCS and Kex1 loci of Pneumocystis jirovecii.

PubMed

Esteves, F; Tavares, A; Costa, M C; Gaspar, J; Antunes, F; Matos, O

2009-02-01

Nucleotide variation in the Pneumocystis jirovecii upstream conserved sequence (UCS) and kexin-like serine protease (Kex1) loci was studied in pulmonary specimens from Portuguese HIV-positive patients. DNA was extracted and used for specific molecular sequence analysis. The number of UCS tandem repeats detected in 13 successfully sequenced isolates ranged from three (9 isolates, 69%) to four (4 isolates, 31%). A novel tandem repeat pattern and two novel polymorphisms were detected in the UCS region. For the Kex1 gene, the wild-type (24 isolates, 86%) was the most frequent sequence detected among the 28 sequenced isolates. Nevertheless, a nonsynonymous (1 isolate, 3%) and three synonymous (3 isolates, 11%) polymorphisms were detected and are described here for the first time.

Taxonomy of the Rhizopogon vinicolor species complex based on analysis of ITS sequences and microsatellite loci.

Treesearch

Annette M. Kretzer; Daniel L. Luoma; Randy Molina; Joseph W. Spatafora

2003-01-01

We are re-addressing species concepts in the Rhizopogon vinicolor species complex (Boletales, Basidiomycota) using sequence data from the interna transcribed spacer (ITS) region of the nuclear ribosomal repeat, as well as genoLypic data from five microsatellite loci. The R. vinicolor species complex by our definition includes,...
Repetitive DNA loci and their modulation by the non-canonical nucleic acid structures R-loops and G-quadruplexes

PubMed Central

Hall, Amanda C.; Ostrowski, Lauren A.; Mekhail, Karim

2017-01-01

ABSTRACT Cells have evolved intricate mechanisms to maintain genome stability despite allowing mutational changes to drive evolutionary adaptation. Repetitive DNA sequences, which represent the bulk of most genomes, are a major threat to genome stability often driving chromosome rearrangements and disease. The major source of repetitive DNA sequences and thus the most vulnerable constituents of the genome are the rDNA (rDNA) repeats, telomeres, and transposable elements. Maintaining the stability of these loci is critical to overall cellular fitness and lifespan. Therefore, cells have evolved mechanisms to regulate rDNA copy number, telomere length and transposon activity, as well as DNA repair at these loci. In addition, non-canonical structure-forming DNA motifs can also modulate the function of these repetitive DNA loci by impacting their transcription, replication, and stability. Here, we discuss key mechanisms that maintain rDNA repeats, telomeres, and transposons in yeast and human before highlighting emerging roles for non-canonical DNA structures at these repetitive loci. PMID:28406751
Development of a massively parallel sequencing assay for investigating sequence polymorphisms of 15 short tandem repeats in a Chinese Northern Han population.

PubMed

Zhang, Qing-Xia; Yang, Meng; Pan, Ya-Jiao; Zhao, Jing; Qu, Bao-Wang; Cheng, Feng; Yang, Ya-Ran; Jiao, Zhang-Ping; Liu, Li; Yan, Jiang-Wei

2018-05-17

Massively parallel sequencing (MPS) has been used in forensic genetics in recent years owing to several advantages, e.g. MPS can provide precise descriptions of the repeat allele structure and variation in the repeat-flanking regions, increasing the discriminating power among loci and individuals. However, it cannot be fully utilized unless sufficient population data are available for all loci. Thus, there is a pressing need to perform population studies providing a basis for the introduction of MPS into forensic practice. Here, we constructed a multiplex PCR system with fusion primers for one-directional PCR for MPS of 15 commonly used forensic autosomal STRs and amelogenin. Samples from 554 unrelated Chinese Northern Han individuals were typed using this MPS assay. In total, 313 alleles obtained by MPS for all 15 STRs were observed, and the corresponding allele frequencies ranged between 0.0009 and 0.5162. Of all 15 loci, the number of alleles identified for 12 loci increased compared to capillary electrophoresis approaches, and for the following six loci more than double the number of alleles was found: D2S1338, D5S818, D21S11, D13S317, vWA, and D3S1358. Forensic parameters were calculated based on length and sequence-based alleles. D21S11 showed the highest heterozygosity (0.8791), discrimination power (0.9865), and paternity exclusion probability in trios (0.7529). The cumulative match probability for MPS was approximately 2.3157 × 10 -20 . © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Differentiation of “Candidatus Liberibacter asiaticus” Isolates by Variable-Number Tandem-Repeat Analysis ▿

PubMed Central

Katoh, Hiroshi; Subandiyah, Siti; Tomimura, Kenta; Okuda, Mitsuru; Su, Hong-Ji; Iwanami, Toru

2011-01-01

Four highly polymorphic simple sequence repeat (SSR) loci were selected and used to differentiate 84 Japanese isolates of “Candidatus Liberibacter asiaticus.” The Nei's measure of genetic diversity values for these four SSRs ranged from 0.60 to 0.86. The four SSR loci were also highly polymorphic in four isolates from Taiwan and 12 isolates from Indonesia. PMID:21239554
Simple sequence repeat marker loci discovery using SSR primer.

PubMed

Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

2004-06-12

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh].

PubMed

Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K

2011-01-20

Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh

PubMed Central

2011-01-01

Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
Complete Chloroplast Genome Sequences of Important Oilseed Crop Sesamum indicum L

PubMed Central

Yi, Dong-Keun; Kim, Ki-Joong

2012-01-01

Sesamum indicum is an important crop plant species for yielding oil. The complete chloroplast (cp) genome of S. indicum (GenBank acc no. JN637766) is 153,324 bp in length, and has a pair of inverted repeat (IR) regions consisting of 25,141 bp each. The lengths of the large single copy (LSC) and the small single copy (SSC) regions are 85,170 bp and 17,872 bp, respectively. Comparative cp DNA sequence analyses of S. indicum with other cp genomes reveal that the genome structure, gene order, gene and intron contents, AT contents, codon usage, and transcription units are similar to the typical angiosperm cp genomes. Nucleotide diversity of the IR region between Sesamum and three other cp genomes is much lower than that of the LSC and SSC regions in both the coding region and noncoding region. As a summary, the regional constraints strongly affect the sequence evolution of the cp genomes, while the functional constraints weakly affect the sequence evolution of cp genomes. Five short inversions associated with short palindromic sequences that form step-loop structures were observed in the chloroplast genome of S. indicum. Twenty-eight different simple sequence repeat loci have been detected in the chloroplast genome of S. indicum. Almost all of the SSR loci were composed of A or T, so this may also contribute to the A-T richness of the cp genome of S. indicum. Seven large repeated loci in the chloroplast genome of S. indicum were also identified and these loci are useful to developing S. indicum-specific cp genome vectors. The complete cp DNA sequences of S. indicum reported in this paper are prerequisite to modifying this important oilseed crop by cp genetic engineering techniques. PMID:22606240
Conservation of human chromosome 13 polymorphic microsatellite (CA){sub n} repeats in chimpanzees

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deka, R.; Shriver, M.D.; Yu, L.M.

Tandemly repeated (dC-dA){sub n} {center_dot} (dG-dT){sub n} sequences occur abundantly and are found in most eukaryotic genomes. To investigate the level of conservation of these repeat sequences in nonhuman primates, the authors have analyzed seven human chromosome 13 dinucleotide (CA){sub n} repeat loci in chimpanzees by DNA amplification using primers designed for analysis of human loci. Comparable levels of polymorphism at these loci in the two species, revealed by the number of alleles, heterozygosity, and allele sizes, suggest that the (CA){sub n} repeat arrays and their genomic locations are highly conserved. Even though the proportion of shared alleles between themore » two species varies enormously and the modal alleles are not the same, allelic lengths at each locus in the chimpanzees are detected within the bounds of the allele size range observed in humans. A similar observation has been noted in a limited number of gorillas and orangutans. Using a new measure of genetic distance that takes into account the size of alleles, they have compared the genetic distance between humans and chimpanzees. The genetic distance between these two species was found to be ninefold smaller than expected assuming there is no selection or mutational bias toward retention of (CA){sub n} repeat arrays. These findings suggest a functional significance for these microsatellite loci. 34 refs., 1 fig., 2 tabs.« less
Genome-Wide Stochastic Adaptive DNA Amplification at Direct and Inverted DNA Repeats in the Parasite Leishmania

PubMed Central

Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc

2014-01-01

Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805
Isolation and characterization of novel microsatellite markers from the sika deer (Cervus nippon) genome.

PubMed

Li, Y M; Bai, C Y; Niu, W P; Yu, H; Yang, R J; Yan, S Q; Zhang, J Y; Zhang, M J; Zhao, Z H

2015-09-28

Microsatellite markers are widely and evenly distributed, and are highly polymorphic. Rapid and convenient detection through automated analysis means that microsatellite markers are widely used in the construction of plant and animal genetic maps, in quantitative trait loci localization, marker-assisted selection, identification of genetic relationships, and genetic diversity and phylogenetic tree construction. However, few microsatellite markers remain to be isolated. We used streptavidin magnetic beads to affinity-capture and construct a (CA)n microsatellite DNA-enriched library from sika deer. We selected sequences containing more than six repeats to design primers. Clear bands were selected, which were amplified using non-specific primers following PCR amplification to screen polymorphisms in a group of 65 unrelated sika deer. The positive clone rate reached 82.9% by constructing the enriched library, and we then selected positive clones for sequencing. There were 395 sequences with CA repeats, and the CA repeat number was 4-105. We selected sequences containing more than six repeats to design primers, of which 297 pairs were designed. We next selected clear bands and used non-specific primers to amplify following PCR amplification. In total, 245 pairs of primers were screened. We then selected 50 pairs of primers to randomly screen for polymorphisms. We detected 47 polymorphic and 3 monomorphic loci in 65 unrelated sika deer. These newly isolated and characterized microsatellite loci can be used to construct genetic maps and for lineage testing in deer. In addition, they can be used for comparative genomics between Cervidae species.
[Analysis on genetic polymorphism of 5 STR loci selected from X chromosome].

PubMed

Liu, Qi-ji; Gong, Yao-qin; Zhang, Xi-yu; Gao, Gui-min; Li, Jiang-xia; Guo, Yi-shou

2005-02-01

To select short tandem repeats(STR) from X chromosome. STR is a universal genetic marker that has changeable polymorphism and stable heredity in human genome. It is a specific DNA segment composed of 2-6 base pairs as its core sequence. It is an ideal DNA marker used in linkage analysis and gene mapping. In this study, 8 short tandem repeats were selected from two genomic clones on X chromosome by using BCM Search Launcher. Primers amplifying the STR loci were designed by using Primer 3.0 according to the unique sequence flanking the STRs. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five of these STRs were polymorphic. Chi-square test indicated that the distribution of genotypes agreed with Hardy-Weinberg equilibrium (P>0.05). Five polymorphic short tandem repeats have been identified on chromosome X and will be useful for linkage analysis and gene mapping.
[Comparative genomics and evolutionary analysis of CRISPR loci in acetic acid bacteria].

PubMed

Xia, Kai; Liang, Xin-le; Li, Yu-dong

2015-12-01

The clustered regularly interspaced short palindromic repeat (CRISPR) is a widespread adaptive immunity system that exists in most archaea and many bacteria against foreign DNA, such as phages, viruses and plasmids. In general, CRISPR system consists of direct repeat, leader, spacer and CRISPR-associated sequences. Acetic acid bacteria (AAB) play an important role in industrial fermentation of vinegar and bioelectrochemistry. To investigate the polymorphism and evolution pattern of CRISPR loci in acetic acid bacteria, bioinformatic analyses were performed on 48 species from three main genera (Acetobacter, Gluconacetobacter and Gluconobacter) with whole genome sequences available from the NCBI database. The results showed that the CRISPR system existed in 32 species of the 48 strains studied. Most of the CRISPR-Cas system in AAB belonged to type I CRISPR-Cas system (subtype E and C), but type II CRISPR-Cas system which contain cas9 gene was only found in the genus Acetobacter and Gluconacetobacter. The repeat sequences of some CRISPR were highly conserved among species from different genera, and the leader sequences of some CRISPR possessed conservative motif, which was associated with regulated promoters. Moreover, phylogenetic analysis of cas1 demonstrated that they were suitable for classification of species. The conservation of cas1 genes was associated with that of repeat sequences among different strains, suggesting they were subjected to similar functional constraints. Moreover, the number of spacer was positively correlated with the number of prophages and insertion sequences, indicating the acetic acid bacteria were continually invaded by new foreign DNA. The comparative analysis of CRISR loci in acetic acid bacteria provided the basis for investigating the molecular mechanism of different acetic acid tolerance and genome stability in acetic acid bacteria.
Genetic analysis of children of atomic bomb survivors.

PubMed Central

Satoh, C; Takahashi, N; Asakawa, J; Kodaira, M; Kuick, R; Hanash, S M; Neel, J V

1996-01-01

Studies are under way for the detection of potential genetic effects of atomic bomb radiation at the DNA level in the children of survivors. In a pilot study, we have examined six minisatellites and five microsatellites in DNA derived from 100 families including 124 children. We detected a total of 28 mutations in three minisatellite loci. The mean mutation rates per locus per gamete in the six minisatellite loci were 1.5% for 65 exposed gametes for which mean parental gonadal dose was 1.9 Sv and 2.0% for 183 unexposed gametes. We detected four mutations in two tetranucleotide repeat sequences but no mutations in three trinucleotide repeat sequences. The mean mutation rate per locus per gamete was o% for the exposed gametes and 0.5% for the unexposed gametes in the five microsatellite loci. No significant differences in the mutation rates between the exposed and the unexposed gametes were detected in these repetitive sequences. Additional loci are being analyzed to increase the power of our study to observe a significant difference in the mutation rates at the 0.05 level of significance. Images Figure 1. Figure 2. Figure 2. Figure 2. Figure 2. Figure 2. Figure 2. PMID:8781374
Optimization of sequence alignment for simple sequence repeat regions.

PubMed

Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

2011-07-20

Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.

PubMed

Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera

2017-01-23

Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.
Molecular characterization of allelic variants of (GATA)n microsatellite loci in parthenogenetic lizards Darevskia unisexualis (Lacertidae).

PubMed

Korchagin, V I; Badaeva, T N; Tokarskaya, O N; Martirosyan, I A; Darevsky, I S; Ryskov, A P

2007-05-01

Populations of parthenogenetic lizards of the genus Darevskia consist of genetically identical animals, and represent a unique model for studying the molecular mechanisms underlying the variability and evolution of hypervariable DNA repeats. As unisexual lineages, parthenogenetic lizards are characterized by some level of genetic diversity at microsatellite loci. We cloned and sequenced a number of (GATA)n microsatellite loci of Darevskia unisexualis. PCR products from these loci were also sequenced and the degree of intraspecific polymorphism was assessed. Among the five (GATA)n loci analysed, two (Du215 and Du281) were polymorphic. Cross-species analysis of Du215 and Du281 indicate that the priming sites at the D. unisexualis loci are conserved in the bisexual parental species, D. raddei and D. valentini. Sequencing the PCR products amplified from Du215 and Du281 and from monomorphic Du323 showed that allelic differences at the polymorphic loci are caused by microsatellite mutations and by point mutations in the flanking regions. The haplotypes identified among the allelic variants of Du281 and among its orthologues in the parental species provide new evidence of the cross-species origin of D. unisexualis. To our knowledge, these data are the first to characterize the nucleotide sequences of allelic variants at microsatellite loci within parthenogenetic vertebrate animals.
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Comparison of taxon-specific versus general locus sets for targeted sequence capture in plant phylogenomics.

PubMed

Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G

2018-03-01

Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.
Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

Treesearch

M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

2009-01-01

The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...

Short-Sequence DNA Repeats in Prokaryotic Genomes

PubMed Central

van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri

1998-01-01

Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442
Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake

USGS Publications Warehouse

Castoe, Todd A.; Poole, Alexander W.; de Koning, A. P. Jason; Jones, Kenneth L.; Tomback, Diana F.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Lance, Stacey L.; Streicher, Jeffrey W.; Smith, Eric N.; Pollock, David D.

2012-01-01

Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample – a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Rapid microsatellite identification from illumina paired-end genomic sequencing in two birds and a snake

USGS Publications Warehouse

Castoe, T.A.; Poole, A.W.; de Koning, A. P. J.; Jones, K.L.; Tomback, D.F.; Oyler-McCance, S.J.; Fike, J.A.; Lance, S.L.; Streicher, J.W.; Smith, E.N.; Pollock, D.D.

2012-01-01

Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample - a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable. ?? 2012 Castoe et al.
Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake.

PubMed

Castoe, Todd A; Poole, Alexander W; de Koning, A P Jason; Jones, Kenneth L; Tomback, Diana F; Oyler-McCance, Sara J; Fike, Jennifer A; Lance, Stacey L; Streicher, Jeffrey W; Smith, Eric N; Pollock, David D

2012-01-01

Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample--a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Microsatellite DNA as shared genetic markers among conifer species

Treesearch

C.S. Echt; G.G. Vendramin; C. D. Nelson; Paula E. Marquardt

1999-01-01

Polymerase chain reaction (PCR) primer pairs for 21 simple sequence repeat (SSR) loci in Pinus strobus L, and 6 in Pinus radiata D. Don were evaluated to determine whether SSR marker amplification could be achieved in 1O other conifer species. Eighty percent of SSR primer pairs for (AC) loci that were polymorphic in P. ...
Microsatellite DNA as shared genetic markers among conifer species

Treesearch

Craig S. Echt; G.G. Vendramin; C.D. Nelson; P. Marquardt

1999-01-01

Polymerase chain reaction (PCR) primer pairs for 21 simple sequence repeat (SSR) loci in Pinus strobus L. and 6 in Pinus radiata D. Don. were evaluated to determine whether SSR marker amplification could be achieved in 10 other conifer species. Eighty percent of SSR primer pairs for (AC)n loci that were polymorphic in P. ...
Detection and analysis of CRISPRs of Shigella.

PubMed

Guo, Xiangjiao; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Wang, Pengfei; Qiu, Shaofu; Xi, Yuanlin; Yang, Haiyan

2015-01-01

The recently discovered CRISPRs (Clustered regularly interspaced short palindromic repeats) and Cas (CRISPR-associated) proteins are a novel genetic barrier that limits horizontal gene transfer in prokaryotes and the CRISPR loci provide a historical view of the exposure of prokaryotes to a variety of foreign genetic elements. The aim of study was to investigate the occurrence and distribution of the CRISPRs in Shigella. A collection of 61 strains of Shigella were screened for the existence of CRISPRs. Three CRISPR loci were identified among 61 shigella strains. CRISPR1/cas loci are detected in 49 strains of shigella. Yet, IS elements were detected in cas gene in some strains. In the remaining 12 Shigella flexneri strains, the CRISPR1/cas locus is deleted and only a cas3' pseudo gene and a repeat sequence are present. The presence of CRISPR2 is frequently accompanied by the emergence of CRISPR1. CRISPR3 loci were present in almost all strains (52/61). The length of CRISPR arrays varied from 1 to 9 spacers. Sequence analysis of the CRISPR arrays revealed that few spacers had matches in the GenBank databases. However, one spacer in CRISPR3 loci matches the cognate cas3 genes and no cas gene was present around CRISPR3 region. Analysis of CRISPR sequences show that CRISPR have little change which makes CRISPR poor genotyping markers. The present study is the first attempt to determine and analyze CRISPRs of shigella isolated from clinical patients.
A Simple Sequence Repeat- and Single-Nucleotide Polymorphism-Based Genetic Linkage Map of the Brown Planthopper, Nilaparvata lugens

PubMed Central

Jairin, Jirapong; Kobayashi, Tetsuya; Yamagata, Yoshiyuki; Sanada-Morimura, Sachiyo; Mori, Kazuki; Tashiro, Kosuke; Kuhara, Satoru; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Yamamoto, Kimiko; Matsumura, Masaya; Yasui, Hideshi

2013-01-01

In this study, we developed the first genetic linkage map for the major rice insect pest, the brown planthopper (BPH, Nilaparvata lugens). The linkage map was constructed by integrating linkage data from two backcross populations derived from three inbred BPH strains. The consensus map consists of 474 simple sequence repeats, 43 single-nucleotide polymorphisms, and 1 sequence-tagged site, for a total of 518 markers at 472 unique positions in 17 linkage groups. The linkage groups cover 1093.9 cM, with an average distance of 2.3 cM between loci. The average number of marker loci per linkage group was 27.8. The sex-linkage group was identified by exploiting X-linked and Y-specific markers. Our linkage map and the newly developed markers used to create it constitute an essential resource and a useful framework for future genetic analyses in BPH. PMID:23204257
Haplogroup-specific deviation from the stepwise mutation model at the microsatellite loci DYS388 and DYS392.

PubMed

Nebel, A; Filon, D; Hohoff, C; Faerman, M; Brinkmann, B; Oppenheim, A

2001-01-01

Deviation from the stepwise mutation model (SMM) at specific human microsatellite loci has implications for population genetic and forensic investigations. In the present study, data on six Y chromosome-specific microsatellites were pooled for 455 paternally unrelated males from six Middle Eastern populations. All chromosomes were assigned to three haplogroups defined by six binary polymorphisms. Two of the microsatellite loci tested, DYS388 and DYS392, displayed marked haplogroup-specific differences in their allele variability. A bimodal distribution of short and long alleles was observed for DYS388 in haplogroup 1 and for DYS392 in haplogroups 1 and 2. Further investigation showed that the short/long alleles segregated almost completely between genealogically distinct haplogroups defined by additional binary markers. Thus, these two loci have a discriminatory power similar to a binary polymorphism. DYS388 was characterised by an extremely low mutation rate in haplogroups 2 and 3, as was DYS392 in haplogroup 3. Sequence analysis of the repeat regions at the two loci revealed no irregularities, indicating that the triplet expansion in these loci is not controlled by sequence variation at the repeat level. A high frequency of long DYS388 alleles has, so far, been found only in populations originating in the Middle East, suggesting that this microsatellite is useful as a region-specific marker.
No evidence of radiation effect on mutation rates at hypervariable minisatellite loci in the germ cells of atomic bomb survivors.

PubMed

Kodaira, Mieko; Izumi, Shizue; Takahashi, Norio; Nakamura, Nori

2004-10-01

Human minisatellites consist of tandem arrays of short repeat sequences, and some are highly polymorphic in numbers of repeats among individuals. Since these loci mutate much more frequently than coding sequences, they make attractive markers for screening populations for genetic effects of mutagenic agents. Here we report the results of our analysis of mutations at eight hypervariable minisatellite loci in the offspring (61 from exposed families in 60 of which only one parent was exposed, and 58 from unexposed parents) of atomic bomb survivors with mean doses of >1 Sv. We found 44 mutations in paternal alleles and eight mutations in maternal alleles with no indication that the high doses of acutely applied radiation had caused significant genetic effects. Our finding contrasts with those of some other studies in which much lower radiation doses, applied chronically, caused significantly increased mutation rates. Possible reasons for this discrepancy are discussed.
Short interspersed elements (SINEs) are a major source of canine genomic diversity.

PubMed

Wang, Wei; Kirkness, Ewen F

2005-12-01

SINEs are retrotransposons that have enjoyed remarkable reproductive success during the course of mammalian evolution, and have played a major role in shaping mammalian genomes. Previously, an analysis of survey-sequence data from an individual dog (a poodle) indicated that canine genomes harbor a high frequency of alleles that differ only by the absence or presence of a SINEC_Cf repeat. Comparison of this survey-sequence data with a draft genome sequence of a distinct dog (a boxer) has confirmed this prediction, and revealed the chromosomal coordinates for >10,000 loci that are bimorphic for SINEC_Cf insertions. Analysis of SINE insertion sites from the genomes of nine additional dogs indicates that 3%-5% are absent from either the poodle or boxer genome sequences--suggesting that an additional 10,000 bimorphic loci could be readily identified in the general dog population. We describe a methodology that can be used to identify these loci, and could be adapted to exploit these bimorphic loci for genotyping purposes. Approximately half of all annotated canine genes contain SINEC_Cf repeats, and these elements are occasionally transcribed. When transcribed in the antisense orientation, they provide splice acceptor sites that can result in incorporation of novel exons. The high frequency of bimorphic SINE insertions in the dog population is predicted to provide numerous examples of allele-specific transcription patterns that will be valuable for the study of differential gene expression among multiple dog breeds.
GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

EPA Science Inventory

Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...
Length and repeat-sequence variation in 58 STRs and 94 SNPs in two Spanish populations.

PubMed

Casals, Ferran; Anglada, Roger; Bonet, Núria; Rasal, Raquel; van der Gaag, Kristiaan J; Hoogenboom, Jerry; Solé-Morata, Neus; Comas, David; Calafell, Francesc

2017-09-01

We have genotyped the 58 STRs (27 autosomal, 24 Y-STRs and 7 X-STRs) and 94 autosomal SNPs in Illumina ForenSeq™ Primer Mix A in 88 Spanish Roma (Gypsy) samples and 143 Catalans. Since this platform is based in massive parallel sequencing, we have used simple R scripts to uncover the sequence variation in the repeat region. Thus, we have found, across 58 STRs, 541 length-based alleles, which, after considering repeat-sequence variation, became 804 different alleles. All loci in both populations were in Hardy-Weinberg equilibrium. F ST between both populations was 0.0178 for autosomal SNPs, 0.0146 for autosomal STRs, 0.0101 for X-STRs and 0.1866 for Y-STRs. Combined a priori statistics showed quite large; for instance, pooling all the autosomal loci, the a priori probabilities of discriminating a suspect become 1-(2.3×10 -70 ) and 1-(5.9×10 -73 ), for Roma and Catalans respectively, and the chances of excluding a false father in a trio are 1-(2.6×10 -20 ) and 1-(2.0×10 -21 ). Copyright © 2017 Elsevier B.V. All rights reserved.
Genotyping and Molecular Identification of Date Palm Cultivars Using Inter-Simple Sequence Repeat (ISSR) Markers.

PubMed

Ayesh, Basim M

2017-01-01

Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.

PubMed

Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T

1997-12-01

The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.
New development and validation of 50 SSR markers in breadfruit (Artocarpus altilis, Moraceae) by next-generation sequencing.

PubMed

De Bellis, Fabien; Malapa, Roger; Kagy, Valérie; Lebegin, Stéphane; Billot, Claire; Labouisse, Jean-Pierre

2016-08-01

Using next-generation sequencing technology, new microsatellite loci were characterized in Artocarpus altilis (Moraceae) and two congeners to increase the number of available markers for genotyping breadfruit cultivars. A total of 47,607 simple sequence repeat loci were obtained by sequencing a library of breadfruit genomic DNA with an Illumina MiSeq system. Among them, 50 single-locus markers were selected and assessed using 41 samples (39 A. altilis, one A. camansi, and one A. heterophyllus). All loci were polymorphic in A. altilis, 44 in A. camansi, and 21 in A. heterophyllus. The number of alleles per locus ranged from two to 19. The new markers will be useful for assessing the identity and genetic diversity of breadfruit cultivars on a small geographical scale, gaining a better understanding of farmer management practices, and will help to optimize breadfruit genebank management.
Preliminary Genomic Characterization of Ten Hardwood Tree Species from Multiplexed Low Coverage Whole Genome Sequencing

PubMed Central

Staton, Margaret; Best, Teodora; Khodwekar, Sudhir; Owusu, Sandra; Xu, Tao; Xu, Yi; Jennings, Tara; Cronn, Richard; Arumuganathan, A. Kathiravetpilla; Coggeshall, Mark; Gailing, Oliver; Liang, Haiying; Romero-Severson, Jeanne; Schlarbaum, Scott; Carlson, John E.

2015-01-01

Forest health issues are on the rise in the United States, resulting from introduction of alien pests and diseases, coupled with abiotic stresses related to climate change. Increasingly, forest scientists are finding genetic/genomic resources valuable in addressing forest health issues. For a set of ten ecologically and economically important native hardwood tree species representing a broad phylogenetic spectrum, we used low coverage whole genome sequencing from multiplex Illumina paired ends to economically profile their genomic content. For six species, the genome content was further analyzed by flow cytometry in order to determine the nuclear genome size. Sequencing yielded a depth of 0.8X to 7.5X, from which in silico analysis yielded preliminary estimates of gene and repetitive sequence content in the genome for each species. Thousands of genomic SSRs were identified, with a clear predisposition toward dinucleotide repeats and AT-rich repeat motifs. Flanking primers were designed for SSR loci for all ten species, ranging from 891 loci in sugar maple to 18,167 in redbay. In summary, we have demonstrated that useful preliminary genome information including repeat content, gene content and useful SSR markers can be obtained at low cost and time input from a single lane of Illumina multiplex sequence. PMID:26698853
Short-read, high-throughput sequencing technology for STR genotyping

PubMed Central

Bornman, Daniel M.; Hester, Mark E.; Schuetter, Jared M.; Kasoji, Manjula D.; Minard-Smith, Angela; Barden, Curt A.; Nelson, Scott C.; Godbold, Gene D.; Baker, Christine H.; Yang, Boyu; Walther, Jacquelyn E.; Tornes, Ivan E.; Yan, Pearlly S.; Rodriguez, Benjamin; Bundschuh, Ralf; Dickens, Michael L.; Young, Brian A.; Faith, Seth A.

2013-01-01

DNA-based methods for human identification principally rely upon genotyping of short tandem repeat (STR) loci. Electrophoretic-based techniques for variable-length classification of STRs are universally utilized, but are limited in that they have relatively low throughput and do not yield nucleotide sequence information. High-throughput sequencing technology may provide a more powerful instrument for human identification, but is not currently validated for forensic casework. Here, we present a systematic method to perform high-throughput genotyping analysis of the Combined DNA Index System (CODIS) STR loci using short-read (150 bp) massively parallel sequencing technology. Open source reference alignment tools were optimized to evaluate PCR-amplified STR loci using a custom designed STR genome reference. Evaluation of this approach demonstrated that the 13 CODIS STR loci and amelogenin (AMEL) locus could be accurately called from individual and mixture samples. Sensitivity analysis showed that as few as 18,500 reads, aligned to an in silico referenced genome, were required to genotype an individual (>99% confidence) for the CODIS loci. The power of this technology was further demonstrated by identification of variant alleles containing single nucleotide polymorphisms (SNPs) and the development of quantitative measurements (reads) for resolving mixed samples. PMID:25621315
Bacterial CRISPR Regions: General Features and their Potential for Epidemiological Molecular Typing Studies.

PubMed

Karimi, Zahra; Ahmadi, Ali; Najafi, Ali; Ranjbar, Reza

2018-01-01

CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci as novel and applicable regions in prokaryotic genomes have gained great attraction in the post genomics era. These unique regions are diverse in number and sequence composition in different pathogenic bacteria and thereby can be a suitable candidate for molecular epidemiology and genotyping studies. Results:Furthermore, the arrayed structure of CRISPR loci (several unique repeats spaced with the variable sequence) and associated cas genes act as an active prokaryotic immune system against viral replication and conjugative elements. This property can be used as a tool for RNA editing in bioengineering studies. The aim of this review was to survey some details about the history, nature, and potential applications of CRISPR arrays in both genetic engineering and bacterial genotyping studies.
Dinucleotide repeat polymorphisms in waterfowl (family Anatidae): Characterization of a sex-linked (Z-specific) and 14 autosomal loci

USGS Publications Warehouse

Buchholz, W.G.; Pearce, J.M.; Pierson, B.J.; Scribner, K.T.

1998-01-01

Canada goose (Branta Canadensis) and harlequin duck (Histrionicus histrionicus) DNAs were digested with Sau3AI, and size selected (300-700 bp) fragments were ligated into BamHI-digested pBluscriptII KS+. The enrichment protocol of Ostrander et al.1 was followed. The resulting libraries were screened using a [ƴ-32P]ATP end-labelled (CA)20 oligonucleotides as a hybridization probe. Positive clones were sequenced using cycle-sequencing protocols (Epicentre Technologies, Madison, WI) and primers flanking the inserts. PCR primers were designed to amplify the repeat and yield amplification products of ≈100-200 bp. DNA samples were screened for variation at these loci using [ƴ-32P]ATP end-labelled primers. The products were resolved using 6% denaturing polyacrylamide gels and autoradiography.

A maize map standard with sequenced core markers, grass genome reference points and 932 expressed sequence tagged sites (ESTs) in a 1736-locus map.

PubMed Central

Davis, G L; McMullen, M D; Baysdorfer, C; Musket, T; Grant, D; Staebell, M; Xu, G; Polacco, M; Koster, L; Melia-Hancock, S; Houchins, K; Chao, S; Coe, E H

1999-01-01

We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits. PMID:10388831
[Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

PubMed

Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

2009-11-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
RNA-seq analysis and de novo transcriptome assembly of Jerusalem artichoke (Helianthus tuberosus Linne).

PubMed

Jung, Won Yong; Lee, Sang Sook; Kim, Chul Wook; Kim, Hyun-Soon; Min, Sung Ran; Moon, Jae Sun; Kwon, Suk-Yoon; Jeon, Jae-Heung; Cho, Hye Sun

2014-01-01

Jerusalem artichoke (Helianthus tuberosus L.) has long been cultivated as a vegetable and as a source of fructans (inulin) for pharmaceutical applications in diabetes and obesity prevention. However, transcriptomic and genomic data for Jerusalem artichoke remain scarce. In this study, Illumina RNA sequencing (RNA-Seq) was performed on samples from Jerusalem artichoke leaves, roots, stems and two different tuber tissues (early and late tuber development). Data were used for de novo assembly and characterization of the transcriptome. In total 206,215,632 paired-end reads were generated. These were assembled into 66,322 loci with 272,548 transcripts. Loci were annotated by querying against the NCBI non-redundant, Phytozome and UniProt databases, and 40,215 loci were homologous to existing database sequences. Gene Ontology terms were assigned to 19,848 loci, 15,434 loci were matched to 25 Clusters of Eukaryotic Orthologous Groups classifications, and 11,844 loci were classified into 142 Kyoto Encyclopedia of Genes and Genomes pathways. The assembled loci also contained 10,778 potential simple sequence repeats. The newly assembled transcriptome was used to identify loci with tissue-specific differential expression patterns. In total, 670 loci exhibited tissue-specific expression, and a subset of these were confirmed using RT-PCR and qRT-PCR. Gene expression related to inulin biosynthesis in tuber tissue was also investigated. Exsiting genetic and genomic data for H. tuberosus are scarce. The sequence resources developed in this study will enable the analysis of thousands of transcripts and will thus accelerate marker-assisted breeding studies and studies of inulin biosynthesis in Jerusalem artichoke.
Novel microsatellite markers for the oriental fruit moth Grapholita molesta (Lepidoptera: Tortricidae) and effects of null alleles on population genetics analyses.

PubMed

Song, W; Cao, L-J; Wang, Y-Z; Li, B-Y; Wei, S-J

2017-06-01

The oriental fruit moth (OFM) Grapholita molesta (Lepidoptera: Tortricidae) is an important economic pest of stone and pome fruits worldwide. We sequenced the OFM genome using next-generation sequencing and characterized the microsatellite distribution. In total, 56,674 microsatellites were identified, with 11,584 loci suitable for primer design. Twenty-seven polymorphic microsatellites, including 24 loci with trinucleotide repeat and three with pentanucleotide repeat, were validated in 95 individuals from four natural populations. The allele numbers ranged from 4 to 40, with an average value of 13.7 per locus. A high frequency of null alleles was observed in most loci developed for the OFM. Three marker panels, all of the loci, nine loci with the lowest null allele frequencies, and nine loci with the highest null allele frequencies, were established for population genetics analyses. The null allele influenced estimations of genetic diversity parameters but not the OFM's genetic structure. Both a STRUCTURE analysis and a discriminant analysis of principal components, using the three marker panels, divided the four natural populations into three groups. However, more individuals were incorrectly assigned by the STRUCTURE analysis when the marker panel with the highest null allele frequency was used compared with the other two panels. Our study provides empirical research on the effects of null alleles on population genetics analyses. The microsatellites developed will be valuable markers for genetic studies of the OFM.
New Multilocus Variable-Number Tandem-Repeat Analysis Tool for Surveillance and Local Epidemiology of Bacterial Leaf Blight and Bacterial Leaf Streak of Rice Caused by Xanthomonas oryzae

PubMed Central

Poulin, L.; Grygiel, P.; Magne, M.; Rodriguez-R, L. M.; Forero Serna, N.; Zhao, S.; El Rafii, M.; Dao, S.; Tekete, C.; Wonni, I.; Koita, O.; Pruvost, O.; Verdier, V.; Vernière, C.

2014-01-01

Multilocus variable-number tandem-repeat analysis (MLVA) is efficient for routine typing and for investigating the genetic structures of natural microbial populations. Two distinct pathovars of Xanthomonas oryzae can cause significant crop losses in tropical and temperate rice-growing countries. Bacterial leaf streak is caused by X. oryzae pv. oryzicola, and bacterial leaf blight is caused by X. oryzae pv. oryzae. For the latter, two genetic lineages have been described in the literature. We developed a universal MLVA typing tool both for the identification of the three X. oryzae genetic lineages and for epidemiological analyses. Sixteen candidate variable-number tandem-repeat (VNTR) loci were selected according to their presence and polymorphism in 10 draft or complete genome sequences of the three X. oryzae lineages and by VNTR sequencing of a subset of loci of interest in 20 strains per lineage. The MLVA-16 scheme was then applied to 338 strains of X. oryzae representing different pathovars and geographical locations. Linkage disequilibrium between MLVA loci was calculated by index association on different scales, and the 16 loci showed linear Mantel correlation with MLSA data on 56 X. oryzae strains, suggesting that they provide a good phylogenetic signal. Furthermore, analyses of sets of strains for different lineages indicated the possibility of using the scheme for deeper epidemiological investigation on small spatial scales. PMID:25398857
Genetic diversity and gene differentiation among ten species of Zingiberaceae from Eastern India.

PubMed

Mohanty, Sujata; Panda, Manoj Kumar; Acharya, Laxmikanta; Nayak, Sanghamitra

2014-08-01

In the present study, genetic fingerprints of ten species of Zingiberaceae from eastern India were developed using PCR-based markers. 19 RAPD (Rapid Amplified polymorphic DNA), 8 ISSR (Inter Simple Sequence Repeats) and 8 SSR (Simple Sequence Repeats) primers were used to elucidate genetic diversity important for utilization, management and conservation. These primers produced 789 loci, out of which 773 loci were polymorphic (including 220 unique loci) and 16 monomorphic loci. Highest number of bands amplified (263) in Curcuma caesia whereas lowest (209) in Zingiber cassumunar. Though all the markers discriminated the species effectively, analysis of combined data of all markers resulted in better distinction of individual species. Highest number of loci was amplified with SSR primers with resolving power in a range of 17.4-39. Dendrogram based on three molecular data using unweighted pair group method with arithmetic mean classified all the species into two clusters. Mantle matrix correspondence test revealed high matrix correlation in all the cases. Correlation values for RAPD, ISSR and SSR were 0.797, 0.84 and 0.8, respectively, with combined data. In both the genera wild and cultivated species were completely separated from each other at genomic level. It also revealed distinct genetic identity between species of Curcuma and Zingiber. High genetic diversity documented in the present study provides a baseline data for optimization of conservation and breeding programme of the studied zingiberacious species.
Bacterial CRISPR Regions: General Features and their Potential for Epidemiological Molecular Typing Studies

PubMed Central

Karimi, Zahra; Ahmadi, Ali; Najafi, Ali; Ranjbar, Reza

2018-01-01

Introduction: CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci as novel and applicable regions in prokaryotic genomes have gained great attraction in the post genomics era. Methods: These unique regions are diverse in number and sequence composition in different pathogenic bacteria and thereby can be a suitable candidate for molecular epidemiology and genotyping studies. Results:Furthermore, the arrayed structure of CRISPR loci (several unique repeats spaced with the variable sequence) and associated cas genes act as an active prokaryotic immune system against viral replication and conjugative elements. This property can be used as a tool for RNA editing in bioengineering studies. Conclusion: The aim of this review was to survey some details about the history, nature, and potential applications of CRISPR arrays in both genetic engineering and bacterial genotyping studies. PMID:29755603
Using Next Generation RAD Sequencing to Isolate Multispecies Microsatellites for Pilosocereus (Cactaceae).

PubMed

Bonatelli, Isabel A S; Carstens, Bryan C; Moraes, Evandro M

2015-01-01

Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.
Using Next Generation RAD Sequencing to Isolate Multispecies Microsatellites for Pilosocereus (Cactaceae)

PubMed Central

Bonatelli, Isabel A. S.; Carstens, Bryan C.; Moraes, Evandro M.

2015-01-01

Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms. PMID:26561396
Discrimination of candidate subgenome-specific loci by linkage map construction with an S1 population of octoploid strawberry (Fragaria × ananassa).

PubMed

Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N

2017-05-12

The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Isolation and characterization of microsatellite markers in Fraser fir (Abies fraseri)

Treesearch

S.A. Josserand; K.M. Potter; G. Johnson; J.A. Bowen; J. Frampton; C.D. Nelson

2006-01-01

We describe the isolation and characterization of 14 microsatellite loci from Fraser fir (Abies fraseri). These markers originated from cloned inserts enriched for DNA sequences containing tandem di- and tri-nucleotide repeats. In total, 36 clones were selected, sequenced and evaluated. Polymerase chain reaction (PCR) primers for 14 of these...
Human minisatellite alleles detectable only after PCR amplification.

PubMed

Armour, J A; Crosier, M; Jeffreys, A J

1992-01-01

We present evidence that a proportion of alleles at two human minisatellite loci is undetected by standard Southern blot hybridization. In each case the missing allele(s) can be identified after PCR amplification and correspond to tandem arrays too short to detect by hybridization. At one locus, there is only one undetected allele (population frequency 0.3), which contains just three repeat units. At the second locus, there are at least five undetected alleles (total population frequency 0.9) containing 60-120 repeats; they are not detected because these tandem repeats give very poor signals when used as a probe in standard Southern blot hybridization, and also cross-hybridize with other sequences in the genome. Under these circumstances only signals from the longest tandemly repeated alleles are detectable above the nonspecific background. The structures of these loci have been compared in human and primate DNA, and at one locus the short human allele containing three repeat units is shown to be an intermediate state in the expansion of a monomeric precursor allele in primates to high copy number in the longer human arrays. We discuss the implications of such loci for studies of human populations, minisatellite isolation by cloning, and the evolution of highly variable tandem arrays.
Identification and characterization of the highly polymorphic locus D14S739 in the Han Chinese population

PubMed Central

Shao, Chengchen; Zhang, Yaqi; Zhou, Yueqin; Zhu, Wei; Xu, Hongmei; Liu, Zhiping; Tang, Qiqun; Shen, Yiwen; Xie, Jianhui

2015-01-01

Aim To systemically select and evaluate short tandem repeats (STRs) on the chromosome 14 and obtain new STR loci as expanded genotyping markers for forensic application. Methods STRs on the chromosome 14 were filtered from Tandem Repeats Database and further selected based on their positions on the chromosome, repeat patterns of the core sequences, sequence homology of the flanking regions, and suitability of flanking regions in primer design. The STR locus with the highest heterozygosity and polymorphism information content (PIC) was selected for further analysis of genetic polymorphism, forensic parameters, and the core sequence. Results Among 26 STR loci selected as candidates, D14S739 had the highest heterozygosity (0.8691) and PIC (0.8432), and showed no deviation from the Hardy-Weinberg equilibrium. 14 alleles were observed, ranging in size from 21 to 34 tetranucleotide units in the core region of (GATA)9-18 (GACA)7-12 GACG (GACA)2 GATA. Paternity testing showed no mutations. Conclusion D14S739 is a highly informative STR locus and could be a suitable genetic marker for forensic applications in the Han Chinese population. PMID:26526885
Seven New Complete Plastome Sequences Reveal Rampant Independent Loss of the ndh Gene Family across Orchids and Associated Instability of the Inverted Repeat/Small Single-Copy Region Boundaries.

PubMed

Kim, Hyoung Tae; Kim, Jung Sung; Moore, Michael J; Neubig, Kurt M; Williams, Norris H; Whitten, W Mark; Kim, Joo-Hwan

2015-01-01

Earlier research has revealed that the ndh loci have been pseudogenized, truncated, or deleted from most orchid plastomes sequenced to date, including in all available plastomes of the two most species-rich subfamilies, Orchidoideae and Epidendroideae. This study sought to resolve deeper-level phylogenetic relationships among major orchid groups and to refine the history of gene loss in the ndh loci across orchids. The complete plastomes of seven orchids, Oncidium sphacelatum (Epidendroideae), Masdevallia coccinea (Epidendroideae), Sobralia callosa (Epidendroideae), Sobralia aff. bouchei (Epidendroideae), Elleanthus sodiroi (Epidendroideae), Paphiopedilum armeniacum (Cypripedioideae), and Phragmipedium longifolium (Cypripedioideae) were sequenced and analyzed in conjunction with all other available orchid and monocot plastomes. Most ndh loci were found to be pseudogenized or lost in Oncidium, Paphiopedilum and Phragmipedium, but surprisingly, all ndh loci were found to retain full, intact reading frames in Sobralia, Elleanthus and Masdevallia. Character mapping suggests that the ndh genes were present in the common ancestor of orchids but have experienced independent, significant losses at least eight times across four subfamilies. In addition, ndhF gene loss was correlated with shifts in the position of the junction of the inverted repeat (IR) and small single-copy (SSC) regions. The Orchidaceae have unprecedented levels of homoplasy in ndh gene presence/absence, which may be correlated in part with the unusual life history of orchids. These results also suggest that ndhF plays a role in IR/SSC junction stability.
New development and validation of 50 SSR markers in breadfruit (Artocarpus altilis, Moraceae) by next-generation sequencing1

PubMed Central

De Bellis, Fabien; Malapa, Roger; Kagy, Valérie; Lebegin, Stéphane; Billot, Claire; Labouisse, Jean-Pierre

2016-01-01

Premise of the study: Using next-generation sequencing technology, new microsatellite loci were characterized in Artocarpus altilis (Moraceae) and two congeners to increase the number of available markers for genotyping breadfruit cultivars. Methods and Results: A total of 47,607 simple sequence repeat loci were obtained by sequencing a library of breadfruit genomic DNA with an Illumina MiSeq system. Among them, 50 single-locus markers were selected and assessed using 41 samples (39 A. altilis, one A. camansi, and one A. heterophyllus). All loci were polymorphic in A. altilis, 44 in A. camansi, and 21 in A. heterophyllus. The number of alleles per locus ranged from two to 19. Conclusions: The new markers will be useful for assessing the identity and genetic diversity of breadfruit cultivars on a small geographical scale, gaining a better understanding of farmer management practices, and will help to optimize breadfruit genebank management. PMID:27610273
Construction of a genetic linkage map and analysis of quantitative trait loci associated with the agronomically important traits of Pleurotus eryngii

Treesearch

Chak Han Im; Young-Hoon Park; Kenneth E. Hammel; Bokyung Park; Soon Wook Kwon; Hojin Ryu; Jae-San Ryu

2016-01-01

Breeding new strains with improved traits is a long-standing goal of mushroom breeders that can be expedited by marker-assisted selection (MAS). We constructed a genetic linkage map of Pleurotus eryngii based on segregation analysis of markers in postmeiotic monokaryons from KNR2312. In total, 256 loci comprising 226 simple sequence-repeat (SSR) markers, 2 mating-type...
Rapid development of microsatellite markers for the endangered fish Schizothorax biddulphi (Günther) using next generation sequencing and cross-species amplification.

PubMed

Luo, Wei; Nie, Zhulan; Zhan, Fanbin; Wei, Jie; Wang, Weimin; Gao, Zexia

2012-11-14

Tarim schizothoracin (Schizothorax biddulphi) is an endemic fish species native to the Tarim River system of Xinjiang and has been classified as an extremely endangered freshwater fish species in China. Here, we used a next generation sequencing platform (ion torrent PGM™) to obtain a large number of microsatellites for S. biddulphi, for the first time. A total of 40577 contigs were assembled, which contained 1379 SSRs. In these SSRs, the number of dinucleotide repeats were the most frequent (77.08%) and AC repeats were the most frequently occurring microsatellite, followed by AG, AAT and AT. Fifty loci were randomly selected for primer development; of these, 38 loci were successfully amplified and 29 loci were polymorphic across panels of 30 individuals. The H(o) ranged from 0.15 to 0.83, and H(e) ranged from 0.15 to 0.85, with 3.5 alleles per locus on average. Cross-species utility indicated that 20 of these markers were successfully amplified in a related, also an endangered fish species, S. irregularis. This study suggests that PGM™ sequencing is a rapid and cost-effective tool for developing microsatellite markers for non-model species and the developed microsatellite markers in this study would be useful in Schizothorax genetic analysis.
Conserved DNA motifs in the type II-A CRISPR leader region.

PubMed

Van Orden, Mason J; Klein, Peter; Babu, Kesavan; Najar, Fares Z; Rajan, Rakhi

2017-01-01

The Clustered Regularly Interspaced Short Palindromic Repeats associated (CRISPR-Cas) systems consist of RNA-protein complexes that provide bacteria and archaea with sequence-specific immunity against bacteriophages, plasmids, and other mobile genetic elements. Bacteria and archaea become immune to phage or plasmid infections by inserting short pieces of the intruder DNA (spacer) site-specifically into the leader-repeat junction in a process called adaptation. Previous studies have shown that parts of the leader region, especially the 3' end of the leader, are indispensable for adaptation. However, a comprehensive analysis of leader ends remains absent. Here, we have analyzed the leader, repeat, and Cas proteins from 167 type II-A CRISPR loci. Our results indicate two distinct conserved DNA motifs at the 3' leader end: ATTTGAG (noted previously in the CRISPR1 locus of Streptococcus thermophilus DGCC7710) and a newly defined CTRCGAG, associated with the CRISPR3 locus of S. thermophilus DGCC7710. A third group with a very short CG DNA conservation at the 3' leader end is observed mostly in lactobacilli. Analysis of the repeats and Cas proteins revealed clustering of these CRISPR components that mirrors the leader motif clustering, in agreement with the coevolution of CRISPR-Cas components. Based on our analysis of the type II-A CRISPR loci, we implicate leader end sequences that could confer site-specificity for the adaptation-machinery in the different subsets of type II-A CRISPR loci.
Conserved DNA motifs in the type II-A CRISPR leader region

PubMed Central

Babu, Kesavan; Najar, Fares Z.

2017-01-01

The Clustered Regularly Interspaced Short Palindromic Repeats associated (CRISPR-Cas) systems consist of RNA-protein complexes that provide bacteria and archaea with sequence-specific immunity against bacteriophages, plasmids, and other mobile genetic elements. Bacteria and archaea become immune to phage or plasmid infections by inserting short pieces of the intruder DNA (spacer) site-specifically into the leader-repeat junction in a process called adaptation. Previous studies have shown that parts of the leader region, especially the 3′ end of the leader, are indispensable for adaptation. However, a comprehensive analysis of leader ends remains absent. Here, we have analyzed the leader, repeat, and Cas proteins from 167 type II-A CRISPR loci. Our results indicate two distinct conserved DNA motifs at the 3′ leader end: ATTTGAG (noted previously in the CRISPR1 locus of Streptococcus thermophilus DGCC7710) and a newly defined CTRCGAG, associated with the CRISPR3 locus of S. thermophilus DGCC7710. A third group with a very short CG DNA conservation at the 3′ leader end is observed mostly in lactobacilli. Analysis of the repeats and Cas proteins revealed clustering of these CRISPR components that mirrors the leader motif clustering, in agreement with the coevolution of CRISPR-Cas components. Based on our analysis of the type II-A CRISPR loci, we implicate leader end sequences that could confer site-specificity for the adaptation-machinery in the different subsets of type II-A CRISPR loci. PMID:28392985
Multiple-Locus Variable-Number Tandem-Repeat Analysis in Genotyping Yersinia enterocolitica Strains from Human and Porcine Origins

PubMed Central

Laukkanen-Ninios, R.; Ortiz Martínez, P.; Siitonen, A.; Fredriksson-Ahomaa, M.; Korkeala, H.

2013-01-01

Sporadic and epidemiologically linked Yersinia enterocolitica strains (n = 379) isolated from fecal samples from human patients, tonsil or fecal samples from pigs collected at slaughterhouses, and pork samples collected at meat stores were genotyped using multiple-locus variable-number tandem-repeat analysis (MLVA) with six loci, i.e., V2A, V4, V5, V6, V7, and V9. In total, 312 different MLVA types were found. Similar types were detected (i) in fecal samples collected from human patients over 2 to 3 consecutive years, (ii) in samples from humans and pigs, and (iii) in samples from pigs that originated from the same farms. Among porcine strains, we found farm-specific MLVA profiles. Variations in the numbers of tandem repeats from one to four for variable-number tandem-repeat (VNTR) loci V2A, V5, V6, and V7 were observed within a farm. MLVA was applicable for serotypes O:3, O:5,27, and O:9 and appeared to be a highly discriminating tool for distinguishing sporadic and outbreak-related strains. With long-term use, interpretation of the results became more challenging due to variations in more-discriminating loci, as was observed for strains originating from pig farms. Additionally, we encountered unexpectedly short V2A VNTR fragments and sequenced them. According to the sequencing results, updated guidelines for interpreting V2A VNTR results were prepared. PMID:23637293

Analysis of simple sequence repeat (SSR) structure and sequence within Epichloë endophyte genomes reveals impacts on gene structure and insights into ancestral hybridization events.

PubMed

Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry

2017-01-01

Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
Outlier Loci and Selection Signatures of Simple Sequence Repeats (SSRs) in Flax (Linum usitatissimum L.).

PubMed

Soto-Cerda, Braulio J; Cloutier, Sylvie

2013-01-01

Genomic microsatellites (gSSRs) and expressed sequence tag-derived SSRs (EST-SSRs) have gained wide application for elucidating genetic diversity and population structure in plants. Both marker systems are assumed to be selectively neutral when making demographic inferences, but this assumption is rarely tested. In this study, three neutrality tests were assessed for identifying outlier loci among 150 SSRs (85 gSSRs and 65 EST-SSRs) that likely influence estimates of population structure in three differentiated flax sub-populations ( F ST = 0.19). Moreover, the utility of gSSRs, EST-SSRs, and the combined sets of SSRs was also evaluated in assessing genetic diversity and population structure in flax. Six outlier loci were identified by at least two neutrality tests showing footprints of balancing selection. After removing the outlier loci, the STRUCTURE analysis and the dendrogram topology of EST-SSRs improved. Conversely, gSSRs and combined SSRs results did not change significantly, possibly as a consequence of the higher number of neutral loci assessed. Taken together, the genetic structure analyses established the superiority of gSSRs to determine the genetic relationships among flax accessions, although the combined SSRs produced the best results. Genetic diversity parameters did not differ statistically ( P > 0.05) between gSSRs and EST-SSRs, an observation partially explained by the similar number of repeat motifs. Our study provides new insights into the ability of gSSRs and EST-SSRs to measure genetic diversity and structure in flax and confirms the importance of testing for the occurrence of outlier loci to properly assess natural and breeding populations, particularly in studies considering only few loci.
Microsatellite markers for the yam bean Pachyrhizus (Fabaceae).

PubMed

Delêtre, Marc; Soengas, Beatriz; Utge, José; Lambourdière, Josie; Sørensen, Marten

2013-07-01

Microsatellite loci were developed for the understudied root crop yam bean (Pachyrhizus spp.) to investigate intraspecific diversity and interspecific relationships within the genus Pachyrhizus. • Seventeen nuclear simple sequence repeat (SSR) markers with perfect di- and trinucleotide repeats were developed from 454 pyrosequencing of SSR-enriched genomic libraries. Loci were characterized in P. ahipa and wild and cultivated populations of four closely related species. All loci successfully cross-amplified and showed high levels of polymorphism, with number of alleles ranging from three to 12 and expected heterozygosity ranging from 0.095 to 0.831 across the genus. • By enabling rapid assessment of genetic diversity in three native neotropical crops, P. ahipa, P. erosus, and P. tuberosus, and two wild relatives, P. ferrugineus and P. panamensis, these markers will allow exploration of the genetic diversity and evolutionary history of the genus Pachyrhizus.
Effect of repeat copy number on variable-number tandem repeat mutations in Escherichia coli O157:H7.

PubMed

Vogler, Amy J; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E; Jay, Zack; Keim, Paul

2006-06-01

Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 x 10(-4) mutations/generation and a combined 28-locus rate of 6.4 x 10(-4) mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2= 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2= 0.833, P < 0.0001) or excluded (r2= 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data.
Seven New Complete Plastome Sequences Reveal Rampant Independent Loss of the ndh Gene Family across Orchids and Associated Instability of the Inverted Repeat/Small Single-Copy Region Boundaries

PubMed Central

Moore, Michael J.; Neubig, Kurt M.; Williams, Norris H.; Whitten, W. Mark; Kim, Joo-Hwan

2015-01-01

Earlier research has revealed that the ndh loci have been pseudogenized, truncated, or deleted from most orchid plastomes sequenced to date, including in all available plastomes of the two most species-rich subfamilies, Orchidoideae and Epidendroideae. This study sought to resolve deeper-level phylogenetic relationships among major orchid groups and to refine the history of gene loss in the ndh loci across orchids. The complete plastomes of seven orchids, Oncidium sphacelatum (Epidendroideae), Masdevallia coccinea (Epidendroideae), Sobralia callosa (Epidendroideae), Sobralia aff. bouchei (Epidendroideae), Elleanthus sodiroi (Epidendroideae), Paphiopedilum armeniacum (Cypripedioideae), and Phragmipedium longifolium (Cypripedioideae) were sequenced and analyzed in conjunction with all other available orchid and monocot plastomes. Most ndh loci were found to be pseudogenized or lost in Oncidium, Paphiopedilum and Phragmipedium, but surprisingly, all ndh loci were found to retain full, intact reading frames in Sobralia, Elleanthus and Masdevallia. Character mapping suggests that the ndh genes were present in the common ancestor of orchids but have experienced independent, significant losses at least eight times across four subfamilies. In addition, ndhF gene loss was correlated with shifts in the position of the junction of the inverted repeat (IR) and small single-copy (SSC) regions. The Orchidaceae have unprecedented levels of homoplasy in ndh gene presence/absence, which may be correlated in part with the unusual life history of orchids. These results also suggest that ndhF plays a role in IR/SSC junction stability. PMID:26558895
Microsatellite markers for the yam bean Pachyrhizus (Fabaceae)1

PubMed Central

Delêtre, Marc; Soengas, Beatriz; Utge, José; Lambourdière, Josie; Sørensen, Marten

2013-01-01

• Premise of the study: Microsatellite loci were developed for the understudied root crop yam bean (Pachyrhizus spp.) to investigate intraspecific diversity and interspecific relationships within the genus Pachyrhizus. • Methods and Results: Seventeen nuclear simple sequence repeat (SSR) markers with perfect di- and trinucleotide repeats were developed from 454 pyrosequencing of SSR-enriched genomic libraries. Loci were characterized in P. ahipa and wild and cultivated populations of four closely related species. All loci successfully cross-amplified and showed high levels of polymorphism, with number of alleles ranging from three to 12 and expected heterozygosity ranging from 0.095 to 0.831 across the genus. • Conclusions: By enabling rapid assessment of genetic diversity in three native neotropical crops, P. ahipa, P. erosus, and P. tuberosus, and two wild relatives, P. ferrugineus and P. panamensis, these markers will allow exploration of the genetic diversity and evolutionary history of the genus Pachyrhizus. PMID:25202568
Geographic patterns of genetic variation in native pecans

USDA-ARS?s Scientific Manuscript database

A structured collection of eighty seedling pecan trees [Carya illinoinensis (Wangenh.) K. Koch] representing nineteen putatively native pecan populations across the species range were evaluated at three plastid and 14 nuclear microsatellite (simple sequence repeat, SSR) loci. Data were analyzed usi...
Chloroplast microsatellite markers for Artocarpus (Moraceae) developed from transcriptome sequences1

PubMed Central

Gardner, Elliot M.; Laricchia, Kristen M.; Murphy, Matthew; Ragone, Diane; Scheffler, Brian E.; Simpson, Sheron; Williams, Evelyn W.; Zerega, Nyree J. C.

2015-01-01

Premise of the study: Chloroplast microsatellite loci were characterized from transcriptomes of Artocarpus altilis (breadfruit) and A. camansi (breadnut). They were tested in A. odoratissimus (terap) and A. altilis and evaluated in silico for two congeners. Methods and Results: Fifteen simple sequence repeats (SSRs) were identified in chloroplast sequences from four Artocarpus transcriptome assemblies. The markers were evaluated using capillary electrophoresis in A. odoratissimus (105 accessions) and A. altilis (73). They were also evaluated in silico in A. altilis (10), A. camansi (6), and A. altilis × A. mariannensis (7) transcriptomes. All loci were polymorphic in at least one species, with all 15 polymorphic in A. camansi. Per species, average alleles per locus ranged between 2.2 and 2.5. Three loci had evidence of fragment-length homoplasy. Conclusions: These markers will complement existing nuclear markers by enabling confident identification of maternal and clone lines, which are often important in vegetatively propagated crops such as breadfruit. PMID:26421253
[Polymorphic loci and polymorphism analysis of short tandem repeats within XNP gene].

PubMed

Liu, Qi-Ji; Gong, Yao-Qin; Guo, Chen-Hong; Chen, Bing-Xi; Li, Jiang-Xia; Guo, Yi-Shou

2002-01-01

To select polymorphic short tandem repeat markers within X-linked nuclear protein (XNP) gene, genomic clones which contain XNP gene were recognized by homologous analysis with XNP cDNA. By comparing the cDNA with genomic DNA, non-exonic sequences were identified, and short tandem repeats were selected from non-exonic sequences by using BCM search Launcher. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five short tandem repeats were identified from XNP gene, two of which were polymorphic. Four and 11 alleles were observed in Chinese population for XNPSTR1 and XNPSTR4, respectively. Heterozygosities were 47% for XNPSTR1 and 70% for XNPSTR4. XNPSTR1 and XNPSTR4 localized within 3' end and intron 10, respectively. Two polymorphic short tandem repeats have been identified within XNP gene and will be useful for linkage analysis and gene diagnosis of XNP gene.
NGS-based likelihood ratio for identifying contributors in two- and three-person DNA mixtures.

PubMed

Chan Mun Wei, Joshua; Zhao, Zicheng; Li, Shuai Cheng; Ng, Yen Kaow

2018-06-01

DNA fingerprinting, also known as DNA profiling, serves as a standard procedure in forensics to identify a person by the short tandem repeat (STR) loci in their DNA. By comparing the STR loci between DNA samples, practitioners can calculate a probability of match to identity the contributors of a DNA mixture. Most existing methods are based on 13 core STR loci which were identified by the Federal Bureau of Investigation (FBI). Analyses based on these loci of DNA mixture for forensic purposes are highly variable in procedures, and suffer from subjectivity as well as bias in complex mixture interpretation. With the emergence of next-generation sequencing (NGS) technologies, the sequencing of billions of DNA molecules can be parallelized, thus greatly increasing throughput and reducing the associated costs. This allows the creation of new techniques that incorporate more loci to enable complex mixture interpretation. In this paper, we propose a computation for likelihood ratio that uses NGS (next generation sequencing) data for DNA testing on mixed samples. We have applied the method to 4480 simulated DNA mixtures, which consist of various mixture proportions of 8 unrelated whole-genome sequencing data. The results confirm the feasibility of utilizing NGS data in DNA mixture interpretations. We observed an average likelihood ratio as high as 285,978 for two-person mixtures. Using our method, all 224 identity tests for two-person mixtures and three-person mixtures were correctly identified. Copyright © 2018 Elsevier Ltd. All rights reserved.
High levels of heterozygosity found for 15 SSR loci in Solanum chacoense

USDA-ARS?s Scientific Manuscript database

Genetic variation is a necessary prerequisite for improving domesticated plants through breeding; without it, breeding progress would be impossible. Genetic variation can be readily ascertained with co-dominant DNA markers, such as simple sequence repeats (SSRs). Twenty-four SSR markers specifically...
Development and characterization of EST-SSR markers for Begonia luzhaiensis (Begoniaceae)1

PubMed Central

Tseng, Yu-Hsin; Huang, Han-Yau; Xu, Wei-Bin; Yang, Hsun-An; Liu, Yan; Peng, Ching-I; Chung, Kuo-Fang

2017-01-01

Premise of the study: Microsatellite primers were developed for Begonia luzhaiensis (Begoniaceae) to assess genetic diversity and population genetic structure. Methods and Results: Based on the transcriptome data of B. luzhaiensis, 60 primer pairs were selected for initial validation, of which 16 yielded polymorphic microsatellite loci in 57 individuals. The number of alleles observed for these 16 loci ranged from one to nine. The observed and expected heterozygosity ranged from 0.000 to 1.000 and from 0.000 to 0.804 with averages of 0.370 and 0.404, respectively. Five loci could be successfully amplified in B. leprosa. Conclusions: The expressed sequence tag–simple sequence repeat markers are the first specifically developed for B. luzhaiensis and the first developed in Begonia sect. Coelocentrum. These markers will be useful for future studies of the genetic structure and phylogeography of B. luzhaiensis. PMID:28529834
Effect of Repeat Copy Number on Variable-Number Tandem Repeat Mutations in Escherichia coli O157:H7

PubMed Central

Vogler, Amy J.; Keys, Christine; Nemoto, Yoshimi; Colman, Rebecca E.; Jay, Zack; Keim, Paul

2006-01-01

Variable-number tandem repeat (VNTR) loci have shown a remarkable ability to discriminate among isolates of the recently emerged clonal pathogen Escherichia coli O157:H7, making them a very useful molecular epidemiological tool. However, little is known about the rates at which these sequences mutate, the factors that affect mutation rates, or the mechanisms by which mutations occur at these loci. Here, we measure mutation rates for 28 VNTR loci and investigate the effects of repeat copy number and mismatch repair on mutation rate using in vitro-generated populations for 10 E. coli O157:H7 strains. We find single-locus rates as high as 7.0 × 10−4 mutations/generation and a combined 28-locus rate of 6.4 × 10−4 mutations/generation. We observed single- and multirepeat mutations that were consistent with a slipped-strand mispairing mutation model, as well as a smaller number of large repeat copy number mutations that were consistent with recombination-mediated events. Repeat copy number within an array was strongly correlated with mutation rate both at the most mutable locus, O157-10 (r2 = 0.565, P = 0.0196), and across all mutating loci. The combined locus model was significant whether locus O157-10 was included (r2 = 0.833, P < 0.0001) or excluded (r2 = 0.452, P < 0.0001) from the analysis. Deficient mismatch repair did not affect mutation rate at any of the 28 VNTRs with repeat unit sizes of >5 bp, although a poly(G) homomeric tract was destabilized in the mutS strain. Finally, we describe a general model for VNTR mutations that encompasses insertions and deletions, single- and multiple-repeat mutations, and their relative frequencies based upon our empirical mutation rate data. PMID:16740932
Intricate interactions between the bloom-forming cyanobacterium Microcystis aeruginosa and foreign genetic elements, revealed by diversified clustered regularly interspaced short palindromic repeat (CRISPR) signatures.

PubMed

Kuno, Sotaro; Yoshida, Takashi; Kaneko, Takakazu; Sako, Yoshihiko

2012-08-01

Clustered regularly interspaced short palindromic repeats (CRISPR) confer sequence-dependent, adaptive resistance in prokaryotes against viruses and plasmids via incorporation of short sequences, called spacers, derived from foreign genetic elements. CRISPR loci are thus considered to provide records of past infections. To describe the host-parasite (i.e., cyanophages and plasmids) interactions involving the bloom-forming freshwater cyanobacterium Microcystis aeruginosa, we investigated CRISPR in four M. aeruginosa strains and in two previously sequenced genomes. The number of spacers in each locus was larger than the average among prokaryotes. All spacers were strain specific, except for a string of 11 spacers shared in two closely related strains, suggesting diversification of the loci. Using CRISPR repeat-based PCR, 24 CRISPR genotypes were identified in a natural cyanobacterial community. Among 995 unique spacers obtained, only 10 sequences showed similarity to M. aeruginosa phage Ma-LMM01. Of these, six spacers showed only silent or conservative nucleotide mutations compared to Ma-LMM01 sequences, suggesting a strategy by the cyanophage to avert CRISPR immunity dependent on nucleotide identity. These results imply that host-phage interactions can be divided into M. aeruginosa-cyanophage combinations rather than pandemics of population-wide infectious cyanophages. Spacer similarity also showed frequent exposure of M. aeruginosa to small cryptic plasmids that were observed only in a few strains. Thus, the diversification of CRISPR implies that M. aeruginosa has been challenged by diverse communities (almost entirely uncharacterized) of cyanophages and plasmids.
Intricate Interactions between the Bloom-Forming Cyanobacterium Microcystis aeruginosa and Foreign Genetic Elements, Revealed by Diversified Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) Signatures

PubMed Central

Kuno, Sotaro; Kaneko, Takakazu; Sako, Yoshihiko

2012-01-01

Clustered regularly interspaced short palindromic repeats (CRISPR) confer sequence-dependent, adaptive resistance in prokaryotes against viruses and plasmids via incorporation of short sequences, called spacers, derived from foreign genetic elements. CRISPR loci are thus considered to provide records of past infections. To describe the host-parasite (i.e., cyanophages and plasmids) interactions involving the bloom-forming freshwater cyanobacterium Microcystis aeruginosa, we investigated CRISPR in four M. aeruginosa strains and in two previously sequenced genomes. The number of spacers in each locus was larger than the average among prokaryotes. All spacers were strain specific, except for a string of 11 spacers shared in two closely related strains, suggesting diversification of the loci. Using CRISPR repeat-based PCR, 24 CRISPR genotypes were identified in a natural cyanobacterial community. Among 995 unique spacers obtained, only 10 sequences showed similarity to M. aeruginosa phage Ma-LMM01. Of these, six spacers showed only silent or conservative nucleotide mutations compared to Ma-LMM01 sequences, suggesting a strategy by the cyanophage to avert CRISPR immunity dependent on nucleotide identity. These results imply that host-phage interactions can be divided into M. aeruginosa-cyanophage combinations rather than pandemics of population-wide infectious cyanophages. Spacer similarity also showed frequent exposure of M. aeruginosa to small cryptic plasmids that were observed only in a few strains. Thus, the diversification of CRISPR implies that M. aeruginosa has been challenged by diverse communities (almost entirely uncharacterized) of cyanophages and plasmids. PMID:22636003
Sequences spanning the leader-repeat junction mediate CRISPR adaptation to phage in Streptococcus thermophilus

PubMed Central

Wei, Yunzhou; Chesne, Megan T.; Terns, Rebecca M.; Terns, Michael P.

2015-01-01

CRISPR-Cas systems are RNA-based immune systems that protect prokaryotes from invaders such as phages and plasmids. In adaptation, the initial phase of the immune response, short foreign DNA fragments are captured and integrated into host CRISPR loci to provide heritable defense against encountered foreign nucleic acids. Each CRISPR contains a ∼100–500 bp leader element that typically includes a transcription promoter, followed by an array of captured ∼35 bp sequences (spacers) sandwiched between copies of an identical ∼35 bp direct repeat sequence. New spacers are added immediately downstream of the leader. Here, we have analyzed adaptation to phage infection in Streptococcus thermophilus at the CRISPR1 locus to identify cis-acting elements essential for the process. We show that the leader and a single repeat of the CRISPR locus are sufficient for adaptation in this system. Moreover, we identified a leader sequence element capable of stimulating adaptation at a dormant repeat. We found that sequences within 10 bp of the site of integration, in both the leader and repeat of the CRISPR, are required for the process. Our results indicate that information at the CRISPR leader-repeat junction is critical for adaptation in this Type II-A system and likely other CRISPR-Cas systems. PMID:25589547
The genomic basis of adaptive evolution in threespine sticklebacks

PubMed Central

Jones, Felicity C; Grabherr, Manfred G; Chan, Yingguang Frank; Russell, Pamela; Mauceli, Evan; Johnson, Jeremy; Swofford, Ross; Pirun, Mono; Zody, Michael C; White, Simon; Birney, Ewan; Searle, Stephen; Schmutz, Jeremy; Grimwood, Jane; Dickson, Mark C; Myers, Richard M; Miller, Craig T; Summers, Brian R; Knecht, Anne K; Brady, Shannon D; Zhang, Haili; Pollen, Alex A; Howes, Timothy; Amemiya, Chris; Lander, Eric S; Di Palma, Federica

2012-01-01

Summary Marine stickleback fish have colonized and adapted to innumerable streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of 20 additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine-freshwater divergence. Our results suggest that reuse of globally-shared standing genetic variation, including chromosomal inversions, plays an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine-freshwater evolution, with regulatory changes likely predominating in this classic example of repeated adaptive evolution in nature. PMID:22481358
The genomic basis of adaptive evolution in threespine sticklebacks.

PubMed

Jones, Felicity C; Grabherr, Manfred G; Chan, Yingguang Frank; Russell, Pamela; Mauceli, Evan; Johnson, Jeremy; Swofford, Ross; Pirun, Mono; Zody, Michael C; White, Simon; Birney, Ewan; Searle, Stephen; Schmutz, Jeremy; Grimwood, Jane; Dickson, Mark C; Myers, Richard M; Miller, Craig T; Summers, Brian R; Knecht, Anne K; Brady, Shannon D; Zhang, Haili; Pollen, Alex A; Howes, Timothy; Amemiya, Chris; Baldwin, Jen; Bloom, Toby; Jaffe, David B; Nicol, Robert; Wilkinson, Jane; Lander, Eric S; Di Palma, Federica; Lindblad-Toh, Kerstin; Kingsley, David M

2012-04-04

Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine-freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine-freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.
Multilocus variable-number tandem repeat analysis for molecular typing and phylogenetic analysis of Shigella flexneri

PubMed Central

2009-01-01

Background Shigella flexneri is one of the causative agents of shigellosis, a major cause of childhood mortality in developing countries. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related bacterial isolates for investigation of disease outbreaks and provide information for establishing phylogenetic patterns among isolates. The present study aimed to develop an MLVA method for S. flexneri and the VNTR loci identified were tested on 242 S. flexneri isolates to evaluate their variability in various serotypes. The isolates were also analyzed by pulsed-field gel electrophoresis (PFGE) to compare the discriminatory power and to evaluate the usefulness of MLVA as a tool for phylogenetic analysis of S. flexneri. Results Thirty-six VNTR loci were identified by exploring the repeat sequence loci in genomic sequences of Shigella species and by testing the loci on nine isolates of different subserotypes. The VNTR loci in different serotype groups differed greatly in their variability. The discriminatory power of an MLVA assay based on four most variable VNTR loci was higher, though not significantly, than PFGE for the total isolates, a panel of 2a isolates, which were relatively diverse, and a panel of 4a/Y isolates, which were closely-related. Phylogenetic groupings based on PFGE patterns and MLVA profiles were considerably concordant. The genetic relationships among the isolates were correlated with serotypes. The phylogenetic trees constructed using PFGE patterns and MLVA profiles presented two distinct clusters for the isolates of serotype 3 and one distinct cluster for each of the serotype groups, 1a/1b/NT, 2a/2b/X/NT, 4a/Y, and 6. Isolates that had different serotypes but had closer genetic relatedness than those with the same serotype were observed between serotype Y and subserotype 4a, serotype X and subserotype 2b, subserotype 1a and 1b, and subserotype 3a and 3b. Conclusions The 36 VNTR loci identified exhibited considerably different degrees of variability among S. flexneri serotype groups. VNTR locus could be highly variable in a serotype but invariable in others. MLVA assay based on four highly variable loci could display a comparable resolving power to PFGE in discriminating isolates. MLVA is also a prominent molecular tool for phylogenetic analysis of S. flexneri; the resulting data are beneficial to establish clear clonal patterns among different serotype groups and to discern clonal groups among isolates within the same serotype. As highly variable VNTR loci could be serotype-specific, a common MLVA protocol that consists of only a small set of loci, for example four to eight loci, and that provides high resolving power to all S. flexneri serotypes may not be obtainable. PMID:20042119
Isolation and Characterization of Eleven Polymorphic Microsatellite Loci for the Valuable Medicinal Plant Dendrobium huoshanense and Cross-Species Amplification

PubMed Central

Wang, Hui; Chen, Nai-Fu; Zheng, Ji-Yang; Wang, Wen-Cai; Pei, Yun-Yun; Zhu, Guo-Ping

2012-01-01

Dendrobium huoshanense (Orchidaceae) is a perennial herb and a widely used medicinal plant in Traditional Chinese medicine (TCM) endemic to Huoshan County town in Anhui province in Southeast China. A microsatellite-enriched genomic DNA library of D. huoshanense was developed and screened to identify marker loci. Eleven polymorphic loci were isolated and analyzed by screening 25 individuals collected from a natural population. The number of alleles per locus ranged from 2 to 5. The observed and expected heterozygosities ranged from 0.227 to 0.818 and from 0.317 to 0.757, respectively. Two loci showed significant deviations from Hardy-Weinberg equilibrium and four of the pairwise comparisons of loci revealed linkage disequilibrium (p < 0.05). These microsatellite loci were cross-amplified for five congeneric species and seven loci can be amplified in all species. These simple sequence repeats (SSR) markers are useful in genetic studies of D. huoshanense and other related species and in conservation decision-making. PMID:23222682

Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

PubMed

Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

PubMed Central

Rehm, Charlotte; Wurmthaler, Lena A.; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S.

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria. PMID:26695179
Analysis of genetic diversity and population structure of oil palm (Elaeis guineensis) from China and Malaysia based on species-specific simple sequence repeat markers.

PubMed

Zhou, L X; Xiao, Y; Xia, W; Yang, Y D

2015-12-08

Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.
Mature clustered, regularly interspaced, short palindromic repeats RNA (crRNA) length is measured by a ruler mechanism anchored at the precursor processing site.

PubMed

Hatoum-Aslan, Asma; Maniv, Inbal; Marraffini, Luciano A

2011-12-27

Precise RNA processing is fundamental to all small RNA-mediated interference pathways. In prokaryotes, clustered, regularly interspaced, short palindromic repeats (CRISPR) loci encode small CRISPR RNAs (crRNAs) that protect against invasive genetic elements by antisense targeting. CRISPR loci are transcribed as a long precursor that is cleaved within repeat sequences by CRISPR-associated (Cas) proteins. In many organisms, this primary processing generates crRNA intermediates that are subject to additional nucleolytic trimming to render mature crRNAs of specific lengths. The molecular mechanisms underlying this maturation event remain poorly understood. Here, we defined the genetic requirements for crRNA primary processing and maturation in Staphylococcus epidermidis. We show that changes in the position of the primary processing site result in extended or diminished maturation to generate mature crRNAs of constant length. These results indicate that crRNA maturation occurs by a ruler mechanism anchored at the primary processing site. We also show that maturation is mediated by specific cas genes distinct from those genes involved in primary processing, showing that this event is directed by CRISPR/Cas loci.
Virulence Phenotypes and Molecular Genotypes of Puccinia triticina Isolates from Italy

USDA-ARS?s Scientific Manuscript database

Twenty-four isolates of Puccinia triticina from Italy were characterized for virulence to seedlings of 22 common wheat cv. Thatcher isolines each with a different leaf rust resistance gene, and for molecular genotypes at 15 simple sequence repeat (SSR) loci. The isolates were compared with a set of ...
Ancient DNA in human bone remains from Pompeii archaeological site.

PubMed

Cipollaro, M; Di Bernardo, G; Galano, G; Galderisi, U; Guarino, F; Angelini, F; Cascino, A

1998-06-29

aDNA extraction and amplification procedures have been optimized for Pompeian human bone remains whose diagenesis has been determined by histological analysis. Single copy genes amplification (X and Y amelogenin loci and Y specific alphoid repeat sequences) have been performed and compared with anthropometric data on sexing.
Fusarium head blight resistance loci in a stratified population of wheat landraces and varieties

USDA-ARS?s Scientific Manuscript database

To determine if Chinese and Japanese wheat landraces and varieties have unique sources of Fusarium head blight (FHB) resistance, an association mapping panel of 195 wheat accessions including both commercial varieties and landraces was genotyped with 364 genome-wide simple sequence repeat (SSR) and ...
Microsatellite loci in Vallisneria natans (Hydrocharitaceae) and cross-reactivity with V. spinulosa and V. denseserrulata.

PubMed

Wang, Bin; Liao, Hui; Zhao, Yao; Li, Wei; Song, Zhiping

2011-03-01

Microsatellite primers were characterized in Vallisneria natans, a dominant submerged macrophyte occurring in freshwater bodies of tropical and subtropical zones. Using the Microsatellite Sequence Enrichment protocol, 16 novel polymorphic codominant loci were developed and characterized in V. natans. In addition to these, six existing microsatellite loci from V. spinulosa were successfully amplified and characterized for V. natans. These primers amplified di- and trinucleotide repeats with 2-7 alleles per locus. Most primers also amplified successfully in V. spinulosa and V. denseserrulata. These results indicate the utility of primers in V. natans for future studies of population genetic structure, as well as their applicability across the genus.
The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

PubMed

Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

2012-05-01

This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
CRISPR-cas loci profiling of Cronobacter sakazakii pathovars.

PubMed

Ogrodzki, Pauline; Forsythe, Stephen James

2016-12-01

Cronobacter sakazakii sequence types 1, 4, 8 and 12 are associated with outbreaks of neonatal meningitis and necrotizing enterocolitis infections. However clonality results in strains which are indistinguishable using conventional methods. This study investigated the use of clustered regularly interspaced short palindromic repeats (CRISPR)-cas loci profiling for epidemiological investigations. Seventy whole genomes of C. sakazakii strains from four clonal complexes which were widely distributed temporally, geographically and origin of source were profiled. All strains encoded the same type I-E subtype CRISPR-cas system with a total of 12 different CRISPR spacer arrays. This study demonstrated the greater discriminatory power of CRISPR spacer array profiling compared with multilocus sequence typing, which will be of use in source attribution during Cronobacter outbreak investigations.
Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

PubMed

Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

2010-02-01

Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
Complete Chloroplast Genome Sequence of Coptis chinensis Franch. and Its Evolutionary History

PubMed Central

He, Yang; Deng, Cao; Fan, Gang; Qin, Shishang

2017-01-01

The Coptis chinensis Franch. is an important medicinal plant from the Ranunculales. We used next generation sequencing technology to determine the complete chloroplast genome of C. chinensis. This genome is 155,484 bp long with 38.17% GC content. Two 26,758 bp long inverted repeats separated the genome into a typical quadripartite structure. The C. chinensis chloroplast genome consists of 128 gene loci, including eight rRNA gene loci, 28 tRNA gene loci, and 92 protein-coding gene loci. Most of the SSRs in C. chinensis are poly-A/T. The numbers of mononucleotide SSRs in C. chinensis and other Ranunculaceae species are fewer than those in Berberidaceae species, while the number of dinucleotide SSRs is greater than that in the Berberidaceae. C. chinensis diverged from other Ranunculaceae species an estimated 81 million years ago (Mya). The divergence between Ranunculaceae and Berberidaceae was ~111 Mya, while the Ranunculales and Magnoliaceae shared a common ancestor during the Jurassic, ~153 Mya. Position 104 of the C. chinensis ndhG protein was identified as a positively selected site, indicating possible selection for the photosystem-chlororespiration system in C. chinensis. In summary, the complete sequencing and annotation of the C. chinensis chloroplast genome will facilitate future studies on this important medicinal species. PMID:28698879
Three novel polymorphic microsatellite markers for the glaucoma locus GLC1B by datamining tetranucleotide repeats on chromosome 2p12-q12

PubMed Central

2009-01-01

In order to identify new markers around the glaucoma locus GLC1B as a tool to refine its critical region at 2p11.2-2q11.2, we searched the critical region sequence obtained from the UCSC database for tetranucleotide (GATA)n and (GTCT)n repeats of at least 10 units in length. Three out of four potential microsatellite loci were found to be polymorphic, heterozygosity ranging from 64.56% to 79.59%. The identified markers are useful not only for GLC1B locus but also for the study of other disease loci at 2p11.2-2q11.2, a region with scarcity of microsatellite markers. PMID:21637444
[Reticulate evolution of parthenogenetic species of the Lacertidae rock lizards: inheritance of CLsat tandem repeats and anonymous RAPD markers].

PubMed

Chobanu, D; Rudykh, I A; Riabinina, N L; Grechko, V V; Kramerov, D A; Darevskiĭ, I S

2002-01-01

The genetic relatedness of several bisexual and of four unisexual "Lacerta saxicola complex" lizards was studied, using monomer sequences of the complex-specific CLsat tandem repeats and anonymous RAPD markers. Genomes of parthenospecies were shown to include different satellite monomers. The structure of each such monomer is specific for a certain pair of bisexual species. This fact might be interpreted in favor of co-dominant inheritance of these markers in bisexual species hybridogenesis. This idea is supported by the results obtained with RAPD markers; i.e., unisexual species genomes include only the loci characteristic of certain bisexual species. At the same time, in neither case parthenospecies possess specific, autoapomorphic loci that were not present in this or that bisexual species.
RNA polymerase V-dependent small RNAs in Arabidopsis originate from small, intergenic loci including most SINE repeats.

PubMed

Lee, Tzuu-fen; Gurazada, Sai Guna Ranjan; Zhai, Jixian; Li, Shengben; Simon, Stacey A; Matzke, Marjori A; Chen, Xuemei; Meyers, Blake C

2012-07-01

In plants, heterochromatin is maintained by a small RNA-based gene silencing mechanism known as RNA-directed DNA methylation (RdDM). RdDM requires the non-redundant functions of two plant-specific DNA-dependent RNA polymerases (RNAP), RNAP IV and RNAP V. RNAP IV plays a major role in siRNA biogenesis, while RNAP V may recruit DNA methylation machinery to target endogenous loci for silencing. Although small RNA-generating regions that are dependent on both RNAP IV and RNAP V have been identified previously, the genomic loci targeted by RNAP V for siRNA accumulation and silencing have not been described extensively. To characterize the RNAP V-dependent, heterochromatic siRNA-generating regions in the Arabidopsis genome, we deeply sequenced the small RNA populations of wild-type and RNAP V null mutant (nrpe1) plants. Our results showed that RNAP V-dependent siRNA-generating loci are associated predominately with short repetitive sequences in intergenic regions. Suppression of small RNA production from short repetitive sequences was also prominent in RdDM mutants including dms4, drd1, dms3 and rdm1, reflecting the known association of these RdDM effectors with RNAP V. The genomic regions targeted by RNAP V were small, with an estimated average length of 238 bp. Our results suggest that RNAP V affects siRNA production from genomic loci with features dissimilar to known RNAP IV-dependent loci. RNAP V, along with RNAP IV and DRM1/2, may target and silence a set of small, intergenic transposable elements located in dispersed genomic regions for silencing. Silencing at these loci may be actively reinforced by RdDM.
In silico mapping of quantitative trait loci in maize.

PubMed

Parisseaux, B; Bernardo, R

2004-08-01

Quantitative trait loci (QTL) are most often detected through designed mapping experiments. An alternative approach is in silico mapping, whereby genes are detected using existing phenotypic and genomic databases. We explored the usefulness of in silico mapping via a mixed-model approach in maize (Zea mays L.). Specifically, our objective was to determine if the procedure gave results that were repeatable across populations. Multilocation data were obtained from the 1995-2002 hybrid testing program of Limagrain Genetics in Europe. Nine heterotic patterns comprised 22,774 single crosses. These single crosses were made from 1,266 inbreds that had data for 96 simple sequence repeat (SSR) markers. By a mixed-model approach, we estimated the general combining ability effects associated with marker alleles in each heterotic pattern. The numbers of marker loci with significant effects--37 for plant height, 24 for smut [Ustilago maydis (DC.) Cda.] resistance, and 44 for grain moisture--were consistent with previous results from designed mapping experiments. Each trait had many loci with small effects and few loci with large effects. For smut resistance, a marker in bin 8.05 on chromosome 8 had a significant effect in seven (out of a maximum of 18) instances. For this major QTL, the maximum effect of an allele substitution ranged from 5.4% to 41.9%, with an average of 22.0%. We conclude that in silico mapping via a mixed-model approach can detect associations that are repeatable across different populations. We speculate that in silico mapping will be more useful for gene discovery than for selection in plant breeding programs. Copyright 2004 Springer-Verlag
Investigation into the sequence structure of 23 Y chromosomal STR loci using massively parallel sequencing.

PubMed

Kwon, So Yeun; Lee, Hwan Young; Kim, Eun Hye; Lee, Eun Young; Shin, Kyoung-Jin

2016-11-01

Next-generation sequencing (NGS) can produce massively parallel sequencing (MPS) data for many targeted regions with a high depth of coverage, suggesting its successful application to the amplicons of forensic genetic markers. In the present study, we evaluated the practical utility of MPS in Y-chromosome short tandem repeat (Y-STR) analysis using a multiplex polymerase chain reaction (PCR) system. The multiplex PCR system simultaneously amplified 24 Y-chromosomal markers, including the PowerPlex ® Y23 loci (DYS19, DYS385ab, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS481, DYS533, DYS549, DYS570, DYS576, DYS635, DYS643, and YGATAH4) and the M175 marker with the small-sized amplicons ranging from 85 to 253bp. The barcoded libraries for the amplicons of the 24 Y-chromosomal markers were produced using a simplified PCR-based library preparation method and successfully sequenced using MPS on a MiSeq ® System with samples from 250 unrelated Korean males. The genotyping concordance between MPS and the capillary electrophoresis (CE) method, as well as the sequence structure of the 23 Y-STRs, were investigated. Three samples exhibited discordance between the MPS and CE results at DYS385, DYS439, and DYS576. There were 12 Y-STR loci that showed sequence variations in the alleles by a fragment size determination, and the most varied alleles occurred in DYS389II with a different sequence structure in the repeat region. The largest increase in gene diversity between the CE and MPS results was in DYS437 at +34.41%. Single nucleotide polymorphisms (SNPs), insertions, and deletions (indels) were observed in the flanking regions of DYS481, DYS576, and DYS385, respectively. Stutter and noise ratios of the 23 Y-STRs using the developed MPS system were also investigated. Based on these results, the MPS analysis system used in this study could facilitate the investigation into the sequences of the 23 Y-STRs in forensic genetics laboratories. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides.

PubMed

Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

2015-09-01

Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.
Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides1

PubMed Central

Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

2015-01-01

Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250
Gene-enriched draft genome of the cattle tick Rhipicephalus microplus: assembly by the hybrid Pacific Biosciences/Illumina approach enabled analysis of the highly repetitive genome.

PubMed

Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I

2017-08-01

The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.

Multilocus Variable-Number Tandem Repeat Typing of Mycobacterium ulcerans

PubMed Central

Ablordey, Anthony; Swings, Jean; Hubans, Christine; Chemlal, Karim; Locht, Camille; Portaels, Françoise; Supply, Philip

2005-01-01

The apparent genetic homogeneity of Mycobacterium ulcerans contributes to the poorly understood epidemiology of M. ulcerans infection. Here, we report the identification of variable number tandem repeat (VNTR) sequences as novel polymorphic elements in the genome of this species. A total of 19 potential VNTR loci identified in the closely related M. marinum genome sequence were screened in a collection of 23 M. ulcerans isolates, one Mycobacterium species referred to here as an intermediate species, and five M. marinum strains. Nine of the 19 loci were polymorphic in the three species (including the intermediate species) and revealed eight M. ulcerans and five M. marinum genotypes. The results from the VNTR analysis corroborated the genetic relationships of M. ulcerans isolates from various geographical origins, as defined by independent molecular markers. Although these results further highlight the extremely high clonal homogeneity within certain geographic regions, we report for the first time the discrimination of the two South American strains from Surinam and French Guyana. These findings support the potential of a VNTR-based genotyping method for strain discrimination within M. ulcerans and M. marinum. PMID:15814964
Intraspecific and heteroplasmic variations, gene losses and inversions in the chloroplast genome of Astragalus membranaceus.

PubMed

Lei, Wanjun; Ni, Dapeng; Wang, Yujun; Shao, Junjie; Wang, Xincun; Yang, Dan; Wang, Jinsheng; Chen, Haimei; Liu, Chang

2016-02-22

Astragalus membranaceus is an important medicinal plant in Asia. Several of its varieties have been used interchangeably as raw materials for commercial production. High resolution genetic markers are in urgent need to distinguish these varieties. Here, we sequenced and analyzed the chloroplast genome of A. membranaceus (Fisch.) Bunge var. mongholicus (Bunge) P.K. Hsiao using the next generation DNA sequencing technology. The genome was assembled using Abyss and then subjected to gene prediction using CPGAVAS and repeat analysis using MISA, Tandem Repeats Finder, and REPuter. Finally, the genome was subjected phylogenetic and comparative genomic analyses. The complete genome is 123,582 bp long, containing only one copy of the inverted repeat. Gene prediction revealed 110 genes encoding 76 proteins, 30 tRNAs, and four rRNAs. Five intra-specific hypermutation loci were identified, three of which are heteroplasmic. Furthermore, three gene losses and two large inversions were identified. Comparative genomic analyses demonstrated the dynamic nature of the Papilionoideae chloroplast genomes, which showed occurrence of numerous hypermutation loci, frequent gene losses, and fragment inversions. Results obtained herein elucidate the complex evolutionary history of chloroplast genomes and have laid the foundation for the identification of genetic markers to distinguish A. membranaceus varieties.
Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic.

PubMed

Amosova, Alexandra V; Bolsheva, Nadezhda L; Samatadze, Tatiana E; Twardovska, Maryana O; Zoshchuk, Svyatoslav A; Andreev, Igor O; Badaeva, Ekaterina D; Kunakh, Viktor A; Muravenko, Olga V

2015-01-01

Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species.
Molecular Cytogenetic Analysis of Deschampsia antarctica Desv. (Poaceae), Maritime Antarctic

PubMed Central

Amosova, Alexandra V.; Bolsheva, Nadezhda L.; Samatadze, Tatiana E.; Twardovska, Maryana O.; Zoshchuk, Svyatoslav A.; Andreev, Igor O.; Badaeva, Ekaterina D.; Kunakh, Viktor A.; Muravenko, Olga V.

2015-01-01

Deschampsia antarctica Desv. (Poaceae) (2n = 26) is one of the two vascular plants adapted to the harshest environment of the Antarctic. Although the species is a valuable model for study of environmental stress tolerance in plants, its karyotype is still poorly investigated. We firstly conducted a comprehensive molecular cytogenetic analysis of D. antarctica collected on four islands of the Maritime Antarctic. D. antarctica karyotypes were studied by Giemsa C- and DAPI/C-banding, Ag-NOR staining, multicolour fluorescence in situ hybridization with repeated DNA probes (pTa71, pTa794, telomere repeats, pSc119.2, pAs1) and the GAA simple sequence repeat probe. We also performed sequential rapid in situ hybridization with genomic DNA of D. caespitosa. Two chromosome pairs bearing transcriptionally active 45S rDNA loci and five pairs with 5S rDNA sites were detected. A weak intercalary site of telomere repeats was revealed on the largest chromosome in addition to telomere hybridization signals at terminal positions. This fact confirms indirectly the hypothesis that chromosome fusion might have been the cause of the unusual for cereals chromosome number in this species. Based on patterns of distribution of the examined molecular cytogenetic markers, all chromosomes in karyotypes were identified, and chromosome idiograms of D. antarctica were constructed. B chromosomes were found in most karyotypes of plants from Darboux Island. A mixoploid plant with mainly triploid cells bearing a Robertsonian rearrangement was detected among typical diploid specimens from Great Jalour Island. The karyotype variability found in D. antarctica is probably an expression of genome instability induced by environmental stress factors. The differences in C-banding patterns and in chromosome distribution of rDNA loci as well as homologous highly repeated DNA sequences detected between genomes of D. antarctica and its related species D. caespitosa indicate that genome reorganization involving coding and noncoding repeated DNA sequences had occurred during the divergence of these species. PMID:26394331
Development of EST-derived microsatellite markers in the aquatic macrophyte Ranunculus bungei (Ranunculaceae)1

PubMed Central

Wu, Zhigang; Wu, Jinwei; Wang, Yalin; Hou, Hongwei

2017-01-01

Premise of the study: Microsatellite or simple sequence repeat (SSR) markers were developed to investigate the influence of ecological factors on gene flow and spatial genetic structuring of the submerged plant Ranunculus bungei (Ranunculaceae), which is regarded as an important species for understanding how plants adapt to an aquatic environment. Methods and Results: Twenty-two microsatellite loci were identified from an expressed sequence tag (EST) library. The number of alleles per locus ranged from one to five, and the expected heterozygosity varied from 0.0 to 0.5 in four Chinese populations of R. bungei. Fourteen loci were polymorphic and significantly deviated from Hardy–Weinberg equilibrium. All of the loci were found to be amplifiable in two other species of Ranunculus section Batrachium, and cross-amplification in six riparian and aquatic species of Ranunculaceae was also partially successful. Conclusions: These novel EST-SSR markers will be useful for ecological and evolutionary studies of R. bungei as well as related species. PMID:28791205
[Clustered regularly interspaced short palindromic repeats: structure, function and application--a review].

PubMed

Cui, Yujun; Li, Yanjun; Yan, Yanfeng; Yang, Ruifu

2008-11-01

CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats), the basis of spoligotyping technology, can provide prokaryotes with heritable adaptive immunity against phages' invasion. Studies on CRISPR loci and their associated elements, including various CAS (CRISPR-associated) proteins and leader sequences, are still in its infant period. We introduce the brief history', structure, function, bioinformatics research and application of this amazing immunity system in prokaryotic organism for inspiring more scientists to find their interest in this developing topic.
Characterization and transferability of microsatellite markers of the cultivated peanut (Arachis hypogaea)

PubMed Central

Gimenes, Marcos A; Hoshino, Andrea A; Barbosa, Andrea VG; Palmieri, Dario A; Lopes, Catalina R

2007-01-01

Background The genus Arachis includes Arachis hypogaea (cultivated peanut) and wild species that are used in peanut breeding or as forage. Molecular markers have been employed in several studies of this genus, but microsatellite markers have only been used in few investigations. Microsatellites are very informative and are useful to assess genetic variability, analyze mating systems and in genetic mapping. The objectives of this study were to develop A. hypogaea microsatellite loci and to evaluate the transferability of these markers to other Arachis species. Results Thirteen loci were isolated and characterized using 16 accessions of A. hypogaea. The level of variation found in A. hypogaea using microsatellites was higher than with other markers. Cross-transferability of the markers was also high. Sequencing of the fragments amplified using the primer pair Ah11 from 17 wild Arachis species showed that almost all wild species had similar repeated sequence to the one observed in A. hypogaea. Sequence data suggested that there is no correlation between taxonomic relationship of a wild species to A. hypogaea and the number of repeats found in its microsatellite loci. Conclusion These results show that microsatellite primer pairs from A. hypogaea have multiple uses. A higher level of variation among A. hypogaea accessions can be detected using microsatellite markers in comparison to other markers, such as RFLP, RAPD and AFLP. The microsatellite primers of A. hypogaea showed a very high rate of transferability to other species of the genus. These primer pairs provide important tools to evaluate the genetic variability and to assess the mating system in Arachis species. PMID:17326826
The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

NASA Astrophysics Data System (ADS)

Nallaseth, Ferez Soli

The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1) sequence content of deletion products confirmed the previously unidentified loss of genetic control of mammalian chromosome biology and hybrid dysgenesis.
Genetic diversity among Puccinia melanocephala isolates from Brazil assessed using simple sequence repeat markers.

PubMed

Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A

2014-09-26

Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.
An evaluation of the International Society for Animal Genetics recommended parentage and identification panel for the domestic pigeon (Columba livia domestica).

PubMed

de Groot, M; van Haeringen, W A

2017-08-01

In this study, the International Society for Animal Genetics (ISAG) recommended panel for the identification of the domestic pigeon (Columba livia domestica) is characterized based on commonly used statistical parameters. The marker panel is based on 16 short tandem repeat (STR) loci (PIGN15, PIGN10, PIGN57, PIGN26, CliμD16, CliμD19, PIGN12, CliμD17, CliμT17, PIGN04, CliμD01, CliμD11, CliμD35, CliμT02, CliμT13, CliμT43). The alleles of the 16 loci consist of a mixture of tri-, tetra-, penta- and hexameric repeat patterns. A sex determination marker was included in the multiplex for quality control. The repeat sequence of the PIGN markers was previously unpublished and therefore sequenced to reveal the sequence pattern. In total, 1421 pigeons were genotyped on 16 STR loci to generate allele frequency data for each locus. For all 16 markers combined, a PE1 (combined non-exclusion probability, first parent) of 0.9986 and PE2 (combined non-exclusion probability, second parent) of >0.9999 was observed. Comparing the alleged father and mother, a PE value of >0.9999 was observed. Two of the markers, CliμD19 and PIGN12, were found to have relatively high Hardy-Weinberg equilibrium and F(null) values. Therefore these markers may be considered to be replaced by other STRs. Another point of discussion may be to add a gender identification marker to the recommended ISAG panel. Not only can this serve as an extra identification marker, but this can also confirm the sex of a sample, because it is challenging to determine the sex based on phenotypical characteristics, especially for chicks. In conclusion, the set of 16 STR markers can be used in routine parentage verification and the identification of individuals. © 2017 Stichting International Foundation for Animal Genetics.
Development of Proteogenomic Approaches to Analyze the Role of Virus-Microbe Interactions in Shaping Natural Microbial Communities

DOE Office of Scientific and Technical Information (OSTI.GOV)

Banfield, Jillian; Breitbart, Mya; VerBerkmoes, Nathan

CRISPRs (clustered regularly interspaced short palindromic repeats) are adaptive immune systems in Bacteria and Archaea. Transcripts of the spacers that separate the repeats confer immunity through sequence identity with a targeted region (proto-spacer) in phage/viral, plasmid, or other foreign DNA. Short sequences immediately flanking the proto-spacer (proto-spacer adjacent motifs—PAMs) are important in both procuring spacers from and providing immunity to targeted sequences. New spacers are incorporated unidirectionally at the leader end of the CRISPR loci, thus recording a timeline of recent viral exposure. In the early phase of our research, we documented extremely rapid diversification of the CRISPR loci inmore » natural populations [Tyson and Banfield, 2008] matched by high levels of sequence variation in natural viral populations [Andersson and Banfield, 2008]. Since then, in a genetically tractable model laboratory system, we have 1) tracked phage mutation and CRISPR diversification, and in a natural model system, we have 2) examined population history via over time, 3) investigated the timescale over which spacers become ineffective and the process by which ineffective spacers are removed, and 4) analyzed viral diversity. In addition to research activities, our group has organized five international CRISPR meetings, the fifth to be held at University of California, Berkeley in June 2012. Most importantly, the project provided the majority of funding support for Christine Sun (Ph.D. 2012).« less
Fourteen polymorphic microsatellite markers for the threatened Arnica montana (Asteraceae)1

PubMed Central

Duwe, Virginia K.; Ismail, Sascha A.; Buser, Andres; Sossai, Esther; Borsch, Thomas; Muller, Ludo A. H.

2015-01-01

• Premise of the study: Microsatellite markers were developed to investigate population genetic structure in the threatened species Arnica montana. • Methods and Results: Fourteen microsatellite markers with di-, tetra-, and hexanucleotide repeat motifs were developed for A. montana using 454 pyrosequencing without and with library-enrichment methods, resulting in 56,545 sequence reads and 14,467 sequence reads, respectively. All loci showed a high level of polymorphism, with allele numbers ranging from four to 11 in five individuals from five populations (25 samples) and an expected heterozygosity ranging from 0.192 to 0.648 across the loci. • Conclusions: This set of microsatellite markers is the first one described for A. montana and will facilitate conservation genetic applications as well as the understanding of phylogeographic patterns in this species. PMID:25606354
The first genetic map of the American cranberry: exploration of synteny conservation and quantitative trait loci.

PubMed

Georgi, Laura; Johnson-Cicalese, Jennifer; Honig, Josh; Das, Sushma Parankush; Rajah, Veeran D; Bhattacharya, Debashish; Bassil, Nahla; Rowland, Lisa J; Polashock, James; Vorsa, Nicholi

2013-03-01

The first genetic map of cranberry (Vaccinium macrocarpon) has been constructed, comprising 14 linkage groups totaling 879.9 cM with an estimated coverage of 82.2 %. This map, based on four mapping populations segregating for field fruit-rot resistance, contains 136 distinct loci. Mapped markers include blueberry-derived simple sequence repeat (SSR) and cranberry-derived sequence-characterized amplified region markers previously used for fingerprinting cranberry cultivars. In addition, SSR markers were developed near cranberry sequences resembling genes involved in flavonoid biosynthesis or defense against necrotrophic pathogens, or conserved orthologous set (COS) sequences. The cranberry SSRs were developed from next-generation cranberry genomic sequence assemblies; thus, the positions of these SSRs on the genomic map provide information about the genomic location of the sequence scaffold from which they were derived. The use of SSR markers near COS and other functional sequences, plus 33 SSR markers from blueberry, facilitates comparisons of this map with maps of other plant species. Regions of the cranberry map were identified that showed conservation of synteny with Vitis vinifera and Arabidopsis thaliana. Positioned on this map are quantitative trait loci (QTL) for field fruit-rot resistance (FFRR), fruit weight, titratable acidity, and sound fruit yield (SFY). The SFY QTL is adjacent to one of the fruit weight QTL and may reflect pleiotropy. Two of the FFRR QTL are in regions of conserved synteny with grape and span defense gene markers, and the third FFRR QTL spans a flavonoid biosynthetic gene.
CRISPR interference and priming varies with individual spacer sequences

PubMed Central

Xue, Chaoyou; Seetharam, Arun S.; Musharova, Olga; Severinov, Konstantin; J. Brouns, Stan J.; Severin, Andrew J.; Sashital, Dipali G.

2015-01-01

CRISPR–Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated) systems allow bacteria to adapt to infection by acquiring ‘spacer’ sequences from invader DNA into genomic CRISPR loci. Cas proteins use RNAs derived from these loci to target cognate sequences for destruction through CRISPR interference. Mutations in the protospacer adjacent motif (PAM) and seed regions block interference but promote rapid ‘primed’ adaptation. Here, we use multiple spacer sequences to reexamine the PAM and seed sequence requirements for interference and priming in the Escherichia coli Type I-E CRISPR–Cas system. Surprisingly, CRISPR interference is far more tolerant of mutations in the seed and the PAM than previously reported, and this mutational tolerance, as well as priming activity, is highly dependent on spacer sequence. We identify a large number of functional PAMs that can promote interference, priming or both activities, depending on the associated spacer sequence. Functional PAMs are preferentially acquired during unprimed ‘naïve’ adaptation, leading to a rapid priming response following infection. Our results provide numerous insights into the importance of both spacer and target sequences for interference and priming, and reveal that priming is a major pathway for adaptation during initial infection. PMID:26586800
Highly diverse variable number tandem repeat loci in the E. coli O157:H7 and O55:H7 genomes for high-resolution molecular typing.

PubMed

Keys, C; Kemper, S; Keim, P

2005-01-01

Evaluation of the Escherichia coli genome for variable number tandem repeat (VNTR) loci in order to provide a subtyping tool with greater discrimination and more efficient capacity. Twenty-nine putative VNTR loci were identified from the E. coli genomic sequence. Their variability was validated by characterizing the number of repeats at each locus in a set of 56 E. coli O157:H7/HN and O55:H7 isolates. An optimized multiplex assay system was developed to facility high capacity analysis. Locus diversity values ranged from 0.23 to 0.95 while the number of alleles ranged from two to 29. This multiple-locus VNTR analysis (MLVA) data was used to describe genetic relationships among these isolates and was compared with PFGE (pulse field gel electrophoresis) data from a subset of the same strains. Genetic similarity values were highly correlated between the two approaches, through MLVA was capable of discrimination amongst closely related isolates when PFGE similar values were equal to 1.0. Highly variable VNTR loci exist in the E. coli O157:H7 genome and are excellent estimators of genetic relationships, in particular for closely related isolates. Escherichia coli O157:H7 MLVA offers a complimentary analysis to the more traditional PFGE approach. Application of MLVA to an outbreak cluster could generate superior molecular epidemiology and result in a more effective public health response.
Methylation of L1Hs promoters is lower on the inactive X, has a tendency of being higher on autosomes in smaller genomes and shows inter-individual variability at some loci.

PubMed

Singer, Heike; Walier, Maja; Nüsgen, Nicole; Meesters, Christian; Schreiner, Felix; Woelfle, Joachim; Fimmers, Rolf; Wienker, Thomas; Kalscheuer, Vera M; Becker, Tim; Schwaab, Rainer; Oldenburg, Johannes; El-Maarri, Osman

2012-01-01

LINE-1 repeats account for ~17% of the human genome. Little is known about their individual methylation patterns, because their repetitive, almost identical sequences make them difficult to be individually targeted. Here, we used bisulfite conversion to study methylation at individual LINE-1 repeats. The loci studied included 39 X-linked loci and 5 autosomal loci. On the X chromosome in women, we found statistically significant less methylation at almost all L1Hs compared with men. Methylation at L1P and L1M did not correlate with the inactivation status of the host DNA, while the majority of L1Hs that were possible to be studied lie in inactivated regions. To investigate whether the male-female differences at L1Hs on the X are linked to the inactivation process itself rather than to a mere influence of gender, we analyzed six of the L1Hs loci on the X chromosome in Turners and Klinefelters which have female and male phenotype, respectively, but with reversed number of X chromosomes. We could confirm that all samples with two X chromosomes are hypomethylated at the L1Hs loci. Therefore, the inactive X is hypomethylated at L1Hs; the latter could play an exclusive role in the X chromosome inactivation process. At autosomal L1Hs, methylation levels showed a correlation tendency between methylation level and genome size, with higher methylation observed at most loci in individuals with one X chromosome and the lowest in XXY individuals. In summary, loci-specific LINE-1 methylation levels show considerable plasticity and depend on genomic position and constitution.
Methylation of L1Hs promoters is lower on the inactive X, has a tendency of being higher on autosomes in smaller genomes and shows inter-individual variability at some loci

PubMed Central

Singer, Heike; Walier, Maja; Nüsgen, Nicole; Meesters, Christian; Schreiner, Felix; Woelfle, Joachim; Fimmers, Rolf; Wienker, Thomas; Kalscheuer, Vera M.; Becker, Tim; Schwaab, Rainer; Oldenburg, Johannes; El-Maarri, Osman

2012-01-01

LINE-1 repeats account for ∼17% of the human genome. Little is known about their individual methylation patterns, because their repetitive, almost identical sequences make them difficult to be individually targeted. Here, we used bisulfite conversion to study methylation at individual LINE-1 repeats. The loci studied included 39 X-linked loci and 5 autosomal loci. On the X chromosome in women, we found statistically significant less methylation at almost all L1Hs compared with men. Methylation at L1P and L1M did not correlate with the inactivation status of the host DNA, while the majority of L1Hs that were possible to be studied lie in inactivated regions. To investigate whether the male–female differences at L1Hs on the X are linked to the inactivation process itself rather than to a mere influence of gender, we analyzed six of the L1Hs loci on the X chromosome in Turners and Klinefelters which have female and male phenotype, respectively, but with reversed number of X chromosomes. We could confirm that all samples with two X chromosomes are hypomethylated at the L1Hs loci. Therefore, the inactive X is hypomethylated at L1Hs; the latter could play an exclusive role in the X chromosome inactivation process. At autosomal L1Hs, methylation levels showed a correlation tendency between methylation level and genome size, with higher methylation observed at most loci in individuals with one X chromosome and the lowest in XXY individuals. In summary, loci-specific LINE-1 methylation levels show considerable plasticity and depend on genomic position and constitution. PMID:21972244
Genome-Wide Association Study Identifies Loci for Salt Tolerance during Germination in Autotetraploid Alfalfa (Medicago sativa L.) Using Genotyping-by-Sequencing

PubMed Central

Yu, Long-Xi; Liu, Xinchun; Boge, William; Liu, Xiang-Ping

2016-01-01

Salinity is one of major abiotic stresses limiting alfalfa (Medicago sativa L.) production in the arid and semi-arid regions in US and other counties. In this study, we used a diverse panel of alfalfa accessions previously described by Zhang et al. (2015) to identify molecular markers associated with salt tolerance during germination using genome-wide association study (GWAS) and genotyping-by-sequencing (GBS). Phenotyping was done by germinating alfalfa seeds under different levels of salt stress. Phenotypic data of adjusted germination rates and SNP markers generated by GBS were used for marker-trait association. Thirty six markers were significantly associated with salt tolerance in at least one level of salt treatments. Alignment of sequence tags to the Medicago truncatula genome revealed genetic locations of the markers on all chromosomes except chromosome 3. Most significant markers were found on chromosomes 1, 2, and 4. BLAST search using the flanking sequences of significant markers identified 14 putative candidate genes linked to 23 significant markers. Most of them were repeatedly identified in two or three salt treatments. Several loci identified in the present study had similar genetic locations to the reported QTL associated with salt tolerance in M. truncatula. A locus identified on chromosome 6 by this study overlapped with that by drought in our previous study. To our knowledge, this is the first report on mapping loci associated with salt tolerance during germination in autotetraploid alfalfa. Further investigation on these loci and their linked genes would provide insight into understanding molecular mechanisms by which salt and drought stresses affect alfalfa growth. Functional markers closely linked to the resistance loci would be useful for MAS to improve alfalfa cultivars with enhanced resistance to drought and salt stresses. PMID:27446182
Genome-Wide Association Study Identifies Loci for Salt Tolerance during Germination in Autotetraploid Alfalfa (Medicago sativa L.) Using Genotyping-by-Sequencing.

PubMed

Yu, Long-Xi; Liu, Xinchun; Boge, William; Liu, Xiang-Ping

2016-01-01

Salinity is one of major abiotic stresses limiting alfalfa (Medicago sativa L.) production in the arid and semi-arid regions in US and other counties. In this study, we used a diverse panel of alfalfa accessions previously described by Zhang et al. (2015) to identify molecular markers associated with salt tolerance during germination using genome-wide association study (GWAS) and genotyping-by-sequencing (GBS). Phenotyping was done by germinating alfalfa seeds under different levels of salt stress. Phenotypic data of adjusted germination rates and SNP markers generated by GBS were used for marker-trait association. Thirty six markers were significantly associated with salt tolerance in at least one level of salt treatments. Alignment of sequence tags to the Medicago truncatula genome revealed genetic locations of the markers on all chromosomes except chromosome 3. Most significant markers were found on chromosomes 1, 2, and 4. BLAST search using the flanking sequences of significant markers identified 14 putative candidate genes linked to 23 significant markers. Most of them were repeatedly identified in two or three salt treatments. Several loci identified in the present study had similar genetic locations to the reported QTL associated with salt tolerance in M. truncatula. A locus identified on chromosome 6 by this study overlapped with that by drought in our previous study. To our knowledge, this is the first report on mapping loci associated with salt tolerance during germination in autotetraploid alfalfa. Further investigation on these loci and their linked genes would provide insight into understanding molecular mechanisms by which salt and drought stresses affect alfalfa growth. Functional markers closely linked to the resistance loci would be useful for MAS to improve alfalfa cultivars with enhanced resistance to drought and salt stresses.
Resistance gene enrichment sequencing (RenSeq) enables reannotation of the NB-LRR gene family from sequenced plant genomes and rapid mapping of resistance loci in segregating populations

PubMed Central

Jupe, Florian; Witek, Kamil; Verweij, Walter; Śliwka, Jadwiga; Pritchard, Leighton; Etherington, Graham J; Maclean, Dan; Cock, Peter J; Leggett, Richard M; Bryan, Glenn J; Cardle, Linda; Hein, Ingo; Jones, Jonathan DG

2013-01-01

Summary RenSeq is a NB-LRR (nucleotide binding-site leucine-rich repeat) gene-targeted, Resistance gene enrichment and sequencing method that enables discovery and annotation of pathogen resistance gene family members in plant genome sequences. We successfully applied RenSeq to the sequenced potato Solanum tuberosum clone DM, and increased the number of identified NB-LRRs from 438 to 755. The majority of these identified R gene loci reside in poorly or previously unannotated regions of the genome. Sequence and positional details on the 12 chromosomes have been established for 704 NB-LRRs and can be accessed through a genome browser that we provide. We compared these NB-LRR genes and the corresponding oligonucleotide baits with the highest sequence similarity and demonstrated that ∼80% sequence identity is sufficient for enrichment. Analysis of the sequenced tomato S. lycopersicum ‘Heinz 1706’ extended the NB-LRR complement to 394 loci. We further describe a methodology that applies RenSeq to rapidly identify molecular markers that co-segregate with a pathogen resistance trait of interest. In two independent segregating populations involving the wild Solanum species S. berthaultii (Rpi-ber2) and S. ruiz-ceballosii (Rpi-rzc1), we were able to apply RenSeq successfully to identify markers that co-segregate with resistance towards the late blight pathogen Phytophthora infestans. These SNP identification workflows were designed as easy-to-adapt Galaxy pipelines. PMID:23937694

Genetic variation and evolutionary stability of the FMR1 CGG repeat in six closed human populations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eichler, E.E.; Nelson, D.L.

1996-07-12

In an attempt to understand the allelic diversity and mutability of the human FMR1 CGG repeat, we have analyzed the AGG substructure of this locus within six genetically-closed populations (Mbuti pygmy, Baka pygmy, R. surui, Karitiana, Mayan, and Hutterite). Most alleles (61/92 or 66%) possessed two AGG interspersions occurring with a periodicity of one AGG every nine or ten CGG repeats, indicating that this pattern is highly conserved in all human populations. Significant differences in allele distribution were observed among the populations for rare variants possessing fewer or more AGG interruptions than the canonical FMR1 CGG repeat sequence. Comparisons ofmore » expected heterozygosity of the FMR1 CGG repeat locus with 30 other microsatellite loci, demonstrated remarkably similar levels of polymorphism within each population, suggesting that most FMR1 CGG repeat alleles mutate at rates indistinguishable from other microsatellite loci. A single allele (1 out of 92) was identified with a large uninterrupted tract of pure repeats (42 pure CGG triplets). Retrospective pedigree analysis indicated that this allele had been transmitted unstably. Although such alleles mutate rapidly and likely represent evolving premutations, our analysis suggests that in spite of the estimated frequency of their occurrence, these unstable alleles do not significantly alter the expected heterozygosity of the FMR1 CGG repeat in the human population. 45 refs., 1 fig., 2 tabs.« less
Plant-symbiotic fungi as chemical engineers: multi-genome analysis of the clavicipitaceae reveals dynamics of alkaloid loci.

PubMed

Schardl, Christopher L; Young, Carolyn A; Hesse, Uljana; Amyotte, Stefan G; Andreeva, Kalina; Calie, Patrick J; Fleetwood, Damien J; Haws, David C; Moore, Neil; Oeser, Birgitt; Panaccione, Daniel G; Schweri, Kathryn K; Voisey, Christine R; Farman, Mark L; Jaromczyk, Jerzy W; Roe, Bruce A; O'Sullivan, Donal M; Scott, Barry; Tudzynski, Paul; An, Zhiqiang; Arnaoudova, Elissaveta G; Bullock, Charles T; Charlton, Nikki D; Chen, Li; Cox, Murray; Dinkins, Randy D; Florea, Simona; Glenn, Anthony E; Gordon, Anna; Güldener, Ulrich; Harris, Daniel R; Hollin, Walter; Jaromczyk, Jolanta; Johnson, Richard D; Khan, Anar K; Leistner, Eckhard; Leuchtmann, Adrian; Li, Chunjie; Liu, JinGe; Liu, Jinze; Liu, Miao; Mace, Wade; Machado, Caroline; Nagabhyru, Padmaja; Pan, Juan; Schmid, Jan; Sugawara, Koya; Steiner, Ulrike; Takach, Johanna E; Tanaka, Eiji; Webb, Jennifer S; Wilson, Ella V; Wiseman, Jennifer L; Yoshida, Ruriko; Zeng, Zheng

2013-01-01

The fungal family Clavicipitaceae includes plant symbionts and parasites that produce several psychoactive and bioprotective alkaloids. The family includes grass symbionts in the epichloae clade (Epichloë and Neotyphodium species), which are extraordinarily diverse both in their host interactions and in their alkaloid profiles. Epichloae produce alkaloids of four distinct classes, all of which deter insects, and some-including the infamous ergot alkaloids-have potent effects on mammals. The exceptional chemotypic diversity of the epichloae may relate to their broad range of host interactions, whereby some are pathogenic and contagious, others are mutualistic and vertically transmitted (seed-borne), and still others vary in pathogenic or mutualistic behavior. We profiled the alkaloids and sequenced the genomes of 10 epichloae, three ergot fungi (Claviceps species), a morning-glory symbiont (Periglandula ipomoeae), and a bamboo pathogen (Aciculosporium take), and compared the gene clusters for four classes of alkaloids. Results indicated a strong tendency for alkaloid loci to have conserved cores that specify the skeleton structures and peripheral genes that determine chemical variations that are known to affect their pharmacological specificities. Generally, gene locations in cluster peripheries positioned them near to transposon-derived, AT-rich repeat blocks, which were probably involved in gene losses, duplications, and neofunctionalizations. The alkaloid loci in the epichloae had unusual structures riddled with large, complex, and dynamic repeat blocks. This feature was not reflective of overall differences in repeat contents in the genomes, nor was it characteristic of most other specialized metabolism loci. The organization and dynamics of alkaloid loci and abundant repeat blocks in the epichloae suggested that these fungi are under selection for alkaloid diversification. We suggest that such selection is related to the variable life histories of the epichloae, their protective roles as symbionts, and their associations with the highly speciose and ecologically diverse cool-season grasses.
Plant-Symbiotic Fungi as Chemical Engineers: Multi-Genome Analysis of the Clavicipitaceae Reveals Dynamics of Alkaloid Loci

PubMed Central

Schardl, Christopher L.; Young, Carolyn A.; Hesse, Uljana; Amyotte, Stefan G.; Andreeva, Kalina; Calie, Patrick J.; Fleetwood, Damien J.; Haws, David C.; Moore, Neil; Oeser, Birgitt; Panaccione, Daniel G.; Schweri, Kathryn K.; Voisey, Christine R.; Farman, Mark L.; Jaromczyk, Jerzy W.; Roe, Bruce A.; O'Sullivan, Donal M.; Scott, Barry; Tudzynski, Paul; An, Zhiqiang; Arnaoudova, Elissaveta G.; Bullock, Charles T.; Charlton, Nikki D.; Chen, Li; Cox, Murray; Dinkins, Randy D.; Florea, Simona; Glenn, Anthony E.; Gordon, Anna; Güldener, Ulrich; Harris, Daniel R.; Hollin, Walter; Jaromczyk, Jolanta; Johnson, Richard D.; Khan, Anar K.; Leistner, Eckhard; Leuchtmann, Adrian; Li, Chunjie; Liu, JinGe; Liu, Jinze; Liu, Miao; Mace, Wade; Machado, Caroline; Nagabhyru, Padmaja; Pan, Juan; Schmid, Jan; Sugawara, Koya; Steiner, Ulrike; Takach, Johanna E.; Tanaka, Eiji; Webb, Jennifer S.; Wilson, Ella V.; Wiseman, Jennifer L.; Yoshida, Ruriko; Zeng, Zheng

2013-01-01

The fungal family Clavicipitaceae includes plant symbionts and parasites that produce several psychoactive and bioprotective alkaloids. The family includes grass symbionts in the epichloae clade (Epichloë and Neotyphodium species), which are extraordinarily diverse both in their host interactions and in their alkaloid profiles. Epichloae produce alkaloids of four distinct classes, all of which deter insects, and some—including the infamous ergot alkaloids—have potent effects on mammals. The exceptional chemotypic diversity of the epichloae may relate to their broad range of host interactions, whereby some are pathogenic and contagious, others are mutualistic and vertically transmitted (seed-borne), and still others vary in pathogenic or mutualistic behavior. We profiled the alkaloids and sequenced the genomes of 10 epichloae, three ergot fungi (Claviceps species), a morning-glory symbiont (Periglandula ipomoeae), and a bamboo pathogen (Aciculosporium take), and compared the gene clusters for four classes of alkaloids. Results indicated a strong tendency for alkaloid loci to have conserved cores that specify the skeleton structures and peripheral genes that determine chemical variations that are known to affect their pharmacological specificities. Generally, gene locations in cluster peripheries positioned them near to transposon-derived, AT-rich repeat blocks, which were probably involved in gene losses, duplications, and neofunctionalizations. The alkaloid loci in the epichloae had unusual structures riddled with large, complex, and dynamic repeat blocks. This feature was not reflective of overall differences in repeat contents in the genomes, nor was it characteristic of most other specialized metabolism loci. The organization and dynamics of alkaloid loci and abundant repeat blocks in the epichloae suggested that these fungi are under selection for alkaloid diversification. We suggest that such selection is related to the variable life histories of the epichloae, their protective roles as symbionts, and their associations with the highly speciose and ecologically diverse cool-season grasses. PMID:23468653
Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tuskan, Gerald A; Gunter, Lee E; DiFazio, Stephen P

The 18S-28S rDNA and 5S rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 18S-28S rDNA sites and one 5S rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis -type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones selected from 2 linkage groups based on genome sequence assembly (LG-I and LG-VI) were localized on 2 chromosomes, as expected. BACs from LG-I hybridized to the longest chromosome in the complement. All BAC positions were found to be concordant with sequencemore » assembly positions. BAC-FISH will be useful for delineating each of the Populus trichocarpa chromosomes and improving the sequence assembly of this model angiosperm tree species.« less
New microsatellite loci for Prosopis alba and P. chilensis (Fabaceae)1

PubMed Central

Bessega, Cecilia F.; Pometti, Carolina L.; Miller, Joe T.; Watts, Richard; Saidman, Beatriz O.; Vilardi, Juan C.

2013-01-01

• Premise of the study: As only six useful microsatellite loci that exhibit broad cross-amplification are so far available for Prosopis species, it is necessary to develop a larger number of codominant markers for population genetic studies. Simple sequence repeat (SSR) markers obtained for Prosopis species from a 454 pyrosequencing run were optimized and characterized for studies in P. alba and P. chilensis. • Methods and Results: Twelve markers that were successfully amplified showed polymorphism in P. alba and P. chilensis. The number of alleles per locus ranged between two and seven and heterozygosity estimates ranged from 0.2 to 0.8. Most of these loci cross-amplify in P. ruscifolia, P. flexuosa, P. kuntzei, P. glandulosa, and P. pallida. • Conclusions: These loci will enable genetic diversity studies of P. alba and P. chilensis and contribute to fine-scale population structure, indirect estimation of relatedness among individuals, and marker-assisted selection. PMID:25202541
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis

PubMed Central

Zheng, Jin-shuang; Sun, Cheng-zhen; Zhang, Shu-ning; Hou, Xi-lin; Bonnema, Guusje

2016-01-01

A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

PubMed

Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

2016-01-01

A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
A framework linkage map of perennial ryegrass based on SSR markers

Treesearch

G.P. Gill; P.L. Wilcox; D.J. Whittaker; R.A. Winz; P. Bickerstaff; Craig E. Echt; J. Kent; M.O. Humphreys; K.M. Elborough; R.C. Gardner

2006-01-01

A moderate-density linkage map for Lolium perenne L. has been constructed based on 376 simple sequence repeat (SSR) markers. Approximately one third ( 124) of the SSR markers were developed from GeneThresher libraries that preferentially select genomic DNA clones from the gene-rich unmethylated portion of the genome. The remaining SSR marker loci...
Dog leukocyte antigen class II-associated genetic risk testing for immune disorders of dogs: simplified approaches using Pug dog necrotizing meningoencephalitis as a model.

PubMed

Pedersen, Niels; Liu, Hongwei; Millon, Lee; Greer, Kimberly

2011-01-01

A significantly increased risk for a number of autoimmune and infectious diseases in purebred and mixed-breed dogs has been associated with certain alleles or allele combinations of the dog leukocyte antigen (DLA) class II complex containing the DRB1, DQA1, and DQB1 genes. The exact level of risk depends on the specific disease, the alleles in question, and whether alleles exist in a homozygous or heterozygous state. The gold standard for identifying high-risk alleles and their zygosity has involved direct sequencing of the exon 2 regions of each of the 3 genes. However, sequencing and identification of specific alleles at each of the 3 loci are relatively expensive and sequencing techniques are not ideal for additional parentage or identity determination. However, it is often possible to get the same information from sequencing only 1 gene given the small number of possible alleles at each locus in purebred dogs, extensive homozygosity, and tendency for disease-causing alleles at each of the 3 loci to be strongly linked to each other into haplotypes. Therefore, genetic testing in purebred dogs with immune diseases can be often simplified by sequencing alleles at 1 rather than 3 loci. Further simplification of genetic tests for canine immune diseases can be achieved by the use of alternative genetic markers in the DLA class II region that are also strongly linked with the disease genotype. These markers consist of either simple tandem repeats or single nucleotide polymorphisms that are also in strong linkage with specific DLA class II genotypes and/or haplotypes. The current study uses necrotizing meningoencephalitis of Pug dogs as a paradigm to assess simple alternative genetic tests for disease risk. It was possible to attain identical necrotizing meningoencephalitis risk assessments to 3-locus DLA class II sequencing by sequencing only the DQB1 gene, using 3 DLA class II-linked simple tandem repeat markers, or with a small single nucleotide polymorphism array designed to identify breed-specific DQB1 alleles.
Multiple-locus, variable number of tandem repeat analysis (MLVA) of the fish-pathogen Francisella noatunensis

PubMed Central

2011-01-01

Background Since Francisella noatunensis was first isolated from cultured Atlantic cod in 2004, it has emerged as a global fish pathogen causing disease in both warm and cold water species. Outbreaks of francisellosis occur in several important cultured fish species making a correct management of this disease a matter of major importance. Currently there are no vaccines or treatments available. A strain typing system for use in studies of F. noatunensis epizootics would be an important tool for disease management. However, the high genetic similarity within the Francisella spp. makes strain typing difficult, but such typing of the related human pathogen Francisella tullarensis has been performed successfully by targeting loci with higher genetic variation than the traditional signature sequences. These loci are known as Variable Numbers of Tandem Repeat (VNTR). The aim of this study is to identify possible useful VNTRs in the genome of F. noatunensis. Results Seven polymorphic VNTR loci were identified in the preliminary genome sequence of F. noatunensis ssp. noatunensis GM2212 isolate. These VNTR-loci were sequenced in F. noatunensis isolates collected from Atlantic cod (Gadus morhua) from Norway (n = 21), Three-line grunt (Parapristipoma trilineatum) from Japan (n = 1), Tilapia (Oreochromis spp.) from Indonesia (n = 3) and Atlantic salmon (Salmo salar) from Chile (n = 1). The Norwegian isolates presented in this study show both nine allelic profiles and clades, and that the majority of the farmed isolates belong in two clades only, while the allelic profiles from wild cod are unique. Conclusions VNTRs can be used to separate isolates belonging to both subspecies of F. noatunensis. Low allelic diversity in F. noatunensis isolates from outbreaks in cod culture compared to isolates wild cod, indicate that transmission of these isolates may be a result of human activity. The sequence based MLVA system presented in this study should provide a good starting point for further development of a genotyping system that can be used in studies of epizootics and disease management of francisellosis. PMID:21261955
Alu Sb2 subfamily is present in all higher primates but was most succesfully amplified in humans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Richer, C.; Zietkiewicz, E.; Labuda, D.

Alu repeats can be classified into subfamilies which amplified in primate genomes at different evolutionary time periods. A young Alu subfamily, Sb2, with a characteristic 7-nucleotide duplication at position 256, has been described in seven human loci. An Sb2 insertion found near the HD gene was unique to two HD families, indicating that Sb2 was still retropositionally active. Here, we have shown that the Sb2 insertion in the CHOL locus was similarly rare, being absent in 120 individuals of Caucasian, Oriental and Black origin. In contrast, Sb2 inserts in five other loci were found fixed (non-polymorphic), based on measurements inmore » the same population sample, but absent from orthologous positions in higher apes. This suggest that Sb2 repeats spread relatively early in the human lineage following divergence from other primates and that these elements may be human-specific. By quantitative PCR, we investigated the presence of Sb2 sequences in different primate DNA, using one PCR primer anchored at the 5{prime} Alu-end and the other complementary to the duplicated Sb2-specific segment. With an Sb2-containing plasmid as a standard, we estimated the number of Sb2 repeats at 1500-1800 copies per human haploid equivalent; corresponding numbers in chimpanzee and gorilla were almost two orders of magnitude lower, while the signal observed in orangutan and gibbon DNAs was consistent with the presence of a single copy. The analysis of 22 human, 11 chimpanzee and 10 gorilla sequences indicates that the Alu Sb2 dispersed independently in these three primate lineages; gorilla consensus differs from the human Sb2 sequence by one position, while all chimpanzee repeats have their linker expanded by up to eight A-residues. Should they be thus considered as separate subfamilies? It is possible that sequence modifications with respect to the human consensus are responsible for poor retroposition of Sb2 in apes.« less
A cytological-physical map of 22q11

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lindsay, E.A.; Rizzu, P.; Gaddini, L.

Our laboratory is involved in the construction of a cytological-physical map of 22q11 and isolation of expressed sequences from the region involved in DiGeorge syndrome (DGS) and Velo-Cardio-Facial syndrome (VCFS). One of the goals of the mapping is an understanding of the molecular mechanisms which generate the 22q11 microdeletions observed with high frequency in DGS and VCFS. Our of over 60 deleted patients studied in our laboratory, all but one were deleted for two loci approximately 1-2 Mb apart. There is evidence from patients with balanced and unbalanced translocations that deletion of the whole region is not necessary for determinationmore » of the clinical phenotype. Therefore, it is possible that deletion breakpoints occur as a consequence of structural characteristics of the DNA that predispose to rearrangements. A striking characteristic of the 22q11 region is the abundance of low copy repeat sequences. It is reasonable to think that recombination between these repeats may lead to microdeletions. However, a direct demonstration of such mechanism is not available yet. The presence of repeats makes standard physical mapping techniques based on hybridization or STS mapping often difficult to interpret. For example, we have found clones positive for the same STS that are located in different positions within 22q11. For this reason we have used high resolution cytological mapping as a supporting technique for map validation. We present the current status map which includes known polymorphic and non-polymorphic loci, newly isolated clones and chromosomal deletion breakpoints. The map extends from the loci D22S9/D22S24 to TOP1P2. Extended chromatin hybridization experiments visually demonstrate the presence of at least two repeat islands flanking (or at) the region where chromosomal breakpoints of the commonly deleted region occur.« less
Development of a Multiple-Locus Variable number of tandem repeat Analysis (MLVA) for Leptospira interrogans and its application to Leptospira interrogans serovar Australis isolates from Far North Queensland, Australia

PubMed Central

Slack, Andrew T; Dohnt, Michael F; Symonds, Meegan L; Smythe, Lee D

2005-01-01

Background Leptospirosis is a zoonotic disease caused by the genus, Leptospira. Leptospira interrogans is the most common genomospecies implicated in the disease. Epidemiological investigations are needed to distinguish outbreak situations or to trace reservoirs of the organisms. Current methodologies used for typing Leptospira have significant drawbacks. The development of an easy to perform yet high resolution method is needed for this organism. Methods In this study we have searched the available genomic sequence of L. interrogans serovar Copenhageni strain Fiocruz L1-130 for the presence of tandem repeats [1]. These repeats were evaluated against reference strains for diversity. Six loci were selected to create a Multiple Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) to explore the genetic diversity within L. interrogans serovar Australis clinical isolates from Far North Queensland. Results The 39 reference strains used for the development of the method displayed 39 distinct patterns. Diversity Indexes for the loci varied between 0.80 and 0.93 and the number of repeat units at each locus varied between less than one to 52 repeats. When the MLVA was applied to serovar Australis isolates three large clusters were distinguishable, each comprising various hosts including Rattus species, human and canines. Conclusion The MLVA described in this report, was easy to perform, analyse and was reproducible. The loci selected had high diversity allowing discrimination between serovars and also between strains within a serovar. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made. PMID:15987533
The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

PubMed Central

Grissa, Ibtissem; Vergnaud, Gilles; Pourcel, Christine

2007-01-01

Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element) are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the dictionary creator. CRISPRdb is accessible at PMID:17521438
CRISPRDetect: A flexible algorithm to define CRISPR arrays.

PubMed

Biswas, Ambarish; Staals, Raymond H J; Morales, Sergio E; Fineran, Peter C; Brown, Chris M

2016-05-17

CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with 'extra-large' repeats in bacteria (repeats 44-50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes.
CRISPR/Cas9-mediated knock-in of an optimized TetO repeat for live cell imaging of endogenous loci.

PubMed

Tasan, Ipek; Sustackova, Gabriela; Zhang, Liguo; Kim, Jiah; Sivaguru, Mayandi; HamediRad, Mohammad; Wang, Yuchuan; Genova, Justin; Ma, Jian; Belmont, Andrew S; Zhao, Huimin

2018-06-15

Nuclear organization has an important role in determining genome function; however, it is not clear how spatiotemporal organization of the genome relates to functionality. To elucidate this relationship, a method for tracking any locus of interest is desirable. Recently clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 (Cas9) or transcription activator-like effectors were adapted for imaging endogenous loci; however, they are mostly limited to visualization of repetitive regions. Here, we report an efficient and scalable method named SHACKTeR (Short Homology and CRISPR/Cas9-mediated Knock-in of a TetO Repeat) for live cell imaging of specific chromosomal regions without the need for a pre-existing repetitive sequence. SHACKTeR requires only two modifications to the genome: CRISPR/Cas9-mediated knock-in of an optimized TetO repeat and its visualization by TetR-EGFP expression. Our simplified knock-in protocol, utilizing short homology arms integrated by polymerase chain reaction, was successful at labeling 10 different loci in HCT116 cells. We also showed the feasibility of knock-in into lamina-associated, heterochromatin regions, demonstrating that these regions prefer non-homologous end joining for knock-in. Using SHACKTeR, we were able to observe DNA replication at a specific locus by long-term live cell imaging. We anticipate the general applicability and scalability of our method will enhance causative analyses between gene function and compartmentalization in a high-throughput manner.
CASFISH: CRISPR/Cas9-mediated in situ labeling of genomic loci in fixed cells.

PubMed

Deng, Wulan; Shi, Xinghua; Tjian, Robert; Lionnet, Timothée; Singer, Robert H

2015-09-22

Direct visualization of genomic loci in the 3D nucleus is important for understanding the spatial organization of the genome and its association with gene expression. Various DNA FISH methods have been developed in the past decades, all involving denaturing dsDNA and hybridizing fluorescent nucleic acid probes. Here we report a novel approach that uses in vitro constituted nuclease-deficient clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated caspase 9 (Cas9) complexes as probes to label sequence-specific genomic loci fluorescently without global DNA denaturation (Cas9-mediated fluorescence in situ hybridization, CASFISH). Using fluorescently labeled nuclease-deficient Cas9 (dCas9) protein assembled with various single-guide RNA (sgRNA), we demonstrated rapid and robust labeling of repetitive DNA elements in pericentromere, centromere, G-rich telomere, and coding gene loci. Assembling dCas9 with an array of sgRNAs tiling arbitrary target loci, we were able to visualize nonrepetitive genomic sequences. The dCas9/sgRNA binary complex is stable and binds its target DNA with high affinity, allowing sequential or simultaneous probing of multiple targets. CASFISH assays using differently colored dCas9/sgRNA complexes allow multicolor labeling of target loci in cells. In addition, the CASFISH assay is remarkably rapid under optimal conditions and is applicable for detection in primary tissue sections. This rapid, robust, less disruptive, and cost-effective technology adds a valuable tool for basic research and genetic diagnosis.
Markers and mapping revisited: finding your gene.

PubMed

Jones, Neil; Ougham, Helen; Thomas, Howard; Pasakinskiene, Izolda

2009-01-01

This paper is an update of our earlier review (Jones et al., 1997, Markers and mapping: we are all geneticists now. New Phytologist 137: 165-177), which dealt with the genetics of mapping, in terms of recombination as the basis of the procedure, and covered some of the first generation of markers, including restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNA (RAPDs), simple sequence repeats (SSRs) and quantitative trait loci (QTLs). In the intervening decade there have been numerous developments in marker science with many new systems becoming available, which are herein described: cleavage amplification polymorphism (CAP), sequence-specific amplification polymorphism (S-SAP), inter-simple sequence repeat (ISSR), sequence tagged site (STS), sequence characterized amplification region (SCAR), selective amplification of microsatellite polymorphic loci (SAMPL), single nucleotide polymorphism (SNP), expressed sequence tag (EST), sequence-related amplified polymorphism (SRAP), target region amplification polymorphism (TRAP), microarrays, diversity arrays technology (DArT), single-strand conformation polymorphism (SSCP), denaturing gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE) and methylation-sensitive PCR. In addition there has been an explosion of knowledge and databases in the area of genomics and bioinformatics. The number of flowering plant ESTs is c. 19 million and counting, with all the opportunity that this provides for gene-hunting, while the survey of bioinformatics and computer resources points to a rapid growth point for future activities in unravelling and applying the burst of new information on plant genomes. A case study is presented on tracking down a specific gene (stay-green (SGR), a post-transcriptional senescence regulator) using the full suite of mapping tools and comparative mapping resources. We end with a brief speculation on how genome analysis may progress into the future of this highly dynamic arena of plant science.
Filipino DNA variation at 12 X-chromosome short tandem repeat markers.

PubMed

Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A

2018-06-08

Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.
High-utility conserved avian microsatellite markers enable parentage and population studies across a wide range of species

PubMed Central

2013-01-01

Background Microsatellites are widely used for many genetic studies. In contrast to single nucleotide polymorphism (SNP) and genotyping-by-sequencing methods, they are readily typed in samples of low DNA quality/concentration (e.g. museum/non-invasive samples), and enable the quick, cheap identification of species, hybrids, clones and ploidy. Microsatellites also have the highest cross-species utility of all types of markers used for genotyping, but, despite this, when isolated from a single species, only a relatively small proportion will be of utility. Marker development of any type requires skill and time. The availability of sufficient “off-the-shelf” markers that are suitable for genotyping a wide range of species would not only save resources but also uniquely enable new comparisons of diversity among taxa at the same set of loci. No other marker types are capable of enabling this. We therefore developed a set of avian microsatellite markers with enhanced cross-species utility. Results We selected highly-conserved sequences with a high number of repeat units in both of two genetically distant species. Twenty-four primer sets were designed from homologous sequences that possessed at least eight repeat units in both the zebra finch (Taeniopygia guttata) and chicken (Gallus gallus). Each primer sequence was a complete match to zebra finch and, after accounting for degenerate bases, at least 86% similar to chicken. We assessed primer-set utility by genotyping individuals belonging to eight passerine and four non-passerine species. The majority of the new Conserved Avian Microsatellite (CAM) markers amplified in all 12 species tested (on average, 94% in passerines and 95% in non-passerines). This new marker set is of especially high utility in passerines, with a mean 68% of loci polymorphic per species, compared with 42% in non-passerine species. Conclusions When combined with previously described conserved loci, this new set of conserved markers will not only reduce the necessity and expense of microsatellite isolation for a wide range of genetic studies, including avian parentage and population analyses, but will also now enable comparisons of genetic diversity among different species (and populations) at the same set of loci, with no or reduced bias. Finally, the approach used here can be applied to other taxa in which appropriate genome sequences are available. PMID:23497230

Two DNA-binding factors recognize specific sequences at silencers, upstream activating sequences, autonomously replicating sequences, and telomeres in Saccharomyces cerevisiae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Buchman, A.R.; Kimmerly, W.J.; Rine, J.

1988-01-01

Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less
Development and characterization of novel EST-SSR markers and their application for genetic diversity analysis of Jerusalem artichoke (Helianthus tuberosus L.).

PubMed

Mornkham, T; Wangsomnuk, P P; Mo, X C; Francisco, F O; Gao, L Z; Kurzweil, H

2016-10-24

Jerusalem artichoke (Helianthus tuberosus L.) is a perennial tuberous plant and a traditional inulin-rich crop in Thailand. It has become the most important source of inulin and has great potential for use in chemical and food industries. In this study, expressed sequence tag (EST)-based simple sequence repeat (SSR) markers were developed from 40,362 Jerusalem artichoke ESTs retrieved from the NCBI database. Among 23,691 non-redundant identified ESTs, 1949 SSR motifs harboring 2 to 6 nucleotides with varied repeat motifs were discovered from 1676 assembled sequences. Seventy-nine primer pairs were generated from EST sequences harboring SSR motifs. Our results show that 43 primers are polymorphic for the six studied populations, while the remaining 36 were either monomorphic or failed to amplify. These 43 SSR loci exhibited a high level of genetic diversity among populations, with allele numbers varying from 2 to 7, with an average of 3.95 alleles per loci. Heterozygosity ranged from 0.096 to 0.774, with an average of 0.536; polymorphic index content ranged from 0.096 to 0.854, with an average of 0.568. Principal component analysis and neighbor-joining analysis revealed that the six populations could be divided into six clusters. Our results indicate that these newly characterized EST-SSR markers may be useful in the exploration of genetic diversity and range expansion of the Jerusalem artichoke, and in cross-species application for the genus Helianthus.
Characterization of arrangement and expression of the beta-2 microglobulin locus in the sandbar and nurse shark.

PubMed

Chen, Hao; Kshirsagar, Sarika; Jensen, Ingvill; Lau, Kevin; Simonson, Caitlin; Schluter, Samuel F

2010-02-01

Beta 2 microglobulin (beta2m) is an essential subunit of major histocompatibility complex (MHC) type I molecules. In this report, beta2m cDNAs were identified and sequenced from sandbar shark spleen cDNA library. Sandbar shark beta2m gene encodes one amino acid less than most teleost beta2m genes, and 3 amino acids less than mammal beta2m genes. Although sandbar shark beta2m protein contains one beta sheet less than that of human in the predicted protein structure, the overall structure of beta2m proteins is conserved during evolution. Germline gene for the beta2m in sandbar and nurse shark is present as a single locus. It contains three exons and two introns. CpG sites are evenly distributed in the shark beta2m loci. Several DNA repeat elements were also identified in the shark beta2m loci. Sequence analysis suggests that the beta2m locus is not linked to the MHC I loci in the shark genome.
[Observation and analysis on mutation of routine STR locus].

PubMed

Li, Qiu-yang; Feng, Wei-jun; Yang, Qin-gen

2005-05-01

To observe and analyze the characteristic of mutation at STR locus. 27 mutant genes observed in 1211 paternity testing cases were checked by PAGE-silver stained and PowerPlex 16 System Kit and validated by sequencing. Mutant genes locate on 15 loci. The pattern of mutation was accord with stepwise mutation model. The mutation ratio of male-to-female was 8:1 and correlated to the age of father. Mutation rate is correlated to the geometric mean of the number of homogeneous repeats of locus. The higher the mean, the higher the mutation rate. These loci are not so appropriate for use in paternity testing.
Satellite DNA in Plants: More than Just Rubbish.

PubMed

Garrido-Ramos, Manuel A

2015-01-01

For decades, satellite DNAs have been the hidden part of genomes. Initially considered as junk DNA, there is currently an increasing appreciation of the functional significance of satellite DNA repeats and of their sequences. Satellite DNA families accumulate in the heterochromatin in different parts of the eukaryotic chromosomes, mainly in pericentromeric and subtelomeric regions, but they also span the functional centromere. Tandem repeat sequences may spread from subtelomeric to interstitial loci, leading to the formation of chromosome-specific loci or to the accumulation in equilocal sites in different chromosomes. They also appear as the main components of the heterochromatin in the sex-specific region of sex chromosomes. Satellite DNA, required for chromosome organization, also plays a role in pairing and segregation. Some satellite repeats are transcribed and can participate in the formation and maintenance of heterochromatin structure and in the modulation of gene expression. In addition to the identification of the different satellite DNA families, their characteristics and location, we are interested in determining their impact on the genomes, by identifying the mechanisms leading to their appearance and amplification as well as in understanding how they change over time, the factors affecting these changes, and the influence exerted by the evolutionary history of the organisms. On the other hand, satellite DNA sequences are rapidly evolving sequences that may cause reproductive barriers between organisms and promote speciation. The accumulation of experimental data collected in recent years and the emergence of new approaches based on next-generation sequencing and high-throughput genome analysis are opening new perspectives that are changing our understanding of satellite DNA. This review examines recent data to provide a timely update on the overall information gathered about this part of the genome, focusing on the advances in the knowledge of its origin, its evolution, and its potential functional roles. © 2015 S. Karger AG, Basel.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.

PubMed

Militello, Kevin T; Lazatin, Justine C

2017-05-01

Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Isolation of human simple repeat loci by hybridization selection.

PubMed

Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

1994-04-01

We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.
The Danish STR sequence database: duplicate typing of 363 Danes with the ForenSeq™ DNA Signature Prep Kit.

PubMed

Hussing, C; Bytyci, R; Huber, C; Morling, N; Børsting, C

2018-05-24

Some STR loci have internal sequence variations, which are not revealed by the standard STR typing methods used in forensic genetics (PCR and fragment length analysis by capillary electrophoresis (CE)). Typing of STRs with next-generation sequencing (NGS) uncovers the sequence variation in the repeat region and in the flanking regions. In this study, 363 Danish individuals were typed for 56 STRs (26 autosomal STRs, 24 Y-STRs, and 6 X-STRs) using the ForenSeq™ DNA Signature Prep Kit to establish a Danish STR sequence database. Increased allelic diversity was observed in 34 STRs by the PCR-NGS assay. The largest increases were found in DYS389II and D12S391, where the numbers of sequenced alleles were around four times larger than the numbers of alleles determined by repeat length alone. Thirteen SNPs and one InDel were identified in the flanking regions of 12 STRs. Furthermore, 36 single positions and five longer stretches in the STR flanking regions were found to have dubious genotyping quality. The combined match probability of the 26 autosomal STRs was 10,000 times larger using the PCR-NGS assay than by using PCR-CE. The typical paternity indices for trios and duos were 500 and 100 times larger, respectively, than those obtained with PCR-CE. The assay also amplified 94 SNPs selected for human identification. Eleven of these loci were not in Hardy-Weinberg equilibrium in the Danish population, most likely because the minimum threshold for allele calling (30 reads) in the ForenSeq™ Universal Analysis Software was too low and frequent allele dropouts were not detected.
Multilocus Sex Determination Revealed in Two Populations of Gynodioecious Wild Strawberry, Fragaria vesca subsp. bracteata

PubMed Central

Ashman, Tia-Lynn; Tennessen, Jacob A.; Dalton, Rebecca M.; Govindarajulu, Rajanikanth; Koski, Matthew H.; Liston, Aaron

2015-01-01

Gynodioecy, the coexistence of females and hermaphrodites, occurs in 20% of angiosperm families and often enables transitions between hermaphroditism and dioecy. Clarifying mechanisms of sex determination in gynodioecious species can thus illuminate sexual system evolution. Genetic determination of gynodioecy, however, can be complex and is not fully characterized in any wild species. We used targeted sequence capture to genetically map a novel nuclear contributor to male sterility in a self-pollinated hermaphrodite of Fragaria vesca subsp. bracteata from the southern portion of its range. To understand its interaction with another identified locus and possibly additional loci, we performed crosses within and between two populations separated by 2000 km, phenotyped the progeny and sequenced candidate markers at both sex-determining loci. The newly mapped locus contains a high density of pentatricopeptide repeat genes, a class commonly involved in restoration of fertility caused by cytoplasmic male sterility. Examination of all crosses revealed three unlinked epistatically interacting loci that determine sexual phenotype and vary in frequency between populations. Fragaria vesca subsp. bracteata represents the first wild gynodioecious species with genomic evidence of both cytoplasmic and nuclear genes in sex determination. We propose a model for the interactions between these loci and new hypotheses for the evolution of sex determining chromosomes in the subdioecious and dioecious Fragaria. PMID:26483011
[Detection of CRISPR and its relationship to drug resistance in Shigella].

PubMed

Wang, Linlin; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Guo, Xiangjiao; Wang, Pengfei; Xi, Yuanlin; Yang, Haiyan

2015-04-04

To detect clustered regularly interspaced short palindromic repeats (CRISPR) in Shigella, and to analyze its relationship to drug resistance. Four pairs of primers were used for the detection of convincing CRISPR structures CRISPR-S2 and CRISPR-S4, questionable CRISPR structures CRISPR-S1 and CRISPR-S3 in 60 Shigella strains. All primers were designed using sequences in CRISPR database. CRISPR Finder was used to analyze CRISPR and susceptibilities of Shigella strains were tested by agar diffusion method. Furthermore, we analyzed the relationship between drug resistance and CRISPR-S4. The positive rate of convincing CRISPR structures was 95%. The four CRISPR loci formed 12 spectral patterns (A-L), all of which contained convincing CRISPR structures except type K. We found one new repeat and 12 new spacers. The multi-drug resistance rate was 53. 33% . We found no significant difference between CRISPR-S4 and drug resistant. However, the repeat sequence of CRISPR-S4 in multi- or TE-resistance strains was mainly R4.1 with AC deletions in the 3' end, and the spacer sequences of CRISPR-S4 in multi-drug resistance strains were mainly Sp5.1, Sp6.1 and Sp7. CRISPR was common in Shigella. Variations df repeat sequences and diversities of spacer sequences might be related to drug resistance in Shigella.
Alu repeats: A source for the genesis of primate microsatellites

DOE Office of Scientific and Technical Information (OSTI.GOV)

Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan

1995-09-01

As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less
Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.

PubMed Central

Schuster, W; Unseld, M; Wissinger, B; Brennicke, A

1990-01-01

The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162
Development of microsatellite markers for Anadenanthera colubrina (Leguminosae), a neotropical tree species.

PubMed

Feres, Juliana Massimino; Monteiro, Mariza; Zucchi, Maria I; Pinheiro, José B; Mestriner, Moacyr A; Alzate-Marin, Ana Lilia

2012-04-01

We developed and characterized nuclear microsatellite markers for Anadenanthera colubrina, a tropical tree species widely distributed in South America. Leaf samples of mature A. colubrina trees, popularly called "angico," were collected from an area that is greatly impacted by agricultural practices in the region of Ribeirão Preto in São Paulo State in southeastern Brazil. Twenty simple sequence repeat (SSR) markers were developed, 14 of which had polymorphic loci. A total of 96 alleles were detected with an average of 6.86 alleles per polymorphic locus. The expected heterozygosity, calculated at polymorphic loci, ranged from 0.18 to 0.83. Finally, we demonstrated that 18 loci were cross-amplified in A. peregrina. A total of 14 polymorphic markers suggest a high potential for genetic diversity, gene flow, and mating system analyses in A. colubrina.
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.)

PubMed Central

2013-01-01

Background Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. Results We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. Conclusions The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs. PMID:24160306
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.).

PubMed

Yagi, Masafumi; Yamamoto, Toshiya; Isobe, Sachiko; Hirakawa, Hideki; Tabata, Satoshi; Tanase, Koji; Yamaguchi, Hiroyasu; Onozaki, Takashi

2013-10-26

Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs.
Comprehensive Analysis of Human Endogenous Retrovirus Group HERV-W Locus Transcription in Multiple Sclerosis Brain Lesions by High-Throughput Amplicon Sequencing

PubMed Central

Schmitt, Katja; Richter, Christin; Backes, Christina; Meese, Eckart; Ruprecht, Klemens

2013-01-01

Human endogenous retroviruses (HERVs) of the HERV-W group comprise hundreds of loci in the human genome. Deregulated HERV-W expression and HERV-W locus ERVWE1-encoded Syncytin-1 protein have been implicated in the pathogenesis of multiple sclerosis (MS). However, the actual transcription of HERV-W loci in the MS context has not been comprehensively analyzed. We investigated transcription of HERV-W in MS brain lesions and white matter brain tissue from healthy controls by employing next-generation amplicon sequencing of HERV-W env-specific reverse transcriptase (RT) PCR products, thus revealing transcribed HERV-W loci and the relative transcript levels of those loci. We identified more than 100 HERV-W loci that were transcribed in the human brain, with a limited number of loci being predominantly transcribed. Importantly, relative transcript levels of HERV-W loci were very similar between MS and healthy brain tissue samples, refuting deregulated transcription of HERV-W env in MS brain lesions, including the high-level-transcribed ERVWE1 locus encoding Syncytin-1. Quantitative RT-PCR likewise did not reveal differences in MS regarding HERV-W env general transcript or ERVWE1- and ERVWE2-specific transcript levels. However, we obtained evidence for interindividual differences in HERV-W transcript levels. Reporter gene assays indicated promoter activity of many HERV-W long terminal repeats (LTRs), including structurally incomplete LTRs. Our comprehensive analysis of HERV-W transcription in the human brain thus provides important information on the biology of HERV-W in MS lesions and normal human brain, implications for study design, and mechanisms by which HERV-W may (or may not) be involved in MS. PMID:24109235
Development and Characterization of Microsatellite Markers for the Cape Gooseberry Physalis peruviana

PubMed Central

Simbaqueba, Jaime; Sánchez, Pilar; Sanchez, Erika; Núñez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2011-01-01

Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruviana leaf transcriptome. The UTR SSR loci were used for the development of 162 primers for amplification. The efficiency of these primers was tested via PCR in a panel of seven P. peruviana accessions including Colombia, Kenya and Ecuador ecotypes and one closely related species Physalis floridana. We obtained an amplification rate of 83% and a polymorphic rate of 22%. Here we report the first P. peruviana specific microsatellite set, a valuable tool for a wide variety of applications, including functional diversity, conservation and improvement of the species. PMID:22039540
Development and characterization of microsatellite markers for the Cape gooseberry Physalis peruviana.

PubMed

Simbaqueba, Jaime; Sánchez, Pilar; Sanchez, Erika; Núñez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mariño-Ramírez, Leonardo

2011-01-01

Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruviana leaf transcriptome. The UTR SSR loci were used for the development of 162 primers for amplification. The efficiency of these primers was tested via PCR in a panel of seven P. peruviana accessions including Colombia, Kenya and Ecuador ecotypes and one closely related species Physalis floridana. We obtained an amplification rate of 83% and a polymorphic rate of 22%. Here we report the first P. peruviana specific microsatellite set, a valuable tool for a wide variety of applications, including functional diversity, conservation and improvement of the species.
My-Forensic-Loci-queries (MyFLq) framework for analysis of forensic STR data generated by massive parallel sequencing.

PubMed

Van Neste, Christophe; Vandewoestyne, Mado; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip

2014-03-01

Forensic scientists are currently investigating how to transition from capillary electrophoresis (CE) to massive parallel sequencing (MPS) for analysis of forensic DNA profiles. MPS offers several advantages over CE such as virtually unlimited multiplexy of loci, combining both short tandem repeat (STR) and single nucleotide polymorphism (SNP) loci, small amplicons without constraints of size separation, more discrimination power, deep mixture resolution and sample multiplexing. We present our bioinformatic framework My-Forensic-Loci-queries (MyFLq) for analysis of MPS forensic data. For allele calling, the framework uses a MySQL reference allele database with automatically determined regions of interest (ROIs) by a generic maximal flanking algorithm which makes it possible to use any STR or SNP forensic locus. Python scripts were designed to automatically make allele calls starting from raw MPS data. We also present a method to assess the usefulness and overall performance of a forensic locus with respect to MPS, as well as methods to estimate whether an unknown allele, which sequence is not present in the MySQL database, is in fact a new allele or a sequencing error. The MyFLq framework was applied to an Illumina MiSeq dataset of a forensic Illumina amplicon library, generated from multilocus STR polymerase chain reaction (PCR) on both single contributor samples and multiple person DNA mixtures. Although the multilocus PCR was not yet optimized for MPS in terms of amplicon length or locus selection, the results show excellent results for most loci. The results show a high signal-to-noise ratio, correct allele calls, and a low limit of detection for minor DNA contributors in mixed DNA samples. Technically, forensic MPS affords great promise for routine implementation in forensic genomics. The method is also applicable to adjacent disciplines such as molecular autopsy in legal medicine and in mitochondrial DNA research. Copyright © 2013 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Evolution of short inverted repeat in cupressophytes, transfer of accD to nucleus in Sciadopitys verticillata and phylogenetic position of Sciadopityaceae.

PubMed

Li, Jia; Gao, Lei; Chen, Shanshan; Tao, Ke; Su, Yingjuan; Wang, Ting

2016-02-11

Sciadopitys verticillata is an evergreen conifer and an economically valuable tree used in construction, which is the only member of the family Sciadopityaceae. Acquisition of the S. verticillata chloroplast (cp) genome will be useful for understanding the evolutionary mechanism of conifers and phylogenetic relationships among gymnosperm. In this study, we have first reported the complete chloroplast genome of S. verticillata. The total genome is 138,284 bp in length, consisting of 118 unique genes. The S. verticillata cp genome has lost one copy of the canonical inverted repeats and shown distinctive genomic structure comparing with other cupressophytes. Fifty-three simple sequence repeat loci and 18 forward tandem repeats were identified in the S. verticillata cp genome. According to the rearrangement of cupressophyte cp genome, we proposed one mechanism for the formation of inverted repeat: tandem repeat occured first, then rearrangement divided the tandem repeat into inverted repeats located at different regions. Phylogenetic estimates inferred from 59-gene sequences and cpDNA organizations have both shown that S. verticillata was sister to the clade consisting of Cupressaceae, Taxaceae, and Cephalotaxaceae. Moreover, accD gene was found to be lost in the S. verticillata cp genome, and a nucleus copy was identified from two transcriptome data.

Data of 10 SSR markers for genomes of homo sapiens and monkeys.

PubMed

Reddy, K K V V V S; Raju, S Viswanadha; Someswara Rao, Chinta

2017-06-01

In this data, we present 10 Simple Sequence Repeat(SSR) markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA which are extracted from the genomes of homo sapiens and monkeys using string matching mechanism [1]. All loci showed 4 Base Pair(bp) in allele size, indicating that there are some polymorphisms between individuals correlating to the number of SSR repeats that maybe useful for the detection of similarity among the genotypes. Collectively, these data show that the SSR extraction is a valuable method to illustrate genetic variation of genomes.
Identification of Single-Nucleotide Polymorphic Loci Associated with Biomass Yield under Water Deficit in Alfalfa (Medicago sativa L.) Using Genome-Wide Sequencing and Association Mapping

PubMed Central

Yu, Long-Xi

2017-01-01

Alfalfa is a worldwide grown forage crop and is important due to its high biomass production and nutritional value. However, the production of alfalfa is challenged by adverse environmental factors such as drought and other stresses. Developing drought resistance alfalfa is an important breeding target for enhancing alfalfa productivity in arid and semi-arid regions. In the present study, we used genotyping-by-sequencing and genome-wide association to identify marker loci associated with biomass yield under drought in the field in a panel of diverse germplasm of alfalfa. A total of 28 markers at 22 genetic loci were associated with yield under water deficit, whereas only four markers associated with the same trait under well-watered condition. Comparisons of marker-trait associations between water deficit and well-watered conditions showed non-similarity except one. Most of the markers were identical across harvest periods within the treatment, although different levels of significance were found among the three harvests. The loci associated with biomass yield under water deficit located throughout all chromosomes in the alfalfa genome agreed with previous reports. Our results suggest that biomass yield under drought is a complex quantitative trait with polygenic inheritance and may involve a different mechanism compared to that of non-stress. BLAST searches of the flanking sequences of the associated loci against DNA databases revealed several stress-responsive genes linked to the drought resistance loci, including leucine-rich repeat receptor-like kinase, B3 DNA-binding domain protein, translation initiation factor IF2, and phospholipase-like protein. With further investigation, those markers closely linked to drought resistance can be used for MAS to accelerate the development of new alfalfa cultivars with improved resistance to drought and other abiotic stresses. PMID:28706532
Characterization of genetic sequence variation of 58 STR loci in four major population groups.

PubMed

Novroski, Nicole M M; King, Jonathan L; Churchill, Jennifer D; Seah, Lay Hong; Budowle, Bruce

2016-11-01

Massively parallel sequencing (MPS) can identify sequence variation within short tandem repeat (STR) alleles as well as their nominal allele lengths that traditionally have been obtained by capillary electrophoresis. Using the MiSeq FGx Forensic Genomics System (Illumina), STRait Razor, and in-house excel workbooks, genetic variation was characterized within STR repeat and flanking regions of 27 autosomal, 7 X-chromosome and 24 Y-chromosome STR markers in 777 unrelated individuals from four population groups. Seven hundred and forty six autosomal, 227 X-chromosome, and 324 Y-chromosome STR alleles were identified by sequence compared with 357 autosomal, 107 X-chromosome, and 189 Y-chromosome STR alleles that were identified by length. Within the observed sequence variation, 227 autosomal, 156 X-chromosome, and 112 Y-chromosome novel alleles were identified and described. One hundred and seventy six autosomal, 123 X-chromosome, and 93 Y-chromosome sequence variants resided within STR repeat regions, and 86 autosomal, 39 X-chromosome, and 20 Y-chromosome variants were located in STR flanking regions. Three markers, D18S51, DXS10135, and DYS385a-b had 1, 4, and 1 alleles, respectively, which contained both a novel repeat region variant and a flanking sequence variant in the same nucleotide sequence. There were 50 markers that demonstrated a relative increase in diversity with the variant sequence alleles compared with those of traditional nominal length alleles. These population data illustrate the genetic variation that exists in the commonly used STR markers in the selected population samples and provide allele frequencies for statistical calculations related to STR profiling with MPS data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
New microsatellite loci for pomegranate, Punica granatum (Lythraceae).

PubMed

Currò, Sergio; Caruso, Marco; Distefano, Gaetano; Gentile, Alessandra; La Malfa, Stefano

2010-07-01

A new set of pomegranate microsatellites was selected and characterized to assess the level of genetic diversity among cultivars and wild genotypes. • Nine Simple Sequence Repeat (SSR) markers were obtained using the Microsatellite-AFLP technique and were successfully amplified in 34 genotypes belonging to Italian, Spanish, and Turkish germplasm collections. The number of alleles per locus ranged from 1 to 5, and the total number of alleles was 22. • Because only a few codominant markers are available for this species, the newly identified SSRs will facilitate genetic diversity studies, fingerprinting, and mapping. In addition, the 9 loci successfully amplified in P. granatum var. nana. No cross transferability was observed for Cuphea micropetala and Lagerstroemia indica (Lythraceae).
Genetic diversity and relationship analysis of Gossypium arboreum accessions.

PubMed

Liu, F; Zhou, Z L; Wang, C Y; Wang, Y H; Cai, X Y; Wang, X X; Zhang, Z S; Wang, K B

2015-11-19

Simple sequence repeat techniques were used to identify the genetic diversity of 101 Gossypium arboreum accessions collected from India, Vietnam, and the southwest of China (Guizhou, Guangxi, and Yunnan provinces). Twenty-six pairs of SSR primers produced a total of 103 polymorphic loci with an average of 3.96 polymorphic loci per primer. The average of the effective number of alleles, Nei's gene diversity, and Shannon's information index were 0.59, 0.2835, and 0.4361, respectively. The diversity varied among different geographic regions. The result of principal component analysis was consistent with that of unweighted pair group method with arithmetic mean clustering analysis. The 101 G. arboreum accessions were clustered into 2 groups.
Isolation and characterization of polymorphic microsatellite loci from Zelkova schneideriana Hand.-Mazz.

PubMed

Liu, H L; Zhang, R Q; Geng, M L; Zhu, J Y; Ma, J L

2014-12-03

Zelkova schneideriana is a highly valued hardwood species. An improved technique for isolating codominant compound microsatellite markers was used to develop simple sequence repeat markers for Z. schneideriana. A total of 12 microsatellite loci were identified. Overall, the number of alleles per locus ranged from 8-19, with an average of 11.75. Observed heterozygosity and expected heterozygosity values ranged from 0.109-0.709 and 0.832-0.929, respectively. Polymorphic information content is from 0.803-0.915, with an average of 0.854. These markers will be very important for future research related to the genetic diversity, population structure, patterns of gene flow, and mating system of this species.
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

PubMed

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

PubMed Central

Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Development of Multiple-Locus Variable-Number Tandem-Repeat Analysis for Molecular Subtyping of Campylobacter jejuni by Using Capillary Electrophoresis

PubMed Central

Techaruvichit, Punnida; Vesaratchavest, Mongkol; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

2015-01-01

Campylobacter jejuni is a common cause of the frequently reported food-borne diseases in developed and developing nations. This study describes the development of multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) using capillary electrophoresis as a novel typing method for microbial source tracking and epidemiological investigation of C. jejuni. Among 36 tandem repeat loci detected by the Tandem Repeat Finder program, 7 VNTR loci were selected and used for characterizing 60 isolates recovered from chicken meat samples from retail shops, samples from chicken meat processing factory, and stool samples. The discrimination ability of MLVA was compared with that of multilocus sequence typing (MLST). MLVA (diversity index of 0.97 with 31 MLVA types) provided slightly higher discrimination than MLST (diversity index of 0.95 with 25 MLST types). The overall concordance between MLVA and MLST was estimated at 63% by adjusted Rand coefficient. MLVA predicted MLST type better than MLST predicted MLVA type, as reflected by Wallace coefficient (Wallace coefficient for MLVA to MLST versus MLST to MLVA, 86% versus 51%). MLVA is a useful tool and can be used for effective monitoring of C. jejuni and investigation of epidemics caused by C. jejuni. PMID:26025899
Occurrence of Can-SINEs and intron sequence evolution supports robust phylogeny of pinniped carnivores and their terrestrial relatives.

PubMed

Schröder, Christiane; Bleidorn, Christoph; Hartmann, Stefanie; Tiedemann, Ralph

2009-12-15

Investigating the dog genome we found 178965 introns with a moderate length of 200-1000 bp. A screening of these sequences against 23 different repeat libraries to find insertions of short interspersed elements (SINEs) detected 45276 SINEs. Virtually all of these SINEs (98%) belong to the tRNA-derived Can-SINE family. Can-SINEs arose about 55 million years ago before Carnivora split into two basal groups, the Caniformia (dog-like carnivores) and the Feliformia (cat-like carnivores). Genome comparisons of dog and cat recovered 506 putatively informative SINE loci for caniformian phylogeny. In this study we show how to use such genome information of model organisms to research the phylogeny of related non-model species of interest. Investigating a dataset including representatives of all major caniformian lineages, we analysed 24 randomly chosen loci for 22 taxa. All loci were amplifiable and revealed 17 parsimony-informative SINE insertions. The screening for informative SINE insertions yields a large amount of sequence information, in particular of introns, which contain reliable phylogenetic information as well. A phylogenetic analysis of intron- and SINE sequence data provided a statistically robust phylogeny which is congruent with the absence/presence pattern of our SINE markers. This phylogeny strongly supports a sistergroup relationship of Musteloidea and Pinnipedia. Within Pinnipedia, we see strong support from bootstrapping and the presence of a SINE insertion for a sistergroup relationship of the walrus with the Otariidae.
Microsatellite markers for Senna spectabilis var. excelsa (Caesalpinioideae, Fabaceae)1

PubMed Central

López-Roberts, M. Cristina; Barbosa, Ariane R.; Paganucci de Queiroz, Luciano; van den Berg, Cássio

2016-01-01

Premise of the study: Senna spectabilis var. excelsa (Fabaceae) is a South and Central American tree of great ecological importance and one of the most common species in several sites of seasonally dry forests. Our goal was to develop microsatellite markers to assess the genetic diversity and structure of this species. Methods and Results: We designed and assessed 53 loci obtained from a microsatellite-enriched library and an intersimple sequence repeat library. Fourteen loci were polymorphic, and they presented a total of 39 alleles in a sample of 61 individuals from six populations. The mean values of observed and expected heterozygosities were 0.355 and 0.479, respectively. Polymorphism information content was 0.390 and the Shannon index was 0.778. Conclusions: Polymorphism information content and Shannon index indicate that at least nine of the 14 microsatellite loci developed are moderate to highly informative, and potentially useful for population genetic studies in this species. PMID:26819856
CRISPR-Cas systems: prokaryotes upgrade to adaptive immunity

PubMed Central

Barrangou, Rodolphe; Marraffini, Luciano A.

2014-01-01

Summary Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), and associated proteins (Cas) comprise the CRISPR-Cas system, which confers adaptive immunity against exogenic elements in many bacteria and most archaea. CRISPR-mediated immunization occurs through the uptake of DNA from invasive genetic elements such as plasmids and viruses, followed by its integration into CRISPR loci. These loci are subsequently transcribed and processed into small interfering RNAs that guide nucleases for specific cleavage of complementary sequences. Conceptually, CRISPR-Cas shares functional features with the mammalian adaptive immune system, while also exhibiting characteristics of Lamarckian evolution. Because immune markers spliced from exogenous agents are integrated iteratively in CRISPR loci, they constitute a genetic record of vaccination events and reflect environmental conditions and changes over time. Cas endonucleases, which can be reprogrammed by small guide RNAs have shown unprecedented potential and flexibility for genome editing, and can be repurposed for numerous DNA targeting applications including transcriptional control. PMID:24766887
Isolation and Characterization of Polymorphic Microsatellite Loci from Metapenaeopsis barbata Using PCR-Based Isolation of Microsatellite Arrays (PIMA)

PubMed Central

Chiang, Tzen-Yuh; Tzeng, Tzong-Der; Lin, Hung-Du; Cho, Ching-Ju; Lin, Feng-Jiau

2012-01-01

The red-spot prawn, Metapenaeopsis barbata, is a commercially important, widely distributed demersal species in the Indo-West Pacific Ocean. Overfishing has made its populations decline in the past decade. To study conservation genetics, eight polymorphic microsatellite loci were isolated. Genetic characteristics of the SSR (simple sequence repeat) fingerprints were estimated in 61 individuals from adjacent seas of Taiwan and China. The number of alleles, ranging from 2 to 4, as well as observed and expected heterozygosities in populations, ranging from 0.048 to 0.538, and 0.048 and 0.654, respectively, were detected. No deviation from Hardy–Weinberg expectations was detected at either locus. No significant linkage disequilibrium was detected in locus pairs. The polymorphic microsatellite loci will be useful for investigations of the genetic variation, population structure, and conservation genetics of this species. PMID:22489123
Genetic variation of Sargassum horneri populations detected by inter-simple sequence repeats.

PubMed

Ren, J R; Yang, R; He, Y Y; Sun, Q H

2015-01-30

The seaweed Sargassum horneri is an important brown alga in the marine environment, and it is an important raw material in the alginate industry. Unfortunately, the fixed resource that was originally reported is now reduced or disappeared, and increased floating populations have been reported in recent years. We sampled a floating population and 4 fixed cultivated populations of S. horneri along the coast of Zhejiang, China. Inter-simple sequence repeat (ISSR) markers were applied in this research to analyze the genetic variation between floating populations and fixed cultivated populations of S. horneri. In total, 220 loci were amplified with 23 ISSR primers. The percentage of polymorphic loci within each population ranged from 53.64 to 95.45%. The highest diversity was observed in population 3, which was the local species that was suspension cultured in the lab and then fixed cultivated in the Nanji Islands before sampling. The lowest diversity was obtained in the floating population 4. The genetic distances among the 5 S. horneri populations ranged from 0.0819 to 0.2889, and the distance tendency confirmed the genetic diversity. The results suggest that the floating population had the lowest genetic diversity and could not be joined into the cluster branch of the fixed cultivated populations.
EAPhy: A Flexible Tool for High-throughput Quality Filtering of Exon-alignments and Data Processing for Phylogenetic Methods.

PubMed

Blom, Mozes P K

2015-08-05

Recently developed molecular methods enable geneticists to target and sequence thousands of orthologous loci and infer evolutionary relationships across the tree of life. Large numbers of genetic markers benefit species tree inference but visual inspection of alignment quality, as traditionally conducted, is challenging with thousands of loci. Furthermore, due to the impracticality of repeated visual inspection with alternative filtering criteria, the potential consequences of using datasets with different degrees of missing data remain nominally explored in most empirical phylogenomic studies. In this short communication, I describe a flexible high-throughput pipeline designed to assess alignment quality and filter exonic sequence data for subsequent inference. The stringency criteria for alignment quality and missing data can be adapted based on the expected level of sequence divergence. Each alignment is automatically evaluated based on the stringency criteria specified, significantly reducing the number of alignments that require visual inspection. By developing a rapid method for alignment filtering and quality assessment, the consistency of phylogenetic estimation based on exonic sequence alignments can be further explored across distinct inference methods, while accounting for different degrees of missing data.
Multilocus Sex Determination Revealed in Two Populations of Gynodioecious Wild Strawberry, Fragaria vesca subsp. bracteata.

PubMed

Ashman, Tia-Lynn; Tennessen, Jacob A; Dalton, Rebecca M; Govindarajulu, Rajanikanth; Koski, Matthew H; Liston, Aaron

2015-10-19

Gynodioecy, the coexistence of females and hermaphrodites, occurs in 20% of angiosperm families and often enables transitions between hermaphroditism and dioecy. Clarifying mechanisms of sex determination in gynodioecious species can thus illuminate sexual system evolution. Genetic determination of gynodioecy, however, can be complex and is not fully characterized in any wild species. We used targeted sequence capture to genetically map a novel nuclear contributor to male sterility in a self-pollinated hermaphrodite of Fragaria vesca subsp. bracteata from the southern portion of its range. To understand its interaction with another identified locus and possibly additional loci, we performed crosses within and between two populations separated by 2000 km, phenotyped the progeny and sequenced candidate markers at both sex-determining loci. The newly mapped locus contains a high density of pentatricopeptide repeat genes, a class commonly involved in restoration of fertility caused by cytoplasmic male sterility. Examination of all crosses revealed three unlinked epistatically interacting loci that determine sexual phenotype and vary in frequency between populations. Fragaria vesca subsp. bracteata represents the first wild gynodioecious species with genomic evidence of both cytoplasmic and nuclear genes in sex determination. We propose a model for the interactions between these loci and new hypotheses for the evolution of sex determining chromosomes in the subdioecious and dioecious Fragaria. Copyright © 2015 Ashman et al.
Rapid microsatellite marker development using next generation pyrosequencing to inform invasive Burmese python -- Python molurus bivittatus -- management

USGS Publications Warehouse

Hunter, Margaret E.; Hart, Kristen M.

2013-01-01

Invasive species represent an increasing threat to native ecosystems, harming indigenous taxa through predation, habitat modification, cross-species hybridization and alteration of ecosystem processes. Additionally, high economic costs are associated with environmental damage, restoration and control measures. The Burmese python, Python molurus bivittatus, is one of the most notable invasive species in the US, due to the threat it poses to imperiled species and the Greater Everglades ecosystem. To address population structure and relatedness, next generation sequencing was used to rapidly produce species-specific microsatellite loci. The Roche 454 GS-FLX Titanium platform provided 6616 di-, tri- and tetra-nucleotide repeats in 117,516 sequences. Using stringent criteria, 24 of 26 selected tri- and tetra-nucleotide loci were polymerase chain reaction (PCR) amplified and 18 were polymorphic. An additional six cross-species loci were amplified, and the resulting 24 loci were incorporated into eight PCR multiplexes. Multi-locus genotypes yielded an average of 61% (39%–77%) heterozygosity and 3.7 (2–6) alleles per locus. Population-level studies using the developed microsatellites will track the invasion front and monitor population-suppression dynamics. Additionally, cross-species amplification was detected in the invasive Ball, P. regius, and Northern African python, P. sebae. These markers can be used to address the hybridization potential of Burmese pythons and the larger, more aggressive P. sebae.
Rapid Microsatellite Marker Development Using Next Generation Pyrosequencing to Inform Invasive Burmese Python—Python molurus bivittatus—Management

PubMed Central

Hunter, Margaret E.; Hart, Kristen M.

2013-01-01

Invasive species represent an increasing threat to native ecosystems, harming indigenous taxa through predation, habitat modification, cross-species hybridization and alteration of ecosystem processes. Additionally, high economic costs are associated with environmental damage, restoration and control measures. The Burmese python, Python molurus bivittatus, is one of the most notable invasive species in the US, due to the threat it poses to imperiled species and the Greater Everglades ecosystem. To address population structure and relatedness, next generation sequencing was used to rapidly produce species-specific microsatellite loci. The Roche 454 GS-FLX Titanium platform provided 6616 di-, tri- and tetra-nucleotide repeats in 117,516 sequences. Using stringent criteria, 24 of 26 selected tri- and tetra-nucleotide loci were polymerase chain reaction (PCR) amplified and 18 were polymorphic. An additional six cross-species loci were amplified, and the resulting 24 loci were incorporated into eight PCR multiplexes. Multi-locus genotypes yielded an average of 61% (39%–77%) heterozygosity and 3.7 (2–6) alleles per locus. Population-level studies using the developed microsatellites will track the invasion front and monitor population-suppression dynamics. Additionally, cross-species amplification was detected in the invasive Ball, P. regius, and Northern African python, P. sebae. These markers can be used to address the hybridization potential of Burmese pythons and the larger, more aggressive P. sebae. PMID:23449030
Short Communication: Genetic linkage map of Cucurbita maxima with molecular and morphological markers.

PubMed

Ge, Y; Li, X; Yang, X X; Cui, C S; Qu, S P

2015-05-22

Cucurbita maxima is one of the most widely cultivated vegetables in China and exhibits distinct morphological characteristics. In this study, genetic linkage analysis with 57 simple-sequence repeats, 21 amplified fragment length polymorphisms, 3 random-amplified polymorphic DNA, and one morphological marker revealed 20 genetic linkage groups of C. maxima covering a genetic distance of 991.5 cM with an average of 12.1 cM between adjacent markers. Genetic linkage analysis identified the simple-sequence repeat marker 'PU078072' 5.9 cM away from the locus 'Rc', which controls rind color. The genetic map in the present study will be useful for better mapping, tagging, and cloning of quantitative trait loci/gene(s) affecting economically important traits and for breeding new varieties of C. maxima through marker-assisted selection.
Development of Genomic Microsatellite Markers in Carthamus tinctorius L. (Safflower) Using Next Generation Sequencing and Assessment of Their Cross-Species Transferability and Utility for Diversity Analysis

PubMed Central

Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra

2015-01-01

Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743

Gender Identification in Date Palm Using Molecular Markers.

PubMed

Awan, Faisal Saeed; Maryam; Jaskani, Muhammad J; Sadia, Bushra

2017-01-01

Breeding of date palm is complicated because of its long life cycle and heterozygous nature. Sexual propagation of date palm does not produce true-to-type plants. Sex of date palms cannot be identified until the first flowering stage. Molecular markers such as random amplified polymorphic DNA (RAPD), sequence-characterized amplified regions (SCAR), and simple sequence repeats (SSR) have successfully been used to identify the sex-linked loci in the plant genome and to isolate the corresponding genes. This chapter highlights the use of three molecular markers including RAPD, SCAR, and SSR to identify the gender of date palm seedlings.
The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

PubMed

Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

2015-07-20

Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.
Variation of short tandem repeats within and between species belonging to the Canidae family.

PubMed

Fredholm, M; Winterø, A K

1995-01-01

Frequency distribution and allele size in 20 canine microsatellite loci were analyzed in 33 flat-coated retrievers, 32 dachshunds, 10 red foxes, and 10 Arctic foxes. Overall, the major difference between the two dog breeds was the relative allele frequencies rather than the size ranges of alleles at the individual locus. The average heterozygosity within the two dog breeds was not significantly different. Since the average heterozygosity at several polymorphic loci is a relative measure of heterogeneity within the population, analysis of heterozygosity within microsatellite loci is suggested as a measure for the diversity of populations. Eighty percent (16 of 20) of the canine microsatellite primer pairs amplified corresponding loci in the two fox species. This reflects a very high sequence conservation within the Canidae family relative to findings in, for instance, the Muridae family. This indicates that it will be possible to utilize the well-characterized fox karyotype instead of the dog karyotype as a step towards physical mapping of the dog genome. Analysis of exclusion power and probabilities of genetic identity between unrelated animals by use of the seven most informative loci demonstrated that it will be possible to assemble a panel of microsatellite loci that is effective for parentage analysis in all breeds.
Development of EST-SSR markers for Taxillus nigrans (Loranthaceae) in southwestern China using next-generation sequencing1

PubMed Central

Miao, Ning; Zhang, Lei; Li, Maoping; Fan, Liqiang; Mao, Kangshan

2017-01-01

Premise of the study: We developed transcriptome microsatellite markers (simple sequence repeats) for Taxillus nigrans (Loranthaceae) to survey the genetic diversity and population structure of this species. Methods and Results: We used Illumina HiSeq data to reconstruct the transcriptome of T. nigrans by de novo assembly and used the transcriptome to develop a set of simple sequence repeat markers. Overall, 40 primer pairs were designed and tested; 19 of them amplified successfully and demonstrated polymorphisms. Two loci that detected null alleles were eliminated, and the remaining 17, which were subjected to further analyses, yielded two to 21 alleles per locus. Conclusions: The markers will serve as a basis for studies to assess the extent and pattern of distribution of genetic variation in T. nigrans, and they may also be useful in conservation genetic, ecological, and evolutionary studies of the genus Taxillus, a group of plant species of importance in Chinese traditional medicine. PMID:28924510
Analysis of an "off-ladder" allele at the Penta D short tandem repeat locus.

PubMed

Yang, Y L; Wang, J G; Wang, D X; Zhang, W Y; Liu, X J; Cao, J; Yang, S L

2015-11-25

Kinship testing of a father and his son from Guangxi, China, the location of the Zhuang minority people, was performed using the PowerPlex® 18D System with a short tandem repeat typing kit. The results indicated that both the father and his son had an off-ladder allele at the Penta D locus, with a genetic size larger than that of the maximal standard allelic ladder. To further identify this locus, monogenic amplification, gene cloning, and genetic sequencing were performed. Sequencing analysis demonstrated that the fragment size of the Penta D-OL locus was 469 bp and the core sequence was [AAAGA]21, also called Penta D-21. The rare Penta D-21 allele was found to be distributed among the Zhuang population from the Guangxi Zhuang Autonomous Region of China; therefore, this study improved the range of DNA data available for this locus and enhanced our ability for individual identification of gene loci.
Investigation of microsatellite instability in Turkish breast cancer patients.

PubMed

Demokan, Semra; Muslumanoglu, Mahmut; Yazici, H; Igci, Abdullah; Dalay, Nejat

2002-01-01

Multiple somatic and inherited genetic changes that lead to loss of growth control may contribute to the development of breast cancer. Microsatellites are tandem repeats of simple sequences that occur abundantly and at random throughout most eucaryotic genomes. Microsatellite instability (MI), characterized by the presence of random contractions or expansions in the length of simple sequence repeats or microsatellites, is observed in a variety of tumors. The aim of this study was to compare tumor DNA fingerprints with constitutional DNA fingerprints to investigate changes specific to breast cancer and evaluate its correlation with clinical characteristics. Tumor and normal tissue samples of 38 patients with breast cancer were investigated by comparing PCR-amplified microsatellite sequences D2S443 and D21S1436. Microsatellite instability at D21S1436 and D2S443 was found in 5 (13%) and 7 (18%) patients, respectively. Two patients displayed instability at both marker loci. No association was found between MI and age, family history, lymph node involvement and other clinical parameters.
Clustered regularly interspaced short palindromic repeats (CRISPRs) analysis of members of the Mycobacterium tuberculosis complex.

PubMed

Botelho, Ana; Canto, Ana; Leão, Célia; Cunha, Mónica V

2015-01-01

Typical CRISPR (clustered, regularly interspaced, short palindromic repeat) regions are constituted by short direct repeats (DRs), interspersed with similarly sized non-repetitive spacers, derived from transmissible genetic elements, acquired when the cell is challenged with foreign DNA. The analysis of the structure, in number and nature, of CRISPR spacers is a valuable tool for molecular typing since these loci are polymorphic among strains, originating characteristic signatures. The existence of CRISPR structures in the genome of the members of Mycobacterium tuberculosis complex (MTBC) enabled the development of a genotyping method, based on the analysis of the presence or absence of 43 oligonucleotide spacers separated by conserved DRs. This method, called spoligotyping, consists on PCR amplification of the DR chromosomal region and recognition after hybridization of the spacers that are present. The workflow beneath this methodology implies that the PCR products are brought onto a membrane containing synthetic oligonucleotides that have complementary sequences to the spacer sequences. Lack of hybridization of the PCR products to a specific oligonucleotide sequence indicates absence of the correspondent spacer sequence in the examined strain. Spoligotyping gained great notoriety as a robust identification and typing tool for members of MTBC, enabling multiple epidemiological studies on human and animal tuberculosis.
A gene (ETM) for essential tremor maps to chromosome 2p22-p25.

PubMed

Higgins, J J; Pho, L T; Nee, L E

1997-11-01

We report the results of linkage analysis in a large American family of Czech descent with dominantly inherited "pure" essential tremor (ET) and genetic anticipation. Genetic loci on chromosome 2p22-p25 establish linkage to this region with a maximum LOD score (Zmax) = 5.92 for the locus, D2S272. Obligate recombinant events place the ETM gene in a 15-cM candidate interval between the genetic loci D2S168 and D2S224. Repeat expansion detection analysis suggests that expanded CAG trinucleotide sequences are associated with ET. These findings will facilitate the search for an ETM gene and may further our understanding of the human motor system.
Microsatellite loci for the stingless bee Melipona rufiventris (Hymenoptera: Apidae).

PubMed

Lopes, Denilce Meneses; D Silva, Filipe Oliveira; Fernandes Salomão, Tânia Maria; Campos, Lúcio Antônio D Oliveira; Tavares, Mara Garcia

2009-05-01

Eight microsatellite primers were developed from ISSR (intersimple sequence repeats) markers for the stingless bee Melipona rufiventris. These primers were tested in 20 M. rufiventris workers, representing a single population from Minas Gerais state. The number of alleles per locus ranged from 2 to 5 (mean = 2.63) and the observed and expected heterozygosity values ranged from 0.00 to 0.44 (mean = 0.20) and from 0.05 to 0.68 (mean = 0.31), respectively. Several loci were also polymorphic in M. quadrifasciata, M. bicolor, M. mandacaia and Partamona helleri and should prove useful in population studies of other stingless bees. © 2009 The Authors. Journal compilation © 2009 Blackwell Publishing Ltd.
Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

PubMed

Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

2018-07-18

The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (P<0.05, t-test). We also found a greater number of homozygous STRs than heterozygous STRs (P<0.05, t-test), with the Emei and Jianyang Tibetan macaques showing more heterozygous loci than Huangshan Tibetan macaques. The proportion of insertions and mean variation of alleles in the Emei and Jianyang individuals were slightly higher than those in the Huangshan individuals, thus revealing differences in STR allele size between the two populations. The polymorphic STR loci identified based on the reference genome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for screening polymorphic STRs. Our results also lay a foundation for future genetic variation studies of macaques.
Molecular typing of Brucella melitensis endemic strains and differentiation from the vaccine strain Rev-1.

PubMed

Noutsios, Georgios T; Papi, Rigini M; Ekateriniadou, Loukia V; Minas, Anastasios; Kyriakidis, Dimitrios A

2012-03-01

In the present study forty-four Greek endemic strains of Br. melitensis and three reference strains were genotyped by Multi locus Variable Number Tandem Repeat (ML-VNTR) analysis based on an eight-base pair tandem repeat sequence that was revealed in eight loci of Br. melitensis genome. The forty-four strains were discriminated from the vaccine strain Rev-1 by Restriction Fragment Length Polymorphism (RFLP) and Denaturant Gradient Gel Electrophoresis (DGGE). The ML-VNTR analysis revealed that endemic, reference and vaccine strains are genetically closely related, while most of the loci tested (1, 2, 4, 5 and 7) are highly polymorphic with Hunter-Gaston Genetic Diversity Index (HGDI) values in the range of 0.939 to 0.775. Analysis of ML-VNTRs loci stability through in vitro passages proved that loci 1 and 5 are non stable. Therefore, vaccine strain can be discriminated from endemic strains by allele's clusters of loci 2, 4, 6 and 7. RFLP and DGGE were also employed to analyse omp2 gene and reveled different patterns among Rev-1 and endemic strains. In RFLP, Rev-1 revealed three fragments (282, 238 and 44 bp), while endemic strains two fragments (238 and 44 bp). As for DGGE, the electrophoretic mobility of Rev-1 is different from the endemic strains due to heterologous binding of DNA chains of omp2a and omp2b gene. Overall, our data show clearly that it is feasible to genotype endemic strains of Br. melitensis and differentiate them from vaccine strain Rev-1 with ML-VNTR, RFLP and DGGE techniques. These tools can be used for conventional investigations in brucellosis outbreaks.
Developmental validation of the MiSeq FGx Forensic Genomics System for Targeted Next Generation Sequencing in Forensic DNA Casework and Database Laboratories.

PubMed

Jäger, Anne C; Alvarez, Michelle L; Davis, Carey P; Guzmán, Ernesto; Han, Yonmee; Way, Lisa; Walichiewicz, Paulina; Silva, David; Pham, Nguyen; Caves, Glorianna; Bruand, Jocelyne; Schlesinger, Felix; Pond, Stephanie J K; Varlaro, Joe; Stephens, Kathryn M; Holt, Cydne L

2017-05-01

Human DNA profiling using PCR at polymorphic short tandem repeat (STR) loci followed by capillary electrophoresis (CE) size separation and length-based allele typing has been the standard in the forensic community for over 20 years. Over the last decade, Next-Generation Sequencing (NGS) matured rapidly, bringing modern advantages to forensic DNA analysis. The MiSeq FGx™ Forensic Genomics System, comprised of the ForenSeq™ DNA Signature Prep Kit, MiSeq FGx™ Reagent Kit, MiSeq FGx™ instrument and ForenSeq™ Universal Analysis Software, uses PCR to simultaneously amplify up to 231 forensic loci in a single multiplex reaction. Targeted loci include Amelogenin, 27 common, forensic autosomal STRs, 24 Y-STRs, 7 X-STRs and three classes of single nucleotide polymorphisms (SNPs). The ForenSeq™ kit includes two primer sets: Amelogenin, 58 STRs and 94 identity informative SNPs (iiSNPs) are amplified using DNA Primer Set A (DPMA; 153 loci); if a laboratory chooses to generate investigative leads using DNA Primer Set B, amplification is targeted to the 153 loci in DPMA plus 22 phenotypic informative (piSNPs) and 56 biogeographical ancestry SNPs (aiSNPs). High-resolution genotypes, including detection of intra-STR sequence variants, are semi-automatically generated with the ForenSeq™ software. This system was subjected to developmental validation studies according to the 2012 Revised SWGDAM Validation Guidelines. A two-step PCR first amplifies the target forensic STR and SNP loci (PCR1); unique, sample-specific indexed adapters or "barcodes" are attached in PCR2. Approximately 1736 ForenSeq™ reactions were analyzed. Studies include DNA substrate testing (cotton swabs, FTA cards, filter paper), species studies from a range of nonhuman organisms, DNA input sensitivity studies from 1ng down to 7.8pg, two-person human DNA mixture testing with three genotype combinations, stability analysis of partially degraded DNA, and effects of five commonly encountered PCR inhibitors. Calculations from ForenSeq™ STR and SNP repeatability and reproducibility studies (1ng template) indicate 100.0% accuracy of the MiSeq FGx™ System in allele calling relative to CE for STRs (1260 samples), and >99.1% accuracy relative to bead array typing for SNPs (1260 samples for iiSNPs, 310 samples for aiSNPs and piSNPs), with >99.0% and >97.8% precision, respectively. Call rates of >99.0% were observed for all STRs and SNPs amplified with both ForenSeq™ primer mixes. Limitations of the MiSeq FGx™ System are discussed. Results described here demonstrate that the MiSeq FGx™ System meets forensic DNA quality assurance guidelines with robust, reliable, and reproducible performance on samples of various quantities and qualities. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Clustured regularly interspersed short palindromic repeats (CRISPR) genetic diversity studies as a mean to reconstruct the evolution of the Mycobacterium tuberculosis complex.

PubMed

Sola, Christophe

2015-06-01

The natural history of tuberculosis may be tackled by various means, among which the record of molecular scars that have been registered by the Mycobacterium tuberculosis complex (MTBC) genomes transmitted from patient to patient for tens of thousands years and possibly more. Recently discovered polymorphic loci, the CRISPR sequences, are indirect witnesses of the historical phage-bacteria struggle, and may be related to the time when the ancestor of today's tubercle bacilli were environmental bacteria, i.e. before becoming intracellular parasites. In this article, we present what are CRISPRs and try to summarize almost 20 years of research results obtained using the genetic diversity of the CRISPR loci in MTBC as a perspective for studying new models. We show that the study of the diversity of CRISPR sequences, thanks to «spoligotyping», has played a great role in our global understanding of the population structure of MTBC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Transcriptome-Derived Tetranucleotide Microsatellites and Their Associated Genes from the Giant Panda (Ailuropoda melanoleuca).

PubMed

Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue

2016-09-01

Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Analysis of Salmonella enterica Serovar Typhimurium Variable-Number Tandem-Repeat Data for Public Health Investigation Based on Measured Mutation Rates and Whole-Genome Sequence Comparisons

PubMed Central

Dimovski, Karolina; Cao, Hanwei; Wijburg, Odilia L. C.; Strugnell, Richard A.; Mantena, Radha K.; Whipp, Margaret; Hogg, Geoff

2014-01-01

Variable-number tandem repeats (VNTRs) mutate rapidly and can be useful markers for genotyping. While multilocus VNTR analysis (MLVA) is increasingly used in the detection and investigation of food-borne outbreaks caused by Salmonella enterica serovar Typhimurium (S. Typhimurium) and other bacterial pathogens, MLVA data analysis usually relies on simple clustering approaches that may lead to incorrect interpretations. Here, we estimated the rates of copy number change at each of the five loci commonly used for S. Typhimurium MLVA, during in vitro and in vivo passage. We found that loci STTR5, STTR6, and STTR10 changed during passage but STTR3 and STTR9 did not. Relative rates of change were consistent across in vitro and in vivo growth and could be accurately estimated from diversity measures of natural variation observed during large outbreaks. Using a set of 203 isolates from a series of linked outbreaks and whole-genome sequencing of 12 representative isolates, we assessed the accuracy and utility of several alternative methods for analyzing and interpreting S. Typhimurium MLVA data. We show that eBURST analysis was accurate and informative. For construction of MLVA-based trees, a novel distance metric, based on the geometric model of VNTR evolution coupled with locus-specific weights, performed better than the commonly used simple or categorical distance metrics. The data suggest that, for the purpose of identifying potential transmission clusters for further investigation, isolates whose profiles differ at one of the rapidly changing STTR5, STTR6, and STTR10 loci should be collapsed into the same cluster. PMID:24957617
A high-density consensus map of barley linking DArT markers to SSR, RFLP and STS loci and agricultural traits

PubMed Central

Wenzl, Peter; Li, Haobing; Carling, Jason; Zhou, Meixue; Raman, Harsh; Paul, Edie; Hearnden, Phillippa; Maier, Christina; Xia, Ling; Caig, Vanessa; Ovesná, Jaroslava; Cakir, Mehmet; Poulsen, David; Wang, Junping; Raman, Rosy; Smith, Kevin P; Muehlbauer, Gary J; Chalmers, Ken J; Kleinhofs, Andris; Huttner, Eric; Kilian, Andrzej

2006-01-01

Background Molecular marker technologies are undergoing a transition from largely serial assays measuring DNA fragment sizes to hybridization-based technologies with high multiplexing levels. Diversity Arrays Technology (DArT) is a hybridization-based technology that is increasingly being adopted by barley researchers. There is a need to integrate the information generated by DArT with previous data produced with gel-based marker technologies. The goal of this study was to build a high-density consensus linkage map from the combined datasets of ten populations, most of which were simultaneously typed with DArT and Simple Sequence Repeat (SSR), Restriction Enzyme Fragment Polymorphism (RFLP) and/or Sequence Tagged Site (STS) markers. Results The consensus map, built using a combination of JoinMap 3.0 software and several purpose-built perl scripts, comprised 2,935 loci (2,085 DArT, 850 other loci) and spanned 1,161 cM. It contained a total of 1,629 'bins' (unique loci), with an average inter-bin distance of 0.7 ± 1.0 cM (median = 0.3 cM). More than 98% of the map could be covered with a single DArT assay. The arrangement of loci was very similar to, and almost as optimal as, the arrangement of loci in component maps built for individual populations. The locus order of a synthetic map derived from merging the component maps without considering the segregation data was only slightly inferior. The distribution of loci along chromosomes indicated centromeric suppression of recombination in all chromosomes except 5H. DArT markers appeared to have a moderate tendency toward hypomethylated, gene-rich regions in distal chromosome areas. On the average, 14 ± 9 DArT loci were identified within 5 cM on either side of SSR, RFLP or STS loci previously identified as linked to agricultural traits. Conclusion Our barley consensus map provides a framework for transferring genetic information between different marker systems and for deploying DArT markers in molecular breeding schemes. The study also highlights the need for improved software for building consensus maps from high-density segregation data of multiple populations. PMID:16904008
Not all predicted CRISPR-Cas systems are equal: isolated cas genes and classes of CRISPR like elements.

PubMed

Zhang, Quan; Ye, Yuzhen

2017-02-06

The CRISPR-Cas systems in prokaryotes are RNA-guided immune systems that target and deactivate foreign nucleic acids. A typical CRISPR-Cas system consists of a CRISPR array of repeat and spacer units, and a locus of cas genes. The CRISPR and the cas locus are often located next to each other in the genomes. However, there is no quantitative estimate of the co-location. In addition, ad-hoc studies have shown that some non-CRISPR genomic elements contain repeat-spacer-like structures and are mistaken as CRISPRs. Using available genome sequences, we observed that a significant number of genomes have isolated cas loci and/or CRISPRs. We found that 11%, 22% and 28% of the type I, II and III cas loci are isolated (without CRISPRs in the same genomes at all or with CRISPRs distant in the genomes), respectively. We identified a large number of genomic elements that superficially reassemble CRISPRs but don't contain diverse spacers and have no companion cas genes. We called these elements false-CRISPRs and further classified them into groups, including tandem repeats and Staphylococcus aureus repeat (STAR)-like elements. This is the first systematic study to collect and characterize false-CRISPR elements. We demonstrated that false-CRISPRs could be used to reduce the false annotation of CRISPRs, therefore showing them to be useful for improving the annotation of CRISPR-Cas systems.
Interpreting short tandem repeat variations in humans using mutational constraint

PubMed Central

Gymrek, Melissa; Willems, Thomas; Reich, David; Erlich, Yaniv

2017-01-01

Identifying regions of the genome that are depleted of mutations can reveal potentially deleterious variants. Short tandem repeats (STRs), also known as microsatellites, are among the largest contributors of de novo mutations in humans. However, per-locus studies of STR mutations have been limited to highly ascertained panels of several dozen loci. Here, we harnessed bioinformatics tools and a novel analytical framework to estimate mutation parameters for each STR in the human genome by correlating STR genotypes with local sequence heterozygosity. We applied our method to obtain robust estimates of the impact of local sequence features on mutation parameters and used this to create a framework for measuring constraint at STRs by comparing observed vs. expected mutation rates. Constraint scores identified known pathogenic variants with early onset effects. Our metric will provide a valuable tool for prioritizing pathogenic STRs in medical genetics studies. PMID:28892063
Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

PubMed Central

Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

2013-01-01

DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298
Molecular diversity analysis of Tetradium ruticarpum (WuZhuYu) in China based on inter-primer binding site (iPBS) markers and inter-simple sequence repeat (ISSR) markers.

PubMed

Xu, Jing-Yuan; Zhu, Yan; Yi, Ze; Wu, Gang; Xie, Guo-Yong; Qin, Min-Jian

2018-01-01

"Wu zhu yu", which is obtained from the dried unripe fruits of Tetradium ruticarpum (A. Jussieu) T. G. Hartley, has been used as a traditional Chinese medicine for treatment of headaches, abdominal colic, and hypertension for thousands of years. The present study was designed to assess the molecular genetic diversity among 25 collected accessions of T. ruticarpum (Wu zhu yu in Chinese) from different areas of China, based on inter-primer binding site (iPBS) markers and inter-simple sequence repeat (ISSR) markers. Thirteen ISSR primers generated 151 amplification bands, of which 130 were polymorphic. Out of 165 bands that were amplified using 10 iPBS primers, 152 were polymorphic. The iPBS markers displayed a higher proportion of polymorphic loci (PPL = 92.5%) than the ISSR markers (PPL = 84.9%). The results showed that T. ruticarpum possessed high loci polymorphism and genetic differentiation occurred in this plant. The combined data of iPBS and ISSR markers scored on 25 accessions produced five clusters that approximately matched the geographic distribution of the species. The results indicated that both iPBS and ISSR markers were reliable and effective tools for analyzing the genetic diversity in T. ruticarpum. Copyright © 2018 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.

Abundance and Characterization of Perfect Microsatellites on the Cattle Y Chromosome.

PubMed

Ma, Zhi-Jie

2017-07-03

Microsatellites or simple sequence repeats (SSRs) are found in most organisms and play an important role in genomic organization and function. To characterize the abundance of SSRs (1-6 base-pairs [bp]) on the cattle Y chromsome, the relative frequency and density of perfect or uninterrupted SSRs based on the published Y chromosome sequence were examined. A total of 17,273 perfect SSRs were found, with total length of 324.78 kb, indicating that approximately 0.75% of the cattle Y chromosome sequence (43.30 Mb) comprises perfect SSRs, with an average length of 18.80 bp. The relative frequency and density were 398.92 loci/Mb and 7500.62 bp/Mb, respectively. The proportions of the six classes of perfect SSRs were highly variable on the cattle Y chromosome. Mononucleotide repeats had a total number of 8073 (46.74%) and an average length of 15.45 bp, and were the most abundant SSRs class, while the percentages of di-, tetra-, tri-, penta-, and hexa-nucleotide repeats were 22.86%, 11.98%, 11.58%, 6.65%, and 0.19%, respectively. Different classes of SSRs varied in their repeat number, with the highest being 42 for dinucleotides. Results reveal that repeat categories A, AC, AT, AAC, AGC, GTTT, CTTT, ATTT, and AACTG predominate on the Y chromosome. This study provides insight into the organization of cattle Y chromosome repetitive DNA, as well as information useful for developing more polymorphic cattle Y-chromosome-specific SSRs.
Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

PubMed

Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

2011-04-20

Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.
Asymmetric Epigenetic Modification and Elimination of rDNA Sequences by Polyploidization in Wheat[W

PubMed Central

Guo, Xiang

2014-01-01

rRNA genes consist of long tandem repeats clustered on chromosomes, and their products are important functional components of the ribosome. In common wheat (Triticum aestivum), rDNA loci from the A and D genomes were largely lost during the evolutionary process. This biased DNA elimination may be related to asymmetric transcription and epigenetic modifications caused by the polyploid formation. Here, we observed both sets of parental nucleolus organizing regions (NORs) were expressed after hybridization, but asymmetric silencing of one parental NOR was immediately induced by chromosome doubling, and reversing the ploidy status could not reactivate silenced NORs. Furthermore, increased CHG and CHH DNA methylation on promoters was accompanied by asymmetric silencing of NORs. Enrichment of H3K27me3 and H3K9me2 modifications was also observed to be a direct response to increased DNA methylation and transcriptional inactivation of NOR loci. Both A and D genome NOR loci with these modifications started to disappear in the S4 generation and were completely eliminated by the S7 generation in synthetic tetraploid wheat. Our results indicated that asymmetric epigenetic modification and elimination of rDNA sequences between different donor genomes may lead to stable allopolyploid wheat with increased differentiation and diversity. PMID:25415973
Asymmetric epigenetic modification and elimination of rDNA sequences by polyploidization in wheat.

PubMed

Guo, Xiang; Han, Fangpu

2014-11-01

rRNA genes consist of long tandem repeats clustered on chromosomes, and their products are important functional components of the ribosome. In common wheat (Triticum aestivum), rDNA loci from the A and D genomes were largely lost during the evolutionary process. This biased DNA elimination may be related to asymmetric transcription and epigenetic modifications caused by the polyploid formation. Here, we observed both sets of parental nucleolus organizing regions (NORs) were expressed after hybridization, but asymmetric silencing of one parental NOR was immediately induced by chromosome doubling, and reversing the ploidy status could not reactivate silenced NORs. Furthermore, increased CHG and CHH DNA methylation on promoters was accompanied by asymmetric silencing of NORs. Enrichment of H3K27me3 and H3K9me2 modifications was also observed to be a direct response to increased DNA methylation and transcriptional inactivation of NOR loci. Both A and D genome NOR loci with these modifications started to disappear in the S4 generation and were completely eliminated by the S7 generation in synthetic tetraploid wheat. Our results indicated that asymmetric epigenetic modification and elimination of rDNA sequences between different donor genomes may lead to stable allopolyploid wheat with increased differentiation and diversity. © 2014 American Society of Plant Biologists. All rights reserved.
Development of microsatellite markers in Caryophyllaeus laticeps (Cestoda: Caryophyllidea), monozoic fish tapeworm, using next-generation sequencing approach.

PubMed

Králová-Hromadová, Ivica; Minárik, Gabriel; Bazsalovicsová, Eva; Mikulíček, Peter; Oravcová, Alexandra; Pálková, Lenka; Hanzelová, Vladimíra

2015-02-01

Caryophyllaeus laticeps (Pallas 1781) (Cestoda: Caryophyllidea) is a monozoic tapeworm of cyprinid fishes with a distribution area that includes Europe, most of the Palaearctic Asia and northern Africa. Broad geographic distribution, wide range of definitive fish hosts and recently revealed high morphological plasticity of the parasite, which is not in an agreement with molecular findings, make this species to be an interesting model for population biology studies. Microsatellites (short tandem repeat (STR) markers), as predominant markers for population genetics, were designed for C. laticeps using a next-generation sequencing (NGS) approach. Out of 165 marker candidates, 61 yielded PCR products of the expected size and in 25 of the candidates a declared repetitive motif was confirmed by Sanger sequencing. After the fragment analysis, six loci were proved to be polymorphic and tested for heterozygosity, Hardy-Weinberg equilibrium and the presence of null alleles on 59 individuals coming from three geographically widely separated populations (Slovakia, Russia and UK). The number of alleles in particular loci and populations ranged from two to five. Significant deficit of heterozygotes and the presence of null alleles were found in one locus in all three populations. Other loci showed deviations from Hardy-Weinberg equilibrium and the presence of null alleles only in some populations. In spite of relatively low polymorphism and the potential presence of null alleles, newly developed microsatellites may be applied as suitable markers in population genetic studies of C. laticeps.
CRISPR-Cas systems: Prokaryotes upgrade to adaptive immunity.

PubMed

Barrangou, Rodolphe; Marraffini, Luciano A

2014-04-24

Clustered regularly interspaced short palindromic repeats (CRISPR), and associated proteins (Cas) comprise the CRISPR-Cas system, which confers adaptive immunity against exogenic elements in many bacteria and most archaea. CRISPR-mediated immunization occurs through the uptake of DNA from invasive genetic elements such as plasmids and viruses, followed by its integration into CRISPR loci. These loci are subsequently transcribed and processed into small interfering RNAs that guide nucleases for specific cleavage of complementary sequences. Conceptually, CRISPR-Cas shares functional features with the mammalian adaptive immune system, while also exhibiting characteristics of Lamarckian evolution. Because immune markers spliced from exogenous agents are integrated iteratively in CRISPR loci, they constitute a genetic record of vaccination events and reflect environmental conditions and changes over time. Cas endonucleases, which can be reprogrammed by small guide RNAs have shown unprecedented potential and flexibility for genome editing and can be repurposed for numerous DNA targeting applications including transcriptional control. Copyright © 2014 Elsevier Inc. All rights reserved.
Genetic analysis of a novel Xylella fastidiosa subspecies found in the southwestern United States.

PubMed

Randall, Jennifer J; Goldberg, Natalie P; Kemp, John D; Radionenko, Maxim; French, Jason M; Olsen, Mary W; Hanson, Stephen F

2009-09-01

Xylella fastidiosa, the causal agent of several scorch diseases, is associated with leaf scorch symptoms in Chitalpa tashkentensis, a common ornamental landscape plant used throughout the southwestern United States. For a number of years, many chitalpa trees in southern New Mexico and Arizona exhibited leaf scorch symptoms, and the results from a regional survey show that chitalpa trees from New Mexico, Arizona, and California are frequently infected with X. fastidiosa. Phylogenetic analysis of multiple loci was used to compare the X. fastidiosa infecting chitalpa strains from New Mexico, Arizona, and trees imported into New Mexico nurseries with previously reported X. fastidiosa strains. Loci analyzed included the 16S ribosome, 16S-23S ribosomal intergenic spacer region, gyrase-B, simple sequence repeat sequences, X. fastidiosa-specific sequences, and the virulence-associated protein (VapD). This analysis indicates that the X. fastidiosa isolates associated with infected chitalpa trees in the Southwest are a highly related group that is distinct from the four previously defined taxons X. fastidiosa subsp. fastidiosa (piercei), X. fastidiosa subsp. multiplex, X. fastidiosa subsp. sandyi, and X. fastidiosa subsp. pauca. Therefore, the classification proposed for this new subspecies is X. fastidiosa subsp. tashke.
Global population genetic structure and male-mediated gene flow in the green sea turtle (Chelonia mydas): analysis of microsatellite loci.

PubMed Central

Roberts, Mark A; Schwartz, Tonia S; Karl, Stephen A

2004-01-01

We assessed the degree of population subdivision among global populations of green sea turtles, Chelonia mydas, using four microsatellite loci. Previously, a single-copy nuclear DNA study indicated significant male-mediated gene flow among populations alternately fixed for different mitochondrial DNA haplotypes and that genetic divergence between populations in the Atlantic and Pacific Oceans was more common than subdivisions among populations within ocean basins. Even so, overall levels of variation at single-copy loci were low and inferences were limited. Here, the markedly more variable microsatellite loci confirm the presence of male-mediated gene flow among populations within ocean basins. This analysis generally confirms the genetic divergence between the Atlantic and Pacific. As with the previous study, phylogenetic analyses of genetic distances based on the microsatellite loci indicate a close genetic relationship among eastern Atlantic and Indian Ocean populations. Unlike the single-copy study, however, the results here cannot be attributed to an artifact of general low variability and likely represent recent or ongoing migration between ocean basins. Sequence analyses of regions flanking the microsatellite repeat reveal considerable amounts of cryptic variation and homoplasy and significantly aid in our understanding of population connectivity. Assessment of the allele frequency distributions indicates that at least some of the loci may not be evolving by the stepwise mutation model. PMID:15126404
Characterization of microsatellite loci and reliable genotyping in a polyploid plant, Mercurialis perennis (Euphorbiaceae).

PubMed

Pfeiffer, Tanja; Roschanski, Anna M; Pannell, John R; Korbecka, Grazyna; Schnittler, Martin

2011-01-01

For many applications in population genetics, codominant simple sequence repeats (SSRs) may have substantial advantages over dominant anonymous markers such as amplified fragment length polymorphisms (AFLPs). In high polyploids, however, allele dosage of SSRs cannot easily be determined and alleles are not easily attributable to potentially diploidized loci. Here, we argue that SSRs may nonetheless be better than AFLPs for polyploid taxa if they are analyzed as effectively dominant markers because they are more reliable and more precise. We describe the transfer of SSRs developed for diploid Mercurialis huetii to the clonal dioecious M. perennis. Primers were tested on a set of 54 male and female plants from natural decaploid populations. Eight of 65 tested loci produced polymorphic fragments. Binary profiles from 4 different scoring routines were used to define multilocus lineages (MLLs). Allowing for fragment differences within 1 MLL, all analyses revealed the same 14 MLLs without conflicting with merigenet, sex, or plot assignment. For semiautomatic scoring, a combination of as few as 2 of the 4 most polymorphic loci resulted in unambiguous discrimination of clones. Our study demonstrates that microsatellite fingerprinting of polyploid plants is a cost efficient and reliable alternative to AFLPs, not least because fewer loci are required than for diploids.
Core genome conservation of Staphylococcus haemolyticus limits sequence based population structure analysis.

PubMed

Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson

2012-06-01

The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.
High-Resolution Whole-Genome Sequencing Reveals That Specific Chromatin Domains from Most Human Chromosomes Associate with Nucleoli

PubMed Central

van Koningsbruggen, Silvana; Gierliński, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J.; Ariyurek, Yavuz; den Dunnen, Johan T.

2010-01-01

The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope. PMID:20826608
High-resolution whole-genome sequencing reveals that specific chromatin domains from most human chromosomes associate with nucleoli.

PubMed

van Koningsbruggen, Silvana; Gierlinski, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J; Ariyurek, Yavuz; den Dunnen, Johan T; Lamond, Angus I

2010-11-01

The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope.
Short tandem repeat analysis in Japanese population.

PubMed

Hashiyada, M

2000-01-01

Short tandem repeats (STRs), known as microsatellites, are one of the most informative genetic markers for characterizing biological materials. Because of the relatively small size of STR alleles (generally 100-350 nucleotides), amplification by polymerase chain reaction (PCR) is relatively easy, affording a high sensitivity of detection. In addition, STR loci can be amplified simultaneously in a multiplex PCR. Thus, substantial information can be obtained in a single analysis with the benefits of using less template DNA, reducing labor, and reducing the contamination. We investigated 14 STR loci in a Japanese population living in Sendai by three multiplex PCR kits, GenePrint PowerPlex 1.1 and 2.2. Fluorescent STR System (Promega, Madison, WI, USA) and AmpF/STR Profiler (Perkin-Elmer, Norwalk, CT, USA). Genomic DNA was extracted using sodium dodecyl sulfate (SDS) proteinase K or Chelex 100 treatment followed by the phenol/chloroform extraction. PCR was performed according to the manufacturer's protocols. Electrophoresis was carried out on an ABI 377 sequencer and the alleles were determined by GeneScan 2.0.2 software (Perkin-Elmer). In 14 STRs loci, statistical parameters indicated a relatively high rate, and no significant deviation from Hardy-Weinberg equilibrium was detected. We apply this STR system to paternity testing and forensic casework, e.g., personal identification in rape cases. This system is an effective tool in the forensic sciences to obtain information on individual identification.
Characterization of replication and conjugation of plasmid pWTY27 from a widely distributed Streptomyces species

PubMed Central

2012-01-01

Background Streptomyces species are widely distributed in natural habitats, such as soils, lakes, plants and some extreme environments. Replication loci of several Streptomyces theta-type plasmids have been reported, but are not characterized in details. Conjugation loci of some Streptomyces rolling-circle-type plasmids are identified and mechanism of conjugal transferring are described. Results We report the detection of a widely distributed Streptomyces strain Y27 and its indigenous plasmid pWTY27 from fourteen plants and four soil samples cross China by both culturing and nonculturing methods. The complete nucleotide sequence of pWTY27 consisted of 14,288 bp. A basic locus for plasmid replication comprised repAB genes and an adjacent iteron sequence, to a long inverted-repeat (ca. 105 bp) of which the RepA protein bound specifically in vitro, suggesting that RepA may recognize a second structure (e.g. a long stem-loop) of the iteron DNA. A plasmid containing the locus propagated in linear mode when the telomeres of a linear plasmid were attached, indicating a bi-directional replication mode for pWTY27. As for rolling-circle plasmids, a single traA gene and a clt sequence (covering 16 bp within traA and its adjacent 159 bp) on pWTY27 were required for plasmid transfer. TraA recognized and bound specifically to the two regions of the clt sequence, one containing all the four DC1 of 7 bp (TGACACC) and one DC2 (CCCGCCC) and most of IC1, and another covering two DC2 and part of IC1, suggesting formation of a high-ordered DNA-protein complex. Conclusions This work (i) isolates a widespread Streptomyces strain Y27 and sequences its indigenous theta-type plasmid pWTY27; (ii) identifies the replication and conjugation loci of pWTY27 and; (iii) characterizes the binding sequences of the RepA and TraA proteins. PMID:23134842
Identification of Genetic Elements Associated with EPSPS Gene Amplification

PubMed Central

Gaines, Todd A.; Wright, Alice A.; Molin, William T.; Lorentz, Lothar; Riggins, Chance W.; Tranel, Patrick J.; Beffa, Roland; Westra, Philip; Powles, Stephen B.

2013-01-01

Weed populations can have high genetic plasticity and rapid responses to environmental selection pressures. For example, 100-fold amplification of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene evolved in the weed species Amaranthus palmeri to confer resistance to glyphosate, the world’s most important herbicide. However, the gene amplification mechanism is unknown. We sequenced the EPSPS gene and genomic regions flanking EPSPS loci in A. palmeri, and searched for mobile genetic elements or repetitive sequences. The EPSPS gene was 10,229 bp, containing 8 exons and 7 introns. The gene amplification likely proceeded through a DNA-mediated mechanism, as introns exist in the amplified gene copies and the entire amplified sequence is at least 30 kb in length. Our data support the presence of two EPSPS loci in susceptible (S) A. palmeri, and that only one of these was amplified in glyphosate-resistant (R) A. palmeri. The EPSPS gene amplification event likely occurred recently, as no sequence polymorphisms were found within introns of amplified EPSPS copies from R individuals. Sequences with homology to miniature inverted-repeat transposable elements (MITEs) were identified next to EPSPS gene copies only in R individuals. Additionally, a putative Activator (Ac) transposase and a repetitive sequence region were associated with amplified EPSPS genes. The mechanism controlling this DNA-mediated amplification remains unknown. Further investigation is necessary to determine if the gene amplification may have proceeded via DNA transposon-mediated replication, and/or unequal recombination between different genomic regions resulting in replication of the EPSPS gene. PMID:23762434
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

PubMed

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

2016-01-07

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum

PubMed Central

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L.; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F.; Li, Shuaicheng; Hu, Kailin

2016-01-01

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum. PMID:26739748
Cas9 specifies functional viral targets during CRISPR-Cas adaptation.

PubMed

Heler, Robert; Samai, Poulami; Modell, Joshua W; Weiner, Catherine; Goldberg, Gregory W; Bikard, David; Marraffini, Luciano A

2015-03-12

Clustered regularly interspaced short palindromic repeat (CRISPR) loci and their associated (Cas) proteins provide adaptive immunity against viral infection in prokaryotes. Upon infection, short phage sequences known as spacers integrate between CRISPR repeats and are transcribed into small RNA molecules that guide the Cas9 nuclease to the viral targets (protospacers). Streptococcus pyogenes Cas9 cleavage of the viral genome requires the presence of a 5'-NGG-3' protospacer adjacent motif (PAM) sequence immediately downstream of the viral target. It is not known whether and how viral sequences flanked by the correct PAM are chosen as new spacers. Here we show that Cas9 selects functional spacers by recognizing their PAM during spacer acquisition. The replacement of cas9 with alleles that lack the PAM recognition motif or recognize an NGGNG PAM eliminated or changed PAM specificity during spacer acquisition, respectively. Cas9 associates with other proteins of the acquisition machinery (Cas1, Cas2 and Csn2), presumably to provide PAM-specificity to this process. These results establish a new function for Cas9 in the genesis of prokaryotic immunological memory.
The complete chloroplast genome sequence of Actinidia arguta using the PacBio RS II platform

PubMed Central

Lin, Miaomiao; Qi, Xiujuan; Chen, Jinyong; Sun, Leiming; Zhong, Yunpeng; Fang, Jinbao; Hu, Chungen

2018-01-01

Actinidia arguta is the most basal species in a phylogenetically and economically important genus in the family Actinidiaceae. To better understand the molecular basis of the Actinidia arguta chloroplast (cp), we sequenced the complete cp genome from A. arguta using Illumina and PacBio RS II sequencing technologies. The cp genome from A. arguta was 157,611 bp in length and composed of a pair of 24,232 bp inverted repeats (IRs) separated by a 20,463 bp small single copy region (SSC) and an 88,684 bp large single copy region (LSC). Overall, the cp genome contained 113 unique genes. The cp genomes from A. arguta and three other Actinidia species from GenBank were subjected to a comparative analysis. Indel mutation events and high frequencies of base substitution were identified, and the accD and ycf2 genes showed a high degree of variation within Actinidia. Forty-seven simple sequence repeats (SSRs) and 155 repetitive structures were identified, further demonstrating the rapid evolution in Actinidia. The cp genome analysis and the identification of variable loci provide vital information for understanding the evolution and function of the chloroplast and for characterizing Actinidia population genetics. PMID:29795601
Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism.

PubMed

Gur-Arie, R; Cohen, C J; Eitan, Y; Shelef, L; Hallerman, E M; Kashi, Y

2000-01-01

Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.

Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping.

PubMed

U'Ren, Jana M; Schupp, James M; Pearson, Talima; Hornstra, Heidie; Friedman, Christine L Clark; Smith, Kimothy L; Daugherty, Rebecca R Leadem; Rhoton, Shane D; Leadem, Ben; Georgia, Shalamar; Cardon, Michelle; Huynh, Lynn Y; DeShazer, David; Harvey, Steven P; Robison, Richard; Gal, Daniel; Mayo, Mark J; Wagner, David; Currie, Bart J; Keim, Paul

2007-03-30

The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation) to that of the most diverse tandemly repeated regions found in other less diverse bacteria. The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were identical using previous typing methods. Given the health threat to humans and livestock and the potential for B. pseudomallei to be released intentionally, MLVA could prove to be an important tool for fine-scale epidemiological or forensic tracking of this increasingly important environmental pathogen.
A reference genetic linkage map of apomictic Hieracium species based on expressed markers derived from developing ovule transcripts

PubMed Central

Shirasawa, Kenta; Hand, Melanie L.; Henderson, Steven T.; Okada, Takashi; Johnson, Susan D.; Taylor, Jennifer M.; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M. G.

2015-01-01

Background and Aims Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. Methods RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. Key Results A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. Conclusions A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. PMID:25538115
The 2.1-kb inverted repeat DNA sequences flank the mat2,3 silent region in two species of Schizosaccharomyces and are involved in epigenetic silencing in Schizosaccharomyces pombe.

PubMed Central

Singh, Gurjeet; Klar, Amar J S

2002-01-01

The mat2,3 region of the fission yeast Schizosaccharomyces pombe exhibits a phenomenon of transcriptional silencing. This region is flanked by two identical DNA sequence elements, 2.1 kb in length, present in inverted orientation: IRL on the left and IRR on the right of the silent region. The repeats do not encode any ORF. The inverted repeat DNA region is also present in a newly identified related species, which we named S. kambucha. Interestingly, the left and right repeats share perfect identity within a species, but show approximately 2% bases interspecies variation. Deletion of IRL results in variegated expression of markers inserted in the silent region, while deletion of the IRR causes their derepression. When deletions of these repeats were genetically combined with mutations in different trans-acting genes previously shown to cause a partial defect in silencing, only mutations in clr1 and clr3 showed additive defects in silencing with the deletion of IRL. The rate of mat1 switching is also affected by deletion of repeats. The IRL or IRR deletion did not cause significant derepression of the mat2 or mat3 loci. These results implicate repeats for maintaining full repression of the mat2,3 region, for efficient mat1 switching, and further support the notion that multiple pathways cooperate to silence the mat2,3 domain. PMID:12399374
Expansion of 50 CAG/CTG repeats excluded in schizophrenia by application of a highly efficient approach using repeat expansion detection and a PCR screening set

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bowen, T.; Guy, C.; Speight, G.

Studies of the transmission of schizophrenia in families with affected members in several generations have suggested that an expanded trinucleotide repeat mechanism may contribute to the genetic inheritance of this disorder. Using repeat expansion detection (RED), we and others have previously found that the distribution of CAG/CTG repeat size is larger in patients with schizophrenia than in controls. In an attempt to identify the specific expanded CAG/CTG locus or loci associated with schizophrenia, we have now used an approach based on a CAG/CTG PCR screening set combined with RED data. This has allowed us to minimize genotyping while excluding 43more » polymorphic autosomal loci and 7 X-chromosomal loci from the screening set as candidates for expansion in schizophrenia with a very high degree of confidence. 18 refs., 1 tab.« less
Non-canonical ribosomal DNA segments in the human genome, and nucleoli functioning.

PubMed

Kupriyanova, Natalia S; Netchvolodov, Kirill K; Sadova, Anastasia A; Cherepanova, Marina D; Ryskov, Alexei P

2015-11-10

Ribosomal DNA (rDNA) in the human genome is represented by tandem repeats of 43 kb nucleotide sequences that form nucleoli organizers (NORs) on each of five pairs of acrocentric chromosomes. RDNA-similar segments of different lengths are also present on (NOR)(-) chromosomes. Many of these segments contain nucleotide substitutions, supplementary microsatellite clusters, and extended deletions. Recently, it was shown that, in addition to ribosome biogenesis, nucleoli exhibit additional functions, such as cell-cycle regulation and response to stresses. In particular, several stress-inducible loci located in the ribosomal intergenic spacer (rIGS) produce stimuli-specific noncoding nucleolus RNAs. By mapping the 5'/3' ends of the rIGS segments scattered throughout (NOR)(-) chromosomes, we discovered that the bonds in the rIGS that were most often susceptible to disruption in the rIGS were adjacent to, or overlapped with stimuli-specific inducible loci. This suggests the interconnection of the two phenomena - nucleoli functioning and the scattering of rDNA-like sequences on (NOR)(-) chromosomes. Copyright © 2015 Elsevier B.V. All rights reserved.
Genome editing technologies to fight infectious diseases.

PubMed

Trevisan, Marta; Palù, Giorgio; Barzon, Luisa

2017-11-01

Genome editing by programmable nucleases represents a promising tool that could be exploited to develop new therapeutic strategies to fight infectious diseases. These nucleases, such as zinc-finger nucleases, transcription activator-like effector nucleases, clustered regularly interspaced short palindromic repeat (CRISPR)-CRISPR-associated protein 9 (Cas9) and homing endonucleases, are molecular scissors that can be targeted at predetermined loci in order to modify the genome sequence of an organism. Areas covered: By perturbing genomic DNA at predetermined loci, programmable nucleases can be used as antiviral and antimicrobial treatment. This approach includes targeting of essential viral genes or viral sequences able, once mutated, to inhibit viral replication; repurposing of CRISPR-Cas9 system for lethal self-targeting of bacteria; targeting antibiotic-resistance and virulence genes in bacteria, fungi, and parasites; engineering arthropod vectors to prevent vector-borne infections. Expert commentary: While progress has been done in demonstrating the feasibility of using genome editing as antimicrobial strategy, there are still many hurdles to overcome, such as the risk of off-target mutations, the raising of escape mutants, and the inefficiency of delivery methods, before translating results from preclinical studies into clinical applications.
Development of Microsatellite Markers for Buffalograss ( Buchloë dactyloides ; Poaceae), a Drought-Tolerant Turfgrass Alternative

DOE PAGES

Hadle, Jacob J.; Konrade, Lauren A.; Beasley, Rochelle R.; ...

2016-08-03

Buchloë dactyloides (Nutt.) Engelm. (buffalograss; Poaceae) is a low-growing, perennial C4 grass that is a dominant component of shortgrass prairies of the North American Great Plains (Shearman et al., 2004). Beyond this significant ecosystem role, buffalograss has been widely adopted as a drought-tolerant turfgrass alternative, particularly notable as a native-species option in North America. Like many dominant Great Plains grasses, B. dactyloides comprises an autopolypoid series, including diploids (2n = 20), tetraploids, pentaploids, and hexaploids (Johnson et al., 2001). Preserving the full range of buffalograss phenotypic and genotypic diversity and utilizing this diversity for crop improvement will require an understandingmore » of the distribution of genetic variation among cytotypes and across its large geographic range. Beyond numerous methodological advantages (Guichoux et al., 2011), microsatellites, or simple sequence repeat (SSR) markers,are an attractive genetic tool for studies of wide-ranging polyploid series given their codominant nature and applicability to museum-derived DNAs. Because SSR data are routinely obtainable from DNA extracted from museum tissue (Wandeler et al., 2007), these samples can be used to quickly and economically obtain comparative genotypic data from all portions of a large geographic range. Currently no buffalograss-specific SSR loci are available, as previous studies have relied on a mixture of dominant and codominant loci that were designed for other taxa (Budak et al., 2004). In this study, a set of SSR loci are designed from B. dactyloides genomic sequence data. The variability of these loci are then evaluated in six populations from numerous portions of the buffalograss range.« less
New polymorphic microsatellite markers derived from hemocyte cDNA library of Manila clam Ruditapes philippinarum challenged by the protozoan parasite Perkinsus olseni

NASA Astrophysics Data System (ADS)

Kang, Hyun-Sil; Hong, Hyun-Ki; Park, Kyung-Il; Cho, Moonjae; Youn, Seok-Hyun; Choi, Kwang-Sik

2017-03-01

Manila clam Ruditapes philippinarum is one of the most important benthic animals in the coastal north Pacific region, where clam populations have been mixed genetically through trade and aquaculture activities. Accordingly, identification of the genetically different clam populations has become one of the most important issues to manage interbreeding of the local and introduced clam populations. To identify genetically different populations of clam populations, we developed 11 expressed sequence tag (EST)-microsatellite loci (i.e., simple sequence repeat, SSR) from 1,128 clam hemocyte cDNA clones challenged by the protozoan parasite Perkinsus olseni. Genotype analysis using the markers developed in this study demonstrated that clams from a tidal flat on the west coast contained 6 to 19 alleles per locus, and a population from Jeju Island had 4 to 20 alleles per locus. The expected heterozygosity of the 2 clam populations ranged from 0.472 to 0.919 for clams from the west coast, and 0.494 to 0.919 for clams from Jeju Island, respectively. Among the 11 loci discovered in this study, 7 loci significantly deviated from the Hardy-Weinberg equilibrium after Bonferroni correction. The 5 loci developed in this study also successfully amplified the SSRs of R. variegatus, a clam species taxonomically very close to R. philippinarum, from Hong Kong and Jeju Island. We believe that the 11 novel polymorphic SSR developed in this study can be utilized successfully in Manila clam genetic diversity analysis, as well as in genetic discrimination of different clam populations.
Development of Microsatellite Markers for Buffalograss ( Buchloë dactyloides ; Poaceae), a Drought-Tolerant Turfgrass Alternative

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hadle, Jacob J.; Konrade, Lauren A.; Beasley, Rochelle R.

Buchloë dactyloides (Nutt.) Engelm. (buffalograss; Poaceae) is a low-growing, perennial C4 grass that is a dominant component of shortgrass prairies of the North American Great Plains (Shearman et al., 2004). Beyond this significant ecosystem role, buffalograss has been widely adopted as a drought-tolerant turfgrass alternative, particularly notable as a native-species option in North America. Like many dominant Great Plains grasses, B. dactyloides comprises an autopolypoid series, including diploids (2n = 20), tetraploids, pentaploids, and hexaploids (Johnson et al., 2001). Preserving the full range of buffalograss phenotypic and genotypic diversity and utilizing this diversity for crop improvement will require an understandingmore » of the distribution of genetic variation among cytotypes and across its large geographic range. Beyond numerous methodological advantages (Guichoux et al., 2011), microsatellites, or simple sequence repeat (SSR) markers,are an attractive genetic tool for studies of wide-ranging polyploid series given their codominant nature and applicability to museum-derived DNAs. Because SSR data are routinely obtainable from DNA extracted from museum tissue (Wandeler et al., 2007), these samples can be used to quickly and economically obtain comparative genotypic data from all portions of a large geographic range. Currently no buffalograss-specific SSR loci are available, as previous studies have relied on a mixture of dominant and codominant loci that were designed for other taxa (Budak et al., 2004). In this study, a set of SSR loci are designed from B. dactyloides genomic sequence data. The variability of these loci are then evaluated in six populations from numerous portions of the buffalograss range.« less
CRISPR/cas Loci of Type II Propionibacterium acnes Confer Immunity against Acquisition of Mobile Elements Present in Type I P. acnes

PubMed Central

Brüggemann, Holger; Lomholt, Hans B.; Tettelin, Hervé; Kilian, Mogens

2012-01-01

Propionibacterium acnes is a skin commensal that occasionally acts as an opportunistic pathogen. The population structure of this species shows three main lineages (I–III). While type I strains are mainly associated with sebaceous follicles of human skin and inflammatory acne, types II and III strains are more often associated with deep tissue infections. We investigated the occurrence and distribution of the clustered regularly interspaced short palindromic repeats (CRISPR) in P. acnes, assessed their immunological memory, and addressed the question if such a system could account for type-specific properties of the species. A collection of 108 clinical isolates covering all known phylotypes of P. acnes was screened for the existence of CRISPR/cas loci. We found that CRISPR loci are restricted to type II P. acnes strains. Sequence analyses of the CRISPR spacers revealed that the system confers immunity to P. acnes-specific phages and to two mobile genetic elements. These elements are found almost exclusively in type I P. acnes strains. Genome sequencing of a type I P. acnes isolate revealed that one element, 54 kb in size, encodes a putative secretion/tight adherence (TAD) system. Thus, CRISPR/cas loci in P. acnes recorded the exposure of type II strains to mobile genetic elements of type I strains. The CRISPR/cas locus is deleted in type I strains, which conceivably accounts for their ability to horizontally acquire fitness or virulence traits and might indicate that type I strains constitute a younger subpopulation of P. acnes. PMID:22479553
Analysis of the leaf transcriptome of Musa acuminata during interaction with Mycosphaerella musicola: gene assembly, annotation and marker development

PubMed Central

2013-01-01

Background Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. Results The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases. Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. Conclusions A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci. PMID:23379821
De novo generation of plant centromeres at tandem repeats.

PubMed

Teo, Chee How; Lermontova, Inna; Houben, Andreas; Mette, Michael Florian; Schubert, Ingo

2013-06-01

Artificial minichromosomes are highly desirable tools for basic research, breeding, and biotechnology purposes. We present an option to generate plant artificial minichromosomes via de novo engineering of plant centromeres in Arabidopsis thaliana by targeting kinetochore proteins to tandem repeat arrays at non-centromeric positions. We employed the bacterial lactose repressor/lactose operator system to guide derivatives of the centromeric histone H3 variant cenH3 to LacO operator sequences. Tethering of cenH3 to non-centromeric loci led to de novo assembly of kinetochore proteins and to dicentric carrier chromosomes which potentially form anaphase bridges. This approach will be further developed and may contribute to generating minichromosomes from preselected genomic regions, potentially even in a diploid background.
Genetic and epigenetic variation in 5S ribosomal RNA genes reveals genome dynamics in Arabidopsis thaliana

PubMed Central

Simon, Lauriane; Rabanal, Fernando A; Dubos, Tristan; Oliver, Cecilia; Lauber, Damien; Poulet, Axel; Vogt, Alexander; Mandlbauer, Ariane; Le Goff, Samuel; Sommer, Andreas; Duborjal, Hervé; Tatout, Christophe

2018-01-01

Abstract Organized in tandem repeat arrays in most eukaryotes and transcribed by RNA polymerase III, expression of 5S rRNA genes is under epigenetic control. To unveil mechanisms of transcriptional regulation, we obtained here in depth sequence information on 5S rRNA genes from the Arabidopsis thaliana genome and identified differential enrichment in epigenetic marks between the three 5S rDNA loci situated on chromosomes 3, 4 and 5. We reveal the chromosome 5 locus as the major source of an atypical, long 5S rRNA transcript characteristic of an open chromatin structure. 5S rRNA genes from this locus translocated in the Landsberg erecta ecotype as shown by linkage mapping and chromosome-specific FISH analysis. These variations in 5S rDNA locus organization cause changes in the spatial arrangement of chromosomes in the nucleus. Furthermore, 5S rRNA gene arrangements are highly dynamic with alterations in chromosomal positions through translocations in certain mutants of the RNA-directed DNA methylation pathway and important copy number variations among ecotypes. Finally, variations in 5S rRNA gene sequence, chromatin organization and transcripts indicate differential usage of 5S rDNA loci in distinct ecotypes. We suggest that both the usage of existing and new 5S rDNA loci resulting from translocations may impact neighboring chromatin organization. PMID:29518237
Genetic and epigenetic variation in 5S ribosomal RNA genes reveals genome dynamics in Arabidopsis thaliana.

PubMed

Simon, Lauriane; Rabanal, Fernando A; Dubos, Tristan; Oliver, Cecilia; Lauber, Damien; Poulet, Axel; Vogt, Alexander; Mandlbauer, Ariane; Le Goff, Samuel; Sommer, Andreas; Duborjal, Hervé; Tatout, Christophe; Probst, Aline V

2018-04-06

Organized in tandem repeat arrays in most eukaryotes and transcribed by RNA polymerase III, expression of 5S rRNA genes is under epigenetic control. To unveil mechanisms of transcriptional regulation, we obtained here in depth sequence information on 5S rRNA genes from the Arabidopsis thaliana genome and identified differential enrichment in epigenetic marks between the three 5S rDNA loci situated on chromosomes 3, 4 and 5. We reveal the chromosome 5 locus as the major source of an atypical, long 5S rRNA transcript characteristic of an open chromatin structure. 5S rRNA genes from this locus translocated in the Landsberg erecta ecotype as shown by linkage mapping and chromosome-specific FISH analysis. These variations in 5S rDNA locus organization cause changes in the spatial arrangement of chromosomes in the nucleus. Furthermore, 5S rRNA gene arrangements are highly dynamic with alterations in chromosomal positions through translocations in certain mutants of the RNA-directed DNA methylation pathway and important copy number variations among ecotypes. Finally, variations in 5S rRNA gene sequence, chromatin organization and transcripts indicate differential usage of 5S rDNA loci in distinct ecotypes. We suggest that both the usage of existing and new 5S rDNA loci resulting from translocations may impact neighboring chromatin organization.
Molecular identification and characterization of clustered regularly interspaced short palindromic repeat (CRISPR) gene cluster in Taylorella equigenitalis.

PubMed

Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo

2013-09-01

Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.
Microsatellite Development for an Endangered Bream Megalobrama pellegrini (Teleostei, Cyprinidae) Using 454 Sequencing

PubMed Central

Wang, Jinjin; Yu, Xiaomu; Zhao, Kai; Zhang, Yaoguang; Tong, Jingou; Peng, Zuogang

2012-01-01

Megalobrama pellegrini is an endemic fish species found in the upper Yangtze River basin in China. This species has become endangered due to the construction of the Three Gorges Dam and overfishing. However, the available genetic data for this species is limited. Here, we developed 26 polymorphic microsatellite markers from the M. pellegrini genome using next-generation sequencing techniques. A total of 257,497 raw reads were obtained from a quarter-plate run on 454 GS-FLX titanium platforms and 49,811 unique sequences were generated with an average length of 404 bp; 24,522 (49.2%) sequences contained microsatellite repeats. Of the 53 loci screened, 33 were amplified successfully and 26 were polymorphic. The genetic diversity in M. pellegrini was moderate, with an average of 3.08 alleles per locus, and the mean observed and expected heterozygosity were 0.47 and 0.51, respectively. In addition, we tested cross-species amplification for all 33 loci in four additional breams: M. amblycephala, M. skolkovii, M. terminalis, and Sinibrama wui. The cross-species amplification showed a significant high level of transferability (79%–97%), which might be due to their dramatically close genetic relationships. The polymorphic microsatellites developed in the current study will not only contribute to further conservation genetic studies and parentage analyses of this endangered species, but also facilitate future work on the other closely related species. PMID:22489139
Interactions between Glu-1 and Glu-3 loci and associations of selected molecular markers with quality traits in winter wheat (Triticum aestivum L.) DH lines.

PubMed

Krystkowiak, Karolina; Langner, Monika; Adamski, Tadeusz; Salmanowicz, Bolesław P; Kaczmarek, Zygmunt; Krajewski, Paweł; Surma, Maria

2017-02-01

The quality of wheat depends on a large complex of genes and environmental factors. The objective of this study was to identify quantitative trait loci controlling technological quality traits and their stability across environments, and to assess the impact of interaction between alleles at loci Glu-1 and Glu-3 on grain quality. DH lines were evaluated in field experiments over a period of 4 years, and genotyped using simple sequence repeat markers. Lines were analysed for grain yield (GY), thousand grain weight (TGW), protein content (PC), starch content (SC), wet gluten content (WG), Zeleny sedimentation value (ZS), alveograph parameter W (APW), hectolitre weight (HW), and grain hardness (GH). A number of QTLs for these traits were identified in all chromosome groups. The Glu-D1 locus influenced TGW, PC, SC, WG, ZS, APW, GH, while locus Glu-B1 affected only PC, ZS, and WG. Most important marker-trait associations were found on chromosomes 1D and 5D. Significant effects of interaction between Glu-1 and Glu-3 loci on technological properties were recorded, and in all types of this interaction positive effects of Glu-D1 locus on grain quality were observed, whereas effects of Glu-B1 locus depended on alleles at Glu-3 loci. Effects of Glu-A3 and Glu-D3 loci per se were not significant, while their interaction with alleles present at other loci encoding HMW and LMW were important. These results indicate that selection of wheat genotypes with predicted good bread-making properties should be based on the allelic composition both in Glu-1 and Glu-3 loci, and confirm the predominant effect of Glu-D1d allele on technological properties of wheat grains.
Inter-laboratory comparison of multi-locus variable-number tandem repeat analysis (MLVA) for verocytotoxin-producing Escherichia coli O157 to facilitate data sharing.

PubMed

Holmes, A; Perry, N; Willshaw, G; Hanson, M; Allison, L

2015-01-01

Multi-locus variable number tandem repeat analysis (MLVA) is used in clinical and reference laboratories for subtyping verocytotoxin-producing Escherichia coli O157 (VTEC O157). However, as yet there is no common allelic or profile nomenclature to enable laboratories to easily compare data. In this study, we carried out an inter-laboratory comparison of an eight-loci MLVA scheme using a set of 67 isolates of VTEC O157. We found all but two isolates were identical in profile in the two laboratories, and repeat units were homogeneous in size but some were incomplete. A subset of the isolates (n = 17) were sequenced to determine the actual copy number of representative alleles, thereby enabling alleles to be named according to international consensus guidelines. This work has enabled us to realize the potential of MLVA as a portable, highly discriminatory and convenient subtyping method.
Microsatellite markers used for genome-wide association mapping of partial resistance to Sclerotinia sclerotiorum in a world collection of Brassica napus.

PubMed

Gyawali, Sanjaya; Harrington, Myrtle; Durkin, Jonathan; Horner, Kyla; Parkin, Isobel A P; Hegedus, Dwayne D; Bekkaoui, Diana; Buchwaldt, Lone

The fungal pathogen Sclerotinia sclerotiorum causes stem rot of oilseed rape ( Brassica napus ) worldwide. In preparation for genome-wide association mapping (GWAM) of sclerotinia resistance in B. napus , 152 accessions from diverse geographical regions were screened with a single Canadian isolate, #321. Plants were inoculated by attaching mycelium plugs to the main stem at full flower. Lesion lengths measured 7, 14 and 21 days after inoculation were used to calculate the area under the disease progress curve (AUDPC). Depth of penetration was noted and used to calculate percent soft and collapsed lesions (% s + c). The two disease traits were highly correlated ( r = 0.93). Partially resistant accessions (AUDPC <7 and % s + c <2) were identified primarily from South Korea and Japan with a few from Pakistan, China and Europe. Genotyping of accessions with 84 simple sequence repeat markers provided 690 polymorphic loci for GWAM. The general linear model in TASSEL best fitted the data when adjusted for population structure (STRUCTURE), GLM + Q. After correction for positive false discovery rate, 34 loci were significantly associated with both disease traits of which 21 alleles contributed to resistance, while the remaining enhanced susceptibility. The phenotypic variation explained by the loci ranged from 6 to 25 %. Five loci mapped to published quantitative trait loci conferring sclerotinia resistance in Chinese lines.
mCAL: A New Approach for Versatile Multiplex Action of Cas9 Using One sgRNA and Loci Flanked by a Programmed Target Sequence.

PubMed

Finnigan, Gregory C; Thorner, Jeremy

2016-07-07

Genome editing exploiting CRISPR/Cas9 has been adopted widely in academia and in the biotechnology industry to manipulate DNA sequences in diverse organisms. Molecular engineering of Cas9 itself and its guide RNA, and the strategies for using them, have increased efficiency, optimized specificity, reduced inappropriate off-target effects, and introduced modifications for performing other functions (transcriptional regulation, high-resolution imaging, protein recruitment, and high-throughput screening). Moreover, Cas9 has the ability to multiplex, i.e., to act at different genomic targets within the same nucleus. Currently, however, introducing concurrent changes at multiple loci involves: (i) identification of appropriate genomic sites, especially the availability of suitable PAM sequences; (ii) the design, construction, and expression of multiple sgRNA directed against those sites; (iii) potential difficulties in altering essential genes; and (iv) lingering concerns about "off-target" effects. We have devised a new approach that circumvents these drawbacks, as we demonstrate here using the yeast Saccharomyces cerevisiae First, any gene(s) of interest are flanked upstream and downstream with a single unique target sequence that does not normally exist in the genome. Thereafter, expression of one sgRNA and cotransformation with appropriate PCR fragments permits concomitant Cas9-mediated alteration of multiple genes (both essential and nonessential). The system we developed also allows for maintenance of the integrated, inducible Cas9-expression cassette or its simultaneous scarless excision. Our scheme-dubbed mCAL for " M: ultiplexing of C: as9 at A: rtificial L: oci"-can be applied to any organism in which the CRISPR/Cas9 methodology is currently being utilized. In principle, it can be applied to install synthetic sequences into the genome, to generate genomic libraries, and to program strains or cell lines so that they can be conveniently (and repeatedly) manipulated at multiple loci with extremely high efficiency. Copyright © 2016 Finnigan and Thorner.

Successful development of microsatellite markers in a challenging species: the horizontal borer Austroplatypus incompertus (Coleoptera: Curculionidae).

PubMed

Smith, S; Joss, T; Stow, A

2011-10-01

The analysis of microsatellite loci has allowed significant advances in evolutionary biology and pest management. However, until very recently, the potential benefits have been compromised by the high costs of developing these neutral markers. High-throughput sequencing provides a solution to this problem. We describe the development of 13 microsatellite markers for the eusocial ambrosia beetle, Austroplatypus incompertus, a significant pest of forests in southeast Australia. The frequency of microsatellite repeats in the genome of A. incompertus was determined to be low, and previous attempts at microsatellite isolation using a traditional genomic library were problematic. Here, we utilised two protocols, microsatellite-enriched genomic library construction and high-throughput 454 sequencing and characterised 13 loci which were polymorphic among 32 individuals. Numbers of alleles per locus ranged from 2 to 17, and observed and expected heterozygosities from 0.344 to 0.767 and from 0.507 to 0.860, respectively. These microsatellites have the resolution required to analyse fine-scale colony and population genetic structure. Our work demonstrates the utility of next-generation 454 sequencing as a method for rapid and cost-effective acquisition of microsatellites where other techniques have failed, or for taxa where marker development has historically been both complicated and expensive.
Multidrug-resistant enterococci lack CRISPR-cas.

PubMed

Palmer, Kelli L; Gilmore, Michael S

2010-10-12

Clustered, regularly interspaced short palindromic repeats (CRISPR) provide bacteria and archaea with sequence-specific, acquired defense against plasmids and phage. Because mobile elements constitute up to 25% of the genome of multidrug-resistant (MDR) enterococci, it was of interest to examine the codistribution of CRISPR and acquired antibiotic resistance in enterococcal lineages. A database was built from 16 Enterococcus faecalis draft genome sequences to identify commonalities and polymorphisms in the location and content of CRISPR loci. With this data set, we were able to detect identities between CRISPR spacers and sequences from mobile elements, including pheromone-responsive plasmids and phage, suggesting that CRISPR regulates the flux of these elements through the E. faecalis species. Based on conserved locations of CRISPR and CRISPR-cas loci and the discovery of a new CRISPR locus with associated functional genes, CRISPR3-cas, we screened additional E. faecalis strains for CRISPR content, including isolates predating the use of antibiotics. We found a highly significant inverse correlation between the presence of a CRISPR-cas locus and acquired antibiotic resistance in E. faecalis, and examination of an additional eight E. faecium genomes yielded similar results for that species. A mechanism for CRISPR-cas loss in E. faecalis was identified. The inverse relationship between CRISPR-cas and antibiotic resistance suggests that antibiotic use inadvertently selects for enterococcal strains with compromised genome defense.
A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning

PubMed Central

Valouev, Anton; Ichikawa, Jeffrey; Tonthat, Thaisan; Stuart, Jeremy; Ranade, Swati; Peckham, Heather; Zeng, Kathy; Malek, Joel A.; Costa, Gina; McKernan, Kevin; Sidow, Arend; Fire, Andrew; Johnson, Steven M.

2008-01-01

Using the massively parallel technique of sequencing by oligonucleotide ligation and detection (SOLiD; Applied Biosystems), we have assessed the in vivo positions of more than 44 million putative nucleosome cores in the multicellular genetic model organism Caenorhabditis elegans. These analyses provide a global view of the chromatin architecture of a multicellular animal at extremely high density and resolution. While we observe some degree of reproducible positioning throughout the genome in our mixed stage population of animals, we note that the major chromatin feature in the worm is a diversity of allowed nucleosome positions at the vast majority of individual loci. While absolute positioning of nucleosomes can vary substantially, relative positioning of nucleosomes (in a repeated array structure likely to be maintained at least in part by steric constraints) appears to be a significant property of chromatin structure. The high density of nucleosomal reads enabled a substantial extension of previous analysis describing the usage of individual oligonucleotide sequences along the span of the nucleosome core and linker. We release this data set, via the UCSC Genome Browser, as a resource for the high-resolution analysis of chromatin conformation and DNA accessibility at individual loci within the C. elegans genome. PMID:18477713
Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

DOE Office of Scientific and Technical Information (OSTI.GOV)

Liao, D.; Weiner, A.M.

1995-12-10

The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to studymore » the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.« less
The First Complete Chloroplast Genome Sequences in Actinidiaceae: Genome Structure and Comparative Analysis.

PubMed

Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen

2015-01-01

Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.
Autosomal dominant hereditary spastic paraplegia with axonal sensory motor polyneuropathy maps to chromosome 21q 22.3.

PubMed

Peddareddygari, Leema Reddy; Hanna, Philip A; Igo, Robert P; Luo, Yuqun A; Won, Sungho; Hirano, Michio; Grewal, Raji P

2016-01-01

Hereditary spastic paraplegia (HSP) are a genetically and clinically heterogeneous group of disorders. At present, 19 autosomal dominant loci for HSP have been mapped. We ascertained an American family of European descent segregating an autosomal dominant HSP associated with peripheral neuropathy. A genome wide scan was performed with 410 microsatellite repeat marker (Weber lab screening set 16) and following linkage and haplotype analysis, fine mapping was performed. Established genes or loci for HSP were excluded by direct sequencing or haplotype analysis. All established loci for HSP were excluded. Fine mapping suggested a locus on chromosome 21q22.3 flanked by markers D21S1411 and D21S1446 with a maximum logarithm of odds score of 2.05 and was supported by haplotype analysis. A number of candidate genes in this region were analyzed and no disease-producing mutations were detected. We present the clinical and genetic analysis of an American family with autosomal dominant HSP with axonal sensory motor polyneuropathy mapping to a novel locus on chromosome 21q22.3 designated SPG56.
Forensic Loci Allele Database (FLAD): Automatically generated, permanent identifiers for sequenced forensic alleles.

PubMed

Van Neste, Christophe; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip

2016-01-01

It is difficult to predict if and when massively parallel sequencing of forensic STR loci will replace capillary electrophoresis as the new standard technology in forensic genetics. The main benefits of sequencing are increased multiplexing scales and SNP detection. There is not yet a consensus on how sequenced profiles should be reported. We present the Forensic Loci Allele Database (FLAD) service, made freely available on http://forensic.ugent.be/FLAD/. It offers permanent identifiers for sequenced forensic alleles (STR or SNP) and their microvariants for use in forensic allele nomenclature. Analogous to Genbank, its aim is to provide permanent identifiers for forensically relevant allele sequences. Researchers that are developing forensic sequencing kits or are performing population studies, can register on http://forensic.ugent.be/FLAD/ and add loci and allele sequences with a short and simple application interface (API). Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome

PubMed Central

2011-01-01

Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a future genome sequence for oak. PMID:21645357
Rapid microsatellite marker development for African mahogany (Khaya senegalensis, Meliaceae) using next-generation sequencing and assessment of its intra-specific genetic diversity.

PubMed

Karan, M; Evans, D S; Reilly, D; Schulte, K; Wright, C; Innes, D; Holton, T A; Nikles, D G; Dickinson, G R

2012-03-01

Khaya senegalensis (African mahogany or dry-zone mahogany) is a high-value hardwood timber species with great potential for forest plantations in northern Australia. The species is distributed across the sub-Saharan belt from Senegal to Sudan and Uganda. Because of heavy exploitation and constraints on natural regeneration and sustainable planting, it is now classified as a vulnerable species. Here, we describe the development of microsatellite markers for K. senegalensis using next-generation sequencing to assess its intra-specific diversity across its natural range, which is a key for successful breeding programs and effective conservation management of the species. Next-generation sequencing yielded 93,943 sequences with an average read length of 234 bp. The assembled sequences contained 1030 simple sequence repeats, with primers designed for 522 microsatellite loci. Twenty-one microsatellite loci were tested with 11 showing reliable amplification and polymorphism in K. senegalensis. The 11 novel microsatellites, together with one previously published, were used to assess 73 accessions belonging to the Australian K. senegalensis domestication program, sampled from across the natural range of the species. STRUCTURE analysis shows two major clusters, one comprising mainly accessions from west Africa (Senegal to Benin) and the second based in the far eastern limits of the range in Sudan and Uganda. Higher levels of genetic diversity were found in material from western Africa. This suggests that new seed collections from this region may yield more diverse genotypes than those originating from Sudan and Uganda in eastern Africa. © 2011 Blackwell Publishing Ltd.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

PubMed

Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

PubMed Central

Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction

PubMed Central

Hughes, Colin E; Eastwood, Ruth J; Donovan Bailey, C

2005-01-01

Phylogenetic analyses of DNA sequences have prompted spectacular progress in assembling the Tree of Life. However, progress in constructing phylogenies among closely related species, at least for plants, has been less encouraging. We show that for plants, the rapid accumulation of DNA characters at higher taxonomic levels has not been matched by conventional sequence loci at the species level, leaving a lack of well-resolved gene trees that is hindering investigations of many fundamental questions in plant evolutionary biology. The most popular approach to address this problem has been to use low-copy nuclear genes as a source of DNA sequence data. However, this has had limited success because levels of variation among nuclear intron sequences across groups of closely related species are extremely variable and generally lower than conventionally used loci, and because no universally useful low-copy nuclear DNA sequence loci have been developed. This suggests that solutions will, for the most part, be lineage-specific, prompting a move away from ‘universal’ gene thinking for species-level phylogenetics. The benefits and limitations of alternative approaches to locate more variable nuclear loci are discussed and the potential of anonymous non-genic nuclear loci is highlighted. Given the virtually unlimited number of loci that can be generated using these new approaches, it is clear that effective screening will be critical for efficient selection of the most informative loci. Strategies for screening are outlined. PMID:16553318
Analysis of complex repeat sequences within the spinal muscular atrophy (SMA) candidate region in 5q13

DOE Office of Scientific and Technical Information (OSTI.GOV)

Davies, K.E.; Morrison, K.E.; Daniels, R.I.

1994-09-01

We previously reported that the 400 kb interval flanked the polymorphic loci D5S435 and D5S557 contains blocks of a chromosome 5 specific repeat. This interval also defines the SMA candidate region by genetic analysis of recombinant families. A YAC contig of 2-3 Mb encompassing this area has been constructed and a 5.5 kb conserved fragment, isolated from a YAC end clone within the above interval, was used to obtain cDNAs from both fetal and adult brain libraries. We describe the identification of cDNAs with stretches of high DNA sequence homology to exons of {beta} glucuronidase on human chromosome 7. Themore » cDNAs map both to the candidate region and to an area of 5p using FISH and deletion hybrid analysis. Hybridization to bacteriophage and cosmid clones from the YACs localizes the {beta} glucuronidase related sequences within the 400 kb region of the YAC contig. The cDNAs show a polymorphic pattern on hybridization to genomic BamH1 fragments in the size range of 10-250 kb. Further analysis using YAC fragmentation vectors is being used to determine how these {beta} glucuronidase related cDNAs are distributed within 5q13. Dinucleotide repeats within the region are being investigated to determine linkage disequilibrium with the disease locus.« less
Genomic scan for genes predisposing to schizophrenia

DOE Office of Scientific and Technical Information (OSTI.GOV)

Coon, H.; Jensen. S.; Holik, J.

1994-03-15

We initiated a genome-wide search for genes predisposing to schizophrenia by ascertaining 9 families, each containing three to five cases of schizophrenia. The 9 pedigrees were initially genotyped with 329 polymorphic DNA loci distributed throughout the genome. Assuming either autosomal dominant or recessive inheritance, 254 DNA loci yielded lod scores less than -2.0 at {theta} = 0.0, 101 DNA markers gave lod scores less than -2.0 at {theta} = 0.05, while 5 DNA loci produced maximum lod scores greater than 1: D4S35, D14S17, D15S1, D22S84, and D22S55. Of the DNA markers yielding lod scores greater than 1, D4S35 and D22S55more » also were suggestive of linkage when the Affected-Pedigree-Member method was used. The families were then genotyped with four highly polymorphic simple sequence repeat markers; possible linkage diminished with DNA markers mapping nearby D4S35, while suggestive evidence of linkage remained with loci in the region of D22S55. Although follow-up investigation of these chromosomal regions may be warranted, our linkage results should be viewed as preliminary observations, as 35 unaffected persons are not past the age of risk. 90 refs., 3 tabs.« less
Bulk development and stringent selection of microsatellite markers in the western flower thrips Frankliniella occidentalis

PubMed Central

Cao, Li-Jun; Li, Ze-Min; Wang, Ze-Hua; Zhu, Liang; Gong, Ya-Jun; Chen, Min; Wei, Shu-Jun

2016-01-01

Recent improvements in next-generation sequencing technologies have enabled investigation of microsatellites on a genome-wide scale. Faced with a huge amount of candidates, the use of appropriate marker selection criteria is crucial. Here, we used the western flower thrips Frankliniella occidentalis for an empirical microsatellite survey and validation; 132,251 candidate microsatellites were identified, 92,102 of which were perfect. Dinucleotides were the most abundant category, while (AG)n was the most abundant motif. Sixty primer pairs were designed and validated in two natural populations, of which 30 loci were polymorphic, stable, and repeatable, but not all in Hardy–Weinberg equilibrium (HWE) and linkage equilibrium. Four marker panels were constructed to understand effect of marker selection on population genetic analyses: (i) only accept loci with single nucleotide insertions (SNI); (ii) only accept the most polymorphic loci (MP); (iii) only accept loci that did not deviate from HWE, did not show SNIs, and had unambiguous peaks (SS) and (iv) all developed markers (ALL). Although the MP panel resulted in microsatellites of highest genetic diversity followed by the SNI, the SS performed best in individual assignment. Our study proposes stringent criteria for selection of microsatellites from a large-scale number of genomic candidates for population genetic studies. PMID:27197749
Bulk development and stringent selection of microsatellite markers in the western flower thrips Frankliniella occidentalis.

PubMed

Cao, Li-Jun; Li, Ze-Min; Wang, Ze-Hua; Zhu, Liang; Gong, Ya-Jun; Chen, Min; Wei, Shu-Jun

2016-05-20

Recent improvements in next-generation sequencing technologies have enabled investigation of microsatellites on a genome-wide scale. Faced with a huge amount of candidates, the use of appropriate marker selection criteria is crucial. Here, we used the western flower thrips Frankliniella occidentalis for an empirical microsatellite survey and validation; 132,251 candidate microsatellites were identified, 92,102 of which were perfect. Dinucleotides were the most abundant category, while (AG)n was the most abundant motif. Sixty primer pairs were designed and validated in two natural populations, of which 30 loci were polymorphic, stable, and repeatable, but not all in Hardy-Weinberg equilibrium (HWE) and linkage equilibrium. Four marker panels were constructed to understand effect of marker selection on population genetic analyses: (i) only accept loci with single nucleotide insertions (SNI); (ii) only accept the most polymorphic loci (MP); (iii) only accept loci that did not deviate from HWE, did not show SNIs, and had unambiguous peaks (SS) and (iv) all developed markers (ALL). Although the MP panel resulted in microsatellites of highest genetic diversity followed by the SNI, the SS performed best in individual assignment. Our study proposes stringent criteria for selection of microsatellites from a large-scale number of genomic candidates for population genetic studies.
Relatedness of Indian flax genotypes (Linum usitatissimum L.): an inter-simple sequence repeat (ISSR) primer assay.

PubMed

Rajwade, Ashwini V; Arora, Ritu S; Kadoo, Narendra Y; Harsulkar, Abhay M; Ghorpade, Prakash B; Gupta, Vidya S

2010-06-01

The objective of this study was to analyze the genetic relationships, using PCR-based ISSR markers, among 70 Indian flax (Linum usitatissimum L.) genotypes actively utilized in flax breeding programs. Twelve ISSR primers were used for the analysis yielding 136 loci, of which 87 were polymorphic. The average number of amplified loci and the average number of polymorphic loci per primer were 11.3 and 7.25, respectively, while the percent loci polymorphism ranged from 11.1 to 81.8 with an average of 63.9 across all the genotypes. The range of polymorphism information content scores was 0.03-0.49, with an average of 0.18. A dendrogram was generated based on the similarity matrix by the Unweighted Pair Group Method with Arithmetic Mean (UPGMA), wherein the flax genotypes were grouped in five clusters. The Jaccard's similarity coefficient among the genotypes ranged from 0.60 to 0.97. When the omega-3 alpha linolenic acid (ALA) contents of the individual genotypes were correlated with the clusters in the dendrogram, the high ALA containing genotypes were grouped in two clusters. This study identified SLS 50, Ayogi, and Sheetal to be the most diverse genotypes and suggested their use in breeding programs and for developing mapping populations.
Diversity of secretion systems associated with virulence characteristics of the classical bordetellae.

PubMed

Park, Jihye; Zhang, Ying; Chen, Chun; Dudley, Edward G; Harvill, Eric T

2015-12-01

Secretion systems are key virulence factors, modulating interactions between pathogens and the host's immune response. Six potential secretion systems (types 1-6; T1SS-T6SS) have been discussed in classical bordetellae, respiratory commensals/pathogens of mammals. The prototypical Bordetella bronchiseptica strain RB50 genome seems to contain all six systems, whilst two human-restricted subspecies, Bordetella parapertussis and Bordetella pertussis, have lost different subsets of these. This implicates secretion systems in the divergent evolutionary histories that have led to their success in different niches. Based on our previous work demonstrating that changes in secretion systems are associated with virulence characteristics, we hypothesized there would be substantial divergence of the loci encoding each amongst sequenced strains. Here, we describe extensive differences in secretion system loci; 10 of the 11 sequenced strains had lost subsets of genes or one entire secretion system locus. These loci contained genes homologous to those present in the respective loci in distantly related organisms, as well as genes unique to bordetellae, suggesting novel and/or auxiliary functions. The high degree of conservation of the T3SS locus, a complex machine with interdependent parts that must be conserved, stands in dramatic contrast to repeated loss of T5aSS 'autotransporters', which function as an autonomous unit. This comparative analysis provided insights into critical aspects of each pathogen's adaptation to its different niche, and the relative contributions of recombination, mutation and horizontal gene transfer. In addition, the relative conservation of various secretion systems is an important consideration in the ongoing search for more highly conserved protective antigens for the next generation of pertussis vaccines.
Development and characterization of microsatellite markers for the medicinal plant Smilax brasiliensis (Smilacaceae) and related species1

PubMed Central

Martins, Aline R.; Abreu, Aluana G.; Bajay, Miklos M.; Villela, Priscilla M. S.; Batista, Carlos E. A.; Monteiro, Mariza; Alves-Pereira, Alessandro; Figueira, Glyn M.; Pinheiro, José B.; Appezzato-da-Glória, Beatriz; Zucchi, Maria I.

2013-01-01

• Premise of the study: A new set of microsatellite or simple sequence repeat (SSR) markers were developed for Smilax brasiliensis, which is popularly known as sarsaparilla and used in folk medicine as a tonic, antirheumatic, and antisyphilitic. Smilax brasiliensis is sold in Brazilian pharmacies, and its origin and effectiveness are not subject to quality control. • Methods and Results: Using a protocol for genomic library enrichment, primer pairs were developed for 26 microsatellite loci and validated in 17 accessions of S. brasiliensis. Thirteen loci were polymorphic and four were monomorphic. The primers successfully amplified alleles in the congeners S. campestris, S. cissoides, S. fluminensis, S. goyazana, S. polyantha, S. quinquenervia, S. rufescens, S. subsessiliflora, and S. syphilitica. • Conclusions: The new SSR markers described herein are informative tools for genetic diversity and gene flow studies in S. brasiliensis and several congeners. PMID:25202555
[Paternity study in Chilean families using DNA fingerprints and erythrocyte blood markers].

PubMed

Aguirre, R; Blanco, R; Cifuentes, L; Chiffelle, I; Armanet, L; Vargas, J; Jara, L

1992-10-01

In the last decade, the electromorphic phenotype corresponding to extremely polymorphic zones of DNA, that include variable number of tandem repeat loci (VNTR) of oligonucleotide sequences, have been added to classical markers to elucidate the problems of parenthood identification and ascription in human beings. Using VNTR of several loci, a band profile practically unique for each individual is obtained (DNA-fingerprints). Since the pattern of VNTR electrophoretic bands is inherited from parents in a proportion of 50% from each one, this system is extremely useful for paternity ascription or exclusion. Nine nuclear families were studied, randomly selected from a group of 170 families that were analyzed using 5 erythrocyte genetic markers and with VNTRs detected using the multi locus probe (CAC)5, aiming to explore the concordance of both methods. Results were similar for both methods; however for VNTR, there is no information available on population frequency of polymorphisms.

A 1,681-locus consensus genetic map of cultivated cucumber including 67 NB-LRR resistance gene homolog and ten gene loci

PubMed Central

2013-01-01

Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
Novel and highly informative Capsicum SSR markers and their cross-species transferability.

PubMed

Buso, G S C; Reis, A M M; Amaral, Z P S; Ferreira, M E

2016-09-23

This study was undertaken primarily to develop new simple sequence repeat (SSR) markers for Capsicum. As part of this project aimed at broadening the use of molecular tools in Capsicum breeding, two genomic libraries enriched for AG/TC repeat sequences were constructed for Capsicum annuum. A total of 475 DNA clones were sequenced from both libraries and 144 SSR markers were tested on cultivated and wild species of Capsicum. Forty-five SSR markers were randomly selected to genotype a panel of 48 accessions of the Capsicum germplasm bank. The number of alleles per locus ranged from 2 to 11, with an average of 6 alleles. The polymorphism information content was on average 0.60, ranging from 0.20 to 0.83. The cross-species transferability to seven cultivated and wild Capsicum species was tested with a set of 91 SSR markers. We found that a high proportion of the loci produced amplicons in all species tested. C. frutescens had the highest number of transferable markers, whereas the wild species had the lowest. Our results indicate that the new markers can be readily used in genetic analyses of Capsicum.
Comparative Analysis of the Orphan CRISPR2 Locus in 242 Enterococcus faecalis Strains

PubMed Central

Hullahalli, Karthik; Rodrigues, Marinelle; Schmidt, Brendan D.; Li, Xiang; Bhardwaj, Pooja; Palmer, Kelli L.

2015-01-01

Clustered, Regularly Interspaced Short Palindromic Repeats and their associated Cas proteins (CRISPR-Cas) provide prokaryotes with a mechanism for defense against mobile genetic elements (MGEs). A CRISPR locus is a molecular memory of MGE encounters. It contains an array of short sequences, called spacers, that generally have sequence identity to MGEs. Three different CRISPR loci have been identified among strains of the opportunistic pathogen Enterococcus faecalis. CRISPR1 and CRISPR3 are associated with the cas genes necessary for blocking MGEs, but these loci are present in only a subset of E. faecalis strains. The orphan CRISPR2 lacks cas genes and is ubiquitous in E. faecalis, although its spacer content varies from strain to strain. Because CRISPR2 is a variable locus occurring in all E. faecalis, comparative analysis of CRISPR2 sequences may provide information about the clonality of E. faecalis strains. We examined CRISPR2 sequences from 228 E. faecalis genomes in relationship to subspecies phylogenetic lineages (sequence types; STs) determined by multilocus sequence typing (MLST), and to a genome phylogeny generated for a representative 71 genomes. We found that specific CRISPR2 sequences are associated with specific STs and with specific branches on the genome tree. To explore possible applications of CRISPR2 analysis, we evaluated 14 E. faecalis bloodstream isolates using CRISPR2 analysis and MLST. CRISPR2 analysis identified two groups of clonal strains among the 14 isolates, an assessment that was confirmed by MLST. CRISPR2 analysis was also used to accurately predict the ST of a subset of isolates. We conclude that CRISPR2 analysis, while not a replacement for MLST, is an inexpensive method to assess clonality among E. faecalis isolates, and can be used in conjunction with MLST to identify recombination events occurring between STs. PMID:26398194
Genetic Fingerprinting Using Microsatellite Markers in a Multiplex PCR Reaction: A Compilation of Methodological Approaches from Primer Design to Detection Systems.

PubMed

Krüger, Jacqueline; Schleinitz, Dorit

2017-01-01

Microsatellites are polymorphic DNA loci comprising repeated sequence motifs of two to five base pairs which are dispersed throughout the genome. Genotyping of microsatellites is a widely accepted tool for diagnostic and research purposes such as forensic investigations and parentage testing, but also in clinics (e.g. monitoring of bone marrow transplantation), as well as for the agriculture and food industries. The co-amplification of several short tandem repeat (STR) systems in a multiplex reaction with simultaneous detection helps to obtain more information from a DNA sample where its availability may be limited. Here, we introduce and describe this commonly used genotyping technique, providing an overview on available resources on STRs, multiplex design, and analysis.
Simultaneous site-directed mutagenesis of duplicated loci in soybean using a single guide RNA.

PubMed

Kanazashi, Yuhei; Hirose, Aya; Takahashi, Ippei; Mikami, Masafumi; Endo, Masaki; Hirose, Sakiko; Toki, Seiichi; Kaga, Akito; Naito, Ken; Ishimoto, Masao; Abe, Jun; Yamada, Tetsuya

2018-03-01

Using a gRNA and Agrobacterium-mediated transformation, we performed simultaneous site-directed mutagenesis of two GmPPD loci in soybean. Mutations in GmPPD loci were confirmed in at least 33% of T 2 seeds. The clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated endonuclease 9 (Cas9) system is a powerful tool for site-directed mutagenesis in crops. Using a single guide RNA (gRNA) and Agrobacterium-mediated transformation, we performed simultaneous site-directed mutagenesis of two homoeologous loci in soybean (Glycine max), GmPPD1 and GmPPD2, which encode the orthologs of Arabidopsis thaliana PEAPOD (PPD). Most of the T 1 plants had heterozygous and/or chimeric mutations for the targeted loci. The sequencing analysis of T 1 and T 2 generations indicates that putative mutation induced in the T 0 plant is transmitted to the T 1 generation. The inheritable mutation induced in the T 1 plant was also detected. This result indicates that continuous induction of mutations during T 1 plant development increases the occurrence of mutations in germ cells, which ensures the transmission of mutations to the next generation. Simultaneous site-directed mutagenesis in both GmPPD loci was confirmed in at least 33% of T 2 seeds examined. Approximately 19% of double mutants did not contain the Cas9/gRNA expression construct. Double mutants with frameshift mutations in both GmPPD1 and GmPPD2 had dome-shaped trifoliate leaves, extremely twisted pods, and produced few seeds. Taken together, our data indicate that continuous induction of mutations in the whole plant and advancing generations of transgenic plants enable efficient simultaneous site-directed mutagenesis in duplicated loci in soybean.
Development of Seven Microsatellite Markers Using Next Generation Sequencing for the Conservation on the Korean Population of Dorcus hopei (E. Saunders, 1854) (Coleoptera, Lucanidae)

PubMed Central

Kang, Tae Hwa; Han, Sang Hoon; Park, Sun Jae

2015-01-01

We developed microsatellite markers for genetic structural analyses of Dorcus hopei, a stag beetle species, using next generation sequencing and polymerase chain reaction (PCR)-based genotyping for regional populations. A total of 407,070,351 base pairs of genomic DNA containing >4000 microsatellite loci except AT repeats were sequenced. From 76 loci selected for primer design, 27 were polymorphic. Of these 27 markers, 10 were tested on three regional populations: two Chinese (Shichuan and Guangxi) and one Korean (Wanju). Three markers were excluded due to inconsistent amplification, genotyping errors, and Hardy-Weinberg equilibrium (HWE). By multi-locus genotyping, the allele number, observed heterozygosity and polymorphism information content of seven microsatellite loci were ranged 2‒10, 0.1333‒1.0000, and 0.1228‒0.8509, respectively. In an analysis on the genetic differentiation among regional populations including one Japanese population and one cross-breeding population, the individual colored bar-plots showed that both Chinese populations were closer to each other than to the Far East Asian populations. In Far East Asian populations, Wanju and Nirasaki populations could not be distinguished from each other because the frequency of genetic contents was very similar in some individuals of two populations. Moreover, the cross-breeding population contained all patterns of genetic contents shown in Chinese, Korean, and Japanese populations, compared with the genetic content frequency of each regional population. As a result, we examined whether the cross-breeding population might be a hybrid population, and might contain a possibility of interbreeding with Chinese populations in parental generations. Therefore, these markers will be useful for analyses of genetic diversity in populations, genetic relationships between regional populations, genetic structure analyses, and origin tests. PMID:26370965
Genome-wide distribution comparative and composition analysis of the SSRs in Poaceae.

PubMed

Wang, Yi; Yang, Chao; Jin, Qiaojun; Zhou, Dongjie; Wang, Shuangshuang; Yu, Yuanjie; Yang, Long

2015-02-15

The Poaceae family is of great importance to human beings since it comprises the cereal grasses which are the main sources for human food and animal feed. With the rapid growth of genomic data from Poaceae members, comparative genomics becomes a convinent method to study genetics of diffierent species. The SSRs (Simple Sequence Repeats) are widely used markers in the studies of Poaceae for their high abundance and stability. In this study, using the genomic sequences of 9 Poaceae species, we detected 11,993,943 SSR loci and developed 6,799,910 SSR primer pairs. The results show that SSRs are distributed on all the genomic elements in grass. Hexamer is the most frequent motif and AT/TA is the most frequent motif in dimer. The abundance of the SSRs has a positive linear relationship with the recombination rate. SSR sequences in the coding regions involve a higher GC content in the Poaceae than that in the other species. SSRs of 70-80 bp in length showed the highest AT/GC base ratio among all of these loci. The result shows the highest polymorphism rate belongs to the SSRs ranged from 30 bp to 40 bp. Using all the SSR primers of Japonica, nineteen universal primers were selected and located on the genome of the grass family. The information of SSR loci, the SSR primers and the tools of mining and analyzing SSR are provided in the PSSRD (Poaceae SSR Database, http://biodb.sdau.edu.cn/pssrd/). Our study and the PSSRD database provide a foundation for the comparative study in the Poaceae and it will accelerate the study on markers application, gene mapping and molecular breeding.
Simple Sequence Repeats in Escherichia coli: Abundance, Distribution, Composition, and Polymorphism

PubMed Central

Gur-Arie, Riva; Cohen, Cyril J.; Eitan, Yuval; Shelef, Leora; Hallerman, Eric M.; Kashi, Yechezkel

2000-01-01

Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020–209030 and AF209508–209518.] PMID:10645951
Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results.

PubMed

Just, Rebecca S; Irwin, Jodi A

2018-05-01

Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools employed. Further, these biologically based, easy-to-derive designations uphold clear relationships between parent alleles and their stutter products, enabling analysis in fully continuous probabilistic programs that model stutter while avoiding the algorithmic complexities that come with string based searches. Though using repeat unit plus LUS length as the allele designator does not capture variation that occurs outside of the core repeat regions, this straightforward approach would permit the large majority of known STR sequence variation to be used for mixture deconvolution and, in turn, result in more informative mixture statistics in the near term. Ultimately, the method could bridge the gap from current length-based probabilistic systems to facilitate broader adoption of NGS by forensic DNA testing laboratories. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

ERIC Educational Resources Information Center

McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

2006-01-01

We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…
Coherent Somatic Mutation in Autoimmune Disease

PubMed Central

Ross, Kenneth Andrew

2014-01-01

Background Many aspects of autoimmune disease are not well understood, including the specificities of autoimmune targets, and patterns of co-morbidity and cross-heritability across diseases. Prior work has provided evidence that somatic mutation caused by gene conversion and deletion at segmentally duplicated loci is relevant to several diseases. Simple tandem repeat (STR) sequence is highly mutable, both somatically and in the germ-line, and somatic STR mutations are observed under inflammation. Results Protein-coding genes spanning STRs having markers of mutability, including germ-line variability, high total length, repeat count and/or repeat similarity, are evaluated in the context of autoimmunity. For the initiation of autoimmune disease, antigens whose autoantibodies are the first observed in a disease, termed primary autoantigens, are informative. Three primary autoantigens, thyroid peroxidase (TPO), phogrin (PTPRN2) and filaggrin (FLG), include STRs that are among the eleven longest STRs spanned by protein-coding genes. This association of primary autoantigens with long STR sequence is highly significant (). Long STRs occur within twenty genes that are associated with sixteen common autoimmune diseases and atherosclerosis. The repeat within the TTC34 gene is an outlier in terms of length and a link with systemic lupus erythematosus is proposed. Conclusions The results support the hypothesis that many autoimmune diseases are triggered by immune responses to proteins whose DNA sequence mutates somatically in a coherent, consistent fashion. Other autoimmune diseases may be caused by coherent somatic mutations in immune cells. The coherent somatic mutation hypothesis has the potential to be a comprehensive explanation for the initiation of many autoimmune diseases. PMID:24988487
Androgen receptor and monoamine oxidase polymorphism in wild bonobos

PubMed Central

Garai, Cintia; Furuichi, Takeshi; Kawamoto, Yoshi; Ryu, Heungjin; Inoue-Murayama, Miho

2014-01-01

Androgen receptor gene (AR), monoamine oxidase A gene (MAOA) and monoamine oxidase B gene (MAOB) have been found to have associations with behavioral traits, such as aggressiveness, and disorders in humans. However, the extent to which similar genetic effects might influence the behavior of wild apes is unclear. We examined the loci AR glutamine repeat (ARQ), AR glycine repeat (ARG), MAOA intron 2 dinucleotide repeat (MAin2) and MAOB intron 2 dinucleotide repeat (MBin2) in 32 wild bonobos, Pan paniscus, and compared them with those of chimpanzees, Pan troglodytes, and humans. We found that bonobos were polymorphic on the four loci examined. Both loci MAin2 and MBin2 in bonobos showed a higher diversity than in chimpanzees. Because monoamine oxidase influences aggressiveness, the differences between the polymorphisms of MAin2 and MBin2 in bonobos and chimpanzees may be associated with the differences in aggression between the two species. In order to understand the evolution of these loci and AR, MAOA and MAOB in humans and non-human primates, it would be useful to conduct future studies focusing on the potential association between aggressiveness, and other personality traits, and polymorphisms documented in bonobos. PMID:25606465
Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

PubMed

Goedbloed, Miriam; Vermeulen, Mark; Fang, Rixun N; Lembring, Maria; Wollstein, Andreas; Ballantyne, Kaye; Lao, Oscar; Brauer, Silke; Krüger, Carmen; Roewer, Lutz; Lessig, Rüdiger; Ploski, Rafal; Dobosz, Tadeusz; Henke, Lotte; Henke, Jürgen; Furtado, Manohar R; Kayser, Manfred

2009-11-01

The Y-chromosomal short tandem repeat (Y-STR) polymorphisms included in the AmpFlSTR Yfiler polymerase chain reaction amplification kit have become widely used for forensic and evolutionary applications where a reliable knowledge on mutation properties is necessary for correct data interpretation. Therefore, we investigated the 17 Yfiler Y-STRs in 1,730-1,764 DNA-confirmed father-son pairs per locus and found 84 sequence-confirmed mutations among the 29,792 meiotic transfers covered. Of the 84 mutations, 83 (98.8%) were single-repeat changes and one (1.2%) was a double-repeat change (ratio, 1:0.01), as well as 43 (51.2%) were repeat gains and 41 (48.8%) repeat losses (ratio, 1:0.95). Medians from Bayesian estimation of locus-specific mutation rates ranged from 0.0003 for DYS448 to 0.0074 for DYS458, with a median rate across all 17 Y-STRs of 0.0025. The mean age (at the time of son's birth) of fathers with mutations was with 34.40 (+/-11.63) years higher than that of fathers without ones at 30.32 (+/-10.22) years, a difference that is highly statistically significant (p < 0.001). A Poisson-based modeling revealed that the Y-STR mutation rate increased with increasing father's age on a statistically significant level (alpha = 0.0294, 2.5% quantile = 0.0001). From combining our data with those previously published, considering all together 135,212 meiotic events and 331 mutations, we conclude for the Yfiler Y-STRs that (1) none had a mutation rate of >1%, 12 had mutation rates of >0.1% and four of <0.1%, (2) single-repeat changes were strongly favored over multiple-repeat ones for all loci but 1 and (3) considerable variation existed among loci in the ratio of repeat gains versus losses. Our finding of three Y-STR mutations in one father-son pair (and two pairs with two mutations each) has consequences for determining the threshold of allelic differences to conclude exclusion constellations in future applications of Y-STRs in paternity testing and pedigree analyses.
Sub-typing of extended-spectrum-β-lactamase-producing isolates from a nosocomial outbreak: application of a 10-loci generic Escherichia coli multi-locus variable number tandem repeat analysis.

PubMed

Karami, Nahid; Helldal, Lisa; Welinder-Olsson, Christina; Ahrén, Christina; Moore, Edward R B

2013-01-01

Extended-spectrum β-lactamase producing Escherichia coli (ESBL-E. coli) were isolated from infants hospitalized in a neonatal, post-surgery ward during a four-month-long nosocomial outbreak and six-month follow-up period. A multi-locus variable number tandem repeat analysis (MLVA), using 10 loci (GECM-10), for 'generic' (i.e., non-STEC) E. coli was applied for sub-species-level (i.e., sub-typing) delineation and characterization of the bacterial isolates. Ten distinct GECM-10 types were detected among 50 isolates, correlating with the types defined by pulsed-field gel electrophoresis (PFGE), which is recognized to be the 'gold-standard' method for clinical epidemiological analyses. Multi-locus sequence typing (MLST), multiplex PCR genotyping of bla CTX-M, bla TEM, bla OXA and bla SHV genes and antibiotic resistance profiling, as well as a PCR assay specific for detecting isolates of the pandemic O25b-ST131 strain, further characterized the outbreak isolates. Two clusters of isolates with distinct GECM-10 types (G06-04 and G07-02), corresponding to two major PFGE types and the MLST-based sequence types (STs) 131 and 1444, respectively, were confirmed to be responsible for the outbreak. The application of GECM-10 sub-typing provided reliable, rapid and cost-effective epidemiological characterizations of the ESBL-producing isolates from a nosocomial outbreak that correlated with and may be used to replace the laborious PFGE protocol for analyzing generic E. coli.
A reference genetic linkage map of apomictic Hieracium species based on expressed markers derived from developing ovule transcripts.

PubMed

Shirasawa, Kenta; Hand, Melanie L; Henderson, Steven T; Okada, Takashi; Johnson, Susan D; Taylor, Jennifer M; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M G

2015-03-01

Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. © The Author 2014. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Population genetics structure of glyphosate-resistant Johnsongrass (Sorghum halepense L. Pers) does not support a single origin of the resistance

PubMed Central

Fernández, Luis; de Haro, Luis Alejandro; Distefano, Ana J; Carolina Martínez, Maria; Lía, Verónica; Papa, Juan C; Olea, Ignacio; Tosto, Daniela; Esteban Hopp, Horacio

2013-01-01

Single sequence repeats (SSR) developed for Sorghum bicolor were used to characterize the genetic distance of 46 different Sorghum halepense (Johnsongrass) accessions from Argentina some of which have evolved toward glyphosate resistance. Since Johnsongrass is an allotetraploid and only one subgenome is homologous to cultivated sorghum, some SSR loci amplified up to two alleles while others (presumably more conserved loci) amplified up to four alleles. Twelve SSR providing information of 24 loci representative of Johnsongrass genome were selected for genetic distance characterization. All of them were highly polymorphic, which was evidenced by the number of different alleles found in the samples studied, in some of them up to 20. UPGMA and Mantel analysis showed that Johnsongrass glyphosate-resistant accessions that belong to different geographic regions do not share similar genetic backgrounds. In contrast, they show closer similarity to their neighboring susceptible counterparts. Discriminant Analysis of Principal Components using the clusters identified by K-means support the lack of a clear pattern of association among samples and resistance status or province of origin. Consequently, these results do not support a single genetic origin of glyphosate resistance. Nucleotide sequencing of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) encoding gene from glyphosate-resistant and susceptible accessions collected from different geographic origins showed that none presented expected mutations in aminoacid positions 101 and 106 which are diagnostic of target-site resistance mechanism. PMID:24223277
High polymorphism in Est-SSR loci for cellulose synthase and β-amylase of sugarcane varieties (Saccharum spp.) used by the industrial sector for ethanol production.

PubMed

Augusto, Raphael; Maranho, Rone Charles; Mangolin, Claudete Aparecida; Pires da Silva Machado, Maria de Fátima

2015-01-01

High and low polymorphisms in simple sequence repeats of expressed sequence tag (EST-SSR) for specific proteins and enzymes, such as β-amylase, cellulose synthase, xyloglucan endotransglucosylase, fructose 1,6-bisphosphate aldolase, and fructose 1,6-bisphosphatase, were used to illustrate the genetic divergence within and between varieties of sugarcane (Saccharum spp.) and to guide the technological paths to optimize ethanol production from lignocellulose biomass. The varieties RB72454, RB867515, RB92579, and SP813250 on the second stage of cutting, all grown in the state of Paraná (PR), and the varieties RB92579 and SP813250 cultured in the PR state and in Northeastern Brazil, state of Pernambuco (PE), were analyzed using five EST-SSR primers for EstC66, EstC67, EstC68, EstC69, and EstC91 loci. Genetic divergence was evident in the EstC67 and EstC69 loci for β-amylase and cellulose synthase, respectively, among the four sugarcane varieties. An extremely high level of genetic differentiation was also detected in the EstC67 locus from the RB82579 and SP813250 varieties cultured in the PR and PE states. High polymorphism in SSR of the cellulose synthase locus may explain the high variability of substrates used in pretreatment and enzymatic hydrolysis processes, which has been an obstacle to effective industrial adaptations.
Current and future developments in patents for quantitative trait loci in dairy cattle.

PubMed

Weller, Joel I

2007-01-01

Many studies have proposed that rates of genetic gain in dairy cattle can be increased by direct selection on the individual quantitative loci responsible for the genetic variation in these traits, or selection on linked genetic markers. The development of DNA-level genetic markers has made detection of QTL nearly routine in all major livestock species. The studies that attempted to detect genes affecting quantitative traits can be divided into two categories: analysis of candidate genes, and genome scans based on within-family genetic linkage. To date, 12 patent cooperative treaty (PCT) and US patents have been registered for DNA sequences claimed to be associated with effects on economic traits in dairy cattle. All claim effects on milk production, but other traits are also included in some of the claims. Most of the sequences found by the candidate gene approach are of dubious validity, and have been repeated in only very few independent studies. The two missense mutations on chromosomes 6 and 14 affecting milk concentration derived from genome scans are more solidly based, but the claims are also disputed. A few PCT in dairy cattle are commercialized as genetic tests where commercial dairy farmers are the target market.
A genomic audit of newly-adopted autosomal STRs for forensic identification.

PubMed

Phillips, C

2017-07-01

In preparation for the growing use of massively parallel sequencing (MPS) technology to genotype forensic STRs, a comprehensive genomic audit of 73 STRs was made in 2016 [Parson et al., Forensic Sci. Int. Genet. 22, 54-63]. The loci examined included miniSTRs that were not in widespread use, but had been incorporated into MPS kits or were under consideration for this purpose. The current study expands the genomic analysis of autosomal STRs that are not commonly used, to include the full set of developed miniSTRs and an additional 24 STRs, most of which have been recently included in several supplementary forensic multiplex kits for capillary electrophoresis. The genomic audit of these 47 newly-adopted STRs examined the linkage status of new loci on the same chromosome as established forensic STRs; analyzed world-wide population variation of the newly-adopted STRs using published data; assessed their forensic informativeness; and compiled the sequence characteristics, repeat structures and flanking regions of each STR. A further 44 autosomal STRs developed for forensic analyses but not incorporated into commercial kits, are also briefly described. Copyright © 2017 Elsevier B.V. All rights reserved.
Comparative fine mapping of the Wax 1 (W1) locus in hexaploid wheat.

PubMed

Lu, Ping; Qin, Jinxia; Wang, Guoxin; Wang, Lili; Wang, Zhenzhong; Wu, Qiuhong; Xie, Jingzhong; Liang, Yong; Wang, Yong; Zhang, Deyun; Sun, Qixin; Liu, Zhiyong

2015-08-01

By applying comparative genomics analyses, a high-density genetic linkage map of the Wax 1 ( W1 ) locus was constructed as a framework for map-based cloning. Glaucousness is described as the scattering effect of visible light from wax deposited on the cuticle of plant aerial organs. In wheat, the wax on leaves and stems is mainly controlled by two sets of genes: glaucousness loci (W1 and W2) and non-glaucousness loci (Iw1 and Iw2). Bulked segregant analysis (BSA) and simple sequence repeat (SSR) mapping showed that Wax1 (W1) is located on chromosome arm 2BS between markers Xgwm210 and Xbarc35. By applying comparative genomics analyses, colinearity genomic regions of the W1 locus on wheat 2BS were identified in Brachypodium distachyon chromosome 5, rice chromosome 4 and sorghum chromosome 6, respectively. Four STS markers were developed using the Triticum aestivum cv. Chinese Spring 454 contig sequences and the International Wheat Genome Sequencing Consortium (IWGSC) survey sequences. W1 was mapped into a 0.93 cM genetic interval flanked by markers XWGGC3197 and XWGGC2484, which has synteny with genomic regions of 56.5 kb in Brachypodium, 390 kb in rice and 31.8 kb in sorghum. The fine genetic map can serve as a framework for chromosome landing, physical mapping and map-based cloning of the W1 in wheat.

Small interfering RNA-producing loci in the ancient parasitic eukaryote Trypanosoma brucei

PubMed Central

2012-01-01

Background At the core of the RNA interference (RNAi) pathway in Trypanosoma brucei is a single Argonaute protein, TbAGO1, with an established role in controlling retroposon and repeat transcripts. Recent evidence from higher eukaryotes suggests that a variety of genomic sequences with the potential to produce double-stranded RNA are sources for small interfering RNAs (siRNAs). Results To test whether such endogenous siRNAs are present in T. brucei and to probe the individual role of the two Dicer-like enzymes, we affinity purified TbAGO1 from wild-type procyclic trypanosomes, as well as from cells deficient in the cytoplasmic (TbDCL1) or nuclear (TbDCL2) Dicer, and subjected the bound RNAs to Illumina high-throughput sequencing. In wild-type cells the majority of reads originated from two classes of retroposons. We also considerably expanded the repertoire of trypanosome siRNAs to encompass a family of 147-bp satellite-like repeats, many of the regions where RNA polymerase II transcription converges, large inverted repeats and two pseudogenes. Production of these newly described siRNAs is strictly dependent on the nuclear DCL2. Notably, our data indicate that putative centromeric regions, excluding the CIR147 repeats, are not a significant source for endogenous siRNAs. Conclusions Our data suggest that endogenous RNAi targets may be as evolutionarily old as the mechanism itself. PMID:22925482
Insights into mutagenesis using Escherichia coli chromosomal lacZ strains that enable detection of a wide spectrum of mutational events.

PubMed

Seier, Tracey; Padgett, Dana R; Zilberberg, Gal; Sutera, Vincent A; Toha, Noor; Lovett, Susan T

2011-06-01

Strand misalignments at DNA repeats during replication are implicated in mutational hotspots. To study these events, we have generated strains carrying mutations in the Escherichia coli chromosomal lacZ gene that revert via deletion of a short duplicated sequence or by template switching within imperfect inverted repeat (quasipalindrome, QP) sequences. Using these strains, we demonstrate that mutation of the distal repeat of a quasipalindrome, with respect to replication fork movement, is about 10-fold higher than the proximal repeat, consistent with more common template switching on the leading strand. The leading strand bias was lost in the absence of exonucleases I and VII, suggesting that it results from more efficient suppression of template switching by 3' exonucleases targeted to the lagging strand. The loss of 3' exonucleases has no effect on strand misalignment at direct repeats to produce deletion. To compare these events to other mutations, we have reengineered reporters (designed by Cupples and Miller 1989) that detect specific base substitutions or frameshifts in lacZ with the reverting lacZ locus on the chromosome rather than an F' element. This set allows rapid screening of potential mutagens, environmental conditions, or genetic loci for effects on a broad set of mutational events. We found that hydroxyurea (HU), which depletes dNTP pools, slightly elevated templated mutations at inverted repeats but had no effect on deletions, simple frameshifts, or base substitutions. Mutations in nucleotide diphosphate kinase, ndk, significantly elevated simple mutations but had little effect on the templated class. Zebularine, a cytosine analog, elevated all classes.
A report on identification of sequence polymorphism in barcode region of six commercially important Cymbopogon species.

PubMed

Bishoyi, Ashok Kumar; Kavane, Aarti; Sharma, Anjali; Geetha, K A

2017-02-01

CYMBOPOGON: is an important member of grass family Poaceae, cultivated for essential oils which have greater medicinal and industrial value. Taxonomic identification of Cymbopogon species is determined mainly by morphological markers, odour of essential oils and concentration of bioactive compounds present in the oil matrices which are highly influenced by environment. Authenticated molecular marker based taxonomical identification is also lacking in the genus; hence effort was made to evaluate potential DNA barcode loci in six commercially important Cymbopogon species for their individual discrimination and authentication at the species level. Four widely used DNA barcoding regions viz., ITS 1 & ITS 2 spacers, matK, psbA-trnH and rbcL were taken for the study. Gene sequences of the same or related genera of the concerned loci were mined from NCBI domain and primers were designed and validated for barcode loci amplification. Out of the four loci studied, sequences from matK and ITS spacer loci revealed 0.46% and 5.64% nucleotide sequence diversity, respectively whereas the other two loci i.e., psbA-trnH and rbcL showed 100% sequence homology. The newly developed primers can be used for barcode loci amplification in the genus Cymbopogon. The identified Single Nucleotide Polymorphisms from the studied sequences may be used as barcodes for the six Cymbopogon species. The information generated can also be utilized for barcode development of the genus by including more number of Cymbopgon species in future.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

PubMed

Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

2015-10-26

Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
Karyotype Analysis of Four Vicia Species using In Situ Hybridization with Repetitive Sequences

PubMed Central

NAVRÁTILOVÁ, ALICE; NEUMANN, PAVEL; MACAS, JIŘÍ

2003-01-01

Mitotic chromosomes of four Vicia species (V. sativa, V. grandiflora, V. pannonica and V. narbonensis) were subjected to in situ hybridization with probes derived from conserved plant repetitive DNA sequences (18S–25S and 5S rDNA, telomeres) and genus‐specific satellite repeats (VicTR‐A and VicTR‐B). Numbers and positions of hybridization signals provided cytogenetic landmarks suitable for unambiguous identification of all chromosomes, and establishment of the karyotypes. The VicTR‐A and ‐B sequences, in particular, produced highly informative banding patterns that alone were sufficient for discrimination of all chromosomes. However, these patterns were not conserved among species and thus could not be employed for identification of homologous chromosomes. This fact, together with observed variations in positions and numbers of rDNA loci, suggests considerable divergence between karyotypes of the species studied. PMID:12770847
B-Bolivia, an Allele of the Maize b1 Gene with Variable Expression, Contains a High Copy Retrotransposon-Related Sequence Immediately Upstream1

PubMed Central

Selinger, David A.; Chandler, Vicki L.

2001-01-01

The maize (Zea mays) b1 gene encodes a transcription factor that regulates the anthocyanin pigment pathway. Of the b1 alleles with distinct tissue-specific expression, B-Peru and B-Bolivia are the only alleles that confer seed pigmentation. B-Bolivia produces variable and weaker seed expression but darker, more regular plant expression relative to B-Peru. Our experiments demonstrated that B-Bolivia is not expressed in the seed when transmitted through the male. When transmitted through the female the proportion of kernels pigmented and the intensity of pigment varied. Molecular characterization of B-Bolivia demonstrated that it shares the first 530 bp of the upstream region with B-Peru, a region sufficient for seed expression. Immediately upstream of 530 bp, B-Bolivia is completely divergent from B-Peru. These sequences share sequence similarity to retrotransposons. Transient expression assays of various promoter constructs identified a 33-bp region in B-Bolivia that can account for the reduced aleurone pigment amounts (40%) observed with B-Bolivia relative to B-Peru. Transgenic plants carrying the B-Bolivia promoter proximal region produced pigmented seeds. Similar to native B-Bolivia, some transgene loci are variably expressed in seeds. In contrast to native B-Bolivia, the transgene loci are expressed in seeds when transmitted through both the male and female. Some transgenic lines produced pigment in vegetative tissues, but the tissue-specificity was different from B-Bolivia, suggesting the introduced sequences do not contain the B-Bolivia plant-specific regulatory sequences. We hypothesize that the chromatin context of the B-Bolivia allele controls its epigenetic seed expression properties, which could be influenced by the adjacent highly repeated retrotransposon sequence. PMID:11244116
Sequence capture of ultraconserved elements from bird museum specimens.

PubMed

McCormack, John E; Tsai, Whitney L E; Faircloth, Brant C

2016-09-01

New DNA sequencing technologies are allowing researchers to explore the genomes of the millions of natural history specimens collected prior to the molecular era. Yet, we know little about how well specific next-generation sequencing (NGS) techniques work with the degraded DNA typically extracted from museum specimens. Here, we use one type of NGS approach, sequence capture of ultraconserved elements (UCEs), to collect data from bird museum specimens as old as 120 years. We targeted 5060 UCE loci in 27 western scrub-jays (Aphelocoma californica) representing three evolutionary lineages that could be species, and we collected an average of 3749 UCE loci containing 4460 single nucleotide polymorphisms (SNPs). Despite older specimens producing fewer and shorter loci in general, we collected thousands of markers from even the oldest specimens. More sequencing reads per individual helped to boost the number of UCE loci we recovered from older specimens, but more sequencing was not as successful at increasing the length of loci. We detected contamination in some samples and determined that contamination was more prevalent in older samples that were subject to less sequencing. For the phylogeny generated from concatenated UCE loci, contamination led to incorrect placement of some individuals. In contrast, a species tree constructed from SNPs called within UCE loci correctly placed individuals into three monophyletic groups, perhaps because of the stricter analytical procedures used for SNP calling. This study and other recent studies on the genomics of museum specimens have profound implications for natural history collections, where millions of older specimens should now be considered genomic resources. © 2015 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Evaluation of fire recurrence effect on genetic diversity in maritime pine (Pinus pinaster Ait.) stands using Inter-Simple Sequence Repeat profiles.

PubMed

Lucas-Borja, M E; Ahrazem, O; Candel-Pérez, D; Moya, D; Fonseca, T; Hernández Tecles, E; De Las Heras, J; Gómez-Gómez, L

2016-12-01

The management of maritime pine in fire-prone habitats is a challenging task and fine-scale population genetic analyses are necessary to check if different fire recurrences affect genetic variability. The objective of this study was to assess the effect of fire recurrence on maritime pine genetic diversity using inter-simple sequence repeat markers (ISSR). Three maritime pine (Pinus pinaster Ait.) populations from Northern Portugal were chosen to characterize the genetic variability among populations. In relation to fire recurrence, Seirós population was affected by fire both in 1990 and 2005 whereas Vila Seca-2 population was affected by fire just in 2005. The Vila Seca-1 population has been never affected by fire. Our results showed the highest Nei's genetic diversity (He=0.320), Shannon information index (I=0.474) and polymorphic loci (PPL=87.79%) among samples from twice burned populations (Seirós site). Thus, fire regime plays an important role affecting genetic diversity in the short-term, although not generating maritime pine genetic erosion. Copyright © 2016 Elsevier B.V. All rights reserved.
[Efficient genome editing in human pluripotent stem cells through CRISPR/Cas9].

PubMed

Liu, Gai-gai; Li, Shuang; Wei, Yu-da; Zhang, Yong-xian; Ding, Qiu-rong

2015-11-01

The RNA-guided CRISPR (clustered regularly interspaced short palindromic repeat)-associated Cas9 nuclease has offered a new platform for genome editing with high efficiency. Here, we report the use of CRISPR/Cas9 technology to target a specific genomic region in human pluripotent stem cells. We show that CRISPR/Cas9 can be used to disrupt a gene by introducing frameshift mutations to gene coding region; to knock in specific sequences (e.g. FLAG tag DNA sequence) to targeted genomic locus via homology directed repair; to induce large genomic deletion through dual-guide multiplex. Our results demonstrate the versatile application of CRISPR/Cas9 in stem cell genome editing, which can be widely utilized for functional studies of genes or genome loci in human pluripotent stem cells.
Mosaic microecological differential stress causes adaptive microsatellite divergence in wild barley, Hordeum spontaneum, at Neve Yaar, Israel.

PubMed

Huang, Qingyang; Beharav, Alex; Li, Youchun; Kirzhner, Valery; Nevo, Eviatar

2002-12-01

Genetic diversity at 38 microsatellite (short sequence repeats (SSRs)) loci was studied in a sample of 54 plants representing a natural population of wild barley, Hordeum spontaneum, at the Neve Yaar microsite in Israel. Wild barley at the microsite was organized in a mosaic pattern over an area of 3180 m2 in the open Tabor oak forest, which was subdivided into four microniches: (i) sun-rock (11 genotypes), (ii) sun-soil (18 genotypes), (iii) shade-soil (11 genotypes), and (iv) shade-rock (14 genotypes). Fifty-four genotypes were tested for ecological-genetic microniche correlates. Analysis of 36 loci showed that allele distributions at SSR loci were nonrandom but structured by ecological stresses (climatic and edaphic). Sixteen (45.7%) of 35 polymorphic loci varied significantly (p < 0.05) in allele frequencies among the microniches. Significant genetic divergence and diversity were found among the four subpopulations. The soil and shade subpopulations showed higher genetic diversities at SSR loci than the rock and sun subpopulations, and the lowest genetic diversity was observed in the sun-rock subpopulation, in contrast with the previous allozyme and RAPD studies. On average, of 36 loci, 88.75% of the total genetic diversity exists within the four microniches, while 11.25% exists between the microniches. In a permutation test, G(ST) was lower for 4999 out of 5000 randomized data sets (p < 0.001) when compared with real data (0.1125). The highest genetic distance was between shade-soil and sun-rock (D = 0.222). Our results suggest that diversifying natural selection may act upon some regulatory regions, resulting in adaptive SSR divergence. Fixation of some loci (GMS61, GMS1, and EBMAC824) at a specific microniche seems to suggest directional selection. The pattern of other SSR loci suggests the operation of balancing selection. SSRs may be either direct targets of selection or markers of selected haplotypes (selective sweep).
Polymorphism of CRISPR shows separated natural groupings of Shigella subtypes and evidence of horizontal transfer of CRISPR

PubMed Central

Yang, Chaojie; Li, Peng; Su, Wenli; Li, Hao; Liu, Hongbo; Yang, Guang; Xie, Jing; Yi, Shengjie; Wang, Jian; Cui, Xianyan; Wu, Zhihao; Wang, Ligui; Hao, Rongzhang; Jia, Leili; Qiu, Shaofu; Song, Hongbin

2015-01-01

Clustered, regularly interspaced, short palindromic repeats (CRISPR) act as an adaptive RNA-mediated immune mechanism in bacteria. They can also be used for identification and evolutionary studies based on polymorphisms within the CRISPR locus. We amplified and analyzed 6 CRISPR loci from 237 Shigella strains belonging to the 4 species groups, as well as 13 Escherichia coli strains. The CRISPR-associated (cas) gene sequence arrays of these strains were screened and compared. The CRISPR sequences from Shigella were conserved among subtypes, suggesting that CRISPR may represent a new identification tool for the detection and discrimination of Shigella species. Secondary structure analysis showed a different stem-loop structure at the terminal repeat, suggesting a distinct recognition mechanism in the formation of crRNA. In addition, the presence of “self-target” spacers and polymorphisms within CRISPR in Shigella indicated a selective pressure for inhibition of this system, which has the potential to damage “self DNA.” Homology analysis of spacers showed that CRISPR might be involved in the regulation of virulence transmission. Phylogenetic analysis based on CRISPR sequences from Shigella and E. coli indicated that although phenotypic properties maintain convergent evolution, the 4 Shigella species do not represent natural groupings. Surprisingly, comparative analysis of Shigella repeats with other species provided new evidence for CRISPR horizontal transfer. Our results suggested that CRISPR analysis is applicable for the detection of Shigella species and for investigation of evolutionary relationships. PMID:26327282
Development and application of microsatellites in candidate genes related to wood properties in the Chinese white poplar (Populus tomentosa Carr.).

PubMed

Du, Qingzhang; Gong, Chenrui; Pan, Wei; Zhang, Deqiang

2013-02-01

Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2-7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.
Polymorphism of CRISPR shows separated natural groupings of Shigella subtypes and evidence of horizontal transfer of CRISPR.

PubMed

Yang, Chaojie; Li, Peng; Su, Wenli; Li, Hao; Liu, Hongbo; Yang, Guang; Xie, Jing; Yi, Shengjie; Wang, Jian; Cui, Xianyan; Wu, Zhihao; Wang, Ligui; Hao, Rongzhang; Jia, Leili; Qiu, Shaofu; Song, Hongbin

2015-01-01

Clustered, regularly interspaced, short palindromic repeats (CRISPR) act as an adaptive RNA-mediated immune mechanism in bacteria. They can also be used for identification and evolutionary studies based on polymorphisms within the CRISPR locus. We amplified and analyzed 6 CRISPR loci from 237 Shigella strains belonging to the 4 species groups, as well as 13 Escherichia coli strains. The CRISPR-associated (cas) gene sequence arrays of these strains were screened and compared. The CRISPR sequences from Shigella were conserved among subtypes, suggesting that CRISPR may represent a new identification tool for the detection and discrimination of Shigella species. Secondary structure analysis showed a different stem-loop structure at the terminal repeat, suggesting a distinct recognition mechanism in the formation of crRNA. In addition, the presence of "self-target" spacers and polymorphisms within CRISPR in Shigella indicated a selective pressure for inhibition of this system, which has the potential to damage "self DNA." Homology analysis of spacers showed that CRISPR might be involved in the regulation of virulence transmission. Phylogenetic analysis based on CRISPR sequences from Shigella and E. coli indicated that although phenotypic properties maintain convergent evolution, the 4 Shigella species do not represent natural groupings. Surprisingly, comparative analysis of Shigella repeats with other species provided new evidence for CRISPR horizontal transfer. Our results suggested that CRISPR analysis is applicable for the detection of Shigella species and for investigation of evolutionary relationships.
Major Quantitative Trait Loci and Putative Candidate Genes for Powdery Mildew Resistance and Fruit-Related Traits Revealed by an Intraspecific Genetic Map for Watermelon (Citrullus lanatus var. lanatus).

PubMed

Kim, Kwang-Hwan; Hwang, Ji-Hyun; Han, Dong-Yeup; Park, Minkyu; Kim, Seungill; Choi, Doil; Kim, Yongjae; Lee, Gung Pyo; Kim, Sun-Tae; Park, Young-Hoon

2015-01-01

An intraspecific genetic map for watermelon was constructed using an F2 population derived from 'Arka Manik' × 'TS34' and transcript sequence variants and quantitative trait loci (QTL) for resistance to powdery mildew (PMR), seed size (SS), and fruit shape (FS) were analyzed. The map consists of 14 linkage groups (LGs) defined by 174 cleaved amplified polymorphic sequences (CAPS), 2 derived-cleaved amplified polymorphic sequence markers, 20 sequence-characterized amplified regions, and 8 expressed sequence tag-simple sequence repeat markers spanning 1,404.3 cM, with a mean marker interval of 6.9 cM and an average of 14.6 markers per LG. Genetic inheritance and QTL analyses indicated that each of the PMR, SS, and FS traits is controlled by an incompletely dominant effect of major QTLs designated as pmr2.1, ss2.1, and fsi3.1, respectively. The pmr2.1, detected on chromosome 2 (Chr02), explained 80.0% of the phenotypic variation (LOD = 30.76). This QTL was flanked by two CAPS markers, wsb2-24 (4.00 cM) and wsb2-39 (13.97 cM). The ss2.1, located close to pmr2.1 and CAPS marker wsb2-13 (1.00 cM) on Chr02, explained 92.3% of the phenotypic variation (LOD = 68.78). The fsi3.1, detected on Chr03, explained 79.7% of the phenotypic variation (LOD = 31.37) and was flanked by two CAPS, wsb3-24 (1.91 cM) and wsb3-9 (7.00 cM). Candidate gene-based CAPS markers were developed from the disease resistance and fruit shape gene homologs located on Chr.02 and Chr03 and were mapped on the intraspecific map. Colocalization of these markers with the major QTLs indicated that watermelon orthologs of a nucleotide-binding site-leucine-rich repeat class gene containing an RPW8 domain and a member of SUN containing the IQ67 domain are candidate genes for pmr2.1 and fsi3.1, respectively. The results presented herein provide useful information for marker-assisted breeding and gene cloning for PMR and fruit-related traits.
Major Quantitative Trait Loci and Putative Candidate Genes for Powdery Mildew Resistance and Fruit-Related Traits Revealed by an Intraspecific Genetic Map for Watermelon (Citrullus lanatus var. lanatus)

PubMed Central

Kim, Kwang-Hwan; Hwang, Ji-Hyun; Han, Dong-Yeup; Park, Minkyu; Kim, Seungill; Choi, Doil; Kim, Yongjae; Lee, Gung Pyo; Kim, Sun-Tae; Park, Young-Hoon

2015-01-01

An intraspecific genetic map for watermelon was constructed using an F2 population derived from ‘Arka Manik’ × ‘TS34’ and transcript sequence variants and quantitative trait loci (QTL) for resistance to powdery mildew (PMR), seed size (SS), and fruit shape (FS) were analyzed. The map consists of 14 linkage groups (LGs) defined by 174 cleaved amplified polymorphic sequences (CAPS), 2 derived-cleaved amplified polymorphic sequence markers, 20 sequence-characterized amplified regions, and 8 expressed sequence tag-simple sequence repeat markers spanning 1,404.3 cM, with a mean marker interval of 6.9 cM and an average of 14.6 markers per LG. Genetic inheritance and QTL analyses indicated that each of the PMR, SS, and FS traits is controlled by an incompletely dominant effect of major QTLs designated as pmr2.1, ss2.1, and fsi3.1, respectively. The pmr2.1, detected on chromosome 2 (Chr02), explained 80.0% of the phenotypic variation (LOD = 30.76). This QTL was flanked by two CAPS markers, wsb2-24 (4.00 cM) and wsb2-39 (13.97 cM). The ss2.1, located close to pmr2.1 and CAPS marker wsb2-13 (1.00 cM) on Chr02, explained 92.3% of the phenotypic variation (LOD = 68.78). The fsi3.1, detected on Chr03, explained 79.7% of the phenotypic variation (LOD = 31.37) and was flanked by two CAPS, wsb3-24 (1.91 cM) and wsb3-9 (7.00 cM). Candidate gene-based CAPS markers were developed from the disease resistance and fruit shape gene homologs located on Chr.02 and Chr03 and were mapped on the intraspecific map. Colocalization of these markers with the major QTLs indicated that watermelon orthologs of a nucleotide-binding site-leucine-rich repeat class gene containing an RPW8 domain and a member of SUN containing the IQ67 domain are candidate genes for pmr2.1 and fsi3.1, respectively. The results presented herein provide useful information for marker-assisted breeding and gene cloning for PMR and fruit-related traits. PMID:26700647
Analysis of genetic stability at SSR loci during somatic embryogenesis in maritime pine (Pinus pinaster).

PubMed

Marum, Liliana; Rocheta, Margarida; Maroco, João; Oliveira, M Margarida; Miguel, Célia

2009-04-01

Somatic embryogenesis (SE) is a propagation tool of particular interest for accelerating the deployment of new high-performance planting stock in multivarietal forestry. However, genetic conformity in in vitro propagated plants should be assessed as early as possible, especially in long-living trees such as conifers. The main objective of this work was to study such conformity based on genetic stability at simple sequence repeat (SSR) loci during somatic embryogenesis in maritime pine (Pinus pinaster Ait.). Embryogenic cell lines (ECLs) subjected to tissue proliferation during 6, 14 or 22 months, as well as emblings regenerated from several ECLs, were analyzed. Genetic variation at seven SSR loci was detected in ECLs under proliferation conditions for all time points, and in 5 out of 52 emblings recovered from somatic embryos. Three of these five emblings showed an abnormal phenotype consisting mainly of plagiotropism and loss of apical dominance. Despite the variation found in somatic embryogenesis-derived plant material, no correlation was established between genetic stability at the analyzed loci and abnormal embling phenotype, present in 64% of the emblings. The use of microsatellites in this work was efficient for monitoring mutation events during the somatic embryogenesis in P. pinaster. These molecular markers should be useful in the implementation of new breeding and deployment strategies for improved trees using SE.
Molecular Linkage Mapping and Marker-Trait Associations with NlRPT, a Downy Mildew Resistance Gene in Nicotiana langsdorffii

PubMed Central

Zhang, Shouan; Gao, Muqiang; Zaitlin, David

2012-01-01

Nicotiana langsdorffii is one of two species of Nicotiana known to express an incompatible interaction with the oomycete Peronospora tabacina, the causal agent of tobacco blue mold disease. We previously showed that incompatibility is due to the hypersensitive response (HR), and plants expressing the HR are resistant to P. tabacina at all stages of growth. Resistance is due to a single dominant gene in N. langsdorffii accession S-4-4 that we have named NlRPT. In further characterizing this unique host-pathogen interaction, NlRPT has been placed on a preliminary genetic map of the N. langsdorffii genome. Allelic scores for five classes of DNA markers were determined for 90 progeny of a “modified backcross” involving two N. langsdorffii inbred lines and the related species N. forgetiana. All markers had an expected segregation ratio of 1:1, and were scored in a common format. The map was constructed with JoinMap 3.0, and loci showing excessive transmission distortion were removed. The linkage map consists of 266 molecular marker loci defined by 217 amplified fragment length polymorphisms (AFLPs), 26 simple-sequence repeats (SSRs), 10 conserved orthologous sequence markers, nine inter-simple sequence repeat markers, and four target region amplification polymorphism markers arranged in 12 linkage groups with a combined length of 1062 cM. NlRPT is located on linkage group three, flanked by four AFLP markers and one SSR. Regions of skewed segregation were detected on LGs 1, 5, and 9. Markers developed for N. langsdorffii are potentially useful genetic tools for other species in Nicotiana section Alatae, as well as in N. benthamiana. We also investigated whether AFLPs could be used to infer genetic relationships within N. langsdorffii and related species from section Alatae. A phenetic analysis of the AFLP data showed that there are two main lineages within N. langsdorffii, and that both contain populations expressing dominant resistance to P. tabacina. PMID:22936937
Developmental and internal validation of a novel 13 loci STR multiplex method for Cannabis sativa DNA profiling.

PubMed

Houston, Rachel; Birck, Matthew; Hughes-Stamm, Sheree; Gangitano, David

2017-05-01

Marijuana (Cannabis sativa L.) is a plant cultivated and trafficked worldwide as a source of fiber (hemp), medicine, and intoxicant. The development of a validated method using molecular techniques such as short tandem repeats (STRs) could serve as an intelligence tool to link multiple cases by means of genetic individualization or association of cannabis samples. For this purpose, a 13 loci STR multiplex method was developed, optimized, and validated according to relevant ISFG and SWGDAM guidelines. The STR multiplex consists of 13 previously described C. sativa STR loci: ANUCS501, 9269, 4910, 5159, ANUCS305, 9043, B05, 1528, 3735, CS1, D02, C11, and H06. A sequenced allelic ladder consisting of 56 alleles was designed to accurately genotype 101 C. sativa samples from three seizures provided by a U.S. Customs and Border Protection crime lab. Using an optimal range of DNA (0.5-1.0ng), validation studies revealed well-balanced electropherograms (inter-locus balance range: 0.500-1.296), relatively balanced heterozygous peaks (mean peak height ratio of 0.83 across all loci) with minimal artifacts and stutter ratio (mean stutter of 0.021 across all loci). This multi-locus system is relatively sensitive (0.13ng of template DNA) with a combined power of discrimination of 1 in 55 million. The 13 STR panel was found to be species specific for C. sativa; however, non-specific peaks were produced with Humulus lupulus. The results of this research demonstrate the robustness and applicability of this 13 loci STR system for forensic DNA profiling of marijuana samples. Copyright © 2017 Elsevier B.V. All rights reserved.
[EST-SSR identification, markers development of Ligusticum chuanxiong based on Ligusticum chuanxiong transcriptome sequences].

PubMed

Yuan, Can; Peng, Fang; Yang, Ze-Mao; Zhong, Wen-Juan; Mou, Fang-Sheng; Gong, Yi-Yun; Ji, Pei-Cheng; Pu, De-Qiang; Huang, Hai-Yan; Yang, Xiao; Zhang, Chao

2017-09-01

Ligusticum chuanxiong is a well-known traditional Chinese medicine plant. The study on its molecular markers development and germplasm resources is very important. In this study, we obtained 24 422 unigenes by assembling transcriptome sequencing reads of L. chuanxiong root. EST-SSR was detected and 4 073 SSR loci were identified. EST-SSR distribution and characteristic analysis results showed that the mono-nucleotide repeats were the main repeat types, accounting for 41.0%. In addition, the sequences containing SSR were functionally annotated in Gene Ontology (GO) and KEGG pathway and were assigned to 49 GO categories, 242 KEGG pathways, among them 2 201 sequences were annotated against Nr database. By validating 235 EST-SSRs,74 primer pairs were ultimately proved to have high quality amplification. Subsequently, genetic diversity analysis, UPGMA cluster analysis, PCoA analysis and population structure analysis of 34 L. chuanxiong germplasm resources were carried out with 74 primer pairs. In both UPGMA tree and PCoA results, L. chuanxiong resources were clustered into two groups, which are believed to be partial related to their geographical distribution. In this study, EST-SSRs in L. chuanxiong was firstly identified, and newly developed molecular markers would contribute significantly to further genetic diversity study, the purity detection, gene mapping, and molecular breeding. Copyright© by the Chinese Pharmaceutical Association.
Construction of an Integrated High Density Simple Sequence Repeat Linkage Map in Cultivated Strawberry (Fragaria × ananassa) and its Applicability

PubMed Central

Isobe, Sachiko N.; Hirakawa, Hideki; Sato, Shusei; Maeda, Fumi; Ishikawa, Masami; Mori, Toshiki; Yamamoto, Yuko; Shirasawa, Kenta; Kimura, Mitsuhiro; Fukami, Masanobu; Hashizume, Fujio; Tsuji, Tomoko; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Tsuruoka, Hisano; Minami, Chiharu; Takahashi, Chika; Wada, Tsuyuko; Ono, Akiko; Kawashima, Kumiko; Nakazaki, Naomi; Kishida, Yoshie; Kohara, Mitsuyo; Nakayama, Shinobu; Yamada, Manabu; Fujishiro, Tsunakazu; Watanabe, Akiko; Tabata, Satoshi

2013-01-01

The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers. PMID:23248204

[Copy number variation of trinucleotide repeat in dynamic mutation sites of autosomal dominant cerebellar ataxias related genes].

PubMed

Chen, Pu; Ma, Mingyi; Shang, Huifang; Su, Dan; Zhang, Sizhong; Yang, Yuan

2009-12-01

To standardize the experimental procedure of the gene test for autosomal dominant cerebellar ataxias (ADCA), and provide the basis for quantitative criteria of the dynamic mutation of spinocerebellar ataxia (SCA) genes in Chinese population. Genotyping of the dynamic mutation loci of the SCA1, SCA2, SCA3, SCA6 and SCA7 genes was performed, using florescence PCR-capillary electrophoresis followed by DNA sequencing, to investigate the variation range of copy number of CAG tandem repeat of the genes in 263 probands of ADCA pedigrees and 261 non-related normal controls. Based on the sequencing result, the bias of the CAG copy number estimation using capillary electrophoresis with different DNA controls was compared to analyze the technical detailes of the electrophresis method in testing the dynamic mutation sites. PCR products containing dynamic mutation loci of the SCA genes showed significantly higher mobility than that of molecular weigh marker with relatively balanced GC content. This was particularly obvious in the SCA2, SCA 6 and SCA7 genes whereas the deviation of copy number could be corrected to +/-1 when known CAG copy number fragments were used as controls. The mobility of PCR products was primarily related to the copy number of CAG repeat when the fragments contained normal CAG repeat. In the 263 ADCA pedigrees, 6 (2.28%) carried SCA1 gene mutation, 8 (3.04%) had SCA2 mutation and 81 (30.80%) harbored SCA3 mutation. The gene mutation of SCA6 and SCA7 was not found. The normal variation range of the CAG repeat was 17-36 copies in SCA1 gene, 13-30 copies in SCA2, 14-39 copies in SCA3, 6-16 copies in SCA6 and 6-13 copies in SCA7. The heterozygosity was 76.1%, 17.7%, 74.4%, 72.1% and 41.3%, respectively. The mutation range of the CAG repeat was 49-56 copies in SCA1 gene, 36-41 copies in SCA2, 59-81 copies in SCA3. Neither homozygous mutation of an SCA gene nor double heterozygous mutation of the SCA genes was observed in the study. The copy number of the CAG repeat in SCA genes could be calculated accurately based on the result of florescence PCR-capillary electrophoresis when limited amount of known repeat copy number controls were used. Our result supported that the notion that SCA3 gene mutation was the most common cause for ADCA, and the obtained data would be helpful for establishing quantitative criteria of the dynamic mutation of the SCA genes in Chinese.
Testing genotyping strategies for ultra-deep sequencing of a co-amplifying gene family: MHC class I in a passerine bird.

PubMed

Biedrzycka, Aleksandra; Sebastian, Alvaro; Migalska, Magdalena; Westerdahl, Helena; Radwan, Jacek

2017-07-01

Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co-amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra-deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500-20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within-method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co-amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co-amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage. © 2016 John Wiley & Sons Ltd.
Differential expression of a WRKY gene between wild and cultivated soybeans correlates to seed size.

PubMed

Gu, Yongzhe; Li, Wei; Jiang, Hongwei; Wang, Yan; Gao, Huihui; Liu, Miao; Chen, Qingshan; Lai, Yongcai; He, Chaoying

2017-05-17

Soybean (Glycine max) probably originated from the wild soybean (Glycine soja). Glycine max has a significantly larger seed size, but the underlying genomic changes are largely unknown. Candidate regulatory genes were preliminarily proposed by data co-localizing RNA sequencing with the quantitative loci (QTLs) for seed size. The soybean gene locus SoyWRKY15a and its orthologous genes from G. max (GmWRKY15a) and G. soja (GsWRKY15a) were analyzed in detail. The coding sequences were nearly identical between the two orthologs, but GmWRKY15a was significantly more highly expressed than GsWRKY15a. Four haplotypes (H1-H4) were found and they varied in the size of a CT-core microsatellite locus in the 5'-untranslated region of this gene. H1 (with six CT-repeats) was the only allelic version found in G. max, while H3 (with five CT-repeats) was the dominant G. soja allele. Differential expression of this gene in soybean pods was correlated with CT-repeat variation, and manipulation of the CT copy number altered the reporter gene expression, suggesting a regulatory role for the simple sequence repeats. Seed weight of wild soybeans harboring H1 was significantly greater than that of soybeans having haplotypes H2, H3, or H4, and seed weight was correlated with gene expression, suggesting the influence of GsWRKY15a in controlling seed size. However, the seed size might be refractory to increased SoyWRKY15a expression in cultivated soybeans. The evolutionary significance of SoyWRKY15a variation in soybean seed domestication is discussed. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Genetic linkage map and QTL identification for adventitious rooting traits in red gum eucalypts.

PubMed

Sumathi, Murugan; Bachpai, Vijaya Kumar Waman; Mayavel, A; Dasgupta, Modhumita Ghosh; Nagarajan, Binai; Rajasugunasekar, D; Sivakumar, Veerasamy; Yasodha, Ramasamy

2018-05-01

The eucalypt species, Eucalyptus tereticornis and Eucalyptus camaldulensis , show tolerance to drought and salinity conditions, respectively, and are widely cultivated in arid and semiarid regions of tropical countries. In this study, genetic linkage map was developed for interspecific cross E. tereticornis × E. camaldulensis using pseudo-testcross strategy with simple sequence repeats (SSRs), intersimple sequence repeats (ISSRs), and sequence-related amplified polymorphism (SRAP) markers. The consensus genetic map comprised totally 283 markers with 84 SSRs, 94 ISSRs, and 105 SRAP markers on 11 linkage groups spanning 1163.4 cM genetic distance. Blasting the SSR sequences against E. grandis sequences allowed an alignment of 64% and the average ratio of genetic-to-physical distance was 1.7 Mbp/cM, which strengths the evidence that high amount of synteny and colinearity exists among eucalypts genome. Blast searches also revealed that 37% of SSRs had homologies with genes, which could potentially be used in the variety of downstream applications including candidate gene polymorphism. Quantitative trait loci (QTL) analysis for adventitious rooting traits revealed six QTL for rooting percent and root length on five chromosomes with interval and composite interval mapping. All the QTL explained 12.0-14.7% of the phenotypic variance, showing the involvement of major effect QTL on adventitious rooting traits. Increasing the density of markers would facilitate the detection of more number of small-effect QTL and also underpinning the genes involved in rooting process.
Phylogenic analysis and forensic genetic characterization of Chinese Uyghur group via autosomal multi STR markers.

PubMed

Jin, Xiaoye; Wei, Yuanyuan; Chen, Jiangang; Kong, Tingting; Mu, Yuling; Guo, Yuxin; Dong, Qian; Xie, Tong; Meng, Haotian; Zhang, Meng; Li, Jianfei; Li, Xiaopeng; Zhu, Bofeng

2017-09-26

We investigated the allelic frequencies and forensic descriptive parameters of 23 autosomal short tandem repeat loci in a randomly selected sample of 1218 unrelated healthy Uyghur individuals residing in the Xinjiang Uyghur Autonomous Region, northwest China. A total of 281 alleles at these loci were identified and their corresponding allelic frequencies ranged from 0.0004 to 0.5390. The combined match probability and combined probability of exclusion of all loci were 5.192 × 10 -29 and 0.9999999996594, respectively. The results of population genetic study manifested that Uyghur had close relationships with those contiguous populations, such as Xibe and Hui groups. In a word, these autosomal short tandem repeat loci were highly informative in Uyghur group and the multiplex PCR system could be used as a valuable tool for forensic caseworks and population genetic analysis.
Comparison of intraspecific, interspecific and intergeneric chloroplast diversity in Cycads

PubMed Central

Jiang, Guo-Feng; Hinsinger, Damien Daniel; Strijk, Joeri Sergej

2016-01-01

Cycads are among the most threatened plant species. Increasing the availability of genomic information by adding whole chloroplast data is a fundamental step in supporting phylogenetic studies and conservation efforts. Here, we assemble a dataset encompassing three taxonomic levels in cycads, including ten genera, three species in the genus Cycas and two individuals of C. debaoensis. Repeated sequences, SSRs and variations of the chloroplast were analyzed at the intraspecific, interspecific and intergeneric scale, and using our sequence data, we reconstruct a phylogenomic tree for cycads. The chloroplast was 162,094 bp in length, with 133 genes annotated, including 87 protein-coding, 37 tRNA and 8 rRNA genes. We found 7 repeated sequences and 39 SSRs. Seven loci showed promising levels of variations for application in DNA-barcoding. The chloroplast phylogeny confirmed the division of Cycadales in two suborders, each of them being monophyletic, revealing a contradiction with the current family circumscription and its evolution. Finally, 10 intraspecific SNPs were found. Our results showed that despite the extremely restricted distribution range of C. debaoensis, using complete chloroplast data is useful not only in intraspecific studies, but also to improve our understanding of cycad evolution and in defining conservation strategies for this emblematic group. PMID:27558458
Cocaine dynamically regulates heterochromatin and repetitive element unsilencing in nucleus accumbens.

PubMed

Maze, Ian; Feng, Jian; Wilkinson, Matthew B; Sun, HaoSheng; Shen, Li; Nestler, Eric J

2011-02-15

Repeated cocaine exposure induces persistent alterations in genome-wide transcriptional regulatory networks, chromatin remodeling activity and, ultimately, gene expression profiles in the brain's reward circuitry. Virtually all previous investigations have centered on drug-mediated effects occurring throughout active euchromatic regions of the genome, with very little known concerning the impact of cocaine exposure on the regulation and maintenance of heterochromatin in adult brain. Here, we report that cocaine dramatically and dynamically alters heterochromatic histone H3 lysine 9 trimethylation (H3K9me3) in the nucleus accumbens (NAc), a key brain reward region. Furthermore, we demonstrate that repeated cocaine exposure causes persistent decreases in heterochromatization in this brain region, suggesting a potential role for heterochromatic regulation in the long-term actions of cocaine. To identify precise genomic loci affected by these alterations, chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-Seq) was performed on NAc. ChIP-Seq analyses confirmed the existence of the H3K9me3 mark mainly within intergenic regions of the genome and identified specific patterns of cocaine-induced H3K9me3 regulation at repetitive genomic sequences. Cocaine-mediated decreases in H3K9me3 enrichment at specific genomic repeats [e.g., long interspersed nuclear element (LINE)-1 repeats] were further confirmed by the increased expression of LINE-1 retrotransposon-associated repetitive elements in NAc. Such increases likely reflect global patterns of genomic destabilization in this brain region after repeated cocaine administration and open the door for future investigations into the epigenetic and genetic basis of drug addiction.
Evolutionary relationships among Pinus (Pinaceae) subsections inferred from multiple low-copy nuclear loci.

Treesearch

John Syring; Ann Willyard; Richard Cronn; Aaron Liston

2005-01-01

Sequence data from nrITS and cpDNA have failed to fully resolve phylogenetic relationships among Pinus species. Four low-copy nuclear genes, developed from the screening of 73 mapped conifer anchor loci, were sequenced from 12 species representing all subsections. Individual loci do not uniformly support either the nrITS or cpDNA hypotheses and in...
The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

PubMed

Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

2013-10-10

Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae. Molecular dating analyses suggest that Ranunculaceae and Berberidaceae diverged between 90 and 84 mya, which is congruent with the fossil records and with recent estimates of the divergence time of these two taxa. © 2013.
Genetic polymorphisms of 20 autosomal STR loci in the Vietnamese population from Yunnan Province, Southwest China.

PubMed

Zhang, Xiufeng; Hu, Liping; Du, Lei; Nie, Aiting; Rao, Min; Pang, Jing Bo; Nie, Shengjie

2017-05-01

The genetic polymorphisms of 20 autosomal short tandem repeat (STR) loci included in the PowerPlex® 21 kit were evaluated in 522 healthy unrelated Vietnamese from Yunnan, China. All of the loci reached the Hardy-Weinberg equilibrium. These loci were examined to determine allele frequencies and forensic statistical parameters. The combined discrimination power and probability of excluding paternity of the 20 STR loci were 0.999999999999999999999991 26 and 0.999999975, respectively. Results suggested that the 20 STR loci are highly polymorphic, which is suitable for forensic personal identification and paternity testing.
The Complete Plastome Sequences of Four Orchid Species: Insights into the Evolution of the Orchidaceae and the Utility of Plastomic Mutational Hotspots

PubMed Central

Niu, Zhitao; Xue, Qingyun; Zhu, Shuying; Sun, Jing; Liu, Wei; Ding, Xiaoyu

2017-01-01

Orchidaceae (orchids) is the largest family in the monocots, including about 25,000 species in 880 genera and five subfamilies. Many orchids are highly valued for their beautiful and long-lasting flowers. However, the phylogenetic relationships among the five orchid subfamilies remain unresolved. The major dispute centers on whether the three one-stamened subfamilies, Epidendroideae, Orchidoideae, and Vanilloideae, are monophyletic or paraphyletic. Moreover, structural changes in the plastid genome (plastome) and the effective genetic loci at the species-level phylogenetics of orchids have rarely been documented. In this study, we compared 53 orchid plastomes, including four newly sequenced ones, that represent four remote genera: Dendrobium, Goodyera, Paphiopedilum, and Vanilla. These differ from one another not only in their lengths of inverted repeats and small single copy regions but also in their retention of ndh genes. Comparative analyses of the plastomes revealed that the expansion of inverted repeats in Paphiopedilum and Vanilla is associated with a loss of ndh genes. In orchid plastomes, mutational hotspots are genus specific. After having carefully examined the data, we propose that the three loci 5′trnK-rps16, trnS-trnG, and rps16-trnQ might be powerful markers for genera within Epidendroideae, and clpP-psbB and rps16-trnQ might be markers for genera within Cypripedioideae. After analyses of a partitioned dataset, we found that our plastid phylogenomic trees were congruent in a topology where two one-stamened subfamilies (i.e., Epidendroideae and Orchidoideae) were sisters to a multi-stamened subfamily (i.e., Cypripedioideae) rather than to the other one-stamened subfamily (Vanilloideae), suggesting that the living one-stamened orchids are paraphyletic. PMID:28515737
Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications.

PubMed

Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong

2009-03-31

The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.
Dynamic of mutational events in variable number tandem repeats of Escherichia coli O157:H7.

PubMed

Bustamante, A V; Sanso, A M; Segura, D O; Parma, A E; Lucchesi, P M A

2013-01-01

VNTRs regions have been successfully used for bacterial subtyping; however, the hypervariability in VNTR loci is problematic when trying to predict the relationships among isolates. Since few studies have examined the mutation rate of these markers, our aim was to estimate mutation rates of VNTRs specific for verotoxigenic E. coli O157:H7. The knowledge of VNTR mutational rates and the factors affecting them would make MLVA more effective for epidemiological or microbial forensic investigations. For this purpose, we analyzed nine loci performing parallel, serial passage experiments (PSPEs) on 9 O157:H7 strains. The combined 9 PSPE population rates for the 8 mutating loci ranged from 4.4 × 10(-05) to 1.8 × 10(-03) mutations/generation, and the combined 8-loci mutation rate was of 2.5 × 10(-03) mutations/generation. Mutations involved complete repeat units, with only one point mutation detected. A similar proportion between single and multiple repeat changes was detected. Of the 56 repeat mutations, 59% were insertions and 41% were deletions, and 72% of the mutation events corresponded to O157-10 locus. For alleles with up to 13 UR, a constant and low mutation rate was observed; meanwhile longer alleles were associated with higher and variable mutation rates. Our results are useful to interpret data from microevolution and population epidemiology studies and particularly point out that the inclusion or not of O157-10 locus or, alternatively, a differential weighting data according to the mutation rates of loci must be evaluated in relation with the objectives of the proposed study.
Construction of a high-density linkage map and mapping quantitative trait loci for somatic embryogenesis using leaf petioles as explants in upland cotton (Gossypium hirsutum L.).

PubMed

Xu, Zhenzhen; Zhang, Chaojun; Ge, Xiaoyang; Wang, Ni; Zhou, Kehai; Yang, Xiaojie; Wu, Zhixia; Zhang, Xueyan; Liu, Chuanliang; Yang, Zuoren; Li, Changfeng; Liu, Kun; Yang, Zhaoen; Qian, Yuyuan; Li, Fuguang

2015-07-01

The first high-density linkage map was constructed to identify quantitative trait loci (QTLs) for somatic embryogenesis (SE) in cotton ( Gossypium hirsutum L.) using leaf petioles as explants. Cotton transformation is highly limited by only a few regenerable genotypes and the lack of understanding of the genetic and molecular basis of somatic embryogenesis (SE) in cotton (Gossypium hirsutum L.). To construct a more saturated linkage map and further identify quantitative trait loci (QTLs) for SE using leaf petioles as explants, a high embryogenesis frequency line (W10) from the commercial Chinese cotton cultivar CRI24 was crossed with TM-1, a genetic standard upland cotton with no embryogenesis frequency. The genetic map spanned 2300.41 cM in genetic distance and contained 411 polymorphic simple sequence repeat (SSR) loci. Of the 411 mapped loci, 25 were developed from unigenes identified for SE in our previous study. Six QTLs for SE were detected by composite interval mapping method, each explaining 6.88-37.07% of the phenotypic variance. Single marker analysis was also performed to verify the reliability of QTLs detection, and the SSR markers NAU3325 and DPL0209 were detected by the two methods. Further studies on the relatively stable and anchoring QTLs/markers for SE in an advanced population of W10 × TM-1 and other cross combinations with different SE abilities may shed light on the genetic and molecular mechanism of SE in cotton.
Pantoea ananatis Genetic Diversity Analysis Reveals Limited Genomic Diversity as Well as Accessory Genes Correlated with Onion Pathogenicity.

PubMed

Stice, Shaun P; Stumpf, Spencer D; Gitaitis, Ron D; Kvitko, Brian H; Dutta, Bhabesh

2018-01-01

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study.
Pantoea ananatis Genetic Diversity Analysis Reveals Limited Genomic Diversity as Well as Accessory Genes Correlated with Onion Pathogenicity

PubMed Central

Stice, Shaun P.; Stumpf, Spencer D.; Gitaitis, Ron D.; Kvitko, Brian H.; Dutta, Bhabesh

2018-01-01

Pantoea ananatis is a member of the family Enterobacteriaceae and an enigmatic plant pathogen with a broad host range. Although P. ananatis strains can be aggressive on onion causing foliar necrosis and onion center rot, previous genomic analysis has shown that P. ananatis lacks the primary virulence secretion systems associated with other plant pathogens. We assessed a collection of fifty P. ananatis strains collected from Georgia over three decades to determine genetic factors that correlated with onion pathogenic potential. Previous genetic analysis studies have compared strains isolated from different hosts with varying diseases potential and isolation sources. Strains varied greatly in their pathogenic potential and aggressiveness on different cultivated Allium species like onion, leek, shallot, and chive. Using multi-locus sequence analysis (MLSA) and repetitive extragenic palindrome repeat (rep)-PCR techniques, we did not observe any correlation between onion pathogenic potential and genetic diversity among strains. Whole genome sequencing and pan-genomic analysis of a sub-set of 10 strains aided in the identification of a novel series of genetic regions, likely plasmid borne, and correlating with onion pathogenicity observed on single contigs of the genetic assemblies. We named these loci Onion Virulence Regions (OVR) A-D. The OVR loci contain genes involved in redox regulation as well as pectate lyase and rhamnogalacturonase genes. Previous studies have not identified distinct genetic loci or plasmids correlating with onion foliar pathogenicity or pathogenicity on a single host pathosystem. The lack of focus on a single host system for this phytopathgenic disease necessitates the pan-genomic analysis performed in this study. PMID:29491851
Microsatellite DNA fingerprinting, differentiation, and genetic relationships of clones, cultivars, and varieties of six poplar species from three sections of the genus Populus.

PubMed

Rahman, Muhammad H; Rajora, Om P

2002-12-01

Accurate identification of Populus clones and cultivars is essential for effective selection, breeding, and genetic resource management programs. The unit of cultivation and breeding in poplars is a clone, and individual cultivars are normally represented by a single clone. Microsatellite DNA markers of 10 simple sequence repeat loci were used for genetic fingerprinting and differentiation of 96 clones/cultivars and varieties belonging to six Populus species (P. deltoides, P. nigra, P. balsamifera, P. trichocarpa, P. grandidentata, and P maximowiczii) from three sections of the genus. All 96 clones/cultivars could be uniquely fingerprinted based on their single- or multilocus microsatellite genotypes. The five P. grandidentata clones could be differentiated based on their single-locus genotypes, while six clones of P. trichocarpa and 11 clones of P. maximowiczii could be identified by their two-locus genotypes. Twenty clones of P. deltoides and 25 clones of P. nigra could be differentiated by their multilocus genotypes employing three loci, and 29 clones of P. balsamifera required the use of multilocus genotypes at five loci for their genetic fingerprinting and differentiation. The loci PTR3, PTR5, and PTR7 were found to be the most informative for genetic fingerprinting and differentiation of the clones. The mean number of alleles per locus ranged from 2.9 in P. trichocarpa or P. grandidentata to 6.0 in P. balsamifera and 11.2 in 96 clones of the six species. The mean number of observed genotypes per locus ranged from 2.4 in P. grandidentata to 7.4 in P. balsamifera and 19.6 in 96 clones of the six species. The mean number of unique genotypes per locus ranged from 1.3 in P. grandidentata to 3.9 in P. deltoides and 8.8 in 96 clones of the six species. The power of discrimination of the microsatellite DNA markers in the 96 clones ranged from 0.726 for PTR4 to 0.939 for PTR7, with a mean of 0.832 over the 10 simple sequence repeat loci. Clones/cultivars from the same species showed higher microsatellite DNA similarities than the clones from the different species. A UPGMA cluster plot constructed from the microsatellite genotypic similarities separated the 96 clones into six major groups corresponding to their species. Populus nigra var. italica clones were genetically differentiated from the P. nigra var. nigra clones. Microsatellite DNA markers could be useful in genetic fingerprinting, identification, classification, certification, and registration of clones, clultivars, and varieties as well as genetic resource management and protection of plant breeders' rights in Populus.
Microsatellite DNA capture from enriched libraries.

PubMed

Gonzalez, Elena G; Zardoya, Rafael

2013-01-01

Microsatellites are DNA sequences of tandem repeats of one to six nucleotides, which are highly polymorphic, and thus the molecular markers of choice in many kinship, population genetic, and conservation studies. There have been significant technical improvements since the early methods for microsatellite isolation were developed, and today the most common procedures take advantage of the hybrid capture methods of enriched-targeted microsatellite DNA. Furthermore, recent advents in sequencing technologies (i.e., next-generation sequencing, NGS) have fostered the mining of microsatellite markers in non-model organisms, affording a cost-effective way of obtaining a large amount of sequence data potentially useful for loci characterization. The rapid improvements of NGS platforms together with the increase in available microsatellite information open new avenues to the understanding of the evolutionary forces that shape genetic structuring in wild populations. Here, we provide detailed methodological procedures for microsatellite isolation based on the screening of GT microsatellite-enriched libraries, either by cloning and Sanger sequencing of positive clones or by direct NGS. Guides for designing new species-specific primers and basic genotyping are also given.
Transcriptome Sequencing of Hevea brasiliensis for Development of Microsatellite Markers and Construction of a Genetic Linkage Map

PubMed Central

Triwitayakorn, Kanokporn; Chatkulkawin, Pornsupa; Kanjanawattanawong, Supanath; Sraphet, Supajit; Yoocha, Thippawan; Sangsrakru, Duangjai; Chanprasert, Juntima; Ngamphiw, Chumpol; Jomchai, Nukoon; Therawattanasuk, Kanikar; Tangphatsornruang, Sithichoke

2011-01-01

To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection. PMID:22086998
Characterization and Transferable Utility of Microsatellite Markers in the Wild and Cultivated Arachis Species.

PubMed

Huang, Li; Wu, Bei; Zhao, Jiaojiao; Li, Haitao; Chen, Weigang; Zheng, Yanli; Ren, Xiaoping; Chen, Yuning; Zhou, Xiaojing; Lei, Yong; Liao, Boshou; Jiang, Huifang

2016-01-01

Microsatellite or simple sequence repeat (SSR) is one of the most widely distributed molecular markers that have been widely utilized to assess genetic diversity and genetic mapping for important traits in plants. However, the understanding of microsatellite characteristics in Arachis species and the currently available amount of high-quality SSR markers remain limited. In this study, we identified 16,435 genome survey sequences SSRs (GSS-SSRs) and 40,199 expressed sequence tag SSRs (EST-SSRs) in Arachis hypogaea and its wild relative species using the publicly available sequence data. The GSS-SSRs had a density of 159.9-239.8 SSRs/Mb for wild Arachis and 1,015.8 SSR/Mb for cultivated Arachis, whereas the EST-SSRs had the density of 173.5-384.4 SSR/Mb and 250.9 SSRs/Mb for wild and cultivated Arachis, respectively. The trinucleotide SSRs were predominant across Arachis species, except that the dinucleotide accounted for most in A. hypogaea GSSs. From Arachis GSS-SSR and EST-SSR sequences, we developed 2,589 novel SSR markers that showed a high polymorphism in six diverse A. hypogaea accessions. A genetic linkage map that contained 540 novel SSR loci and 105 anchor SSR loci was constructed by case of a recombinant inbred lines F6 population. A subset of 82 randomly selected SSR markers were used to screen 39 wild and 22 cultivated Arachis accessions, which revealed a high transferability of the novel SSRs across Arachis species. Our results provided informative clues to investigate microsatellite patterns across A. hypogaea and its wild relative species and potentially facilitate the germplasm evaluation and gene mapping in Arachis species.

A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)

PubMed Central

Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar

2017-01-01

Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
RNA degradation and models for post-transcriptional gene-silencing.

PubMed

Meins, F

2000-06-01

Post-transcriptional gene silencing (PTGS) is a form of stable but potentially reversible epigenetic modification, which frequently occurs in transgenic plants. The interaction in trans of genes with similar transcribed sequences results in sequence-specific degradation of RNAs derived from the genes involved. Highly expressed single-copy loci, transcribed inverted repeats, and poorly transcribed complex loci can act as sources of signals that trigger PTGS. In some cases, mobile, sequence-specific silencing signals can move from cell to cell or even over long distances in the plant. Several current models hold that silencing signals are 'aberrant' RNAs (aRNA), which differ in some way from normal mRNAs. The most likely candidates are small antisense RNAs (asRNA) and double-stranded RNAs (dsRNA). Direct evidence that these or other aRNAs found in silent tissues can induce PTGS is still lacking. Most current models assume that silencing signals interact with target RNAs in a sequence-specific fashion. This results in degradation, usually in the cytoplasm, by exonucleolytic as well as endonucleolytic pathways, which are not necessarily PTGS-specific. Biochemical-switch models hold that the silent state is maintained by a positive auto-regulatory loop. One possibility is that concentrations of hypothetical silencing signals above a critical threshold trigger their own production by self-replication, by degradation of target RNAs, or by a combination of both mechanisms. These models can account for the stability, reversibility and multiplicity of silent states; the strong influence of transcription rate of target genes on the incidence and stability of silencing, and the amplification and systemic propagation of motile silencing signals.
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

PubMed

Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

2017-01-01

Apostasioideae, consists of only two genera, Apostasia and Neuwiedia , which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla ), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase ( ndh ) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci- ndhA intron, matK-5'trnK , clpP-psbB , rps8-rpl14 , trnT-trnL , 3'trnK-matK , clpP intron , psbK-trnK , trnS-psbC , and ndhF-rpl32 -that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed.
Estimation of pea (Pisum sativum L.) microsatellite mutation rate based on pedigree and single-seed descent analyses.

PubMed

Cieslarová, Jaroslava; Hanáček, Pavel; Fialová, Eva; Hýbl, Miroslav; Smýkal, Petr

2011-11-01

Microsatellites, or simple sequence repeats (SSRs) are widespread class of repetitive DNA sequences, used in population genetics, genetic diversity and mapping studies. In spite of the SSR utility, the genetic and evolutionary mechanisms are not fully understood. We have investigated three microsatellite loci with different position in the pea (Pisum sativum L.) genome, the A9 locus residing in LTR region of abundant retrotransposon, AD270 as intergenic and AF016458 located in 5'untranslated region of expressed gene. Comparative analysis of a 35 pair samples from seven pea varieties propagated by single-seed descent for ten generations, revealed single 4 bp mutation in 10th generation sample at AD270 locus corresponding to stepwise increase in one additional ATCT repeat unit. The estimated mutation rate was 4.76 × 10(-3) per locus per generation, with a 95% confidence interval of 1.2 × 10(-4) to 2.7 × 10(-2). The comparison of cv. Bohatýr accessions retrieved from different collections, showed intra-, inter-accession variation and differences in flanking and repeat sequences. Fragment size and sequence alternations were also found in long term in vitro organogenic culture, established at 1983, indicative of somatic mutation process. The evidence of homoplasy was detected across of unrelated pea genotypes, which adversaly affects the reliability of diversity estimates not only for diverse germplasm but also highly bred material. The findings of this study have important implications for Pisum phylogeny studies, variety identification and registration process in pea breeding where mutation rate influences the genetic diversity and the effective population size estimates.
A LDR-PCR approach for multiplex polymorphisms genotyping of severely degraded DNA with fragment sizes <100 bp.

PubMed

Zhang, Zhen; Wang, Bao-Jie; Guan, Hong-Yu; Pang, Hao; Xuan, Jin-Feng

2009-11-01

Reducing amplicon sizes has become a major strategy for analyzing degraded DNA typical of forensic samples. However, amplicon sizes in current mini-short tandem repeat-polymerase chain reaction (PCR) and mini-sequencing assays are still not suitable for analysis of severely degraded DNA. In this study, we present a multiplex typing method that couples ligase detection reaction with PCR that can be used to identify single nucleotide polymorphisms and small-scale insertion/deletions in a sample of severely fragmented DNA. This method adopts thermostable ligation for allele discrimination and subsequent PCR for signal enhancement. In this study, four polymorphic loci were used to assess the ability of this technique to discriminate alleles in an artificially degraded sample of DNA with fragment sizes <100 bp. Our results showed clear allelic discrimination of single or multiple loci, suggesting that this method might aid in the analysis of extremely degraded samples in which allelic drop out of larger fragments is observed.
An integrated map of structural variation in 2,504 human genomes.

PubMed

Sudmant, Peter H; Rausch, Tobias; Gardner, Eugene J; Handsaker, Robert E; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo; Fritz, Markus Hsi-Yang; Konkel, Miriam K; Malhotra, Ankit; Stütz, Adrian M; Shi, Xinghua; Casale, Francesco Paolo; Chen, Jieming; Hormozdiari, Fereydoun; Dayama, Gargi; Chen, Ken; Malig, Maika; Chaisson, Mark J P; Walter, Klaudia; Meiers, Sascha; Kashin, Seva; Garrison, Erik; Auton, Adam; Lam, Hugo Y K; Mu, Xinmeng Jasmine; Alkan, Can; Antaki, Danny; Bae, Taejeong; Cerveira, Eliza; Chines, Peter; Chong, Zechen; Clarke, Laura; Dal, Elif; Ding, Li; Emery, Sarah; Fan, Xian; Gujral, Madhusudan; Kahveci, Fatma; Kidd, Jeffrey M; Kong, Yu; Lameijer, Eric-Wubbo; McCarthy, Shane; Flicek, Paul; Gibbs, Richard A; Marth, Gabor; Mason, Christopher E; Menelaou, Androniki; Muzny, Donna M; Nelson, Bradley J; Noor, Amina; Parrish, Nicholas F; Pendleton, Matthew; Quitadamo, Andrew; Raeder, Benjamin; Schadt, Eric E; Romanovitch, Mallory; Schlattl, Andreas; Sebra, Robert; Shabalin, Andrey A; Untergasser, Andreas; Walker, Jerilyn A; Wang, Min; Yu, Fuli; Zhang, Chengsheng; Zhang, Jing; Zheng-Bradley, Xiangqun; Zhou, Wanding; Zichner, Thomas; Sebat, Jonathan; Batzer, Mark A; McCarroll, Steven A; Mills, Ryan E; Gerstein, Mark B; Bashir, Ali; Stegle, Oliver; Devine, Scott E; Lee, Charles; Eichler, Evan E; Korbel, Jan O

2015-10-01

Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.
Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

PubMed Central

Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

2010-01-01

Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966
Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

PubMed

Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

2010-04-01

Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.
Analysis of MHC class I genes across horse MHC haplotypes

PubMed Central

Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

2010-01-01

The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063
A modifier of Huntington's disease onset at the MLH1 locus.

PubMed

Lee, Jong-Min; Chao, Michael J; Harold, Denise; Abu Elneel, Kawther; Gillis, Tammy; Holmans, Peter; Jones, Lesley; Orth, Michael; Myers, Richard H; Kwak, Seung; Wheeler, Vanessa C; MacDonald, Marcy E; Gusella, James F

2017-10-01

Huntington's disease (HD) is a dominantly inherited neurodegenerative disease caused by an expanded CAG repeat in HTT. Many clinical characteristics of HD such as age at motor onset are determined largely by the size of HTT CAG repeat. However, emerging evidence strongly supports a role for other genetic factors in modifying the disease pathogenesis driven by mutant huntingtin. A recent genome-wide association analysis to discover genetic modifiers of HD onset age provided initial evidence for modifier loci on chromosomes 8 and 15 and suggestive evidence for a locus on chromosome 3. Here, genotyping of candidate single nucleotide polymorphisms in a cohort of 3,314 additional HD subjects yields independent confirmation of the former two loci and moves the third to genome-wide significance at MLH1, a locus whose mouse orthologue modifies CAG length-dependent phenotypes in a Htt-knock-in mouse model of HD. Both quantitative and dichotomous association analyses implicate a functional variant on ∼32% of chromosomes with the beneficial modifier effect that delays HD motor onset by 0.7 years/allele. Genomic DNA capture and sequencing of a modifier haplotype localize the functional variation to a 78 kb region spanning the 3'end of MLH1 and the 5'end of the neighboring LRRFIP2, and marked by an isoleucine-valine missense variant in MLH1. Analysis of expression Quantitative Trait Loci (eQTLs) provides modest support for altered regulation of MLH1 and LRRFIP2, raising the possibility that the modifier affects regulation of both genes. Finally, polygenic modification score and heritability analyses suggest the existence of additional genetic modifiers, supporting expanded, comprehensive genetic analysis of larger HD datasets. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Mapping of disease-associated variants in admixed populations

PubMed Central

2011-01-01

Recent developments in high-throughput genotyping and whole-genome sequencing will enhance the identification of disease loci in admixed populations. We discuss how a more refined estimation of ancestry benefits both admixture mapping and association mapping, making disease loci identification in admixed populations more powerful. High-throughput genotyping and sequencing will enable refined estimation of ancestry, thus enhancing disease loci identification in admixed populations PMID:21635713
On-line resources for bacterial micro-evolution studies using MLVA or CRISPR typing.

PubMed

Grissa, Ibtissem; Bouchon, Patrick; Pourcel, Christine; Vergnaud, Gilles

2008-04-01

The control of bacterial pathogens requires the development of tools allowing the precise identification of strains at the subspecies level. It is now widely accepted that these tools will need to be DNA-based assays (in contrast to identification at the species level, where biochemical based assays are still widely used, even though very powerful 16S DNA sequence databases exist). Typing assays need to be cheap and amenable to the designing of international databases. The success of such subspecies typing tools will eventually be measured by the size of the associated reference databases accessible over the internet. Three methods have shown some potential in this direction, the so-called spoligotyping assay (Mycobacterium tuberculosis, 40,000 entries database), Multiple Loci Sequence Typing (MLST; up to a few thousands entries for the more than 20 bacterial species), and more recently Multiple Loci VNTR Analysis (MLVA; up to a few hundred entries, assays available for more than 20 pathogens). In the present report we will review the current status of the tools and resources we have developed along the past seven years to help in the setting-up or the use of MLVA assays or lately for analysing Clustered Regularly Interspaced Short Palindromic Repeats called CRISPRs which are the basis for spoligotyping assays.
A review of the prevalence, utility, and caveats of using chloroplast simple sequence repeats for studies of plant biology1

PubMed Central

Wheeler, Gregory L.; Dorman, Hanna E.; Buchanan, Alenda; Challagundla, Lavanya; Wallace, Lisa E.

2014-01-01

Microsatellites occur in all plant genomes and provide useful markers for studies of genetic diversity and structure. Chloroplast microsatellites (cpSSRs) are frequently targeted because they are more easily isolated than nuclear microsatellites. Here, we quantified the frequency and uses of cpSSRs based on a literature review of over 400 studies published 1995–2013. These markers are an important and economical tool for plant biologists and continue to be used alongside modern genomics approaches to study genetic diversity and structure, evolutionary history, and hybridization in native and agricultural species. Studies using species-specific primers reported a greater number of polymorphic loci than those employing universal primers. A major disadvantage to cpSSRs is fragment size homoplasy; therefore, we documented its occurrence at several cpSSR loci within and between species of Acmispon (Fabaceae). Based on our empirical data set, we recommend targeted sequencing of a subset of samples combined with fragment genotyping as a cost-efficient, data-rich approach to the use of cpSSRs and as a test of homoplasy. The availability of genomic resources for plants aids in the development of primers for new study systems, thereby enhancing the utility of cpSSRs across plant biology. PMID:25506520
Evidence of Brucella strain ST27 in bottlenose dolphin (Tursiops truncatus) in Europe.

PubMed

Cvetnić, Željko; Duvnjak, Sanja; Đuras, Martina; Gomerčić, Tomislav; Reil, Irena; Zdelar-Tuk, Maja; Špičić, Silvio

2016-11-30

Marine mammal brucellosis has been known for more than 20 years, but recent work suggests it is more widespread than originally thought. Brucella (B.) pinnipedialis has been isolated from pinnipeds, while B. ceti strains have been associated with cetaceans. Here we report a Brucella strain isolated from multiple lymph nodes of one bottlenose dolphin (Tursiops truncatus) during routine examination of dolphin carcasses found in the Croatian part of the northern Adriatic Sea during the summer of 2015. Classical bacteriological biotyping, PCR-based techniques (single, multiplex, PCR-RFLP) and 16S rRNA DNA sequencing were used to identify Brucella spp. Multiple-locus variable number tandem repeat analysis of 16 loci and multilocus sequence typing of 9 loci were used for genotyping and species determination. The combination of bacteriological, molecular and genotyping techniques identified our strain as ST27, previously identified as a human pathogen. This report provides, to our knowledge, the first evidence of ST27 in the Adriatic Sea in particular and in European waters in general. The zoonotic nature of the strain and its presence in the Adriatic, which is inhabited by bottlenose dolphins, suggest that the strain may pose a significant threat to human health. Copyright © 2016 Elsevier B.V. All rights reserved.
Tools to exploit sequence data to find new markers and disease loci in dairy cattle

USDA-ARS?s Scientific Manuscript database

The decrease in cost of Next-Generation Sequencing has brought the technology into the realm of practical applications in livestock genomics. Recently, the 1000 Bulls Project has heralded the possibility of using full sequence data to improve imputation and detect disease loci within select founder ...
Evaluation of advanced multiplex short tandem repeat systems in pairwise kinship analysis.

PubMed

Tamura, Tomonori; Osawa, Motoki; Ochiai, Eriko; Suzuki, Takanori; Nakamura, Takashi

2015-09-01

The AmpFLSTR Identifiler Kit, comprising 15 autosomal short tandem repeat (STR) loci, is commonly employed in forensic practice for calculating match probabilities and parentage testing. The conventional system exhibits insufficient estimation for kinship analysis such as sibship testing because of shortness of examined loci. This study evaluated the power of the PowerPlex Fusion System, GlobalFiler Kit, and PowerPlex 21 System, which comprise more than 20 autosomal STR loci, to estimate pairwise blood relatedness (i.e., parent-child, full siblings, second-degree relatives, and first cousins). The genotypes of all 24 STR loci in 10,000 putative pedigrees were constructed by simulation. The likelihood ratio for each locus was calculated from joint probabilities for relatives and non-relatives. The combined likelihood ratio was calculated according to the product rule. The addition of STR loci improved separation between relatives and non-relatives. However, these systems were less effectively extended to the inference for first cousins. In conclusion, these advanced systems will be useful in forensic personal identification, especially in the evaluation of full siblings and second-degree relatives. Moreover, the additional loci may give rise to two major issues of more frequent mutational events and several pairs of linked loci on the same chromosome. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Harnessing CRISPR-Cas systems for bacterial genome editing.

PubMed

Selle, Kurt; Barrangou, Rodolphe

2015-04-01

Manipulation of genomic sequences facilitates the identification and characterization of key genetic determinants in the investigation of biological processes. Genome editing via clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) constitutes a next-generation method for programmable and high-throughput functional genomics. CRISPR-Cas systems are readily reprogrammed to induce sequence-specific DNA breaks at target loci, resulting in fixed mutations via host-dependent DNA repair mechanisms. Although bacterial genome editing is a relatively unexplored and underrepresented application of CRISPR-Cas systems, recent studies provide valuable insights for the widespread future implementation of this technology. This review summarizes recent progress in bacterial genome editing and identifies fundamental genetic and phenotypic outcomes of CRISPR targeting in bacteria, in the context of tool development, genome homeostasis, and DNA repair. Copyright © 2015 Elsevier Ltd. All rights reserved.
Phylogenic analysis and forensic genetic characterization of Chinese Uyghur group via autosomal multi STR markers

PubMed Central

Jin, Xiaoye; Wei, Yuanyuan; Chen, Jiangang; Kong, Tingting; Mu, Yuling; Guo, Yuxin; Dong, Qian; Xie, Tong; Meng, Haotian; Zhang, Meng; Li, Jianfei; Li, Xiaopeng; Zhu, Bofeng

2017-01-01

We investigated the allelic frequencies and forensic descriptive parameters of 23 autosomal short tandem repeat loci in a randomly selected sample of 1218 unrelated healthy Uyghur individuals residing in the Xinjiang Uyghur Autonomous Region, northwest China. A total of 281 alleles at these loci were identified and their corresponding allelic frequencies ranged from 0.0004 to 0.5390. The combined match probability and combined probability of exclusion of all loci were 5.192 × 10−29 and 0.9999999996594, respectively. The results of population genetic study manifested that Uyghur had close relationships with those contiguous populations, such as Xibe and Hui groups. In a word, these autosomal short tandem repeat loci were highly informative in Uyghur group and the multiplex PCR system could be used as a valuable tool for forensic caseworks and population genetic analysis. PMID:29088750
New chloroplast microsatellite markers suitable for assessing genetic diversity of Lolium perenne and other related grass species

PubMed Central

Diekmann, Kerstin; Hodkinson, Trevor R.; Barth, Susanne

2012-01-01

Background and Aims Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne ‘Cashel’. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Methods Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. Key Results All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A8 mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. Conclusions The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species. PMID:22419761
New chloroplast microsatellite markers suitable for assessing genetic diversity of Lolium perenne and other related grass species.

PubMed

Diekmann, Kerstin; Hodkinson, Trevor R; Barth, Susanne

2012-11-01

Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne 'Cashel'. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A(8) mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species.

Diversity of chromosomal karyotypes in maize and its relatives.

PubMed

Albert, P S; Gao, Z; Danilova, T V; Birchler, J A

2010-07-01

Maize is a highly diverse species on the gene sequence level. With the recent development of methods to distinguish each of the 10 pairs of homologues in somatic root tip spreads, a wide collection of maize lines was subjected to karyotype analysis to serve as a reference for the community and to examine the spectrum of chromosomal features in the species. The core nested association mapping progenitor collection and additional selections of diversity lines were examined. Commonly used inbred lines were included in the analysis. The centromere 4 specific repeat and ribosomal RNA loci were invariant. The CentC centromere repeat exhibited extensive differences in quantity on any particular chromosome across lines. Knob heterochromatin was highly variable with locations at many sites in the genome. Lastly, representative examples from other species in the genus Zea (teosintes) were examined, which provide information on the evolution of chromosomal features. Copyright 2010 S. Karger AG, Basel.
A multiplicity of factors contributes to selective RNA polymerase III occupancy of a subset of RNA polymerase III genes in mouse liver

PubMed Central

Canella, Donatella; Bernasconi, David; Gilardi, Federica; LeMartelot, Gwendal; Migliavacca, Eugenia; Praz, Viviane; Cousin, Pascal; Delorenzi, Mauro; Hernandez, Nouria; Hernandez, Nouria; Delorenzi, Mauro; Deplancke, Bart; Desvergne, Béatrice; Guex, Nicolas; Herr, Winship; Naef, Felix; Rougemont, Jacques; Schibler, Ueli; Deplancke, Bart; Guex, Nicolas; Herr, Winship; Guex, Nicolas; Andersin, Teemu; Cousin, Pascal; Gilardi, Federica; Gos, Pascal; Le Martelot, Gwendal; Lammers, Fabienne; Canella, Donatella; Gilardi, Federica; Raghav, Sunil; Fabbretti, Roberto; Fortier, Arnaud; Long, Li; Vlegel, Volker; Xenarios, Ioannis; Migliavacca, Eugenia; Praz, Viviane; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; David, Fabrice; Jarosz, Yohan; Kuznetsov, Dmitry; Liechti, Robin; Martin, Olivier; Ross, Frederick; Sinclair, Lucas; Cajan, Julia; Krier, Irina; Leleu, Marion; Migliavacca, Eugenia; Molina, Nacho; Naldi, Aurélien; Rey, Guillaume; Symul, Laura; Guex, Nicolas; Naef, Felix; Rougemont, Jacques; Bernasconi, David; Delorenzi, Mauro; Andersin, Teemu; Canella, Donatella; Gilardi, Federica; Le Martelot, Gwendal; Lammers, Fabienne; Raghav, Sunil

2012-01-01

The genomic loci occupied by RNA polymerase (RNAP) III have been characterized in human culture cells by genome-wide chromatin immunoprecipitations, followed by deep sequencing (ChIP-seq). These studies have shown that only ∼40% of the annotated 622 human tRNA genes and pseudogenes are occupied by RNAP-III, and that these genes are often in open chromatin regions rich in active RNAP-II transcription units. We have used ChIP-seq to characterize RNAP-III-occupied loci in a differentiated tissue, the mouse liver. Our studies define the mouse liver RNAP-III-occupied loci including a conserved mammalian interspersed repeat (MIR) as a potential regulator of an RNAP-III subunit-encoding gene. They reveal that synteny relationships can be established between a number of human and mouse RNAP-III genes, and that the expression levels of these genes are significantly linked. They establish that variations within the A and B promoter boxes, as well as the strength of the terminator sequence, can strongly affect RNAP-III occupancy of tRNA genes. They reveal correlations with various genomic features that explain the observed variation of 81% of tRNA scores. In mouse liver, loci represented in the NCBI37/mm9 genome assembly that are clearly occupied by RNAP-III comprise 50 Rn5s (5S RNA) genes, 14 known non-tRNA RNAP-III genes, nine Rn4.5s (4.5S RNA) genes, and 29 SINEs. Moreover, out of the 433 annotated tRNA genes, half are occupied by RNAP-III. Transfer RNA gene expression levels reflect both an underlying genomic organization conserved in dividing human culture cells and resting mouse liver cells, and the particular promoter and terminator strengths of individual genes. PMID:22287103
Species-Level Phylogeny and Polyploid Relationships in Hordeum (Poaceae) Inferred by Next-Generation Sequencing and In Silico Cloning of Multiple Nuclear Loci.

PubMed

Brassac, Jonathan; Blattner, Frank R

2015-09-01

Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Sequencing and de novo assembly of visceral mass transcriptome of the critically endangered land snail Satsuma myomphala: Annotation and SSR discovery.

PubMed

Kang, Se Won; Patnaik, Bharat Bhusan; Hwang, Hee-Ju; Park, So Young; Chung, Jong Min; Song, Dae Kwon; Patnaik, Hongray Howrelia; Lee, Jae Bong; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Park, Seung-Hwan; Park, Young-Su; Han, Yeon Soo; Lee, Jun Sang; Lee, Yong Seok

2017-03-01

Satsuma myomphala is critically endangered through loss of natural habitats, predation by natural enemies, and indiscriminate collection. It is a protected species in Korea but lacks genomic resources for an understanding of varied functional processes attributable to evolutionary success under natural habitats. For assessing the genetic information of S. myomphala, we performed for the first time, de novo transcriptome sequencing and functional annotation of expressed sequences using Illumina Next-Generation Sequencing (NGS) platform and bioinformatics analysis. We identified 103,774 unigenes of which 37,959, 12,890, and 17,699 were annotated in the PANM (Protostome DB), Unigene, and COG (Clusters of Orthologous Groups) databases, respectively. In addition, 14,451 unigenes were predicted under Gene Ontology functional categories, with 4581 assigned to a single category. Furthermore, 3369 sequences with 646 having Enzyme Commission (EC) numbers were mapped to 122 pathways in the Kyoto Encyclopedia of Genes and Genomes Pathway database. The prominent protein domains included the Zinc finger (C2H2-like), Reverse Transcriptase, Thioredoxin-like fold, and RNA recognition motif domain. Many unigenes with homology to immunity, defense, and reproduction-related genes were screened in the transcriptome. We also detected 3120 putative simple sequence repeats (SSRs) encompassing dinucleotide to hexanucleotide repeat motifs from >1kb unigene sequences. A list of PCR primers of SSR loci have been identified to study the genetic polymorphisms. The transcriptome data represents a valuable resource for further investigations on the species genome structure and biology. The unigenes information and microsatellites would provide an indispensable tool for conservation of the species in natural and adaptive environments. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

PubMed

Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

2012-08-01

Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.

PubMed

Harvey, Michael G; Smith, Brian Tilston; Glenn, Travis C; Faircloth, Brant C; Brumfield, Robb T

2016-09-01

Sequence capture and restriction site associated DNA sequencing (RAD-Seq) are two genomic enrichment strategies for applying next-generation sequencing technologies to systematics studies. At shallow timescales, such as within species, RAD-Seq has been widely adopted among researchers, although there has been little discussion of the potential limitations and benefits of RAD-Seq and sequence capture. We discuss a series of issues that may impact the utility of sequence capture and RAD-Seq data for shallow systematics in non-model species. We review prior studies that used both methods, and investigate differences between the methods by re-analyzing existing RAD-Seq and sequence capture data sets from a Neotropical bird (Xenops minutus). We suggest that the strengths of RAD-Seq data sets for shallow systematics are the wide dispersion of markers across the genome, the relative ease and cost of laboratory work, the deep coverage and read overlap at recovered loci, and the high overall information that results. Sequence capture's benefits include flexibility and repeatability in the genomic regions targeted, success using low-quality samples, more straightforward read orthology assessment, and higher per-locus information content. The utility of a method in systematics, however, rests not only on its performance within a study, but on the comparability of data sets and inferences with those of prior work. In RAD-Seq data sets, comparability is compromised by low overlap of orthologous markers across species and the sensitivity of genetic diversity in a data set to an interaction between the level of natural heterozygosity in the samples examined and the parameters used for orthology assessment. In contrast, sequence capture of conserved genomic regions permits interrogation of the same loci across divergent species, which is preferable for maintaining comparability among data sets and studies for the purpose of drawing general conclusions about the impact of historical processes across biotas. We argue that sequence capture should be given greater attention as a method of obtaining data for studies in shallow systematics and comparative phylogeography. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Utility of next-generation RNA-sequencing in identifying chimeric transcription involving human endogenous retroviruses.

PubMed

Sokol, Martin; Jessen, Karen Margrethe; Pedersen, Finn Skou

2016-01-01

Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and β-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome. © 2016 APMIS. Published by John Wiley & Sons Ltd.
Analysis of the type II-A CRISPR-Cas system of Streptococcus agalactiae reveals distinctive features according to genetic lineages

PubMed Central

Lier, Clément; Baticle, Elodie; Horvath, Philippe; Haguenoer, Eve; Valentin, Anne-Sophie; Glaser, Philippe; Mereghetti, Laurent; Lanotte, Philippe

2015-01-01

CRISPR-Cas systems (clustered regularly interspaced short palindromic repeats/CRISPR-associated proteins) are found in 90% of archaea and about 40% of bacteria. In this original system, CRISPR arrays comprise short, almost unique sequences called spacers that are interspersed with conserved palindromic repeats. These systems play a role in adaptive immunity and participate to fight non-self DNA such as integrative and conjugative elements, plasmids, and phages. In Streptococcus agalactiae, a bacterium implicated in colonization and infections in humans since the 1960s, two CRISPR-Cas systems have been described. A type II-A system, characterized by proteins Cas9, Cas1, Cas2, and Csn2, is ubiquitous, and a type I–C system, with the Cas8c signature protein, is present in about 20% of the isolates. Unlike type I–C, which appears to be non-functional, type II-A appears fully functional. Here we studied type II-A CRISPR-cas loci from 126 human isolates of S. agalactiae belonging to different clonal complexes that represent the diversity of the species and that have been implicated in colonization or infection. The CRISPR-cas locus was analyzed both at spacer and repeat levels. Major distinctive features were identified according to the phylogenetic lineages previously defined by multilocus sequence typing, especially for the sequence type (ST) 17, which is considered hypervirulent. Among other idiosyncrasies, ST-17 shows a significantly lower number of spacers in comparison with other lineages. This characteristic could reflect the peculiar virulence or colonization specificities of this lineage. PMID:26124774
Dynamic of Mutational Events in Variable Number Tandem Repeats of Escherichia coli O157:H7

PubMed Central

Bustamante, A. V.; Sanso, A. M.; Segura, D. O.; Parma, A. E.; Lucchesi, P. M. A.

2013-01-01

VNTRs regions have been successfully used for bacterial subtyping; however, the hypervariability in VNTR loci is problematic when trying to predict the relationships among isolates. Since few studies have examined the mutation rate of these markers, our aim was to estimate mutation rates of VNTRs specific for verotoxigenic E. coli O157:H7. The knowledge of VNTR mutational rates and the factors affecting them would make MLVA more effective for epidemiological or microbial forensic investigations. For this purpose, we analyzed nine loci performing parallel, serial passage experiments (PSPEs) on 9 O157:H7 strains. The combined 9 PSPE population rates for the 8 mutating loci ranged from 4.4 × 10−05 to 1.8 × 10−03 mutations/generation, and the combined 8-loci mutation rate was of 2.5 × 10−03 mutations/generation. Mutations involved complete repeat units, with only one point mutation detected. A similar proportion between single and multiple repeat changes was detected. Of the 56 repeat mutations, 59% were insertions and 41% were deletions, and 72% of the mutation events corresponded to O157-10 locus. For alleles with up to 13 UR, a constant and low mutation rate was observed; meanwhile longer alleles were associated with higher and variable mutation rates. Our results are useful to interpret data from microevolution and population epidemiology studies and particularly point out that the inclusion or not of O157-10 locus or, alternatively, a differential weighting data according to the mutation rates of loci must be evaluated in relation with the objectives of the proposed study. PMID:24093095
Application of novel polymorphic microsatellite loci identified in the Korean Pacific Abalone (Haliotis diversicolor supertexta (Haliotidae)) in the genetic characterization of wild and released populations.

PubMed

An, Hye Suck; Lee, Jang Wook; Hong, Seong Wan

2012-01-01

The small abalone, Haliotis diversicolor supertexta, of the family Haliotidae, is one of the most important species of marine shellfish in eastern Asia. Over the past few decades, this species has drastically declined in Korea. Thus, hatchery-bred seeds have been released into natural coastal areas to compensate for the reduced fishery resources. However, information on the genetic background of the small abalone is scarce. In this study, 20 polymorphic microsatellite DNA markers were identified using next-generation sequencing techniques and used to compare allelic variation between wild and released abalone populations in Korea. Using high-throughput genomic sequencing, a total of 1516 (2.26%; average length of 385 bp) reads containing simple sequence repeats were obtained from 86,011 raw reads. Among the 99 loci screened, 28 amplified successfully, and 20 were polymorphic. When comparing allelic variation between wild and released abalone populations, a total of 243 different alleles were observed, with 18.7 alleles per locus. High genetic diversity (mean heterozygosity = 0.81; mean allelic number = 15.5) was observed in both populations. A statistical analysis of the fixation index (F(ST)) and analysis of molecular variance (AMOVA) indicated limited genetic differences between the two populations (F(ST) = 0.002, p > 0.05). Although no significant reductions in the genetic diversity were found in the released population compared with the wild population (p > 0.05), the genetic diversity parameters revealed that the seeds released for stock abundance had a different genetic composition. These differences are likely a result of hatchery selection and inbreeding. Additionally, all the primer pair sets were effectively amplified in another congeneric species, H. diversicolor diversicolor, indicating that these primers are useful for both abalone species. These microsatellite loci may be valuable for future aquaculture and population genetic studies aimed at developing conservation and management plans for these two abalone species.
Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome.

PubMed

Greally, John M

2002-01-08

To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.
Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome

PubMed Central

Greally, John M.

2002-01-01

To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival. PMID:11756672
Genome Comparison of Barley and Maize Smut Fungi Reveals Targeted Loss of RNA Silencing Components and Species-Specific Presence of Transposable Elements[W

PubMed Central

Laurie, John D.; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

2012-01-01

Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts. PMID:22623492
Genome comparison of barley and maize smut fungi reveals targeted loss of RNA silencing components and species-specific presence of transposable elements.

PubMed

Laurie, John D; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

2012-05-01

Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts.
Microsatellite alterations as clonal markers for the detection of human cancer.

PubMed Central

Mao, L; Lee, D J; Tockman, M S; Erozan, Y S; Askin, F; Sidransky, D

1994-01-01

Microsatellite instability has been reported to be an important feature of tumors from hereditary nonpolyposis colorectal carcinoma (HNPCC) patients. The recent discovery of genetic instability in small cell lung carcinoma, a neoplasm not associated with HNPCC, led us to investigate the possible presence of microsatellite alterations in other tumor types. We examined 52 microsatellite repeat sequences in the DNA of normal and tumor pairs from 100 head and neck, bladder, and lung cancer patients by the polymerase chain reaction. Although alterations were rare in dinucleotide repeats, larger (tri- or tetranucleotide) repeats were found to be more prone to expansion or deletion. We screened 100 tumors with a panel of nine tri- and tetranucleotide repeat markers and identified 26 (26%) that displayed alterations in at least one locus. This observation prompted us to examine the possibility of using microsatellite alterations as markers to detect clonal tumor-derived cell populations in pathologic samples. The identical microsatellite alterations detected in the primary tumors were successfully identified in corresponding urine, sputum, and surgical margins from affected patients. This study demonstrates that appropriately selected microsatellite loci are commonly altered in many cancers and can serve as clonal markers for their detection. Images PMID:7937908
Concerted copy number variation balances ribosomal DNA dosage in human and mouse genomes

PubMed Central

Gibbons, John G.; Branco, Alan T.; Godinho, Susana A.; Yu, Shoukai; Lemos, Bernardo

2015-01-01

Tandemly repeated ribosomal DNA (rDNA) arrays are among the most evolutionary dynamic loci of eukaryotic genomes. The loci code for essential cellular components, yet exhibit extensive copy number (CN) variation within and between species. CN might be partly determined by the requirement of dosage balance between the 5S and 45S rDNA arrays. The arrays are nonhomologous, physically unlinked in mammals, and encode functionally interdependent RNA components of the ribosome. Here we show that the 5S and 45S rDNA arrays exhibit concerted CN variation (cCNV). Despite 5S and 45S rDNA elements residing on different chromosomes and lacking sequence similarity, cCNV between these loci is strong, evolutionarily conserved in humans and mice, and manifested across individual genotypes in natural populations and pedigrees. Finally, we observe that bisphenol A induces rapid and parallel modulation of 5S and 45S rDNA CN. Our observations reveal a novel mode of genome variation, indicate that natural selection contributed to the evolution and conservation of cCNV, and support the hypothesis that 5S CN is partly determined by the requirement of dosage balance with the 45S rDNA array. We suggest that human disease variation might be traced to disrupted rDNA dosage balance in the genome. PMID:25583482
Polymorphic microsatellite markers for the rare and endangered cactus Uebelmannia pectinifera (Cactaceae) and its congeneric species.

PubMed

Moraes, E M; Cidade, F W; Silva, G A R; Machado, M C

2014-12-04

The cactus genus Uebelmannia includes 3 narrow endemic species associated with rocky savanna habitats in eastern South America. Because of their rarity and illegal over-collection, all of these species are endangered. Taxonomic uncertainties resulting from dramatic local variation in morphology within Uebelmannia species preclude effective conservation efforts, such as the reintroduction or translocation of plants, to restore declining populations. In this study, we developed and characterized 18 perfect, dinucleotide simple-sequence repeat markers for U. pectinifera, the most widely distributed species in the genus, and tested the cross-amplification of these markers in the remaining congeneric species and subspecies. All markers were polymorphic in a sample from 2 U. pectinifera populations. The effective number of alleles ranged from 1.6 to 8.7, with an average per population of 3.3 (SE ± 0.30) and 4.5 (SE ± 0.50). Expected heterozygosity ranged from 0.375 to 0.847 and 8-10 loci showed departures from Hardy- Weinberg equilibrium in the analyzed populations. Based on the observed polymorphism level of each marker, as well as the analysis of null allele presence and evidence of amplification of duplicate loci, a subset of 12 loci can be used as reliable markers to investigate the genetic structure, diversity, and species limits of the Uebelmannia genus.
Genome scans for divergent selection in natural populations of the widespread hardwood species Eucalyptus grandis (Myrtaceae) using microsatellites

PubMed Central

Song, Zhijiao; Zhang, Miaomiao; Li, Fagen; Weng, Qijie; Zhou, Chanpin; Li, Mei; Li, Jie; Huang, Huanhua; Mo, Xiaoyong; Gan, Siming

2016-01-01

Identification of loci or genes under natural selection is important for both understanding the genetic basis of local adaptation and practical applications, and genome scans provide a powerful means for such identification purposes. In this study, genome-wide simple sequence repeats markers (SSRs) were used to scan for molecular footprints of divergent selection in Eucalyptus grandis, a hardwood species occurring widely in costal areas from 32° S to 16° S in Australia. High population diversity levels and weak population structure were detected with putatively neutral genomic SSRs. Using three FST outlier detection methods, a total of 58 outlying SSRs were collectively identified as loci under divergent selection against three non-correlated climatic variables, namely, mean annual temperature, isothermality and annual precipitation. Using a spatial analysis method, nine significant associations were revealed between FST outlier allele frequencies and climatic variables, involving seven alleles from five SSR loci. Of the five significant SSRs, two (EUCeSSR1044 and Embra394) contained alleles of putative genes with known functional importance for response to climatic factors. Our study presents critical information on the population diversity and structure of the important woody species E. grandis and provides insight into the adaptive responses of perennial trees to climatic variations. PMID:27748400
Characterization of EST-based SSR loci in the spruce budworm, Choristoneura fumiferana (Lepidoptera: Tortricidae)

Treesearch

B.M.T. Brunet; D. Doucet; B.R. Sturtevant; F.A.H. Sperling

2013-01-01

After identifying 114 microsatellite loci from Choristoneura fumiferana expressed sequence tags, 87 loci were assayed in a panel of 11 wild-caught individuals, giving 29 polymorphic loci. Further analysis of 20 of these loci on 31 individuals collected from a single population in northern Minnesota identified 14 in Hardy-Weinberg equilibrium.
Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency.

PubMed

Jensen, Kristopher Torp; Fløe, Lasse; Petersen, Trine Skov; Huang, Jinrong; Xu, Fengping; Bolund, Lars; Luo, Yonglun; Lin, Lin

2017-07-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated protein 9 (CRISPR-Cas9) systems have emerged as the method of choice for genome editing, but large variations in on-target efficiencies continue to limit their applicability. Here, we investigate the effect of chromatin accessibility on Cas9-mediated gene editing efficiency for 20 gRNAs targeting 10 genomic loci in HEK293T cells using both SpCas9 and the eSpCas9(1.1) variant. Our study indicates that gene editing is more efficient in euchromatin than in heterochromatin, and we validate this finding in HeLa cells and in human fibroblasts. Furthermore, we investigate the gRNA sequence determinants of CRISPR-Cas9 activity using a surrogate reporter system and find that the efficiency of Cas9-mediated gene editing is dependent on guide sequence secondary structure formation. This knowledge can aid in the further improvement of tools for gRNA design. © 2017 Federation of European Biochemical Societies.

Mutations in Cas9 Enhance the Rate of Acquisition of Viral Spacer Sequences during the CRISPR-Cas Immune Response.

PubMed

Heler, Robert; Wright, Addison V; Vucelja, Marija; Bikard, David; Doudna, Jennifer A; Marraffini, Luciano A

2017-01-05

CRISPR loci and their associated (Cas) proteins encode a prokaryotic immune system that protects against viruses and plasmids. Upon infection, a low fraction of cells acquire short DNA sequences from the invader. These sequences (spacers) are integrated in between the repeats of the CRISPR locus and immunize the host against the matching invader. Spacers specify the targets of the CRISPR immune response through transcription into short RNA guides that direct Cas nucleases to the invading DNA molecules. Here we performed random mutagenesis of the RNA-guided Cas9 nuclease to look for variants that provide enhanced immunity against viral infection. We identified a mutation, I473F, that increases the rate of spacer acquisition by more than two orders of magnitude. Our results highlight the role of Cas9 during CRISPR immunization and provide a useful tool to study this rare process and develop it as a biotechnological application. Copyright © 2017 Elsevier Inc. All rights reserved.
A bacterial Argonaute with noncanonical guide RNA specificity

PubMed Central

Kaya, Emine; Doxzen, Kevin W.; Knoll, Kilian R.; Wilson, Ross C.; Strutt, Steven C.; Kranzusch, Philip J.; Doudna, Jennifer A.

2016-01-01

Eukaryotic Argonaute proteins induce gene silencing by small RNA-guided recognition and cleavage of mRNA targets. Although structural similarities between human and prokaryotic Argonautes are consistent with shared mechanistic properties, sequence and structure-based alignments suggested that Argonautes encoded within CRISPR-cas [clustered regularly interspaced short palindromic repeats (CRISPR)-associated] bacterial immunity operons have divergent activities. We show here that the CRISPR-associated Marinitoga piezophila Argonaute (MpAgo) protein cleaves single-stranded target sequences using 5′-hydroxylated guide RNAs rather than the 5′-phosphorylated guides used by all known Argonautes. The 2.0-Å resolution crystal structure of an MpAgo–RNA complex reveals a guide strand binding site comprising residues that block 5′ phosphate interactions. Using structure-based sequence alignment, we were able to identify other putative MpAgo-like proteins, all of which are encoded within CRISPR-cas loci. Taken together, our data suggest the evolution of an Argonaute subclass with noncanonical specificity for a 5′-hydroxylated guide. PMID:27035975
The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.

PubMed

Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin

2018-05-15

Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P < 0.05). The case group had higher frequencies of DYS533 allele 14, DYS437 allele 14, and haplotypes 11-14 of DYS533-DYS437 compared with the control group. Additionally, the DYS437 allele 15 frequency was significantly lower in cases than controls. No frequency differences were observed in the other 20 Y-STR loci between these two groups. Our results indicate a genetic role for Y-STR loci in the development of initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.
A highly polymorphic dinucleotide repeat on the proximal short arm of the human X chromosome: linkage mapping of the synapsin I/A-raf-1 genes.

PubMed Central

Kirchgessner, C U; Trofatter, J A; Mahtani, M M; Willard, H F; DeGennaro, L J

1991-01-01

A compound (AC)n repeat located 1,000 bp downstream from the human synapsin I gene and within the last intron of the A-raf-1 gene has been identified. DNA data-base comparisons of the sequences surrounding the repeat indicate that the synapsin I gene and the A-raf-1 gene lie immediately adjacent to each other, in opposite orientation. PCR amplification of this synapsin I/A-raf-1 associated repeat by using total genomic DNA from members of the 40 reference pedigree families of the Centre d'Etude du Polymorphisme Humaine showed it to be highly polymorphic, with a PIC value of .84 and a minimum of eight alleles. Because the synapsin I gene has been mapped previously to the short arm of the human X chromosome at Xp11.2, linkage analysis was performed with markers on the proximal short arm of the X chromosome. The most likely gene order is DXS7SYN/ARAF1TIMPDXS255DXS146, with a relative probability of 5 x 10(8) as compared with the next most likely order. This highly informative repeat should serve as a valuable marker for disease loci mapped to the Xp11 region. Images Figure 2 PMID:1905878
Genetic diversity of an Azorean endemic and endangered plant species inferred from inter-simple sequence repeat markers.

PubMed

Lopes, Maria S; Mendonça, Duarte; Bettencourt, Sílvia X; Borba, Ana R; Melo, Catarina; Baptista, Cláudio; da Câmara Machado, Artur

2014-06-26

Knowledge of the levels and distribution of genetic diversity is important for designing conservation strategies for threatened and endangered species so as to guarantee sustainable survival of populations and to preserve their evolutionary potential. Picconia azorica is a valuable Azorean endemic species recently classified as endangered. To contribute with information useful for the establishment of conservation programmes, the genetic variability and differentiation among 230 samples from 11 populations collected in three Azorean islands was accessed with eight inter-simple sequence repeat markers. A total of 64 polymorphic loci were detected. The majority of genetic variability was found within populations and no genetic structure was detected between populations and between islands. Also the coefficient of genetic differentiation and the level of gene flow indicate that geographical distances do not act as barriers for gene flow. In order to ensure the survival of populations in situ and ex situ management practices should be considered, including artificial propagation through the use of plant tissue culture techniques, not only for the restoration of habitat but also for the sustainable use of its valuable wood. Published by Oxford University Press on behalf of the Annals of Botany Company.
A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome

PubMed Central

2010-01-01

Background The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Results Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative. Conclusion A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement. PMID:20105299
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons

PubMed Central

Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.

2017-01-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
Genomic Heat Shock Element Sequences Drive Cooperative Human Heat Shock Factor 1 DNA Binding and Selectivity*

PubMed Central

Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.

2014-01-01

The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

PubMed Central

Doolittle-Hall, Janet M.; Cunningham Glasspoole, Danielle L.; Seaman, William T.; Webster-Cyriaque, Jennifer

2015-01-01

Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV), hepatitis B virus (HBV) or Merkel cell polyomavirus (MCPyV). These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats) and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures. PMID:26569308
The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century1

PubMed Central

Hodel, Richard G. J.; Segovia-Salcedo, M. Claudia; Landis, Jacob B.; Crowl, Andrew A.; Sun, Miao; Liu, Xiaoxian; Gitzendanner, Matthew A.; Douglas, Norman A.; Germain-Aubrey, Charlotte C.; Chen, Shichao; Soltis, Douglas E.; Soltis, Pamela S.

2016-01-01

Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers. PMID:27347456
Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

PubMed Central

2010-01-01

Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Neptune: a bioinformatics tool for rapid discovery of genomic variation in bacterial populations

PubMed Central

Marinier, Eric; Zaheer, Rahat; Berry, Chrystal; Weedmark, Kelly A.; Domaratzki, Michael; Mabon, Philip; Knox, Natalie C.; Reimer, Aleisha R.; Graham, Morag R.; Chui, Linda; Patterson-Fortin, Laura; Zhang, Jian; Pagotto, Franco; Farber, Jeff; Mahony, Jim; Seyer, Karine; Bekal, Sadjia; Tremblay, Cécile; Isaac-Renton, Judy; Prystajecky, Natalie; Chen, Jessica; Slade, Peter

2017-01-01

Abstract The ready availability of vast amounts of genomic sequence data has created the need to rethink comparative genomics algorithms using ‘big data’ approaches. Neptune is an efficient system for rapidly locating differentially abundant genomic content in bacterial populations using an exact k-mer matching strategy, while accommodating k-mer mismatches. Neptune’s loci discovery process identifies sequences that are sufficiently common to a group of target sequences and sufficiently absent from non-targets using probabilistic models. Neptune uses parallel computing to efficiently identify and extract these loci from draft genome assemblies without requiring multiple sequence alignments or other computationally expensive comparative sequence analyses. Tests on simulated and real datasets showed that Neptune rapidly identifies regions that are both sensitive and specific. We demonstrate that this system can identify trait-specific loci from different bacterial lineages. Neptune is broadly applicable for comparative bacterial analyses, yet will particularly benefit pathogenomic applications, owing to efficient and sensitive discovery of differentially abundant genomic loci. The software is available for download at: http://github.com/phac-nml/neptune. PMID:29048594
Characterization of Mauritius parakeet (Psittacula eques) microsatellite loci and their cross-utility in other parrots (Psittacidae, Aves).

PubMed

Raisin, Claire; Dawson, Deborah A; Greenwood, Andrew G; Jones, Carl G; Groombridge, Jim J

2009-07-01

We characterized 21 polymorphic microsatellite loci in the endangered Mauritius parakeet (Psittacula eques). Loci were isolated from a Mauritius parakeet genomic library that had been enriched separately for eight different repeat motifs. Loci were characterized in up to 43 putatively unrelated Mauritius parakeets from a single population inhabiting the Black River Gorges National Park, Mauritius. Each locus displayed between three and nine alleles, with the observed heterozygosity ranging between 0.39 and 0.96. All loci were tested in 10 other parrot species. Despite testing few individuals, between seven and 21 loci were polymorphic in each of seven species tested. © 2009 Blackwell Publishing Ltd.
Biased distribution of IS629 among strains in different lineages of enterohemorrhagic Escherichia coli serovar O157.

PubMed

Yokoyama, Eiji; Hashimoto, Ruiko; Etoh, Yoshiki; Ichihara, Sachiko; Horikawa, Kazumi; Uchimura, Masako

2011-01-01

The distribution of insertion sequence (IS) 629 among strains of enterohemorrhagic Escherichia coli serovar O157 (O157) was investigated and compared with the strain lineages defined by lineage specific polymorphism assay-6 (LSPA-6) to demonstrate the effectiveness of IS629 analysis for population genetics analysis. Using pulsed-field gel electrophoresis and variable-number tandem repeat typing, 140 strains producing both VT1 and VT2 and 98 strains producing only VT2 were selected from a total of 592 strains isolated from patients and asymptomatic carriers in Chiba Prefecture, Japan, during 2003-2008. By LSPA-6 analysis, six strains had atypical amplicon sizes in their Z5935 loci and five strains had atypical amplicon sizes in their arp-iclR intergenic regions. Sequence analyses of PCR amplified DNAs showed that five of the six loci used for LSPA-6 analysis had tandem repeats and the allele changes were due to changes in the number of tandem repeats. Subculturing and long-term incubation was found to have no detectable effect on the lineages defined by LSPA-6 analysis, demonstrating the robustness of LSPA-6 analysis. Minimum spanning tree analysis reconstruction revealed that strains in lineage I, I/II, and II clustered on separate branches, indicating that the distribution of IS629 was biased among O157 strains in different lineages. Strains with LSPA-6 codes 231111, 211113, and 211114 had atypical amplicon sizes and were clustered in lineage I/II branch, and strains with LSPA-6 codes 212114, 221123, 221223, 222123, 222224, 242123, 252123, and 242222 had atypical amplicon sizes and clustered in lineage II branches. Linkage disequilibrium was observed in strains in every lineage when the standardized index of association was calculated using IS629 distribution data. Therefore, the distribution analysis of IS629 may be effective for population genetics analysis of O157 due to the biased IS629 distribution among strains in the three O157 lineages. Copyright © 2010 Elsevier B.V. All rights reserved.
Evaluation of two new STR loci 9q2h2 and wg3f12 in a Japanese population.

PubMed

Mizutani, M; Huang, X L; Tamaki, K; Yoshimoto, T; Uchihi, R; Yamamoto, T; Katsumata, Y; Armour, J A

1999-09-01

Two short tandem repeat (STR) loci (9q2h2 and wg3f12) have been evaluated in a Japanese population. Ten and seven different alleles were observed in 9q2h2 and wg3f12 respectively. 9q2h2 displayed simple polymorphism in tetrameric repeat structure; by contrast, wg3f12 contained variable numbers of tetrameric repeats and a 30-bp deletion/insertion polymorphism. No "interalleles" were found. The expected heterozygosities of 9q2h2 and wg3fl2 were 0.749 and 0.574, respectively. No deviation from Hardy-Weinberg equilibrium was found.
Rapid and high resolution genotyping of all Escherichia coli serotypes using 10 genomic repeat-containing loci.

PubMed

Løbersli, Inger; Haugum, Kjersti; Lindstedt, Bjørn-Arne

2012-01-01

Our laboratory has previously published two multiple-locus variable-number tandem-repeats analysis (MLVA) methods for rapid genotyping of Escherichia coli (E. coli), which are now in routine use for surveillance and outbreak detection. The first assay developed was specific for E. coli O157:H7; however this assay was not suitable for genotyping other E. coli serotypes. A new generic MLVA-assay was then developed with the capability of genotyping all E. coli serotypes. This generic E. coli MLVA (GECM7) was based on polymorphism in seven variable number of tandem repeats (VNTR) loci. GECM7 worked well with the majority of E. coli serotypes; however we wanted to increase the resolution for this method based in part of comparison with PFGE typing of E. coli O26:H11, where PFGE appeared to display higher resolution. The GECM7 method was improved by adding three new repeat-loci to a total of ten (GECM10), and a considerable increase in resolution was observed (from 296 to 507 genotypes on the same set of strains). Copyright © 2011 Elsevier B.V. All rights reserved.
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

PubMed

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
The short interspersed repetitive element of Trypanosoma cruzi, SIRE, is part of VIPER, an unusual retroelement related to long terminal repeat retrotransposons

PubMed Central

Vázquez, Martín; Ben-Dov, Claudia; Lorenzi, Hernan; Moore, Troy; Schijman, Alejandro; Levin, Mariano J.

2000-01-01

The short interspersed repetitive element (SIRE) of Trypanosoma cruzi was first detected when comparing the sequences of loci that encode the TcP2β genes. It is present in about 1,500–3,000 copies per genome, depending on the strain, and it is distributed in all chromosomes. An initial analysis of SIRE sequences from 21 genomic fragments allowed us to derive a consensus nucleotide sequence and structure for the element, consisting of three regions (I, II, and III) each harboring distinctive features. Analysis of 158 transcribed SIREs demonstrates that the consensus is highly conserved. The sequences of 51 cDNAs show that SIRE is included in the 3′ end of several mRNAs, always transcribed from the sense strand, contributing the polyadenylation site in 63% of the cases. This study led to the characterization of VIPER (vestigial interposed retroelement), a 2,326-bp-long unusual retroelement. VIPER's 5′ end is formed by the first 182 bp of SIRE, whereas its 3′ end is formed by the last 220 bp of the element. Both SIRE moieties are connected by a 1,924-bp-long fragment that carries a unique ORF encoding a complete reverse transcriptase-RNase H gene whose 15 C-terminal amino acids derive from codons specified by SIRE's region II. The amino acid sequence of VIPER's reverse transcriptase-RNase H shares significant homology to that of long terminal repeat retrotransposons. The fact that SIRE and VIPER sequences are found only in the T. cruzi genome may be of relevance for studies concerning the evolution and the genome flexibility of this protozoan parasite. PMID:10688909
Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random.

PubMed Central

Furano, A V; Somerville, C C; Tsichlis, P N; D'Ambrosio, E

1986-01-01

The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition. PMID:3012480
Using long ssDNA polynucleotides to amplify STRs loci in degraded DNA samples

PubMed Central

Pérez Santángelo, Agustín; Corti Bielsa, Rodrigo M.; Sala, Andrea; Ginart, Santiago; Corach, Daniel

2017-01-01

Obtaining informative short tandem repeat (STR) profiles from degraded DNA samples is a challenging task usually undermined by locus or allele dropouts and peak-high imbalances observed in capillary electrophoresis (CE) electropherograms, especially for those markers with large amplicon sizes. We hereby show that the current STR assays may be greatly improved for the detection of genetic markers in degraded DNA samples by using long single stranded DNA polynucleotides (ssDNA polynucleotides) as surrogates for PCR primers. These long primers allow a closer annealing to the repeat sequences, thereby reducing the length of the template required for the amplification in fragmented DNA samples, while at the same time rendering amplicons of larger sizes suitable for multiplex assays. We also demonstrate that the annealing of long ssDNA polynucleotides does not need to be fully complementary in the 5’ region of the primers, thus allowing for the design of practically any long primer sequence for developing new multiplex assays. Furthermore, genotyping of intact DNA samples could also benefit from utilizing long primers since their close annealing to the target STR sequences may overcome wrong profiling generated by insertions/deletions present between the STR region and the annealing site of the primers. Additionally, long ssDNA polynucleotides might be utilized in multiplex PCR assays for other types of degraded or fragmented DNA, e.g. circulating, cell-free DNA (ccfDNA). PMID:29099837

Biology and applications of human minisatellite loci.

PubMed

Armour, J A; Jeffreys, A J

1992-12-01

Highly repetitive minisatellites' include the most variable human loci described to date. They have proved invaluable in a wide variety of genetic analyses, and despite some controversies surrounding their practical implementation, have been extensively adopted in civil and forensic casework. Molecular analysis of internal allelic structure has provided detailed insights into the repeat-unit turnover mechanisms operating in germline mutations, which are ultimately responsible for the extreme variability seen at these loci.
A case of false mother included with 46 autosomal STR markers.

PubMed

Li, Li; Lin, Yuan; Liu, Yan; Zhu, Ruxin; Zhao, Zhenmin; Que, Tingzhi

2015-01-01

For solving a maternity case, 19 autosomal short tandem repeats (STRs) were amplified using the AmpFℓSTR(®) Sinofiler(TM) kit and PowerPlex(®) 16 System. Additional 27 autosomal STR loci were analyzed using two domestic kits AGCU 21+1 and STRtyper-10G. The combined maternity index (CMI) was calculated to be 3.3 × 10(13), but the putative mother denied that she had given birth to the child. In order to reach an accurate conclusion, further testing of 20 X-chromosomal short tandem repeats (X-STRs), 40 single nucleotide polymorphism (SNP) loci, and mitochondrial DNA (mtDNA) was carried out. The putative mother and the boy shared at least one allele at all 46 tested autosomal STR loci. But, according to the profile data of 20 X-STR and 40 SNP markers, different genotypes at 13 X-STR loci and five SNP loci excluded maternity. Mitochondrial profiles also clearly excluded the mother as a parent of the son because they have multiple differences. It was finally found that the putative mother is the sister of the biological father. Different kinds of genetic markers needfully supplement the use of autosomal STR loci in case where the putative parent is suspected to be related to the true parent.
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina.

PubMed

Kovačević, Lejla; Fatur-Cerić, Vera; Hadzic, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

2013-06-01

To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis.
Mutation rates for 20 STR loci in a population from São Paulo state, Southeast, Brazil.

PubMed

Martinez, Juliana; Braganholi, Danilo Faustino; Ambrósio, Isabela Brunelli; Polverari, Fernanda Silva; Cicarelli, Regina Maria Barretto

2017-11-01

Short tandem repeats (STRs) are genetic markers largely employed in forensic analysis and paternity investigation cases. When an inconsistency between the parent and child is considered as a possible mutation, the mutation rate should be incorporated into paternity index calculations to give a robust result and to reduce the chance of misinterpretation. The aim of this study was to estimate the mutation rates of 20 autosomal STRs loci used for paternity tests. In these loci we analysed 29,831 parent-child allelic transfers from 929 duo or trio paternity tests carried out during 2012?2016 from São Paulo State, Brazil. We identified 35 mutations in 16 loci, and they were more frequent in the paternal germline compared to the maternal germline. The loci with the highest rate were vWA and FGA and the ones with the lowest rate were PENTA E, PENTA D, D21S11, D7S820 and D6S1043. We did not identified any mutation in D2S1338, TH01, TPOX and D16S539 loci. All mutations consisted of losses or gains of one repeat unit. Mutation rates found in the São Paulo population have peculiarities, which justifies the use of regional databases in laboratories.
Developmental validation of a Cannabis sativa STR multiplex system for forensic analysis.

PubMed

Howard, Christopher; Gilmore, Simon; Robertson, James; Peakall, Rod

2008-09-01

A developmental validation study based on recommendations of the Scientific Working Group on DNA Analysis Methods (SWGDAM) was conducted on a multiplex system of 10 Cannabis sativa short tandem repeat loci. Amplification of the loci in four multiplex reactions was tested across DNA from dried root, stem, and leaf sources, and DNA from fresh, frozen, and dried leaf tissue with a template DNA range of 10.0-0.01 ng. The loci were amplified and scored consistently for all DNA sources when DNA template was in the range of 10.0-1.0 ng. Some allelic dropout and PCR failure occurred in reactions with lower template DNA amounts. Overall, amplification was best using 10.0 ng of template DNA from dried leaf tissue indicating that this is the optimal source material. Cross species amplification was observed in Humulus lupulus for three loci but there was no allelic overlap. This is the first study following SWGDAM validation guidelines to validate short tandem repeat markers for forensic use in plants.
Genetic data and de novo mutation rates in father-son pairs of 23 Y-STR loci in Southern Brazil population.

PubMed

Da Fré, Nicole Nascimento; Rodenbusch, Rodrigo; Gastaldo, André Zoratto; Hanson, Erin; Ballantyne, Jack; Alho, Clarice Sampaio

2015-11-01

We evaluated haplotype and allele frequencies, as well as statistical forensic parameters, for 23 Y-chromosome short tandem repeats (STRs) loci of the PowerPlex®Y23 system (DYS19, DYS385a/b, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, Y-GATA-H4, DYS481, DYS533, DYS549, DYS570, DYS576, DYS643) in a sample of 150 apparently healthy males, resident in South Brazil. A total of 150 different haplotypes were identified. The highest gene diversity (GD) was observed for the single locus marker DYS570 (GD = 0.7888) and for a two-locus system DYS385 (GD = 0.9009). We also examined 150 father-son pairs by the same system, and a total of 13 mutations were identified in the 3450 father-son allelic transfers, with an overall mutation rate across the 23 loci of 3.768 × 10(-3) (95% CI: 3.542 × 10(-3) to 3.944 × 10(-3)). In all cases there was only one locus mutated with gain/loss of repeats in the son (5 one-repeat gains, and 7 one-repeat and 1 two-repeat losses); we observed no instances of mutations involving a non-integral number of repeats.
Development of chromosome-specific markers with high polymorphism for allotetraploid cotton based on genome-wide characterization of simple sequence repeats in diploid cottons (Gossypium arboreum L. and Gossypium raimondii Ulbrich).

PubMed

Lu, Cairui; Zou, Changsong; Zhang, Youping; Yu, Daoqian; Cheng, Hailiang; Jiang, Pengfei; Yang, Wencui; Wang, Qiaolian; Feng, Xiaoxu; Prosper, Mtawa Andrew; Guo, Xiaoping; Song, Guoli

2015-02-06

Tetraploid cotton contains two sets of homologous chromosomes, the At- and Dt-subgenomes. Consequently, many markers in cotton were mapped to multiple positions during linkage genetic map construction, posing a challenge to anchoring linkage groups and mapping economically-important genes to particular chromosomes. Chromosome-specific markers could solve this problem. Recently, the genomes of two diploid species were sequenced whose progenitors were putative contributors of the At- and Dt-subgenomes to tetraploid cotton. These sequences provide a powerful tool for developing chromosome-specific markers given the high level of synteny among tetraploid and diploid cotton genomes. In this study, simple sequence repeats (SSRs) on each chromosome in the two diploid genomes were characterized. Chromosome-specific SSRs were developed by comparative analysis and proved to distinguish chromosomes. A total of 200,744 and 142,409 SSRs were detected on the 13 chromosomes of Gossypium arboreum L. and Gossypium raimondii Ulbrich, respectively. Chromosome-specific SSRs were obtained by comparing SSR flanking sequences from each chromosome with those from the other 25 chromosomes. The average was 7,996 per chromosome. To confirm their chromosome specificity, these SSRs were used to distinguish two homologous chromosomes in tetraploid cotton through linkage group construction. The chromosome-specific SSRs and previously-reported chromosome markers were grouped together, and no marker mapped to another homologous chromosome, proving that the chromosome-specific SSRs were unique and could distinguish homologous chromosomes in tetraploid cotton. Because longer dinucleotide AT-rich repeats were the most polymorphic in previous reports, the SSRs on each chromosome were sorted by motif type and repeat length for convenient selection. The primer sequences of all chromosome-specific SSRs were also made publicly available. Chromosome-specific SSRs are efficient tools for chromosome identification by anchoring linkage groups to particular chromosomes during genetic mapping and are especially useful in mapping of qualitative-trait genes or quantitative trait loci with just a few markers. The SSRs reported here will facilitate a number of genetic and genomic studies in cotton, including construction of high-density genetic maps, positional gene cloning, fingerprinting, and genetic diversity and comparative evolutionary analyses among Gossypium species.
SMCHD1 regulates a limited set of gene clusters on autosomal chromosomes.

PubMed

Mason, Amanda G; Slieker, Roderick C; Balog, Judit; Lemmers, Richard J L F; Wong, Chao-Jen; Yao, Zizhen; Lim, Jong-Won; Filippova, Galina N; Ne, Enrico; Tawil, Rabi; Heijmans, Bas T; Tapscott, Stephen J; van der Maarel, Silvère M

2017-06-06

Facioscapulohumeral muscular dystrophy (FSHD) is in most cases caused by a contraction of the D4Z4 macrosatellite repeat on chromosome 4 (FSHD1) or by mutations in the SMCHD1 or DNMT3B gene (FSHD2). Both situations result in the incomplete epigenetic repression of the D4Z4-encoded retrogene DUX4 in somatic cells, leading to the aberrant expression of DUX4 in the skeletal muscle. In mice, Smchd1 regulates chromatin repression at different loci, having a role in CpG methylation establishment and/or maintenance. To investigate the global effects of harboring heterozygous SMCHD1 mutations on DNA methylation in humans, we combined 450k methylation analysis on mononuclear monocytes from female heterozygous SMCHD1 mutation carriers and unaffected controls with reduced representation bisulfite sequencing (RRBS) on FSHD2 and control myoblast cell lines. Candidate loci were then evaluated for SMCHD1 binding using ChIP-qPCR and expression was evaluated using RT-qPCR. We identified a limited number of clustered autosomal loci with CpG hypomethylation in SMCHD1 mutation carriers: the protocadherin (PCDH) cluster on chromosome 5, the transfer RNA (tRNA) and 5S rRNA clusters on chromosome 1, the HOXB and HOXD clusters on chromosomes 17 and 2, respectively, and the D4Z4 repeats on chromosomes 4 and 10. Furthermore, minor increases in RNA expression were seen in FSHD2 myoblasts for some of the PCDHβ cluster isoforms, tRNA isoforms, and a HOXB isoform in comparison to controls, in addition to the previously reported effects on DUX4 expression. SMCHD1 was bound at DNAseI hypersensitivity sites known to regulate the PCDHβ cluster and at the chromosome 1 tRNA cluster, with decreased binding in SMCHD1 mutation carriers at the PCDHβ cluster sites. Our study is the first to investigate the global methylation effects in humans resulting from heterozygous mutations in SMCHD1. Our results suggest that SMCHD1 acts as a repressor on a limited set of autosomal gene clusters, as an observed reduction in methylation associates with a loss of SMCHD1 binding and increased expression for some of the loci.
The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding.

PubMed

Shirasawa, Kenta; Isuzugawa, Kanji; Ikenaga, Mitsunobu; Saito, Yutaro; Yamamoto, Toshiya; Hirakawa, Hideki; Isobe, Sachiko

2017-10-01

We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site-associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)). © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Microsatellite loci discovery from next-generation sequencing data and loci characterization in the epizoic barnacle Chelonibia testudinaria (Linnaeus, 1758)

PubMed Central

Zardus, John D.; Wares, John P.

2016-01-01

Microsatellite markers remain an important tool for ecological and evolutionary research, but are unavailable for many non-model organisms. One such organism with rare ecological and evolutionary features is the epizoic barnacle Chelonibia testudinaria (Linnaeus, 1758). Chelonibia testudinaria appears to be a host generalist, and has an unusual sexual system, androdioecy. Genetic studies on host specificity and mating behavior are impeded by the lack of fine-scale, highly variable markers, such as microsatellite markers. In the present study, we discovered thousands of new microsatellite loci from next-generation sequencing data, and characterized 12 loci thoroughly. We conclude that 11 of these loci will be useful markers in future ecological and evolutionary studies on C. testudinaria. PMID:27231653
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping

PubMed Central

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-01-01

The American cranberry (Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. PMID:28250016
Two Functional Copies of the DGCR6 Gene Are Present on Human Chromosome 22q11 Due to a Duplication of an Ancestral Locus

PubMed Central

Edelmann, Lisa; Stankiewicz, Pavel; Spiteri, Elizabeth; Pandita, Raj K.; Shaffer, Lisa; Lupski, James; Morrow, Bernice E.

2001-01-01

The DGCR6 (DiGeorge critical region) gene encodes a putative protein with sequence similarity to gonadal (gdl), a Drosophila melanogaster gene of unknown function. We mapped the DGCR6 gene to chromosome 22q11 within a low copy repeat, termed sc11.1a, and identified a second copy of the gene, DGCR6L, within the duplicate locus, termed sc11.1b. Both sc11.1 repeats are deleted in most persons with velo-cardio-facial syndrome/DiGeorge syndrome (VCFS/DGS), and they map immediately adjacent and internal to the low copy repeats, termed LCR22, that mediate the deletions associated with VCFS/DGS. We sequenced genomic clones from both loci and determined that the putative initiator methionine is located further upstream than originally described, but in a position similar to the mouse and chicken orthologs. DGCR6L encodes a highly homologous, functional copy of DGCR6, with some base changes rendering amino acid differences. Expression studies of the two genes indicate that both genes are widely expressed in fetal and adult tissues. Evolutionary studies using FISH mapping in several different species of ape combined with sequence analysis of DGCR6 in a number of different primate species indicate that the duplication is at least 12 million years old and may date back to before the divergence of Catarrhines from Platyrrhines, 35 mya. These data suggest that there has been selective evolutionary pressure toward the functional maintenance of both paralogs. Interestingly, a full-length HERV-K provirus integrated into the sc11.1a locus after the divergence of chimpanzees and humans. PMID:11157784
[Standard algorithm of molecular typing of Yersinia pestis strains].

PubMed

Eroshenko, G A; Odinokov, G N; Kukleva, L M; Pavlova, A I; Krasnov, Ia M; Shavina, N Iu; Guseva, N P; Vinogradova, N A; Kutyrev, V V

2012-01-01

Development of the standard algorithm of molecular typing of Yersinia pestis that ensures establishing of subspecies, biovar and focus membership of the studied isolate. Determination of the characteristic strain genotypes of plague infectious agent of main and nonmain subspecies from various natural foci of plague of the Russian Federation and the near abroad. Genotyping of 192 natural Y. pestis strains of main and nonmain subspecies was performed by using PCR methods, multilocus sequencing and multilocus analysis of variable tandem repeat number. A standard algorithm of molecular typing of plague infectious agent including several stages of Yersinia pestis differentiation by membership: in main and nonmain subspecies, various biovars of the main subspecies, specific subspecies; natural foci and geographic territories was developed. The algorithm is based on 3 typing methods--PCR, multilocus sequence typing and multilocus analysis of variable tandem repeat number using standard DNA targets--life support genes (terC, ilvN, inv, glpD, napA, rhaS and araC) and 7 loci of variable tandem repeats (ms01, ms04, ms06, ms07, ms46, ms62, ms70). The effectiveness of the developed algorithm is shown on the large number of natural Y. pestis strains. Characteristic sequence types of Y. pestis strains of various subspecies and biovars as well as MLVA7 genotypes of strains from natural foci of plague of the Russian Federation and the near abroad were established. The application of the developed algorithm will increase the effectiveness of epidemiologic monitoring of plague infectious agent, and analysis of epidemics and outbreaks of plague with establishing the source of origin of the strain and routes of introduction of the infection.
Sequence-Based Typing of Legionella pneumophila Serogroup 1 Offers the Potential for True Portability in Legionellosis Outbreak Investigation

PubMed Central

Gaia, Valeria; Fry, Norman K.; Harrison, Timothy G.; Peduzzi, Raffaele

2003-01-01

Seven gene loci of Legionella pneumophila serogroup 1 were analyzed as potential epidemiological typing markers to aid in the investigation of legionella outbreaks. The genes chosen included four likely to be selectively neutral (acn, groES, groEL, and recA) and three likely to be under selective pressure (flaA, mompS, and proA). Oligonucleotide primers were designed to amplify 279- to 763-bp fragments from each gene. Initial sequence analysis of the seven loci from 10 well-characterized isolates of L. pneumophila serogroup 1 gave excellent reproducibility (R) and epidemiological concordance (E) values (R = 1.00; E = 1.00). The three loci showing greatest discrimination and nucleotide variation, flaA, mompS, and proA, were chosen for further study. Indices of discrimination (D) were calculated using a panel of 79 unrelated isolates. Single loci gave D values ranging from 0.767 to 0.857, and a combination of all three loci resulted in a D value of 0.924. When all three loci were combined with monoclonal antibody subgrouping, the D value was 0.971. Sequence-based typing of L. pneumophila serogroup 1 using only three loci is epidemiologically concordant and highly discriminatory and has the potential to become the new “gold standard” for the epidemiological typing of L. pneumophila. PMID:12843023
A 48 SNP set for grapevine cultivar identification

PubMed Central

2011-01-01

Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012
Development of a High-Resolution Multi-Locus Microsatellite Typing Method for Colletotrichum gloeosporioides.

PubMed

Mehta, Nikita; Hagen, Ferry; Aamir, Sadaf; Singh, Sanjay K; Baghela, Abhishek

2017-12-01

Colletotrichum gloeosporioides is an economically important fungal pathogen causing substantial yield losses indifferent host plants. To understand the genetic diversity and molecular epidemiology of this fungus, we have developed a novel, high-resolution multi-locus microsatellite typing (MLMT) method. Bioinformatic analysis of C. gloeosporioides unannotated genome sequence yielded eight potential microsatellite loci, of which five, CG1 (GT) n , CG2 (GT1) n , CG3 (TC) n , CG4 (CT) n , and CG5 (CT1) n were selected for further study based on their universal amplification potential, reproducibility, and repeat number polymorphism. The selected microsatellites were used to analyze 31 strains of C. gloeosporioides isolated from 20 different host plants from India. All microsatellite loci were found to be polymorphic, and the approximate fragment sizes of microsatellite loci CG1, CG2, CG3, CG4, and CG5 were in ranges of 213-241, 197-227, 231-265, 209-275, and 132-188, respectively. Among the 31 isolates, 55 different genotypes were identified. The Simpson's index of diversity (D) values for the individual locus ranged from 0.79 to 0.92, with the D value of all combined five microsatellite loci being 0.99. Microsatellite data analysis revealed that isolates from Ocimum sanctum , Capsicum annuum (chili pepper), and Mangifera indica (mango) formed distinct clusters, therefore exhibited some level of correlation between certain genotypes and host. The developed MLMT method would be a powerful tool for studying the genetic diversity and any possible genotype-host correlation in C. gloeosporioides .
Construction of a high-density genetic map by specific locus amplified fragment sequencing (SLAF-seq) and its application to Quantitative Trait Loci (QTL) analysis for boll weight in upland cotton (Gossypium hirsutum.).

PubMed

Zhang, Zhen; Shang, Haihong; Shi, Yuzhen; Huang, Long; Li, Junwen; Ge, Qun; Gong, Juwu; Liu, Aiying; Chen, Tingting; Wang, Dan; Wang, Yanling; Palanga, Koffi Kibalou; Muhammad, Jamshed; Li, Weijie; Lu, Quanwei; Deng, Xiaoying; Tan, Yunna; Song, Weiwu; Cai, Juan; Li, Pengtao; Rashid, Harun or; Gong, Wankui; Yuan, Youlu

2016-04-11

Upland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping. In this research, a recombinant inbred lines population developed from two upland cotton cultivars 0-153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15-16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis. This research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.
CoLIde: a bioinformatics tool for CO-expression-based small RNA Loci Identification using high-throughput sequencing data.

PubMed

Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent

2013-07-01

Small RNAs (sRNAs) are 20-25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk.
Signature of genetic associations in oral cancer.

PubMed

Sharma, Vishwas; Nandan, Amrita; Sharma, Amitesh Kumar; Singh, Harpreet; Bharadwaj, Mausumi; Sinha, Dhirendra Narain; Mehrotra, Ravi

2017-10-01

Oral cancer etiology is complex and controlled by multi-factorial events including genetic events. Candidate gene studies, genome-wide association studies, and next-generation sequencing identified various chromosomal loci to be associated with oral cancer. There is no available review that could give us the comprehensive picture of genetic loci identified to be associated with oral cancer by candidate gene studies-based, genome-wide association studies-based, and next-generation sequencing-based approaches. A systematic literature search was performed in the PubMed database to identify the loci associated with oral cancer by exclusive candidate gene studies-based, genome-wide association studies-based, and next-generation sequencing-based study approaches. The information of loci associated with oral cancer is made online through the resource "ORNATE." Next, screening of the loci validated by candidate gene studies and next-generation sequencing approach or by two independent studies within candidate gene studies or next-generation sequencing approaches were performed. A total of 264 loci were identified to be associated with oral cancer by candidate gene studies, genome-wide association studies, and next-generation sequencing approaches. In total, 28 loci, that is, 14q32.33 (AKT1), 5q22.2 (APC), 11q22.3 (ATM), 2q33.1 (CASP8), 11q13.3 (CCND1), 16q22.1 (CDH1), 9p21.3 (CDKN2A), 1q31.1 (COX-2), 7p11.2 (EGFR), 22q13.2 (EP300), 4q35.2 (FAT1), 4q31.3 (FBXW7), 4p16.3 (FGFR3), 1p13.3 (GSTM1-GSTT1), 11q13.2 (GSTP1), 11p15.5 (H-RAS), 3p25.3 (hOGG1), 1q32.1 (IL-10), 4q13.3 (IL-8), 12p12.1 (KRAS), 12q15 (MDM2), 12q13.12 (MLL2), 9q34.3 (NOTCH1), 17p13.1 (p53), 3q26.32 (PIK3CA), 10q23.31 (PTEN), 13q14.2 (RB1), and 5q14.2 (XRCC4), were validated to be associated with oral cancer. "ORNATE" gives a snapshot of genetic loci associated with oral cancer. All 28 loci were validated to be linked to oral cancer for which further fine-mapping followed by gene-by-gene and gene-environment interaction studies is needed to confirm their involvement in modifying oral cancer.
[DNA marker-assisted selection of medicinal plants (Ⅰ) .Breeding research of disease-resistant cultivars of Panax notoginseng].

PubMed

Li, Qing; Li, Biao; Guo, Shun-Xing

2017-01-01

SSR is one of the most important molecular markers used in molecular identification and genetic diversity research of Dendrobium nobile. In order to enrich the library of SSR and establish a method for rapid identification of D. nobile, the SSR information was analyzed in the transcriptome of D. nobile. A total of 32 709 SSRs were obtained from the transcriptome of D. nobile, distributed in 26 742 unigenes with the distribution frequency of 12.90%. SSR loci occurred every 3 748 bp. Mono-nucleotide repeat was the main type, account for as much as 72.18% of all SSRs, followed by di-nucleotide (15.97%) and tri-nucleotide (11.19%). Among all repeat types, A/T was the predominant one followed by AG/CT. Finally a total of 62 157 primer pairs were designed for marker development. Randomly 20 pairs of primers were selected for PCR amplification, 17 amplified on clear and reproducible bands, the amplification rate was 85.0%.Thirteen pairs were polymorphic among the 3 Dendrobium plants. The results indicated that the unigenes generated from transcriptome sequencing in D. nobile can be used as effective source to develop SSR markers. The SSR loci in the transcriptome of D. nobile have the characteristics of type riches, high density and high potential of polymorphism, and these characteristics might applied in the study of molecular identification, genetic diversity and marker-assisted breeding of D. nobile and its closely related species. Copyright© by the Chinese Pharmaceutical Association.

Identification of associated SSR markers for yield component and fiber quality traits based on frame map and Upland cotton collections.

PubMed

Qin, Hongde; Chen, Min; Yi, Xianda; Bie, Shu; Zhang, Cheng; Zhang, Youchang; Lan, Jiayang; Meng, Yanyan; Yuan, Youlu; Jiao, Chunhai

2015-01-01

Detecting QTLs (quantitative trait loci) that enhance cotton yield and fiber quality traits and accelerate breeding has been the focus of many cotton breeders. In the present study, 359 SSR (simple sequence repeat) markers were used for the association mapping of 241 Upland cotton collections. A total of 333 markers, representing 733 polymorphic loci, were detected. The average linkage disequilibrium (LD) decay distances were 8.58 cM (r2 > 0.1) and 5.76 cM (r2 > 0.2). 241 collections were arranged into two subgroups using STRUCTURE software. Mixed linear modeling (MLM) methods (with population structure (Q) and relative kinship matrix (K)) were applied to analyze four phenotypic datasets obtained from four environments (two different locations and two years). Forty-six markers associated with the number of bolls per plant (NB), boll weight (BW), lint percentage (LP), fiber length (FL), fiber strength (FS) and fiber micornaire value (FM) were repeatedly detected in at least two environments. Of 46 associated markers, 32 were identified as new association markers, and 14 had been previously reported in the literature. Nine association markers were near QTLs (at a distance of less than 1-2 LD decay on the reference map) that had been previously described. These results provide new useful markers for marker-assisted selection in breeding programs and new insights for understanding the genetic basis of Upland cotton yields and fiber quality traits at the whole-genome level.
Identification and characterization of pleiotropic and co-located resistance loci to leaf rust and stripe rust in bread wheat cultivar Sujata.

PubMed

Lan, Caixia; Zhang, Yelun; Herrera-Foessel, Sybil A; Basnet, Bhoja R; Huerta-Espino, Julio; Lagudah, Evans S; Singh, Ravi P

2015-03-01

Two new co-located resistance loci, QLr.cim - 1AS/QYr.cim - 1AS and QLr.cim - 7BL/YrSuj , in combination with Lr46 / Yr29 and Lr67/Yr46 , and a new leaf rust resistance quantitative trait loci, conferred high resistance to rusts in adult plant stage. The tall Indian bread wheat cultivar Sujata displays high and low infection types to leaf rust and stripe rust, respectively, at the seedling stage in greenhouse tests. It was also highly resistant to both rusts at adult plant stage in field trials in Mexico. The genetic basis of this resistance was investigated in a population of 148 F5 recombinant inbred lines (RILs) derived from the cross Avocet × Sujata. The parents and RIL population were characterized in field trials for resistance to leaf rust during 2011 at El Batán, and 2012 and 2013 at Ciudad Obregón, Mexico, and for stripe rust during 2011 and 2012 at Toluca, Mexico; they were also characterized three times for stripe rust at seedling stage in the greenhouse. The RILs were genotyped with diversity arrays technology and simple sequence repeat markers. The final genetic map was constructed with 673 polymorphic markers. Inclusive composite interval mapping analysis detected two new significant co-located resistance loci, QLr.cim-1AS/QYr.cim-1AS and QLr.cim-7BL/YrSuj, on chromosomes 1AS and 7BL, respectively. The chromosomal position of QLr.cim-7BL overlapped with the seedling stripe rust resistance gene, temporarily designated as YrSuj. Two previously reported pleiotropic adult plant resistance genes, Lr46/Yr29 and Lr67/Yr46, and a new leaf rust resistance quantitative trait loci derived from Avocet were also mapped in the population. The two new co-located resistance loci are expected to contribute to breeding durable rust resistance in wheat. Closely linked molecular markers can be used to transfer all four resistance loci simultaneously to modern wheat varieties.
Development and characterization of genomic SSR markers in Cynodon transvaalensis Burtt-Davy.

PubMed

Tan, Chengcheng; Wu, Yanqi; Taliaferro, Charles M; Bell, Greg E; Martin, Dennis L; Smith, Mike W

2014-08-01

Simple sequence repeat (SSR) markers are a major molecular tool for genetic and genomic research that have been extensively developed and used in major crops. However, few are available in African bermudagrass (Cynodon transvaalensis Burtt-Davy), an economically important warm-season turfgrass species. African bermudagrass is mainly used for hybridizations with common bermudagrass [C. dactylon var. dactylon (L.) Pers.] in the development of superior interspecific hybrid turfgrass cultivars. Accordingly, the major objective of this study was to develop and characterize a large set of SSR markers. Genomic DNA of C. transvaalensis '4200TN 24-2' from an Oklahoma State University (OSU) turf nursery was extracted for construction of four SSR genomic libraries enriched with [CA](n), [GA](n), [AAG](n), and [AAT](n) as core repeat motifs. A total of 3,064 clones were sequenced at the OSU core facility. The sequences were categorized into singletons and contiguous sequences to exclude redundancy. From the two sequence categories, 1,795 SSR loci were identified. After excluding duplicate SSRs by comparison with previously developed SSR markers using a nucleotide basic local alignment tool, 1,426 unique primer pairs (PPs) were designed. Out of the 1,426 designed PPs, 981 (68.8 %) amplified alleles of the expected size in the donor DNA. Polymorphisms of the SSR PPs tested in eight C. transvaalensis plants were 93 % polymorphic with 544 markers effective in all genotypes. Inheritance of the SSRs was examined in six F(1) progeny of African parents 'T577' × 'Uganda', indicating 917 markers amplified heritable alleles. The SSR markers developed in the study are the first large set of co-dominant markers in African bermudagrass and should be highly valuable for molecular and traditional breeding research.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

PubMed

Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

2017-04-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Report on the development of putative functional SSR and SNP markers in passion fruits.

PubMed

da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro

2017-09-06

Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
Characterization of 32 microsatellite loci for the Pacific red snapper, Lutjanus peru, through next generation sequencing.

PubMed

Paz-García, David A; Munguía-Vega, Adrián; Plomozo-Lugo, Tomas; Weaver, Amy Hudson

2017-04-01

We developed a set of hypervariable microsatellite markers for the Pacific red snapper (Lutjanus peru), an economically important marine fish for small-scale fisheries in the west coast of Mexico. We performed shotgun genome sequencing with the 454 XL titanium chemistry and used bioinformatic tools to search for perfect microsatellite loci. We selected 66 primer pairs that were synthesized and genotyped in an ABI PRISM 3730XL DNA sequencer in 32 individuals from the Gulf of California. We estimated levels of genetic diversity, deviations from linkage and Hardy-Weinberg equilibrium, estimated the frequency of null alleles and the probability of individual identity for the new markers. We reanalyzed 16 loci in 16 individuals to estimate genotyping error rates. Eighteen loci failed to amplify, 16 loci were discarded due to unspecific amplifications and 32 loci (14 tetranucleotide and 18 dinucleotide) were successfully scored. The average number of alleles per locus was 21 (±6.87, SD) and ranged from 8 to 34. The average observed and expected heterozygosities were 0.787 (±0.144 SD, range 0.250-0.935) and 0.909 (±0.122 SD, range 0.381-0.965), respectively. No significant linkage was detected. Eight loci showed deviations from Hardy-Weinberg equilibrium, and from these, four loci showed moderate null allele frequencies (0.104-0.220). The probability of individual identity for the new loci was 1.46 -62 . Genotyping error rates averaged 9.58%. The new markers will be useful to investigate patterns of larval dispersal, metapopulation dynamics, fine-scale genetic structure and diversity aimed to inform the implementation of spatially explicit fisheries management strategies in the Gulf of California.
Rapid development of microsatellite markers with 454 pyrosequencing in a vulnerable fish, the mottled skate, Raja pulchra.

PubMed

Kang, Jung-Ha; Park, Jung-Youn; Jo, Hyun-Su

2012-01-01

The mottled skate, Raja pulchra, is an economically valuable fish. However, due to a severe population decline, it is listed as a vulnerable species by the International Union for Conservation of Nature. To analyze its genetic structure and diversity, microsatellite markers were developed using 454 pyrosequencing. A total of 17,033 reads containing dinucleotide microsatellite repeat units (mean, 487 base pairs) were identified from 453,549 reads. Among 32 loci containing more than nine repeat units, 20 primer sets (62%) produced strong PCR products, of which 14 were polymorphic. In an analysis of 60 individuals from two R. pulchra populations, the number of alleles per locus ranged from 1-10, and the mean allelic richness was 4.7. No linkage disequilibrium was found between any pair of loci, indicating that the markers were independent. The Hardy-Weinberg equilibrium test showed significant deviation in two of the 28 single-loci after sequential Bonferroni's correction. Using 11 primer sets, cross-species amplification was demonstrated in nine related species from four families within two classes. Among the 11 loci amplified from three other Rajidae family species; three loci were polymorphic. A monomorphic locus was amplified in all three Rajidae family species and the Dasyatidae family. Two Rajidae polymorphic loci amplified monomorphic target DNAs in four species belonging to the Carcharhiniformes class, and another was polymorphic in two Carcharhiniformes species.
Haplotype data for 23 Y-chromosome markers in a reference sample from Bosnia and Herzegovina

PubMed Central

Kovačević, Lejla; Fatur-Cerić, Vera; Hadžić, Negra; Čakar, Jasmina; Primorac, Dragan; Marjanović, Damir

2013-01-01

Aim To detect polymorphisms of 23 Y-chromosomal short tandem repeat (STR) loci, including 6 new loci, in a reference database of male population of Bosnia and Herzegovina, as well as to assess the importance of increasing the number of Y-STR loci utilized in forensic DNA analysis. Methods The reference sample consisted of 100 healthy, unrelated men originating from Bosnia and Herzegovina. Sample collection using buccal swabs was performed in all geographical regions of Bosnia and Herzegovina in the period from 2010 to 2011. DNA samples were typed for 23 Y STR loci, including 6 new loci: DYS576, DYS481, DYS549, DYS533, DYS570, and DYS643, which are included in the new PowerPlex® Y 23 amplification kit. Results The absolute frequency of generated haplotypes was calculated and results showed that 98 samples had unique Y 23 haplotypes, and that only two samples shared the same haplotype. The most polymorphic locus was DYS418, with 14 detected alleles and the least polymorphic loci were DYS389I, DYS391, DYS437, and DYS393. Conclusion This study showed that by increasing the number of highly polymorphic Y STR markers, to include those tested in our analysis, leads to a reduction of repeating haplotypes, which is very important in the application of forensic DNA analysis. PMID:23771760
Rapid Development of Microsatellite Markers with 454 Pyrosequencing in a Vulnerable Fish, the Mottled Skate, Raja pulchra

PubMed Central

Kang, Jung-Ha; Park, Jung-Youn; Jo, Hyun-Su

2012-01-01

The mottled skate, Raja pulchra, is an economically valuable fish. However, due to a severe population decline, it is listed as a vulnerable species by the International Union for Conservation of Nature. To analyze its genetic structure and diversity, microsatellite markers were developed using 454 pyrosequencing. A total of 17,033 reads containing dinucleotide microsatellite repeat units (mean, 487 base pairs) were identified from 453,549 reads. Among 32 loci containing more than nine repeat units, 20 primer sets (62%) produced strong PCR products, of which 14 were polymorphic. In an analysis of 60 individuals from two R. pulchra populations, the number of alleles per locus ranged from 1–10, and the mean allelic richness was 4.7. No linkage disequilibrium was found between any pair of loci, indicating that the markers were independent. The Hardy–Weinberg equilibrium test showed significant deviation in two of the 28 single-loci after sequential Bonferroni’s correction. Using 11 primer sets, cross-species amplification was demonstrated in nine related species from four families within two classes. Among the 11 loci amplified from three other Rajidae family species; three loci were polymorphic. A monomorphic locus was amplified in all three Rajidae family species and the Dasyatidae family. Two Rajidae polymorphic loci amplified monomorphic target DNAs in four species belonging to the Carcharhiniformes class, and another was polymorphic in two Carcharhiniformes species. PMID:22837688
Identification of Molecular Markers Associated with Verticillium Wilt Resistance in Alfalfa (Medicago Sativa L.) Using High-Resolution Melting

PubMed Central

Zhang, Tiejun; Yu, Long-Xi; McCord, Per; Miller, David; Bhamidimarri, Suresh; Johnson, David; Monteros, Maria J.; Ho, Julie; Reisen, Peter; Samac, Deborah A.

2014-01-01

Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L.) worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs. PMID:25536106
Identification of molecular markers associated with Verticillium wilt resistance in alfalfa (Medicago sativa L.) using high-resolution melting.

PubMed

Zhang, Tiejun; Yu, Long-Xi; McCord, Per; Miller, David; Bhamidimarri, Suresh; Johnson, David; Monteros, Maria J; Ho, Julie; Reisen, Peter; Samac, Deborah A

2014-01-01

Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L.) worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs.
High-throughput analysis of the satellitome illuminates satellite DNA evolution

NASA Astrophysics Data System (ADS)

Ruiz-Ruano, Francisco J.; López-León, María Dolores; Cabrero, Josefa; Camacho, Juan Pedro M.

2016-07-01

Satellite DNA (satDNA) is a major component yet the great unknown of eukaryote genomes and clearly underrepresented in genome sequencing projects. Here we show the high-throughput analysis of satellite DNA content in the migratory locust by means of the bioinformatic analysis of Illumina reads with the RepeatExplorer and RepeatMasker programs. This unveiled 62 satDNA families and we propose the term “satellitome” for the whole collection of different satDNA families in a genome. The finding that satDNAs were present in many contigs of the migratory locust draft genome indicates that they show many genomic locations invisible by fluorescent in situ hybridization (FISH). The cytological pattern of five satellites showing common descent (belonging to the SF3 superfamily) suggests that non-clustered satDNAs can become into clustered through local amplification at any of the many genomic loci resulting from previous dissemination of short satDNA arrays. The fact that all kinds of satDNA (micro- mini- and satellites) can show the non-clustered and clustered states suggests that all these elements are mostly similar, except for repeat length. Finally, the presence of VNTRs in bacteria, showing similar properties to non-clustered satDNAs in eukaryotes, suggests that this kind of tandem repeats show common properties in all living beings.
Isolation and characterization of 14 tetranucleotide microsatellite loci for the cannonball jellyfish (Stomolophus sp.) by next generation sequencing.

PubMed

Getino-Mamet, Leandro Nicolás; Valdivia-Carrillo, Tania; Gómez Daglio, Liza; García-De León, Francisco Javier

2017-04-01

The Cannonball jellyfish (Stomolophus sp.) is a species of jellyfish with high relevance in artisanal fishing. Studies of their populations do not extend beyond the morphological descriptions knowing that presents a great morphological variability. However, there are no genetic studies to determine the number of independent populations, so microsatellite markers become a suitable option. Since there are no species-specific microsatellite loci, in this paper, 14 new microsatellite loci are characterized. Microsatellite loci were isolated de novo through next generation sequencing, by two runs on Illumina MiSeq. A total of 506,771,269 base pair were obtained, from which 142,616 were microsatellite loci, and 1546 of them could design primers. We tested 14 primer pairs on 32 individuals from Bahía de La Paz, Gulf of California. We observed low genetic variation among loci (mean number of alleles per locus = 4.33, mean observed heterozygosity 0.381, mean expected heterozygosity 0.501). These loci are the first ones described for the species and will be helpful to carry out genetic diversity and population genetics studies.
Phylogenetic Status of an Unrecorded Species of Curvularia, C. spicifera, Based on Current Classification System of Curvularia and Bipolaris Group Using Multi Loci.

PubMed

Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong; Lee, Hyang Burm

2015-09-01

A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae.
Significant variance in genetic diversity among populations of Schistosoma haematobium detected using microsatellite DNA loci from a genome-wide database.

PubMed

Glenn, Travis C; Lance, Stacey L; McKee, Anna M; Webster, Bonnie L; Emery, Aidan M; Zerlotini, Adhemar; Oliveira, Guilherme; Rollinson, David; Faircloth, Brant C

2013-10-17

Urogenital schistosomiasis caused by Schistosoma haematobium is widely distributed across Africa and is increasingly being targeted for control. Genome sequences and population genetic parameters can give insight into the potential for population- or species-level drug resistance. Microsatellite DNA loci are genetic markers in wide use by Schistosoma researchers, but there are few primers available for S. haematobium. We sequenced 1,058,114 random DNA fragments from clonal cercariae collected from a snail infected with a single Schistosoma haematobium miracidium. We assembled and aligned the S. haematobium sequences to the genomes of S. mansoni and S. japonicum, identifying microsatellite DNA loci across all three species and designing primers to amplify the loci in S. haematobium. To validate our primers, we screened 32 randomly selected primer pairs with population samples of S. haematobium. We designed >13,790 primer pairs to amplify unique microsatellite loci in S. haematobium, (available at http://www.cebio.org/projetos/schistosoma-haematobium-genome). The three Schistosoma genomes contained similar overall frequencies of microsatellites, but the frequency and length distributions of specific motifs differed among species. We identified 15 primer pairs that amplified consistently and were easily scored. We genotyped these 15 loci in S. haematobium individuals from six locations: Zanzibar had the highest levels of diversity; Malawi, Mauritius, Nigeria, and Senegal were nearly as diverse; but the sample from South Africa was much less diverse. About half of the primers in the database of Schistosoma haematobium microsatellite DNA loci should yield amplifiable and easily scored polymorphic markers, thus providing thousands of potential markers. Sequence conservation among S. haematobium, S. japonicum, and S. mansoni is relatively high, thus it should now be possible to identify markers that are universal among Schistosoma species (i.e., using DNA sequences conserved among species), as well as other markers that are specific to species or species-groups (i.e., using DNA sequences that differ among species). Full genome-sequencing of additional species and specimens of S. haematobium, S. japonicum, and S. mansoni is desirable to better characterize differences within and among these species, to develop additional genetic markers, and to examine genes as well as conserved non-coding elements associated with drug resistance.
Software for rapid time dependent ChIP-sequencing analysis (TDCA).

PubMed

Myschyshyn, Mike; Farren-Dai, Marco; Chuang, Tien-Jui; Vocadlo, David

2017-11-25

Chromatin immunoprecipitation followed by DNA sequencing (ChIP-seq) and associated methods are widely used to define the genome wide distribution of chromatin associated proteins, post-translational epigenetic marks, and modifications found on DNA bases. An area of emerging interest is to study time dependent changes in the distribution of such proteins and marks by using serial ChIP-seq experiments performed in a time resolved manner. Despite such time resolved studies becoming increasingly common, software to facilitate analysis of such data in a robust automated manner is limited. We have designed software called Time-Dependent ChIP-Sequencing Analyser (TDCA), which is the first program to automate analysis of time-dependent ChIP-seq data by fitting to sigmoidal curves. We provide users with guidance for experimental design of TDCA for modeling of time course (TC) ChIP-seq data using two simulated data sets. Furthermore, we demonstrate that this fitting strategy is widely applicable by showing that automated analysis of three previously published TC data sets accurately recapitulates key findings reported in these studies. Using each of these data sets, we highlight how biologically relevant findings can be readily obtained by exploiting TDCA to yield intuitive parameters that describe behavior at either a single locus or sets of loci. TDCA enables customizable analysis of user input aligned DNA sequencing data, coupled with graphical outputs in the form of publication-ready figures that describe behavior at either individual loci or sets of loci sharing common traits defined by the user. TDCA accepts sequencing data as standard binary alignment map (BAM) files and loci of interest in browser extensible data (BED) file format. TDCA accurately models the number of sequencing reads, or coverage, at loci from TC ChIP-seq studies or conceptually related TC sequencing experiments. TC experiments are reduced to intuitive parametric values that facilitate biologically relevant data analysis, and the uncovering of variations in the time-dependent behavior of chromatin. TDCA automates the analysis of TC ChIP-seq experiments, permitting researchers to easily obtain raw and modeled data for specific loci or groups of loci with similar behavior while also enhancing consistency of data analysis of TC data within the genomics field.
Interaction of the putative tyrosine recombinases RipX (UU145), XerC (UU222), and CodV (UU529) of Ureaplasma parvum serovar 3 with specific DNA

PubMed Central

Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim

2013-01-01

Phase variation of two loci (‘mba locus’ and ‘UU172 phase-variable element’) in Ureaplasma parvum serovar 3 has been suggested as result of site-specific DNA inversion occurring at short inverted repeats. Three potential tyrosine recombinases (RipX, XerC, and CodV encoded by the genes UU145, UU222, and UU529) have been annotated in the genome of U. parvum serovar 3, which could be mediators in the proposed recombination event. We document that only orthologs of the gene xerC are present in all strains that show phase variation in the two loci. We demonstrate in vitro binding of recombinant maltose-binding protein fusions of XerC to the inverted repeats of the phase-variable loci, of RipX to a direct repeat that flanks a 20-kbp region, which has been proposed as putative pathogenicity island, and of CodV to a putative dif site. Co-transformation of the model organism Mycoplasma pneumoniae M129 with both the ‘mba locus’ and the recombinase gene xerC behind an active promoter region resulted in DNA inversion in the ‘mba locus’. Results suggest that XerC of U. parvum serovar 3 is a mediator in the proposed DNA inversion event of the two phase-variable loci. PMID:23305333
Microsatellite markers characterized in the barn owl (Tyto alba) and of high utility in other owls (Strigiformes: AVES).

PubMed

Klein, Akos; Horsburgh, Gavin J; Küpper, Clemens; Major, Agnes; Lee, Patricia L M; Hoffmann, Gyula; Mátics, Róbert; Dawson, Deborah A

2009-11-01

We have identified 15 polymorphic microsatellite loci for the barn owl (Tyto alba), five from testing published owl loci and 10 from testing non-owl loci, including loci known to be of high utility in passerines and shorebirds. All 15 loci were sequenced in barn owl, and new primer sets were designed for eight loci. The 15 polymorphic loci displayed two to 26 alleles in 56-58 barn owls. When tested in 10 other owl species (n = 1-6 individuals), between four and nine loci were polymorphic per species. These loci are suitable for studies of population structure and parentage in owls. © 2009 Blackwell Publishing Ltd.
Motif mismatches in microsatellites: insights from genome-wide investigation among 20 insect species.

PubMed

Behura, Susanta K; Severson, David W

2015-02-01

We present a detailed genome-wide comparative study of motif mismatches of microsatellites among 20 insect species representing five taxonomic orders. The results show that varying proportions (∼15-46%) of microsatellites identified in these species are imperfect in motif structure, and that they also vary in chromosomal distribution within genomes. It was observed that the genomic abundance of imperfect repeats is significantly associated with the length and number of motif mismatches of microsatellites. Furthermore, microsatellites with a higher number of mismatches tend to have lower abundance in the genome, suggesting that sequence heterogeneity of repeat motifs is a key determinant of genomic abundance of microsatellites. This relationship seems to be a general feature of microsatellites even in unrelated species such as yeast, roundworm, mouse and human. We provide a mechanistic explanation of the evolutionary link between motif heterogeneity and genomic abundance of microsatellites by examining the patterns of motif mismatches and allele sequences of single-nucleotide polymorphisms identified within microsatellite loci. Using Drosophila Reference Genetic Panel data, we further show that pattern of allelic variation modulates motif heterogeneity of microsatellites, and provide estimates of allele age of specific imperfect microsatellites found within protein-coding genes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Genetic Diversity of Arabica Coffee (Coffea arabica L.) in Nicaragua as Estimated by Simple Sequence Repeat Markers

PubMed Central

Geleta, Mulatu; Herrera, Isabel; Monzón, Arnulfo; Bryngelsson, Tomas

2012-01-01

Coffea arabica L. (arabica coffee), the only tetraploid species in the genus Coffea, represents the majority of the world's coffee production and has a significant contribution to Nicaragua's economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR) markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei's gene diversity (H T) and the within-population gene diversity (H S) were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA) revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (F ST = 0.13; P < 0.001). The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety. PMID:22701376

Coincidence of synteny breakpoints with malignancy-related deletions on human chromosome 3

PubMed Central

Kost-Alimova, Maria; Kiss, Hajnalka; Fedorova, Ludmila; Yang, Ying; Dumanski, Jan P.; Klein, George; Imreh, Stefan

2003-01-01

We have found previously that during tumor growth intact human chromosome 3 transferred into tumor cells regularly looses certain 3p regions, among them the ≈1.4-Mb common eliminated region 1 (CER1) at 3p21.3. Fluorescence in situ hybridization analysis of 12 mouse orthologous loci revealed that CER1 splits into two segments in mouse and therefore contains a murine/human conservation breakpoint region (CBR). Several breaks occurred in tumors within the region surrounding the CBR, and this sequence has features that characterize unstable chromosomal regions: deletions in yeast artificial chromosome clones, late replication, gene and segment duplications, and pseudogene insertions. Sequence analysis of the entire 3p12-22 revealed that other cancer-associated deletions (regions eliminated from monochromosomal hybrids carrying an intact chromosome 3 during tumor growth and homozygous deletions found in human tumors) colocalized nonrandomly with murine/human CBRs and were characterized by an increased number of local gene duplications and murine/human conservation mismatches (single genes that do not match into the conserved chromosomal segment). The CBR within CER1 contains a simple tandem TATAGA repeat capable of forming a 40-bp-long secondary hairpin-like structure. This repeat is nonrandomly localized within the other tumor-associated deletions and in the vicinity of 3p12-22 CBRs. PMID:12738884
Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats

PubMed Central

Armour, John A. L.; Palla, Raquel; Zeeuwen, Patrick L. J. M.; den Heijer, Martin; Schalkwijk, Joost; Hollox, Edward J.

2007-01-01

Recent work has demonstrated an unexpected prevalence of copy number variation in the human genome, and has highlighted the part this variation may play in predisposition to common phenotypes. Some important genes vary in number over a high range (e.g. DEFB4, which commonly varies between two and seven copies), and have posed formidable technical challenges for accurate copy number typing, so that there are no simple, cheap, high-throughput approaches suitable for large-scale screening. We have developed a simple comparative PCR method based on dispersed repeat sequences, using a single pair of precisely designed primers to amplify products simultaneously from both test and reference loci, which are subsequently distinguished and quantified via internal sequence differences. We have validated the method for the measurement of copy number at DEFB4 by comparison of results from >800 DNA samples with copy number measurements by MAPH/REDVR, MLPA and array-CGH. The new Paralogue Ratio Test (PRT) method can require as little as 10 ng genomic DNA, appears to be comparable in accuracy to the other methods, and for the first time provides a rapid, simple and inexpensive method for copy number analysis, suitable for application to typing thousands of samples in large case-control association studies. PMID:17175532
Next generation DNA sequencing technology delivers valuable genetic markers for the genomic orphan legume species, Bituminaria bituminosa

PubMed Central

2011-01-01

Background Bituminaria bituminosa is a perennial legume species from the Canary Islands and Mediterranean region that has potential as a drought-tolerant pasture species and as a source of pharmaceutical compounds. Three botanical varieties have previously been identified in this species: albomarginata, bituminosa and crassiuscula. B. bituminosa can be considered a genomic 'orphan' species with very few genomic resources available. New DNA sequencing technologies provide an opportunity to develop high quality molecular markers for such orphan species. Results 432,306 mRNA molecules were sampled from a leaf transcriptome of a single B. bituminosa plant using Roche 454 pyrosequencing, resulting in an average read length of 345 bp (149.1 Mbp in total). Sequences were assembled into 3,838 isotigs/contigs representing putatively unique gene transcripts. Gene ontology descriptors were identified for 3,419 sequences. Raw sequence reads containing simple sequence repeat (SSR) motifs were identified, and 240 primer pairs flanking these motifs were designed. Of 87 primer pairs developed this way, 75 (86.2%) successfully amplified primarily single fragments by PCR. Fragment analysis using 20 primer pairs in 79 accessions of B. bituminosa detected 130 alleles at 21 SSR loci. Genetic diversity analyses confirmed that variation at these SSR loci accurately reflected known taxonomic relationships in original collections of B. bituminosa and provided additional evidence that a division of the botanical variety bituminosa into two according to geographical origin (Mediterranean region and Canary Islands) may be appropriate. Evidence of cross-pollination was also found between botanical varieties within a B. bituminosa breeding programme. Conclusions B. bituminosa can no longer be considered a genomic orphan species, having now a large (albeit incomplete) repertoire of expressed gene sequences that can serve as a resource for future genetic studies. This experimental approach was effective in developing codominant and polymorphic SSR markers for application in diverse genetic studies. These markers have already given new insight into genetic variation in B. bituminosa, providing evidence that a division of the botanical variety bituminosa may be appropriate. This approach is commended to those seeking to develop useful markers for genomic orphan species. PMID:22171578
Maize centromere structure and evolution: sequence analysis of centromeres 2 and 5 reveals dynamic Loci shaped primarily by retrotransposons.

PubMed

Wolfgruber, Thomas K; Sharma, Anupma; Schneider, Kevin L; Albert, Patrice S; Koo, Dal-Hoe; Shi, Jinghua; Gao, Zhi; Han, Fangpu; Lee, Hyeran; Xu, Ronghui; Allison, Jamie; Birchler, James A; Jiang, Jiming; Dawe, R Kelly; Presting, Gernot G

2009-11-01

We describe a comprehensive and general approach for mapping centromeres and present a detailed characterization of two maize centromeres. Centromeres are difficult to map and analyze because they consist primarily of repetitive DNA sequences, which in maize are the tandem satellite repeat CentC and interspersed centromeric retrotransposons of maize (CRM). Centromeres are defined epigenetically by the centromeric histone H3 variant, CENH3. Using novel markers derived from centromere repeats, we have mapped all ten centromeres onto the physical and genetic maps of maize. We were able to completely traverse centromeres 2 and 5, confirm physical maps by fluorescence in situ hybridization (FISH), and delineate their functional regions by chromatin immunoprecipitation (ChIP) with anti-CENH3 antibody followed by pyrosequencing. These two centromeres differ substantially in size, apparent CENH3 density, and arrangement of centromeric repeats; and they are larger than the rice centromeres characterized to date. Furthermore, centromere 5 consists of two distinct CENH3 domains that are separated by several megabases. Succession of centromere repeat classes is evidenced by the fact that elements belonging to the recently active recombinant subgroups of CRM1 colonize the present day centromeres, while elements of the ancestral subgroups are also found in the flanking regions. Using abundant CRM and non-CRM retrotransposons that inserted in and near these two centromeres to create a historical record of centromere location, we show that maize centromeres are fluid genomic regions whose borders are heavily influenced by the interplay of retrotransposons and epigenetic marks. Furthermore, we propose that CRMs may be involved in removal of centromeric DNA (specifically CentC), invasion of centromeres by non-CRM retrotransposons, and local repositioning of the CENH3.
Maize Centromere Structure and Evolution: Sequence Analysis of Centromeres 2 and 5 Reveals Dynamic Loci Shaped Primarily by Retrotransposons

PubMed Central

Albert, Patrice S.; Koo, Dal-Hoe; Shi, Jinghua; Gao, Zhi; Han, Fangpu; Lee, Hyeran; Xu, Ronghui; Allison, Jamie; Birchler, James A.; Jiang, Jiming; Dawe, R. Kelly; Presting, Gernot G.

2009-01-01

We describe a comprehensive and general approach for mapping centromeres and present a detailed characterization of two maize centromeres. Centromeres are difficult to map and analyze because they consist primarily of repetitive DNA sequences, which in maize are the tandem satellite repeat CentC and interspersed centromeric retrotransposons of maize (CRM). Centromeres are defined epigenetically by the centromeric histone H3 variant, CENH3. Using novel markers derived from centromere repeats, we have mapped all ten centromeres onto the physical and genetic maps of maize. We were able to completely traverse centromeres 2 and 5, confirm physical maps by fluorescence in situ hybridization (FISH), and delineate their functional regions by chromatin immunoprecipitation (ChIP) with anti-CENH3 antibody followed by pyrosequencing. These two centromeres differ substantially in size, apparent CENH3 density, and arrangement of centromeric repeats; and they are larger than the rice centromeres characterized to date. Furthermore, centromere 5 consists of two distinct CENH3 domains that are separated by several megabases. Succession of centromere repeat classes is evidenced by the fact that elements belonging to the recently active recombinant subgroups of CRM1 colonize the present day centromeres, while elements of the ancestral subgroups are also found in the flanking regions. Using abundant CRM and non-CRM retrotransposons that inserted in and near these two centromeres to create a historical record of centromere location, we show that maize centromeres are fluid genomic regions whose borders are heavily influenced by the interplay of retrotransposons and epigenetic marks. Furthermore, we propose that CRMs may be involved in removal of centromeric DNA (specifically CentC), invasion of centromeres by non-CRM retrotransposons, and local repositioning of the CENH3. PMID:19956743
Genetic analysis and mapping of adult plant resistance loci to leaf rust in durum wheat cultivar Bairds.

PubMed

Lan, Caixia; Basnet, Bhoja R; Singh, Ravi P; Huerta-Espino, Julio; Herrera-Foessel, Sybil A; Ren, Yong; Randhawa, Mandeep S

2017-03-01

New leaf rust adult plant resistance (APR) QTL QLr.cim - 6BL was mapped and confirmed the known pleotropic APR gene Lr46 effect on leaf rust in durum wheat line Bairds. CIMMYT-derived durum wheat line Bairds displays an adequate level of adult plant resistance (APR) to leaf rust in Mexican field environments. A recombinant inbred line (RIL) population developed from a cross of Bairds with susceptible parent Atred#1 was phenotyped for leaf rust response at Ciudad Obregon, Mexico, during 2013, 2014, 2015 and 2016 under artificially created epidemics of Puccinia triticina (Pt) race BBG/BP. The RIL population and its parents were genotyped with the 50 K diversity arrays technology (DArT) sequence system and simple sequence repeat (SSR) markers. A genetic map comprising 1150 markers was used to map the resistance loci. Four significant quantitative trait loci (QTLs) were detected on chromosomes 1BL, 2BC (centromere region), 5BL and 6BL. These QTLs, named Lr46, QLr.cim-2BC, QLr.cim-5BL and QLr.cim-6BL, respectively, explained 13.5-60.8%, 9.0-14.3%, 2.8-13.9%, and 11.6-29.4%, respectively, of leaf rust severity variation by the inclusive composite interval mapping method. All of these resistance loci were contributed by the resistant parent Bairds, except for QLr.cim-2BC, which came from susceptible parent Atred#1. Among these, the QTL on chromosome 1BL was the known pleiotropic APR gene Lr46, whereas QLr.cim-6BL, a consistently detected locus, should be a new leaf rust resistance locus in durum wheat. The mean leaf rust severity of RILs carrying all four QTLs ranged from 8.0 to 17.5%, whereas it ranged from 10.9 to 38.5% for three QTLs (Lr46 + 5BL + 6BL) derived from the resistant parent Bairds. Two RILs with four QTLs combinations can be used as sources of complex APR in durum wheat breeding.
Determining Y-STR mutation rates in deep-routing genealogies: Identification of haplogroup differences.

PubMed

Claerhout, Sofie; Vandenbosch, Michiel; Nivelle, Kelly; Gruyters, Leen; Peeters, Anke; Larmuseau, Maarten H D; Decorte, Ronny

2018-05-01

Knowledge of Y-chromosomal short tandem repeat (Y-STR) mutation rates is essential to determine the most recent common ancestor (MRCA) in familial searching or genealogy research. Up to now, locus-specific mutation rates have been extensively examined especially for commercially available forensic Y-STRs, while haplogroup specific mutation rates have not yet been investigated in detail. Through 450 patrilineally related namesakes distributed over 212 deep-rooting genealogies, the individual mutation rates of 42 Y-STR loci were determined, including 27 forensic Y-STR loci from the Yfiler ® Plus kit and 15 additional Y-STR loci (DYS388, DYS426, DYS442, DYS447, DYS454, DYS455, DYS459a/b, DYS549, DYS607, DYS643, DYS724a/b and YCAIIa/b). At least 726 mutations were observed over 148,596 meiosis and individual Y-STR mutation rates varied from 2.83 × 10 -4 to 1.86 × 10 -2 . The mutation rate was significantly correlated with the average allele size, the complexity of the repeat motif sequence and the age of the father. Significant differences in average Y-STR mutations rates were observed when haplogroup 'I & J' (4.03 × 10 -3 mutations/generation) was compared to 'R1b' (5.35 × 10 -3 mutations/generation) and to the overall mutation rate (5.03 × 10 -3 mutations/generation). A difference in allele size distribution was identified as the only cause for these haplogroup specific mutation rates. The haplogroup specific mutation rates were also present within the commercially available Y-STR kits (Yfiler ® , PowerPlex ® Y23 System and Yfiler ® Plus). This observation has consequences for applications where an average Y-STR mutation rate is used, e.g. tMRCA estimations in familial searching and genealogy research. Copyright © 2018 Elsevier B.V. All rights reserved.
Fluorescent in situ hybridization shows DIPLOSPOROUS located on one of the NOR chromosomes in apomictic dandelions (Taraxacum) in the absence of a large hemizygous chromosomal region.

PubMed

Vašut, Radim J; Vijverberg, Kitty; van Dijk, Peter J; de Jong, Hans

2014-11-01

Apomixis in dandelions (Taraxacum: Asteraceae) is encoded by two unlinked dominant loci and a third yet undefined genetic factor: diplosporous omission of meiosis (DIPLOSPOROUS, DIP), parthenogenetic embryo development (PARTHENOGENESIS, PAR), and autonomous endosperm formation, respectively. In this study, we determined the chromosomal position of the DIP locus in Taraxacum by using fluorescent in situ hybridization (FISH) with bacterial artificial chromosomes (BACs) that genetically map within 1.2-0.2 cM of DIP. The BACs showed dispersed fluorescent signals, except for S4-BAC 83 that displayed strong unique signals as well. Under stringent blocking of repeats by C0t-DNA fragments, only a few fluorescent foci restricted to defined chromosome regions remained, including one on the nucleolus organizer region (NOR) chromosomes that contains the 45S rDNAs. FISH with S4-BAC 83 alone and optimal blocking showed discrete foci in the middle of the long arm of one of the NOR chromosomes only in triploid and tetraploid diplosporous dandelions, while signals in sexual diploids were lacking. This agrees with the genetic model of a single dose, dominant DIP allele, absent in sexuals. The length of the DIP region is estimated to cover a region of 1-10 Mb. FISH in various accessions of Taraxacum and the apomictic sister species Chondrilla juncea, confirmed the chromosomal position of DIP within Taraxacum but not outside the genus. Our results endorse that, compared to other model apomictic species, expressing either diplospory or apospory, the genome of Taraxacum shows a more similar and less diverged chromosome structure at the DIP locus. The different levels of allele sequence divergence at apomeiosis loci may reflect different terms of asexual reproduction. The association of apomeiosis loci with repetitiveness, dispersed repeats, and retrotransposons commonly observed in apomictic species may imply a functional role of these shared features in apomictic reproduction, as is discussed.
Uropathogenic Escherichia coli are less likely than paired fecal E. coli to have CRISPR loci.

PubMed

Dang, Trang Nguyen Doan; Zhang, Lixin; Zöllner, Sebastian; Srinivasan, Usha; Abbas, Khadija; Marrs, Carl F; Foxman, Betsy

2013-10-01

CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats) are short fragments of DNA that act as an adaptive immune system protecting bacteria against invasion by phages, plasmids or other forms of foreign DNA. Bacteria without a CRISPR locus may more readily adapt to environmental changes by acquiring foreign genetic material. Uropathogenic Escherichia coli (UPEC) live in a number of environments suggesting an ability to rapidly adapt to new environments. If UPEC are more adaptive than commensal E. coli we would expect that UPEC would have fewer CRISPR loci, and--if loci are present--that they would harbor fewer spacers than CRISPR loci in fecal E. coli. We tested this in vivo by comparing the number of CRISPR loci and spacers, and sensitivity to antibiotics (resistance is often obtained via plasmids) among 81 pairs of UPEC and fecal E. coli isolated from women with urinary tract infection. Each pair included one uropathogen and one commensal (fecal) sample from the same female patient. Fecal isolates had more repeats (p=0.009) and more unique spacers (p<0.0001) at four CRISPR loci than uropathogens. By contrast, uropathogens were more likely than fecal E. coli to be resistant to ampicillin, cefazolin and trimethoprim/sulfamethoxazole. However, no consistent association between CRISPRs and antibiotic resistance was identified. To our knowledge, this is the first study to compare fecal E. coli and pathogenic E. coli from the same individuals, and to test the association of CRISPR loci with antibiotic resistance. Our results suggest that the absence of CRISPR loci may make UPEC more susceptible to infection by phages or plasmids and allow them to adapt more quickly to various environments. Copyright © 2013 Elsevier B.V. All rights reserved.
A study of Huntington disease-like syndromes in black South African patients reveals a single SCA2 mutation and a unique distribution of normal alleles across five repeat loci.

PubMed

Baine, Fiona K; Peerbhai, Nabeelah; Krause, Amanda

2018-07-15

Huntington disease (HD) is a progressive neurodegenerative disease, characterised by a triad of movement disorder, emotional and behavioural disturbances and cognitive impairment. The underlying cause is an expanded CAG repeat in the huntingtin gene. For a small proportion of patients presenting with HD-like symptoms, the mutation in this gene is not identified and they are said to have a HD "phenocopy". South Africa has the highest number of recorded cases of an African-specific phenocopy, Huntington disease-like 2 (HDL2), caused by a repeat expansion in the junctophilin-3 gene. However, a significant proportion of black patients with clinical symptoms suggestive of HD still test negative for HD and HDL2. This study thus aimed to investigate five other loci associated with HD phenocopy syndromes - ATN1, ATXN2, ATXN7, TBP and C9orf72. In a sample of patients in whom HD and HDL2 had been excluded, a single expansion was identified in the ATXN2 gene, confirming a diagnosis of Spinocerebellar ataxia 2. The results indicate that common repeat expansion disorders do not contribute significantly to the HD-like phenotype in black South African patients. Importantly, allele sizing reveals unique distributions of normal repeat lengths across the associated loci in the African population studied. Copyright © 2018 Elsevier B.V. All rights reserved.
Use of the VNTR typing technique to determine the origin of Mycobacterium tuberculosis strains isolated from Filipino patients in Korea.

PubMed

Lee, Jihye; Tupasi, Thelma E; Park, Young Kil

2014-05-01

With increasing international interchange of personnel, international monitoring is necessary to decrease tuberculosis incidence in the world. This study aims to develop a new tool to determine origin of Mycobacterium tuberculosis strains isolated from Filipino patients living in Korea. Thirty-two variable number tandem repeat (VNTR) loci were used for discrimination of 50 Filipino M. tuberculosis strains isolated in the Philippines, 317 Korean strains isolated in Korea, and 8 Filipino strains isolated in Korea. We found that the VNTR loci 0580, 0960, 2531, 2687, 2996, 0802, 2461, 2163a, 4052, 0424, 1955, 2074, 2347, 2401, 3171, 3690, 2372, 3232, and 4156 had different mode among copy numbers or exclusively distinct copy number in VNTR typing between Filipino and Korean M. tuberculosis strains. When these differences of the VNTR loci were applied to 8 Filipino M. tuberculosis strains isolated in Korea, 6 of them revealed Filipino type while 2 of them had Korean type. Using the differences of mode or repeated number of VNTR loci were very useful in distinguishing the Filipino strain from Korean strain.
Multi-Virulence-Locus Sequence Typing of Staphylococcus lugdunensis Generates Results Consistent with a Clonal Population Structure and Is Reliable for Epidemiological Typing

PubMed Central

Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis

2014-01-01

Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. PMID:25078912
Infrared fluorescent automated detection of thirteen short tandem repeat polymorphisms and one gender-determining system of the CODIS core system.

PubMed

Ricci, U; Sani, I; Guarducci, S; Biondi, C; Pelagatti, S; Lazzerini, V; Brusaferri, A; Lapini, M; Andreucci, E; Giunti, L; Giovannucci Uzielli, M L

2000-11-01

We used an infrared (IR) automated fluorescence monolaser sequencer for the analysis of 13 autosomal short tandem repeat (STR) systems (TPOX, D3S1358, FGA, CSF1PO, D5S818, D7S820, D8S1179, TH01, vWA, D13S317, D16S359, D18S51, D21S11) and the X-Y homologous gene amelogenin system. These two systems represent the core of the combined DNA index systems (CODIS). Four independent multiplex reactions, based on the polymerase chain reaction (PCR) technique and on the direct labeling of the forward primer of every primer pair, with a new molecule (IRDye800), were set up, permitting the exact characterization of the alleles by comparison with ladders of specific sequenced alleles. This is the first report of the whole analysis of the STRs of the CODIS core using an IR automated DNA sequencer. The protocol was used to solve paternity/maternity tests and for population studies. The electrophoretic system also proved useful for the correct typing of those loci differing in size by only 2 bp. A sensibility study demonstrated that the test can detect an average of 10 pg of undegraded human DNA. We also performed a preliminary study analyzing some forensic samples and mixed stains, which suggested the usefulness of using this analytical system for human identification as well as for forensic purposes.
Characterisation of 12 microsatellite loci in the Vietnamese commercial clam Lutraria rhynchaena Jonas 1844 (Heterodonta: Bivalvia: Mactridae) through next-generation sequencing.

PubMed

Thai, Binh Thanh; Tan, Mun Hua; Lee, Yin Peng; Gan, Han Ming; Tran, Trang Thi; Austin, Christopher M

2016-05-01

The marine clam Lutraria rhynchaena is gaining popularity as an aquaculture species in Asia. Lutraria populations are present in the wild throughout Vietnam and several stocks have been established and translocated for breeding and aquaculture grow-out purposes. In this study, we demonstrate the feasibility of utilising Illumina next-generation sequencing technology to streamline the identification and genotyping of microsatellite loci from this clam species. Based on an initial partial genome scan, 48 microsatellite markers with similar melting temperatures were identified and characterised. The 12 most suitable polymorphic loci were then genotyped using 51 individuals from a population in Quang Ninh Province, North Vietnam. Genetic variation was low (mean number of alleles per locus = 2.6; mean expected heterozygosity = 0.41). Two loci showed significant deviation from Hardy-Weinberg equilibrium (HWE) and the presence of null alleles, but there was no evidence of linkage disequilibrium among loci. Three additional populations were screened (n = 7-36) to test the geographic utility of the 12 loci, which revealed 100 % successful genotyping in two populations from central Vietnam (Nha Trang). However, a second population from north Vietnam (Co To) could not be successfully genotyped and morphological evidence and mitochondrial variation suggests that this population represents a cryptic species of Lutraria. Comparisons of the Qang Ninh and Nha Trang populations, excluding the 2 loci out of HWE, revealed statistically significant allelic variation at 4 loci. We reported the first microsatellite loci set for the marine clam Lutraria rhynchaena and demonstrated its potential in differentiating clam populations. Additionally, a cryptic species population of Lutraria rhynchaena was identified during initial loci development, underscoring the overlooked diversity of marine clam species in Vietnam and the need to genetically characterise population representatives prior to microsatellite development. The rapid identification and validation of microsatellite loci using next-generation sequencing technology warrant its integration into future microsatellite loci development for key aquaculture species in Vietnam and more generally, aquaculture countries in the South East Asia region.
Arrangement and number of clustered regularly interspaced short palindromic repeat spacers are associated with erythromycin susceptibility in emm12, emm75 and emm92 of group A streptococcus.

PubMed

Zheng, P-X; Chiang-Ni, C; Wang, S-Y; Tsai, P-J; Kuo, C-F; Chuang, W-J; Lin, Y-S; Liu, C-C; Wu, J-J

2014-06-01

Clustered regularly interspaced short palindromic repeats (CRISPR) are composed of numerous repeat-spacer units and are considered a prokaryotic defence system against foreign nucleic acids. Since antibiotic-resistant genes are frequently encoded in foreign nucleic acids, the aim of this study was to test whether erythromycin susceptibility in group A streptococcus (Streptococcus pyogenes) is associated with characteristics of CRISPR elements. Erythromycin susceptibility of 330 isolates collected between 1997 and 2003 was analysed. Among 29 emm types, emm12, emm75 and emm92 showed significant changes in erythromycin-resistance rates. By sequencing the spacers from two CRISPR loci, spacer contents in emm12, emm75 and emm92 strains were associated with erythromycin susceptibility. Strains with fewer spacers were more resistant to erythromycin. Moreover, in emm4 strains, which showed no significant change in their annual erythromycin-resistance rate, CRISPR type and number of spacers were not correlated with erythromycin susceptibility. These results highlight a novel association between CRISPR spacer content and erythromycin susceptibility in group A streptococcus. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

PubMed Central

2010-01-01

Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
A palindrome-mediated mechanism distinguishes translocations involving LCR-B of chromosome 22q11.2.

PubMed

Gotter, Anthony L; Shaikh, Tamim H; Budarf, Marcia L; Rhodes, C Harker; Emanuel, Beverly S

2004-01-01

Two known recurrent constitutional translocations, t(11;22) and t(17;22), as well as a non-recurrent t(4;22), display derivative chromosomes that have joined to a common site within the low copy repeat B (LCR-B) region of 22q11.2. This breakpoint is located between two AT-rich inverted repeats that form a nearly perfect palindrome. Breakpoints within the 11q23, 17q11 and 4q35 partner chromosomes also fall near the center of palindromic sequences. In the present work the breakpoints of a fourth translocation involving LCR-B, a balanced ependymoma-associated t(1;22), were characterized not only to localize this junction relative to known genes, but also to further understand the mechanism underlying these rearrangements. FISH mapping was used to localize the 22q11.2 breakpoint to LCR-B and the 1p21 breakpoint to single BAC clones. STS mapping narrowed the 1p21.2 breakpoint to a 1990 bp AT-rich region, and junction fragments were amplified by nested PCR. Junction fragment-derived sequence indicates that the 1p21.2 breakpoint splits a 278 nt palindrome capable of forming stem-loop secondary structure. In contrast, the 1p21.2 reference genomic sequence from clones in the database does not exhibit this configuration, suggesting a predisposition for regional genomic instability perhaps etiologic for this rearrangement. Given its similarity to known chromosomal fragile site (FRA) sequences, this polymorphic 1p21.2 sequence may represent one of the FRA1 loci. Comparative analysis of the secondary structure of sequences surrounding translocation breakpoints that involve LCR-B with those not involving this region indicate a unique ability of the former to form stem-loop structures. The relative likelihood of forming these configurations appears to be related to the rate of translocation occurrence. Further analysis suggests that constitutional translocations in general occur between sequences of similar melting temperature and propensity for secondary structure.
A palindrome-mediated mechanism distinguishes translocations involving LCR-B of chromosome 22q11.2

PubMed Central

Gotter, Anthony L.; Shaikh, Tamim H.; Budarf, Marcia L.; Rhodes, C. Harker; Emanuel, Beverly S.

2010-01-01

Two known recurrent constitutional translocations, t(11;22) and t(17;22), as well as a non-recurrent t(4;22), display derivative chromosomes that have joined to a common site within the low copy repeat B (LCR-B) region of 22q11.2. This breakpoint is located between two AT-rich inverted repeats that form a nearly perfect palindrome. Breakpoints within the 11q23, 17q11 and 4q35 partner chromosomes also fall near the center of palindromic sequences. In the present work the breakpoints of a fourth translocation involving LCR-B, a balanced ependymoma-associated t(1;22), were characterized not only to localize this junction relative to known genes, but also to further understand the mechanism underlying these rearrangements. FISH mapping was used to localize the 22q11.2 breakpoint to LCR-B and the 1p21 breakpoint to single BAC clones. STS mapping narrowed the 1p21.2 breakpoint to a 1990 bp AT-rich region, and junction fragments were amplified by nested PCR. Junction fragment-derived sequence indicates that the 1p21.2 breakpoint splits a 278 nt palindrome capable of forming stem–loop secondary structure. In contrast, the 1p21.2 reference genomic sequence from clones in the database does not exhibit this configuration, suggesting a predisposition for regional genomic instability perhaps etiologic for this rearrangement. Given its similarity to known chromosomal fragile site (FRA) sequences, this polymorphic 1p21.2 sequence may represent one of the FRA1 loci. Comparative analysis of the secondary structure of sequences surrounding translocation breakpoints that involve LCR-B with those not involving this region indicate a unique ability of the former to form stem–loop structures. The relative likelihood of forming these configurations appears to be related to the rate of translocation occurrence. Further analysis suggests that constitutional translocations in general occur between sequences of similar melting temperature and propensity for secondary structure. PMID:14613967
Substructure of a Tunisian Berber population as inferred from 15 autosomal short tandem repeat loci.

PubMed

Khodjet-El-Khil, Houssein; Fadhlaoui-Zid, Karima; Gusmão, Leonor; Alves, Cíntia; Benammar-Elgaaied, Amel; Amorim, Antonio

2008-08-01

Currently, language and cultural practices are the only criteria to distinguish between Berber autochthonous Tunisian populations. To evaluate these populations' possible genetic structure and differentiation, we have analyzed 15 autosomal short tandem repeat loci (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D21S11, FGA, TH01, TPOX, VWA, D2S1338, and D19S433) in three southern Tunisian Berber groups: Sened, Matmata, and Chenini-Douiret. The exact test of population differentiation based on allele frequencies at the 15 loci shows significant P values at 7 loci between Chenini-Douiret and both Sened and Matmata, whereas just 5 loci show significant P values between Sened and Matmata. Comparative analyses between the three Berber groups based on genetic distances show that P values for F(ST) distances are significant between the three Berber groups. Population analysis performed using Structure shows a clear differentiation between these Berber groups, with strong genetic isolation of Chenini-Douiret. These results confirm at the autosomal level the high degree of heterogeneity of Tunisian Berber populations that had been previously reported for uniparental markers.
[Discriminatory power of variable number on tandem repeats loci for genotyping Mycobacterium tuberculosis strains in China].

PubMed

Chen, H X; Cai, C; Liu, J Y; Zhang, Z G; Yuan, M; Jia, J N; Sun, Z G; Huang, H R; Gao, J M; Li, W M

2017-06-10

Objective: Using the standard genotype method, variable number of tandem repeats (VNTR), we constructed a VNTR database to cover all provinces and proposed a set of optimized VNTR loci combinations for each province, in order to improve the preventive and control programs on tuberculosis, in China. Methods: A total of 15 loci VNTR was used to analyze 4 116 Mycobacterium tuberculosis strains, isolated from national survey of Drug Resistant Tuberculosis, in 2007. Hunter-Gaston Index (HGI) was also used to analyze the discriminatory power of each VNTR site. A set combination of 12-VNTR, 10-VNTR, 8-VNTR and 5-VNTR was respectively constructed for each province, based on 1) epidemic characteristics of M. tuberculosis lineages in China, with high discriminatory power and genetic stability. Results: Through the completed 15 loci VNTR patterns of 3 966 strains under 96.36 % (3 966/4 116) coverage, we found seven high HGI loci (including QUB11b and MIRU26) as well as low stable loci (including QUB26, MIRU16, Mtub21 and QUB11b) in several areas. In all the 31 provinces, we found an optimization VNTR combination as 10-VNTR loci in Inner Mongolia, Chongqing and Heilongjiang, but with 8-VNTR combination shared in other provinces. Conclusions: It is necessary to not only use the VNTR database for tracing the source of infection and cluster of M. tuberculosis in the nation but also using the set of optimized VNTR combinations in monitoring those local epidemics and M. tuberculosis (genetics in local) population.

Identification, characterization and genetic mapping of TLR1 loci in rainbow trout (Oncorhynchus mykiss)

USGS Publications Warehouse

Palti, Y.; Rodriguez, M.F.; Gahr, S.A.; Purcell, M.K.; Rexroad, C. E.; Wiens, G.D.

2010-01-01

Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5??? UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Identification, characterization and genetic mapping of TLR1 loci in rainbow trout (Oncorhynchus mykiss)

USGS Publications Warehouse

Palti, Yniv; Rodriguez, M. Fernanda; Gahr, Scott A.; Purcell, Maureen K.; Rexroad, Caird E.; Wiens, Gregory D.

2010-01-01

Induction of innate immune pathways is critical for early anti-microbial defense but there is limited understanding of how teleosts recognize microbial molecules and activate these pathways. In mammals, Toll-like receptors (TLR) 1 and 2 form a heterodimer involved in recognizing peptidoglycans and lipoproteins of microbial origin. Herein, we identify and describe the rainbow trout (Oncorhynchus mykiss) TLR1 gene ortholog and its mRNA expression. Two TLR1 loci were identified from a rainbow trout bacterial artificial chromosome (BAC) library using DNA sequencing and genetic linkage analyses. Full length cDNA clone and direct sequencing of four BACs revealed an intact omTLR1 open reading frame (ORF) located on chromosome 14 and a second locus on chromosome 25 that contains a TLR1 pseudogene. The duplicated trout loci exhibit conserved synteny with other fish genomes that extends beyond the TLR1 gene sequences. The omTLR1 gene includes a single large coding exon similar to all other described TLR1 genes, but unlike other teleosts it also has a 5' UTR exon and intron preceding the large coding exon. The omTLR1 ORF is predicted to encode an 808 amino-acid protein with 69% similarity to the Fugu TLR1 and a conserved pattern of predicted leucine-rich repeats (LRR). Phylogenetic analysis grouped omTLR1 with other fish TLR1 genes on a separate branch from the avian TLR1 and mammalian TLR1, 6 and 10. omTLR1 expression levels in rainbow trout anterior kidney leukocytes were not affected by the human TLR2/6 and TLR2/1 agonists diacylated lipoprotein (Pam2CSK4) and triacylated lipoprotein (Pam3CSK4). However, due to the lack of TLR6 and 10 genes in teleost genomes and up-regulation of TLR1 mRNA in response to LPS and bacterial infection in other fish species we hypothesize an important role for omTLR1 in anti-microbial immunity. Therefore, the identification of a TLR2 ortholog in rainbow trout and the development of assays to measure ligand binding and downstream signaling are critical for future elucidation of omTLR1 functions.
Egg Case Silk Gene Sequences from Argiope Spiders: Evidence for Multiple Loci and a Loss of Function Between Paralogs

PubMed Central

Chaw, R. Crystal; Collin, Matthew; Wimmer, Marjorie; Helmrick, Kara-Leigh; Hayashi, Cheryl Y.

2017-01-01

Spiders swath their eggs with silk to protect developing embryos and hatchlings. Egg case silks, like other fibrous spider silks, are primarily composed of proteins called spidroins (spidroin = spider-fibroin). Silks, and thus spidroins, are important throughout the lives of spiders, yet the evolution of spidroin genes has been relatively understudied. Spidroin genes are notoriously difficult to sequence because they are typically very long (≥ 10 kb of coding sequence) and highly repetitive. Here, we investigate the evolution of spider silk genes through long-read sequencing of Bacterial Artificial Chromosome (BAC) clones. We demonstrate that the silver garden spider Argiope argentata has multiple egg case spidroin loci with a loss of function at one locus. We also use degenerate PCR primers to search the genomic DNA of congeneric species and find evidence for multiple egg case spidroin loci in other Argiope spiders. Comparative analyses show that these multiple loci are more similar at the nucleotide level within a species than between species. This pattern is consistent with concerted evolution homogenizing gene copies within a genome. More complicated explanations include convergent evolution or recent independent gene duplications within each species. PMID:29127108
Universal target-enrichment baits for anthozoan (Cnidaria) phylogenomics: New approaches to long-standing problems.

PubMed

Quattrini, Andrea M; Faircloth, Brant C; Dueñas, Luisa F; Bridge, Tom C L; Brugler, Mercer R; Calixto-Botía, Iván F; DeLeo, Danielle M; Forêt, Sylvain; Herrera, Santiago; Lee, Simon M Y; Miller, David J; Prada, Carlos; Rádis-Baptista, Gandhi; Ramírez-Portilla, Catalina; Sánchez, Juan A; Rodríguez, Estefanía; McFadden, Catherine S

2018-03-01

Anthozoans (e.g., corals, anemones) are an ecologically important and diverse group of marine metazoans that occur from shallow to deep waters worldwide. However, our understanding of the evolutionary relationships among the ~7,500 species within this class is hindered by the lack of phylogenetically informative markers that can be reliably sequenced across a diversity of taxa. We designed and tested 16,306 RNA baits to capture 720 ultraconserved element loci and 1,071 exon loci. Library preparation and target enrichment were performed on 33 taxa from all orders within the class Anthozoa. Following Illumina sequencing and Trinity assembly, we recovered 1,774 of 1,791 targeted loci. The mean number of loci recovered from each species was 638 ± 222, with more loci recovered from octocorals (783 ± 138 loci) than hexacorals (475 ± 187 loci). Parsimony informative sites ranged from 26 to 49% for alignments at differing hierarchical taxonomic levels (e.g., Anthozoa, Octocorallia, Hexacorallia). The per cent of variable sites within each of three genera (Acropora, Alcyonium, and Sinularia) for which multiple species were sequenced ranged from 4.7% to 30%. Maximum-likelihood analyses recovered highly resolved trees with topologies matching those supported by other studies, including the monophyly of the order Scleractinia. Our results demonstrate the utility of this target-enrichment approach to resolve phylogenetic relationships from relatively old to recent divergences. Redesigning the baits with improved affinities to capture loci within each subclass will provide a valuable toolset to address systematic questions, further our understanding of the timing of diversifications and help resolve long-standing controversial relationships in the class Anthozoa. © 2017 John Wiley & Sons Ltd.
Organization of 5S rDNA in species of the fish Leporinus: two different genomic locations are characterized by distinct nontranscribed spacers.

PubMed

Martins, C; Galetti, P M

2001-10-01

To address understanding the organization of the 5S rRNA multigene family in the fish genome, the nucleotide sequence and organization array of 5S rDNA were investigated in the genus Leporinus, a representative freshwater fish group of South American fauna. PCR, subgenomic library screening, genomic blotting, fluorescence in situ hybridization, and DNA sequencing were employed in this study. Two arrays of 5S rDNA were identified for all species investigated, one consisting of monomeric repeat units of around 200 bp and another one with monomers of 900 bp. These 5S rDNA arrays were characterized by distinct NTS sequences (designated NTS-I and NTS-II for the 200- and 900-bp monomers, respectively); however, their coding sequences were nearly identical. The 5S rRNA genes were clustered in two chromosome loci, a major one corresponding to the NTS-I sites and a minor one corresponding to the NTS-II sites. The NTS-I sequence was variable among Leporinus spp., whereas the NTS-II was conserved among them and even in the related genus Schizodon. The distinct 5S rDNA arrays might characterize two 5S rRNA gene subfamilies that have been evolving independently in the genome.
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping.

PubMed

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-04-03

The American cranberry ( Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. Copyright © 2017 Schlautman et al.
[Association of aggressive behaviors of schizophrenia with short tandem repeats loci].

PubMed

Yang, Chun; Ba, Huajie; Tan, Xingqi; Zhao, Hanqing; Zhang, Shuyou; Yu, Haiying

2017-12-10

To assess the association of short tandem repeats (STRs) loci with aggressive behaviors of schizophrenia. Blood samples from 123 schizophrenic patients with aggressive behaviors and 489 schizophrenic patients without aggressive behaviors were collected. DNA from all samples was amplified with a PowerPlex 21 system and separated by electrophoresis to determine the genotypes and allelic frequencies of 20 STR loci including D3S1368, D1S1656, D6S1043, D13S317, Penta E, D16S639, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, and FGA. All of the 20 STR loci have reached Hardy-Weinberg equilibrium in both groups. A significant difference was found in allelic and genotypic frequencies of loci Penta D between the two groups (alleles: P=0.042; genotypes: P=0.014) but not for the remaining 19 loci (P> 0.05). Univariate analysis also showed a significant difference for allele 10 and genotypes 10-12 of Penta D between the two groups (P=0.0027, P=0.0001), with the OR being 1.81 (95%CI: 1.22-2.67) and 4.33 (95%CI: 1.95-9.59), respectively. Penta D may be associated with aggressive behaviors of schizophrenia. Allele 10 and genotypes 10-12 of Penta D may confer a risk for the disease.
High-resolution definition of the Vibrio cholerae essential gene set with hidden Markov model–based analyses of transposon-insertion sequencing data

PubMed Central

Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.

2013-01-01

The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011
[SSR loci information analysis in transcriptome of Andrographis paniculata].

PubMed

Li, Jun-Ren; Chen, Xiu-Zhen; Tang, Xiao-Ting; He, Rui; Zhan, Ruo-Ting

2018-06-01

To study the SSR loci information and develop molecular markers, a total of 43 683 Unigenes in transcriptome of Andrographis paniculata were used to explore SSR. The distribution frequency of SSR and the basic characteristics of repeat motifs were analyzed using MicroSAtellite software, SSR primers were designed by Primer 3.0 software and then validated by PCR. Moreover, the gene function analysis of SSR Unigene was obtained by Blast. The results showed that 14 135 SSR loci were found in the transcriptome of A. paniculata, which distributed in 9 973 Unigenes with a distribution frequency of 32.36%. Di-nucleotide and Tri-nucleotide repeat were the main types, accounted for 75.54% of all SSRs. The repeat motifs of AT/AT and CCG/CGG were the predominant repeat types of Di-nucleotide and Tri-nucleotide, respectively. A total of 4 740 pairs of SSR primers with the potential to produce polymorphism were designed for maker development. Ten pairs of primers in 20 pairs of randomly picked primers produced fragments with expected molecular size. The gene function of Unigenes containing SSR were mostly related to the basic metabolism function of A. paniculata. The SSR markers in transcriptome of A. paniculata show rich type, strong specificity and high potential of polymorphism, which will benefit the candidate gene mining and marker-assisted breeding. Copyright© by the Chinese Pharmaceutical Association.
Advanced Backcross QTL Analysis of Fiber Strength and Fineness in a Cross between Gossypium hirsutum and G. mustelinum.

PubMed

Wang, Baohua; Zhuang, Zhimin; Zhang, Zhengsheng; Draye, Xavier; Shuang, Lan-Shuan; Shehzad, Tariq; Lubbers, Edward L; Jones, Don; May, O Lloyd; Paterson, Andrew H; Chee, Peng W

2017-01-01

The molecular genetic basis of cotton fiber strength and fineness in crosses between Gossypium mustelinum and Gossypium hirsutum (Upland cotton) was dissected using 21 BC 3 F 2 and 12 corresponding BC 3 F 2:3 and BC 3 F 2:4 families. The BC 3 F 2 families were genotyped with simple sequence repeat markers from a G. hirsutum by G. mustelinum linkage map, and the three generations of BC 3 -derived families were phenotyped for fiber strength (STR) and fineness (Micronaire, MIC). A total of 42 quantitative trait loci (QTLs) were identified through one-way analysis of variance, including 15 QTLs for STR and 27 for MIC, with the percentage of variance explained by individual loci averaging 13.86 and 14.06%, respectively. Eighteen of the 42 QTLs were detected at least twice near the same markers in different generations/families or near linked markers in the same family, and 28 of the 42 QTLs were identified in both mixed model-based composite interval mapping and one-way variance analyses. Alleles from G. mustelinum increased STR for eight of 15 and reduced MIC for 15 of 27 QTLs. Significant among-family genotypic effects ( P < 0.001) were detected in 13 and 10 loci for STR and MIC respectively, and five loci showed significant ( P < 0.001) genotype × family interaction for MIC. These results support the hypothesis that fiber quality improvement for Upland cotton could be realized by introgressing G. mustelinum alleles although complexities due to the different effects of genetic background on introgressed chromatin might be faced. Building on prior work with G. barbadense, G. tomentosum , and G. darwinii , QTL mapping involving introgression of G. mustelinum alleles offers new allelic variation to Upland cotton germplasm.
Assignment of the dystonia-parkinsonism syndrome locus, DYT3, to a small region within a 1.8-Mb YAC contig of Xq13.1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Haberhausen, G.; Schmitt, I.; Koehler, A.

1995-09-01

A YAC contig was constructed of Xq13.1 in order to sublocalize the X-linked dystonia-parkinsonism (XDP) syndrome locus, DYT3. The contig spans a region of {approximately}1.8 Mb and includes loci DXS453/DXS348/IL2R{gamma}/GJB1/CCG1/DXS559. For the construction of the contig, nine sequence-tagged sites and four short tandem repeat polymorphisms (STRPs) were isolated. The STRPs, designated as 4704 No. 6 (DXS7113), 4704 No. 7 (DXS7114), 67601 (DXS7117), and B4Pst (DXS7119) were assigned to a region flanked by DXS348 proximally and by DXS559 distally. Their order was DXS348/4704 No. 6/4704 No. 7/67601/B4Pst/DXS559. They were applied to the analysis of allelic association and of haplotypes in 47more » not-obviously-related XDP patients and in 105 Filipino male controls. The same haplotype was found at loci 67601 (DXS7117) and B4Pst (DXS7119) in 42 of 47 patients. This percentage of common haplotypes decreased at the adjacent loci. The findings, together with the previous demonstration of DXS559 being the distal flanking marker of DYT3, assign the disease locus to a small region in Xq13.1 defined by loci 67601 (DXS7117) and B4Pst (DXS7119). The location of DYT3 was born out by the application of a newly developed likelihood method for the analysis of linkage disequilibrium. 28 refs., 1 fig., 6 tabs.« less
Development of molecular method for sex identification in date palm (Phoenix dactylifera L.) plantlets using novel sex-linked microsatellite markers.

PubMed

Maryam; Jaskani, Muhammad Jafar; Awan, Faisal Saeed; Ahmad, Saeed; Khan, Iqrar A

2016-06-01

Microsatellite markers containing simple sequence repeats (SSRs) are a valuable tool for genetic analysis. Date palm is a dioecious and slow flowering and is very difficult to identify the gender of the trees until it reaches the reproductive age (5-10 years). A total of 12 microsatellite primers were used with 30 date palm samples, 14 parents (8 male + 6 females) and 16 progeny (developed from parents breeding) which showed that microsatellites were highly polymorphic, having a great number of alleles. A total of 124 alleles were characterized in 12 SSR loci. On average, there are 9.08 alleles per locus, with a range from 5 to 16 alleles, for primers mpdCIR15 and mpdCIR57, respectively. These primers produced 15 polymorphic loci specifically in male date palm samples and the seedlings harboring the unique fragments were further characterized as male plants. Increasingly, 38.46 % of these loci were scored as homozygous alleles while 61.53 % heterozygous allelic loci were determined. Primer mpdCIR48 produced a specific locus (250/250) in all male samples whereas the same locus was absent in female samples. Similarly, a locus of 300/310 bp reoccurred in 5 date palm male samples using marker DP-168 which indicated that these are the promising candidate marker to detect the sex in date palm seedlings at early stage. The data resulted from combination of 12 primers enabled the 16 seedling samples progeny (developed from parents breeding) of date palm cultivars to divide into two groups i.e., male and female regarding their sex expression comparative to the parents (male + female) using the principle coordinate analysis.
Characterization and Mapping of Leaf Rust and Stripe Rust Resistance Loci in Hexaploid Wheat Lines UC1110 and PI610750 under Mexican Environments.

PubMed

Lan, Caixia; Hale, Iago L; Herrera-Foessel, Sybil A; Basnet, Bhoja R; Randhawa, Mandeep S; Huerta-Espino, Julio; Dubcovsky, Jorge; Singh, Ravi P

2017-01-01

Growing resistant wheat varieties is a key method of minimizing the extent of yield losses caused by the globally important wheat leaf rust (LR) and stripe rust (YR) diseases. In this study, a population of 186 F 8 recombinant inbred lines (RILs) derived from a cross between a synthetic wheat derivative (PI610750) and an adapted common wheat line (cv. "UC1110") were phenotyped for LR and YR response at both seedling and adult plant stages over multiple seasons. Using a genetic linkage map consisting of single sequence repeats and diversity arrays technology markers, in combination with inclusive composite interval mapping analysis, we detected a new LR adult plant resistance (APR) locus, QLr.cim-2DS , contributed by UC1110. One co-located resistance locus to both rusts, QLr.cim-3DC/QYr.cim-3DC , and the known seedling resistance gene Lr26 were also mapped. QLr.cim-2DS and QLr.cim-3DC showed a marginally significant interaction for LR resistance in the adult plant stage. In addition, two previously reported YR APR loci, QYr.ucw-3BS and Yr48 , were found to exhibit stable performances in rust environments in both Mexico and the United States and showed a highly significant interaction in the field. Yr48 was also observed to confer intermediate seedling resistance against Mexican YR races, thus suggesting it should be re-classified as an all-stage resistance gene. We also identified 5 and 2 RILs that possessed all detected YR and LR resistance loci, respectively. With the closely linked molecular markers reported here, these RILs could be used as donors for multiple resistance loci to both rusts in wheat breeding programs.
Development and characterization of 21 polymorphic microsatellite markers for the barren-ground shrew, Sorex ugyunak (Mammalia: Sorcidae), through next-generation sequencing, and cross-species amplification in the masked shrew, S. cinereus

USGS Publications Warehouse

Sonsthagen, Sarah A.; Sage, G. Kevin; Fowler, Megan C.; Hope, Andrew G.; Cook, J.A.; Talbot, Sandra L.

2013-01-01

We used next generation shotgun sequencing to develop 21 novel microsatellite markers for the barren-ground shrew (Sorex ugyunak), which were polymorphic among individuals from northern Alaska. The loci displayed moderate allelic diversity (averaging 6.81 alleles per locus) and heterozygosity (averaging 70 %). Two loci deviated from Hardy–Weinberg equilibrium (HWE) due to heterozygote deficiency. While the population did not deviate from HWE overall, it showed significant linkage disequilibrium suggesting this population is not in mutation-drift equilibrium. Nineteen of 21 loci were polymorphic in masked shrews (S. cinereus) from interior Alaska and exhibited linkage equilibrium and HWE overall. All loci yielded sufficient variability for use in population studies.
Position-based scanning for comparative genomics and identification of genetic islands in Haemophilus influenzae type b.

PubMed

Bergman, Nicholas H; Akerley, Brian J

2003-03-01

Bacteria exhibit extensive genetic heterogeneity within species. In many cases, these differences account for virulence properties unique to specific strains. Several such loci have been discovered in the genome of the type b serotype of Haemophilus influenzae, a human pathogen able to cause meningitis, pneumonia, and septicemia. Here we report application of a PCR-based scanning procedure to compare the genome of a virulent type b (Hib) strain with that of the laboratory-passaged Rd KW20 strain for which a complete genome sequence is available. We have identified seven DNA segments or H. influenzae genetic islands (HiGIs) present in the type b genome and absent from the Rd genome. These segments vary in size and content and show signs of horizontal gene transfer in that their percent G+C content differs from that of the rest of the H. influenzae genome, they contain genes similar to those found on phages or other mobile elements, or they are flanked by DNA repeats. Several of these loci represent potential pathogenicity islands, because they contain genes likely to mediate interactions with the host. These newly identified genetic islands provide areas of investigation into both the evolution and pathogenesis of H. influenzae. In addition, the genome scanning approach developed to identify these islands provides a rapid means to compare the genomes of phenotypically diverse bacterial strains once the genome sequence of one representative strain has been determined.
Genotyping of Coxiella burnetii from domestic ruminants and human in Hungary: indication of various genotypes.

PubMed

Sulyok, Kinga M; Kreizinger, Zsuzsa; Hornstra, Heidie M; Pearson, Talima; Szigeti, Alexandra; Dán, Ádám; Balla, Eszter; Keim, Paul S; Gyuranecz, Miklós

2014-05-07

Information about the genotypic characteristic of Coxiella burnetii from Hungary is lacking. The aim of this study is to describe the genetic diversity of C. burnetii in Hungary and compare genotypes with those found elsewhere. A total of 12 samples: (cattle, n = 6, sheep, n = 5 and human, n = 1) collected from across Hungary were studied by a 10-loci multispacer sequence typing (MST) and 6-loci multiple-locus variable-number of tandem repeat analysis (MLVA). Phylogenetic relationships among MST genotypes show how these Hungarian samples are related to others collected around the world. Three MST genotypes were identified: sequence type (ST) 20 has also been identified in ruminants from other European countries and the USA, ST28 was previously identified in Kazakhstan, and the proposed ST37 is novel. All MST genotypes yielded different MLVA genotypes and three different MLVA genotypes were identified within ST20 samples alone. Two novel MLVA types 0-9-5-5-6-2 (AG) and 0-8-4-5-6-2 (AF) (Ms23-Ms24-Ms27-Ms28-Ms33-Ms34) were defined in the ovine materials correlated with ST28 and ST37. Samples from different parts of the phylogenetic tree were associated with different hosts, suggesting host-specific adaptations. Even with the limited number of samples analysed, this study revealed high genetic diversity among C. burnetii in Hungary. Understanding the background genetic diversity will be essential in identifying and controlling outbreaks.
Esophageal squamous cell carcinomas with DNA replication errors (RER+) are associated with p16/pRb loss and wild-type p53.

PubMed

Mathew, R; Arora, S; Mathur, M; Chattopadhyay, T K; Ralhan, R

2001-10-01

Microsatellite instability (MSI) as a determinant of propensity to esophageal squamous cell carcinoma (ESCC) at seven microsatellite markers at 2p (2p15-16), 3p (3p13, 3p14.1-3, 3p25, and 3p26) and 16q (16q12.1-3) was investigated to analyze their putative role as indicators of predisposition to esophageal malignancies. Seven microsatellite loci were amplified by polymerase chain reaction, from surgically resected tumor tissues from 30 ESCC patients from Indian population, to assess the loss of heterozygosity (LOH) and replication error repeats (RER) and to correlate these alterations with aberrations in major cell cycle regulatory proteins and histopathological parameters. LOH and RER analyses at these loci demonstrated moderate microsatellite alterations, suggesting the involvement of MSI in esophageal tumorigenesis in a subset of the Indian population. MSI, defined as RER in at least two or more of the loci studied, was observed in ten of 30 (33%) patients. Twenty-two of 30 patients (73%) showed LOH at one or more loci, while 17 of the 30 patients (60%) showed RER in at least one of the loci studied. RER-positive patients showed a trend towards better prognosis when compared to RER-negative patients. MSI demonstrated a significant association with concomitant loss of p16 and pRb (p16-/pRb- phenotype) (P=0.046). Interestingly, we observed an inverse correlation between MSI and p53 mutations (P=0.03) suggesting that MSI may provide a p53-independent pathway for esophageal tumorigenesis in RER+ patients. MSI showed a trend towards longer survival and absence of distant organ metastasis (P=0.06). The present study demonstrates the probable role of MSI in esophageal squamous cell carcinoma in the Indian population. Instability associated with the repetitive sequences--the revealing marks of loss of DNA replication fidelity may serve as an indicator of predisposition to esophageal cancer.
Alu expression in human cell lines and their retrotranspositional potential.

PubMed

Oler, Andrew J; Traina-Dorge, Stephen; Derbes, Rebecca S; Canella, Donatella; Cairns, Brad R; Roy-Engel, Astrid M

2012-06-20

The vast majority of the 1.1 million Alu elements are retrotranspositionally inactive, where only a few loci referred to as 'source elements' can generate new Alu insertions. The first step in identifying the active Alu sources is to determine the loci transcribed by RNA polymerase III (pol III). Previous genome-wide analyses from normal and transformed cell lines identified multiple Alu loci occupied by pol III factors, making them candidate source elements. Analysis of the data from these genome-wide studies determined that the majority of pol III-bound Alus belonged to the older subfamilies Alu S and Alu J, which varied between cell lines from 62.5% to 98.7% of the identified loci. The pol III-bound Alus were further scored for estimated retrotransposition potential (ERP) based on the absence or presence of selected sequence features associated with Alu retrotransposition capability. Our analyses indicate that most of the pol III-bound Alu loci candidates identified lack the sequence characteristics important for retrotransposition. These data suggest that Alu expression likely varies by cell type, growth conditions and transformation state. This variation could extend to where the same cell lines in different laboratories present different Alu expression patterns. The vast majority of Alu loci potentially transcribed by RNA pol III lack important sequence features for retrotransposition and the majority of potentially active Alu loci in the genome (scored high ERP) belong to young Alu subfamilies. Our observations suggest that in an in vivo scenario, the contribution of Alu activity on somatic genetic damage may significantly vary between individuals and tissues.
Microsatellite markers: what they mean and why they are so useful

PubMed Central

Vieira, Maria Lucia Carneiro; Santini, Luciane; Diniz, Augusto Lima; Munhoz, Carla de Freitas

2016-01-01

Abstract Microsatellites or Single Sequence Repeats (SSRs) are extensively employed in plant genetics studies, using both low and high throughput genotyping approaches. Motivated by the importance of these sequences over the last decades this review aims to address some theoretical aspects of SSRs, including definition, characterization and biological function. The methodologies for the development of SSR loci, genotyping and their applications as molecular markers are also reviewed. Finally, two data surveys are presented. The first was conducted using the main database of Web of Science, prospecting for articles published over the period from 2010 to 2015, resulting in approximately 930 records. The second survey was focused on papers that aimed at SSR marker development, published in the American Journal of Botany's Primer Notes and Protocols in Plant Sciences (over 2013 up to 2015), resulting in a total of 87 publications. This scenario confirms the current relevance of SSRs and indicates their continuous utilization in plant science. PMID:27561112
Next Generation Sequencing Plus (NGS+) with Y-chromosomal Markers for Forensic Pedigree Searches.

PubMed

Qian, Xiaoqin; Hou, Jiayi; Wang, Zheng; Ye, Yi; Lang, Min; Gao, Tianzhen; Liu, Jing; Hou, Yiping

2017-09-12

There is high demand for forensic pedigree searches with Y-chromosome short tandem repeat (Y-STR) profiling in large-scale crime investigations. However, when two Y-STR haplotypes have a few mismatched loci, it is difficult to determine if they are from the same male lineage because of the high mutation rate of Y-STRs. Here we design a new strategy to handle cases in which none of pedigree samples shares identical Y-STR haplotype. We combine next generation sequencing (NGS), capillary electrophoresis and pyrosequencing under the term 'NGS+' for typing Y-STRs and Y-chromosomal single nucleotide polymorphisms (Y-SNPs). The high-resolution Y-SNP haplogroup and Y-STR haplotype can be obtained with NGS+. We further developed a new data-driven decision rule, FSindex, for estimating the likelihood for each retrieved pedigree. Our approach enables positive identification of pedigree from mismatched Y-STR haplotypes. It is envisaged that NGS+ will revolutionize forensic pedigree searches, especially when the person of interest was not recorded in forensic DNA database.

CRISPR interference can prevent natural transformation and virulence acquisition during in vivo bacterial infection.

PubMed

Bikard, David; Hatoum-Aslan, Asma; Mucida, Daniel; Marraffini, Luciano A

2012-08-16

Pathogenic bacterial strains emerge largely due to transfer of virulence and antimicrobial resistance genes between bacteria, a process known as horizontal gene transfer (HGT). Clustered, regularly interspaced, short palindromic repeat (CRISPR) loci of bacteria and archaea encode a sequence-specific defense mechanism against bacteriophages and constitute a programmable barrier to HGT. However, the impact of CRISPRs on the emergence of virulence is unknown. We programmed the human pathogen Streptococcus pneumoniae with CRISPR sequences that target capsule genes, an essential pneumococcal virulence factor, and show that CRISPR interference can prevent transformation of nonencapsulated, avirulent pneumococci into capsulated, virulent strains during infection in mice. Further, at low frequencies bacteria can lose CRISPR function, acquire capsule genes, and mount a successful infection. These results demonstrate that CRISPR interference can prevent the emergence of virulence in vivo and that strong selective pressure for virulence or antibiotic resistance can lead to CRISPR loss in bacterial pathogens. Copyright © 2012 Elsevier Inc. All rights reserved.
A novel high-resolution multilocus sequence typing of Giardia intestinalis Assemblage A isolates reveals zoonotic transmission, clonal outbreaks and recombination.

PubMed

Ankarklev, Johan; Lebbad, Marianne; Einarsson, Elin; Franzén, Oscar; Ahola, Harri; Troell, Karin; Svärd, Staffan G

2018-06-01

Molecular epidemiology and genotyping studies of the parasitic protozoan Giardia intestinalis have proven difficult due to multiple factors, such as low discriminatory power in the commonly used genotyping loci, which has hampered molecular analyses of outbreak sources, zoonotic transmission and virulence types. Here we have focused on assemblage A Giardia and developed a high-resolution assemblage-specific multilocus sequence typing (MLST) method. Analyses of sequenced G. intestinalis assemblage A genomes from different sub-assemblages identified a set of six genetic loci with high genetic variability. DNA samples from both humans (n = 44) and animals (n = 18) that harbored Giardia assemblage A infections, were PCR amplified (557-700 bp products) and sequenced at the six novel genetic loci. Bioinformatic analyses showed five to ten-fold higher levels of polymorphic sites than what was previously found among assemblage A samples using the classic genotyping loci. Phylogenetically, a division of two major clusters in assemblage A became apparent, separating samples of human and animal origin. A subset of human samples (n = 9) from a documented Giardia outbreak in a Swedish day-care center, showed full complementarity at nine genetic loci (the six new and the standard BG, TPI and GDH loci), strongly suggesting one source of infection. Furthermore, three samples of human origin displayed MLST profiles that were phylogenetically more closely related to MLST profiles from animal derived samples, suggesting zoonotic transmission. These new genotyping loci enabled us to detect events of recombination between different assemblage A isolates but also between assemblage A and E isolates. In summary, we present a novel and expanded MLST strategy with significantly improved sensitivity for molecular analyses of virulence types, zoonotic potential and source tracking for assemblage A Giardia. Copyright © 2018. Published by Elsevier B.V.
Genetic analysis of 20 autosomal STR loci in the Miao ethnic group from Yunnan Province, Southwest China.

PubMed

Zhang, Xiufeng; Hu, Liping; Du, Lei; Nie, Aiting; Rao, Min; Pang, Jing Bo; Xiran, Zeng; Nie, Shengjie

2017-05-01

The genetic polymorphisms of 20 autosomal short tandem repeat (STR) loci included in the PowerPlex ® 21 kit were evaluated from 748 unrelated healthy individuals of the Miao ethnic minority living in the Yunnan province in southwestern China. All of the loci reached Hardy-Weinberg equilibrium. These loci were examined to determine allele frequencies and forensic statistical parameters. The genetic relationship between the Miao population and other Chinese populations were also estimated. The combined discrimination power and probability of excluding paternity of the 20 STR loci were 0.999 999 999 999 999 999 999 991 26 and 0.999 999 975, respectively. The results suggested that the 20 STR loci were highly polymorphic, which makes them suitable for forensic personal identification and paternity testing. Copyright © 2017 Elsevier B.V. All rights reserved.
Genomics and introgression: discovery and mapping of thousands of species-diagnostic SNPs using RAD sequencing

USGS Publications Warehouse

Hand, Brian K.; Hether, Tyler D; Kovach, Ryan P.; Muhlfeld, Clint C.; Amish, Stephen J.; Boyer, Matthew C.; O’Rourke, Sean M.; Miller, Michael R.; Lowe, Winsor H.; Hohenlohe, Paul A.; Luikart, Gordon

2015-01-01

Invasive hybridization and introgression pose a serious threat to the persistence of many native species. Understanding the effects of hybridization on native populations (e.g., fitness consequences) requires numerous species-diagnostic loci distributed genome-wide. Here we used RAD sequencing to discover thousands of single-nucleotide polymorphisms (SNPs) that are diagnostic between rainbow trout (RBT, Oncorhynchus mykiss), the world’s most widely introduced fish, and native westslope cutthroat trout (WCT, O. clarkii lewisi) in the northern Rocky Mountains, USA. We advanced previous work that identified 4,914 species-diagnostic loci by using longer sequence reads (100 bp vs. 60 bp) and a larger set of individuals (n = 84). We sequenced RAD libraries for individuals from diverse sampling sources, including native populations of WCT and hatchery broodstocks of WCT and RBT. We also took advantage of a newly released reference genome assembly for RBT to align our RAD loci. In total, we discovered 16,788 putatively diagnostic SNPs, 10,267 of which we mapped to anchored chromosome locations on the RBT genome. A small portion of previously discovered putative diagnostic loci (325 of 4,914) were no longer diagnostic (i.e., fixed between species) based on our wider survey of non-hybridized RBT and WCT individuals. Our study suggests that RAD loci mapped to a draft genome assembly could provide the marker density required to identify genes and chromosomal regions influencing selection in admixed populations of conservation concern and evolutionary interest.
Insight into microevolution of Yersinia pestis by clustered regularly interspaced short palindromic repeats.

PubMed

Cui, Yujun; Li, Yanjun; Gorgé, Olivier; Platonov, Mikhail E; Yan, Yanfeng; Guo, Zhaobiao; Pourcel, Christine; Dentovskaya, Svetlana V; Balakhonov, Sergey V; Wang, Xiaoyi; Song, Yajun; Anisimov, Andrey P; Vergnaud, Gilles; Yang, Ruifu

2008-07-09

Yersinia pestis, the pathogen of plague, has greatly influenced human history on a global scale. Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR), an element participating in immunity against phages' invasion, is composed of short repeated sequences separated by unique spacers and provides the basis of the spoligotyping technology. In the present research, three CRISPR loci were analyzed in 125 strains of Y. pestis from 26 natural plague foci of China, the former Soviet Union and Mongolia were analyzed, for validating CRISPR-based genotyping method and better understanding adaptive microevolution of Y. pestis. Using PCR amplification, sequencing and online data processing, a high degree of genetic diversity was revealed in all three CRISPR elements. The distribution of spacers and their arrays in Y. pestis strains is strongly region and focus-specific, allowing the construction of a hypothetic evolutionary model of Y. pestis. This model suggests transmission route of microtus strains that encircled Takla Makan Desert and ZhunGer Basin. Starting from Tadjikistan, one branch passed through the Kunlun Mountains, and moved to the Qinghai-Tibet Plateau. Another branch went north via the Pamirs Plateau, the Tianshan Mountains, the Altai Mountains and the Inner Mongolian Plateau. Other Y. pestis lineages might be originated from certain areas along those routes. CRISPR can provide important information for genotyping and evolutionary research of bacteria, which will help to trace the source of outbreaks. The resulting data will make possible the development of very low cost and high-resolution assays for the systematic typing of any new isolate.
Determination of the genetic diversity of vegetable soybean [Glycine max (L.) Merr.] using EST-SSR markers*

PubMed Central

Zhang, Gu-wen; Xu, Sheng-chun; Mao, Wei-hua; Hu, Qi-zan; Gong, Ya-ming

2013-01-01

The development of expressed sequence tag-derived simple sequence repeats (EST-SSRs) provided a useful tool for investigating plant genetic diversity. In the present study, 22 polymorphic EST-SSRs from grain soybean were identified and used to assess the genetic diversity in 48 vegetable soybean accessions. Among the 22 EST-SSR loci, tri-nucleotides were the most abundant repeats, accounting for 50.00% of the total motifs. GAA was the most common motif among tri-nucleotide repeats, with a frequency of 18.18%. Polymorphic analysis identified a total of 71 alleles, with an average of 3.23 per locus. The polymorphism information content (PIC) values ranged from 0.144 to 0.630, with a mean of 0.386. Observed heterozygosity (H o) values varied from 0.0196 to 1.0000, with an average of 0.6092, while the expected heterozygosity (H e) values ranged from 0.1502 to 0.6840, with a mean value of 0.4616. Principal coordinate analysis and phylogenetic tree analysis indicated that the accessions could be assigned to different groups based to a large extent on their geographic distribution, and most accessions from China were clustered into the same groups. These results suggest that Chinese vegetable soybean accessions have a narrow genetic base. The results of this study indicate that EST-SSRs from grain soybean have high transferability to vegetable soybean, and that these new markers would be helpful in taxonomy, molecular breeding, and comparative mapping studies of vegetable soybean in the future. PMID:23549845
The internal transcribed spacer (ITS) region and trnH-psbA [corrected] are suitable candidate loci for DNA barcoding of tropical tree species of India.

PubMed

Tripathi, Abhinandan Mani; Tyagi, Antariksh; Kumar, Anoop; Singh, Akanksha; Singh, Shivani; Chaudhary, Lal Babu; Roy, Sribash

2013-01-01

DNA barcoding as a tool for species identification has been successful in animals and other organisms, including certain groups of plants. The exploration of this new tool for species identification, particularly in tree species, is very scanty from biodiversity-rich countries like India. rbcL and matK are standard barcode loci while ITS, and trnH-psbA are considered as supplementary loci for plants. Plant barcode loci, namely, rbcL, matK, ITS, trnH-psbA, and the recently proposed ITS2, were tested for their efficacy as barcode loci using 300 accessions of tropical tree species. We tested these loci for PCR, sequencing success, and species discrimination ability using three methods. rbcL was the best locus as far as PCR and sequencing success rate were concerned, but not for the species discrimination ability of tropical tree species. ITS and trnH-psbA were the second best loci in PCR and sequencing success, respectively. The species discrimination ability of ITS ranged from 24.4 percent to 74.3 percent and that of trnH-psbA was 25.6 percent to 67.7 percent, depending upon the data set and the method used. matK provided the least PCR success, followed by ITS2 (59. 0%). Species resolution by ITS2 and rbcL ranged from 9.0 percent to 48.7 percent and 13.2 percent to 43.6 percent, respectively. Further, we observed that the NCBI nucleotide database is poorly represented by the sequences of barcode loci studied here for tree species. Although a conservative approach of a success rate of 60-70 percent by both ITS and trnH-psbA may not be considered as highly successful but would certainly help in large-scale biodiversity inventorization, particularly for tropical tree species, considering the standard success rate of plant DNA barcode program reported so far. The recommended matK and rbcL primers combination may not work in tropical tree species as barcode markers.
Construction, De-Novo Assembly and Analysis of Transcriptome for Identification of Reproduction-Related Genes and Pathways from Rohu, Labeo rohita (Hamilton)

PubMed Central

Sahu, Dinesh Kumar; Panda, Soumya Prasad; Meher, Prem Kumar; Das, Paramananda; Routray, Padmanav; Sundaray, Jitendra Kumar; Jayasankar, Pallipuram; Nandi, Samiran

2015-01-01

Rohu is a leading candidate species for freshwater aquaculture in South-East Asia. Unlike common carp the monsoon breeding habit of rohu restricts its seed production beyond season indicating strong genetic control over spawning. Genetic information is limited in this regard. The problem is exacerbated by the lack of genomic-resources. We identified 182 reproduction-related genes previously by Sanger-sequencing which were less to address the issue of seasonal spawning behaviour of this important carp. Therefore, the present work was taken up to generate transcriptome profile by mRNAseq. 16GB, 72bp paired end (PE) data was generated from the pooled-RNA of twelve-tissues from pre-spawning rohu using IlluminaGA-II-platform. There were 64.97 million high-quality reads producing 62,283 contigs and 88,612 numbers of transcripts using velvet and oases programs, respectively. Gene ontology annotation identified 940 reproduction-related genes consisting of 184 mainly associated with reproduction, 223 related to hormone-activity and receptor-binding, 178 receptor-activity and 355 embryonic-development related-proteins. The important reproduction-relevant pathways found in KEGG analysis were GnRH-signaling, oocyte-meiosis, steroid-biosynthesis, steroid-hormone biosynthesis, progesterone-mediated oocyte-maturation, retinol-metabolism, neuroactive-ligand-receptor interaction, neurotrophin-signaling and photo-transduction. Twenty nine simple sequence repeat containing sequences were also found out of which 12 repeat loci were polymorphic with mean expected-&-observed heterozygosity of 0.471 and 0.983 respectively. Quantitative RT-PCR analyses of 13-known and 6-unknown transcripts revealed differences in expression level between preparatory and post-spawning phase. These transcriptomic sequences have significantly increased the genetic-&-genomic resources for reproduction-research in Labeo rohita. PMID:26148098
Whole Genome Sequence Analysis of Mutations Accumulated in rad27Δ Yeast Strains with Defects in the Processing of Okazaki Fragments Indicates Template-Switching Events

PubMed Central

Omer, Sumita; Lavi, Bar; Mieczkowski, Piotr A.; Covo, Shay; Hazkani-Covo, Einat

2017-01-01

Okazaki fragments that are formed during lagging strand DNA synthesis include an initiating primer consisting of both RNA and DNA. The RNA fragment must be removed before the fragments are joined. In Saccharomyces cerevisiae, a key player in this process is the structure-specific flap endonuclease, Rad27p (human homolog FEN1). To obtain a genomic view of the mutational consequence of loss of RAD27, a S. cerevisiae rad27Δ strain was subcultured for 25 generations and sequenced using Illumina paired-end sequencing. Out of the 455 changes observed in 10 colonies isolated the two most common types of events were insertions or deletions (INDELs) in simple sequence repeats (SSRs) and INDELs mediated by short direct repeats. Surprisingly, we also detected a previously neglected class of 21 template-switching events. These events were presumably generated by quasi-palindrome to palindrome correction, as well as palindrome elongation. The formation of these events is best explained by folding back of the stalled nascent strand and resumption of DNA synthesis using the same nascent strand as a template. Evidence of quasi-palindrome to palindrome correction that could be generated by template switching appears also in yeast genome evolution. Out of the 455 events, 55 events appeared in multiple isolates; further analysis indicates that these loci are mutational hotspots. Since Rad27 acts on the lagging strand when the leading strand should not contain any gaps, we propose a mechanism favoring intramolecular strand switching over an intermolecular mechanism. We note that our results open new ways of understanding template switching that occurs during genome instability and evolution. PMID:28974572
Development of unigene-derived SSR markers in cowpea (Vigna unguiculata) and their transferability to other Vigna species.

PubMed

Gupta, S K; Gopalakrishna, T

2010-07-01

Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.
Forensic massively parallel sequencing data analysis tool: Implementation of MyFLq as a standalone web- and Illumina BaseSpace(®)-application.

PubMed

Van Neste, Christophe; Gansemans, Yannick; De Coninck, Dieter; Van Hoofstat, David; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip

2015-03-01

Routine use of massively parallel sequencing (MPS) for forensic genomics is on the horizon. The last few years, several algorithms and workflows have been developed to analyze forensic MPS data. However, none have yet been tailored to the needs of the forensic analyst who does not possess an extensive bioinformatics background. We developed our previously published forensic MPS data analysis framework MyFLq (My-Forensic-Loci-queries) into an open-source, user-friendly, web-based application. It can be installed as a standalone web application, or run directly from the Illumina BaseSpace environment. In the former, laboratories can keep their data on-site, while in the latter, data from forensic samples that are sequenced on an Illumina sequencer can be uploaded to Basespace during acquisition, and can subsequently be analyzed using the published MyFLq BaseSpace application. Additional features were implemented such as an interactive graphical report of the results, an interactive threshold selection bar, and an allele length-based analysis in addition to the sequenced-based analysis. Practical use of the application is demonstrated through the analysis of four 16-plex short tandem repeat (STR) samples, showing the complementarity between the sequence- and length-based analysis of the same MPS data. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
A new hybrid approach for MHC genotyping: high-throughput NGS and long read MinION nanopore sequencing, with application to the non-model vertebrate Alpine chamois (Rupicapra rupicapra).

PubMed

Fuselli, S; Baptista, R P; Panziera, A; Magi, A; Guglielmi, S; Tonin, R; Benazzo, A; Bauzer, L G; Mazzoni, C J; Bertorelle, G

2018-03-24

The major histocompatibility complex (MHC) acts as an interface between the immune system and infectious diseases. Accurate characterization and genotyping of the extremely variable MHC loci are challenging especially without a reference sequence. We designed a combination of long-range PCR, Illumina short-reads, and Oxford Nanopore MinION long-reads approaches to capture the genetic variation of the MHC II DRB locus in an Italian population of the Alpine chamois (Rupicapra rupicapra). We utilized long-range PCR to generate a 9 Kb fragment of the DRB locus. Amplicons from six different individuals were fragmented, tagged, and simultaneously sequenced with Illumina MiSeq. One of these amplicons was sequenced with the MinION device, which produced long reads covering the entire amplified fragment. A pipeline that combines short and long reads resolved several short tandem repeats and homopolymers and produced a de novo reference, which was then used to map and genotype the short reads from all individuals. The assembled DRB locus showed a high level of polymorphism and the presence of a recombination breakpoint. Our results suggest that an amplicon-based NGS approach coupled with single-molecule MinION nanopore sequencing can efficiently achieve both the assembly and the genotyping of complex genomic regions in multiple individuals in the absence of a reference sequence.
Locus-specific oligonucleotide probes increase the usefulness of inter-Alu polymorphisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jarnik, M.; Tang, J.Q.; Korab-Laskowska, M.

1994-09-01

Most of the mapping approaches are based on single-locus codominant markers of known location. Their multiplex ratio, defined as the number of loci that can be simultaneously tested, is typically one. An increased multiplex ratio was obtained by typing anonymous polymorphisms using PCR primers anchored in ubiquitous Alu-repeats. These so called alumorphs are revealed by inter-Alu-PCR and seen as the presence or absence of an amplified band of a given length. We decided to map alumorphs and to develop locus-specific oligonucleotide (LSO) probes to facilitate their use and transfer among different laboratories. We studied the segregation of alumorphs in eightmore » CEPH families, using two distinct Alu-primers, both directing PCR between the repeats in a tail-to-tail orientation. The segregating bands were assigned to chromosomal locations by two-point linkage analysis with CEPH markers (V6.0). They were excised from dried gels, reamplified, cloned and sequenced. The resulting LSOs were used as hybridization probes (i) to confirm chromosomal assignments in a human/hamster somatic cell hybrid panel, and (ii) to group certain allelic length variants, originally coded as separate dominant markres, into more informative codominant loci. These codominants were then placed by multipoint analysis on a microsatellite Genethon map. Finally, the LSO probes were used as polymorphic STSs, to identify by hybridization the corresponding markers among products of inter-Alu-PCR. The use of LSOs converts alumorphs into a system of non-anonymous, often multiallelic codominant markes which can be simultaneously typed, thus achieving the goal of high multiplex ratio.« less
Bioinformatics analyses of Shigella CRISPR structure and spacer classification.

PubMed

Wang, Pengfei; Zhang, Bing; Duan, Guangcai; Wang, Yingfang; Hong, Lijuan; Wang, Linlin; Guo, Xiangjiao; Xi, Yuanlin; Yang, Haiyan

2016-03-01

Clustered regularly interspaced short palindromic repeats (CRISPR) are inheritable genetic elements of a variety of archaea and bacteria and indicative of the bacterial ecological adaptation, conferring acquired immunity against invading foreign nucleic acids. Shigella is an important pathogen for anthroponosis. This study aimed to analyze the features of Shigella CRISPR structure and classify the spacers through bioinformatics approach. Among 107 Shigella, 434 CRISPR structure loci were identified with two to seven loci in different strains. CRISPR-Q1, CRISPR-Q4 and CRISPR-Q5 were widely distributed in Shigella strains. Comparison of the first and last repeats of CRISPR1, CRISPR2 and CRISPR3 revealed several base variants and different stem-loop structures. A total of 259 cas genes were found among these 107 Shigella strains. The cas gene deletions were discovered in 88 strains. However, there is one strain that does not contain cas gene. Intact clusters of cas genes were found in 19 strains. From comprehensive analysis of sequence signature and BLAST and CRISPRTarget score, the 708 spacers were classified into three subtypes: Type I, Type II and Type III. Of them, Type I spacer referred to those linked with one gene segment, Type II spacer linked with two or more different gene segments, and Type III spacer undefined. This study examined the diversity of CRISPR/cas system in Shigella strains, demonstrated the main features of CRISPR structure and spacer classification, which provided critical information for elucidation of the mechanisms of spacer formation and exploration of the role the spacers play in the function of the CRISPR/cas system.
Construction of a genetic linkage map and analysis of quantitative trait loci associated with the agronomically important traits of Pleurotus eryngii.

PubMed

Im, Chak Han; Park, Young-Hoon; Hammel, Kenneth E; Park, Bokyung; Kwon, Soon Wook; Ryu, Hojin; Ryu, Jae-San

2016-07-01

Breeding new strains with improved traits is a long-standing goal of mushroom breeders that can be expedited by marker-assisted selection (MAS). We constructed a genetic linkage map of Pleurotus eryngii based on segregation analysis of markers in postmeiotic monokaryons from KNR2312. In total, 256 loci comprising 226 simple sequence-repeat (SSR) markers, 2 mating-type factors, and 28 insertion/deletion (InDel) markers were mapped. The map consisted of 12 linkage groups (LGs) spanning 1047.8cM, with an average interval length of 4.09cM. Four independent populations (Pd3, Pd8, Pd14, and Pd15) derived from crossing between four monokaryons from KNR2532 as a tester strain and 98 monokaryons from KNR2312 were used to characterize quantitative trait loci (QTL) for nine traits such as yield, quality, cap color, and earliness. Using composite interval mapping (CIM), 71 QTLs explaining between 5.82% and 33.17% of the phenotypic variations were identified. Clusters of more than five QTLs for various traits were identified in three genomic regions, on LGs 1, 7 and 9. Regardless of the population, 6 of the 9 traits studied and 18 of the 71 QTLs found in this study were identified in the largest cluster, LG1, in the range from 65.4 to 110.4cM. The candidate genes for yield encoding transcription factor, signal transduction, mycelial growth and hydrolase are suggested by using manual and computational analysis of genome sequence corresponding to QTL region with the highest likelihood odds (LOD) for yield. The genetic map and the QTLs established in this study will help breeders and geneticists to develop selection markers for agronomically important characteristics of mushrooms and to identify the corresponding genes. Copyright © 2016 Elsevier Inc. All rights reserved.
Evaluating genetic diversity and constructing core collections of Chinese Lentinula edodes cultivars using ISSR and SRAP markers.

PubMed

Liu, Jun; Wang, Zhuo-Ren; Li, Chuang; Bian, Yin-Bing; Xiao, Yang

2015-06-01

Genetic diversity among 89 Chinese Lentinula edodes cultivars was analyzed by inter-simple sequence repeat (ISSR) and sequence-related amplified polymorphism (SRAP) markers. A 123 out of 126 ISSR loci (97.62%) and 108 out of 129 SRAP loci (83.73%) were polymorphic between two or more strains. A dendrogram constructed by cluster analysis based on the ISSR and SRAP markers separated the L. edodes strains into two major groups, of which group B was further divided into five subgroups. Clustering results also showed a positive correlation with the main agronomic traits of the strains, and that strains with similar traits clustered together into the same groups or subgroups in most cases. The average coefficient of pairwise genetic similarity was 0.820 (range: 0.576-0.988). Compared to the wild strains, Chinese L. edodes cultivars indicated a lower level of genetic diversity. Two preliminary core collections of L. edodes, Core1 and Core2, were established based on the ISSR and SRAP data, respectively. Core1 was constructed by the advanced M (maximization) strategy using the PowerCore version 1.0 software and contained 21 strains, whereas Core2 was created by the allele preferred sampling strategy using the cluster method and contained 18 strains. Both core collections were highly representative of the genetic diversity of the original germplasm, as confirmed by the values of Na (observed number of alleles), Ne (effective number of alleles), H (Nei's gene diversity) and I (Shannon's information index), as well as results of principal coordinate analysis. The loci retention ratio of Core1 (99.61%) was higher than that of Core2 (97.65%). Moreover, Core1 contained strains with more types of agronomic traits than those in Core2. This study builds the basis for further effective protection, management and use of L. edodes germplasm resource. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Deciphering the Theobroma cacao self-incompatibility system: from genomics to diagnostic markers for self-compatibility.

PubMed

Lanaud, Claire; Fouet, Olivier; Legavre, Thierry; Lopes, Uilson; Sounigo, Olivier; Eyango, Marie Claire; Mermaz, Benoit; Da Silva, Marcos Ramos; Loor Solorzano, Rey Gaston; Argout, Xavier; Gyapay, Gabor; Ebaiarrey, Herman Ebai; Colonges, Kelly; Sanier, Christine; Rivallan, Ronan; Mastin, Géraldine; Cryer, Nicholas; Boccara, Michel; Verdeil, Jean-Luc; Efombagn Mousseni, Ives Bruno; Peres Gramacho, Karina; Clément, Didier

2017-10-13

Cocoa self-compatibility is an important yield factor and has been described as being controlled by a late gameto-sporophytic system expressed only at the level of the embryo sac. It results in gametic non-fusion and involves several loci. In this work, we identified two loci, located on chromosomes 1 and 4 (CH1 and CH4), involved in cocoa self-incompatibility by two different processes. Both loci are responsible for gametic selection, but only one (the CH4 locus) is involved in the main fruit drop. The CH1 locus acts prior to the gamete fusion step and independently of the CH4 locus. Using fine-mapping and genome-wide association studies, we focused analyses on restricted regions and identified candidate genes. Some of them showed a differential expression between incompatible and compatible reactions. Immunolocalization experiments provided evidence of CH1 candidate genes expressed in ovule and style tissues. Highly polymorphic simple sequence repeat (SSR) diagnostic markers were designed in the CH4 region that had been identified by fine-mapping. They are characterized by a strong linkage disequilibrium with incompatibility alleles, thus allowing the development of efficient diagnostic markers predicting self-compatibility and fruit setting according to the presence of specific alleles or genotypes. SSR alleles specific to self-compatible Amelonado and Criollo varieties were also identified, thus allowing screening for self-compatible plants in cocoa populations. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Genetic architecture and genomic patterns of gene flow between hybridizing species of Picea

PubMed Central

De La Torre, A; Ingvarsson, P K; Aitken, S N

2015-01-01

Hybrid zones provide an opportunity to study the effects of selection and gene flow in natural settings. We employed nuclear microsatellites (single sequence repeat (SSR)) and candidate gene single-nucleotide polymorphism markers (SNPs) to characterize the genetic architecture and patterns of interspecific gene flow in the Picea glauca × P. engelmannii hybrid zone across a broad latitudinal (40–60 degrees) and elevational (350–3500 m) range in western North America. Our results revealed a wide and complex hybrid zone with broad ancestry levels and low interspecific heterozygosity, shaped by asymmetric advanced-generation introgression, and low reproductive barriers between parental species. The clinal variation based on geographic variables, lack of concordance in clines among loci and the width of the hybrid zone points towards the maintenance of species integrity through environmental selection. Congruency between geographic and genomic clines suggests that loci with narrow clines are under strong selection, favoring either one parental species (directional selection) or their hybrids (overdominance) as a result of strong associations with climatic variables such as precipitation as snow and mean annual temperature. Cline movement due to past demographic events (evidenced by allelic richness and heterozygosity shifts from the average cline center) may explain the asymmetry in introgression and predominance of P. engelmannii found in this study. These results provide insights into the genetic architecture and fine-scale patterns of admixture, and identify loci that may be involved in reproductive barriers between the species. PMID:25806545
Evaluation of a 13-loci STR multiplex system for Cannabis sativa genetic identification.

PubMed

Houston, Rachel; Birck, Matthew; Hughes-Stamm, Sheree; Gangitano, David

2016-05-01

Marijuana (Cannabis sativa) is the most commonly used illicit substance in the USA. The development of a validated method using Cannabis short tandem repeats (STRs) could aid in the individualization of samples as well as serve as an intelligence tool to link multiple cases. For this purpose, a modified 13-loci STR multiplex method was optimized and evaluated according to ISFG and SWGDAM guidelines. A real-time PCR quantification method for C. sativa was developed and validated, and a sequenced allelic ladder was also designed to accurately genotype 199 C. sativa samples from 11 U.S. Customs and Border Protection seizures. Distinguishable DNA profiles were generated from 127 samples that yielded full STR profiles. Four duplicate genotypes within seizures were found. The combined power of discrimination of this multilocus system is 1 in 70 million. The sensitivity of the multiplex STR system is 0.25 ng of template DNA. None of the 13 STR markers cross-reacted with any of the studied species, except for Humulus lupulus (hops) which generated unspecific peaks. Phylogenetic analysis and case-to-case pairwise comparison of 11 cases using F st as genetic distance revealed the genetic association of four groups of cases. Moreover, due to their genetic similarity, a subset of samples (N = 97) was found to form a homogeneous population in Hardy-Weinberg and linkage equilibrium. The results of this research demonstrate the applicability of this 13-loci STR system in associating Cannabis cases for intelligence purposes.
Lessons from a Phenotyping Center Revealed by the Genome-Guided Mapping of Powdery Mildew Resistance Loci.

PubMed

Cadle-Davidson, Lance; Gadoury, David; Fresnedo-Ramírez, Jonathan; Yang, Shanshan; Barba, Paola; Sun, Qi; Demmings, Elizabeth M; Seem, Robert; Schaub, Michelle; Nowogrodzki, Anna; Kasinathan, Hema; Ledbetter, Craig; Reisch, Bruce I

2016-10-01

The genomics era brought unprecedented opportunities for genetic analysis of host resistance, but it came with the challenge that accurate and reproducible phenotypes are needed so that genomic results appropriately reflect biology. Phenotyping host resistance by natural infection in the field can produce variable results due to the uncontrolled environment, uneven distribution and genetics of the pathogen, and developmentally regulated resistance among other factors. To address these challenges, we developed highly controlled, standardized methodologies for phenotyping powdery mildew resistance in the context of a phenotyping center, receiving samples of up to 140 grapevine progeny per F 1 family. We applied these methodologies to F 1 families segregating for REN1- or REN2-mediated resistance and validated that some but not all bioassays identified the REN1 or REN2 locus. A point-intercept method (hyphal transects) to quantify colony density objectively at 8 or 9 days postinoculation proved to be the phenotypic response most reproducibly predicted by these resistance loci. Quantitative trait locus (QTL) mapping with genotyping-by-sequencing maps defined the REN1 and REN2 loci at relatively high resolution. In the reference PN40024 genome under each QTL, nucleotide-binding site-leucine-rich repeat candidate resistance genes were identified-one gene for REN1 and two genes for REN2. The methods described here for centralized resistance phenotyping and high-resolution genetic mapping can inform strategies for breeding resistance to powdery mildews and other pathogens on diverse, highly heterozygous hosts.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.