Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis
Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting
2013-01-01
Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen
2013-04-01
Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.
Kayesh, E; Bilkish, N; Liu, G S; Chen, W; Leng, X P; Fang, J G
2014-03-31
Among different classes of molecular markers, expressed sequence tags (ESTs) are a new resource for developing simple sequence repeat (SSR) functional markers for genotyping and genetic mapping in F1 hybrid populations of Vitis vinifera L. Recently, because of the availability of an enormous amount of data for ESTs in the public domain, the emphasis has shifted from genomic SSRs to EST-SSRs, which belong to transcribed regions of the genome and may have a role in gene expression or function. The objective of this study was to assess the polymorphisms among 94 F1 hybrids from "Early Rose" and "Red Globe" using 25 EST-derived and 25 non-EST SSR markers. A total collection of 362,375 grape ESTs that were retrieved from the National Center for Biotechnology Information (NCBI) and 2522 EST-SSR sequences were identified. From them, 205 primer pairs were randomly selected, including 176 pairs that were EST-derived and 29 non-EST SSR primer pairs, for polymerase chain reaction amplification. A total of 131 alleles were amplified using 50 pairs of primers; 78 alleles were amplified using EST-derived SSR primers and 53 were from non-EST SSR primers. At most, 6 and 5 alleles were amplified by EST-derived and non-EST SSR primers, respectively. The EST-derived SSR markers showed a maximum polymorphic information content (PIC) value of 1 and a minimum of 0.33 while non-EST SSR markers had maximum and minimum PIC values of 1 and 0.25, respectively. The average PIC value was 0.56 for EST-derived SSR markers and 0.45 for non-EST SSR markers.
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...
SSR allelic variation in almond (Prunus dulcis Mill.).
Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai
2006-01-01
Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism
USDA-ARS?s Scientific Manuscript database
Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
Kayesh, E; Zhang, Y Y; Liu, G S; Bilkish, N; Sun, X; Leng, X P; Fang, J G
2013-09-23
The objectives of this investigation were to develop and validate the expressed sequence tag (EST)-simple sequence repeat (SSR) markers from large EST sequences, and to study the segregation and distribution of SSRs within two grapevine parental lines. In total, 94 F₁ lines crossed between "Early Rose" and "Red Globe" were studied. Approximately 2100 EST-SSR sequences of Vitis vinifera L. were searched for SSRs and analyzed for the design of polymerase chain reaction (PCR) primers amplifying the SSR-rich regions. Trinucleotide repeats were found to be the most abundant, followed by other nucleotide repeats. A total of 182 SSR primer pairs were first developed for the study on the parental polymorphism. Among the 182 SSR primers, 142 primer pairs (78%) could amplify the anticipated PCR products, among which only 52 primer pairs (36.62%) showed polymorphism between the two parents. These polymorphic bands were further surveyed among the 94 F₁ lines, and the results showed that a total of 162 bands were amplified, and 98 of them were polymorphic in both parents (60.86% polymorphism), with an average of 1.88 polymorphic DNA bands for each primer pair. After testing with the chi-square test, 33 of the clearly amplified polymorphic bands followed a 3:1 ratio, and 37 followed a 1:1 ratio. The rest showed distorted segregation ratios.
Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop
2012-01-01
Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine
Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson
2011-01-01
Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
In silico search, characterization and validation of new EST-SSR markers in the genus Prunus.
Sorkheh, Karim; Prudencio, Angela S; Ghebinejad, Azim; Dehkordi, Mehrana Kohei; Erogul, Deniz; Rubio, Manuel; Martínez-Gómez, Pedro
2016-07-07
Simple sequence repeats (SSRs) are defined as sequence repeat units between 1 and 6 bp that occur in both coding and non-coding regions abundant in eukaryotic genomes, which may affect the expression of genes. In this study, expressed sequence tags (ESTs) of eight Prunus species were analyzed for in silico mining of EST-SSRs, protein annotation, and open reading frames (ORFs), and the identification of codon repetitions. A total of 316 SSRs were identified using MISA software. Dinucleotide SSR motifs (26.31 %) were found to be the most abundant type of repeats, followed by tri- (14.58 %), tetra- (0.53 %), and penta- (0.27 %) nucleotide motifs. An attempt was made to design primer pairs for 316 identified SSRs but these were successful for only 175 SSR sequences. The positions of SSRs with respect to ORFs were detected, and annotation of sequences containing SSRs was performed to assign function to each sequence. SSRs were also characterized (in terms of position in the reference genome and associated gene) using the two available Prunus reference genomes (mei and peach). Finally, 38 SSR markers were validated across peach, almond, plum, and apricot genotypes. This validation showed a higher transferability level of EST-SSR developed in P. mume (mei) in comparison with the rest of species analyzed. Findings will aid analysis of functionally important molecular markers and facilitate the analysis of genetic diversity.
Isobe, Sachiko N.; Hirakawa, Hideki; Sato, Shusei; Maeda, Fumi; Ishikawa, Masami; Mori, Toshiki; Yamamoto, Yuko; Shirasawa, Kenta; Kimura, Mitsuhiro; Fukami, Masanobu; Hashizume, Fujio; Tsuji, Tomoko; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Tsuruoka, Hisano; Minami, Chiharu; Takahashi, Chika; Wada, Tsuyuko; Ono, Akiko; Kawashima, Kumiko; Nakazaki, Naomi; Kishida, Yoshie; Kohara, Mitsuyo; Nakayama, Shinobu; Yamada, Manabu; Fujishiro, Tsunakazu; Watanabe, Akiko; Tabata, Satoshi
2013-01-01
The cultivated strawberry (Fragaria× ananassa) is an octoploid (2n = 8x = 56) of the Rosaceae family whose genomic architecture is still controversial. Several recent studies support the AAA′A′BBB′B′ model, but its complexity has hindered genetic and genomic analysis of this important crop. To overcome this difficulty and to assist genome-wide analysis of F. × ananassa, we constructed an integrated linkage map by organizing a total of 4474 of simple sequence repeat (SSR) markers collected from published Fragaria sequences, including 3746 SSR markers [Fragaria vesca expressed sequence tag (EST)-derived SSR markers] derived from F. vesca ESTs, 603 markers (F. × ananassa EST-derived SSR markers) from F. × ananassa ESTs, and 125 markers (F. × ananassa transcriptome-derived SSR markers) from F. × ananassa transcripts. Along with the previously published SSR markers, these markers were mapped onto five parent-specific linkage maps derived from three mapping populations, which were then assembled into an integrated linkage map. The constructed map consists of 1856 loci in 28 linkage groups (LGs) that total 2364.1 cM in length. Macrosynteny at the chromosome level was observed between the LGs of F. × ananassa and the genome of F. vesca. Variety distinction on 129 F. × ananassa lines was demonstrated using 45 selected SSR markers. PMID:23248204
Mornkham, T; Wangsomnuk, P P; Mo, X C; Francisco, F O; Gao, L Z; Kurzweil, H
2016-10-24
Jerusalem artichoke (Helianthus tuberosus L.) is a perennial tuberous plant and a traditional inulin-rich crop in Thailand. It has become the most important source of inulin and has great potential for use in chemical and food industries. In this study, expressed sequence tag (EST)-based simple sequence repeat (SSR) markers were developed from 40,362 Jerusalem artichoke ESTs retrieved from the NCBI database. Among 23,691 non-redundant identified ESTs, 1949 SSR motifs harboring 2 to 6 nucleotides with varied repeat motifs were discovered from 1676 assembled sequences. Seventy-nine primer pairs were generated from EST sequences harboring SSR motifs. Our results show that 43 primers are polymorphic for the six studied populations, while the remaining 36 were either monomorphic or failed to amplify. These 43 SSR loci exhibited a high level of genetic diversity among populations, with allele numbers varying from 2 to 7, with an average of 3.95 alleles per loci. Heterozygosity ranged from 0.096 to 0.774, with an average of 0.536; polymorphic index content ranged from 0.096 to 0.854, with an average of 0.568. Principal component analysis and neighbor-joining analysis revealed that the six populations could be divided into six clusters. Our results indicate that these newly characterized EST-SSR markers may be useful in the exploration of genetic diversity and range expansion of the Jerusalem artichoke, and in cross-species application for the genus Helianthus.
Characterization and Amplification of Gene-Based Simple Sequence Repeat (SSR) Markers in Date Palm.
Zhao, Yongli; Keremane, Manjunath; Prakash, Channapatna S; He, Guohao
2017-01-01
The paucity of molecular markers limits the application of genetic and genomic research in date palm (Phoenix dactylifera L.). Availability of expressed sequence tag (EST) sequences in date palm may provide a good resource for developing gene-based markers. This study characterizes a substantial fraction of transcriptome sequences containing simple sequence repeats (SSRs) from the EST sequences in date palm. The EST sequences studied are mainly homologous to those of Elaeis guineensis and Musa acuminata. A total of 911 gene-based SSR markers, characterized with functional annotations, have provided a useful basis not only for discovering candidate genes and understanding genetic basis of traits of interest but also for developing genetic and genomic tools for molecular research in date palm, such as diversity study, quantitative trait locus (QTL) mapping, and molecular breeding. The procedures of DNA extraction, polymerase chain reaction (PCR) amplification of these gene-based SSR markers, and gel electrophoresis of PCR products are described in this chapter.
Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining
2014-01-01
Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachis wild species
Liang, Xuanqiang; Chen, Xiaoping; Hong, Yanbin; Liu, Haiyan; Zhou, Guiyuan; Li, Shaoxiong; Guo, Baozhu
2009-01-01
Background Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining. Results In this study, we investigated 24,238 ESTs for the identification and development of SSR markers. In total, 881 SSRs were identified from 780 SSR-containing unique ESTs. On an average, one SSR was found per 7.3 kb of EST sequence with tri-nucleotide motifs (63.9%) being the most abundant followed by di- (32.7%), tetra- (1.7%), hexa- (1.0%) and penta-nucleotide (0.7%) repeat types. The top six motifs included AG/TC (27.7%), AAG/TTC (17.4%), AAT/TTA (11.9%), ACC/TGG (7.72%), ACT/TGA (7.26%) and AT/TA (6.3%). Based on the 780 SSR-containing ESTs, a total of 290 primer pairs were successfully designed and used for validation of the amplification and assessment of the polymorphism among 22 genotypes of cultivated peanuts and 16 accessions of wild species. The results showed that 251 primer pairs yielded amplification products, of which 26 and 221 primer pairs exhibited polymorphism among the cultivated and wild species examined, respectively. Two to four alleles were found in cultivated peanuts, while 3–8 alleles presented in wild species. The apparent broad polymorphism was further confirmed by cloning and sequencing of amplified alleles. Sequence analysis of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions. In addition, a few single base mutations were observed in the microsatellite flanking regions. Conclusion This study gives an insight into the frequency, type and distribution of peanut EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated peanut. These EST-SSR markers could enrich the current resource of molecular markers for the peanut community and would be useful for qualitative and quantitative trait mapping, marker-assisted selection, and genetic diversity studies in cultivated peanut as well as related Arachis species. All of the 251 working primer pairs with names, motifs, repeat types, primer sequences, and alleles tested in cultivated and wild species are listed in Additional File 1. PMID:19309524
Loblolly pine SSR markers for shortleaf pine genetics
C. Dana Nelson; Sedley Josserand; Craig S. Echt; Jeff Koppelman
2007-01-01
Simple sequence repeats (SSR) are highly informative DNA-based markers widely used in population genetic and linkage mapping studies. We have been developing PCR primer pairs for amplifying SSR markers for loblolly pine (Pinus taeda L.) using loblolly pine DNA and EST sequence data as starting materials. Fifty primer pairs known to reliably amplify...
Yuan, Can; Peng, Fang; Yang, Ze-Mao; Zhong, Wen-Juan; Mou, Fang-Sheng; Gong, Yi-Yun; Ji, Pei-Cheng; Pu, De-Qiang; Huang, Hai-Yan; Yang, Xiao; Zhang, Chao
2017-09-01
Ligusticum chuanxiong is a well-known traditional Chinese medicine plant. The study on its molecular markers development and germplasm resources is very important. In this study, we obtained 24 422 unigenes by assembling transcriptome sequencing reads of L. chuanxiong root. EST-SSR was detected and 4 073 SSR loci were identified. EST-SSR distribution and characteristic analysis results showed that the mono-nucleotide repeats were the main repeat types, accounting for 41.0%. In addition, the sequences containing SSR were functionally annotated in Gene Ontology (GO) and KEGG pathway and were assigned to 49 GO categories, 242 KEGG pathways, among them 2 201 sequences were annotated against Nr database. By validating 235 EST-SSRs,74 primer pairs were ultimately proved to have high quality amplification. Subsequently, genetic diversity analysis, UPGMA cluster analysis, PCoA analysis and population structure analysis of 34 L. chuanxiong germplasm resources were carried out with 74 primer pairs. In both UPGMA tree and PCoA results, L. chuanxiong resources were clustered into two groups, which are believed to be partial related to their geographical distribution. In this study, EST-SSRs in L. chuanxiong was firstly identified, and newly developed molecular markers would contribute significantly to further genetic diversity study, the purity detection, gene mapping, and molecular breeding. Copyright© by the Chinese Pharmaceutical Association.
A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)
USDA-ARS?s Scientific Manuscript database
The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...
2010-01-01
Background Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum. Results cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9). Conclusion A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and ~1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species. PMID:20141623
Li, Hui; Li, Defang; Chen, Anguo; Tang, Huijuan; Li, Jianjun; Huang, Siqi
2016-01-01
Kenaf (Hibiscus cannabinus L.) is an economically important natural fiber crop grown worldwide. However, only 20 expressed tag sequences (ESTs) for kenaf are available in public databases. The aim of this study was to develop large-scale simple sequence repeat (SSR) markers to lay a solid foundation for the construction of genetic linkage maps and marker-assisted breeding in kenaf. We used Illumina paired-end sequencing technology to generate new EST-simple sequences and MISA software to mine SSR markers. We identified 71,318 unigenes with an average length of 1143 nt and annotated these unigenes using four different protein databases. Overall, 9324 complementary pairs were designated as EST-SSR markers, and their quality was validated using 100 randomly selected SSR markers. In total, 72 primer pairs reproducibly amplified target amplicons, and 61 of these primer pairs detected significant polymorphism among 28 kenaf accessions. Thus, in this study, we have developed large-scale SSR markers for kenaf, and this new resource will facilitate construction of genetic linkage maps, investigation of fiber growth and development in kenaf, and also be of value to novel gene discovery and functional genomic studies. PMID:26960153
ESAP plus: a web-based server for EST-SSR marker development.
Ponyared, Piyarat; Ponsawat, Jiradej; Tongsima, Sissades; Seresangtakul, Pusadee; Akkasaeng, Chutipong; Tantisuwichwong, Nathpapat
2016-12-22
Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can download all the results through the web interface. ESAP Plus is a comprehensive and convenient web-based bioinformatic tool for SSR marker development. ESAP Plus offers all necessary EST-SSR development processes with various adjustable options that users can easily use to identify SSR markers from a large EST collection. With familiar web interface, users can upload the raw EST using the data submission page and visualize/download the corresponding EST-SSR information from within ESAP Plus. ESAP Plus can handle considerably large EST datasets. This EST-SSR discovery tool can be accessed directly from: http://gbp.kku.ac.th/esap_plus/ .
Jing, S; Liu, B; Peng, L; Peng, X; Zhu, L; Fu, Q; He, G
2012-02-01
To assess genetic diversity in populations of the brown planthopper (Nilaparvata lugens Stål) (Homoptera: Delphacidae), we have developed and applied microsatellite, or simple sequence repeat (SSR), markers from expressed sequence tags (ESTs). We found that the brown planthopper clusters of ESTs were rich in SSRs with unique frequencies and distributions of SSR motifs. Three hundred and fifty-one EST-SSR markers were developed and yielded clear bands from samples of four brown planthopper populations. High cross-species transferability of these markers was detected in the closely related planthopper N. muiri. The newly developed EST-SSR markers provided sufficient resolution to distinguish within and among biotypes. Analyses based on SSR data revealed host resistance-based genetic differentiation among different brown planthopper populations; the genetic diversity of populations feeding on susceptible rice varieties was lower than that of populations feeding on resistant rice varieties. This is the first large-scale development of brown planthopper SSR markers, which will be useful for future molecular genetics and genomics studies of this serious agricultural pest.
Analysis of SSR information in EST resources of sugarcane
USDA-ARS?s Scientific Manuscript database
Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...
Wang, Q Z; Huang, M; Downie, S R; Chen, Z X
2016-05-23
Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Kalyana Babu, B; Pandey, Dinesh; Agrawal, P K; Sood, Salej; Kumar, Anil
2014-05-01
In recent years, the increased availability of the DNA sequences has given the possibility to develop and explore the expressed sequence tags (ESTs) derived SSR markers. In the present study, a total of 1956 ESTs of finger millet were used to find the microsatellite type, distribution, frequency and developed a total of 545 primer pairs from the ESTs of finger millet. Thirty-two EST sequences had more than two microsatellites and 1357 sequences did not have any SSR repeats. The most frequent type of repeats was trimeric motif, however the second place was occupied by dimeric motif followed by tetra-, hexa- and penta repeat motifs. The most common dimer repeat motif was GA and in case of trimeric SSRs, it was CGG. The EST sequences of NBS-LRR region of finger millet and rice showed higher synteny and were found on nearly same positions on the rice chromosome map. A total of eight, out of 15 EST based SSR primers were polymorphic among the selected resistant and susceptible finger millet genotypes. The primer FMBLEST5 could able to differentiate them into resistant and susceptible genotypes. The alleles specific to the resistant and susceptible genotypes were sequenced using the ABI 3130XL genetic analyzer and found similarity to NBS-LRR regions of rice and finger millet and contained the characteristic kinase-2 and kinase 3a motifs of plant R-genes belonged to NBS-LRR region. The In-silico and comparative analysis showed that the genes responsible for blast resistance can be identified, mapped and further introgressed through molecular breeding approaches for enhancing the blast resistance in finger millet.
NASA Astrophysics Data System (ADS)
Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong
2010-01-01
Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A
2015-10-26
Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
Ramu, P; Kassahun, B; Senthilvel, S; Ashok Kumar, C; Jayashree, B; Folkertsma, R T; Reddy, L Ananda; Kuruvinashetti, M S; Haussmann, B I G; Hash, C T
2009-11-01
The sequencing and detailed comparative functional analysis of genomes of a number of select botanical models open new doors into comparative genomics among the angiosperms, with potential benefits for improvement of many orphan crops that feed large populations. In this study, a set of simple sequence repeat (SSR) markers was developed by mining the expressed sequence tag (EST) database of sorghum. Among the SSR-containing sequences, only those sharing considerable homology with rice genomic sequences across the lengths of the 12 rice chromosomes were selected. Thus, 600 SSR-containing sorghum EST sequences (50 homologous sequences on each of the 12 rice chromosomes) were selected, with the intention of providing coverage for corresponding homologous regions of the sorghum genome. Primer pairs were designed and polymorphism detection ability was assessed using parental pairs of two existing sorghum mapping populations. About 28% of these new markers detected polymorphism in this 4-entry panel. A subset of 55 polymorphic EST-derived SSR markers were mapped onto the existing skeleton map of a recombinant inbred population derived from cross N13 x E 36-1, which is segregating for Striga resistance and the stay-green component of terminal drought tolerance. These new EST-derived SSR markers mapped across all 10 sorghum linkage groups, mostly to regions expected based on prior knowledge of rice-sorghum synteny. The ESTs from which these markers were derived were then mapped in silico onto the aligned sorghum genome sequence, and 88% of the best hits corresponded to linkage-based positions. This study demonstrates the utility of comparative genomic information in targeted development of markers to fill gaps in linkage maps of related crop species for which sufficient genomic tools are not available.
Huang, Li; Wu, Bei; Zhao, Jiaojiao; Li, Haitao; Chen, Weigang; Zheng, Yanli; Ren, Xiaoping; Chen, Yuning; Zhou, Xiaojing; Lei, Yong; Liao, Boshou; Jiang, Huifang
2016-01-01
Microsatellite or simple sequence repeat (SSR) is one of the most widely distributed molecular markers that have been widely utilized to assess genetic diversity and genetic mapping for important traits in plants. However, the understanding of microsatellite characteristics in Arachis species and the currently available amount of high-quality SSR markers remain limited. In this study, we identified 16,435 genome survey sequences SSRs (GSS-SSRs) and 40,199 expressed sequence tag SSRs (EST-SSRs) in Arachis hypogaea and its wild relative species using the publicly available sequence data. The GSS-SSRs had a density of 159.9-239.8 SSRs/Mb for wild Arachis and 1,015.8 SSR/Mb for cultivated Arachis, whereas the EST-SSRs had the density of 173.5-384.4 SSR/Mb and 250.9 SSRs/Mb for wild and cultivated Arachis, respectively. The trinucleotide SSRs were predominant across Arachis species, except that the dinucleotide accounted for most in A. hypogaea GSSs. From Arachis GSS-SSR and EST-SSR sequences, we developed 2,589 novel SSR markers that showed a high polymorphism in six diverse A. hypogaea accessions. A genetic linkage map that contained 540 novel SSR loci and 105 anchor SSR loci was constructed by case of a recombinant inbred lines F6 population. A subset of 82 randomly selected SSR markers were used to screen 39 wild and 22 cultivated Arachis accessions, which revealed a high transferability of the novel SSRs across Arachis species. Our results provided informative clues to investigate microsatellite patterns across A. hypogaea and its wild relative species and potentially facilitate the germplasm evaluation and gene mapping in Arachis species.
Development and Characterization of 1,906 EST-SSR Markers from Unigenes in Jute (Corchorus spp.)
Zhang, Liwu; Li, Yanru; Tao, Aifen; Fang, Pingping; Qi, Jianmin
2015-01-01
Jute, comprising white and dark jute, is the second important natural fiber crop after cotton worldwide. However, the lack of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers has resulted in a large gap in the improvement of jute. Previously, de novo 48,914 unigenes from white jute were assembled. In this study, 1,906 EST-SSRs were identified from these assembled uingenes. Among these markers, di-, tri- and tetra-nucleotide repeat types were the abundant types (12.0%, 56.9% and 21.6% respectively). The AG-rich or GA-rich nucleotide repeats were the predominant. Subsequently, a sample of 116 SSRs, located in genes encoding transcription factors and cellulose synthases, were selected to survey polymorphisms among12 diverse jute accessions. Of these, 83.6% successfully amplified at least one fragment and detected polymorphism among the 12diverse genotypes, indicating that the newly developed SSRs are of good quality. Furthermore, the genetic similarity coefficients of all the 12 accessions were evaluated using 97 polymorphic SSRs. The cluster analysis divided the jute accessions into two main groups with genetic similarity coefficient of 0.61. These EST-SSR markers not only enrich molecular markers of jute genome, but also facilitate genetic and genomic researches in jute. PMID:26512891
Bioinformatic mining of EST-SSR loci in the Pacific oyster, Crassostrea gigas.
Wang, Y; Ren, R; Yu, Z
2008-06-01
A set of expressed sequence tag-simple sequence repeat (EST-SSR) markers of the Pacific oyster, Crassostrea gigas, was developed through bioinformatic mining of the GenBank public database. As of June 30, 2007, a total of 5132 EST sequences from GenBank were downloaded and screened for di-, tri- and tetra-nucleotide repeats, with criteria set at a minimum of 5, 4 and 4 repeats for the three categories of SSRs respectively. Seventeen polymorphic microsatellite markers were characterized. Allele numbers ranged from 3 to 10, and the observed and expected heterozygosity values varied from 0.125 to 0.770 and from 0.113 to 0.732 respectively. Eleven loci were at Hardy-Weinberg equilibrium (HWE); the other six loci showed significant departure from HWE (P < 0.01), suggesting possible presence of null alleles. Pairwise check of linkage disequilibrium (LD) indicated that 11 of 136 pairs of loci showed significant LD (P < 0.01), likely due to HWE present in single markers. Cross-species amplification was examined for five other Crassostrea species and reasonable results were obtained, promising usefulness of these markers in oyster genetics.
2012-01-01
Background Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Results Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession’s origin. Conclusion L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection. PMID:22920992
Parra-González, Lorena B; Aravena-Abarzúa, Gabriela A; Navarro-Navarro, Cristell S; Udall, Joshua; Maughan, Jeff; Peterson, Louis M; Salvo-Garrido, Haroldo E; Maureira-Butler, Iván J
2012-08-24
Yellow lupin (Lupinus luteus L.) is a minor legume crop characterized by its high seed protein content. Although grown in several temperate countries, its orphan condition has limited the generation of genomic tools to aid breeding efforts to improve yield and nutritional quality. In this study, we report the construction of 454-expresed sequence tag (EST) libraries, carried out comparative studies between L. luteus and model legume species, developed a comprehensive set of EST-simple sequence repeat (SSR) markers, and validated their utility on diversity studies and transferability to related species. Two runs of 454 pyrosequencing yielded 205 Mb and 530 Mb of sequence data for L1 (young leaves, buds and flowers) and L2 (immature seeds) EST- libraries. A combined assembly (L1L2) yielded 71,655 contigs with an average contig length of 632 nucleotides. L1L2 contigs were clustered into 55,309 isotigs. 38,200 isotigs translated into proteins and 8,741 of them were full length. Around 57% of L. luteus sequences had significant similarity with at least one sequence of Medicago, Lotus, Arabidopsis, or Glycine, and 40.17% showed positive matches with all of these species. L. luteus isotigs were also screened for the presence of SSR sequences. A total of 2,572 isotigs contained at least one EST-SSR, with a frequency of one SSR per 17.75 kbp. Empirical evaluation of the EST-SSR candidate markers resulted in 222 polymorphic EST-SSRs. Two hundred and fifty four (65.7%) and 113 (30%) SSR primer pairs were able to amplify fragments from L. hispanicus and L. mutabilis DNA, respectively. Fifty polymorphic EST-SSRs were used to genotype a sample of 64 L. luteus accessions. Neighbor-joining distance analysis detected the existence of several clusters among L. luteus accessions, strongly suggesting the existence of population subdivisions. However, no clear clustering patterns followed the accession's origin. L. luteus deep transcriptome sequencing will facilitate the further development of genomic tools and lupin germplasm. Massive sequencing of cDNA libraries will continue to produce raw materials for gene discovery, identification of polymorphisms (SNPs, EST-SSRs, INDELs, etc.) for marker development, anchoring sequences for genome comparisons and putative gene candidates for QTL detection.
Barboza, Karina; Beretta, Vanesa; Kozub, Perla C; Salinas, Cecilia; Morgenfeld, Mauro M; Galmarini, Claudio R; Cavagnaro, Pablo F
2018-04-28
Allium vegetables, such as garlic and onion, have understudied genomes and limited molecular resources, hindering advances in genetic research and breeding of these species. In this study, we characterized and compared the simple sequence repeats (SSR) landscape in the transcriptomes of garlic and related Allium (A. cepa, A. fistulosum, and A. tuberosum) and non-Allium monocot species. In addition, 110 SSR markers were developed from garlic ESTs, and they were characterized-along with 112 previously developed SSRs-at various levels, including transferability across Alliaceae species, and their usefulness for genetic diversity analysis. Among the Allium species analyzed, garlic ESTs had the highest overall SSR density, the lowest frequency of trinucleotides, and the highest of di- and tetranucleotides. When compared to more distantly related monocots, outside the Asparagales order, it was evident that ESTs of Allium species shared major commonalities with regards to SSR density, frequency distribution, sequence motifs, and GC content. A significant fraction of the SSR markers were successfully transferred across Allium species, including crops for which no SSR markers have been developed yet, such as leek, shallot, chives, and elephant garlic. Diversity analysis of garlic cultivars with selected SSRs revealed 36 alleles, with 2-5 alleles/locus, and PIC = 0.38. Cluster analysis grouped the accessions according to their flowering behavior, botanical variety, and ecophysiological characteristics. Results from this study contribute to the characterization of Allium transcriptomes. The new SSR markers developed, along with the data from the polymorphism and transferability analyses, will aid in assisting genetic research and breeding in garlic and other Allium.
Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi
2013-01-01
Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire
2012-01-01
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Sahu, Jagajjit; Das Talukdar, Anupam; Devi, Kamalakshi; Choudhury, Manabendra Dutta; Barooah, Madhumita; Modi, Mahendra Kumar; Sen, Priyabrata
2015-01-01
Abstract Centella asiatica (Gotu Kola) is a plant that grows in tropical swampy regions of the world and has important medicinal and culinary use. It is often considered as part of Ayurvedic medicine, traditional African medicine, and traditional Chinese medicine. The unavailability of genomics resources is significantly impeding its genetic improvement. To date, no attempt has been made to develop Expressed Sequence Tags (ESTs) derived Simple Sequence Repeat (SSR) markers (eSSRs) from the Centella genome. Hence, the present study aimed to develop eSSRs and their further experimental validation and cross-transferability of these markers in different genera of the Apiaceae family to which Centella belongs. An in-house pipeline was developed for the entire analyses by combining bioinformatics tools and perl scripts. A total of 4443 C. asiatica EST sequences from dbEST were processed, which generated 2617 nonredundant high quality EST sequences consisting 441 contigs and 2176 singletons. Out of 1776.5 kb of examined sequences, 417 (15.9%) ESTs containing 686 SSRs were detected with a density of one SSR per 2.59 kb. The gene ontology study revealed 282 functional domains involved in various processes, components, and functions, out of which 64 ESTs were found to have both SSRs and functional domains. Out of 603 designed EST-SSR primers, 18 pairs of primers were selected for validation based on the optimum parameter value. Reproducible amplification was obtained for six primer pairs in C. asiatica that were further tested for cross-transferability in nine other important genera/species of the Apiaceae family. Cross-transferability of the EST-SSR markers among the species were examined and Centella javanica showed highest transferability (83.3%). The study revealed six highly polymorphic EST-SSR primers with an average PIC value of 0.95. In conclusion, these EST-SSR markers hold a big promise for the genomics analysis of Centella asiatica, to facilitate comparative map-based analyses across other related species within the Apiaceae family, and future marker-assisted breeding programs. To the best of our knowledge, this is the first report of development of EST-SSRs in Centella asiatica by in silico approaches, which offers a veritable potential in further use in plant omics research and development.
Hendre, Prasad S.; Aggarwal, Ramesh K.
2014-01-01
Coffee breeding and improvement efforts can be greatly facilitated by availability of a large repository of simple sequence repeats (SSRs) based microsatellite markers, which provides efficiency and high-resolution in genetic analyses. This study was aimed to improve SSR availability in coffee by developing new genic−/genomic-SSR markers using in-silico bioinformatics and streptavidin-biotin based enrichment approach, respectively. The expressed sequence tag (EST) based genic microsatellite markers (EST-SSRs) were developed using the publicly available dataset of 13,175 unigene ESTs, which showed a distribution of 1 SSR/3.4 kb of coffee transcriptome. Genomic SSRs, on the other hand, were developed from an SSR-enriched small-insert partial genomic library of robusta coffee. In total, 69 new SSRs (44 EST-SSRs and 25 genomic SSRs) were developed and validated as suitable genetic markers. Diversity analysis of selected coffee genotypes revealed these to be highly informative in terms of allelic diversity and PIC values, and eighteen of these markers (∼27%) could be mapped on a robusta linkage map. Notably, the markers described here also revealed a very high cross-species transferability. In addition to the validated markers, we have also designed primer pairs for 270 putative EST-SSRs, which are expected to provide another ca. 200 useful genetic markers considering the high success rate (88%) of marker conversion of similar pairs tested/validated in this study. PMID:25461752
Augusto, Raphael; Maranho, Rone Charles; Mangolin, Claudete Aparecida; Pires da Silva Machado, Maria de Fátima
2015-01-01
High and low polymorphisms in simple sequence repeats of expressed sequence tag (EST-SSR) for specific proteins and enzymes, such as β-amylase, cellulose synthase, xyloglucan endotransglucosylase, fructose 1,6-bisphosphate aldolase, and fructose 1,6-bisphosphatase, were used to illustrate the genetic divergence within and between varieties of sugarcane (Saccharum spp.) and to guide the technological paths to optimize ethanol production from lignocellulose biomass. The varieties RB72454, RB867515, RB92579, and SP813250 on the second stage of cutting, all grown in the state of Paraná (PR), and the varieties RB92579 and SP813250 cultured in the PR state and in Northeastern Brazil, state of Pernambuco (PE), were analyzed using five EST-SSR primers for EstC66, EstC67, EstC68, EstC69, and EstC91 loci. Genetic divergence was evident in the EstC67 and EstC69 loci for β-amylase and cellulose synthase, respectively, among the four sugarcane varieties. An extremely high level of genetic differentiation was also detected in the EstC67 locus from the RB82579 and SP813250 varieties cultured in the PR and PE states. High polymorphism in SSR of the cellulose synthase locus may explain the high variability of substrates used in pretreatment and enzymatic hydrolysis processes, which has been an obstacle to effective industrial adaptations.
Luo, C; Zhang, F; Zhang, Q L; Guo, D Y; Luo, Z R
2013-01-09
We developed and characterized expressed sequence tags (ESTs)-simple sequence repeats (SSRs) and targeted region amplified polymorphism (TRAP) markers to examine genetic relationships in the persimmon genus Diospyros gene pool. In total, we characterized 14 EST-SSR primer pairs and 36 TRAP primer combinations, which were amplified across 20 germplasms of 4 species in the genus Diospyros. We used various genetic parameters, including effective multiplex ratio (EMR), diversity index (DI), and marker index (MI), to test the utility of these markers. TRAP markers gave higher EMR (24.85) but lower DI (0.33), compared to EST-SSRs (EMR = 3.65, DI = 0.34). TRAP gave a very high MI (8.08), which was about 8 times than the MI of EST-SSR (1.25). These markers were utilized for phylogenetic inference of 20 genotypes of Diospyros kaki Thunb. and allied species, with a result that all kaki genotypes clustered closely and 3 allied species formed an independent group. These markers could be further exploited for large-scale genetic relationship inference.
Chaudhary, Sakshi; Mishra, Bharat Kumar; Vivek, Thiruvettai; Magadum, Santoshkumar; Yasin, Jeshima Khan
2016-01-01
Simple Sequence Repeats or microsatellites are resourceful molecular genetic markers. There are only few reports of SSR identification and development in pineapple. Complete genome sequence of pineapple available in the public domain can be used to develop numerous novel SSRs. Therefore, an attempt was made to identify SSRs from genomic, chloroplast, mitochondrial and EST sequences of pineapple which will help in deciphering genetic makeup of its germplasm resources. A total of 359511 SSRs were identified in pineapple (356385 from genome sequence, 45 from chloroplast sequence, 249 in mitochondrial sequence and 2832 from EST sequences). The list of EST-SSR markers and their details are available in the database. PineElm_SSRdb is an open source database available for non-commercial academic purpose at http://app.bioelm.com/ with a mapping tool which can develop circular maps of selected marker set. This database will be of immense use to breeders, researchers and graduates working on Ananas spp. and to others working on cross-species transferability of markers, investigating diversity, mapping and DNA fingerprinting.
Genotyping variability of computationally categorized peach microsatellite markers
USDA-ARS?s Scientific Manuscript database
Numerous expressed sequence tag (EST) simple sequence repeat (SSR) primers can be easily mined out. The obstacle to develop them into usable markers is how to optimally select downsized subsets of the primers for genotyping, which accordingly reduces amplification failure and monomorphism often occu...
Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A
2010-02-01
Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
2010-01-01
Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P
2008-01-01
Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard
2012-12-01
Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-09-01
Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-01-01
Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250
Liu, Le; Zhang, Shijie; Lian, Chunlan
2015-01-01
Japanese red pine (Pinus densiflora) is extensively cultivated in Japan, Korea, China, and Russia and is harvested for timber, pulpwood, garden, and paper markets. However, genetic information and molecular markers were very scarce for this species. In this study, over 51 million sequencing clean reads from P. densiflora mRNA were produced using Illumina paired-end sequencing technology. It yielded 83,913 unigenes with a mean length of 751 bp, of which 54,530 (64.98%) unigenes showed similarity to sequences in the NCBI database. Among which the best matches in the NCBI Nr database were Picea sitchensis (41.60%), Amborella trichopoda (9.83%), and Pinus taeda (4.15%). A total of 1953 putative microsatellites were identified in 1784 unigenes using MISA (MicroSAtellite) software, of which the tri-nucleotide repeats were most abundant (50.18%) and 629 EST-SSR (expressed sequence tag- simple sequence repeats) primer pairs were successfully designed. Among 20 EST-SSR primer pairs randomly chosen, 17 markers yielded amplification products of the expected size in P. densiflora. Our results will provide a valuable resource for gene-function analysis, germplasm identification, molecular marker-assisted breeding and resistance-related gene(s) mapping for pine for P. densiflora. PMID:26690126
Pyne, Robert; Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James
2017-01-01
Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37-55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21-28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5-16% and 4-18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome.
Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James
2017-01-01
Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37–55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21–28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5–16% and 4–18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome. PMID:28922359
Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet
2014-01-01
Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
2010-01-01
Background Genetic markers and linkage mapping are basic prerequisites for marker-assisted selection and map-based cloning. In the case of the key grassland species Lolium spp., numerous mapping populations have been developed and characterised for various traits. Although some genetic linkage maps of these populations have been aligned with each other using publicly available DNA markers, the number of common markers among genetic maps is still low, limiting the ability to compare candidate gene and QTL locations across germplasm. Results A set of 204 expressed sequence tag (EST)-derived simple sequence repeat (SSR) markers has been assigned to map positions using eight different ryegrass mapping populations. Marker properties of a subset of 64 EST-SSRs were assessed in six to eight individuals of each mapping population and revealed 83% of the markers to be polymorphic in at least one population and an average number of alleles of 4.88. EST-SSR markers polymorphic in multiple populations served as anchor markers and allowed the construction of the first comprehensive consensus map for ryegrass. The integrated map was complemented with 97 SSRs from previously published linkage maps and finally contained 284 EST-derived and genomic SSR markers. The total map length was 742 centiMorgan (cM), ranging for individual chromosomes from 70 cM of linkage group (LG) 6 to 171 cM of LG 2. Conclusions The consensus linkage map for ryegrass based on eight mapping populations and constructed using a large set of publicly available Lolium EST-SSRs mapped for the first time together with previously mapped SSR markers will allow for consolidating existing mapping and QTL information in ryegrass. Map and markers presented here will prove to be an asset in the development for both molecular breeding of ryegrass as well as comparative genetics and genomics within grass species. PMID:20712870
Construction of new EST-SSRs for Fusarium resistant wheat breeding.
Yumurtaci, Aysen; Sipahi, Hulya; Al-Abdallat, Ayed; Jighly, Abdulqader; Baum, Michael
2017-06-01
Surveying Fusarium resistance in wheat with easy applicable molecular markers such as simple sequence repeats (SSRs) is a prerequest for molecular breeding. Expressed sequence tags (ESTs) are one of the main sources for development of new SSR candidates. Therefore, 18.292 publicly available wheat ESTs were mined and genotyping of newly developed 55 EST-SSR derived primer pairs produced clear fragments in ten wheat cultivars carrying different levels of Fusarium resistance. Among the proved markers, 23 polymorphic EST-SSRs were obtained and related alleles were mostly found on B and D genome. Based on the fragment profiling and similarity analysis, a 327bp amplicon, which was a product of contig 1207 (chromosome 5BL), was detected only in Fusarium head blight (FHB) resistant cultivars (CM82036 and Sumai) and the amino acid sequences showed a similarity to pathogen related proteins. Another FHB resistance related EST-SSR, Contig 556 (chromosome 1BL) produced a 151bp fragment in Sumai and was associated to wax2-like protein. A polymorphic 204bp fragment, derived from Contig 578 (chromosome 1DL), was generated from root rot (FRR) resistant cultivars (2-49; Altay2000 and Sunco). A total of 98 alleles were displayed with an average of 1.8 alleles per locus and the polymorphic information content (PIC) ranged from 0.11 to 0.78. Dendrogram tree with two main and five sub-groups were displayed the highest genetic relationship between FRR resistant cultivars (2-49 and Altay2000), FRR sensitive cultivars (Seri82 and Scout66) and FHB resistant cultivars (CM82036 and Sumai). Thus, exploitation of these candidate EST-SSRs may help to genotype other wheat sources for Fusarium resistance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Wu, Zhigang; Wu, Jinwei; Wang, Yalin; Hou, Hongwei
2017-01-01
Premise of the study: Microsatellite or simple sequence repeat (SSR) markers were developed to investigate the influence of ecological factors on gene flow and spatial genetic structuring of the submerged plant Ranunculus bungei (Ranunculaceae), which is regarded as an important species for understanding how plants adapt to an aquatic environment. Methods and Results: Twenty-two microsatellite loci were identified from an expressed sequence tag (EST) library. The number of alleles per locus ranged from one to five, and the expected heterozygosity varied from 0.0 to 0.5 in four Chinese populations of R. bungei. Fourteen loci were polymorphic and significantly deviated from Hardy–Weinberg equilibrium. All of the loci were found to be amplifiable in two other species of Ranunculus section Batrachium, and cross-amplification in six riparian and aquatic species of Ranunculaceae was also partially successful. Conclusions: These novel EST-SSR markers will be useful for ecological and evolutionary studies of R. bungei as well as related species. PMID:28791205
Zhang, Gu-wen; Xu, Sheng-chun; Mao, Wei-hua; Hu, Qi-zan; Gong, Ya-ming
2013-01-01
The development of expressed sequence tag-derived simple sequence repeats (EST-SSRs) provided a useful tool for investigating plant genetic diversity. In the present study, 22 polymorphic EST-SSRs from grain soybean were identified and used to assess the genetic diversity in 48 vegetable soybean accessions. Among the 22 EST-SSR loci, tri-nucleotides were the most abundant repeats, accounting for 50.00% of the total motifs. GAA was the most common motif among tri-nucleotide repeats, with a frequency of 18.18%. Polymorphic analysis identified a total of 71 alleles, with an average of 3.23 per locus. The polymorphism information content (PIC) values ranged from 0.144 to 0.630, with a mean of 0.386. Observed heterozygosity (H o) values varied from 0.0196 to 1.0000, with an average of 0.6092, while the expected heterozygosity (H e) values ranged from 0.1502 to 0.6840, with a mean value of 0.4616. Principal coordinate analysis and phylogenetic tree analysis indicated that the accessions could be assigned to different groups based to a large extent on their geographic distribution, and most accessions from China were clustered into the same groups. These results suggest that Chinese vegetable soybean accessions have a narrow genetic base. The results of this study indicate that EST-SSRs from grain soybean have high transferability to vegetable soybean, and that these new markers would be helpful in taxonomy, molecular breeding, and comparative mapping studies of vegetable soybean in the future. PMID:23549845
Blair, Matthew W; Hurtado, Natalia; Chavarro, Carolina M; Muñoz-Torres, Monica C; Giraldo, Martha C; Pedraza, Fabio; Tomkins, Jeff; Wing, Rod
2011-03-22
Sequencing of cDNA libraries for the development of expressed sequence tags (ESTs) as well as for the discovery of simple sequence repeats (SSRs) has been a common method of developing microsatellites or SSR-based markers. In this research, our objective was to further sequence and develop common bean microsatellites from leaf and root cDNA libraries derived from the Andean gene pool accession G19833 and the Mesoamerican gene pool accession DOR364, mapping parents of a commonly used reference map. The root libraries were made from high and low phosphorus treated plants. A total of 3,123 EST sequences from leaf and root cDNA libraries were screened and used for direct simple sequence repeat discovery. From these EST sequences we found 184 microsatellites; the majority containing tri-nucleotide motifs, many of which were GC rich (ACC, AGC and AGG in particular). Di-nucleotide motif microsatellites were about half as common as the tri-nucleotide motif microsatellites but most of these were AGn microsatellites with a moderate number of ATn microsatellites in root ESTs followed by few ACn and no GCn microsatellites. Out of the 184 new SSR loci, 120 new microsatellite markers were developed in the BMc (Bean Microsatellites from cDNAs) series and these were evaluated for their capacity to distinguish bean diversity in a germplasm panel of 18 genotypes. We developed a database with images of the microsatellites and their polymorphism information content (PIC), which averaged 0.310 for polymorphic markers. The present study produced information about microsatellite frequency in root and leaf tissues of two important genotypes for common bean genomics: namely G19833, the Andean genotype selected for whole genome shotgun sequencing from race Peru, and DOR364 a race Mesoamerica subgroup 2 genotype that is a small-red seeded, released variety in Central America. Both race Peru and Mesoamerica subgroup 2 (small red beans) have been understudied in comparison to race Nueva Granada and Mesoamerica subgroup 1 (black beans) both with regards to gene expression and as sources of markers. However, we found few differences between SSR type and frequency between the G19833 leaf and DOR364 root tissue-derived ESTs. Overall, our work adds to the analysis of microsatellite frequency evaluation for common bean and provides a new set of 120 BMc markers which combined with the 248 previously developed BMc markers brings the total in this series to 368 markers. Once we include BMd markers, which are derived from GenBank sequences, the current total of gene-based markers from our laboratory surpasses 500 markers. These markers are basic for studies of the transcriptome of common bean and can form anchor points for genetic mapping studies in the future.
Two EST-derived marker systems for cultivar identification in tree peony.
Zhang, J J; Shu, Q Y; Liu, Z A; Ren, H X; Wang, L S; De Keyser, E
2012-02-01
Tree peony (Paeonia suffruticosa Andrews), a woody deciduous shrub, belongs to the section Moutan DC. in the genus of Paeonia of the Paeoniaceae family. To increase the efficiency of breeding, two EST-derived marker systems were developed based on a tree peony expressed sequence tag (EST) database. Using target region amplification polymorphism (TRAP), 19 of 39 primer pairs showed good amplification for 56 accessions with amplicons ranging from 120 to 3,000 bp long, among which 99.3% were polymorphic. In contrast, 7 of 21 primer pairs demonstrated adequate amplification with clear bands for simple sequence repeats (SSRs) developed from ESTs, and a total of 33 alleles were found in 56 accessions. The similarity matrices generated by TRAP and EST-SSR markers were compared, and the Mantel test (r = 0.57778, P = 0.0020) showed a moderate correlation between the two types of molecular markers. TRAP markers were suitable for DNA fingerprinting and EST-SSR markers were more appropriate for discriminating synonyms (the same cultivars with different names due to limited information exchanged among different geographic areas). The two sets of EST-derived markers will be used further for genetic linkage map construction and quantitative trait locus detection in tree peony.
First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers.
Arias, Marina; Hernandez, Monica; Remondegui, Naroa; Huvenaars, Koen; van Dijk, Peter; Ritter, Enrique
2016-08-04
Taraxacum koksaghyz Rodin (TKS) has been studied in many occasions as a possible alternative source for natural rubber production of good quality and for inulin production. Some tire companies are already testing TKS tire prototypes. There are also many investigations on the production of bio-fuels from inulin and inulin applications for health improvement and in the food industry. A limited amount of genomic resources exist for TKS and particularly no genetic linkage map is available in this species. We have constructed the first TKS genetic linkage map based on AFLP, COS, SSR and EST-SSR markers. The integrated linkage map with eight linkage groups (LG), representing the eight chromosomes of Russian dandelion, has 185 individual AFLP markers from parent 1, 188 individual AFLP markers from parent 2, 75 common AFLP markers and 6 COS, 1 SSR and 63 EST-SSR loci. Blasting the EST-SSR sequences against known sequences from lettuce allowed a partial alignment of our TKS map with a lettuce map. Blast searches against plant gene databases revealed some homologies with useful genes for downstream applications in the future.
tropiTree: An NGS-Based EST-SSR Resource for 24 Tropical Tree Species
Russell, Joanne R.; Hedley, Peter E.; Cardle, Linda; Dancey, Siobhan; Morris, Jenny; Booth, Allan; Odee, David; Mwaura, Lucy; Omondi, William; Angaine, Peter; Machua, Joseph; Muchugi, Alice; Milne, Iain; Kindt, Roeland; Jamnadass, Ramni; Dawson, Ian K.
2014-01-01
The development of genetic tools for non-model organisms has been hampered by cost, but advances in next-generation sequencing (NGS) have created new opportunities. In ecological research, this raises the prospect for developing molecular markers to simultaneously study important genetic processes such as gene flow in multiple non-model plant species within complex natural and anthropogenic landscapes. Here, we report the use of bar-coded multiplexed paired-end Illumina NGS for the de novo development of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers at low cost for a range of 24 tree species. Each chosen tree species is important in complex tropical agroforestry systems where little is currently known about many genetic processes. An average of more than 5,000 EST-SSRs was identified for each of the 24 sequenced species, whereas prior to analysis 20 of the species had fewer than 100 nucleotide sequence citations. To make results available to potential users in a suitable format, we have developed an open-access, interactive online database, tropiTree (http://bioinf.hutton.ac.uk/tropiTree), which has a range of visualisation and search facilities, and which is a model for the efficient presentation and application of NGS data. PMID:25025376
Li, Zhi-Zhong; Lu, Meng-Xue; Saina, Josphat K; Gichira, Andrew W; Wang, Qing-Feng; Chen, Jin-Ming
2017-11-01
Simple sequence repeat (SSR) markers were derived from transcriptomic data for Ottelia acuminata (Hydrocharitaceae), a species comprising five endemic and highly endangered varieties in China. Sixteen novel SSR markers were developed for O. acuminata var. jingxiensis . One to eight alleles per locus were found, with a mean of 2.896. The observed and expected heterozygosity ranged from 0.000 to 1.000 and 0.000 to 0.793, respectively. Interestingly, in cross-varietal amplification, 13 out of the 16 loci were successfully amplified in O. acuminata var. acuminata , and 12 amplified in each of the other three varieties of O. acuminata . These newly developed SSR markers will facilitate further study of genetic variation and provide important genetic data needed for appropriate conservation of natural populations of all varieties of O. acuminata .
Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo
2017-01-01
Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections, also based on recently developed markers.
Kaur, Sukhjiwan; Cogan, Noel O I; Pembleton, Luke W; Shinozuka, Maiko; Savin, Keith W; Materne, Michael; Forster, John W
2011-05-25
Lentil (Lens culinaris Medik.) is a cool-season grain legume which provides a rich source of protein for human consumption. In terms of genomic resources, lentil is relatively underdeveloped, in comparison to other Fabaceae species, with limited available data. There is hence a significant need to enhance such resources in order to identify novel genes and alleles for molecular breeding to increase crop productivity and quality. Tissue-specific cDNA samples from six distinct lentil genotypes were sequenced using Roche 454 GS-FLX Titanium technology, generating c. 1.38 × 106 expressed sequence tags (ESTs). De novo assembly generated a total of 15,354 contigs and 68,715 singletons. The complete unigene set was sequence-analysed against genome drafts of the model legume species Medicago truncatula and Arabidopsis thaliana to identify 12,639, and 7,476 unique matches, respectively. When compared to the genome of Glycine max, a total of 20,419 unique hits were observed corresponding to c. 31% of the known gene space. A total of 25,592 lentil unigenes were subsequently annoated from GenBank. Simple sequence repeat (SSR)-containing ESTs were identified from consensus sequences and a total of 2,393 primer pairs were designed. A subset of 192 EST-SSR markers was screened for validation across a panel 12 cultivated lentil genotypes and one wild relative species. A total of 166 primer pairs obtained successful amplification, of which 47.5% detected genetic polymorphism. A substantial collection of ESTs has been developed from sequence analysis of lentil genotypes using second-generation technology, permitting unigene definition across a broad range of functional categories. As well as providing resources for functional genomics studies, the unigene set has permitted significant enhancement of the number of publicly-available molecular genetic markers as tools for improvement of this species.
Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar
2014-01-01
Abstract Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes. PMID:24802971
Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar; Talukdar, Anupam Das
2014-05-01
Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes.
First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers
Arias, Marina; Hernandez, Monica; Remondegui, Naroa; Huvenaars, Koen; van Dijk, Peter; Ritter, Enrique
2016-01-01
Taraxacum koksaghyz Rodin (TKS) has been studied in many occasions as a possible alternative source for natural rubber production of good quality and for inulin production. Some tire companies are already testing TKS tire prototypes. There are also many investigations on the production of bio-fuels from inulin and inulin applications for health improvement and in the food industry. A limited amount of genomic resources exist for TKS and particularly no genetic linkage map is available in this species. We have constructed the first TKS genetic linkage map based on AFLP, COS, SSR and EST-SSR markers. The integrated linkage map with eight linkage groups (LG), representing the eight chromosomes of Russian dandelion, has 185 individual AFLP markers from parent 1, 188 individual AFLP markers from parent 2, 75 common AFLP markers and 6 COS, 1 SSR and 63 EST-SSR loci. Blasting the EST-SSR sequences against known sequences from lettuce allowed a partial alignment of our TKS map with a lettuce map. Blast searches against plant gene databases revealed some homologies with useful genes for downstream applications in the future. PMID:27488242
2012-01-01
Background There are several reports describing thousands of SSR markers in the peanut (Arachis hypogaea L.) genome. There is a need to integrate various research reports of peanut DNA polymorphism into a single platform. Further, because of lack of uniformity in the labeling of these markers across the publications, there is some confusion on the identities of many markers. We describe below an effort to develop a central comprehensive database of polymorphic SSR markers in peanut. Findings We compiled 1,343 SSR markers as detecting polymorphism (14.5%) within a total of 9,274 markers. Amongst all polymorphic SSRs examined, we found that AG motif (36.5%) was the most abundant followed by AAG (12.1%), AAT (10.9%), and AT (10.3%).The mean length of SSR repeats in dinucleotide SSRs was significantly longer than that in trinucleotide SSRs. Dinucleotide SSRs showed higher polymorphism frequency for genomic SSRs when compared to trinucleotide SSRs, while for EST-SSRs, the frequency of polymorphic SSRs was higher in trinucleotide SSRs than in dinucleotide SSRs. The correlation of the length of SSR and the frequency of polymorphism revealed that the frequency of polymorphism was decreased as motif repeat number increased. Conclusions The assembled polymorphic SSRs would enhance the density of the existing genetic maps of peanut, which could also be a useful source of DNA markers suitable for high-throughput QTL mapping and marker-assisted selection in peanut improvement and thus would be of value to breeders. PMID:22818284
2011-01-01
Background Over recent years, a growing effort has been made to develop microsatellite markers for the genomic analysis of the common bean (Phaseolus vulgaris) to broaden the knowledge of the molecular genetic basis of this species. The availability of large sets of expressed sequence tags (ESTs) in public databases has given rise to an expedient approach for the identification of SSRs (Simple Sequence Repeats), specifically EST-derived SSRs. In the present work, a battery of new microsatellite markers was obtained from a search of the Phaseolus vulgaris EST database. The diversity, degree of transferability and polymorphism of these markers were tested. Results From 9,583 valid ESTs, 4,764 had microsatellite motifs, from which 377 were used to design primers, and 302 (80.11%) showed good amplification quality. To analyze transferability, a group of 167 SSRs were tested, and the results showed that they were 82% transferable across at least one species. The highest amplification rates were observed between the species from the Phaseolus (63.7%), Vigna (25.9%), Glycine (19.8%), Medicago (10.2%), Dipterix (6%) and Arachis (1.8%) genera. The average PIC (Polymorphism Information Content) varied from 0.53 for genomic SSRs to 0.47 for EST-SSRs, and the average number of alleles per locus was 4 and 3, respectively. Among the 315 newly tested SSRs in the BJ (BAT93 X Jalo EEP558) population, 24% (76) were polymorphic. The integration of these segregant loci into a framework map composed of 123 previously obtained SSR markers yielded a total of 199 segregant loci, of which 182 (91.5%) were mapped to 14 linkage groups, resulting in a map length of 1,157 cM. Conclusions A total of 302 newly developed EST-SSR markers, showing good amplification quality, are available for the genetic analysis of Phaseolus vulgaris. These markers showed satisfactory rates of transferability, especially between species that have great economic and genomic values. Their diversity was comparable to genomic SSRs, and they were incorporated in the common bean reference genetic map, which constitutes an important contribution to and advance in Phaseolus vulgaris genomic research. PMID:21554695
Identification and characterization of gene-based SSR markers in date palm (Phoenix dactylifera L.).
Zhao, Yongli; Williams, Roxanne; Prakash, C S; He, Guohao
2012-12-15
Date palm (Phoenix dactylifera L.) is an important tree in the Middle East and North Africa due to the nutritional value of its fruit. Molecular Breeding would accelerate genetic improvement of fruit tree through marker assisted selection. However, the lack of molecular markers in date palm restricts the application of molecular breeding. In this study, we analyzed 28,889 EST sequences from the date palm genome database to identify simple-sequence repeats (SSRs) and to develop gene-based markers, i.e. expressed sequence tag-SSRs (EST-SSRs). We identified 4,609 ESTs as containing SSRs, among which, trinucleotide motifs (69.7%) were the most common, followed by tetranucleotide (10.4%) and dinucleotide motifs (9.6%). The motif AG (85.7%) was most abundant in dinucleotides, while motifs AGG (26.8%), AAG (19.3%), and AGC (16.1%) were most common among trinucleotides. A total of 4,967 primer pairs were designed for EST-SSR markers from the computational data. In a follow up laboratory study, we tested a sample of 20 random selected primer pairs for amplification and polymorphism detection using genomic DNA from date palm cultivars. Nearly one-third of these primer pairs detected DNA polymorphism to differentiate the twelve date palm cultivars used. Functional categorization of EST sequences containing SSRs revealed that 3,108 (67.4%) of such ESTs had homology with known proteins. Date palm EST sequences exhibits a good resource for developing gene-based markers. These genic markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in date palm, such as diversity study, QTL mapping, and molecular breeding.
Optimization of sequence alignment for simple sequence repeat regions.
Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C
2011-07-20
Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
Comparison of simple sequence repeats in 19 Archaea.
Trivedi, S
2006-12-05
All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.
2009-01-01
Background Chickpea (Cicer arietinum L.), an important grain legume crop of the world is seriously challenged by terminal drought and salinity stresses. However, very limited number of molecular markers and candidate genes are available for undertaking molecular breeding in chickpea to tackle these stresses. This study reports generation and analysis of comprehensive resource of drought- and salinity-responsive expressed sequence tags (ESTs) and gene-based markers. Results A total of 20,162 (18,435 high quality) drought- and salinity- responsive ESTs were generated from ten different root tissue cDNA libraries of chickpea. Sequence editing, clustering and assembly analysis resulted in 6,404 unigenes (1,590 contigs and 4,814 singletons). Functional annotation of unigenes based on BLASTX analysis showed that 46.3% (2,965) had significant similarity (≤1E-05) to sequences in the non-redundant UniProt database. BLASTN analysis of unique sequences with ESTs of four legume species (Medicago, Lotus, soybean and groundnut) and three model plant species (rice, Arabidopsis and poplar) provided insights on conserved genes across legumes as well as novel transcripts for chickpea. Of 2,965 (46.3%) significant unigenes, only 2,071 (32.3%) unigenes could be functionally categorised according to Gene Ontology (GO) descriptions. A total of 2,029 sequences containing 3,728 simple sequence repeats (SSRs) were identified and 177 new EST-SSR markers were developed. Experimental validation of a set of 77 SSR markers on 24 genotypes revealed 230 alleles with an average of 4.6 alleles per marker and average polymorphism information content (PIC) value of 0.43. Besides SSR markers, 21,405 high confidence single nucleotide polymorphisms (SNPs) in 742 contigs (with ≥ 5 ESTs) were also identified. Recognition sites for restriction enzymes were identified for 7,884 SNPs in 240 contigs. Hierarchical clustering of 105 selected contigs provided clues about stress- responsive candidate genes and their expression profile showed predominance in specific stress-challenged libraries. Conclusion Generated set of chickpea ESTs serves as a resource of high quality transcripts for gene discovery and development of functional markers associated with abiotic stress tolerance that will be helpful to facilitate chickpea breeding. Mapping of gene-based markers in chickpea will also add more anchoring points to align genomes of chickpea and other legume species. PMID:19912666
Highly Informative Simple Sequence Repeat (SSR) Markers for Fingerprinting Hazelnut
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) or microsatellite markers have many applications in breeding and genetic studies of plants, including fingerprinting of cultivars and investigations of genetic diversity, and therefore provide information for better management of germplasm collections. They are repeatab...
Sun, Lidan; Yang, Weiru; Zhang, Qixiang; Cheng, Tangren; Pan, Huitang; Xu, Zongda; Zhang, Jie; Chen, Chuguang
2013-01-01
Because of its popularity as an ornamental plant in East Asia, mei (Prunus mume Sieb. et Zucc.) has received increasing attention in genetic and genomic research with the recent shotgun sequencing of its genome. Here, we performed the genome-wide characterization of simple sequence repeats (SSRs) in the mei genome and detected a total of 188,149 SSRs occurring at a frequency of 794 SSR/Mb. Mononucleotide repeats were the most common type of SSR in genomic regions, followed by di- and tetranucleotide repeats. Most of the SSRs in coding sequences (CDS) were composed of tri- or hexanucleotide repeat motifs, but mononucleotide repeats were always the most common in intergenic regions. Genome-wide comparison of SSR patterns among the mei, strawberry (Fragaria vesca), and apple (Malus×domestica) genomes showed mei to have the highest density of SSRs, slightly higher than that of strawberry (608 SSR/Mb) and almost twice as high as that of apple (398 SSR/Mb). Mononucleotide repeats were the dominant SSR motifs in the three Rosaceae species. Using 144 SSR markers, we constructed a 670 cM-long linkage map of mei delimited into eight linkage groups (LGs), with an average marker distance of 5 cM. Seventy one scaffolds covering about 27.9% of the assembled mei genome were anchored to the genetic map, depending on which the macro-colinearity between the mei genome and Prunus T×E reference map was identified. The framework map of mei constructed provides a first step into subsequent high-resolution genetic mapping and marker-assisted selection for this ornamental species. PMID:23555708
Zhu, H; Senalik, D; McCown, B H; Zeldin, E L; Speers, J; Hyman, J; Bassil, N; Hummer, K; Simon, P W; Zalapa, J E
2012-01-01
The American cranberry (Vaccinium macrocarpon Ait.) is a major commercial fruit crop in North America, but limited genetic resources have been developed for the species. Furthermore, the paucity of codominant DNA markers has hampered the advance of genetic research in cranberry and the Ericaceae family in general. Therefore, we used Roche 454 sequencing technology to perform low-coverage whole genome shotgun sequencing of the cranberry cultivar 'HyRed'. After de novo assembly, the obtained sequence covered 266.3 Mb of the estimated 540-590 Mb in cranberry genome. A total of 107,244 SSR loci were detected with an overall density across the genome of 403 SSR/Mb. The AG repeat was the most frequent motif in cranberry accounting for 35% of all SSRs and together with AAG and AAAT accounted for 46% of all loci discovered. To validate the SSR loci, we designed 96 primer-pairs using contig sequence data containing perfect SSR repeats, and studied the genetic diversity of 25 cranberry genotypes. We identified 48 polymorphic SSR loci with 2-15 alleles per locus for a total of 323 alleles in the 25 cranberry genotypes. Genetic clustering by principal coordinates and genetic structure analyzes confirmed the heterogeneous nature of cranberries. The parentage composition of several hybrid cultivars was evident from the structure analyzes. Whole genome shotgun 454 sequencing was a cost-effective and efficient way to identify numerous SSR repeats in the cranberry sequence for marker development.
SSRPrimer and SSR Taxonomy Tree: Biome SSR discovery
Jewell, Erica; Robinson, Andrew; Savage, David; Erwin, Tim; Love, Christopher G.; Lim, Geraldine A. C.; Li, Xi; Batley, Jacqueline; Spangenberg, German C.; Edwards, David
2006-01-01
Simple sequence repeat (SSR) molecular genetic markers have become important tools for a broad range of applications such as genome mapping and genetic diversity studies. SSRs are readily identified within DNA sequence data and PCR primers can be designed for their amplification. These PCR primers frequently cross amplify within related species. We report a web-based tool, SSR Primer, that integrates SPUTNIK, an SSR repeat finder, with Primer3, a primer design program, within one pipeline. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. Results are then parsed to Primer3 for locus specific primer design. We have applied this tool for the discovery of SSRs within the complete GenBank database, and have designed PCR amplification primers for over 13 million SSRs. The SSR Taxonomy Tree server provides web-based searching and browsing of species and taxa for the visualisation and download of these SSR amplification primers. These tools are available at . PMID:16845092
SSRPrimer and SSR Taxonomy Tree: Biome SSR discovery.
Jewell, Erica; Robinson, Andrew; Savage, David; Erwin, Tim; Love, Christopher G; Lim, Geraldine A C; Li, Xi; Batley, Jacqueline; Spangenberg, German C; Edwards, David
2006-07-01
Simple sequence repeat (SSR) molecular genetic markers have become important tools for a broad range of applications such as genome mapping and genetic diversity studies. SSRs are readily identified within DNA sequence data and PCR primers can be designed for their amplification. These PCR primers frequently cross amplify within related species. We report a web-based tool, SSR Primer, that integrates SPUTNIK, an SSR repeat finder, with Primer3, a primer design program, within one pipeline. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. Results are then parsed to Primer3 for locus specific primer design. We have applied this tool for the discovery of SSRs within the complete GenBank database, and have designed PCR amplification primers for over 13 million SSRs. The SSR Taxonomy Tree server provides web-based searching and browsing of species and taxa for the visualisation and download of these SSR amplification primers. These tools are available at http://bioinformatics.pbcbasc.latrobe.edu.au/ssrdiscovery.html.
USDA-ARS?s Scientific Manuscript database
Simple sequence repeats (SSR) markers were developed from a small insert genomic library for Bipolaris sorokiniana, a mitosporic fungal pathogen that causes spot blotch and root rot in switchgrass. About 59% of sequenced clones (n=384) harbored various SSR motifs. After eliminating the redundant seq...
Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja
2012-08-01
Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A
2011-01-01
PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
2010-01-01
Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Simple sequence repeat marker loci discovery using SSR primer.
Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David
2004-06-12
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Wu, Jianzhong; Zhao, Qian; Wu, Guangwen; Zhang, Shuquan; Jiang, Tingbo
2016-01-01
Flax ( Linum usitatissimum L.) is a major fiber and oil yielding crop grown in northeastern China. Identification of flax molecular markers is a key step toward improving flax yield and quality via marker-assisted breeding. Simple sequence repeat (SSR) markers, which are based on genomic structural variation, are considered the most valuable type of genetic marker for this purpose. In this study, we screened 1574 microsatellites from Linum usitatissimum L. obtained using reduced representation genome sequencing (RRGS) to systematically identify SSR markers. The resulting set of microsatellites consisted mainly of trinucleotide (56.10%) and dinucleotide (35.23%) repeats, with each motif consisting of 5-8 repeats. We then evaluated marker sensitivity and specificity based on samples of 48 flax isolates obtained from northeastern China. Using the new SSR panel, the results demonstrated that fiber flax and oilseed flax varieties clustered into two well separated groups. The novel SSR markers developed in this study show potential value for selection of varieties for use in flax breeding programs.
Misra, Gopal; Gupta, Sarika; Subramanian, Alagesan; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj
2013-01-01
Foxtail millet ( Setaria italica L.) is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR) markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2%) eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02–0.65) obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%), maize (~61%) and rice (~42%) chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species. PMID:23805325
Kumari, Kajal; Muthamilarasan, Mehanathan; Misra, Gopal; Gupta, Sarika; Subramanian, Alagesan; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj
2013-01-01
Foxtail millet (Setariaitalica L.) is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR) markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2%) eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02-0.65) obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%), maize (~61%) and rice (~42%) chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species.
Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming
2015-01-01
Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Marques, Isabel; Montgomery, Sean A; Barker, Michael S; Macfarlane, Terry D; Conran, John G; Catalán, Pilar; Rieseberg, Loren H; Rudall, Paula J; Graham, Sean W
2016-04-01
Relatively little is known about species-level genetic diversity in flowering plants outside the eudicots and monocots, and it is often unclear how to interpret genetic patterns in lineages with whole-genome duplications. We addressed these issues in a polyploid representative of Hydatellaceae, part of the water-lily order Nymphaeales. We examined a transcriptome of Trithuria submersa for evidence of recent whole-genome duplication, and applied transcriptome-derived microsatellite (expressed-sequence tag simple-sequence repeat (EST-SSR)) primers to survey genetic variation in populations across its range in mainland Australia. A transcriptome-based Ks plot revealed at least one recent polyploidization event, consistent with fixed heterozygous genotypes representing underlying sets of homeologous loci. A strong genetic division coincides with a trans-Nullarbor biogeographic boundary. Patterns of 'allelic' variation (no more than two variants per EST-SSR genotype) and recently published chromosomal evidence are consistent with the predicted polyploidization event and substantial homozygosity underlying fixed heterozygote SSR genotypes, which in turn reflect a selfing mating system. The Nullarbor Plain is a barrier to gene flow between two deep lineages of T. submersa that may represent cryptic species. The markers developed here should also be useful for further disentangling species relationships, and provide a first step towards future genomic studies in Trithuria. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Meyer, Lucie; Causse, Romain; Pernin, Fanny; Scalone, Romain; Bailly, Géraldine; Chauvel, Bruno; Délye, Christophe; Le Corre, Valérie
2017-01-01
Ambrosia artemisiifolia L., (common ragweed), is an annual invasive and highly troublesome plant species originating from North America that has become widespread across Europe. New sets of genomic and expressed sequence tag (EST) based simple sequence repeats (SSRs) markers were developed in this species using three approaches. After validation, 13 genomic SSRs and 13 EST-SSRs were retained and used to characterize the genetic diversity and population genetic structure of Ambrosia artemisiifolia populations from the native (North America) and invasive (Europe) ranges of the species. Analysing the mating system based on maternal families did not reveal any departure from complete allogamy and excess homozygosity was mostly due the presence of null alleles. High genetic diversity and patterns of genetic structure in Europe suggest two main introduction events followed by secondary colonization events. Cross-species transferability of the newly developed markers to other invasive species of the Ambrosia genus was assessed. Sixty-five percent and 75% of markers, respectively, were transferable from A. artemisiifolia to Ambrosia psilostachya and Ambrosia tenuifolia. 40% were transferable to Ambrosia trifida, this latter species being seemingly more phylogenetically distantly related to A. artemisiifolia than the former two.
USDA-ARS?s Scientific Manuscript database
Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA
USDA-ARS?s Scientific Manuscript database
A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...
Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K
2011-01-20
Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
2011-01-01
Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398
Wu, Jianzhong; Zhao, Qian; Wu, Guangwen; Zhang, Shuquan; Jiang, Tingbo
2017-01-01
Flax (Linum usitatissimum L.) is a major fiber and oil yielding crop grown in northeastern China. Identification of flax molecular markers is a key step toward improving flax yield and quality via marker-assisted breeding. Simple sequence repeat (SSR) markers, which are based on genomic structural variation, are considered the most valuable type of genetic marker for this purpose. In this study, we screened 1574 microsatellites from Linum usitatissimum L. obtained using reduced representation genome sequencing (RRGS) to systematically identify SSR markers. The resulting set of microsatellites consisted mainly of trinucleotide (56.10%) and dinucleotide (35.23%) repeats, with each motif consisting of 5–8 repeats. We then evaluated marker sensitivity and specificity based on samples of 48 flax isolates obtained from northeastern China. Using the new SSR panel, the results demonstrated that fiber flax and oilseed flax varieties clustered into two well separated groups. The novel SSR markers developed in this study show potential value for selection of varieties for use in flax breeding programs. PMID:28133461
Onyśk, Agnieszka; Boczkowska, Maja
2017-01-01
Simple Sequence Repeat (SSR) markers are one of the most frequently used molecular markers in studies of crop diversity and population structure. This is due to their uniform distribution in the genome, the high polymorphism, reproducibility, and codominant character. Additional advantages are the possibility of automatic analysis and simple interpretation of the results. The M13 tagged PCR reaction significantly reduces the costs of analysis by the automatic genetic analyzers. Here, we also disclose a short protocol of SSR data analysis.
A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome
2010-01-01
Background The construction of genetic linkage maps for cultivated peanut (Arachis hypogaea L.) has and continues to be an important research goal to facilitate quantitative trait locus (QTL) analysis and gene tagging for use in a marker-assisted selection in breeding. Even though a few maps have been developed, they were constructed using diploid or interspecific tetraploid populations. The most recently published intra-specific map was constructed from the cross of cultivated peanuts, in which only 135 simple sequence repeat (SSR) markers were sparsely populated in 22 linkage groups. The more detailed linkage map with sufficient markers is necessary to be feasible for QTL identification and marker-assisted selection. The objective of this study was to construct a genetic linkage map of cultivated peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Results Three recombinant inbred lines (RILs) populations were constructed from three crosses with one common female parental line Yueyou 13, a high yielding Spanish market type. The four parents were screened with 1044 primer pairs designed to amplify SSRs and 901 primer pairs produced clear PCR products. Of the 901 primer pairs, 146, 124 and 64 primer pairs (markers) were polymorphic in these populations, respectively, and used in genotyping these RIL populations. Individual linkage maps were constructed from each of the three populations and a composite map based on 93 common loci were created using JoinMap. The composite linkage maps consist of 22 composite linkage groups (LG) with 175 SSR markers (including 47 SSRs on the published AA genome maps), representing the 20 chromosomes of A. hypogaea. The total composite map length is 885.4 cM, with an average marker density of 5.8 cM. Segregation distortion in the 3 populations was 23.0%, 13.5% and 7.8% of the markers, respectively. These distorted loci tended to cluster on LG1, LG3, LG4 and LG5. There were only 15 EST-SSR markers mapped due to low polymorphism. By comparison, there were potential synteny, collinear order of some markers and conservation of collinear linkage groups among the maps and with the AA genome but not fully conservative. Conclusion A composite linkage map was constructed from three individual mapping populations with 175 SSR markers in 22 composite linkage groups. This composite genetic linkage map is among the first "true" tetraploid peanut maps produced. This map also consists of 47 SSRs that have been used in the published AA genome maps, and could be used in comparative mapping studies. The primers described in this study are PCR-based markers, which are easy to share for genetic mapping in peanuts. All 1044 primer pairs are provided as additional files and the three RIL populations will be made available to public upon request for quantitative trait loci (QTL) analysis and linkage map improvement. PMID:20105299
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis
Zheng, Jin-shuang; Sun, Cheng-zhen; Zhang, Shu-ning; Hou, Xi-lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.
Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje
2016-01-01
A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
Awasthi, Praveen; Singh, Ashish; Sheikh, Gulfam; Mahajan, Vidushi; Gupta, Ajai Prakash; Gupta, Suphla; Bedi, Yashbir S; Gandhi, Sumit G
2017-10-01
Zingiber officinale is a model spice herb, well known for its medicinal value. It is primarily a vegetatively propagated commercial crop. However, considerable diversity in its morphology, fiber content and chemoprofiles has been reported. The present study explores the utility of EST-derived markers in studying genetic diversity in different accessions of Z. officinale and their cross transferability within the Zingiberaceae family. A total of 38,115 ESTs sequences were assembled to generate 7850 contigs and 10,762 singletons. SSRs were searched in the unigenes and 515 SSR-containing ESTs were identified with a frequency of 1 SSR per 25.21 kb of the genome. These ESTs were also annotated using BLAST2GO. Primers were designed for 349 EST-SSRs and 25 primer pairs were randomly picked for EST SSR study. Out of these, 16 primer pairs could be optimized for amplification in different accessions of Z. officinale as well as other species belonging to Zingiberaceae. GES454, GES466, GES480 and GES486 markers were found to exhibit 100% cross-transferability among different members of Zingiberaceae.
Portis, Ezio; Portis, Flavio; Valente, Luisa; Moglia, Andrea; Barchi, Lorenzo; Lanteri, Sergio; Acquadro, Alberto
2016-01-01
The recently acquired genome sequence of globe artichoke (Cynara cardunculus var. scolymus) has been used to catalog the genome’s content of simple sequence repeat (SSR) markers. More than 177,000 perfect SSRs were revealed, equivalent to an overall density across the genome of 244.5 SSRs/Mbp, but some 224,000 imperfect SSRs were also identified. About 21% of these SSRs were complex (two stretches of repeats separated by <100 nt). Some 73% of the SSRs were composed of dinucleotide motifs. The SSRs were categorized for the numbers of repeats present, their overall length and were allocated to their linkage group. A total of 4,761 perfect and 6,583 imperfect SSRs were present in 3,781 genes (14.11% of the total), corresponding to an overall density across the gene space of 32,5 and 44,9 SSRs/Mbp for perfect and imperfect motifs, respectively. A putative function has been assigned, using the gene ontology approach, to the set of genes harboring at least one SSR. The same search parameters were applied to reveal the SSR content of 14 other plant species for which genome sequence is available. Certain species-specific SSR motifs were identified, along with a hexa-nucleotide motif shared only with the other two Compositae species (sunflower (Helianthus annuus) and horseweed (Conyza canadensis)) included in the study. Finally, a database, called “Cynara cardunculus MicroSatellite DataBase” (CyMSatDB) was developed to provide a searchable interface to the SSR data. CyMSatDB facilitates the retrieval of SSR markers, as well as suggested forward and reverse primers, on the basis of genomic location, genomic vs genic context, perfect vs imperfect repeat, motif type, motif sequence and repeat number. The SSR markers were validated via an in silico based PCR analysis adopting two available assembled transcriptomes, derived from contrasting globe artichoke accessions, as templates. PMID:27648830
Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin
2016-01-07
The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L.; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F.; Li, Shuaicheng; Hu, Kailin
2016-01-01
The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum. PMID:26739748
Gupta, S K; Gopalakrishna, T
2010-07-01
Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.
Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism.
Gur-Arie, R; Cohen, C J; Eitan, Y; Shelef, L; Hallerman, E M; Kashi, Y
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.
Biradar, Jyoti; Madhuri, T.; N. Nataraja, Karaba; Sreeman, Sheshshayee M.
2016-01-01
Improving mulberry leaf production with enhanced leaf quality holds the key to sustain the ever increasing demand for silk. Adoption of modern genomic approaches for crop improvement is severely constrained by the lack of sufficient molecular markers in mulberry. Here, we report development and validation of 206 EST derived SSR markers using transcriptome data generated from leaf tissue of a drought tolerant mulberry genotype, Dudia white. Analysis of transcriptome data containing 10169 EST sequences, revealed 1469 sequences with microsatellite repeat motifs. We designed a total of 264 primers to the most appropriate repeat regions, of which 206 were locus specific. These markers were validated with 25 diverse mulberry accessions and their transferability to closely related species belonging to family Moraceae was examined. Of these markers, 189 revealed polymorphism with up to 8 allelic forms across mulberry species, genotypes and varieties with a mean of 3.5 alleles per locus. The markers also revealed higher polymorphic information content of 0.824 among the accessions. These markers effectively segregated the species and genotypes and hence, can be used for both diversity analysis and in breeding applications. Around 40% of these markers were transferable to other closely related species. Along with the other genic and genomic markers, we report a set of over 750 co-dominant markers. Using these markers we constructed the first genetic linkage map of mulberry exclusively with co-dominant markers. PMID:27669004
Sharafi, Ata Allah; Abkenar, Asad Asadi; Sharafi, Ali; Masaeli, Mohammad
2016-01-01
Iran has a long history of acid lime cultivation and propagation. In this study, genetic variation in 28 acid lime accessions from five regions of south of Iran, and their relatedness with other 19 citrus cultivars were analyzed using Simple Sequence Repeat (SSR) and Inter-Simple Sequence Repeat (ISSR) molecular markers. Nine primers for SSR and nine ISSR primers were used for allele scoring. In total, 49 SSR and 131 ISSR polymorphic alleles were detected. Cluster analysis of SSR and ISSR data showed that most of the acid lime accessions (19 genotypes) have hybrid origin and genetically distance with nucellar of Mexican lime (9 genotypes). As nucellar of Mexican lime are susceptible to phytoplasma, these acid lime genotypes can be used to evaluate their tolerance against biotic constricts like lime "witches' broom disease".
Zaki, Noorhariza Mohd; Singh, Rajinder; Rosli, Rozana; Ismail, Ismanizan
2012-01-01
Species-specific simple sequence repeat (SSR) markers are favored for genetic studies and marker-assisted selection (MAS) breeding for oil palm genetic improvement. This report characterizes 20 SSR markers from an Elaeis oleifera genomic library (gSSR). Characterization of the repeat type in 2000 sequences revealed a high percentage of di-nucleotides (63.6%), followed by tri-nucleotides (24.2%). Primer pairs were successfully designed for 394 of the E. oleifera gSSRs. Subsequent analysis showed the ability of the 20 selected E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The average Polymorphism Information Content (PIC) value for the SSRs was 0.402, with the tri-repeats showing the highest average PIC (0.626). Low values of observed heterozygosity (Ho) (0.164) and highly positive fixation indices (Fis) in the E. oleifera germplasm collection, compared to the E. guineensis, indicated an excess of homozygosity in E. oleifera. The transferability of the markers to closely related palms, Elaeis guineensis, Cocos nucifera and ornamental palms is also reported. Sequencing the amplicons of three selected E. oleifera gSSRs across both species and palm taxa revealed variations in the repeat-units. The study showed the potential of E. oleifera gSSR markers to reveal genetic diversity in the genus Elaeis. The markers are also a valuable genetic resource for studying E. oleifera and other genus in the Arecaceae family. PMID:22605966
2012-01-01
Background Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut. Results A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution. Conclusions Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut. PMID:23140574
A framework linkage map of perennial ryegrass based on SSR markers
G.P. Gill; P.L. Wilcox; D.J. Whittaker; R.A. Winz; P. Bickerstaff; Craig E. Echt; J. Kent; M.O. Humphreys; K.M. Elborough; R.C. Gardner
2006-01-01
A moderate-density linkage map for Lolium perenne L. has been constructed based on 376 simple sequence repeat (SSR) markers. Approximately one third ( 124) of the SSR markers were developed from GeneThresher libraries that preferentially select genomic DNA clones from the gene-rich unmethylated portion of the genome. The remaining SSR marker loci...
Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo
2018-03-16
Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.
Simple Sequence Repeats in Escherichia coli: Abundance, Distribution, Composition, and Polymorphism
Gur-Arie, Riva; Cohen, Cyril J.; Eitan, Yuval; Shelef, Leora; Hallerman, Eric M.; Kashi, Yechezkel
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020–209030 and AF209508–209518.] PMID:10645951
Zhang, Kai; Wu, Zhengdan; Tang, Daobin; Lv, Changwen; Luo, Kai; Zhao, Yong; Liu, Xun; Huang, Yuanxin; Wang, Jichun
2016-01-01
Sweet potato (Ipomoea batatas L.) is a nutritious food crop and, based on the high starch content of its storage root, a potential bioethanol feedstock. Enhancing the nutritional value and starch quantity of storage roots are important goals of sweet potato breeding programs aimed at developing improved varieties for direct consumption, processing, and industrial uses. However, developing improved lines of sweet potato is challenging due to the genetic complexity of this plant and the lack of genome information. Short sequence repeat (SSR) markers are powerful molecular tools for tracking important loci in crops and for molecular-based breeding strategies; however, few SSR markers and marker-trait associations have hitherto been identified in sweet potato. In this study, we identified 1824 SSRs by using a de novo assembly of publicly available ESTs and mRNAs in sweet potato, and designed 1476 primer pairs based on SSR-containing sequences. We mapped 214 pairs of primers in a natural population comprised of 239 germplasms, and identified 1278 alleles with an average of 5.972 alleles per locus and a major allele frequency of 0.7702. Population structure analysis revealed two subpopulations in this panel of germplasms, and phenotypic characterization demonstrated that this panel is suitable for association mapping of starch-related traits. We identified 32, 16, and 17 SSR markers associated with starch content, β-carotene content, and starch composition in the storage root, respectively, using association analysis and further evaluation of a subset of sweet potato genotypes with various characteristics. The SSR markers identified here can be used to select varieties with desired traits and to investigate the genetic mechanism underlying starch and carotenoid formation in the starchy roots of sweet potato. PMID:26973669
Panwar, Preety; Jha, Anand Kumar; Pandey, P K; Gupta, Arun K; Kumar, Anil
2011-06-01
Magnaporthe grisea, the blast fungus is one of the main pathological threats to finger millet crop worldwide. A systematic search for the blast resistance gene analogs was carried out, using functional molecular markers. Three-fourths of the recognition-dependent disease resistance genes (R-genes) identified in plants encodes nucleotide binding site (NBS) leucine-rich repeat (LRR) proteins. NBS-LRR homologs have only been isolated on a limited scale from Eleusine coracana. Genomic DNA sequences sharing homology with NBS region of resistance gene analogs were isolated and characterized from resistant genotypes of finger millet using PCR based approach with primers designed from conserved regions of NBS domain. Attempts were made to identify molecular markers linked to the resistance gene and to differentiate the resistant bulk from the susceptible bulk. A total of 9 NBS-LRR and 11 EST-SSR markers generated 75.6 and 73.5% polymorphism respectively amongst 73 finger millet genotypes. NBS-5, NBS-9, NBS-3 and EST-SSR-04 markers showed a clear polymorphism which differentiated resistant genotypes from susceptible genotypes. By comparing the banding pattern of different resistant and susceptible genotypes, five DNA amplifications of NBS and EST-SSR primers (NBS-05(504,) NBS-09(711), NBS-07(688), NBS-03(509) and EST-SSR-04(241)) were identified as markers for the blast resistance in resistant genotypes. Principal coordinate plot and UPGMA analysis formed similar groups of the genotypes and placed most of the resistant genotypes together showing a high level of genetic relatedness and the susceptible genotypes were placed in different groups on the basis of differential disease score. Our results provided a clue for the cloning of finger millet blast resistance gene analogs which not only facilitate the process of plant breeding but also molecular characterization of blast resistance gene analogs from Eleusine coracana.
Development and validation of new SSR markers from expressed regions in the garlic genome
USDA-ARS?s Scientific Manuscript database
Limited number of simple sequence repeat (SSR) markers is available for the genome of garlic (Allium sativum L.) although SSR markers have become one of the most preferred marker systems because they are typically co-dominant, reproducible, cross species transferable and highly polymorphic. In this ...
Data of 10 SSR markers for genomes of homo sapiens and monkeys.
Reddy, K K V V V S; Raju, S Viswanadha; Someswara Rao, Chinta
2017-06-01
In this data, we present 10 Simple Sequence Repeat(SSR) markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA which are extracted from the genomes of homo sapiens and monkeys using string matching mechanism [1]. All loci showed 4 Base Pair(bp) in allele size, indicating that there are some polymorphisms between individuals correlating to the number of SSR repeats that maybe useful for the detection of similarity among the genotypes. Collectively, these data show that the SSR extraction is a valuable method to illustrate genetic variation of genomes.
Development and characterization of genomic SSR markers in Cynodon transvaalensis Burtt-Davy.
Tan, Chengcheng; Wu, Yanqi; Taliaferro, Charles M; Bell, Greg E; Martin, Dennis L; Smith, Mike W
2014-08-01
Simple sequence repeat (SSR) markers are a major molecular tool for genetic and genomic research that have been extensively developed and used in major crops. However, few are available in African bermudagrass (Cynodon transvaalensis Burtt-Davy), an economically important warm-season turfgrass species. African bermudagrass is mainly used for hybridizations with common bermudagrass [C. dactylon var. dactylon (L.) Pers.] in the development of superior interspecific hybrid turfgrass cultivars. Accordingly, the major objective of this study was to develop and characterize a large set of SSR markers. Genomic DNA of C. transvaalensis '4200TN 24-2' from an Oklahoma State University (OSU) turf nursery was extracted for construction of four SSR genomic libraries enriched with [CA](n), [GA](n), [AAG](n), and [AAT](n) as core repeat motifs. A total of 3,064 clones were sequenced at the OSU core facility. The sequences were categorized into singletons and contiguous sequences to exclude redundancy. From the two sequence categories, 1,795 SSR loci were identified. After excluding duplicate SSRs by comparison with previously developed SSR markers using a nucleotide basic local alignment tool, 1,426 unique primer pairs (PPs) were designed. Out of the 1,426 designed PPs, 981 (68.8 %) amplified alleles of the expected size in the donor DNA. Polymorphisms of the SSR PPs tested in eight C. transvaalensis plants were 93 % polymorphic with 544 markers effective in all genotypes. Inheritance of the SSRs was examined in six F(1) progeny of African parents 'T577' × 'Uganda', indicating 917 markers amplified heritable alleles. The SSR markers developed in the study are the first large set of co-dominant markers in African bermudagrass and should be highly valuable for molecular and traditional breeding research.
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily,...
Kishine, Masahiro; Tsutsumi, Katsuji; Kitta, Kazumi
2017-12-01
Simple sequence repeat (SSR) is a popular tool for individual fingerprinting. The long-core motif (e.g. tetra-, penta-, and hexa-nucleotide) simple sequence repeats (SSRs) are preferred because they make it easier to separate and distinguish neighbor alleles. In the present study, a new set of 8 tetra-nucleotide SSRs in potato ( Solanum tuberosum ) is reported. By using these 8 markers, 72 out of 76 cultivars obtained from Japan and the United States were clearly discriminated, while two pairs, both of which arose from natural variation, showed identical profiles. The combined probability of identity between two random cultivars for the set of 8 SSR markers was estimated to be 1.10 × 10 -8 , confirming the usefulness of the proposed SSR markers for fingerprinting analyses of potato.
Construction of a SNP and SSR linkage map in autotetraploid blueberry using genotyping by sequencing
USDA-ARS?s Scientific Manuscript database
A mapping population developed from a cross between two key highbush blueberry cultivars, Draper × Jewel (Vaccinium corymbosum), segregating for a number of important phenotypic traits, has been utilized to produce a genetic linkage map. Data on 233 single sequence repeat (SSR) markers and 1794 sing...
USDA-ARS?s Scientific Manuscript database
Genic microsatellites or simple sequence repeat (genic-SSR) markers were developed in boxwood (Buxus taxa) for genetic diversity analysis, identification of taxa, and to facilitate breeding. cDNA libraries were developed from mRNA extracted from leaves of Buxus sempervirens ‘Vardar Valley’ and seque...
An, Jianyu; Yin, Mengqi; Zhang, Qin; Gong, Dongting; Jia, Xiaowen; Guan, Yajing; Hu, Jin
2017-09-11
Luffa cylindrica (L.) Roem. is an economically important vegetable crop in China. However, the genomic information on this species is currently unknown. In this study, for the first time, a genome survey of L. cylindrica was carried out using next-generation sequencing (NGS) technology. In total, 43.40 Gb sequence data of L. cylindrica , about 54.94× coverage of the estimated genome size of 789.97 Mb, were obtained from HiSeq 2500 sequencing, in which the guanine plus cytosine (GC) content was calculated to be 37.90%. The heterozygosity of genome sequences was only 0.24%. In total, 1,913,731 contigs (>200 bp) with 525 bp N 50 length and 1,410,117 scaffolds (>200 bp) with 885.01 Mb total length were obtained. From the initial assembled L. cylindrica genome, 431,234 microsatellites (SSRs) (≥5 repeats) were identified. The motif types of SSR repeats included 62.88% di-nucleotide, 31.03% tri-nucleotide, 4.59% tetra-nucleotide, 0.96% penta-nucleotide and 0.54% hexa-nucleotide. Eighty genomic SSR markers were developed, and 51/80 primers could be used in both "Zheda 23" and "Zheda 83". Nineteen SSRs were used to investigate the genetic diversity among 32 accessions through SSR-HRM analysis. The unweighted pair group method analysis (UPGMA) dendrogram tree was built by calculating the SSR-HRM raw data. SSR-HRM could be effectively used for genotype relationship analysis of Luffa species.
Development of Genomic Simple Sequence Repeats (SSR) by Enrichment Libraries in Date Palm.
Al-Faifi, Sulieman A; Migdadi, Hussein M; Algamdi, Salem S; Khan, Mohammad Altaf; Al-Obeed, Rashid S; Ammar, Megahed H; Jakse, Jerenj
2017-01-01
Development of highly informative markers such as simple sequence repeats (SSR) for cultivar identification and germplasm characterization and management is essential for date palms genetic studies. The present study documents the development of SSR markers and assesses genetic relationships of commonly grown date palm (Phoenix dactylifera L.) cultivars in different geographical regions of Saudi Arabia. A total of 93 novel simple sequence repeat (SSR) markers were screened for their ability to detect polymorphism in date palm. Around 71% of genomic SSRs are dinucleotide, 25% trinucleotide, 3% tetranucleotide, and 1% pentanucleotide motives and show 100% polymorphism. The Unweighted Pair Group Method with Arithmetic Mean (UPGMA) cluster analysis illustrates that cultivars trend to group according to their class of maturity, region of cultivation, and fruit color. Analysis of molecular variations (AMOVA) reveals genetic variation among and within cultivars of 27% and 73%, respectively, according to the geographical distribution of the cultivars. Developed microsatellite markers are of additional value to date palm characterization, tools which can be used by researchers in population genetics, cultivar identification, as well as genetic resource exploration and management. The cultivars tested exhibited a significant amount of genetic diversity and could be suitable for successful breeding programs. Genomic sequences generated from this study are available at the National Center for Biotechnology Information (NCBI), Sequence Read Archive (Accession numbers. LIBGSS_039019).
Castoe, Todd A.; Poole, Alexander W.; de Koning, A. P. Jason; Jones, Kenneth L.; Tomback, Diana F.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Lance, Stacey L.; Streicher, Jeffrey W.; Smith, Eric N.; Pollock, David D.
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample – a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Castoe, T.A.; Poole, A.W.; de Koning, A. P. J.; Jones, K.L.; Tomback, D.F.; Oyler-McCance, S.J.; Fike, J.A.; Lance, S.L.; Streicher, J.W.; Smith, E.N.; Pollock, D.D.
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample - a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable. ?? 2012 Castoe et al.
Castoe, Todd A; Poole, Alexander W; de Koning, A P Jason; Jones, Kenneth L; Tomback, Diana F; Oyler-McCance, Sara J; Fike, Jennifer A; Lance, Stacey L; Streicher, Jeffrey W; Smith, Eric N; Pollock, David D
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample--a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Novel and highly informative Capsicum SSR markers and their cross-species transferability.
Buso, G S C; Reis, A M M; Amaral, Z P S; Ferreira, M E
2016-09-23
This study was undertaken primarily to develop new simple sequence repeat (SSR) markers for Capsicum. As part of this project aimed at broadening the use of molecular tools in Capsicum breeding, two genomic libraries enriched for AG/TC repeat sequences were constructed for Capsicum annuum. A total of 475 DNA clones were sequenced from both libraries and 144 SSR markers were tested on cultivated and wild species of Capsicum. Forty-five SSR markers were randomly selected to genotype a panel of 48 accessions of the Capsicum germplasm bank. The number of alleles per locus ranged from 2 to 11, with an average of 6 alleles. The polymorphism information content was on average 0.60, ranging from 0.20 to 0.83. The cross-species transferability to seven cultivated and wild Capsicum species was tested with a set of 91 SSR markers. We found that a high proportion of the loci produced amplicons in all species tested. C. frutescens had the highest number of transferable markers, whereas the wild species had the lowest. Our results indicate that the new markers can be readily used in genetic analyses of Capsicum.
Microsatellite DNA as shared genetic markers among conifer species
C.S. Echt; G.G. Vendramin; C. D. Nelson; Paula E. Marquardt
1999-01-01
Polymerase chain reaction (PCR) primer pairs for 21 simple sequence repeat (SSR) loci in Pinus strobus L, and 6 in Pinus radiata D. Don were evaluated to determine whether SSR marker amplification could be achieved in 1O other conifer species. Eighty percent of SSR primer pairs for (AC) loci that were polymorphic in P. ...
Microsatellite DNA as shared genetic markers among conifer species
Craig S. Echt; G.G. Vendramin; C.D. Nelson; P. Marquardt
1999-01-01
Polymerase chain reaction (PCR) primer pairs for 21 simple sequence repeat (SSR) loci in Pinus strobus L. and 6 in Pinus radiata D. Don. were evaluated to determine whether SSR marker amplification could be achieved in 10 other conifer species. Eighty percent of SSR primer pairs for (AC)n loci that were polymorphic in P. ...
Miao, Ning; Zhang, Lei; Li, Maoping; Fan, Liqiang; Mao, Kangshan
2017-01-01
Premise of the study: We developed transcriptome microsatellite markers (simple sequence repeats) for Taxillus nigrans (Loranthaceae) to survey the genetic diversity and population structure of this species. Methods and Results: We used Illumina HiSeq data to reconstruct the transcriptome of T. nigrans by de novo assembly and used the transcriptome to develop a set of simple sequence repeat markers. Overall, 40 primer pairs were designed and tested; 19 of them amplified successfully and demonstrated polymorphisms. Two loci that detected null alleles were eliminated, and the remaining 17, which were subjected to further analyses, yielded two to 21 alleles per locus. Conclusions: The markers will serve as a basis for studies to assess the extent and pattern of distribution of genetic variation in T. nigrans, and they may also be useful in conservation genetic, ecological, and evolutionary studies of the genus Taxillus, a group of plant species of importance in Chinese traditional medicine. PMID:28924510
Lin, Heng-Sheng; Chiang, Chih-Yun; Chang, Song-Bin; Kuoh, Chang-Sheng
2011-01-01
Foxtail millet is one of the world's oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR) markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21%) and CAT (46.15%). The average number of alleles (N(a)), the average heterozygosities observed (H(o)) and expected (H(e)) are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.
Lin, Heng-Sheng; Chiang, Chih-Yun; Chang, Song-Bin; Kuoh, Chang-Sheng
2011-01-01
Foxtail millet is one of the world’s oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR) markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21%) and CAT (46.15%). The average number of alleles (Na), the average heterozygosities observed (Ho) and expected (He) are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae. PMID:22174636
Rowland, Lisa J; Alkharouf, Nadim; Darwish, Omar; Ogden, Elizabeth L; Polashock, James J; Bassil, Nahla V; Main, Dorrie
2012-04-02
There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
2012-01-01
Background There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits. Results Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%. Conclusions These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry. PMID:22471859
An, Jianyu; Yin, Mengqi; Zhang, Qin; Gong, Dongting; Jia, Xiaowen; Guan, Yajing; Hu, Jin
2017-01-01
Luffa cylindrica (L.) Roem. is an economically important vegetable crop in China. However, the genomic information on this species is currently unknown. In this study, for the first time, a genome survey of L. cylindrica was carried out using next-generation sequencing (NGS) technology. In total, 43.40 Gb sequence data of L. cylindrica, about 54.94× coverage of the estimated genome size of 789.97 Mb, were obtained from HiSeq 2500 sequencing, in which the guanine plus cytosine (GC) content was calculated to be 37.90%. The heterozygosity of genome sequences was only 0.24%. In total, 1,913,731 contigs (>200 bp) with 525 bp N50 length and 1,410,117 scaffolds (>200 bp) with 885.01 Mb total length were obtained. From the initial assembled L. cylindrica genome, 431,234 microsatellites (SSRs) (≥5 repeats) were identified. The motif types of SSR repeats included 62.88% di-nucleotide, 31.03% tri-nucleotide, 4.59% tetra-nucleotide, 0.96% penta-nucleotide and 0.54% hexa-nucleotide. Eighty genomic SSR markers were developed, and 51/80 primers could be used in both “Zheda 23” and “Zheda 83”. Nineteen SSRs were used to investigate the genetic diversity among 32 accessions through SSR-HRM analysis. The unweighted pair group method analysis (UPGMA) dendrogram tree was built by calculating the SSR-HRM raw data. SSR-HRM could be effectively used for genotype relationship analysis of Luffa species. PMID:28891982
Katoh, Hiroshi; Subandiyah, Siti; Tomimura, Kenta; Okuda, Mitsuru; Su, Hong-Ji; Iwanami, Toru
2011-01-01
Four highly polymorphic simple sequence repeat (SSR) loci were selected and used to differentiate 84 Japanese isolates of “Candidatus Liberibacter asiaticus.” The Nei's measure of genetic diversity values for these four SSRs ranged from 0.60 to 0.86. The four SSR loci were also highly polymorphic in four isolates from Taiwan and 12 isolates from Indonesia. PMID:21239554
USDA-ARS?s Scientific Manuscript database
In recent years SSR markers have been used widely for the genetic analysis. The objective of present research was to use SSR markers to develop DNA-based genetic identification and analyze genetic relationship of sugarcane cultivars grown in Pakistan either resistant or susceptible to red rot. Twent...
Vogiatzi, Emmanouella; Lagnel, Jacques; Pakaki, Victoria; Louro, Bruno; Canario, Adelino V M; Reinhardt, Richard; Kotoulas, Georgios; Magoulas, Antonios; Tsigenopoulos, Costas S
2011-06-01
We screened for simple sequence repeats (SSRs) found in ESTs derived from an EST-database development project ('Marine Genomics Europe' Network of Excellence). Different motifs of di-, tri-, tetra-, penta- and hexanucleotide SSRs were evaluated for variation in length and position in the expressed sequences, relative abundance and distribution in gilthead sea bream (Sparus aurata). We found 899 ESTs that harbor 997 SSRs (4.94%). On average, one SSR was found per 2.95 kb of EST sequence and the dinucleotide SSRs are the most abundant accounting for 47.6% of the total number. EST-SSRs were used as template for primer design. 664 primer pairs could be successfully identified and a subset of 206 pairs of primers was synthesized, PCR-tested and visualized on ethidium bromide stained agarose gels. The main objective was to further assess the potential of EST-SSRs as informative markers and investigate their cross-species amplification in sixteen teleost fish species: seven sparid species and nine other species from different families. Approximately 78% of the primer pairs gave PCR products of expected size in gilthead sea bream, and as expected, the rate of successful amplification of sea bream EST-SSRs was higher in sparids, lower in other perciforms and even lower in species of the Clupeiform and Gadiform orders. We finally determined the polymorphism and the heterozygosity of 63 markers in a wild gilthead sea bream population; fifty-eight loci were found to be polymorphic with the expected heterozygosity and the number of alleles ranging from 0.089 to 0.946 and from 2 to 27, respectively. These tools and markers are expected to enhance the available genetic linkage map in gilthead sea bream, to assist comparative mapping and genome analyses for this species and further with other model fish species and finally to help advance genetic analysis for cultivated and wild populations and accelerate breeding programs. Copyright © 2011 Elsevier B.V. All rights reserved.
Sathyanarayana, N; Pittala, Ranjith Kumar; Tripathi, Pankaj Kumar; Chopra, Ratan; Singh, Heikham Russiachand; Belamkar, Vikas; Bhardwaj, Pardeep Kumar; Doyle, Jeff J; Egan, Ashley N
2017-05-25
The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson's drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plant currently lacks genomic resources and there is limited knowledge on gene expression, metabolic pathways, and genetics of secondary metabolite production. Here, we present transcriptomic resources for M. pruriens, including a de novo transcriptome assembly and annotation, as well as differential transcript expression analyses between root, leaf, and pod tissues. We also develop microsatellite markers and analyze genetic diversity and population structure within a set of Indian germplasm accessions. One-hundred ninety-one million two hundred thirty-three thousand two hundred forty-two bp cleaned reads were assembled into 67,561 transcripts with mean length of 626 bp and N50 of 987 bp. Assembled sequences were annotated using BLASTX against public databases with over 80% of transcripts annotated. We identified 7,493 simple sequence repeat (SSR) motifs, including 787 polymorphic repeats between the parents of a mapping population. 134 SSRs from expressed sequenced tags (ESTs) were screened against 23 M. pruriens accessions from India, with 52 EST-SSRs retained after quality control. Population structure analysis using a Bayesian framework implemented in fastSTRUCTURE showed nearly similar groupings as with distance-based (neighbor-joining) and principal component analyses, with most of the accessions clustering per geographical origins. Pair-wise comparison of transcript expression in leaves, roots and pods identified 4,387 differentially expressed transcripts with the highest number occurring between roots and leaves. Differentially expressed transcripts were enriched with transcription factors and transcripts annotated as belonging to secondary metabolite pathways. The M. pruriens transcriptomic resources generated in this study provide foundational resources for gene discovery and development of molecular markers. Polymorphic SSRs identified can be used for genetic diversity, marker-trait analyses, and development of functional markers for crop improvement. The results of differential expression studies can be used to investigate genes involved in L-Dopa synthesis and other key metabolic pathways in M. pruriens.
Wang, Jian-Sheng; He, Jun-Hu; Chen, Hua-Rui; Chen, Ye-Yuan; Qiao, Fei
2017-12-01
Inter simple sequence repeat (ISSR) and simple sequence repeat (SSR) markers were used to assess the genetic diversity of 36 pineapple accessions that were introduced from 10 countries/regions. Thirteen ISSR primers amplified 96 bands, of which 91 (93.65%) were polymorphic, whereas 20 SSR primers amplified 73 bands, of which 70 (96.50%) were polymorphic. Nei's gene diversity (h = 0.28), Shannon's information index (I = 0.43), and polymorphism information content (PIC = 0.29) generated using the SSR primers were higher than that with ISSR primers (h = 0.23, I = 0.37, PIC = 0.24), thereby suggesting that the SSR system is more efficient than the ISSR system in assessing genetic diversity in various pineapple accessions. Mean genetic similarities were 0.74, 0.61, and 0.69, as determined using ISSR, SSR, and combined ISSR/SSR, respectively. These results suggest that the genetic diversity among pineapple accessions is very high. We clustered the 36 pineapple accessions into three or five groups on the basis of the phylogenetic trees constructed based on the results of ISSR, SSR, and combined ISSR/SSR analyses using the unweighted pair-group with arithmetic averaging (UPGMA) method. The results of principal components analysis (PCA) also supported the UPGMA clustering. These results will be useful not only for the scientific conservation and management of pineapple germplasm but also for the improvement of the current pineapple breeding strategies.
High levels of heterozygosity found for 15 SSR loci in Solanum chacoense
USDA-ARS?s Scientific Manuscript database
Genetic variation is a necessary prerequisite for improving domesticated plants through breeding; without it, breeding progress would be impossible. Genetic variation can be readily ascertained with co-dominant DNA markers, such as simple sequence repeats (SSRs). Twenty-four SSR markers specifically...
Genetic differentiation and geographical relationship of Asian barley landraces using SSRs
USDA-ARS?s Scientific Manuscript database
Genetic diversity in 403 morphologically distinctive landraces of barley (Hordeum vulgare L. subsp. vulgare) originating from seven geographical zones of Asia was studied using simple sequence repeat (SSR) markers. The seven polymorphic SSR markers representing each chromosome chosen for this study ...
Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan
2013-01-01
Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.
USDA-ARS?s Scientific Manuscript database
Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.
Taheri, Sima; Lee Abdullah, Thohirah; Yusop, Mohd Rafii; Hanafi, Mohamed Musa; Sahebi, Mahbod; Azizi, Parisa; Shamshiri, Redmond Ramin
2018-02-13
Microsatellites, or simple sequence repeats (SSRs), are one of the most informative and multi-purpose genetic markers exploited in plant functional genomics. However, the discovery of SSRs and development using traditional methods are laborious, time-consuming, and costly. Recently, the availability of high-throughput sequencing technologies has enabled researchers to identify a substantial number of microsatellites at less cost and effort than traditional approaches. Illumina is a noteworthy transcriptome sequencing technology that is currently used in SSR marker development. Although 454 pyrosequencing datasets can be used for SSR development, this type of sequencing is no longer supported. This review aims to present an overview of the next generation sequencing, with a focus on the efficient use of de novo transcriptome sequencing (RNA-Seq) and related tools for mining and development of microsatellites in plants.
UPIC: Perl scripts to determine the number of SSR markers to run
USDA-ARS?s Scientific Manuscript database
We have developed Perl Scripts for the cost-effective planning of fingerprinting and genotyping experiments. The UPIC scripts detect the best combination of polymorphic simple sequence repeat (SSR) markers and provide coefficients of the amount of information obtainable (number of alleles of patter...
The art of attrition: development of robust oat microsatellites
USDA-ARS?s Scientific Manuscript database
Microsatellite or simple sequence repeat (SSR) markers are important tools for genetic analyses, especially those targeting diversity, based on the fact that multiple alleles can occur at a given locus. Currently, only 160 genomic-based SSR markers are publicly available for oat, most of which have...
Independently segregating simple sequence repeats (SSR) alleles in polyploid sugarcane
USDA-ARS?s Scientific Manuscript database
The complex nuclear genomic and flower structures of sugarcane cultivars (Saccharum hybrids spp., 2n = 10x = 100 – 130) render sugarcane a difficult subject for genetics research. Using a capillary electrophoresis- and fluorescence-labeling-based SSR genotyping platform, the segregation of a multi-a...
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
Zhou, L X; Xiao, Y; Xia, W; Yang, Y D
2015-12-08
Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.
Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul
2012-11-20
Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
Vibrio vulnificus typing based on simple sequence repeats: insights into the biotype 3 group.
Broza, Yoav Y; Danin-Poleg, Yael; Lerner, Larisa; Broza, Meir; Kashi, Yechezkel
2007-09-01
Vibrio vulnificus is an opportunistic, highly invasive human pathogen with worldwide distribution. V. vulnificus strains are commonly divided into three biochemical groups (biotypes), most members of which are pathogenic. Simple sequence repeats (SSR) provide a source of high-level genomic polymorphism used in bacterial typing. Here, we describe the use of variations in mutable SSR loci for accurate and rapid genotyping of V. vulnificus. An in silico screen of the genomes of two V. vulnificus strains revealed thousands of SSR tracts. Twelve SSR with core motifs longer than 5 bp in a panel of 32 characterized and 56 other V. vulnificus isolates, including both clinical and environmental isolates from all three biotypes, were tested for polymorphism. All tested SSR were polymorphic, and diversity indices ranged from 0.17 to 0.90, allowing a high degree of discrimination among isolates (27 of 32 characterized isolates). Genetic analysis of the SSR data resulted in the clear distinction of isolates that belong to the highly virulent biotype 3 group. Despite the clonal nature of this new group, SSR analysis demonstrated high-level discriminatory power within the biotype 3 group, as opposed to other molecular methods that failed to differentiate these isolates. Thus, SSR are suitable for rapid typing and classification of V. vulnificus strains by high-throughput capillary electrophoresis methods. SSR (>/=5 bp) by their nature enable the identification of variations occurring on a small scale and, therefore, may provide new insights into the newly emerged biotype 3 group of V. vulnificus and may be used as an efficient tool in epidemiological studies.
Ramchiary, Nirala; Nguyen, Van Dan; Li, Xiaonan; Hong, Chang Pyo; Dhandapani, Vignesh; Choi, Su Ryun; Yu, Ge; Piao, Zhong Yun; Lim, Yong Pyo
2011-01-01
Genic microsatellite markers, also known as functional markers, are preferred over anonymous markers as they reveal the variation in transcribed genes among individuals. In this study, we developed a total of 707 expressed sequence tag-derived simple sequence repeat markers (EST-SSRs) and used for development of a high-density integrated map using four individual mapping populations of B. rapa. This map contains a total of 1426 markers, consisting of 306 EST-SSRs, 153 intron polymorphic markers, 395 bacterial artificial chromosome-derived SSRs (BAC-SSRs), and 572 public SSRs and other markers covering a total distance of 1245.9 cM of the B. rapa genome. Analysis of allelic diversity in 24 B. rapa germplasm using 234 mapped EST-SSR markers showed amplification of 2 alleles by majority of EST-SSRs, although amplification of alleles ranging from 2 to 8 was found. Transferability analysis of 167 EST-SSRs in 35 species belonging to cultivated and wild brassica relatives showed 42.51% (Sysimprium leteum) to 100% (B. carinata, B. juncea, and B. napus) amplification. Our newly developed EST-SSRs and high-density linkage map based on highly transferable genic markers would facilitate the molecular mapping of quantitative trait loci and the positional cloning of specific genes, in addition to marker-assisted selection and comparative genomic studies of B. rapa with other related species. PMID:21768136
Wada, Takuya; Oku, Koichiro; Nagano, Soichiro; Isobe, Sachiko; Suzuki, Hideyuki; Mori, Miyuki; Takata, Kinuko; Hirata, Chiharu; Shimomura, Katsumi; Tsubone, Masao; Katayama, Takao; Hirashima, Keita; Uchimura, Yosuke; Ikegami, Hidetoshi; Sueyoshi, Takayuki; Obu, Ko-ichi; Hayashida, Tatsuya; Shibato, Yasushi
2017-01-01
A strawberry Multi-parent Advanced Generation Intercrosses (MAGIC) population, derived from crosses using six strawberry cultivars was successfully developed. The population was composed of 338 individuals; genome conformation was evaluated by expressed sequence tag-derived simple short repeat (EST-SSR) markers. Cluster analysis and principal component analysis (PCA) based on EST-SSR marker polymorphisms revealed that the MAGIC population was a mosaic of the six founder cultivars and covered the genomic regions of the six founders evenly. Fruit quality related traits, including days to flowering (DTF), fruit weight (FW), fruit firmness (FF), fruit color (FC), soluble solid content (SC), and titratable acidity (TA), of the MAGIC population were evaluated over two years. All traits showed normal transgressive segregation beyond the founder cultivars and most traits, except for DTF, distributed normally. FC exhibited the highest correlation coefficient overall and was distributed normally regardless of differences in DTF, FW, FF, SC, and TA. These facts were supported by PCA using fruit quality related values as explanatory variables, suggesting that major genetic factors, which are not influenced by fluctuations in other fruit traits, could control the distribution of FC. This MAGIC population is a promising resource for genome-wide association studies and genomic selection for efficient strawberry breeding. PMID:29085247
Du, Wanli; Wang, Jing; Wang, Liangming; Zhang, Jun; Chen, Xinhong; Zhao, Jixin; Yang, Qunhui; Wu, Jun
2013-01-01
The aim of this study was to characterize a Triticum aestivum-Psathyrostachys huashanica Keng (2n = 2x = 14, NsNs) disomic addition line 2-1-6-3. Individual line 2-1-6-3 plants were analyzed using cytological, genomic in situ hybridization (GISH), EST-SSR, and EST-STS techniques. The alien addition line 2-1-6-3 was shown to have two P. huashanica chromosomes, with a meiotic configuration of 2n = 44 = 22 II. We tested 55 EST-SSR and 336 EST-STS primer pairs that mapped onto seven different wheat chromosomes using DNA from parents and the P. huashanica addition line. One EST-SSR and nine EST-STS primer pairs indicated that the additional chromosome of P. huashanica belonged to homoeologous group 7, the diagnostic fragments of five EST-STS markers (BE404955, BE591127, BE637663, BF482781 and CD452422) were cloned, sequenced and compared. The results showed that the amplified polymorphic bands of P. huashanica and disomic addition line 2-1-6-3 shared 100% sequence identity, which was designated as the 7Ns disomic addition line. Disomic addition line 2-1-6-3 was evaluated to test the leaf rust resistance of adult stages in the field. We found that one pair of the 7Ns genome chromosomes carried new leaf rust resistance gene(s). Moreover, wheat line 2-1-6-3 had a superior numbers of florets and grains per spike, which were associated with the introgression of the paired P. huashanica chromosomes. These high levels of disease resistance and stable, excellent agronomic traits suggest that this line could be utilized as a novel donor in wheat breeding programs. PMID:23976963
Moon, Suyun; Lee, Hwa-Yong; Shim, Donghwan; Kim, Myungkil; Ka, Kang-Hyeon; Ryoo, Rhim; Ko, Han-Gyu; Koo, Chang-Duck; Chung, Jong-Wook; Ryu, Hojin
2017-06-01
Sixteen genomic DNA simple sequence repeat (SSR) markers of Lentinula edodes were developed from 205 SSR motifs present in 46.1-Mb long L. edodes genome sequences. The number of alleles ranged from 3-14 and the major allele frequency was distributed from 0.17-0.96. The values of observed and expected heterozygosity ranged from 0.00-0.76 and 0.07-0.90, respectively. The polymorphic information content value ranged from 0.07-0.89. A dendrogram, based on 16 SSR markers clustered by the paired hierarchical clustering' method, showed that 33 shiitake cultivars could be divided into three major groups and successfully identified. These SSR markers will contribute to the efficient breeding of this species by providing diversity in shiitake varieties. Furthermore, the genomic information covered by the markers can provide a valuable resource for genetic linkage map construction, molecular mapping, and marker-assisted selection in the shiitake mushroom.
Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir
2013-01-01
Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689
Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir
2013-01-01
Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.
Identification, validation and cross-species transferability of novel Lavandula EST-SSRs.
Adal, Ayelign M; Demissie, Zerihun A; Mahmoud, Soheil S
2015-04-01
We identified and characterized EST-SSRs with strong discrimination power against Lavandula angustifolia and Lavandula x intermedia . The markers also showed considerable cross-species transferability rate into six related Lavandula species. Lavenders (Lavandula) are important economical crops grown around the globe for essential oil production. In an attempt to develop genetic markers for these plants, we analyzed over 13,000 unigenes developed from L. angustifolia and L. x intermedia EST databases, and identified 3,459 simple sequence repeats (SSR), which were dominated by trinucleotides (41.2 %) and dinucleotides (31.45 %). Approximately, 19 % of the unigenes contained at least one SSR marker, over 60 % of which were localized in the UTRs. Only 252 EST-SSRs were 18 bp or longer from which 31 loci were validated, and 24 amplified discrete fragments with 85 % polymorphism in L. x intermedia and L. angustifolia. The average number of alleles in L. x intermedia and L. angustifolia were 3.42 and 3.71 per marker with average PIC values of 0.47 and 0.52, respectively. These values suggest a moderate to strong level of informativeness for the markers, with some loci producing unique fingerprints. The cross-species transferability rate of the markers ranges 50-100 % across eight species. The utility of these markers was assessed in eight Lavandula species and 15 L. angustifolia and L. x intermedia cultivars, and the dendrogram deduced from their similarity indexes successfully delineated the species into their respective sections and the cultivars into their respective species. These markers have potential for application in fingerprinting, diversity studies and marker-assisted breeding of Lavandula.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach
Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar
2015-01-01
Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future. PMID:25709226
Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan
2013-01-01
Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176
Wofford, Austin M.; Finch, Kristen; Bigott, Adam; Willyard, Ann
2014-01-01
• Premise of the study: Recently released Pinus plastome sequences support characterization of 15 plastid simple sequence repeat (cpSSR) loci originally published for P. contorta and P. thunbergii. This allows selection of loci for single-tube PCR multiplexed genotyping in any subsection of the genus. • Methods: Unique placement of primers and primer conservation across the genus were investigated, and a set of six loci were selected for single-tube multiplexing. We compared interspecific variation between cpSSRs and nucleotide sequences of ycf1 and tested intraspecific variation for cpSSRs using 911 samples in the P. ponderosa species complex. • Results: The cpSSR loci contain mononucleotide and complex repeats with additional length variation in flanking regions. They are not located in hypervariable regions, and most primers are conserved across the genus. A single PCR per sample multiplexed for six loci yielded 45 alleles in 911 samples. • Discussion: The protocol allows efficient genotyping of many samples. The cpSSR loci are too variable for Pinus phylogenies but are useful for the study of genetic structure within and among populations. The multiplex method could easily be extended to other plant groups by choosing primers for cpSSR loci in a plastome alignment for the target group. PMID:25202625
Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong
2014-01-01
Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species. PMID:24130371
Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong
2014-02-01
Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.
Mokhtar, Morad M; Adawy, Sami S; El-Assal, Salah El-Din S; Hussein, Ebtissam H A
2016-01-01
The present investigation was carried out aiming to use the bioinformatics tools in order to identify and characterize, simple sequence repeats within the third Version of the date palm genome and develop a new SSR primers database. In addition single nucleotide polymorphisms (SNPs) that are located within the SSR flanking regions were recognized. Moreover, the pathways for the sequences assigned by SSR primers, the biological functions and gene interaction were determined. A total of 172,075 SSR motifs was identified on date palm genome sequence with a frequency of 450.97 SSRs per Mb. Out of these, 130,014 SSRs (75.6%) were located within the intergenic regions with a frequency of 499 SSRs per Mb. While, only 42,061 SSRs (24.4%) were located within the genic regions with a frequency of 347.5 SSRs per Mb. A total of 111,403 of SSR primer pairs were designed, that represents 291.9 SSR primers per Mb. Out of the 111,403, only 31,380 SSR primers were in the genic regions, while 80,023 primers were in the intergenic regions. A number of 250,507 SNPs were recognized in 84,172 SSR flanking regions, which represents 75.55% of the total SSR flanking regions. Out of 12,274 genes only 463 genes comprising 896 SSR primers were mapped onto 111 pathways using KEGG data base. The most abundant enzymes were identified in the pathway related to the biosynthesis of antibiotics. We tested 1031 SSR primers using both publicly available date palm genome sequences as templates in the in silico PCR reactions. Concerning in vitro validation, 31 SSR primers among those used in the in silico PCR were synthesized and tested for their ability to detect polymorphism among six Egyptian date palm cultivars. All tested primers have successfully amplified products, but only 18 primers detected polymorphic amplicons among the studied date palm cultivars.
Cömertpay, G; Baloch, F S; Derya, M; Andeden, E E; Alsaleh, A; Sürek, H; Özkan, H
2016-02-19
Effective breeding programs based on genetic diversity are needed to broaden the genetic basis of rice (Oryza sativa L.) in Turkey. In this study, 81 commercial varieties from seven countries were studied in order to estimate the genomic relationships among them using nine inter-primer binding site (iPBS)-retrotransposon and 17 simple-sequence repeat (SSR) markers. A total of 59 alleles for the SSR markers and 96 bands for the iPBS-retrotransposon markers were detected, with an average of 3.47 and 10.6 per locus, respectively. Each of the varieties could be unequivocally identified by the SSR and iPBS-retrotransposon profiles. The iPBS-retrotransposon- and SSR-based clustering were identical and closely mirrored each other, with a significantly high correlation (r = 0.73). A neighbor-joining cluster based on the combined SSR and iPBS-retrotransposon data divided the rice varieties into three clusters. The population structure was determined using the STRUCTURE software, and three populations (K = 3) were identified among the varieties studied, showing that the diversity harbored by Turkish rice varieties is low. The results indicate that iPBS-retrotransposon markers are a very powerful technique to determine the genetic diversity of rice varieties.
USDA-ARS?s Scientific Manuscript database
Genomic resources in carrot and other Apiaceae are relatively underdeveloped. The availability of a large set of pcr-based codominant markers, such as simple sequence repeats (SSR), would allow integration of the different carrot genetic maps constructed to date (mainly using anonymous dominant mark...
USDA-ARS?s Scientific Manuscript database
We have developed 384 simple sequence repeat (SSR) markers for the identification of accessions of Chionanthus retusus and four related species. The bark of C. retusus and C. virginicus is used in the industry of natural product to treat inflammation, fever and other illnesses, and with the use of ...
Lu, Taofeng; Sun, Yujiao; Ma, Qin; Zhu, Minghao; Liu, Dan; Ma, Jianzhang; Ma, Yuehui; Chen, Hongyan; Guan, Weijun
2016-12-01
The Siberian tiger, Panthera tigris altaica, is an endangered species, and much more work is needed to protect this species, which is still vulnerable to extinction. Conservation efforts may be supported by the genetic assessment of wild populations, for which highly specific microsatellite markers are required. However, only a limited amount of genetic sequence data is available for this species. To identify the genes involved in the lung transcriptome and to develop additional simple sequence repeat (SSR) markers for the Siberian tiger, we used high-throughput RNA-Seq to characterize the Siberian tiger transcriptome in lung tissue (designated 'PTA-lung') and a pooled tissue sample (designated 'PTA'). Approximately 47.5 % (33,187/69,836) of the lung transcriptome was annotated in four public databases (Nr, Swiss-Prot, KEGG, and COG). The annotated genes formed a potential pool for gene identification in the tiger. An analysis of the genes differentially expressed in the PTA lung, and PTA samples revealed that the tiger may have suffered a series of diseases before death. In total, 1062 non-redundant SSRs were identified in the Siberian tiger transcriptome. Forty-three primer pairs were randomly selected for amplification reactions, and 26 of the 43 pairs were also used to evaluate the levels of genetic polymorphism. Fourteen primer pairs (32.56 %) amplified products that were polymorphic in size in P. tigris altaica. In conclusion, the transcriptome sequences will provide a valuable genomic resource for genetic research, and these new SSR markers comprise a reasonable number of loci for the genetic analysis of wild and captive populations of P. tigris altaica.
USDA-ARS?s Scientific Manuscript database
The genetic relationships and pedigree inferences among peach (Prunus persica (L.) Batsch) accessions and breeding lines used in genetic improvement were evaluated using 15 simple sequence repeat (SSR) markers. A total of 80 alleles were detected among the 37 peach accessions with an average of 5.53...
Atin, K H; Christianus, A; Fatin, N; Lutas, A C; Shabanimofrad, M; Subha, B
2017-08-17
The Malaysian giant prawn is among the most commonly cultured species of the genus Macrobrachium. Stocks of giant prawns from four rivers in Peninsular Malaysia have been used for aquaculture over the past 25 years, which has led to repeated harvesting, restocking, and transplantation between rivers. Consequently, a stock improvement program is now important to avoid the depletion of wild stocks and the loss of genetic diversity. However, the success of such an improvement program depends on our knowledge of the genetic variation of these base populations. The aim of the current study was to estimate genetic variation and differentiation of these riverine sources using novel expressed sequence tag-microsatellite (EST-SSR) markers, which not only are informative on genetic diversity but also provide information on immune and metabolic traits. Our findings indicated that the tested stocks have inbreeding depression due to a significant deficiency in heterozygotes, and F IS was estimated as 0.15538 to 0.31938. An F-statistics analysis suggested that the stocks are composed of one large panmictic population. Among the four locations, stocks from Johor, in the southern region of the peninsular, showed higher allelic and genetic diversity than the other stocks. To overcome inbreeding problems, the Johor population could be used as a base population in a stock improvement program by crossing to the other populations. The study demonstrated that EST-SSR markers can be incorporated in future marker assisted breeding to aid the proper management of the stocks by breeders and stakeholders in Malaysia.
USDA-ARS?s Scientific Manuscript database
The advent of next-generation sequencing technologies has been a boon to the cost-effective development of molecular markers, particularly in non-model species. Here, we demonstrate the efficiency of microsatellite or simple sequence repeat (SSR) marker development from short-read sequences using th...
Shafiei-Astani, Behnam; Ong, Alan Han Kiat; Valdiani, Alireza; Tan, Soon Guan; Yien, Christina Yong Seok; Ahmady, Fatemeh; Alitheen, Noorjahan Banu; Ng, Wei Lun; Kuar, Taranjeet
2015-10-15
Tomistoma schlegelii, also referred to as the "false gharial", is one of the most exclusive and least known of the world's fresh water crocodilians, limited to Southeast Asia. Indeed, lack of economic value for its skin has led to neglect the biodiversity of the species. The current study aimed to investigate the mentioned case using 40 simple sequence repeat (SSR) primer pairs and 45 inter-simple sequence repeat (ISSR) primers. DNA analysis of 17 T. schlegelii samples using the SSR and ISSR markers resulted in producing a total of 49 and 108 polymorphic bands, respectively. Furthermore, the SSR- and ISSR-based cluster analyses both generated two main clusters. However, the SSR based results were found to be more in line with the geographical distributions of the crocodile samples collected across the country as compared with the ISSR-based results. The observed heterozygosity (HO) and expected heterozygosity (HE) of the polymorphic SSRs ranged between 0.588-1 and 0.470-0.891, respectively. The present results suggest that the Malaysian T. schlegelii populations had originated from a core population of crocodiles. In cooperation with the SSR markers, the ISSRs showed high potential for studying the genetic variation of T. schlegelii, and these markers are suitable to be employed in conservation genetic programs of this endangered species. Both SSR- and ISSR-based STRUCTURE analyses suggested that all the individuals of T. schlegelii are genetically similar with each other. Copyright © 2015 Elsevier B.V. All rights reserved.
Hayat Topcu; Nergiz Coban; Keith Woeste; Mehmet Sutyemez; Salih Kafkas
2015-01-01
We attempted to develop new polymorphic SSR primer pairs in walnut using sequences derived from Juglans nigra L. genomic enriched library with GA repeat. The designed 94 SSR primer pairs were subjected to gradient PCR in 12 walnut cultivars to determine their optimum annealing temperatures and to determine whether they produce bands. Then, the...
The genetic map of finger millet, Eleusine coracana.
Dida, Mathews M; Srinivasachary; Ramakrishnan, Sujatha; Bennetzen, Jeffrey L; Gale, Mike D; Devos, Katrien M
2007-01-01
Restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), expressed-sequenced tag (EST), and simple sequence repeat (SSR) markers were used to generate a genetic map of the tetraploid finger millet (Eleusine coracana subsp. coracana) genome (2n = 4x = 36). Because levels of variation in finger millet are low, the map was generated in an inter-subspecific F(2) population from a cross between E. coracana subsp. coracana cv. Okhale-1 and its wild progenitor E. coracana subsp. africana acc. MD-20. Duplicated loci were used to identify homoeologous groups. Assignment of linkage groups to the A and B genome was done by comparing the hybridization patterns of probes in Okhale-1, MD-20, and Eleusine indica acc. MD-36. E. indica is the A genome donor to E. coracana. The maps span 721 cM on the A genome and 787 cM on the B genome and cover all 18 finger millet chromosomes, at least partially. To facilitate the use of marker-assisted selection in finger millet, a first set of 82 SSR markers was developed. The SSRs were identified in small-insert genomic libraries generated using methylation-sensitive restriction enzymes. Thirty-one of the SSRs were mapped. Application of the maps and markers in hybridization-based breeding programs will expedite the improvement of finger millet.
Gong, Wenping; Li, Guangrong; Zhou, Jianping; Li, Genying; Liu, Cheng; Huang, Chengyan; Zhao, Zhendong; Yang, Zujun
2014-09-01
Aegilops uniaristata has many agronomically useful traits that can be used for wheat breeding. So far, a Triticum turgidum - Ae. uniaristata amphiploid and one set of Chinese Spring (CS) - Ae. uniaristata addition lines have been produced. To guide Ae. uniaristata chromatin transformation from these lines into cultivated wheat through chromosome engineering, reliable cytogenetic and molecular markers specific for Ae. uniaristata chromosomes need to be developed. Standard C-banding shows that C-bands mainly exist in the centromeric regions of Ae. uniaristata but rarely at the distal ends. Fluorescence in situ hybridization (FISH) using (GAA)8 as a probe showed that the hybridization signal of chromosomes 1N-7N are different, thus (GAA)8 can be used to identify all Ae. uniaristata chromosomes in wheat background simultaneously. Moreover, a total of 42 molecular markers specific for Ae. uniaristata chromosomes were developed by screening expressed sequence tag - sequence tagged site (EST-STS), expressed sequence tag - simple sequence repeat (EST-SSR), and PCR-based landmark unique gene (PLUG) primers. The markers were subsequently localized using the CS - Ae. uniaristata addition lines and different wheat cultivars as controls. The cytogenetic and molecular markers developed herein will be helpful for screening and identifying wheat - Ae. uniaristata progeny.
Singh, A K; Rai, V P; Chand, R; Singh, R P; Singh, M N
2013-01-01
Genetic diversity and identification of simple sequence repeat markers correlated with Fusarium wilt resistance was performed in a set of 36 elite cultivated pigeonpea genotypes differing in levels of resistance to Fusarium wilt. Twenty-four polymorphic sequence repeat markers were screened across these genotypes, and amplified a total of 59 alleles with an average high polymorphic information content value of 0.52. Cluster analysis, done by UPGMA and PCA, grouped the 36 pigeonpea genotypes into two main clusters according to their Fusarium wilt reaction. Based on the Kruskal-Wallis ANOVA and simple regression analysis, six simple sequence repeat markers were found to be significantly associated with Fusarium wilt resistance. The phenotypic variation explained by these markers ranged from 23.7 to 56.4%. The present study helps in finding out feasibility of prescreened SSR markers to be used in genetic diversity analysis and their potential association with disease resistance.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
2012-01-01
Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289
Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong
2018-01-01
Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
Siew, Ging Yang; Tan, Sheau Wei; Tan, Soon Guan; Yeap, Swee Keong
2018-01-01
Durian (Durio zibethinus) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, HE = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10−3. Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called “clones”, “varieties”, or “cultivars”. Such matters have a direct impact on the regulation and management of durian genetic resources in the region. PMID:29511604
Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.
2013-01-01
SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).
Miller, Mark P; Knaus, Brian J; Mullins, Thomas D; Haig, Susan M
2013-01-01
SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (e.g., microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains 3 analysis modules along with a fourth control module that can automate analyses of large volumes of data. The modules are used to 1) identify the subset of paired-end sequences that pass Illumina quality standards, 2) align paired-end reads into a single composite DNA sequence, and 3) identify sequences that possess microsatellites (both simple and compound) conforming to user-specified parameters. The microsatellite search algorithm is extremely efficient, and we have used it to identify repeats with motifs from 2 to 25 bp in length. Each of the 3 analysis modules can also be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc.). We demonstrate use of the program with data from the brine fly Ephydra packardi (Diptera: Ephydridae) and provide empirical timing benchmarks to illustrate program performance on a common desktop computer environment. We further show that the Illumina platform is capable of identifying large numbers of microsatellites, even when using unenriched sample libraries and a very small percentage of the sequencing capacity from a single DNA sequencing run. All modules from SSR_pipeline are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, and Windows).
Shu, Jinshuai; Liu, Yumei; Li, Zhansheng; Zhang, Lili; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao
2015-01-01
We previously discovered carpelloid stamens when breeding cytoplasmic male sterile lines in broccoli (Brassica oleracea var. italica). In this study, hybrids and multiple backcrosses were produced from different cytoplasmic male sterile carpelloid stamen sources and maintainer lines. Carpelloid stamens caused dysplasia of the flower structure and led to hooked or coiled siliques with poor seed setting, which were inherited in a maternal fashion. Using four distinct carpelloid stamens and twelve distinct normal stamens from cytoplasmic male sterile sources and one maintainer, we used 21 mitochondrial simple sequence repeat (mtSSR) primers and 32 chloroplast SSR primers to identify a mitochondrial marker, mtSSR2, that can differentiate between the cytoplasm of carpelloid and normal stamens. Thereafter, mtSSR2 was used to identify another 34 broccoli accessions, with an accuracy rate of 100%. Analysis of the polymorphic sequences revealed that the mtSSR2 open reading frame of carpelloid stamen sterile sources had a deletion of 51 bases (encoding 18 amino acids) compared with normal stamen materials. The open reading frame is located in the coding region of orf125 and orf108 of the mitochondrial genomes in Brassica crops and had the highest similarity with Raphanus sativus and Brassica carinata. The current study has not only identified a useful molecular marker to detect the cytoplasm of carpelloid stamens during broccoli breeding, but it also provides evidence that the mitochondrial genome is maternally inherited and provides a basis for studying the effect of the cytoplasm on flower organ development in plants. PMID:26407159
Development and characterization of EST-SSR markers for Begonia luzhaiensis (Begoniaceae)1
Tseng, Yu-Hsin; Huang, Han-Yau; Xu, Wei-Bin; Yang, Hsun-An; Liu, Yan; Peng, Ching-I; Chung, Kuo-Fang
2017-01-01
Premise of the study: Microsatellite primers were developed for Begonia luzhaiensis (Begoniaceae) to assess genetic diversity and population genetic structure. Methods and Results: Based on the transcriptome data of B. luzhaiensis, 60 primer pairs were selected for initial validation, of which 16 yielded polymorphic microsatellite loci in 57 individuals. The number of alleles observed for these 16 loci ranged from one to nine. The observed and expected heterozygosity ranged from 0.000 to 1.000 and from 0.000 to 0.804 with averages of 0.370 and 0.404, respectively. Five loci could be successfully amplified in B. leprosa. Conclusions: The expressed sequence tag–simple sequence repeat markers are the first specifically developed for B. luzhaiensis and the first developed in Begonia sect. Coelocentrum. These markers will be useful for future studies of the genetic structure and phylogeography of B. luzhaiensis. PMID:28529834
Rauscher, Gilda; Simko, Ivan
2013-01-22
Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes.
2013-01-01
Background Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Results Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. Conclusions The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes. PMID:23339733
A genetic linkage map for hazelnut (Corylus avellana L.) based on RAPD and SSR markerswac
Shawn A. Mehlenbacher; Rebecca N. Brown; Eduardo R. Nouhra; Tufan Gokirmak; Nahla V. Bassil; Thomas L. Kubisiak
2006-01-01
A linkage map for European hazelnut (Corylus avellana L.) was constructed using random amplified polymorphic DNA (RAPD) and simple sequence repeat (SSR) markers and the 2-way pseudotestcross approach. A full-sib population of 144 seedlings from the cross OSU 252.146 x OSU 414.062 was used. RAPD markers in testcross configuration,segregating 1:I, were...
Yan, Xiuqin; Zhang, Xue; Lu, Min; He, Yong; An, Huaming
2015-04-25
Rosa roxburghii Tratt. is a well-known ornamental rose species native to China. In addition, the fruits of this species are valued for their nutritional and medicinal characteristics, especially their high ascorbic acid (AsA) levels. Nevertheless, AsA biosynthesis in R. roxburghii fruit has not been explored in detail because of a lack of genomic resources for this species. High-throughput transcriptomic sequencing generating large volumes of transcript sequence data can aid in gene discovery and molecular marker development. In this study, we generated more than 53 million clean reads using Illumina paired-end sequencing technology. De novo assembly yielded 106,590 unigenes, with an average length of 343 bp. On the basis of sequence similarity to known proteins, 9301 and 2393 unigenes were classified into Gene Ontology and Clusters of Orthologous Group categories, respectively. There were 7480 unigenes assigned to 124 pathways in the Kyoto Encyclopedia of Gene and Genome pathway database. BLASTx searches identified 498 unique putative transcripts encoding various transcription factors, some known to regulate fruit development. qRT-PCR validated the expressions of most of the genes encoding the main enzymes involved in ascorbate biosynthesis. In addition, 9131 potential simple sequence repeat (SSR) loci were identified among the unigenes. One hundred and two primer pairs were synthesized and 71 pairs produced an amplification product during initial screening. Among the amplified products, 30 were polymorphic in the 16 R. roxburghii germplasms tested. Our study was the first to produce a large volume of transcriptome data from R. roxburghii. The resulting sequence collection is a valuable resource for gene discovery and marker-assisted selective breeding in this rose species. Copyright © 2015 Elsevier B.V. All rights reserved.
Rajaram, Vengaldas; Nepolean, Thirunavukkarasu; Senthilvel, Senapathy; Varshney, Rajeev K; Vadez, Vincent; Srivastava, Rakesh K; Shah, Trushar M; Supriya, Ambawat; Kumar, Sushil; Ramana Kumari, Basava; Bhanuprakash, Amindala; Narasu, Mangamoori Lakshmi; Riera-Lizarazu, Oscar; Hash, Charles Thomas
2013-03-09
Pearl millet [Pennisetum glaucum (L.) R. Br.] is a widely cultivated drought- and high-temperature tolerant C4 cereal grown under dryland, rainfed and irrigated conditions in drought-prone regions of the tropics and sub-tropics of Africa, South Asia and the Americas. It is considered an orphan crop with relatively few genomic and genetic resources. This study was undertaken to increase the EST-based microsatellite marker and genetic resources for this crop to facilitate marker-assisted breeding. Newly developed EST-SSR markers (99), along with previously mapped EST-SSR (17), genomic SSR (53) and STS (2) markers, were used to construct linkage maps of four F7 recombinant inbred populations (RIP) based on crosses ICMB 841-P3 × 863B-P2 (RIP A), H 77/833-2 × PRLT 2/89-33 (RIP B), 81B-P6 × ICMP 451-P8 (RIP C) and PT 732B-P2 × P1449-2-P1 (RIP D). Mapped loci numbers were greatest for RIP A (104), followed by RIP B (78), RIP C (64) and RIP D (59). Total map lengths (Haldane) were 615 cM, 690 cM, 428 cM and 276 cM, respectively. A total of 176 loci detected by 171 primer pairs were mapped among the four crosses. A consensus map of 174 loci (899 cM) detected by 169 primer pairs was constructed using MergeMap to integrate the individual linkage maps. Locus order in the consensus map was well conserved for nearly all linkage groups. Eighty-nine EST-SSR marker loci from this consensus map had significant BLAST hits (top hits with e-value ≤ 1E-10) on the genome sequences of rice, foxtail millet, sorghum, maize and Brachypodium with 35, 88, 58, 48 and 38 loci, respectively. The consensus map developed in the present study contains the largest set of mapped SSRs reported to date for pearl millet, and represents a major consolidation of existing pearl millet genetic mapping information. This study increased numbers of mapped pearl millet SSR markers by >50%, filling important gaps in previously published SSR-based linkage maps for this species and will greatly facilitate SSR-based QTL mapping and applied marker-assisted selection programs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Labbe, Jessy L; Murat, Claude; Morin, Emmanuelle
It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in themore » L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.« less
2012-01-01
Background In rubber tree, bark is one of important agricultural and biological organs. However, the molecular mechanism involved in the bark formation and development in rubber tree remains largely unknown, which is at least partially due to lack of bark transcriptomic and genomic information. Therefore, it is necessary to carried out high-throughput transcriptome sequencing of rubber tree bark to generate enormous transcript sequences for the functional characterization and molecular marker development. Results In this study, more than 30 million sequencing reads were generated using Illumina paired-end sequencing technology. In total, 22,756 unigenes with an average length of 485 bp were obtained with de novo assembly. The similarity search indicated that 16,520 and 12,558 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 6,867 and 5,559 unigenes were separately assigned to Gene Ontology (GO) and Clusters of Orthologous Group (COG). When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 12,097 unigenes were assigned to 5 main categories including 123 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (9,043, 74.75%), suggesting the active metabolic processes in rubber tree bark. In addition, a total of 39,257 EST-SSRs were identified from 22,756 unigenes, and the characterizations of EST-SSRs were further analyzed in rubber tree. 110 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among 13 Hevea germplasms, PCR success rate and polymorphism rate of 110 markers were separately 96.36% and 55.45% in this study. Conclusion By assembling and analyzing de novo transcriptome sequencing data, we reported the comprehensive functional characterization of rubber tree bark. This research generated a substantial fraction of rubber tree transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation, and microarrays development in rubber tree. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding in rubber tree. Moreover, this study also supported that transcriptome analysis based on Illumina paired-end sequencing is a powerful tool for transcriptome characterization and molecular marker development in non-model species, especially those with large and complex genomes. PMID:22607098
CMD: a Cotton Microsatellite Database resource for Gossypium genomics
Blenda, Anna; Scheffler, Jodi; Scheffler, Brian; Palmer, Michael; Lacape, Jean-Marc; Yu, John Z; Jesudurai, Christopher; Jung, Sook; Muthukumar, Sriram; Yellambalase, Preetham; Ficklin, Stephen; Staton, Margaret; Eshelman, Robert; Ulloa, Mauricio; Saha, Sukumar; Burr, Ben; Liu, Shaolin; Zhang, Tianzhen; Fang, Deqiu; Pepper, Alan; Kumpatla, Siva; Jacobs, John; Tomkins, Jeff; Cantrell, Roy; Main, Dorrie
2006-01-01
Background The Cotton Microsatellite Database (CMD) is a curated and integrated web-based relational database providing centralized access to publicly available cotton microsatellites, an invaluable resource for basic and applied research in cotton breeding. Description At present CMD contains publication, sequence, primer, mapping and homology data for nine major cotton microsatellite projects, collectively representing 5,484 microsatellites. In addition, CMD displays data for three of the microsatellite projects that have been screened against a panel of core germplasm. The standardized panel consists of 12 diverse genotypes including genetic standards, mapping parents, BAC donors, subgenome representatives, unique breeding lines, exotic introgression sources, and contemporary Upland cottons with significant acreage. A suite of online microsatellite data mining tools are accessible at CMD. These include an SSR server which identifies microsatellites, primers, open reading frames, and GC-content of uploaded sequences; BLAST and FASTA servers providing sequence similarity searches against the existing cotton SSR sequences and primers, a CAP3 server to assemble EST sequences into longer transcripts prior to mining for SSRs, and CMap, a viewer for comparing cotton SSR maps. Conclusion The collection of publicly available cotton SSR markers in a centralized, readily accessible and curated web-enabled database provides a more efficient utilization of microsatellite resources and will help accelerate basic and applied research in molecular breeding and genetic mapping in Gossypium spp. PMID:16737546
Li, Yunfeng; Zhou, Zunchun; Tian, Meilin; Tian, Yi; Dong, Ying; Li, Shilei; Liu, Weidong; He, Chongbo
2017-08-01
In this study, single nucleotide polymorphism (SNP), microsatellite (SSR) and differentially expressed genes (DEGs) in the oral parts, gonads, and umbrella parts of the jellyfish Rhopilema esculentum were analyzed by RNA-Seq technology. A total of 76.4 million raw reads and 72.1 million clean reads were generated from deep sequencing. Approximately 119,874 tentative unigenes and 149,239 transcripts were obtained. A total of 1,034,708 SNP markers were detected in the three tissues. For microsatellite mining, 5088 SSRs were identified from the unigene sequences. The most frequent repeat motifs were mononucleotide repeats, which accounted for 61.93%. Transcriptome comparison of the three tissues yielded a total of 8841 DEGs, of which 3560 were up-regulated and 5281 were down-regulated. This study represents the greatest sequencing effort carried out for a jellyfish and provides the first high-throughput transcriptomic resource for jellyfish. Copyright © 2017 Elsevier B.V. All rights reserved.
Fatty Acid Profile and Unigene-Derived Simple Sequence Repeat Markers in Tung Tree (Vernicia fordii)
Zhang, Lin; Jia, Baoguang; Tan, Xiaofeng; Thammina, Chandra S.; Long, Hongxu; Liu, Min; Wen, Shanna; Song, Xianliang; Cao, Heping
2014-01-01
Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple sequence repeat (SSR) markers in tung tree. Fatty acid profiles of 41 accessions showed that the ratio of α-eleostearic acid was increasing continuously with a parallel trend to the amount of tung oil accumulation while the ratios of other fatty acids were decreasing in different stages of the seeds and that α-eleostearic acid (18∶3) consisted of 77% of the total fatty acids in tung oil. Transcriptome sequencing identified 81,805 unigenes from tung cDNA library constructed using seed mRNA and discovered 6,366 SSRs in 5,404 unigenes. The di- and tri-nucleotide microsatellites accounted for 92% of the SSRs with AG/CT and AAG/CTT being the most abundant SSR motifs. Fifteen polymorphic genic-SSR markers were developed from 98 unigene loci tested in 41 cultivated tung accessions by agarose gel and capillary electrophoresis. Genbank database search identified 10 of them putatively coding for functional proteins. Quantitative PCR demonstrated that all 15 polymorphic SSR-associated unigenes were expressed in tung seeds and some of them were highly correlated with oil composition in the seeds. Dendrogram revealed that most of the 41 accessions were clustered according to the geographic region. These new polymorphic genic-SSR markers will facilitate future studies on genetic diversity, molecular fingerprinting, comparative genomics and genetic mapping in tung tree. The lipid profiles in the seeds of 41 tung accessions will be valuable for biochemical and breeding studies. PMID:25167054
Multiplexed microsatellite recovery using massively parallel sequencing
T.N. Jennings; B.J. Knaus; T.D. Mullins; S.M. Haig; R.C. Cronn
2011-01-01
Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of...
Gender Identification in Date Palm Using Molecular Markers.
Awan, Faisal Saeed; Maryam; Jaskani, Muhammad J; Sadia, Bushra
2017-01-01
Breeding of date palm is complicated because of its long life cycle and heterozygous nature. Sexual propagation of date palm does not produce true-to-type plants. Sex of date palms cannot be identified until the first flowering stage. Molecular markers such as random amplified polymorphic DNA (RAPD), sequence-characterized amplified regions (SCAR), and simple sequence repeats (SSR) have successfully been used to identify the sex-linked loci in the plant genome and to isolate the corresponding genes. This chapter highlights the use of three molecular markers including RAPD, SCAR, and SSR to identify the gender of date palm seedlings.
Development of New Candidate Gene and EST-Based Molecular Markers for Gossypium Species
Buyyarapu, Ramesh; Kantety, Ramesh V.; Yu, John Z.; Saha, Sukumar; Sharma, Govind C.
2011-01-01
New source of molecular markers accelerate the efforts in improving cotton fiber traits and aid in developing high-density integrated genetic maps. We developed new markers based on candidate genes and G. arboreum EST sequences that were used for polymorphism detection followed by genetic and physical mapping. Nineteen gene-based markers were surveyed for polymorphism detection in 26 Gossypium species. Cluster analysis generated a phylogenetic tree with four major sub-clusters for 23 species while three species branched out individually. CAP method enhanced the rate of polymorphism of candidate gene-based markers between G. hirsutum and G. barbadense. Two hundred A-genome based SSR markers were designed after datamining of G. arboreum EST sequences (Mississippi Gossypium arboreum EST-SSR: MGAES). Over 70% of MGAES markers successfully produced amplicons while 65 of them demonstrated polymorphism between the parents of G. hirsutum and G. barbadense RIL population and formed 14 linkage groups. Chromosomal localization of both candidate gene-based and MGAES markers was assisted by euploid and hypoaneuploid CS-B analysis. Gene-based and MGAES markers were highly informative as they were designed from candidate genes and fiber transcriptome with a potential to be integrated into the existing cotton genetic and physical maps. PMID:22315588
Zhao, Chuanzhi; Qiu, Jingjing; Agarwal, Gaurav; Wang, Jiangshan; Ren, Xuezhen; Xia, Han; Guo, Baozhu; Ma, Changle; Wan, Shubo; Bertioli, David J.; Varshney, Rajeev K.; Pandey, Manish K.; Wang, Xingjun
2017-01-01
Despite several efforts in the last decade toward development of simple sequence repeat (SSR) markers in peanut, there is still a need for more markers for conducting different genetic and breeding studies. With the effort of the International Peanut Genome Initiative, the availability of reference genome for both the diploid progenitors of cultivated peanut allowed us to identify 135,529 and 199,957 SSRs from the A (Arachis duranensis) and B genomes (Arachis ipaensis), respectively. Genome sequence analysis showed uneven distribution of the SSR motifs across genomes with variation in parameters such as SSR type, repeat number, and SSR length. Using the flanking sequences of identified SSRs, primers were designed for 51,354 and 60,893 SSRs with densities of 49 and 45 SSRs per Mb in A. duranensis and A. ipaensis, respectively. In silico PCR analysis of these SSR markers showed high transferability between wild and cultivated Arachis species. Two physical maps were developed for the A genome and the B genome using these SSR markers, and two reported disease resistance quantitative trait loci (QTLs), qF2TSWV5 for tomato spotted wilt virus (TSWV) and qF2LS6 for leaf spot (LS), were mapped in the 8.135 Mb region of chromosome A04 of A. duranensis. From this genomic region, 719 novel SSR markers were developed, which provide the possibility for fine mapping of these QTLs. In addition, this region also harbors 652 genes and 49 of these are defense related genes, including two NB-ARC genes, three LRR receptor-like genes and three WRKY transcription factors. These disease resistance related genes could contribute to resistance to viral (such as TSWV) and fungal (such as LS) diseases in peanut. In summary, this study not only provides a large number of molecular markers for potential use in peanut genetic map development and QTL mapping but also for map-based gene cloning and molecular breeding. PMID:28769940
Du, Qingzhang; Gong, Chenrui; Pan, Wei; Zhang, Deqiang
2013-02-01
Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2-7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.
Short-Sequence DNA Repeats in Prokaryotic Genomes
van Belkum, Alex; Scherer, Stewart; van Alphen, Loek; Verbrugh, Henri
1998-01-01
Short-sequence DNA repeat (SSR) loci can be identified in all eukaryotic and many prokaryotic genomes. These loci harbor short or long stretches of repeated nucleotide sequence motifs. DNA sequence motifs in a single locus can be identical and/or heterogeneous. SSRs are encountered in many different branches of the prokaryote kingdom. They are found in genes encoding products as diverse as microbial surface components recognizing adhesive matrix molecules and specific bacterial virulence factors such as lipopolysaccharide-modifying enzymes or adhesins. SSRs enable genetic and consequently phenotypic flexibility. SSRs function at various levels of gene expression regulation. Variations in the number of repeat units per locus or changes in the nature of the individual repeat sequences may result from recombination processes or polymerase inadequacy such as slipped-strand mispairing (SSM), either alone or in combination with DNA repair deficiencies. These rather complex phenomena can occur with relative ease, with SSM approaching a frequency of 10−4 per bacterial cell division and allowing high-frequency genetic switching. Bacteria use this random strategy to adapt their genetic repertoire in response to selective environmental pressure. SSR-mediated variation has important implications for bacterial pathogenesis and evolutionary fitness. Molecular analysis of changes in SSRs allows epidemiological studies on the spread of pathogenic bacteria. The occurrence, evolution and function of SSRs, and the molecular methods used to analyze them are discussed in the context of responsiveness to environmental factors, bacterial pathogenicity, epidemiology, and the availability of full-genome sequences for increasing numbers of microorganisms, especially those that are medically relevant. PMID:9618442
Staton, Margaret; Best, Teodora; Khodwekar, Sudhir; Owusu, Sandra; Xu, Tao; Xu, Yi; Jennings, Tara; Cronn, Richard; Arumuganathan, A. Kathiravetpilla; Coggeshall, Mark; Gailing, Oliver; Liang, Haiying; Romero-Severson, Jeanne; Schlarbaum, Scott; Carlson, John E.
2015-01-01
Forest health issues are on the rise in the United States, resulting from introduction of alien pests and diseases, coupled with abiotic stresses related to climate change. Increasingly, forest scientists are finding genetic/genomic resources valuable in addressing forest health issues. For a set of ten ecologically and economically important native hardwood tree species representing a broad phylogenetic spectrum, we used low coverage whole genome sequencing from multiplex Illumina paired ends to economically profile their genomic content. For six species, the genome content was further analyzed by flow cytometry in order to determine the nuclear genome size. Sequencing yielded a depth of 0.8X to 7.5X, from which in silico analysis yielded preliminary estimates of gene and repetitive sequence content in the genome for each species. Thousands of genomic SSRs were identified, with a clear predisposition toward dinucleotide repeats and AT-rich repeat motifs. Flanking primers were designed for SSR loci for all ten species, ranging from 891 loci in sugar maple to 18,167 in redbay. In summary, we have demonstrated that useful preliminary genome information including repeat content, gene content and useful SSR markers can be obtained at low cost and time input from a single lane of Illumina multiplex sequence. PMID:26698853
Genome-wide distribution comparative and composition analysis of the SSRs in Poaceae.
Wang, Yi; Yang, Chao; Jin, Qiaojun; Zhou, Dongjie; Wang, Shuangshuang; Yu, Yuanjie; Yang, Long
2015-02-15
The Poaceae family is of great importance to human beings since it comprises the cereal grasses which are the main sources for human food and animal feed. With the rapid growth of genomic data from Poaceae members, comparative genomics becomes a convinent method to study genetics of diffierent species. The SSRs (Simple Sequence Repeats) are widely used markers in the studies of Poaceae for their high abundance and stability. In this study, using the genomic sequences of 9 Poaceae species, we detected 11,993,943 SSR loci and developed 6,799,910 SSR primer pairs. The results show that SSRs are distributed on all the genomic elements in grass. Hexamer is the most frequent motif and AT/TA is the most frequent motif in dimer. The abundance of the SSRs has a positive linear relationship with the recombination rate. SSR sequences in the coding regions involve a higher GC content in the Poaceae than that in the other species. SSRs of 70-80 bp in length showed the highest AT/GC base ratio among all of these loci. The result shows the highest polymorphism rate belongs to the SSRs ranged from 30 bp to 40 bp. Using all the SSR primers of Japonica, nineteen universal primers were selected and located on the genome of the grass family. The information of SSR loci, the SSR primers and the tools of mining and analyzing SSR are provided in the PSSRD (Poaceae SSR Database, http://biodb.sdau.edu.cn/pssrd/). Our study and the PSSRD database provide a foundation for the comparative study in the Poaceae and it will accelerate the study on markers application, gene mapping and molecular breeding.
De novo transcriptomic analysis and development of EST-SSRs for Sorbus pohuashanensis (Hance) Hedl.
Guan, Xuelian; Fu, Qiang; Zhang, Ze; Hu, Zenghui; Zheng, Jian; Lu, Yizeng; Li, Wei
2017-01-01
Sorbus pohuashanensis is a native tree species of northern China that is used for a variety of ecological purposes. The species is often grown as an ornamental landscape tree because of its beautiful form, silver flowers in early summer, attractive pinnate leaves in summer, and red leaves and fruits in autumn. However, development and further utilization of the species are hindered by the lack of comprehensive genetic information, which impedes research into its genetics and molecular biology. Recent advances in de novo transcriptome sequencing (RNA-seq) technology have provided an effective means to obtain genomic information from non-model species. Here, we applied RNA-seq for sequencing S. pohuashanensis leaves and obtained a total of 137,506 clean reads. After assembly, 96,213 unigenes with an average length of 770 bp were obtained. We found that 64.5% of the unigenes could be annotated using bioinformatics tools to analyze gene function and alignment with the NCBI database. Overall, 59,089 unigenes were annotated using the Nr database(non-redundant protein database), 35,225 unigenes were annotated using the GO (Gene Ontology categories) database, and 33,168 unigenes were annotated using COG (Cluster of Orthologous Groups). Analysis of the unigenes using the KEGG (Kyoto Encyclopedia of Genes and Genomes) database indicated that 13,953 unigenes were involved in 322 metabolic pathways. Finally, simple sequence repeat (SSR) site detection identified 6,604 unigenes that included EST-SSRs and a total of 7,473 EST-SSRs in the unigene sequences. Fifteen polymorphic SSRs were screened and found to be of use for future genetic research. These unigene sequences will provide important genetic resources for genetic improvement and investigation of biochemical processes in S. pohuashanensis. PMID:28614366
Georgi, Laura; Johnson-Cicalese, Jennifer; Honig, Josh; Das, Sushma Parankush; Rajah, Veeran D; Bhattacharya, Debashish; Bassil, Nahla; Rowland, Lisa J; Polashock, James; Vorsa, Nicholi
2013-03-01
The first genetic map of cranberry (Vaccinium macrocarpon) has been constructed, comprising 14 linkage groups totaling 879.9 cM with an estimated coverage of 82.2 %. This map, based on four mapping populations segregating for field fruit-rot resistance, contains 136 distinct loci. Mapped markers include blueberry-derived simple sequence repeat (SSR) and cranberry-derived sequence-characterized amplified region markers previously used for fingerprinting cranberry cultivars. In addition, SSR markers were developed near cranberry sequences resembling genes involved in flavonoid biosynthesis or defense against necrotrophic pathogens, or conserved orthologous set (COS) sequences. The cranberry SSRs were developed from next-generation cranberry genomic sequence assemblies; thus, the positions of these SSRs on the genomic map provide information about the genomic location of the sequence scaffold from which they were derived. The use of SSR markers near COS and other functional sequences, plus 33 SSR markers from blueberry, facilitates comparisons of this map with maps of other plant species. Regions of the cranberry map were identified that showed conservation of synteny with Vitis vinifera and Arabidopsis thaliana. Positioned on this map are quantitative trait loci (QTL) for field fruit-rot resistance (FFRR), fruit weight, titratable acidity, and sound fruit yield (SFY). The SFY QTL is adjacent to one of the fruit weight QTL and may reflect pleiotropy. Two of the FFRR QTL are in regions of conserved synteny with grape and span defense gene markers, and the third FFRR QTL spans a flavonoid biosynthetic gene.
Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B
2015-01-01
Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.
USDA-ARS?s Scientific Manuscript database
To discover resistance (R) and/or pathogen-induced (PR) genes involved in disease response, 12 bacterial artificial chromosome (BAC) clones from cv. Acala Maxxa (G. hirsutum) were sequenced at the Clemson University, Genomics Institute, Clemson, SC. These BACs derived MUSB single sequence repeat (SS...
Urasaki, Naoya; Goeku, Satoko; Kaneshima, Risa; Takamine, Tomonori; Tarora, Kazuhiko; Takeuchi, Makoto; Moromizato, Chie; Yonamine, Kaname; Hosaka, Fumiko; Terakami, Shingo; Matsumura, Hideo; Yamamoto, Toshiya; Shoda, Moriyuki
2015-01-01
To explore genome-wide DNA polymorphisms and identify DNA markers for leaf margin phenotypes, a restriction-site-associated DNA sequencing analysis was employed to analyze three bulked DNAs of F1 progeny from a cross between a ‘piping-leaf-type’ cultivar, ‘Yugafu’, and a ‘spiny-tip-leaf-type’ variety, ‘Yonekura’. The parents were both Ananas comosus var. comosus. From the analysis, piping-leaf and spiny-tip-leaf gene-specific restriction-site-associated DNA sequencing tags were obtained and designated as PLSTs and STLSTs, respectively. The five PLSTs and two STSLTs were successfully converted to cleaved amplified polymorphic sequence (CAPS) or simple sequence repeat (SSR) markers using the sequence differences between alleles. Based on the genotyping of the F1 with two SSR and three CAPS markers, the five PLST markers were mapped in the vicinity of the P locus, with the closest marker, PLST1_SSR, being located 1.5 cM from the P locus. The two CAPS markers from STLST1 and STLST3 perfectly assessed the ‘spiny-leaf type’ as homozygotes of the recessive s allele of the S gene. The recombination value between the S locus and STLST loci was 2.4, and STLSTs were located 2.2 cM from the S locus. SSR and CAPS markers are applicable to marker-assisted selection of leaf margin phenotypes in pineapple breeding. PMID:26175625
Urasaki, Naoya; Goeku, Satoko; Kaneshima, Risa; Takamine, Tomonori; Tarora, Kazuhiko; Takeuchi, Makoto; Moromizato, Chie; Yonamine, Kaname; Hosaka, Fumiko; Terakami, Shingo; Matsumura, Hideo; Yamamoto, Toshiya; Shoda, Moriyuki
2015-06-01
To explore genome-wide DNA polymorphisms and identify DNA markers for leaf margin phenotypes, a restriction-site-associated DNA sequencing analysis was employed to analyze three bulked DNAs of F1 progeny from a cross between a 'piping-leaf-type' cultivar, 'Yugafu', and a 'spiny-tip-leaf-type' variety, 'Yonekura'. The parents were both Ananas comosus var. comosus. From the analysis, piping-leaf and spiny-tip-leaf gene-specific restriction-site-associated DNA sequencing tags were obtained and designated as PLSTs and STLSTs, respectively. The five PLSTs and two STSLTs were successfully converted to cleaved amplified polymorphic sequence (CAPS) or simple sequence repeat (SSR) markers using the sequence differences between alleles. Based on the genotyping of the F1 with two SSR and three CAPS markers, the five PLST markers were mapped in the vicinity of the P locus, with the closest marker, PLST1_SSR, being located 1.5 cM from the P locus. The two CAPS markers from STLST1 and STLST3 perfectly assessed the 'spiny-leaf type' as homozygotes of the recessive s allele of the S gene. The recombination value between the S locus and STLST loci was 2.4, and STLSTs were located 2.2 cM from the S locus. SSR and CAPS markers are applicable to marker-assisted selection of leaf margin phenotypes in pineapple breeding.
Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y
2014-04-08
The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.
Genome Wide Characterization of Short Tandem Repeat Markers in Sweet Orange (Citrus sinensis)
Biswas, Manosh Kumar; Xu, Qiang; Mayer, Christoph; Deng, Xiuxin
2014-01-01
Sweet orange (Citrus sinensis) is one of the major cultivated and most-consumed citrus species. With the goal of enhancing the genomic resources in citrus, we surveyed, developed and characterized microsatellite markers in the ≈347 Mb sequence assembly of the sweet orange genome. A total of 50,846 SSRs were identified with a frequency of 146.4 SSRs/Mbp. Dinucleotide repeats are the most frequent repeat class and the highest density of SSRs was found in chromosome 4. SSRs are non-randomly distributed in the genome and most of the SSRs (62.02%) are located in the intergenic regions. We found that AT-rich SSRs are more frequent than GC-rich SSRs. A total number of 21,248 SSR primers were successfully developed, which represents 89 SSR markers per Mb of the genome. A subset of 950 developed SSR primer pairs were synthesized and tested by wet lab experiments on a set of 16 citrus accessions. In total we identified 534 (56.21%) polymorphic SSR markers that will be useful in citrus improvement. The number of amplified alleles ranges from 2 to 12 with an average of 4 alleles per marker and an average PIC value of 0.75. The newly developed sweet orange primer sequences, their in silico PCR products, exact position in the genome assembly and putative function are made publicly available. We present the largest number of SSR markers ever developed for a citrus species. Almost two thirds of the markers are transferable to 16 citrus relatives and may be used for constructing a high density linkage map. In addition, they are valuable for marker-assisted selection studies, population structure analyses and comparative genomic studies of C. sinensis with other citrus related species. Altogether, these markers provide a significant contribution to the citrus research community. PMID:25148383
Genome wide characterization of short tandem repeat markers in sweet orange (Citrus sinensis).
Biswas, Manosh Kumar; Xu, Qiang; Mayer, Christoph; Deng, Xiuxin
2014-01-01
Sweet orange (Citrus sinensis) is one of the major cultivated and most-consumed citrus species. With the goal of enhancing the genomic resources in citrus, we surveyed, developed and characterized microsatellite markers in the ≈347 Mb sequence assembly of the sweet orange genome. A total of 50,846 SSRs were identified with a frequency of 146.4 SSRs/Mbp. Dinucleotide repeats are the most frequent repeat class and the highest density of SSRs was found in chromosome 4. SSRs are non-randomly distributed in the genome and most of the SSRs (62.02%) are located in the intergenic regions. We found that AT-rich SSRs are more frequent than GC-rich SSRs. A total number of 21,248 SSR primers were successfully developed, which represents 89 SSR markers per Mb of the genome. A subset of 950 developed SSR primer pairs were synthesized and tested by wet lab experiments on a set of 16 citrus accessions. In total we identified 534 (56.21%) polymorphic SSR markers that will be useful in citrus improvement. The number of amplified alleles ranges from 2 to 12 with an average of 4 alleles per marker and an average PIC value of 0.75. The newly developed sweet orange primer sequences, their in silico PCR products, exact position in the genome assembly and putative function are made publicly available. We present the largest number of SSR markers ever developed for a citrus species. Almost two thirds of the markers are transferable to 16 citrus relatives and may be used for constructing a high density linkage map. In addition, they are valuable for marker-assisted selection studies, population structure analyses and comparative genomic studies of C. sinensis with other citrus related species. Altogether, these markers provide a significant contribution to the citrus research community.
A set of primers for analyzing chloroplast DNA diversity in Citrus and related genera.
Cheng, Yunjiang; de Vicente, M Carmen; Meng, Haijun; Guo, Wenwu; Tao, Nengguo; Deng, Xiuxin
2005-06-01
Chloroplast simple sequence repeat (cpSSR) markers in Citrus were developed and used to analyze chloroplast diversity of Citrus and closely related genera. Fourteen cpSSR primer pairs from the chloroplast genomes of tobacco (Nicotiana tabacum L.) and Arabidopsis were found useful for analyzing the Citrus chloroplast genome (cpDNA) and recoded with the prefix SPCC (SSR Primers for Citrus Chloroplast). Eleven of the 14 primer pairs revealed some degree of polymorphism among 34 genotypes of Citrus, Fortunella, Poncirus and some of their hybrids, with polymorphism information content (PIC) values ranging from 0.057 to 0.732, and 18 haplotypes were identified. The cpSSR data were analyzed with NTSYS-pc software, and the genetic relationships suggested by the unweighted pair group method based on arithmetic means (UPGMA) dendrogram were congruent with previous taxonomic investigations: the results showed that all samples fell into seven major clusters, i.e., Citrus medica L., Poncirus, Fortunella, C. ichangensis Blanco, C. reticulata Swingle, C. aurantifolia (Christm.) Swingle and C. grandis (L.) Osbeck. The results of previous studies combined with our cpSSR analyses revealed that: (1) Calamondin (C. madurensis Swingle) is the result of hybridization between kumquat (Fortunella) and mandarin (C. reticulata), where kumquat acted as the female parent; (2) Ichang papeda (C. ichangensis) has a unique taxonomic status; and (3) although Bendiguangju mandarin (C. reticulata) and Satsuma mandarin (C. reticulata) are similar in fruit shape and leaf morphology, they have different maternal parents. Bendiguangju mandarin has the same cytoplasm as sweet orange (C. sinensis), whereas Satsuma mandarin has the cytoplasm of C. reticulata. Seventeen PCR products from SPCC1 and 21 from SPCC11 were cloned and sequenced. The results revealed that mononucleotide repeats as well as insertions and deletions of small segments of DNA were associated with SPCC1 polymorphism, whereas polymorphism generated by SPCC11 was essentially due to the variation in length of the mononucleotide repeats.
Zhang, Xiaoyan; Hu, Jinguo; Bao, Shiying; Hao, Junjie; Li, Ling; He, Yuhua; Jiang, Junye; Wang, Fang; Tian, Shufang; Zong, Xuxiao
2015-01-01
Pea (Pisum sativum L.) is an important food legume globally, and is the plant species that J.G. Mendel used to lay the foundation of modern genetics. However, genomics resources of pea are limited comparing to other crop species. Application of marker assisted selection (MAS) in pea breeding has lagged behind many other crops. Development of a large number of novel and reliable SSR (simple sequence repeat) or microsatellite markers will help both basic and applied genomics research of this crop. The Illumina HiSeq 2500 System was used to uncover 8,899 putative SSR containing sequences, and 3,275 non-redundant primers were designed to amplify these SSRs. Among the 1,644 SSRs that were randomly selected for primer validation, 841 yielded reliable amplifications of detectable polymorphisms among 24 genotypes of cultivated pea (Pisum sativum L.) and wild relatives (P. fulvum Sm.) originated from diverse geographical locations. The dataset indicated that the allele number per locus ranged from 2 to 10, and that the polymorphism information content (PIC) ranged from 0.08 to 0.82 with an average of 0.38. These 1,644 novel SSR markers were also tested for polymorphism between genotypes G0003973 and G0005527. Finally, 33 polymorphic SSR markers were anchored on the genetic linkage map of G0003973 × G0005527 F2 population. PMID:26440522
Construction of the first genetic linkage map of Japanese gentian (Gentianaceae)
2012-01-01
Background Japanese gentians (Gentiana triflora and Gentiana scabra) are amongst the most popular floricultural plants in Japan. However, genomic resources for Japanese gentians have not yet been developed, mainly because of the heterozygous genome structure conserved by outcrossing, the long juvenile period, and limited knowledge about the inheritance of important traits. In this study, we developed a genetic linkage map to improve breeding programs of Japanese gentians. Results Enriched simple sequence repeat (SSR) libraries from a G. triflora double haploid line yielded almost 20,000 clones using 454 pyrosequencing technology, 6.7% of which could be used to design SSR markers. To increase the number of molecular markers, we identified three putative long terminal repeat (LTR) sequences using the recently developed inter-primer binding site (iPBS) method. We also developed retrotransposon microsatellite amplified polymorphism (REMAP) markers combining retrotransposon and inter-simple sequence repeat (ISSR) markers. In addition to SSR and REMAP markers, modified amplified fragment length polymorphism (AFLP) and random amplification polymorphic DNA (RAPD) markers were developed. Using 93 BC1 progeny from G. scabra backcrossed with a G. triflora double haploid line, 19 linkage groups were constructed with a total of 263 markers (97 SSR, 97 AFLP, 39 RAPD, and 30 REMAP markers). One phenotypic trait (stem color) and 10 functional markers related to genes controlling flower color, flowering time and cold tolerance were assigned to the linkage map, confirming its utility. Conclusions This is the first reported genetic linkage map for Japanese gentians and for any species belonging to the family Gentianaceae. As demonstrated by mapping of functional markers and the stem color trait, our results will help to explain the genetic basis of agronomic important traits, and will be useful for marker-assisted selection in gentian breeding programs. Our map will also be an important resource for further genetic analyses such as mapping of quantitative trait loci and map-based cloning of genes in this species. PMID:23186361
Molecular discrimination of tall fescue morphotypes in association with Festuca relatives
Chekhovskiy, Konstantin
2018-01-01
Tall fescue (Festuca arundinacea Schreb.) is an important cool-season perennial grass species used as forage and turf, and in conservation plantings. There are three morphotypes in hexaploid tall fescue: Continental, Mediterranean and Rhizomatous. This study was conducted to develop morphotype-specific molecular markers to distinguish Continental and Mediterranean tall fescues, and establish their relationships with other species of the Festuca genus for genomic inference. Chloroplast sequence variation and simple sequence repeat (SSR) polymorphism were explored in 12 genotypes of three tall fescue morphotypes and four Festuca species. Hypervariable chloroplast regions were retrieved by using 33 specifically designed primers followed by sequencing the PCR products. SSR polymorphism was studied using 144 tall fescue SSR primers. Four chloroplast (NFTCHL17, NFTCHL43, NFTCHL45 and NFTCHL48) and three SSR (nffa090, nffa204 and nffa338) markers were identified which can distinctly differentiate Continental and Mediterranean morphotypes. A primer pair, NFTCHL45, amplified a 47 bp deletion between the two morphotypes is being routinely used in the Noble Research Institute’s core facility for morphotype discrimination. Both chloroplast sequence variation and SSR diversity showed a close association between Rhizomatous and Continental morphotypes, while the Mediterranean morphotype was in a distant clade. F. pratensis and F. arundinacea var. glaucescens, the P and G1G2 genome donors, respectively, were grouped with the Continental clade, and F. mairei (M1M2 genome) grouped with the Mediterranean clade in chloroplast sequence variation, while both F. pratensis and F. mairei formed independent clade in SSR analysis. Age estimation based on chloroplast sequence variation indicated that the Continental and Mediterranean clades might have been colonized independently during 0.65 ± 0.06 and 0.96 ± 0.1 million years ago (Mya) respectively. The findings of the study will enhance tall fescue breeding for persistence and productivity. PMID:29342197
2014-01-01
Background Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. Result A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1’ by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei’s genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. Conclusions A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species. PMID:24472631
Zhang, Shuo; Tang, Chanjuan; Zhao, Qiang; Li, Jing; Yang, Lifang; Qie, Lufeng; Fan, Xingke; Li, Lin; Zhang, Ning; Zhao, Meicheng; Liu, Xiaotong; Chai, Yang; Zhang, Xue; Wang, Hailong; Li, Yingtao; Li, Wen; Zhi, Hui; Jia, Guanqing; Diao, Xianmin
2014-01-28
Foxtail millet (Setaria italica (L.) Beauv.) is an important gramineous grain-food and forage crop. It is grown worldwide for human and livestock consumption. Its small genome and diploid nature have led to foxtail millet fast becoming a novel model for investigating plant architecture, drought tolerance and C4 photosynthesis of grain and bioenergy crops. Therefore, cost-effective, reliable and highly polymorphic molecular markers covering the entire genome are required for diversity, mapping and functional genomics studies in this model species. A total of 5,020 highly repetitive microsatellite motifs were isolated from the released genome of the genotype 'Yugu1' by sequence scanning. Based on sequence comparison between S. italica and S. viridis, a set of 788 SSR primer pairs were designed. Of these primers, 733 produced reproducible amplicons and were polymorphic among 28 Setaria genotypes selected from diverse geographical locations. The number of alleles detected by these SSR markers ranged from 2 to 16, with an average polymorphism information content of 0.67. The result obtained by neighbor-joining cluster analysis of 28 Setaria genotypes, based on Nei's genetic distance of the SSR data, showed that these SSR markers are highly polymorphic and effective. A large set of highly polymorphic SSR markers were successfully and efficiently developed based on genomic sequence comparison between different genotypes of the genus Setaria. The large number of new SSR markers and their placement on the physical map represent a valuable resource for studying diversity, constructing genetic maps, functional gene mapping, QTL exploration and molecular breeding in foxtail millet and its closely related species.
Diekmann, Kerstin; Hodkinson, Trevor R.; Barth, Susanne
2012-01-01
Background and Aims Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne ‘Cashel’. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Methods Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. Key Results All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A8 mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. Conclusions The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species. PMID:22419761
Diekmann, Kerstin; Hodkinson, Trevor R; Barth, Susanne
2012-11-01
Lolium perenne (perennial ryegrass) is the most important forage grass species of temperate regions. We have previously released the chloroplast genome sequence of L. perenne 'Cashel'. Here nine chloroplast microsatellite markers are published, which were designed based on knowledge about genetically variable regions within the L. perenne chloroplast genome. These markers were successfully used for characterizing the genetic diversity in Lolium and different grass species. Chloroplast genomes of 14 Poaceae taxa were screened for mononucleotide microsatellite repeat regions and primers designed for their amplification from nine loci. The potential of these markers to assess genetic diversity was evaluated on a set of 16 Irish and 15 European L. perenne ecotypes, nine L. perenne cultivars, other Lolium taxa and other grass species. All analysed Poaceae chloroplast genomes contained more than 200 mononucleotide repeats (chloroplast simple sequence repeats, cpSSRs) of at least 7 bp in length, concentrated mainly in the large single copy region of the genome. Nucleotide composition varied considerably among subfamilies (with Pooideae biased towards poly A repeats). The nine new markers distinguish L. perenne from all non-Lolium taxa. TeaCpSSR28 was able to distinguish between all Lolium species and Lolium multiflorum due to an elongation of an A(8) mononucleotide repeat in L. multiflorum. TeaCpSSR31 detected a considerable degree of microsatellite length variation and single nucleotide polymorphism. TeaCpSSR27 revealed variation within some L. perenne accessions due to a 44-bp indel and was hence readily detected by simple agarose gel electrophoresis. Smaller insertion/deletion events or single nucleotide polymorphisms detected by these new markers could be visualized by polyacrylamide gel electrophoresis or DNA sequencing, respectively. The new markers are a valuable tool for plant breeding companies, seed testing agencies and the wider scientific community due to their ability to monitor genetic diversity within breeding pools, to trace maternal inheritance and to distinguish closely related species.
Li, Muwang; Shen, Li; Xu, Anying; Miao, Xuexia; Hou, Chengxiang; Sun, Pingjiang; Zhang, Yuehua; Huang, Yongping
2005-10-01
To determine genetic relationships among strains of silkworm, Bombyx mori L., 31 strains with different origins, number of generations per year, number of molts per generation, and morphological characters were studied using simple sequence repeat (SSR) markers. Twenty-six primer pairs flanking microsatellite sequences in the silkworm genome were assayed. All were polymorphic and unambiguously separated silkworm strains from each other. A total of 188 alleles were detected with a mean value of 7.2 alleles/locus (range 2-17). The average heterozygosity value for each SSR locus ranged from 0 to 0.60, and the highest one was 0.96 (Fl0516 in 4013). The mean polymorphism index content (PIC) was 0.66 (range 0.12-0.89). Unweighted pair group method with arithmetic means (UPGMA) cluster analysis of Nei's genetic distance grouped silkworm strains based on their origin. Seven major ecotypic silkworm groups were analyzed. Principal components analysis (PCA) for SSR data support their UPGMA clustering. The results indicated that SSR markers are an efficient tool for fingerprinting cultivars and conducting genetic-diversity studies in the silkworm.
USDA-ARS?s Scientific Manuscript database
To confirm a hybrid swarm population of Pinus densiflora × P. sylvestris in Jilin, China and to study whether shoot apex morphology of 4-year old seedlings can be correlated with the sequence of a chloroplast DNA simple sequence repeat marker (cpDNA SSR), needles and seeds from P. densiflora, P. syl...
Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. Results A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using HindIII (34,560 clones) and BamHI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the first SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. Conclusion In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea. PMID:21447154
Tong, Zhijun; Xiao, Bingguang; Jiao, Fangchan; Fang, Dunhuang; Zeng, Jianmin; Wu, Xingfu; Chen, Xuejun; Yang, Jiankang; Li, Yongping
2016-01-01
Tobacco (Nicotiana tabacum L.), particularly flue-cured tobacco, is one of the most economically important nonfood crops and is also an important model system in plant biotechnology. Despite its importance, only limited molecular marker resources are available for genome analysis, genetic mapping, and breeding. Simple sequence repeats (SSR) are one of the most widely-used molecular markers, having significant advantages including that they are generally co-dominant, easy to use, abundant in eukaryotic organisms, and produce highly reproducible results. In this study, based on the genome sequence data of flue-cured tobacco (K326), we developed a total of 13,645 mostly novel SSR markers, which were working in a set of eighteen tobacco varieties of four different types. A mapping population of 213 backcross (BC1) individuals, which were derived from an intra-type cross between two flue-cured tobacco varieties, Y3 and K326, was selected for mapping. Based on the newly developed SSR markers as well as published SSR markers, we constructed a genetic map consisting of 626 SSR loci distributed across 24 linkage groups and covering a total length of 1120.45 cM with an average distance of 1.79 cM between adjacent markers, which is the highest density map of flue-cured tobacco till date. PMID:27436948
Yada, Benard; Brown-Guedira, Gina; Alajo, Agnes; Ssemakula, Gorrettie N; Owusu-Mensah, Eric; Carey, Edward E; Mwanga, Robert O M; Yencho, G Craig
2017-03-01
Molecular markers are needed for enhancing the development of elite sweetpotato ( Ipomoea batatas (L.) Lam) cultivars with a wide range of commercially important traits in sub-Saharan Africa. This study was conducted to estimate the heritability and determine trait correlations of storage root yield, dry matter, starch and β-carotene content in a cross between 'New Kawogo' × 'Beauregard'. The study was also conducted to identify simple sequence repeat (SSR) markers associated with these traits. A total of 287 progeny and the parents were evaluated for two seasons at three sites in Uganda and genotyped with 250 SSR markers. Broad sense heritability (H 2 ) for storage root yield, dry matter, starch and β-carotene content were 0.24, 0.68, 0.70 and 0.90, respectively. Storage root β-carotene content was negatively correlated with dry matter (r = -0.59, P < 0.001) and starch (r = -0.93, P < 0.001) content, while storage root yield was positively correlated with dry matter (r = 0.57, P = 0.029) and starch (r = 0.41, P = 0.008) content. Through logistic regression, a total of 12, 4, 6 and 8 SSR markers were associated with storage root yield, dry matter, starch and β-carotene content, respectively. The SSR markers used in this study may be useful for quantitative trait loci analysis and selection for these traits in future.
Yada, Benard; Brown-Guedira, Gina; Alajo, Agnes; Ssemakula, Gorrettie N.; Owusu-Mensah, Eric; Carey, Edward E.; Mwanga, Robert O.M.; Yencho, G. Craig
2017-01-01
Molecular markers are needed for enhancing the development of elite sweetpotato (Ipomoea batatas (L.) Lam) cultivars with a wide range of commercially important traits in sub-Saharan Africa. This study was conducted to estimate the heritability and determine trait correlations of storage root yield, dry matter, starch and β-carotene content in a cross between ‘New Kawogo’ × ‘Beauregard’. The study was also conducted to identify simple sequence repeat (SSR) markers associated with these traits. A total of 287 progeny and the parents were evaluated for two seasons at three sites in Uganda and genotyped with 250 SSR markers. Broad sense heritability (H2) for storage root yield, dry matter, starch and β-carotene content were 0.24, 0.68, 0.70 and 0.90, respectively. Storage root β-carotene content was negatively correlated with dry matter (r = −0.59, P < 0.001) and starch (r = −0.93, P < 0.001) content, while storage root yield was positively correlated with dry matter (r = 0.57, P = 0.029) and starch (r = 0.41, P = 0.008) content. Through logistic regression, a total of 12, 4, 6 and 8 SSR markers were associated with storage root yield, dry matter, starch and β-carotene content, respectively. The SSR markers used in this study may be useful for quantitative trait loci analysis and selection for these traits in future. PMID:28588391
Maternal lineages of peach genotypes
USDA-ARS?s Scientific Manuscript database
Simple sequence repeats (SSRs) in chloroplast genomes are useful markers to determine maternal lineages. The SSR mining results revealed that most chloroplast SSRs among three Prunus chloroplast genomes were conserved in locations and motif types, but polymorphic in motif and/or amplicon lengths. Fi...
Geographic patterns of genetic variation in native pecans
USDA-ARS?s Scientific Manuscript database
A structured collection of eighty seedling pecan trees [Carya illinoinensis (Wangenh.) K. Koch] representing nineteen putatively native pecan populations across the species range were evaluated at three plastid and 14 nuclear microsatellite (simple sequence repeat, SSR) loci. Data were analyzed usi...
Li, Qing; Li, Biao; Guo, Shun-Xing
2017-01-01
SSR is one of the most important molecular markers used in molecular identification and genetic diversity research of Dendrobium nobile. In order to enrich the library of SSR and establish a method for rapid identification of D. nobile, the SSR information was analyzed in the transcriptome of D. nobile. A total of 32 709 SSRs were obtained from the transcriptome of D. nobile, distributed in 26 742 unigenes with the distribution frequency of 12.90%. SSR loci occurred every 3 748 bp. Mono-nucleotide repeat was the main type, account for as much as 72.18% of all SSRs, followed by di-nucleotide (15.97%) and tri-nucleotide (11.19%). Among all repeat types, A/T was the predominant one followed by AG/CT. Finally a total of 62 157 primer pairs were designed for marker development. Randomly 20 pairs of primers were selected for PCR amplification, 17 amplified on clear and reproducible bands, the amplification rate was 85.0%.Thirteen pairs were polymorphic among the 3 Dendrobium plants. The results indicated that the unigenes generated from transcriptome sequencing in D. nobile can be used as effective source to develop SSR markers. The SSR loci in the transcriptome of D. nobile have the characteristics of type riches, high density and high potential of polymorphism, and these characteristics might applied in the study of molecular identification, genetic diversity and marker-assisted breeding of D. nobile and its closely related species. Copyright© by the Chinese Pharmaceutical Association.
2012-01-01
Background Chinese bayberry (Myrica rubra Sieb. and Zucc.) is a subtropical evergreen tree originating in China. It has been cultivated in southern China for several thousand years, and annual production has reached 1.1 million tons. The taste and high level of health promoting characters identified in the fruit in recent years has stimulated its extension in China and introduction to Australia. A limited number of co-dominant markers have been developed and applied in genetic diversity and identity studies. Here we report, for the first time, a survey of whole genome shotgun data to develop a large number of simple sequence repeat (SSR) markers to analyse the genetic diversity of the common cultivated Chinese bayberry and the relationship with three other Myrica species. Results The whole genome shotgun survey of Chinese bayberry produced 9.01Gb of sequence data, about 26x coverage of the estimated genome size of 323 Mb. The genome sequences were highly heterozygous, but with little duplication. From the initial assembled scaffold covering 255 Mb sequence data, 28,602 SSRs (≥5 repeats) were identified. Dinucleotide was the most common repeat motif with a frequency of 84.73%, followed by 13.78% trinucleotide, 1.34% tetranucleotide, 0.12% pentanucleotide and 0.04% hexanucleotide. From 600 primer pairs, 186 polymorphic SSRs were developed. Of these, 158 were used to screen 29 Chinese bayberry accessions and three other Myrica species: 91.14%, 89.87% and 46.84% SSRs could be used in Myrica adenophora, Myrica nana and Myrica cerifera, respectively. The UPGMA dendrogram tree showed that cultivated Myrica rubra is closely related to Myrica adenophora and Myrica nana, originating in southwest China, and very distantly related to Myrica cerifera, originating in America. These markers can be used in the construction of a linkage map and for genetic diversity studies in Myrica species. Conclusion Myrica rubra has a small genome of about 323 Mb with a high level of heterozygosity. A large number of SSRs were identified, and 158 polymorphic SSR markers developed, 91% of which can be transferred to other Myrica species. PMID:22621340
Characterization of 10 new nuclear microsatellite markers in Acca sellowiana (Myrtaceae).
Klabunde, Gustavo H F; Olkoski, Denise; Vilperte, Vinicius; Zucchi, Maria I; Nodari, Rubens O
2014-06-01
Microsatellite primers were identified and characterized in Acca sellowiana in order to expand the limited number of pre-existing polymorphic markers for use in population genetic studies for conservation, phylogeography, breeding, and domestication. • A total of 10 polymorphic microsatellite primers were designed from clones obtained from a simple sequence repeat (SSR)-enriched genomic library. The primers amplified di- and trinucleotide repeats with four to 27 alleles per locus. In all tested populations, the observed heterozygosity ranged from 0.269 to 1.0. • These new polymorphic SSR markers will allow future genetic studies to be denser, either for genetic structure characterization of natural populations or for studies involving genetic breeding and domestication process in A. sellowiana.
Estimation of genetic diversity using SSR markers in sunflower
USDA-ARS?s Scientific Manuscript database
Sunflower is a major oilseed crop in central Asia, but little is known of the molecular diversity among collections of sunflower from Pakistan region. This paper described inherent genetic relationships among sunflower collections using Simple Sequence Repeat molecular markers. Results should help...
Multiplexed microsatellite markers for seven Metarhizium species
USDA-ARS?s Scientific Manuscript database
Cross-species transferability of 41 previously published simple sequence repeat (SSR) markers was assessed for 11 species of the entomopathogenic fungus Metarhizium. A collection of 65 Metarhizium isolates including all 54 used in a recent phylogenetic revision of the genus were characterized. Betwe...
Characterization and multiplexing of EST-SSR primers in Cynodon (Poaceae) species1.
Jewell, Margaret C; Frere, Celine H; Prentis, Peter J; Lambrides, Christopher J; Godwin, Ian D
2010-10-01
Cynodon species are multiple-use grasses that display varying levels of adaptation to biotic and abiotic stress. Previously identified EST-SSR primers were characterized and multiplexed to assess the level of genetic diversity present within a collection of almost 1200 Cynodon accessions from across Australia. • Two multiplex reactions were developed comprising a total of 16 EST-SSR markers. All SSR markers amplified across different Cynodon species and different levels of ploidy. The number of alleles ranged from one to eight per locus and the total number of alleles for the germplasm collection was 79. • The 16 markers show sufficient variation for the characterization of Cynodon core collections and analysis of population genetic diversity in Cynodon grasses.
DNA profiling of pineapple cultivars in Japan discriminated by SSR markers
Shoda, Moriyuki; Urasaki, Naoya; Sakiyama, Sumisu; Terakami, Shingo; Hosaka, Fumiko; Shigeta, Narumi; Nishitani, Chikako; Yamamoto, Toshiya
2012-01-01
We developed 18 polymorphic simple sequence repeat (SSR) markers in pineapple (Ananas comosus) by using genomic libraries enriched for GA and CA motifs. The markers were used to genotype 31 pineapple accessions, including seven cultivars and 11 breeding lines from Okinawa Prefecture, 12 foreign accessions and one from a related species. These SSR loci were highly polymorphic: the 31 accessions contained three to seven alleles per locus, with an average of 4.1. The values of expected heterozygosity ranged from 0.09 to 0.76, with an average of 0.52. All 31 accessions could be successfully differentiated by the 18 SSR markers, with the exception of ‘N67-10’ and ‘Hawaiian Smooth Cayenne’. A single combination of three markers TsuAC004, TsuAC010 and TsuAC041, was enough to distinguish all accessions with one exception. A phenogram based on the SSR genotypes did not show any distinct groups, but it suggested that pineapples bred in Japan are genetically diversed. We reconfirmed the parentage of 14 pineapple accessions by comparing the SSR alleles at 17 SSR loci in each accession and its reported parents. The obtained information will contribute substantially to protecting plant breeders’ rights. PMID:23341750
Lin, Y S; Kuan, C S; Weng, I S; Tsai, C C
2015-11-25
The genetic relationships among 27 pineapple [Ananas comosus (L.) Merr.] cultivars and lines were examined using 16 simple sequence repeat (SSR) markers. The number of alleles per locus of the SSR markers ranged from 2 to 6 (average 3.19), for a total of 51 alleles. Similarity coefficients were calculated on the basis of 51 amplified bands. A dendrogram was created according to the 16 SSR markers by the unweighted pair-group method. The banding patterns obtained from the SSR primers allowed most of the cultivars and lines to be distinguished, with the exception of vegetative clones. According to the dendrogram, the 27 pineapple cultivars and lines were clustered into three main clusters and four individual clusters. As expected, the dendrogram showed that derived cultivars and lines are closely related to their parental cultivars; the genetic relationships between pineapple cultivars agree with the genealogy of their breeding history. In addition, the analysis showed that there is no obvious correlation between SSR markers and morphological characters. In conclusion, SSR analysis is an efficient method for pineapple cultivar identification and can offer valuable informative characters to identify pineapple cultivars in Taiwan.
Liu, Yun-Hua; Zhang, Meiping; Wu, Chengcang; Huang, James J; Zhang, Hong-Bin
2014-01-01
Knowledge of how a genome is structured and organized from its constituent elements is crucial to understanding its biology and evolution. Here, we report the genome structuring and organization pattern as revealed by systems analysis of the sequences of three model species, Arabidopsis, rice and yeast, at the whole-genome and chromosome levels. We found that all fundamental function elements (FFE) constituting the genomes, including genes (GEN), DNA transposable elements (DTE), retrotransposable elements (RTE), simple sequence repeats (SSR), and (or) low complexity repeats (LCR), are structured in a nonrandom and correlative manner, thus leading to a hypothesis that the DNA of the species is structured as a linear "jigsaw puzzle". Furthermore, we showed that different FFE differ in their importance in the formation and evolution of the DNA jigsaw puzzle structure between species. DTE and RTE play more important roles than GEN, LCR, and SSR in Arabidopsis, whereas GEN and RTE play more important roles than LCR, SSR, and DTE in rice. The genes having multiple recognized functions play more important roles than those having single functions. These results provide useful knowledge necessary for better understanding genome biology and evolution of the species and for effective molecular breeding of rice.
Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.
Liu, Yaling; Zhang, Pengfei; Song, Meiling; Hou, Junling; Qing, Mei; Wang, Wenquan; Liu, Chunsheng
2015-01-01
Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice. PMID:26571372
USDA-ARS?s Scientific Manuscript database
Cultivated peanut (Arachis hypogaea L.) is an important food and oil crop grown in more than 100 countries for providing edible oil and protein. A wide variety of pathogens including fungi, bacteria, viruses, and nematodes severely constrain peanut yield and quality. Therefore, it is very imp...
B.M.T. Brunet; D. Doucet; B.R. Sturtevant; F.A.H. Sperling
2013-01-01
After identifying 114 microsatellite loci from Choristoneura fumiferana expressed sequence tags, 87 loci were assayed in a panel of 11 wild-caught individuals, giving 29 polymorphic loci. Further analysis of 20 of these loci on 31 individuals collected from a single population in northern Minnesota identified 14 in Hardy-Weinberg equilibrium.
Report on the development of putative functional SSR and SNP markers in passion fruits.
da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro
2017-09-06
Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
Perumal, Ramasamy; Nimmakayala, Padmavathi; Erattaimuthu, Saradha R; No, Eun-Gyu; Reddy, Umesh K; Prom, Louis K; Odvody, Gary N; Luster, Douglas G; Magill, Clint W
2008-01-01
Background A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites) to detect differences at the DNA level. Results Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55%) with dinucleotide repeats and 6 (11%) with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40%) and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis), sugar cane (P. sacchari), pearl millet (Sclerospora graminicola) and rose (Peronospora sparsa) indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production) were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34 Peronosclerospora, Peronospora and Sclerospora spp isolates studied. Cluster analysis by UPGMA as well as principal coordinate analysis (PCA) grouped the 34 isolates into three distinct groups (all 19 isolates of Peronosclerospora sorghi in cluster I, five isolates of P. maydis and three isolates of P. sacchari in cluster II and five isolates of Sclerospora graminicola in cluster III). Conclusion To our knowledge, this is the first attempt to extensively develop SSR markers from Peronosclerospora genomic DNA. The newly developed SSR markers can be readily used to distinguish isolates within several species of the oomycetes that cause downy mildew diseases. Also, microsatellite fragments likely include retrotransposon regions of DNA and these sequences can serve as useful genetic markers for strain identification, due to their degree of variability and their widespread occurrence among sorghum, maize, sugarcane, pearl millet and rose downy mildew isolates. PMID:19040756
Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N
2017-05-12
The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Genetic Diversity in Lens Species Revealed by EST and Genomic Simple Sequence Repeat Analysis
Dikshit, Harsh Kumar; Singh, Akanksha; Singh, Dharmendra; Aski, Muraleedhar Sidaram; Prakash, Prapti; Jain, Neelu; Meena, Suresh; Kumar, Shiv; Sarker, Ashutosh
2015-01-01
Low productivity of pilosae type lentils grown in South Asia is attributed to narrow genetic base of the released cultivars which results in susceptibility to biotic and abiotic stresses. For enhancement of productivity and production, broadening of genetic base is essentially required. The genetic base of released cultivars can be broadened by using diverse types including bold seeded and early maturing lentils from Mediterranean region and related wild species. Genetic diversity in eighty six accessions of three species of genus Lens was assessed based on twelve genomic and thirty one EST-SSR markers. The evaluated set of genotypes included diverse lentil varieties and advanced breeding lines from Indian programme, two early maturing ICARDA lines and five related wild subspecies/species endemic to the Mediterranean region. Genomic SSRs exhibited higher polymorphism in comparison to EST SSRs. GLLC 598 produced 5 alleles with highest gene diversity value of 0.80. Among the studied subspecies/species 43 SSRs detected maximum number of alleles in L. orientalis. Based on Nei’s genetic distance cultivated lentil L. culinaris subsp. culinaris was found to be close to its wild progenitor L. culinaris subsp. orientalis. The Prichard’s structure of 86 genotypes distinguished different subspecies/species. Higher variability was recorded among individuals within population than among populations. PMID:26381889
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.
Anwar, Tamanna; Khan, Asad U
2006-02-20
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
Srivastava, Deepika; Shanker, Asheesh
2016-12-01
Basal angiosperms or Magnoliids is an important clade of commercially important plants which mainly include spices and edible fruits. In this study, 17 chloroplast genome sequences belonging to clade Magnoliids were screened for the identification of chloroplast simple sequence repeats (cpSSRs). Simple sequence repeats or microsatellites are short stretches of DNA up to 1-6 base pair in length. These repeats are ubiquitous and play important role in the development of molecular markers and to study the mapping of traits of economic, medical or ecological interest. A total of 479 SSRs were detected, showing average density of 1 SSR/6.91 kb. Depending on the repeat units, the length of SSRs ranged from 12 to 24 bp for mono-, 12 to 18 bp for di-, 12 to 26 bp for tri-, 12 to 24 bp for tetra-, 15 bp for penta- and 18 bp for hexanucleotide repeats. Mononucleotide repeats were the most frequent (207, 43.21 %) followed by tetranucleotide repeats (130, 27.13 %). Penta- and hexanucleotide repeats were least frequent or absent in these chloroplast genomes.
[SSR loci information analysis in transcriptome of Andrographis paniculata].
Li, Jun-Ren; Chen, Xiu-Zhen; Tang, Xiao-Ting; He, Rui; Zhan, Ruo-Ting
2018-06-01
To study the SSR loci information and develop molecular markers, a total of 43 683 Unigenes in transcriptome of Andrographis paniculata were used to explore SSR. The distribution frequency of SSR and the basic characteristics of repeat motifs were analyzed using MicroSAtellite software, SSR primers were designed by Primer 3.0 software and then validated by PCR. Moreover, the gene function analysis of SSR Unigene was obtained by Blast. The results showed that 14 135 SSR loci were found in the transcriptome of A. paniculata, which distributed in 9 973 Unigenes with a distribution frequency of 32.36%. Di-nucleotide and Tri-nucleotide repeat were the main types, accounted for 75.54% of all SSRs. The repeat motifs of AT/AT and CCG/CGG were the predominant repeat types of Di-nucleotide and Tri-nucleotide, respectively. A total of 4 740 pairs of SSR primers with the potential to produce polymorphism were designed for maker development. Ten pairs of primers in 20 pairs of randomly picked primers produced fragments with expected molecular size. The gene function of Unigenes containing SSR were mostly related to the basic metabolism function of A. paniculata. The SSR markers in transcriptome of A. paniculata show rich type, strong specificity and high potential of polymorphism, which will benefit the candidate gene mining and marker-assisted breeding. Copyright© by the Chinese Pharmaceutical Association.
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers
Craig S. Echt; Surya Saha; Konstantin V. Krutovsky; Kokulapalan Wimalanathan; John E. Erpelding; Chun Liang; C Dana Nelson
2011-01-01
Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective of this study was to integrate a large set of SSR markers from a variety...
Linkage mapping in a watermelon population segregating for fusarium wilt resistance
Leigh K. Hawkins; Fenny Dane; Thomas L. Kubisiak; Billy B. Rhodes; Robert L. Jarret
2001-01-01
Isozyme, randomly amplified polymorphic DNA (RAPD), and simple sequence repeats (SSR) markers were used to generate a linkage map in an F2 and F3 watermelon (Citrullus lanatus (Thumb.) Matsum. & Nakai) population derived from a cross between the fusarium wilt (Fusarium oxysporum f....
Characterization of 10 new nuclear microsatellite markers in Acca sellowiana (Myrtaceae)1
Klabunde, Gustavo H. F.; Olkoski, Denise; Vilperte, Vinicius; Zucchi, Maria I.; Nodari, Rubens O.
2014-01-01
• Premise of the study: Microsatellite primers were identified and characterized in Acca sellowiana in order to expand the limited number of pre-existing polymorphic markers for use in population genetic studies for conservation, phylogeography, breeding, and domestication. • Methods and Results: A total of 10 polymorphic microsatellite primers were designed from clones obtained from a simple sequence repeat (SSR)–enriched genomic library. The primers amplified di- and trinucleotide repeats with four to 27 alleles per locus. In all tested populations, the observed heterozygosity ranged from 0.269 to 1.0. • Conclusions: These new polymorphic SSR markers will allow future genetic studies to be denser, either for genetic structure characterization of natural populations or for studies involving genetic breeding and domestication process in A. sellowiana. PMID:25202632
Barth, Susanne; Jankowska, Marta Jolanta; Hodkinson, Trevor Roland; Vellani, Tia; Klaas, Manfred
2016-03-22
Forty three microsatellite markers were developed for further genetic characterisation of a forage and biomass grass crop, for which genomic resources are currently scarce. The microsatellite markers were developed from a normalized EST-SSR library. All of the 43 markers gave a clear banding pattern on 3% Metaphor agarose gels. Eight selected SSR markers were tested in detail for polymorphism across eleven DNA samples of large geographic distribution across Europe. The new set of 43 SSR markers will help future research to characterise the genetic structure and diversity of Phalaris arundinacea, with a potential to further understand its invasive character in North American wetlands, as well as aid in breeding work for desired biomass and forage traits. P. arundinacea is particularly valued in the northern latitude as a crop with high biomass potential, even more so on marginal lands.
Li, Xiaobai; Jin, Feng; Jin, Liang; Jackson, Aaron; Huang, Cheng; Li, Kehu; Shu, Xiaoli
2014-12-05
Cymbidium is a genus of 68 species in the orchid family, with extremely high ornamental value. Marker-assisted selection has proven to be an effective strategy in accelerating plant breeding for many plant species. Analysis of cymbidiums genetic background by molecular markers can be of great value in assisting parental selection and breeding strategy design, however, in plants such as cymbidiums limited genomic resources exist. In order to obtain efficient markers, we deep sequenced the C. ensifolium transcriptome to identify simple sequence repeats derived from gene regions (genic-SSR). The 7,936 genic-SSR markers were identified. A total of 80 genic-SSRs were selected, and primers were designed according to their flanking sequences. Of the 80 genic-SSR primer sets, 62 were amplified in C. ensifolium successfully, and 55 showed polymorphism when cross-tested among 9 Cymbidium species comprising 59 accessions. Unigenes containing the 62 genic-SSRs were searched against Non-redundant (Nr), Gene Ontology database (GO), eukaryotic orthologous groups (KOGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The search resulted in 53 matching Nr sequences, of which 39 had GO terms, 18 were assigned to KOGs, and 15 were annotated with KEGG. Genetic diversity and population structure were analyzed based on 55 polymorphic genic-SSR data among 59 accessions. The genetic distance averaged 0.3911, ranging from 0.016 to 0.618. The polymorphic index content (PIC) of 55 polymorphic markers averaged 0.407, ranging from 0.033 to 0.863. A model-based clustering analysis revealed that five genetic groups existed in the collection. Accessions from the same species were typically grouped together; however, C. goeringii accessions did not always form a separate cluster, suggesting that C. goeringii accessions were polyphyletic. The genic-SSR identified in this study constitute a set of markers that can be applied across multiple Cymbidium species and used for the evaluation of genetic relationships as well as qualitative and quantitative trait mapping studies. Genic-SSR's coupled with the functional annotations provided by the unigenes will aid in mapping candidate genes of specific function.
Kim, Jin-Hee; Chung, Il Kyung; Kim, Kyung-Min
2017-01-01
The Sweet potato, Ipomoea batatas (L.) Lam, is difficult to study in genetics and genomics because it is a hexaploid. The sweet potato study not have been performed domestically or internationally. In this study was performed to construct genetic map and quantitative trait loci (QTL) analysis. A total of 245 EST-SSR markers were developed, and the map was constructed by using 210 of those markers. The total map length was 1508.1 cM, and the mean distance between markers was 7.2 cM. Fifteen characteristics were investigated for QTLs analysis. According to those, the Four QTLs were identified, and The LOD score was 3.0. Further studies need to develop molecular markers in terms of EST-SSR markers for doing to be capable of efficient breeding. The genetic map created here using EST-SSR markers will facilitate planned breeding of sweet potato cultivars with various desirable traits.
NASA Astrophysics Data System (ADS)
Kang, Hyun-Sil; Hong, Hyun-Ki; Park, Kyung-Il; Cho, Moonjae; Youn, Seok-Hyun; Choi, Kwang-Sik
2017-03-01
Manila clam Ruditapes philippinarum is one of the most important benthic animals in the coastal north Pacific region, where clam populations have been mixed genetically through trade and aquaculture activities. Accordingly, identification of the genetically different clam populations has become one of the most important issues to manage interbreeding of the local and introduced clam populations. To identify genetically different populations of clam populations, we developed 11 expressed sequence tag (EST)-microsatellite loci (i.e., simple sequence repeat, SSR) from 1,128 clam hemocyte cDNA clones challenged by the protozoan parasite Perkinsus olseni. Genotype analysis using the markers developed in this study demonstrated that clams from a tidal flat on the west coast contained 6 to 19 alleles per locus, and a population from Jeju Island had 4 to 20 alleles per locus. The expected heterozygosity of the 2 clam populations ranged from 0.472 to 0.919 for clams from the west coast, and 0.494 to 0.919 for clams from Jeju Island, respectively. Among the 11 loci discovered in this study, 7 loci significantly deviated from the Hardy-Weinberg equilibrium after Bonferroni correction. The 5 loci developed in this study also successfully amplified the SSRs of R. variegatus, a clam species taxonomically very close to R. philippinarum, from Hong Kong and Jeju Island. We believe that the 11 novel polymorphic SSR developed in this study can be utilized successfully in Manila clam genetic diversity analysis, as well as in genetic discrimination of different clam populations.
USDA-ARS?s Scientific Manuscript database
Genetic diversity of thirty five Psidium guajava accessions maintained at the USDA, National Plants Germplasm System, Hilo, HI, was characterized using 20 simple sequence repeat (SSR) markers. Diversity analysis detected a total of 178 alleles ranging from four to 16. The observed mean heterozygosit...
USDA-ARS?s Scientific Manuscript database
One hundred and forty-seven primer pairs originally designed to amplify microsatellites, also known as simple sequence repeats (SSR), in black walnut (Juglans nigra L.) were screened for utility in persian walnut (J. regia L.). Based on scorability and number of informative polymorphisms, the best 1...
Virulence Phenotypes and Molecular Genotypes of Puccinia triticina Isolates from Italy
USDA-ARS?s Scientific Manuscript database
Twenty-four isolates of Puccinia triticina from Italy were characterized for virulence to seedlings of 22 common wheat cv. Thatcher isolines each with a different leaf rust resistance gene, and for molecular genotypes at 15 simple sequence repeat (SSR) loci. The isolates were compared with a set of ...
USDA-ARS?s Scientific Manuscript database
Avocado (Persea americana Mill.) is an economically important tropical fruit native to Mesoamerica. It belongs to the Lauraceae family and is subdivided in three horticultural races (Guatemalan, Mexican, and West Indian) based primarily on ecological adaptation, botanical and physiological traits. T...
Fusarium head blight resistance loci in a stratified population of wheat landraces and varieties
USDA-ARS?s Scientific Manuscript database
To determine if Chinese and Japanese wheat landraces and varieties have unique sources of Fusarium head blight (FHB) resistance, an association mapping panel of 195 wheat accessions including both commercial varieties and landraces was genotyped with 364 genome-wide simple sequence repeat (SSR) and ...
Usefulness of fire ant genetics in insecticide efficacy trials
USDA-ARS?s Scientific Manuscript database
Mature fire ant colonies contain an average of 80,000 worker ants. For this study, eight fire ant workers were randomly sampled from each colony. DNA fingerprints for each individual ant were generated using 21 simple sequence repeats (SSR) markers that were developed from fire ant DNA by other lab...
Transcript and proteomic analysis of developing white lupin (Lupinus albus L.) roots
Tian, Li; Peel, Gregory J; Lei, Zhentian; Aziz, Naveed; Dai, Xinbin; He, Ji; Watson, Bonnie; Zhao, Patrick X; Sumner, Lloyd W; Dixon, Richard A
2009-01-01
Background White lupin (Lupinus albus L.) roots efficiently take up and accumulate (heavy) metals, adapt to phosphate deficiency by forming cluster roots, and secrete antimicrobial prenylated isoflavones during development. Genomic and proteomic approaches were applied to identify candidate genes and proteins involved in antimicrobial defense and (heavy) metal uptake and translocation. Results A cDNA library was constructed from roots of white lupin seedlings. Eight thousand clones were randomly sequenced and assembled into 2,455 unigenes, which were annotated based on homologous matches in the NCBInr protein database. A reference map of developing white lupin root proteins was established through 2-D gel electrophoresis and peptide mass fingerprinting. High quality peptide mass spectra were obtained for 170 proteins. Microsomal membrane proteins were separated by 1-D gel electrophoresis and identified by LC-MS/MS. A total of 74 proteins were putatively identified by the peptide mass fingerprinting and the LC-MS/MS methods. Genomic and proteomic analyses identified candidate genes and proteins encoding metal binding and/or transport proteins, transcription factors, ABC transporters and phenylpropanoid biosynthetic enzymes. Conclusion The combined EST and protein datasets will facilitate the understanding of white lupin's response to biotic and abiotic stresses and its utility for phytoremediation. The root ESTs provided 82 perfect simple sequence repeat (SSR) markers with potential utility in breeding white lupin for enhanced agronomic traits. PMID:19123941
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.)
2013-01-01
Background Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. Results We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. Conclusions The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs. PMID:24160306
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.).
Yagi, Masafumi; Yamamoto, Toshiya; Isobe, Sachiko; Hirakawa, Hideki; Tabata, Satoshi; Tanase, Koji; Yamaguchi, Hiroyasu; Onozaki, Takashi
2013-10-26
Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs.
Genetic diversity and gene differentiation among ten species of Zingiberaceae from Eastern India.
Mohanty, Sujata; Panda, Manoj Kumar; Acharya, Laxmikanta; Nayak, Sanghamitra
2014-08-01
In the present study, genetic fingerprints of ten species of Zingiberaceae from eastern India were developed using PCR-based markers. 19 RAPD (Rapid Amplified polymorphic DNA), 8 ISSR (Inter Simple Sequence Repeats) and 8 SSR (Simple Sequence Repeats) primers were used to elucidate genetic diversity important for utilization, management and conservation. These primers produced 789 loci, out of which 773 loci were polymorphic (including 220 unique loci) and 16 monomorphic loci. Highest number of bands amplified (263) in Curcuma caesia whereas lowest (209) in Zingiber cassumunar. Though all the markers discriminated the species effectively, analysis of combined data of all markers resulted in better distinction of individual species. Highest number of loci was amplified with SSR primers with resolving power in a range of 17.4-39. Dendrogram based on three molecular data using unweighted pair group method with arithmetic mean classified all the species into two clusters. Mantle matrix correspondence test revealed high matrix correlation in all the cases. Correlation values for RAPD, ISSR and SSR were 0.797, 0.84 and 0.8, respectively, with combined data. In both the genera wild and cultivated species were completely separated from each other at genomic level. It also revealed distinct genetic identity between species of Curcuma and Zingiber. High genetic diversity documented in the present study provides a baseline data for optimization of conservation and breeding programme of the studied zingiberacious species.
Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A
2014-09-26
Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.
USDA-ARS?s Scientific Manuscript database
Wild Thinopyrum grasses serve as important gene pools for forage and cereal crops. Knowledge of their chromosome organizations is pivotal for efficient utilization of this important gene pool in germplasm enhancement programs. EST-SSR markers for Th. bessarabicum, Th. elongatum and Th. junceum chr...
Satya, Pratik; Paswan, Pramod Kumar; Ghosh, Swagata; Majumdar, Snehalata; Ali, Nasim
2016-06-01
Cross-species transferability is a quick and economic method to enrich SSR database, particularly for minor crops where little genomic information is available. However, transferability of SSR markers varies greatly between species, genera and families of plant species. We assessed confamiliar transferability of SSR markers from cotton (Gossypium hirsutum) and jute (Corchorus olitorius) to 22 species distributed in different taxonomic groups of Malvaceae. All the species selected were potential industrial crop species having little or no genomic resources or SSR database. Of the 14 cotton SSR loci tested, 13 (92.86 %) amplified in G. arboreum and 71.43 % exhibited cross-genera transferability. Nine out of 11 jute SSRs (81.81 %) showed cross-transferability across genera. SSRs from both the species exhibited high polymorphism and resolving power in other species. The correlation between transferability of cotton and jute SSRs were highly significant (r = 0.813). The difference in transferability among species was also significant for both the marker groups. High transferability was observed at genus, tribe and subfamily level. At tribe level, transferability of jute SSRs (41.04 %) was higher than that of cotton SSRs (33.74 %). The tribe Byttnerieae exhibited highest SSR transferability (48.7 %). The high level of cross-genera transferability (>50 %) in ten species of Malvaceae, where no SSR resource is available, calls for large scale transferability testing from the enriched SSR databases of cotton and jute.
USDA-ARS?s Scientific Manuscript database
Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
In silico comparative analysis of SSR markers in plants
2011-01-01
Background The adverse environmental conditions impose extreme limitation to growth and plant development, restricting the genetic potential and reflecting on plant yield losses. The progress obtained by classic plant breeding methods aiming at increasing abiotic stress tolerances have not been enough to cope with increasing food demands. New target genes need to be identified to reach this goal, which requires extensive studies of the related biological mechanisms. Comparative analyses in ancestral plant groups can help to elucidate yet unclear biological processes. Results In this study, we surveyed the occurrence patterns of expressed sequence tag-derived microsatellite markers for model plants. A total of 13,133 SSR markers were discovered using the SSRLocator software in non-redundant EST databases made for all eleven species chosen for this study. The dimer motifs are more frequent in lower plant species, such as green algae and mosses, and the trimer motifs are more frequent for the majority of higher plant groups, such as monocots and dicots. With this in silico study we confirm several microsatellite plant survey results made with available bioinformatics tools. Conclusions The comparative studies of EST-SSR markers among all plant lineages is well suited for plant evolution studies as well as for future studies of transferability of molecular markers. PMID:21247422
2012-01-01
Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding community in marker-assisted breeding, and for performing comparative genomic studies in Rosaceae. PMID:23039990
2011-01-01
Background Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Results Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Conclusions Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock. PMID:21504589
Microsatellite markers for the yam bean Pachyrhizus (Fabaceae).
Delêtre, Marc; Soengas, Beatriz; Utge, José; Lambourdière, Josie; Sørensen, Marten
2013-07-01
Microsatellite loci were developed for the understudied root crop yam bean (Pachyrhizus spp.) to investigate intraspecific diversity and interspecific relationships within the genus Pachyrhizus. • Seventeen nuclear simple sequence repeat (SSR) markers with perfect di- and trinucleotide repeats were developed from 454 pyrosequencing of SSR-enriched genomic libraries. Loci were characterized in P. ahipa and wild and cultivated populations of four closely related species. All loci successfully cross-amplified and showed high levels of polymorphism, with number of alleles ranging from three to 12 and expected heterozygosity ranging from 0.095 to 0.831 across the genus. • By enabling rapid assessment of genetic diversity in three native neotropical crops, P. ahipa, P. erosus, and P. tuberosus, and two wild relatives, P. ferrugineus and P. panamensis, these markers will allow exploration of the genetic diversity and evolutionary history of the genus Pachyrhizus.
Salgado-Salazar, Catalina; Rivera, Yazmín; Veltri, Daniel; ...
2015-11-10
Premise of the study: Simple sequence repeat (SSR) markers were developed for Plasmopara obducens, the causal agent of the newly emergent downy mildew disease of Impatiens walleriana. Methods and Results: A 202-Mb draft genome assembly was generated from P. obducens using Illumina technology and mined to identify 13,483 SSR motifs. Primers were synthesized for 62 marker candidates, of which 37 generated reliable PCR products. Testing of the 37 markers using 96 P. obducens samples showed 96% of the markers were polymorphic, with 2-6 alleles observed. Observed and expected heterozygosity ranged from 0.000-0.892 and 0.023-0.746, respectively. Just 17 markers were sufficientmore » to identify all multilocus genotypes. Conclusions: These are the first SSR markers available for this pathogen, and one of the first molecular resources. These markers will be useful in assessing variation in pathogen populations and determining the factors contributing to the emergence of destructive impatiens downy mildew disease.« less
Duan, Dong; Jia, Yun; Yang, Jie; Li, Zhong-Hu
2017-01-01
The sex determination in gymnosperms is still poorly characterized due to the lack of genomic/transcriptome resources and useful molecular genetic markers. To enhance our understanding of the molecular mechanisms of the determination of sexual recognition of reproductive structures in conifers, the transcriptome of male and female conelets were characterized in a Chinese endemic conifer species, Pinus bungeana Zucc. ex Endl. The 39.62 Gb high-throughput sequencing reads were obtained from two kinds of sexual conelets. After de novo assembly of the obtained reads, 85,305 unigenes were identified, 53,944 (63.23%) of which were annotated with public databases. A total of 12,073 differentially expressed genes were detected between the two types of sexes in P. bungeana, and 5766 (47.76%) of them were up-regulated in females. The Kyoto Encyclopedia of Genes and Genomes (KEGG) enriched analysis suggested that some of the genes were significantly associated with the sex determination process of P. bungeana, such as those involved in tryptophan metabolism, zeatin biosynthesis, and cysteine and methionine metabolism, and the phenylpropanoid biosynthesis pathways. Meanwhile, some important plant hormone pathways (e.g., the gibberellin (GA) pathway, carotenoid biosynthesis, and brassinosteroid biosynthesis (BR) pathway) that affected sexual determination were also induced in P. bungeana. In addition, 8791 expressed sequence tag-simple sequence repeats (EST-SSRs) from 7859 unigenes were detected in P. bungeana. The most abundant repeat types were dinucleotides (1926), followed by trinucleotides (1711). The dominant classes of the sequence repeat were A/T (4942) in mononucleotides and AT/AT (1283) in dinucleotides. Among these EST-SSRs, 84 pairs of primers were randomly selected for the characterization of potential molecular genetic markers. Finally, 19 polymorphic EST-SSR primers were characterized. We found low to moderate levels of genetic diversity (NA = 1.754; HO = 0.206; HE = 0.205) across natural populations of P. bungeana. The cluster analysis revealed two distinct genetic groups for the six populations that were sampled in this endemic species, which might be caused by the fragmentation of habitats and long-term geographic isolation among different populations. Taken together, this work provides important insights into the molecular mechanisms of sexual identity in the reproductive organs of P. bungeana. The molecular genetic resources that were identified in this study will also facilitate further studies in functional genomics and population genetics in the Pinus species. PMID:29257091
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats
Anwar, Tamanna; Khan, Asad U
2006-01-01
Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. Availability This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com PMID:17597863
Clayton, William; Eaton, Carla Jane; Dupont, Pierre-Yves; Gillanders, Tim; Cameron, Nick; Saikia, Sanjay; Scott, Barry
2017-01-01
Epichloë grass endophytes comprise a group of filamentous fungi of both sexual and asexual species. Known for the beneficial characteristics they endow upon their grass hosts, the identification of these endophyte species has been of great interest agronomically and scientifically. The use of simple sequence repeat loci and the variation in repeat elements has been used to rapidly identify endophyte species and strains, however, little is known of how the structure of repeat elements changes between species and strains, and where these repeat elements are located in the fungal genome. We report on an in-depth analysis of the structure and genomic location of the simple sequence repeat locus B10, commonly used for Epichloë endophyte species identification. The B10 repeat was found to be located within an exon of a putative bZIP transcription factor, suggesting possible impacts on polypeptide sequence and thus protein function. Analysis of this repeat in the asexual endophyte hybrid Epichloë uncinata revealed that the structure of B10 alleles reflects the ancestral species that hybridized to give rise to this species. Understanding the structure and sequence of these simple sequence repeats provides a useful set of tools for readily distinguishing strains and for gaining insights into the ancestral species that have undergone hybridization events.
USDA-ARS?s Scientific Manuscript database
Cassava (Manihot esculenta Crantz), a starchy root crop grown in tropical and subtropical climates, is the sixth most important crop in the world after wheat, rice, maize, potato and barley. The repertoire of simple sequence repeat (SSR) markers for cassava is limited and warrants a need for a large...
Li, X Y; Xu, H X; Chen, J W
2014-04-29
Manual cultivar identification diagram is a new strategy for plant cultivar identification based on DNA markers, providing information to efficiently separate cultivars. We tested 25 pairs of apple EST-SSR primers for amplification of PCR products from loquat cultivars. These EST-SSR primers provided clear amplification products from the loquat cultivars, with a relatively high transferability rate of 84% to loquat; 11 pairs of primers amplified polymorphic products. After analysis of 24 red-fleshed loquat accessions, we found that only 7 pairs of primers could clearly separate all of them. A cultivar identification diagram of the 24 cultivars was constructed using polymorphic bands from the DNA fingerprints and EST-SSR primers. Any two of the 24 cultivars could be rapidly separated from each other, according to the polymorphic bands from the cultivars; the corresponding primers were marked in the correct position on the cultivar identification diagram. This red-flesh loquat cultivar identification diagram can separate the 24 red-flesh loquat cultivars, which is of benefit for loquat cultivar identification for germplasm management and breeding programs.
Sahoo, Ambika; Jena, Sudipta; Kar, Basudeba; Sahoo, Suprava; Ray, Asit; Singh, Subhashree; Joshi, Raj Kumar; Acharya, Laxmikanta; Nayak, Sanghamitra
2017-05-01
Turmeric (Curcuma longa L., family Zingiberaceae) is one of the most economically important plants for its use in food, medicine, and cosmetic industries. Cultivar identification is a major constraint in turmeric, owing to high degree of morphological similarity that in turn, affects its commercialization. The present study addresses this constraint, using EST-SSR marker based, molecular identification of 8 elite cultivars and 88 accessions in turmeric. Fifty EST-SSR primers were screened against eight cultivars of turmeric (Suroma, Roma, Lakadong, Megha, Alleppey Supreme, Kedaram, Pratibha, and Suvarna); out of which 11 primers showed polymorphic banding pattern. The polymorphic information content (PIC) of these primers ranged from 0.13 to 0.48. However, only three SSR loci (CSSR 14, CSSR 15, and CSSR 18) gave reproducible unique banding pattern clearly distinguishing the cultivars 'Lakadong' and 'Suvarna' from other cultivars tested. These three unique SSR markers also proved to be effective in identification of 'Lakadong' cultivars when analysed with 88 accessions of turmeric collected from different agro-climatic regions. Furthermore, two identified cultivars (Lakadong and Suvarna) could also be precisely differentiated when analysed and based on phylogenetic tree, with other 94 genotypes of turmeric. The novel SSR markers can be used for identification and authentication of two commercially important turmeric cultivars 'Lakadong' and 'Suvarna'.
Vidal, Newton Medeiros; Grazziotin, Ana Laura; Ramos, Helaine Christine Cancela; Pereira, Messias Gonzaga; Venancio, Thiago Motta
2014-01-01
Carica papaya (papaya) is an economically important tropical fruit. Molecular marker-assisted selection is an inexpensive and reliable tool that has been widely used to improve fruit quality traits and resistance against diseases. In the present study we report the development and validation of an atlas of papaya simple sequence repeat (SSR) markers. We integrated gene predictions and functional annotations to provide a gene-centered perspective for marker-assisted selection studies. Our atlas comprises 160,318 SSRs, from which 21,231 were located in genic regions (i.e. inside exons, exon-intron junctions or introns). A total of 116,453 (72.6%) of all identified repeats were successfully mapped to one of the nine papaya linkage groups. Primer pairs were designed for markers from 9,594 genes (34.5% of the papaya gene complement). Using papaya-tomato orthology assessments, we assembled a list of 300 genes (comprising 785 SSRs) potentially involved in fruit ripening. We validated our atlas by screening 73 SSR markers (including 25 fruit ripening genes), achieving 100% amplification rate and uncovering 26% polymorphism rate between the parental genotypes (Sekati and JS12). The SSR atlas presented here is the first comprehensive gene-centered collection of annotated and genome positioned papaya SSRs. These features combined with thousands of high-quality primer pairs make the atlas an important resource for the papaya research community.
Haider, Nadia
2017-01-01
Investigation of genetic variation and phylogenetic relationships among date palm (Phoenix dactylifera L.) cultivars is useful for their conservation and genetic improvement. Various molecular markers such as restriction fragment length polymorphisms (RFLPs), simple sequence repeat (SSR), representational difference analysis (RDA), and amplified fragment length polymorphism (AFLP) have been developed to molecularly characterize date palm cultivars. PCR-based markers random amplified polymorphic DNA (RAPD) and inter-simple sequence repeat (ISSR) are powerful tools to determine the relatedness of date palm cultivars that are difficult to distinguish morphologically. In this chapter, the principles, materials, and methods of RAPD and ISSR techniques are presented. Analysis of data generated from these two techniques and the use of these data to reveal phylogenetic relationships among date palm cultivars are also discussed.
Microsatellite markers for the yam bean Pachyrhizus (Fabaceae)1
Delêtre, Marc; Soengas, Beatriz; Utge, José; Lambourdière, Josie; Sørensen, Marten
2013-01-01
• Premise of the study: Microsatellite loci were developed for the understudied root crop yam bean (Pachyrhizus spp.) to investigate intraspecific diversity and interspecific relationships within the genus Pachyrhizus. • Methods and Results: Seventeen nuclear simple sequence repeat (SSR) markers with perfect di- and trinucleotide repeats were developed from 454 pyrosequencing of SSR-enriched genomic libraries. Loci were characterized in P. ahipa and wild and cultivated populations of four closely related species. All loci successfully cross-amplified and showed high levels of polymorphism, with number of alleles ranging from three to 12 and expected heterozygosity ranging from 0.095 to 0.831 across the genus. • Conclusions: By enabling rapid assessment of genetic diversity in three native neotropical crops, P. ahipa, P. erosus, and P. tuberosus, and two wild relatives, P. ferrugineus and P. panamensis, these markers will allow exploration of the genetic diversity and evolutionary history of the genus Pachyrhizus. PMID:25202568
Yan, Haidong; Zhang, Yu; Zeng, Bing; Yin, Guohua; Zhang, Xinquan; Ji, Yang; Huang, Linkai; Jiang, Xiaomei; Liu, Xinchun; Peng, Yan; Ma, Xiao; Yan, Yanhong
2016-01-08
Orchardgrass (Dactylis glomerata L.), is a well-known perennial forage species; however, rust diseases have caused a noticeable reduction in the quality and production of orchardgrass. In this study, genetic diversity was assessed and the marker-trait associations for rust were examined using 18 EST-SSR and 21 SCoT markers in 75 orchardgrass accessions. A high level of genetic diversity was detected in orchardgrass with an average genetic diversity index of 0.369. For the EST-SSR and SCoT markers, 164 and 289 total bands were obtained, of which 148 (90.24%) and 272 (94.12%) were polymorphic, respectively. Results from an AMOVA analysis showed that more genetic variance existed within populations (87.57%) than among populations (12.43%). Using a parameter marker index, the efficiencies of the EST-SSR and SCoT markers were compared to show that SCoTs have higher marker efficiency (8.07) than EST-SSRs (4.82). The results of a UPGMA cluster analysis and a STRUCTURE analysis were both correlated with the geographic distribution of the orchardgrass accessions. Linkage disequilibrium analysis revealed an average r² of 0.1627 across all band pairs, indicating a high extent of linkage disequilibrium in the material. An association analysis between the rust trait and 410 bands from the EST-SSR and SCoT markers using TASSEL software revealed 20 band panels were associated with the rust trait in both 2011 and 2012. The 20 bands obtained from association analysis could be used in breeding programs for lineage selection to prevent great losses of orchardgrass caused by rust, and provide valuable information for further association mapping using this collection of orchardgrass.
Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen
2017-07-11
Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.
USDA-ARS?s Scientific Manuscript database
Breeding for disease resistance in peanut cultivars has been constrained due to both a narrow genetic base and a low degree of polymorphism. Earlier attempts have resulted in the development of a few hundreds of simple sequence repeat (SSR) markers in peanut that could define broad QTL on the physic...
A New SNP Haplotype associated with blue disease resistance gene in cotton (Gossypium hirsutum L.)
USDA-ARS?s Scientific Manuscript database
Resistance to cotton blue disease (CBD) was evaluated in 364 F2.3 families of 3 populations derived from resistant variety ‘Delta Opal’. The CBD resistance in ‘Delta Opal’ was controlled by one single dominant gene designated Cbd. Two simple sequence repeat (SSR) markers were identified as linked t...
USDA-ARS?s Scientific Manuscript database
Cowpea (Vigna unguiculata) is an important legume crop with diverse uses. The species is presently a minor crop, and evaluation of its genetic diversity has been very limited. In this study, a total of 200 genic and 100 genomic simple sequence repeat (SSR) markers were developed from cowpea unigene ...
NASA Astrophysics Data System (ADS)
Sun, Xiujun; Li, Dongming; Liu, Zhihong; Zhou, Liqing; Wu, Biao; Yang, Aiguo
2017-10-01
The pen shell ( Atrina pectinata) is a large wedge-shaped bivalve, which belongs to family Pinnidae. Due to its large and nutritious adductor muscle, it is the popular seafood with high commercial value in Asia-Pacific countries. However, limiting genomic and transcriptomic data have hampered its genetic investigations. In this study, the transcriptome of A. pectinata was deeply sequenced using Illumina pair-end sequencing technology. After assembling, a total of 127263 unigenes were obtained. Functional annotation indicated that the highest percentage of unigenes (18.60%) was annotated on GO database, followed by 18.44% on PFAM database and 17.04% on NR database. There were 270 biological pathways matched with those in KEGG database. Furthermore, a total of 23452 potential simple sequence repeats (SSRs) were identified, of them the most abundant type was mono-nucleotide repeats (12902, 55.01%), which was followed by di-nucleotide (8132, 34.68%), tri-nucleotide (2010, 8.57%), tetra-nucleotide (401, 1.71%), and penta-nucleotide (7, 0.03%) repeats. Sixty SSRs were selected for validating and developing genic SSR markers, of them 23 showed polymorphism in a cultured population with the average observed and expected heterozygosities of 0.412 and 0.579, respectively. In this study, we established the first comprehensive transcript dataset of A. pectinata genes. Our results demonstrated that RNA-Seq is a fast and cost-effective method for genic SSR development in non-model species.
Phylogenetic relationships of chrysanthemums in Korea based on novel SSR markers.
Khaing, A A; Moe, K T; Hong, W J; Park, C S; Yeon, K H; Park, H S; Kim, D C; Choi, B J; Jung, J Y; Chae, S C; Lee, K M; Park, Y J
2013-11-07
Chrysanthemums are well known for their esthetic and medicinal values. Characterization of chrysanthemums is vital for their conservation and management as well as for understanding their genetic relationships. We found 12 simple sequence repeat markers (SSRs) of 100 designed primers to be polymorphic. These novel SSR markers were used to evaluate 95 accessions of chrysanthemums (3 indigenous and 92 cultivated accessions). Two hundred alleles were identified, with an average of 16.7 alleles per locus. KNUCRY-77 gave the highest polymorphic information content value (0.879), while KNUCRY-10 gave the lowest (0.218). Similar patterns of grouping were observed with a distance-based dendrogram developed using PowerMarker and model-based clustering with Structure. Three clusters with some admixtures were identified by model-based clustering. These newly developed SSR markers will be useful for further studies of chrysanthemums, such as taxonomy and marker-assisted selection breeding.
Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin
2013-10-10
Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae. Molecular dating analyses suggest that Ranunculaceae and Berberidaceae diverged between 90 and 84 mya, which is congruent with the fossil records and with recent estimates of the divergence time of these two taxa. © 2013.
Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong
2012-05-01
This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or intron regions. These medium size repeats may contribute to developing a cp genome-specific gene introduction vector because the region may use for specific recombination sites.
Cuc, Luu M; Mace, Emma S; Crouch, Jonathan H; Quang, Vu D; Long, Tran D; Varshney, Rajeev K
2008-01-01
Background Cultivated peanut or groundnut (Arachis hypogaea L.) is the fourth most important oilseed crop in the world, grown mainly in tropical, subtropical and warm temperate climates. Due to its origin through a single and recent polyploidization event, followed by successive selection during breeding efforts, cultivated groundnut has a limited genetic background. In such species, microsatellite or simple sequence repeat (SSR) markers are very informative and useful for breeding applications. The low level of polymorphism in cultivated germplasm, however, warrants a need of larger number of polymorphic microsatellite markers for cultivated groundnut. Results A microsatellite-enriched library was constructed from the genotype TMV2. Sequencing of 720 putative SSR-positive clones from a total of 3,072 provided 490 SSRs. 71.2% of these SSRs were perfect type, 13.1% were imperfect and 15.7% were compound. Among these SSRs, the GT/CA repeat motifs were the most common (37.6%) followed by GA/CT repeat motifs (25.9%). The primer pairs could be designed for a total of 170 SSRs and were optimized initially on two genotypes. 104 (61.2%) primer pairs yielded scorable amplicon and 46 (44.2%) primers showed polymorphism among 32 cultivated groundnut genotypes. The polymorphic SSR markers detected 2 to 5 alleles with an average of 2.44 per locus. The polymorphic information content (PIC) value for these markers varied from 0.12 to 0.75 with an average of 0.46. Based on 112 alleles obtained by 46 markers, a phenogram was constructed to understand the relationships among the 32 genotypes. Majority of the genotypes representing subspecies hypogaea were grouped together in one cluster, while the genotypes belonging to subspecies fastigiata were grouped mainly under two clusters. Conclusion Newly developed set of 104 markers extends the repertoire of SSR markers for cultivated groundnut. These markers showed a good level of PIC value in cultivated germplasm and therefore would be very useful for germplasm analysis, linkage mapping, diversity studies and phylogenetic relationships in cultivated groundnut as well as related Arachis species. PMID:18482440
Edwards, J D; Baldo, A M; Mueller, L A
2016-01-01
Ricebase (http://ricebase.org) is an integrative genomic database for rice (Oryza sativa) with an emphasis on combining datasets in a way that maintains the key links between past and current genetic studies. Ricebase includes DNA sequence data, gene annotations, nucleotide variation data and molecular marker fragment size data. Rice research has benefited from early adoption and extensive use of simple sequence repeat (SSR) markers; however, the majority of rice SSR markers were developed prior to the latest rice pseudomolecule assembly. Interpretation of new research using SNPs in the context of literature citing SSRs requires a common coordinate system. A new pipeline, using a stepwise relaxation of stringency, was used to map SSR primers onto the latest rice pseudomolecule assembly. The SSR markers and experimentally assayed amplicon sizes are presented in a relational database with a web-based front end, and are available as a track loaded in a genome browser with links connecting the browser and database. The combined capabilities of Ricebase link genetic markers, genome context, allele states across rice germplasm and potentially user curated phenotypic interpretations as a community resource for genetic discovery and breeding in rice. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the United States.
USDA-ARS?s Scientific Manuscript database
Simple sequence repeat (SSR) markers from Spodoptera frugiperda (J. E. Smith) were analyzed in crosses of this species between Bacillus thuringiensis (Berliner) (Bacillales: Bacillaceae) (Bt) resistant and susceptible populations to determine a possible association between markers and Bt resistance....
USDA-ARS?s Scientific Manuscript database
Indonesia is the 3rd largest cocoa producing countries in the world, with an annual cacao bean production of 572,000 tons. The currently cultivated cacao varieties in Indonesia were inter-hybrids of various clones introduced from the Americas since the 16th century. Among them, “Java cocoa” is a wel...
Vidal, Newton Medeiros; Grazziotin, Ana Laura; Ramos, Helaine Christine Cancela; Pereira, Messias Gonzaga; Venancio, Thiago Motta
2014-01-01
Carica papaya (papaya) is an economically important tropical fruit. Molecular marker-assisted selection is an inexpensive and reliable tool that has been widely used to improve fruit quality traits and resistance against diseases. In the present study we report the development and validation of an atlas of papaya simple sequence repeat (SSR) markers. We integrated gene predictions and functional annotations to provide a gene-centered perspective for marker-assisted selection studies. Our atlas comprises 160,318 SSRs, from which 21,231 were located in genic regions (i.e. inside exons, exon-intron junctions or introns). A total of 116,453 (72.6%) of all identified repeats were successfully mapped to one of the nine papaya linkage groups. Primer pairs were designed for markers from 9,594 genes (34.5% of the papaya gene complement). Using papaya-tomato orthology assessments, we assembled a list of 300 genes (comprising 785 SSRs) potentially involved in fruit ripening. We validated our atlas by screening 73 SSR markers (including 25 fruit ripening genes), achieving 100% amplification rate and uncovering 26% polymorphism rate between the parental genotypes (Sekati and JS12). The SSR atlas presented here is the first comprehensive gene-centered collection of annotated and genome positioned papaya SSRs. These features combined with thousands of high-quality primer pairs make the atlas an important resource for the papaya research community. PMID:25393538
Identification and characterization of salt responsive miRNA-SSR markers in rice (Oryza sativa).
Mondal, Tapan Kumar; Ganie, Showkat Ahmad
2014-02-10
Salinity is an important abiotic stress that affects agricultural production and productivity. It is a complex trait that is regulated by different molecular mechanisms. miRNAs are non-coding RNAs which are highly conserved and regulate gene expression. Simple sequence repeats (SSRs) are robust molecular markers for studying genetic diversity. Although several SSR markers are available now, challenge remains to identify the trait-specific SSRs which can be used for marker assisted breeding. In order to understand the genetic diversity of salt responsive-miRNA genes in rice, SSR markers were mined from 130 members of salt-responsive miRNA genes of rice and validated among the contrasting panels of tolerant as well as susceptible rice genotypes, each with 12 genotypes. Although 12 miR-SSRs were found to be polymorphic, only miR172b-SSR was able to differentiate the tolerant and susceptible genotypes in 2 different groups. It had also been found that miRNA genes were more diverse in susceptible genotypes than the tolerant one (as indicated by polymorphic index content) which might interfere to form the stem-loop structure of premature miRNA and their subsequent synthesis in susceptible genotypes. Thus, we concluded that length variations of the repeats in salt responsive miRNA genes may be responsible for a possible sensitivity to salinity adaptation. This is the first report of characterization of trait specific miRNA derived SSRs in plants. Copyright © 2013 Elsevier B.V. All rights reserved.
Galvão, K S C; Ramos, H C C; Santos, P H A D; Entringer, G C; Vettorazzi, J C F; Pereira, M G
2015-07-03
This study aimed to improve grain yield in the full-sib reciprocal recurrent selection program of maize from the North Fluminense State University. In the current phase of the program, the goal is to maintain, or even increase, the genetic variability within and among populations, in order to increase heterosis of the 13th cycle of reciprocal recurrent selection. Microsatellite expressed sequence tags (EST-SSRs) were used as a tool to assist the maximization step of genetic variability, targeting the functional genome. Eighty S1 progenies of the 13th recur-rent selection cycle, 40 from each population (CIMMYT and Piranão), were analyzed using 20 EST-SSR loci. Genetic diversity, observed heterozygosity, information content of polymorphism, and inbreeding co-efficient were estimated. Subsequently, analysis of genetic dissimilarity, molecular variance, and a graphical dispersion of genotypes were conducted. The number of alleles in the CIMMYT population ranged from 1 to 6, while in the Piranão population the range was from 2 to 8, with a mean of 3.65 and 4.35, respectively. As evidenced by the number of alleles, the Shannon index showed greater diversity for the Piranão population (1.04) in relation to the CIMMYT population (0.89). The genic SSR markers were effective in clustering genotypes into their respective populations before selection and an increase in the variation between populations after selection was observed. The results indicate that the study populations have expressive genetic diversity, which cor-responds to the functional genome, indicating that this strategy may contribute to genetic gain, especially in association with the grain yield of future hybrids.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ostrander, E.A.; Sprague, G.F. Jr.; Rine, J.
1993-04-01
A large block of simple sequence repeat (SSR) polymorphisms for the dog genome has been isolated and characterized. Screening of primary libraries by conventional hybridization methods as well as by screening of enriched marker-selected libraries led to the isolation of a large number of genomic clones that contained (CA)[sub n] repeats. The sequences of 101 clones showed that the size and complexity of (CA)[sub n] repeats in the dog genome were similar to those reported for these markers in the human genome. Detailed analysis of a representative subset of these markers revealed that most markers were moderately to highly polymorphic,more » with PIC values exceeding 0.70 for 33% of the markers tested. An association between higher PIC values and markers containing longer (CA)[sub n] repeats was observed in these studies, as previously noted for similar markers in the human genome. A list of primer sequences that tag each characterized marker is provided, and a comprehensive system of nomenclature for the dog genome is suggested. 28 refs., 4 figs., 2 tabs.« less
Nayak, Spurthi N.; Varghese, Nicy; Shah, Trushar M.; Penmetsa, R. Varma; Thirunavukkarasu, Nepolean; Gudipati, Srivani; Gaur, Pooran M.; Kulwal, Pawan L.; Upadhyaya, Hari D.; KaviKishor, Polavarapu B.; Winter, Peter; Kahl, Günter; Town, Christopher D.; Kilian, Andrzej; Cook, Douglas R.; Varshney, Rajeev K.
2011-01-01
Chickpea (Cicer arietinum L.) is the third most important cool season food legume, cultivated in arid and semi-arid regions of the world. The goal of this study was to develop novel molecular markers such as microsatellite or simple sequence repeat (SSR) markers from bacterial artificial chromosome (BAC)-end sequences (BESs) and diversity arrays technology (DArT) markers, and to construct a high-density genetic map based on recombinant inbred line (RIL) population ICC 4958 (C. arietinum)×PI 489777 (C. reticulatum). A BAC-library comprising 55,680 clones was constructed and 46,270 BESs were generated. Mining of these BESs provided 6,845 SSRs, and primer pairs were designed for 1,344 SSRs. In parallel, DArT arrays with ca. 15,000 clones were developed, and 5,397 clones were found polymorphic among 94 genotypes tested. Screening of newly developed BES-SSR markers and DArT arrays on the parental genotypes of the RIL mapping population showed polymorphism with 253 BES-SSR markers and 675 DArT markers. Segregation data obtained for these polymorphic markers and 494 markers data compiled from published reports or collaborators were used for constructing the genetic map. As a result, a comprehensive genetic map comprising 1,291 markers on eight linkage groups (LGs) spanning a total of 845.56 cM distance was developed (http://cmap.icrisat.ac.in/cmap/sm/cp/thudi/). The number of markers per linkage group ranged from 68 (LG 8) to 218 (LG 3) with an average inter-marker distance of 0.65 cM. While the developed resource of molecular markers will be useful for genetic diversity, genetic mapping and molecular breeding applications, the comprehensive genetic map with integrated BES-SSR markers will facilitate its anchoring to the physical map (under construction) to accelerate map-based cloning of genes in chickpea and comparative genome evolution studies in legumes. PMID:22102885
Wu, Haibin; He, Xiaoli; Gong, Hao; Luo, Shaobo; Li, Mingzhu; Chen, Junqiu; Zhang, Changyuan; Yu, Ting; Huang, Wangping; Luo, Jianning
2016-01-01
The hybrids between Luffa acutangula (L.) Roxb. and L.cylindrica (L.) Roem. have strong heterosis effects. However, some reproductive isolation traits hindered their normal hybridization and fructification, which was mainly caused by the flowering time and hybrid pollen sterility. In order to study the genetic basis of two interspecific reproductive isolation traits, we constructed a genetic linkage map using an F2 population derived from a cross between S1174 [L. acutangula (L.) Roxb.] and 93075 [L. cylindrica (L.) Roem.]. The map spans 1436.12 CentiMorgans (cM), with an average of 8.11 cM among markers, and consists of 177 EST-SSR markers distributed in 14 linkage groups (LG) with an average of 102.58 cM per LG. Meanwhile, we conducted colinearity analysis between the sequences of EST-SSR markers and the genomic sequences of cucumber, melon and watermelon. On the basis of genetic linkage map, we conducted QTL mapping of two reproductive isolation traits in sponge gourd, which were the flowering time and hybrid male sterility. Two putative QTLs associated with flowering time (FT) were both detected on LG 1. The accumulated contribution of these two QTLs explained 38.07% of the total phenotypic variance (PV), and each QTL explained 15.36 and 22.71% of the PV respectively. Four QTLs for pollen fertility (PF) were identified on LG 1 (qPF1.1 and qPF1.2), LG 3 (qPF3) and LG 7 (qPF7), respectively. The percentage of PF explained by these QTLs varied from 2.91 to 16.79%, and all together the four QTLs accounted for 39.98% of the total PV. Our newly developed EST-SSR markers and linkage map are very useful for gene mapping, comparative genomics and molecular marker-assisted breeding. These QTLs for interspecific reproductive isolation will also contribute to the cloning of genes relating to interspecific reproductive isolation and the utilization of interspecific heterosis in sponge gourd in further studies.
Wu, Haibin; He, Xiaoli; Gong, Hao; Luo, Shaobo; Li, Mingzhu; Chen, Junqiu; Zhang, Changyuan; Yu, Ting; Huang, Wangping; Luo, Jianning
2016-01-01
The hybrids between Luffa acutangula (L.) Roxb. and L.cylindrica (L.) Roem. have strong heterosis effects. However, some reproductive isolation traits hindered their normal hybridization and fructification, which was mainly caused by the flowering time and hybrid pollen sterility. In order to study the genetic basis of two interspecific reproductive isolation traits, we constructed a genetic linkage map using an F2 population derived from a cross between S1174 [L. acutangula (L.) Roxb.] and 93075 [L. cylindrica (L.) Roem.]. The map spans 1436.12 CentiMorgans (cM), with an average of 8.11 cM among markers, and consists of 177 EST-SSR markers distributed in 14 linkage groups (LG) with an average of 102.58 cM per LG. Meanwhile, we conducted colinearity analysis between the sequences of EST-SSR markers and the genomic sequences of cucumber, melon and watermelon. On the basis of genetic linkage map, we conducted QTL mapping of two reproductive isolation traits in sponge gourd, which were the flowering time and hybrid male sterility. Two putative QTLs associated with flowering time (FT) were both detected on LG 1. The accumulated contribution of these two QTLs explained 38.07% of the total phenotypic variance (PV), and each QTL explained 15.36 and 22.71% of the PV respectively. Four QTLs for pollen fertility (PF) were identified on LG 1 (qPF1.1 and qPF1.2), LG 3 (qPF3) and LG 7 (qPF7), respectively. The percentage of PF explained by these QTLs varied from 2.91 to 16.79%, and all together the four QTLs accounted for 39.98% of the total PV. Our newly developed EST-SSR markers and linkage map are very useful for gene mapping, comparative genomics and molecular marker-assisted breeding. These QTLs for interspecific reproductive isolation will also contribute to the cloning of genes relating to interspecific reproductive isolation and the utilization of interspecific heterosis in sponge gourd in further studies. PMID:27458467
Kang, Se Won; Patnaik, Bharat Bhusan; Hwang, Hee-Ju; Park, So Young; Chung, Jong Min; Song, Dae Kwon; Patnaik, Hongray Howrelia; Lee, Jae Bong; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Park, Seung-Hwan; Park, Young-Su; Han, Yeon Soo; Lee, Jun Sang; Lee, Yong Seok
2017-03-01
Satsuma myomphala is critically endangered through loss of natural habitats, predation by natural enemies, and indiscriminate collection. It is a protected species in Korea but lacks genomic resources for an understanding of varied functional processes attributable to evolutionary success under natural habitats. For assessing the genetic information of S. myomphala, we performed for the first time, de novo transcriptome sequencing and functional annotation of expressed sequences using Illumina Next-Generation Sequencing (NGS) platform and bioinformatics analysis. We identified 103,774 unigenes of which 37,959, 12,890, and 17,699 were annotated in the PANM (Protostome DB), Unigene, and COG (Clusters of Orthologous Groups) databases, respectively. In addition, 14,451 unigenes were predicted under Gene Ontology functional categories, with 4581 assigned to a single category. Furthermore, 3369 sequences with 646 having Enzyme Commission (EC) numbers were mapped to 122 pathways in the Kyoto Encyclopedia of Genes and Genomes Pathway database. The prominent protein domains included the Zinc finger (C2H2-like), Reverse Transcriptase, Thioredoxin-like fold, and RNA recognition motif domain. Many unigenes with homology to immunity, defense, and reproduction-related genes were screened in the transcriptome. We also detected 3120 putative simple sequence repeats (SSRs) encompassing dinucleotide to hexanucleotide repeat motifs from >1kb unigene sequences. A list of PCR primers of SSR loci have been identified to study the genetic polymorphisms. The transcriptome data represents a valuable resource for further investigations on the species genome structure and biology. The unigenes information and microsatellites would provide an indispensable tool for conservation of the species in natural and adaptive environments. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Molecular characterization and genetic diversity of Jatropha curcas L. in Costa Rica
Vásquez-Mayorga, Marcela; Fuchs, Eric J.; Hernández, Eduardo J.; Herrera, Franklin; Hernández, Jesús; Moreira, Ileana; Arnáez, Elizabeth
2017-01-01
We estimated the genetic diversity of 50 Jatropha curcas samples from the Costa Rican germplasm bank using 18 EST-SSR, one G-SSR and nrDNA-ITS markers. We also evaluated the phylogenetic relationships among samples using nuclear ribosomal ITS markers. Non-toxicity was evaluated using G-SSRs and SCARs markers. A Neighbor-Joining (NJ) tree and a Maximum Likelihood (ML) tree were constructed using SSR markers and ITS sequences, respectively. Heterozygosity was moderate (He = 0.346), but considerable compared to worldwide values for J. curcas. The PIC (PIC = 0.274) and inbreeding coefficient (f = − 0.102) were both low. Clustering was not related to the geographical origin of accessions. International accessions clustered independently of collection sites, suggesting a lack of genetic structure, probably due to the wide distribution of this crop and ample gene flow. Molecular markers identified only one non-toxic accession (JCCR-24) from Mexico. This work is part of a countrywide effort to characterize the genetic diversity of the Jatropha curcas germplasm bank in Costa Rica. PMID:28289556
2011-01-01
Background Bituminaria bituminosa is a perennial legume species from the Canary Islands and Mediterranean region that has potential as a drought-tolerant pasture species and as a source of pharmaceutical compounds. Three botanical varieties have previously been identified in this species: albomarginata, bituminosa and crassiuscula. B. bituminosa can be considered a genomic 'orphan' species with very few genomic resources available. New DNA sequencing technologies provide an opportunity to develop high quality molecular markers for such orphan species. Results 432,306 mRNA molecules were sampled from a leaf transcriptome of a single B. bituminosa plant using Roche 454 pyrosequencing, resulting in an average read length of 345 bp (149.1 Mbp in total). Sequences were assembled into 3,838 isotigs/contigs representing putatively unique gene transcripts. Gene ontology descriptors were identified for 3,419 sequences. Raw sequence reads containing simple sequence repeat (SSR) motifs were identified, and 240 primer pairs flanking these motifs were designed. Of 87 primer pairs developed this way, 75 (86.2%) successfully amplified primarily single fragments by PCR. Fragment analysis using 20 primer pairs in 79 accessions of B. bituminosa detected 130 alleles at 21 SSR loci. Genetic diversity analyses confirmed that variation at these SSR loci accurately reflected known taxonomic relationships in original collections of B. bituminosa and provided additional evidence that a division of the botanical variety bituminosa into two according to geographical origin (Mediterranean region and Canary Islands) may be appropriate. Evidence of cross-pollination was also found between botanical varieties within a B. bituminosa breeding programme. Conclusions B. bituminosa can no longer be considered a genomic orphan species, having now a large (albeit incomplete) repertoire of expressed gene sequences that can serve as a resource for future genetic studies. This experimental approach was effective in developing codominant and polymorphic SSR markers for application in diverse genetic studies. These markers have already given new insight into genetic variation in B. bituminosa, providing evidence that a division of the botanical variety bituminosa may be appropriate. This approach is commended to those seeking to develop useful markers for genomic orphan species. PMID:22171578
Ziya Motalebipour, Elmira; Kafkas, Salih; Khodaeiaminjan, Mortaza; Çoban, Nergiz; Gözel, Hatice
2016-12-07
Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm characterization, genetic diversity analysis, and genetic linkage mapping, and may help to elucidate genetic relationships among pistachio cultivars and species. To explore the genome structure of pistachio, a genome survey was performed using the Illumina platform at approximately 40× coverage depth in the P. vera cv. Siirt. The K-mer analysis indicated that pistachio has a genome that is about 600 Mb in size and is highly heterozygous. The assembly of 26.77 Gb Illumina data produced 27,069 scaffolds at N50 = 3.4 kb with a total of 513.5 Mb. A total of 59,280 SSR motifs were detected with a frequency of 8.67 kb. A total of 206 SSRs were used to characterize 24 P. vera cultivars and 20 wild Pistacia genotypes (four genotypes from each five wild Pistacia species) belonging to P. atlantica, P. integerrima, P. chinenesis, P. terebinthus, and P. lentiscus genotypes. Overall 135 SSR loci amplified in all 44 cultivars and genotypes, 41 were polymorphic in six Pistacia species. The novel SSR loci developed from cultivated pistachio were highly transferable to wild Pistacia species. The results from a genome survey of pistachio suggest that the genome size of pistachio is about 600 Mb with a high heterozygosity rate. This information will help to design whole genome sequencing strategies for pistachio. The newly developed novel polymorphic SSRs in this study may help germplasm characterization, genetic diversity, and genetic linkage mapping studies in the genus Pistacia.
Ipek, M; Ipek, A; Seker, M; Gul, M K
2015-03-27
The purpose of this research was to characterize an olive core collection using some agronomic characters and simple sequence repeat (SSR) markers and to determine SSR markers associated with the content of fatty acids in olive oil. SSR marker analysis demonstrated the presence of a high amount of genetic variation between the olive cultivars analyzed. A UPGMA dendrogram demonstrated that olive cultivars did not cluster on the basis of their geographic origin. Fatty acid components of olive oil in these cultivars were determined. The results also showed that there was a great amount of variation between the olive cultivars in terms of fatty acid composition. For example, oleic acid content ranged from 57.76 to 76.9% with standard deviation of 5.10%. Significant correlations between fatty acids of olive oil were observed. For instance, a very high negative correlation (-0.812) between oleic and linoleic acids was detected. A structured association analysis between the content of fatty acids in olive oil and SSR markers was performed. STRUCTURE analysis assigned olive cultivars to two gene pools (K = 2). Assignment of olive cultivars to these gene pools was not based on geographical origin. Association between fatty acid traits and SSR markers was evaluated using the general linear model of TASSEL. Significant associations were determined between five SSR markers and stearic, oleic, linoleic, and linolenic acids of olive oil. Very high associations (P < 0.001) between ssrOeUA-DCA14 and stearic acid and between GAPU71B and oleic acid indicated that these markers could be used for marker-assisted selection in olive.
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.
2016-01-01
Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R. K.; Singh, N. K.; Singh, Rakesh
2013-01-01
Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis. PMID:24367635
Abbas, Ghulam; Hameed, Amjad; Rizwan, Muhammad; Ahsan, Muhammad; Asghar, Muhammad J; Iqbal, Nayyer
2015-01-01
Molecular confirmation of interspecific recombinants is essential to overcome the issues like self-pollination, environmental influence, and inadequacy of morphological characteristics during interspecific hybridization. The present study was conducted for genetic confirmation of mungbean (female) and mashbean (male) interspecific crosses using molecular markers. Initially, polymorphic random amplified polymorphic DNA (RAPD), universal rice primers (URP), and simple sequence repeats (SSR) markers differentiating parent genotypes were identified. Recombination in hybrids was confirmed using these polymorphic DNA markers. The NM 2006 × Mash 88 was most successful interspecific cross. Most of true recombinants confirmed by molecular markers were from this cross combination. SSR markers were efficient in detecting genetic variability and recombination with reference to specific chromosomes and particular loci. SSR (RIS) and RAPD identified variability dispersed throughout the genome. In conclusion, DNA based marker assisted selection (MAS) efficiently confirmed the interspecific recombinants. The results provided evidence that MAS can enhance the authenticity of selection in mungbean improvement program.
Simbaqueba, Jaime; Sánchez, Pilar; Sanchez, Erika; Núñez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mariño-Ramírez, Leonardo
2011-01-01
Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruviana leaf transcriptome. The UTR SSR loci were used for the development of 162 primers for amplification. The efficiency of these primers was tested via PCR in a panel of seven P. peruviana accessions including Colombia, Kenya and Ecuador ecotypes and one closely related species Physalis floridana. We obtained an amplification rate of 83% and a polymorphic rate of 22%. Here we report the first P. peruviana specific microsatellite set, a valuable tool for a wide variety of applications, including functional diversity, conservation and improvement of the species. PMID:22039540
Simbaqueba, Jaime; Sánchez, Pilar; Sanchez, Erika; Núñez Zarantes, Victor Manuel; Chacon, Maria Isabel; Barrero, Luz Stella; Mariño-Ramírez, Leonardo
2011-01-01
Physalis peruviana, commonly known as Cape gooseberry, is an Andean Solanaceae fruit with high nutritional value and interesting medicinal properties. In the present study we report the development and characterization of microsatellite loci from a P. peruviana commercial Colombian genotype. We identified 932 imperfect and 201 perfect Simple Sequence Repeats (SSR) loci in untranslated regions (UTRs) and 304 imperfect and 83 perfect SSR loci in coding regions from the assembled Physalis peruviana leaf transcriptome. The UTR SSR loci were used for the development of 162 primers for amplification. The efficiency of these primers was tested via PCR in a panel of seven P. peruviana accessions including Colombia, Kenya and Ecuador ecotypes and one closely related species Physalis floridana. We obtained an amplification rate of 83% and a polymorphic rate of 22%. Here we report the first P. peruviana specific microsatellite set, a valuable tool for a wide variety of applications, including functional diversity, conservation and improvement of the species.
Solmaz, Ilknur; Kacar, Yildiz Aka; Simsek, Ozhan; Sari, Nebahat
2016-08-01
Snake melon is an important cucurbit crop especially in the Southeastern and the Mediterranean region of Turkey. It is consumed as fresh or pickled. The production is mainly done with the local landraces in the country. Turkey is one of the secondary diversification centers of melon and possesses valuable genetic resources which have different morphological characteristics in case of snake melon. Genetic diversity of snake melon genotypes collected from different regions of Turkey and reference genotypes obtained from World Melon Gene Bank in Avignon-France was examined using 13 simple sequence repeat (SSR) markers. A total of 69 alleles were detected, with an average of 5.31 alleles per locus. The polymorphism information content of SSR markers ranged from 0.19 to 0.57 (average 0.38). Based on cluster analysis, two major groups were defined. The first major group included only one accession (61), while the rest of all accessions grouped in the second major group and separated into different sub-clusters. Based on SSR markers, cluster analysis indicated that considerably high genetic variability exists among the examined accessions; however, Turkish snake melon accessions were grouped together with the reference snake melon accessions.
Sablok, Gaurav; Pérez-Pulido, Antonio J.; Do, Thac; Seong, Tan Y.; Casimiro-Soriguer, Carlos S.; La Porta, Nicola; Ralph, Peter J.; Squartini, Andrea; Muñoz-Merida, Antonio; Harikrishna, Jennifer A.
2016-01-01
Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intra-generic differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www.bioinfocabd.upo.es/plantssr. PMID:27446111
2013-01-01
Background Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. Results The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases. Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. Conclusions A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci. PMID:23379821
Ott, Alina; Trautschold, Brian; Sandhu, Devinder
2011-01-01
Soybean is a major crop that is an important source of oil and proteins. A number of genetic linkage maps have been developed in soybean. Specifically, hundreds of simple sequence repeat (SSR) markers have been developed and mapped. Recent sequencing of the soybean genome resulted in the generation of vast amounts of genetic information. The objectives of this investigation were to use SSR markers in developing a connection between genetic and physical maps and to determine the physical distribution of recombination on soybean chromosomes. A total of 2,188 SSRs were used for sequence-based physical localization on soybean chromosomes. Linkage information was used from different maps to create an integrated genetic map. Comparison of the integrated genetic linkage maps and sequence based physical maps revealed that the distal 25% of each chromosome was the most marker-dense, containing an average of 47.4% of the SSR markers and 50.2% of the genes. The proximal 25% of each chromosome contained only 7.4% of the markers and 6.7% of the genes. At the whole genome level, the marker density and gene density showed a high correlation (R(2)) of 0.64 and 0.83, respectively with the physical distance from the centromere. Recombination followed a similar pattern with comparisons indicating that recombination is high in telomeric regions, though the correlation between crossover frequency and distance from the centromeres is low (R(2) = 0.21). Most of the centromeric regions were low in recombination. The crossover frequency for the entire soybean genome was 7.2%, with extremes much higher and lower than average. The number of recombination hotspots varied from 1 to 12 per chromosome. A high correlation of 0.83 between the distribution of SSR markers and genes suggested close association of SSRs with genes. The knowledge of distribution of recombination on chromosomes may be applied in characterizing and targeting genes.
The transcriptome of Spodoptera exigua larvae exposed to different types of microbes.
Pascual, Laura; Jakubowska, Agata K; Blanca, Jose M; Cañizares, Joaquin; Ferré, Juan; Gloeckner, Gernot; Vogel, Heiko; Herrero, Salvador
2012-08-01
We have obtained and characterized the transcriptome of Spodoptera exigua larvae with special emphasis on pathogen-induced genes. In order to obtain a highly representative transcriptome, we have pooled RNA from diverse insect colonies, conditions and tissues. Sequenced cDNA included samples from 3 geographically different colonies. Enrichment of RNA from pathogen-related genes was accomplished by exposing larvae to different pathogenic and non-pathogenic microbial agents such as the bacteria Bacillus thuringiensis, Micrococcus luteus, and Escherichia coli, the yeast Saccharomyces cerevisiae, and the S. exigua nucleopolyhedrovirus (SeMNPV). In addition, to avoid the loss of tissue-specific genes we included cDNA from the midgut, fat body, hemocytes and integument derived from pathogen exposed insects. RNA obtained from the different types of samples was pooled, normalized and sequenced. Analysis of the sequences obtained using the Roche 454 FLX and Sanger methods has allowed the generation of the largest public set of ESTs from S. exigua, including a large group of immune genes, and the identification of an important number of SSR (simple sequence repeats) and SNVs (single nucleotide variants: SNPs and INDELs) with potential use as genetic markers. Moreover, data mining has allowed the discovery of novel RNA viruses with potential influence in the insect population dynamics and the larval interactions with the microbial pesticides that are currently in use for the biological control of this pest. Copyright © 2012 Elsevier Ltd. All rights reserved.
Choi, Hong-Il; Kim, Nam Hoon; Kim, Jun Ha; Choi, Beom Soon; Ahn, In-Ok; Lee, Joon-Soo; Yang, Tae-Jin
2011-01-01
Little is known about the genetics or genomics of Panax ginseng. In this study, we developed 70 expressed sequence tag-derived polymorphic simple sequence repeat markers by trials of 140 primer pairs. All of the 70 markers showed reproducible polymorphism among four Panax speciesand 19 of them were polymorphic in six P. ginseng cultivars. These markers segregated 1:2:1 manner of Mendelian inheritance in an F2 population of a cross between two P. ginseng cultivars, ‘Yunpoong’ and ‘Chunpoong’, indicating that these are reproducible and inheritable mappable markers. A phylogenetic analysis using the genotype data showed three distinctive groups: a P. ginseng-P. japonicus clade, P. notoginseng and P. quinquefolius, with similarity coefficients of 0.70. P. japonicus was intermingled with P. ginseng cultivars, indicating that both species have similar genetic backgrounds. P. ginseng cultivars were subdivided into three minor groups: an independent cultivar ‘Chunpoong’, a subgroup with three accessions including two cultivars, ‘Gumpoong’ and ‘Yunpoong’ and one landrace ‘Hwangsook’ and another subgroup with two accessions including one cultivar, ‘Gopoong’ and one landrace ‘Jakyung’. Each primer pair produced 1 to 4 bands, indicating that the ginseng genome has a highly replicated paleopolyploid genome structure. PMID:23717085
Fan, R; Ling, P; Hao, C Y; Li, F P; Huang, L F; Wu, B D; Wu, H S
2015-10-19
Black pepper is a perennial climbing vine. It is widely cultivated because its berries can be utilized not only as a spice in food but also for medicinal use. This study aimed to construct a standardized, high-quality cDNA library to facilitated identification of new Piper hainanense transcripts. For this, 262 unigenes were used to generate raw reads. The average length of these 262 unigenes was 774.8 bp. Of these, 94 genes (35.9%) were newly identified, according to the NCBI protein database. Thus, identification of new genes may broaden the molecular knowledge of P. hainanense on the basis of Clusters of Orthologous Groups and Gene Ontology categories. In addition, certain basic genes linked to physiological processes, which can contribute to disease resistance and thereby to the breeding of black pepper. A total of 26 unigenes were found to be SSR markers. Dinucleotide SSR was the main repeat motif, accounting for 61.54%, followed by trinucleotide SSR (23.07%). Eight primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among twenty-one piper germplasm. These results present a novel sequence information of P. hainanense, which can serve as the foundation for further genetic research on this species.
Lee, Hwa-Yong; Moon, Suyun; Shim, Donghwan; Hong, Chang Pyo; Lee, Yi; Koo, Chang-Duck; Chung, Jong-Wook; Ryu, Hojin
2017-01-01
The shiitake mushroom (Lentinula edodes) is one of the most popular edible mushrooms in the world and has attracted attention for its value in medicinal and pharmacological uses. With recent advanced research and techniques, the agricultural cultivation of the shiitake mushroom has been greatly increased, especially in East Asia. Additionally, demand for the development of new cultivars with good agricultural traits has been greatly enhanced, but the development processes are complicated and more challenging than for other edible mushrooms. In this study, we developed 44 novel polymorphic simple sequence repeat (SSR) markers for the determination of shiitake mushroom cultivars based on a whole genome sequencing database of L. edodes. These markers were found to be polymorphic and reliable when screened in 23 shiitake mushroom cultivars. For the 44 SSR markers developed in this study, the major allele frequency ranged from 0.13 to 0.94; the number of genotypes and number of alleles were each 2–11; the observed and expected heterozygosity were 0.00–1.00 and 0.10–0.90, respectively; and the polymorphic information content value ranged from 0.10 to 0.89. These new markers can be used for molecular breeding, the determination of cultivars, and other applications. PMID:28338645
Chak Han Im; Young-Hoon Park; Kenneth E. Hammel; Bokyung Park; Soon Wook Kwon; Hojin Ryu; Jae-San Ryu
2016-01-01
Breeding new strains with improved traits is a long-standing goal of mushroom breeders that can be expedited by marker-assisted selection (MAS). We constructed a genetic linkage map of Pleurotus eryngii based on segregation analysis of markers in postmeiotic monokaryons from KNR2312. In total, 256 loci comprising 226 simple sequence-repeat (SSR) markers, 2 mating-type...
Microsatellite markers: what they mean and why they are so useful
Vieira, Maria Lucia Carneiro; Santini, Luciane; Diniz, Augusto Lima; Munhoz, Carla de Freitas
2016-01-01
Abstract Microsatellites or Single Sequence Repeats (SSRs) are extensively employed in plant genetics studies, using both low and high throughput genotyping approaches. Motivated by the importance of these sequences over the last decades this review aims to address some theoretical aspects of SSRs, including definition, characterization and biological function. The methodologies for the development of SSR loci, genotyping and their applications as molecular markers are also reviewed. Finally, two data surveys are presented. The first was conducted using the main database of Web of Science, prospecting for articles published over the period from 2010 to 2015, resulting in approximately 930 records. The second survey was focused on papers that aimed at SSR marker development, published in the American Journal of Botany's Primer Notes and Protocols in Plant Sciences (over 2013 up to 2015), resulting in a total of 87 publications. This scenario confirms the current relevance of SSRs and indicates their continuous utilization in plant science. PMID:27561112
MSDB: A Comprehensive Database of Simple Sequence Repeats
Avvaru, Akshay Kumar; Saxena, Saketh; Mishra, Rakesh Kumar
2017-01-01
Abstract Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1–6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. PMID:28854643
2013-01-01
Background Adoption of genomics based breeding has emerged as a promising approach for achieving comprehensive crop improvement. Such an approach is more relevant in the case of perennial species like mulberry. However, unavailability of genomic resources of co-dominant marker systems has been the major constraint for adopting molecular breeding to achieve genetic enhancement of Mulberry. The goal of this study was to develop and characterize a large number of locus specific genic and genomic SSR markers which can be effectively used for molecular characterization of mulberry species/genotypes. Result We analyzed a total of 3485 DNA sequences including genomic and expressed sequences (ESTs) of mulberry (Morus alba L.) genome. We identified 358 sequences to develop appropriate microsatellite primer pairs representing 222 genomic and 136 EST regions. Primers amplifying locus specific regions of Dudia white (a genotype of Morus alba L), were identified and 137 genomic and 51 genic SSR markers were standardized. A two pronged strategy was adopted to assess the applicability of these SSR markers using mulberry species and genotypes along with a few closely related species belonging to the family Moraceae viz., Ficus, Fig and Jackfruit. While 100% of these markers amplified specific loci on the mulberry genome, 79% were transferable to other related species indicating the robustness of these markers and the potential they hold in analyzing the molecular and genetic diversity among mulberry germplasm as well as other related species. The inherent ability of these markers in detecting heterozygosity combined with a high average polymorphic information content (PIC) of 0.559 ranging between 0.076 and 0.943 clearly demonstrates their potential as genomic resources in diversity analysis. The dissimilarity coefficient determined based on Neighbor joining method, revealed that the markers were successful in segregating the mulberry species, genotypes and other related species into distinct clusters. Conclusion We report a total of 188 genomic and genic SSR markers in Morus alba L. A large proportion of these markers (164) were polymorphic both among mulberry species and genotypes. A substantial number of these markers (149) were also transferable to other related species like Ficus, Fig and Jackfruit. The extent of polymorphism revealed and the ability to detect heterozygosity among the cross pollinated mulberry species and genotypes render these markers an invaluable genomic resource that can be utilized in assessing molecular diversity as well as in QTL mapping and subsequently mulberry crop improvement through MAS. PMID:24289047
Penmetsa, R. V.; Dutta, S.; Kulwal, P. L.; Saxena, R. K.; Datta, S.; Sharma, T. R.; Rosen, B.; Carrasquilla-Garcia, N.; Farmer, A. D.; Dubey, A.; Saxena, K. B.; Gao, J.; Fakrudin, B.; Singh, M. N.; Singh, B. P.; Wanjari, K. B.; Yuan, M.; Srivastava, R. K.; Kilian, A.; Upadhyaya, H. D.; Mallikarjuna, N.; Town, C. D.; Bruening, G. E.; He, G.; May, G. D.; McCombie, R.; Jackson, S. A.; Singh, N. K.; Cook, D. R.
2009-01-01
Pigeonpea (Cajanus cajan), an important food legume crop in the semi-arid regions of the world and the second most important pulse crop in India, has an average crop productivity of 780 kg/ha. The relatively low crop yields may be attributed to non-availability of improved cultivars, poor crop husbandry and exposure to a number of biotic and abiotic stresses in pigeonpea growing regions. Narrow genetic diversity in cultivated germplasm has further hampered the effective utilization of conventional breeding as well as development and utilization of genomic tools, resulting in pigeonpea being often referred to as an ‘orphan crop legume’. To enable genomics-assisted breeding in this crop, the pigeonpea genomics initiative (PGI) was initiated in late 2006 with funding from Indian Council of Agricultural Research under the umbrella of Indo-US agricultural knowledge initiative, which was further expanded with financial support from the US National Science Foundation’s Plant Genome Research Program and the Generation Challenge Program. As a result of the PGI, the last 3 years have witnessed significant progress in development of both genetic as well as genomic resources in this crop through effective collaborations and coordination of genomics activities across several institutes and countries. For instance, 25 mapping populations segregating for a number of biotic and abiotic stresses have been developed or are under development. An 11X-genome coverage bacterial artificial chromosome (BAC) library comprising of 69,120 clones have been developed of which 50,000 clones were end sequenced to generate 87,590 BAC-end sequences (BESs). About 10,000 expressed sequence tags (ESTs) from Sanger sequencing and ca. 2 million short ESTs by 454/FLX sequencing have been generated. A variety of molecular markers have been developed from BESs, microsatellite or simple sequence repeat (SSR)-enriched libraries and mining of ESTs and genomic amplicon sequencing. Of about 21,000 SSRs identified, 6,698 SSRs are under analysis along with 670 orthologous genes using a GoldenGate SNP (single nucleotide polymorphism) genotyping platform, with large scale SNP discovery using Solexa, a next generation sequencing technology, is in progress. Similarly a diversity array technology array comprising of ca. 15,000 features has been developed. In addition, >600 unique nucleotide binding site (NBS) domain containing members of the NBS-leucine rich repeat disease resistance homologs were cloned in pigeonpea; 960 BACs containing these sequences were identified by filter hybridization, BES physical maps developed using high information content fingerprinting. To enrich the genomic resources further, sequenced soybean genome is being analyzed to establish the anchor points between pigeonpea and soybean genomes. In addition, Solexa sequencing is being used to explore the feasibility of generating whole genome sequence. In summary, the collaborative efforts of several research groups under the umbrella of PGI are making significant progress in improving molecular tools in pigeonpea and should significantly benefit pigeonpea genetics and breeding. As these efforts come to fruition, and expanded (depending on funding), pigeonpea would move from an ‘orphan legume crop’ to one where genomics-assisted breeding approaches for a sustainable crop improvement are routine. PMID:20976284
Genetic diversity and relationships among different tomato varieties revealed by EST-SSR markers.
Korir, N K; Diao, W; Tao, R; Li, X; Kayesh, E; Li, A; Zhen, W; Wang, S
2014-01-08
The genetic diversity and relationship of 42 tomato varieties sourced from different geographic regions was examined with EST-SSR markers. The genetic diversity was between 0.18 and 0.77, with a mean of 0.49; the polymorphic information content ranged from 0.17 to 0.74, with a mean of 0.45. This indicates a fairly high degree of diversity among these tomato varieties. Based on the cluster analysis using unweighted pair-group method with arithmetic average (UPGMA), all the tomato varieties fell into 5 groups, with no obvious geographical distribution characteristics despite their diverse sources. The principal component analysis (PCA) supported the clustering result; however, relationships among varieties were more complex in the PCA scatterplot than in the UPGMA dendrogram. This information about the genetic relationships between these tomato lines helps distinguish these 42 varieties and will be useful for tomato variety breeding and selection. We confirm that the EST-SSR marker system is useful for studying genetic diversity among tomato varieties. The high degree of polymorphism and the large number of bands obtained per assay shows that SSR is the most informative marker system for tomato genotyping for purposes of rights/protection and for the tomato industry in general. It is recommended that these varieties be subjected to identification using an SSR-based manual cultivar identification diagram strategy or other easy-to-use and referable methods so as to provide a complete set of information concerning genetic relationships and a readily usable means of identifying these varieties.
Hadle, Jacob J.; Konrade, Lauren A.; Beasley, Rochelle R.; ...
2016-08-03
Buchloë dactyloides (Nutt.) Engelm. (buffalograss; Poaceae) is a low-growing, perennial C4 grass that is a dominant component of shortgrass prairies of the North American Great Plains (Shearman et al., 2004). Beyond this significant ecosystem role, buffalograss has been widely adopted as a drought-tolerant turfgrass alternative, particularly notable as a native-species option in North America. Like many dominant Great Plains grasses, B. dactyloides comprises an autopolypoid series, including diploids (2n = 20), tetraploids, pentaploids, and hexaploids (Johnson et al., 2001). Preserving the full range of buffalograss phenotypic and genotypic diversity and utilizing this diversity for crop improvement will require an understandingmore » of the distribution of genetic variation among cytotypes and across its large geographic range. Beyond numerous methodological advantages (Guichoux et al., 2011), microsatellites, or simple sequence repeat (SSR) markers,are an attractive genetic tool for studies of wide-ranging polyploid series given their codominant nature and applicability to museum-derived DNAs. Because SSR data are routinely obtainable from DNA extracted from museum tissue (Wandeler et al., 2007), these samples can be used to quickly and economically obtain comparative genotypic data from all portions of a large geographic range. Currently no buffalograss-specific SSR loci are available, as previous studies have relied on a mixture of dominant and codominant loci that were designed for other taxa (Budak et al., 2004). In this study, a set of SSR loci are designed from B. dactyloides genomic sequence data. The variability of these loci are then evaluated in six populations from numerous portions of the buffalograss range.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hadle, Jacob J.; Konrade, Lauren A.; Beasley, Rochelle R.
Buchloë dactyloides (Nutt.) Engelm. (buffalograss; Poaceae) is a low-growing, perennial C4 grass that is a dominant component of shortgrass prairies of the North American Great Plains (Shearman et al., 2004). Beyond this significant ecosystem role, buffalograss has been widely adopted as a drought-tolerant turfgrass alternative, particularly notable as a native-species option in North America. Like many dominant Great Plains grasses, B. dactyloides comprises an autopolypoid series, including diploids (2n = 20), tetraploids, pentaploids, and hexaploids (Johnson et al., 2001). Preserving the full range of buffalograss phenotypic and genotypic diversity and utilizing this diversity for crop improvement will require an understandingmore » of the distribution of genetic variation among cytotypes and across its large geographic range. Beyond numerous methodological advantages (Guichoux et al., 2011), microsatellites, or simple sequence repeat (SSR) markers,are an attractive genetic tool for studies of wide-ranging polyploid series given their codominant nature and applicability to museum-derived DNAs. Because SSR data are routinely obtainable from DNA extracted from museum tissue (Wandeler et al., 2007), these samples can be used to quickly and economically obtain comparative genotypic data from all portions of a large geographic range. Currently no buffalograss-specific SSR loci are available, as previous studies have relied on a mixture of dominant and codominant loci that were designed for other taxa (Budak et al., 2004). In this study, a set of SSR loci are designed from B. dactyloides genomic sequence data. The variability of these loci are then evaluated in six populations from numerous portions of the buffalograss range.« less
Genetic variation patterns of American chestnut populations at EST-SSRs
Oliver Gailing; C. Dana Nelson
2017-01-01
The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...
RAPD and SSR Polymorphisms in Mutant Lines of Transgenic Wheat Mediated by Low Energy Ion Beam
NASA Astrophysics Data System (ADS)
Wang, Tiegu; Huang, Qunce; Feng, Weisen
2007-10-01
Two types of markers-random amplified polymorphic DNA (RAPD) and simple sequence repeat DNA (SSR)-have been used to characterize the genetic diversity among nine mutant lines of transgenic wheat intermediated by low energy ion beam and their four receptor cultivars. The objectives of this study were to analyze RAPD-based and SSR-based genetic variance among transgenic wheat lines and with their receptors, and to find specific genetic markers of special traits of transgenic wheat lines. 170 RAPD primers were amplified to 733 fragments in all the experimental materials. There were 121 polymorphic fragments out of the 733 fragments with a ratio of polymorphic fragments of 16.5%. 29 SSR primer pairs were amplified to 83 fragments in all the experiment materials. There were 57 polymorphic fragments out of the 83 fragments with a ratio of polymorphic fragments of 68.7%. The dendrograms were prepared based on a genetic distance matrix using the UPGMA (Unweighted Pair-group Method with Arithmetic averaging) algorithm, which corresponded well to the results of the wheat pedigree analysis and separated the 13 genotypes into four groups. Association analysis between RAPD and SSR markers with the special traits of transgenic wheat mutant lines discovered that three RAPD markers, s1, opt-16, and f14, were significantly associated with the muticate trait, while three SSR markers, Rht8 (Xgwm261), Rht-B1b, and Rht-D1b, highly associated with the dwarf trait. These markers will be useful for marker-assistant breeding and can be used as candidate markers for further gene mapping and cloning.
Sugai, Kyoko; Setsuko, Suzuki; Uchiyama, Kentaro; Murakami, Noriaki; Kato, Hidetoshi; Yoshimaru, Hiroshi
2012-02-01
Expressed sequence tag (EST)-derived microsatellite markers were developed for Elaeocarpus photiniifolia, an endemic taxon of the Bonin Islands. Initially, a complementary DNA (cDNA) library was constructed by de novo pyrosequencing of total RNA extracted from a seedling. A total of 267 primer pairs were designed from the library. Of the 48 tested loci, 25 loci were polymorphic among 41 individuals representing the entire geographical range of the species, with the number of alleles per locus and expected heterozygosity ranging from two to 14 and 0.09 to 0.86, respectively. Most loci were transferable to a related species, E. sylvestris. The developed markers will be useful for evaluating the genetic structure of E. photiniifolia.
NASA Astrophysics Data System (ADS)
Zhao, Cui; Zhang, Xiaojun; Liu, Chengzhang; Huan, Pin; Li, Fuhua; Xiang, Jianhai; Huang, Chao
2012-05-01
Little is known about the genome of Pacific white shrimp ( Litopenaeus vannamei). To address this, we conducted BAC (bacterial artificial chromosome) end sequencing of L. vannamei. We selected and sequenced 7 812 BAC clones from the BAC library LvHE from the two ends of the inserts by Sanger sequencing. After trimming and quality filtering, 11 279 BAC end sequences (BESs) including 4 609 pairedends BESs were obtained. The total length of the BESs was 4 340 753 bp, representing 0.18% of the L. vannamei haploid genome. The lengths of the BESs ranged from 100 bp to 660 bp with an average length of 385 bp. Analysis of the BESs indicated that the L. vannamei genome is AT-rich and that the primary repeats patterns were simple sequence repeats (SSRs) and low complexity sequences. Dinucleotide and hexanucleotide repeats were the most common SSR types in the BESs. The most abundant transposable element was gypsy, which may contribute to the generation of the large genome size of L. vannamei. We successfully annotated 4 519 BESs by BLAST searching, including genes involved in immunity and sex determination. Our results provide an important resource for functional gene studies, map construction and integration, and complete genome assembly for this species.
Raju, Nikku L; Gnanesh, Belaghihalli N; Lekha, Pazhamala; Jayashree, Balaji; Pande, Suresh; Hiremath, Pavana J; Byregowda, Munishamappa; Singh, Nagendra K; Varshney, Rajeev K
2010-03-11
Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (
2010-01-01
Background Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). Results A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (≤ 1E-08). Functional categorization of the annotated unigenes sequences showed that 153 (3.3%) genes were assigned to cellular component category, 132 (2.8%) to biological process, and 132 (2.8%) in molecular function. Further, 19 genes were identified differentially expressed between FW- responsive genotypes and 20 between SMD- responsive genotypes. Generated ESTs were compiled together with 908 ESTs available in public domain, at the time of analysis, and a set of 5,085 unigenes were defined that were used for identification of molecular markers in pigeonpea. For instance, 3,583 simple sequence repeat (SSR) motifs were identified in 1,365 unigenes and 383 primer pairs were designed. Assessment of a set of 84 primer pairs on 40 elite pigeonpea lines showed polymorphism with 15 (28.8%) markers with an average of four alleles per marker and an average polymorphic information content (PIC) value of 0.40. Similarly, in silico mining of 133 contigs with ≥ 5 sequences detected 102 single nucleotide polymorphisms (SNPs) in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS) assay. Conclusion The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding. PMID:20222972
Wang, Xiao-Ting; Zhang, Yu-Juan; Qiao, Liang; Chen, Bin
2018-02-27
Simple sequence repeats (SSRs) exist in both eukaryotic and prokaryotic genomes and are the most popular genetic markers, but the SSRs of mosquito genomes are still not well understood. In this study, we identified and analyzed the SSRs in 23 mosquito species using Drosophila melanogaster as reference at the whole-genome level. The results show that SSR numbers (33 076-560 175/genome) and genome sizes (574.57-1342.21 Mb) are significantly positively correlated (R 2 = 0.8992, P < 0.01), but the correlation in individual species varies in these mosquito species. In six types of SSR, mono- to trinucleotide SSRs are dominant with cumulative percentages of 95.14%-99.00% and densities of 195.65/Mb-787.51/Mb, whereas tetra- to hexanucleotide SSRs are rare with 1.12%-4.22% and 3.76/Mb-40.23/Mb. The (A/T)n, (AC/GT)n and (AGC/GCT)n are the most frequent motifs in mononucleotide, dinucleotide and trinucleotide SSRs, respectively, and the motif frequencies of tetra- to hexanucleotide SSRs appear to be species-specific. The 10-20 bp length of SSRs are dominant with the number of 110 561 ± 93 482 and the frequency of 87.25% ± 5.73% on average, and the number and frequency decline with the increase of length. Most SSRs (83.34% ± 7.72%) are located in intergenic regions, followed by intron regions (11.59% ± 5.59%), exon regions (3.74% ± 1.95%), and untranslated regions (1.32% ± 1.39%). The mono-, di- and trinucleotide SSRs are the main SSRs in both gene regions (98.55% ± 0.85%) and exon regions (99.27% ± 0.52%). An average of 42.52% of total genes contains SSRs, and the preference for SSR occurrence in different gene subcategories are species-specific. The study provides useful insights into the SSR diversity, characteristics and distribution in 23 mosquito species of genomes. © 2018 Institute of Zoology, Chinese Academy of Sciences.
McBenedict, Billy; Chimwamurombe, Percy; Kwembeya, Ezekeil; Maggs-Kölling, Gillian
2016-01-01
Current Pennisetum glaucum (L.) R. BR. cultivars in Namibia have overall poor performance posing a threat to the nation's food security because this crop is staple for over 70% of the Namibian population. The crop suffers from undesirable production traits such as susceptibility to diseases, low yield, and prolonged reproductive cycle. This study aimed to understand the genetic diversity of the crop in Namibia by simple sequence repeats (SSRs) and morphology analysis. A total of 1441 genotypes were collected from the National Gene Bank representing all the Namibian landraces. A sample of 96 genotypes was further analyzed by SSR using Shannon-Wiener diversity index and revealed a value of 0.45 indicating low genetic diversity. Ordination using Principal Coordinate Analysis (PCoA) on SSR data confirmed clusters generated by UPGMA for the 96 P. glaucum accessions. UPGMA phenograms of 29 morphological characterized genotypes were generated for SSR and morphology data and the two trees revealed 78% resemblance. Lodging susceptibility, tillering attitude, spike density, fodder yield potential, early vigour, and spike shape were the phenotypic characters upon which some clusters were based in both datasets. It is recommended that efforts should be made to widen the current gene pool in Namibia.
McBenedict, Billy; Chimwamurombe, Percy; Kwembeya, Ezekeil; Maggs-Kölling, Gillian
2016-01-01
Current Pennisetum glaucum (L.) R. BR. cultivars in Namibia have overall poor performance posing a threat to the nation's food security because this crop is staple for over 70% of the Namibian population. The crop suffers from undesirable production traits such as susceptibility to diseases, low yield, and prolonged reproductive cycle. This study aimed to understand the genetic diversity of the crop in Namibia by simple sequence repeats (SSRs) and morphology analysis. A total of 1441 genotypes were collected from the National Gene Bank representing all the Namibian landraces. A sample of 96 genotypes was further analyzed by SSR using Shannon-Wiener diversity index and revealed a value of 0.45 indicating low genetic diversity. Ordination using Principal Coordinate Analysis (PCoA) on SSR data confirmed clusters generated by UPGMA for the 96 P. glaucum accessions. UPGMA phenograms of 29 morphological characterized genotypes were generated for SSR and morphology data and the two trees revealed 78% resemblance. Lodging susceptibility, tillering attitude, spike density, fodder yield potential, early vigour, and spike shape were the phenotypic characters upon which some clusters were based in both datasets. It is recommended that efforts should be made to widen the current gene pool in Namibia. PMID:27433479
Ashrafi, Hamid; Hill, Theresa; Stoffel, Kevin; Kozik, Alexander; Yao, Jiqiang; Chin-Wo, Sebastian Reyes; Van Deynze, Allen
2012-10-30
Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80-120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.
Genetic diversity in the germplasm of black pepper determined by EST-SSR markers.
Wu, B D; Fan, R; Hu, L S; Wu, H S; Hao, C Y
2016-03-18
This study aimed to assess genetic diversity in the germplasm of black pepper from around the world using SSR markers from EST. In total, 13 markers were selected and successfully amplified the target loci across the black pepper germplasm. All the EST-SSR markers showed high levels of polymorphisms with an average polymorphism information content of 0.93. The genetic similarity coefficients among all accessions ranged from 0.724 to 1.000, with an average of 0.867. These results indicated that black pepper germplasms possess a complex genetic background and high genetic diversity. Based on a cluster analysis, 148 black pepper germplasms were grouped in two major clades: the Neotropics and the Asian tropics. Peperomia pellucida was grouped separately and distantly from all other accessions. These results generally agreed with the genetic and geographic distances. However, the Asian tropics clade did not cluster according to their geographic origins. In addition, compared with the American accessions, the Asian wild accessions and cultivated accessions grouped together, indicating a close genetic relationship. This verified the origin of black pepper. The newly developed EST-SSRs are highly valuable resources for the conservation of black pepper germplasm diversity and for black pepper breeding.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Genome-Wide Association of Rice Blast Disease Resistance and Yield-Related Components of Rice.
Wang, Xueyan; Jia, Melissa H; Ghai, Pooja; Lee, Fleet N; Jia, Yulin
2015-12-01
Robust disease resistance may require an expenditure of energy that may limit crop yield potential. In the present study, a subset of a United States Department of Agriculture rice core collection consisting of 151 accessions was selected using a major blast resistance (R) gene, Pi-ta, marker and was genotyped with 156 simple sequence repeat (SSR) markers. Disease reactions to Magnaporthe oryzae, the causal agent of rice blast disease, were evaluated under greenhouse and field conditions, and heading date, plant height, paddy and brown seed weight in two field environments were analyzed, using an association mapping approach. A total of 21 SSR markers distributed among rice chromosomes 2 to 12 were associated with blast resistance, and 16 SSR markers were associated with seed weight, heading date, and plant height. Most noticeably, shorter plants were significantly correlated with resistance to blast, rice genomes with Pi-ta were associated with lighter seed weights, and the susceptible alleles of RM171 and RM6544 were associated with heavier seed weight. These findings unraveled a complex relationship between disease resistance and yield-related components.
Molecular Mapping of Restriction-Site Associated DNA Markers In Allotetraploid Upland Cotton.
Wang, Yangkun; Ning, Zhiyuan; Hu, Yan; Chen, Jiedan; Zhao, Rui; Chen, Hong; Ai, Nijiang; Guo, Wangzhen; Zhang, Tianzhen
2015-01-01
Upland cotton (Gossypium hirsutum L., 2n = 52, AADD) is an allotetraploid, therefore the discovery of single nucleotide polymorphism (SNP) markers is difficult. The recent emergence of genome complexity reduction technologies based on the next-generation sequencing (NGS) platform has greatly expedited SNP discovery in crops with highly repetitive and complex genomes. Here we applied restriction-site associated DNA (RAD) sequencing technology for de novo SNP discovery in allotetraploid cotton. We identified 21,109 SNPs between the two parents and used these for genotyping of 161 recombinant inbred lines (RILs). Finally, a high dense linkage map comprising 4,153 loci over 3500-cM was developed based on the previous result. Using this map quantitative trait locus (QTLs) conferring fiber strength and Verticillium Wilt (VW) resistance were mapped to a more accurate region in comparison to the 1576-cM interval determined using the simple sequence repeat (SSR) genetic map. This suggests that the newly constructed map has more power and resolution than the previous SSR map. It will pave the way for the rapid identification of the marker-assisted selection in cotton breeding and cloning of QTL of interest traits.
Development of DArT-based PCR markers for selecting drought-tolerant spring barley.
Fiust, Anna; Rapacz, Marcin; Wójcik-Jagła, Magdalena; Tyrka, Mirosław
2015-08-01
The tolerance of spring barley (Hordeum vulgare L.) cultivars to spring drought is an important agronomic trait affecting crop yield and quality in Poland. Therefore, breeders require new molecular markers to select plants with lower spring drought susceptibility. With the advent of genomic selection technology, simple molecular tools may still be applicable to screen material for markers of the most important traits and in-depth genome scanning. In previous studies, diversity arrays technology (DArT)-based genetic maps were constructed for F2 populations of Polish fodder and malt barley elite breeding lines, and 15 and 18 quantitative trait loci (QTLs) related to spring drought tolerance were identified, respectively. In this paper, we show the results of a conversion of 30 DArT markers corresponding to 11 QTLs into simple sequence repeat (SSR) and sequence tagged site (STS) markers. Twenty-two polymorphic markers were obtained, including 13 DArT-based SSRs. Additionally, 31 SSR markers, located in close proximity to the DArT markers, were selected from the GrainGenes database and tested. Further analyses of 24 advanced breeding lines with different drought tolerances confirmed that five out of the 30 converted markers, as well as three out of the 31 additional SSR markers, were effective in marker-assisted selection for drought tolerance. The possible function of clones related to these markers in drought tolerance is discussed.
Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.
2013-01-01
SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (SSRs; for example, microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains three analysis modules along with a fourth control module that can be used to automate analyses of large volumes of data. The modules are used to (1) identify the subset of paired-end sequences that pass quality standards, (2) align paired-end reads into a single composite DNA sequence, and (3) identify sequences that possess microsatellites conforming to user specified parameters. Each of the three separate analysis modules also can be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc). All modules are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, Windows). The program suite relies on a compiled Python extension module to perform paired-end alignments. Instructions for compiling the extension from source code are provided in the documentation. Users who do not have Python installed on their computers or who do not have the ability to compile software also may choose to download packaged executable files. These files include all Python scripts, a copy of the compiled extension module, and a minimal installation of Python in a single binary executable. See program documentation for more information.
Shirasawa, Kenta; Hand, Melanie L.; Henderson, Steven T.; Okada, Takashi; Johnson, Susan D.; Taylor, Jennifer M.; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M. G.
2015-01-01
Background and Aims Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. Methods RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. Key Results A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. Conclusions A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. PMID:25538115
Liu, Shaoqun; Li, Wanshun; Wu, Yimin; Chen, Changming; Lei, Jianjun
2013-01-01
The capsaicinoids are a group of compounds produced by chili pepper fruits and are used widely in many fields, especially in medical purposes. The capsaicinoid biosynthetic pathway has not yet been established clearly. To understand more knowledge in biosynthesis of capsaicinoids, we applied RNA-seq for the mixture of placenta and pericarp of pungent pepper (Capsicum frutescens L.). We have assessed the effect of various assembly parameters using different assembly software, and obtained one of the best strategies for de novo assembly of transcriptome data. We obtained a total 54,045 high-quality unigenes (transcripts) using Trinity software. About 92.65% of unigenes showed similarity to the public protein sequences, genome of potato and tomato and pepper (C. annuum) ESTs databases. Our results predicted 3 new structural genes (DHAD, TD, PAT), which filled gaps of the capsaicinoid biosynthetic pathway predicted by Mazourek, and revealed new candidate genes involved in capsaicinoid biosynthesis based on KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis. A significant number of SSR (Simple Sequence Repeat) and SNP (Single Nucleotide Polymorphism) markers were predicted in C. frutescens and C. annuum sequences, which will be helpful in the identification of polymorphisms within chili pepper populations. These data will provide new insights to the pathway of capsaicinoid biosynthesis and subsequent research of chili peppers. In addition, our strategy of de novo transcriptome assembly is applicable to a wide range of similar studies.
Liu, Shaoqun; Li, Wanshun; Wu, Yimin; Chen, Changming; Lei, Jianjun
2013-01-01
The capsaicinoids are a group of compounds produced by chili pepper fruits and are used widely in many fields, especially in medical purposes. The capsaicinoid biosynthetic pathway has not yet been established clearly. To understand more knowledge in biosynthesis of capsaicinoids, we applied RNA-seq for the mixture of placenta and pericarp of pungent pepper (Capsicum frutescens L.). We have assessed the effect of various assembly parameters using different assembly software, and obtained one of the best strategies for de novo assembly of transcriptome data. We obtained a total 54,045 high-quality unigenes (transcripts) using Trinity software. About 92.65% of unigenes showed similarity to the public protein sequences, genome of potato and tomato and pepper (C. annuum) ESTs databases. Our results predicted 3 new structural genes (DHAD, TD, PAT), which filled gaps of the capsaicinoid biosynthetic pathway predicted by Mazourek, and revealed new candidate genes involved in capsaicinoid biosynthesis based on KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis. A significant number of SSR (Simple Sequence Repeat) and SNP (Single Nucleotide Polymorphism) markers were predicted in C. frutescens and C. annuum sequences, which will be helpful in the identification of polymorphisms within chili pepper populations. These data will provide new insights to the pathway of capsaicinoid biosynthesis and subsequent research of chili peppers. In addition, our strategy of de novo transcriptome assembly is applicable to a wide range of similar studies. PMID:23349661
Obaid, Ramiz; Abu-Qaoud, Hassan; Arafeh, Rami
2014-09-03
Eight accessions of olive trees from three common varieties in Palestine, Nabali Baladi, Nabali Mohassan and Surri, were genetically evaluated using five simple sequence repeat (SSR) markers. A total of 17 alleles from 5 loci were observed in which 15 (88.2%) were polymorphic and 2 (11.8%) were monomorphic. An average of 3.4 alleles per locus was found ranging from 2.0 alleles with the primers GAPU-103 and DCA-9 to 5.0 alleles with U9932 and DCA-16. The smallest amplicon size observed was 50 bp with the primer DCA-16, whereas the largest one (450 bp) with the primer U9932. Cluster analysis with the unweighted pair group method with arithmetic average (UPGMA) showed three clusters: a cluster with four accessions from the 'Nabali Baladi' cultivar, another cluster with three accessions that represents the 'Nabali Mohassen' cultivar and finally the 'Surri' cultivar. The similarity coefficient for the eight olive tree samples ranged from a maximum of 100% between two accessions from Nabali Baladi and also in two other samples from Nabali Mohassan, to a minimum similarity coefficient (0.315) between the Surri and two Nabali Baladi accessions. The results in this investigation clearly highlight the genetic dissimilarity between the three main olive cultivars that have been misidentified and mixed up in the past, based on conventional morphological characters.
ssrA (tmRNA) Plays a Role in Salmonella enterica Serovar Typhimurium Pathogenesis
Julio, Steven M.; Heithoff, Douglas M.; Mahan, Michael J.
2000-01-01
Escherichia coli ssrA encodes a small stable RNA molecule, tmRNA, that has many diverse functions, including tagging abnormal proteins for degradation, supporting phage growth, and modulating the activity of DNA binding proteins. Here we show that ssrA plays a role in Salmonella enterica serovar Typhimurium pathogenesis and in the expression of several genes known to be induced during infection. Moreover, the phage-like attachment site, attL, encoded within ssrA, serves as the site of integration of a region of Salmonella-specific sequence; adjacent to the 5′ end of ssrA is another region of Salmonella-specific sequence with extensive homology to predicted proteins encoded within the unlinked Salmonella pathogenicity island SPI4. S. enterica serovar Typhimurium ssrA mutants fail to support the growth of phage P22 and are delayed in their ability to form viable phage particles following induction of a phage P22 lysogen. These data indicate that ssrA plays a role in the pathogenesis of Salmonella, serves as an attachment site for Salmonella-specific sequences, and is required for the growth of phage P22. PMID:10692360
Reisch, Christoph; Schurm, Sophia; Poschlod, Peter
2007-01-01
Background and Aims Many alpine plant species combine clonal and sexual reproduction to minimize the risks of flowering and seed production in high mountain regions. The spatial genetic structure and diversity of these alpine species is strongly affected by different clonal strategies (phalanx or guerrilla) and the proportion of generative and vegetative reproduction. Methods The clonal structure of the alpine plant species Salix herbacea was investigated in a 3 × 3 m plot of an alpine meadow using microsatellite (simple sequence repeat; SSR) analysis. The data obtained were compared with the results of a random amplified polymorphic DNA (RAPD) analysis. Key Results SSR analysis, based on three loci and 16 alleles, revealed 24 different genotypes and a proportion of distinguishable genotypes of 0·18. Six SSR clones were found consisting of at least five samples, 17 clones consisting of more than two samples and seven single genotypes. Mean clone size comprising at least five samples was 0·96 m2, and spatial autocorrelation analysis showed strong similarity of samples up to 130 cm. RAPD analysis revealed a higher level of clonal diversity but a comparable number of larger clones and a similar spatial structure. Conclusions The spatial genetic structure as well as the occurrence of single genotypes revealed in this study suggests both clonal and sexual propagation and repeated seedling recruitment in established populations of S. herbacea and is thus suggestive of a relaxed phalanx strategy. PMID:17242040
Ohyama, Akio; Shirasawa, Kenta; Matsunaga, Hiroshi; Negoro, Satomi; Miyatake, Koji; Yamaguchi, Hirotaka; Nunome, Tsukasa; Iwata, Hiroyoshi; Fukuoka, Hiroyuki; Hayashi, Takeshi
2017-08-01
Using newly developed euchromatin-derived genomic SSR markers and a flexible Bayesian mapping method, 13 significant agricultural QTLs were identified in a segregating population derived from a four-way cross of tomato. So far, many QTL mapping studies in tomato have been performed for progeny obtained from crosses between two genetically distant parents, e.g., domesticated tomatoes and wild relatives. However, QTL information of quantitative traits related to yield (e.g., flower or fruit number, and total or average weight of fruits) in such intercross populations would be of limited use for breeding commercial tomato cultivars because individuals in the populations have specific genetic backgrounds underlying extremely different phenotypes between the parents such as large fruit in domesticated tomatoes and small fruit in wild relatives, which may not be reflective of the genetic variation in tomato breeding populations. In this study, we constructed F 2 population derived from a cross between two commercial F 1 cultivars in tomato to extract QTL information practical for tomato breeding. This cross corresponded to a four-way cross, because the four parental lines of the two F 1 cultivars were considered to be the founders. We developed 2510 new expressed sequence tag (EST)-based (euchromatin-derived) genomic SSR markers and selected 262 markers from these new SSR markers and publicly available SSR markers to construct a linkage map. QTL analysis for ten agricultural traits of tomato was performed based on the phenotypes and marker genotypes of F 2 plants using a flexible Bayesian method. As results, 13 QTL regions were detected for six traits by the Bayesian method developed in this study.
Zhou, Yan; Chen, Lianfu; Fan, Xiuzhi; Bian, Yinbing
2014-01-01
Auricularia polytricha (Mont.) Sacc., a type of edible black-brown mushroom with a gelatinous and modality-specific fruiting body, is in high demand in Asia due to its nutritional and medicinal properties. Illumina Solexa sequenceing technology was used to generate very large transcript sequences from the mycelium and the mature fruiting body of A. polytricha for gene discovery and molecular marker development. De novo assembly generated 36,483 ESTs with an N50 length of 636 bp. A total of 28,108 ESTs demonstrated significant hits with known proteins in the nr database, and 94.03% of the annotated ESTs showed the greatest similarity to A. delicata, a related species of A. polytricha. Functional categorization of the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways revealed the conservation of genes involved in various biological processes in A. polytricha. Gene expression profile analysis indicated that a total of 2,057 ESTs were differentially expressed, including 1,020 ESTs that were up-regulated in the mycelium and 1,037 up-regulated in the fruiting body. Functional enrichment showed that the ESTs associated with biosynthesis, metabolism and assembly of proteins were more active in fruiting body development. The expression patterns of homologous transcription factors indicated that the molecular mechanisms of fruiting body formation and development were not exactly the same as for other agarics. Interestingly, an EST encoding tyrosinase was significantly up-regulated in the fruiting body, indicating that melanins accumulated during the processes of the formation of the black-brown color of the fruiting body in A. polytricha development. In addition, a total of 1,715 potential SSRs were detected in this transcriptome. The transcriptome analysis of A. polytricha provides valuable sequence resources and numerous molecular markers to facilitate further functional genomics studies and genetic researches on this fungus. PMID:24626227
MSDB: A Comprehensive Database of Simple Sequence Repeats.
Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar
2017-06-01
Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Choudhary, Shashi Bhushan; Sharma, Hariom Kumar; Kumar, Arroju Anil; Maruthi, Rangappa Thimmaiah; Mitra, Jiban; Chowdhury, Isholeena; Singh, Binay Kumar; Karmakar, Pran Gobinda
2017-02-01
A total of 130 flax accessions of diverse morphotypes and worldwide origin were assessed for genetic diversity and population structure using 11 morphological traits and microsatellite markers (15 gSSRs and 7 EST-SSRs). Analysis performed after classifying these accessions on the basis of plant height, branching pattern, seed size, Indian/foreign origin into six categories called sub-populations viz. fibre type exotic, fibre type indigenous, intermediate type exotic, intermediate type indigenous, linseed type exotic and linseed type indigenous. The study assessed different diversity indices, AMOVA, population structure and included a principal coordinate analysis based on different marker systems. The highest diversity was exhibited by gSSR markers (SI=0.46; He=0.31; P=85.11). AMOVA based on all markers explained significant difference among fibre type, intermediate type and linseed type populations of flax. In terms of variation explained by different markers, EST-SSR markers (12%) better differentiated flax populations compared to morphological (9%) and gSSR (6%) markers at P=0.01. The maximum Nei's unbiased genetic distance (D=0.11) was observed between fibre type and linseed type exotic sub-populations based on EST-SSR markers. The combined structure analysis by using all markers grouped Indian fibre type accessions (63.4%) in a separate cluster along with the Indian intermediate type (48.7%), whereas Indian accessions (82.16%) of linseed type constituted an independent cluster. These findings were supported by the results of the principal coordinate analysis. Morphological markers employed in the study found complementary with microsatellite based markers in deciphering genetic diversity and population structure of the flax germplasm. Copyright © 2016 Académie des sciences. Published by Elsevier Masson SAS. All rights reserved.
Survey and Analysis of Microsatellites in the Silkworm, Bombyx mori
Prasad, M. Dharma; Muthulakshmi, M.; Madhu, M.; Archak, Sunil; Mita, K.; Nagaraju, J.
2005-01-01
We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of ≥15 bases of mononucleotide repeats and ≥5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2–14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama. PMID:15371363
2011-01-01
Background Panax notoginseng (Burk) F.H. Chen is important medicinal plant of the Araliacease family. Triterpene saponins are the bioactive constituents in P. notoginseng. However, available genomic information regarding this plant is limited. Moreover, details of triterpene saponin biosynthesis in the Panax species are largely unknown. Results Using the 454 pyrosequencing technology, a one-quarter GS FLX titanium run resulted in 188,185 reads with an average length of 410 bases for P. notoginseng root. These reads were processed and assembled by 454 GS De Novo Assembler software into 30,852 unique sequences. A total of 70.2% of unique sequences were annotated by Basic Local Alignment Search Tool (BLAST) similarity searches against public sequence databases. The Kyoto Encyclopedia of Genes and Genomes (KEGG) assignment discovered 41 unique sequences representing 11 genes involved in triterpene saponin backbone biosynthesis in the 454-EST dataset. In particular, the transcript encoding dammarenediol synthase (DS), which is the first committed enzyme in the biosynthetic pathway of major triterpene saponins, is highly expressed in the root of four-year-old P. notoginseng. It is worth emphasizing that the candidate cytochrome P450 (Pn02132 and Pn00158) and UDP-glycosyltransferase (Pn00082) gene most likely to be involved in hydroxylation or glycosylation of aglycones for triterpene saponin biosynthesis were discovered from 174 cytochrome P450s and 242 glycosyltransferases by phylogenetic analysis, respectively. Putative transcription factors were detected in 906 unique sequences, including Myb, homeobox, WRKY, basic helix-loop-helix (bHLH), and other family proteins. Additionally, a total of 2,772 simple sequence repeat (SSR) were identified from 2,361 unique sequences, of which, di-nucleotide motifs were the most abundant motif. Conclusion This study is the first to present a large-scale EST dataset for P. notoginseng root acquired by next-generation sequencing (NGS) technology. The candidate genes involved in triterpene saponin biosynthesis, including the putative CYP450s and UGTs, were obtained in this study. Additionally, the identification of SSRs provided plenty of genetic makers for molecular breeding and genetics applications in this species. These data will provide information on gene discovery, transcriptional regulation and marker-assisted selection for P. notoginseng. The dataset establishes an important foundation for the study with the purpose of ensuring adequate drug resources for this species. PMID:22369100
Distefano, Gaetano; Caruso, Marco; La Malfa, Stefano; Gentile, Alessandra; Wu, Shu-Biao
2012-01-01
High resolution melting curve analysis (HRM) has been used as an efficient, accurate and cost-effective tool to detect single nucleotide polymorphisms (SNPs) or insertions or deletions (INDELs). However, its efficiency, accuracy and applicability to discriminate microsatellite polymorphism have not been extensively assessed. The traditional protocols used for SSR genotyping include PCR amplification of the DNA fragment and the separation of the fragments on electrophoresis-based platform. However, post-PCR handling processes are laborious and costly. Furthermore, SNPs present in the sequences flanking repeat motif cannot be detected by polyacrylamide-gel-electrophoresis based methods. In the present study, we compared the discriminating power of HRM with the traditional electrophoresis-based methods and provided a panel of primers for HRM genotyping in Citrus. The results showed that sixteen SSR markers produced distinct polymorphic melting curves among the Citrus spp investigated through HRM analysis. Among those, 10 showed more genotypes by HRM analysis than capillary electrophoresis owing to the presence of SNPs in the amplicons. For the SSR markers without SNPs present in the flanking region, HRM also gave distinct melting curves which detected same genotypes as were shown in capillary electrophoresis (CE) analysis. Moreover, HRM analysis allowed the discrimination of most of the 15 citrus genotypes and the resulting genetic distance analysis clustered them into three main branches. In conclusion, it has been approved that HRM is not only an efficient and cost-effective alternative of electrophoresis-based method for SSR markers, but also a method to uncover more polymorphisms contributed by SNPs present in SSRs. It was therefore suggested that the panel of SSR markers could be used in a variety of applications in the citrus biodiversity and breeding programs using HRM analysis. Furthermore, we speculate that the HRM analysis can be employed to analyse SSR markers in a wide range of applications in all other species.
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1–6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ PMID:25380781
Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh
2014-01-01
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies
Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity. PMID:27454301
Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity.
Van Inghelandt, Delphine; Melchinger, Albrecht E; Lebreton, Claude; Stich, Benjamin
2010-05-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger's distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity.
Microsatellites for the genus Cucurbita and an SSR-based genetic linkage map of Cucurbita pepo L.
Gong, L.; Stift, G.; Kofler, R.; Pachner, M.
2008-01-01
Until recently, only a few microsatellites have been available for Cucurbita, thus their development is highly desirable. The Austrian oil-pumpkin variety Gleisdorfer Ölkürbis (C. pepo subsp. pepo) and the C. moschata cultivar Soler (Puerto Rico) were used for SSR development. SSR-enriched partial genomic libraries were established and 2,400 clones were sequenced. Of these 1,058 (44%) contained an SSR at least four repeats long. Primers were designed for 532 SSRs; 500 primer pairs produced fragments of expected size. Of these, 405 (81%) amplified polymorphic fragments in a set of 12 genotypes: three C. moschata, one C. ecuadorensis, and eight C. pepo representing all eight cultivar groups. On an average, C. pepo and C. moschata produced 3.3 alleles per primer pair, showing high inter-species transferability. There were 187 SSR markers detecting polymorphism between the USA oil-pumpkin variety “Lady Godiva” (O5) and the Italian crookneck variety “Bianco Friulano” (CN), which are the parents of our previous F2 mapping population. It has been used to construct the first published C. pepo map, containing mainly RAPD and AFLP markers. Now the updated map comprises 178 SSRs, 244 AFLPs, 230 RAPDs, five SCARs, and two morphological traits (h and B). It contains 20 linkage groups with a map density of 2.9 cM. The observed genome coverage (Co) is 86.8%. Electronic supplementary material The online version of this article (doi:10.1007/s00122-008-0750-2) contains supplementary material, which is available to authorized users. PMID:18379753
NASA Astrophysics Data System (ADS)
Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng
2011-06-01
The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.
Martins, Aline R.; Abreu, Aluana G.; Bajay, Miklos M.; Villela, Priscilla M. S.; Batista, Carlos E. A.; Monteiro, Mariza; Alves-Pereira, Alessandro; Figueira, Glyn M.; Pinheiro, José B.; Appezzato-da-Glória, Beatriz; Zucchi, Maria I.
2013-01-01
• Premise of the study: A new set of microsatellite or simple sequence repeat (SSR) markers were developed for Smilax brasiliensis, which is popularly known as sarsaparilla and used in folk medicine as a tonic, antirheumatic, and antisyphilitic. Smilax brasiliensis is sold in Brazilian pharmacies, and its origin and effectiveness are not subject to quality control. • Methods and Results: Using a protocol for genomic library enrichment, primer pairs were developed for 26 microsatellite loci and validated in 17 accessions of S. brasiliensis. Thirteen loci were polymorphic and four were monomorphic. The primers successfully amplified alleles in the congeners S. campestris, S. cissoides, S. fluminensis, S. goyazana, S. polyantha, S. quinquenervia, S. rufescens, S. subsessiliflora, and S. syphilitica. • Conclusions: The new SSR markers described herein are informative tools for genetic diversity and gene flow studies in S. brasiliensis and several congeners. PMID:25202555
Williams, Kelly P.
2003-01-01
A partial screen for genetic elements integrated into completely sequenced bacterial genomes shows more significant bias in specificity for the tmRNA gene (ssrA) than for any type of tRNA gene. Horizontal gene transfer, a major avenue of bacterial evolution, was assessed by focusing on elements using this single attachment locus. Diverse elements use ssrA; among enterobacteria alone, at least four different integrase subfamilies have independently evolved specificity for ssrA, and almost every strain analyzed presents a unique set of integrated elements. Even elements using essentially the same integrase can be very diverse, as is a group with an ssrA-specific integrase of the P4 subfamily. This same integrase appears to promote damage routinely at attachment sites, which may be adaptive. Elements in arrays can recombine; one such event mediated by invertible DNA segments within neighboring elements likely explains the monophasic nature of Salmonella enterica serovar Typhi. One of a limited set of conserved sequences occurs at the attachment site of each enterobacterial element, apparently serving as a transcriptional terminator for ssrA. Elements were usually found integrated into tRNA-like sequence at the 3′ end of ssrA, at subsites corresponding to those used in tRNA genes; an exception was found at the non-tRNA-like 3′ end produced by ssrA gene permutation in cyanobacteria, suggesting that, during the evolution of new site specificity by integrases, tropism toward a conserved 3′ end of an RNA gene may be as strong as toward a tRNA-like sequence. The proximity of ssrA and smpB, which act in concert, was also surveyed. PMID:12533482
Zhao, Jiaojiao; Huang, Li; Ren, Xiaoping; Pandey, Manish K; Wu, Bei; Chen, Yuning; Zhou, Xiaojing; Chen, Weigang; Xia, Youlin; Li, Zeqing; Luo, Huaiyong; Lei, Yong; Varshney, Rajeev K; Liao, Boshou; Jiang, Huifang
2017-01-01
Cultivated peanut ( Arachis hypogaea L.) is an allotetraploid (AABB, 2 n = 4 x = 40), valued for its edible oil and digestible protein. Seed size and weight are important agronomical traits significantly influence the yield and nutritional composition of peanut. However, the genetic basis of seed-related traits remains ambiguous. Association mapping is a powerful approach for quickly and efficiently exploring the genetic basis of important traits in plants. In this study, a total of 104 peanut accessions were used to identify molecular markers associated with seed-related traits using 554 single-locus simple sequence repeat (SSR) markers. Most of the accessions had no or weak relationship in the peanut panel. The linkage disequilibrium (LD) decayed with the genetic distance of 1cM at the genome level and the LD of B subgenome decayed faster than that of the A subgenome. Large phenotypic variation was observed for four seed-related traits in the association panel. Using mixed linear model with population structure and kinship, a total of 30 significant SSR markers were detected to be associated with four seed-related traits ( P < 1.81 × 10 -3 ) in different environments, which explained 11.22-32.30% of the phenotypic variation for each trait. The marker AHGA44686 was simultaneously and repeatedly associated with seed length and hundred-seed weight in multiple environments with large phenotypic variance (26.23 ∼ 32.30%). The favorable alleles of associated markers for each seed-related trait and the optimal combination of favorable alleles of associated markers were identified to significantly enhance trait performance, revealing a potential of utilization of these associated markers in peanut breeding program.
Fernández, Luis; de Haro, Luis Alejandro; Distefano, Ana J; Carolina Martínez, Maria; Lía, Verónica; Papa, Juan C; Olea, Ignacio; Tosto, Daniela; Esteban Hopp, Horacio
2013-01-01
Single sequence repeats (SSR) developed for Sorghum bicolor were used to characterize the genetic distance of 46 different Sorghum halepense (Johnsongrass) accessions from Argentina some of which have evolved toward glyphosate resistance. Since Johnsongrass is an allotetraploid and only one subgenome is homologous to cultivated sorghum, some SSR loci amplified up to two alleles while others (presumably more conserved loci) amplified up to four alleles. Twelve SSR providing information of 24 loci representative of Johnsongrass genome were selected for genetic distance characterization. All of them were highly polymorphic, which was evidenced by the number of different alleles found in the samples studied, in some of them up to 20. UPGMA and Mantel analysis showed that Johnsongrass glyphosate-resistant accessions that belong to different geographic regions do not share similar genetic backgrounds. In contrast, they show closer similarity to their neighboring susceptible counterparts. Discriminant Analysis of Principal Components using the clusters identified by K-means support the lack of a clear pattern of association among samples and resistance status or province of origin. Consequently, these results do not support a single genetic origin of glyphosate resistance. Nucleotide sequencing of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) encoding gene from glyphosate-resistant and susceptible accessions collected from different geographic origins showed that none presented expected mutations in aminoacid positions 101 and 106 which are diagnostic of target-site resistance mechanism. PMID:24223277
Al-Khalifah, Nasser S; Shanavaskhan, A E
2017-01-01
Ambiguity in the total number of date palm cultivars across the world is pointing toward the necessity for an enumerative study using standard morphological and molecular markers. Among molecular markers, DNA markers are more suitable and ubiquitous to most applications. They are highly polymorphic in nature, frequently occurring in genomes, easy to access, and highly reproducible. Various molecular markers such as restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), simple sequence repeats (SSR), inter-simple sequence repeats (ISSR), and random amplified polymorphic DNA (RAPD) markers have been successfully used as efficient tools for analysis of genetic variation in date palm. This chapter explains a stepwise protocol for extracting total genomic DNA from date palm leaves. A user-friendly protocol for RAPD analysis and a table showing the primers used in different molecular techniques that produce polymorphisms in date palm are also provided.
Guo, Yuanwen; Wu, Yanqi; Anderson, Jeff A.; Moss, Justin Q.; Zhu, Lan
2015-01-01
Common bermudagrass [C. dactylon (L.) Pers. var. dactylon] is economically and environmentally the most important member among Cynodon species because of its extensive use for turf, forage and soil erosion control in the world. However, information regarding the inheritance within the taxon is limited. Accordingly, the objective of this study was to determine qualitative inheritance mode in common bermudagrass. Two tetraploid (2n = 4x = 36), first-generation selfed (S1) populations, 228 progenies of ‘Zebra’ and 273 from A12359, were analyzed for segregation with 21 and 12 simple sequence repeat (SSR) markers, respectively. It is concluded that the inheritance mode of tetraploid bermudagrass was complete or near complete disomic. It is evident that the two bermudagrass parents had an allotetraploid genome with two distinct subgenomes since 33 SSR primer pairs amplified 34 loci, each having two alleles. Severe transmission ratio distortions occurred in the Zebra population while less so in the A12359 population. The findings of disomic inheritance and segregation ratio distortion in common bermudagrass is significant in subsequent linkage map construction, quantitative trait locus mapping and marker-assisted selection in the species. PMID:26295707
Guo, Yuanwen; Wu, Yanqi; Anderson, Jeff A; Moss, Justin Q; Zhu, Lan
2015-01-01
Common bermudagrass [C. dactylon (L.) Pers. var. dactylon] is economically and environmentally the most important member among Cynodon species because of its extensive use for turf, forage and soil erosion control in the world. However, information regarding the inheritance within the taxon is limited. Accordingly, the objective of this study was to determine qualitative inheritance mode in common bermudagrass. Two tetraploid (2n = 4x = 36), first-generation selfed (S1) populations, 228 progenies of 'Zebra' and 273 from A12359, were analyzed for segregation with 21 and 12 simple sequence repeat (SSR) markers, respectively. It is concluded that the inheritance mode of tetraploid bermudagrass was complete or near complete disomic. It is evident that the two bermudagrass parents had an allotetraploid genome with two distinct subgenomes since 33 SSR primer pairs amplified 34 loci, each having two alleles. Severe transmission ratio distortions occurred in the Zebra population while less so in the A12359 population. The findings of disomic inheritance and segregation ratio distortion in common bermudagrass is significant in subsequent linkage map construction, quantitative trait locus mapping and marker-assisted selection in the species.
Li, Jingjian; Xiong, Chao; He, Xia; Lu, Zhaocen; Zhang, Xin; Chen, Xiaoyang; Sun, Wei
2018-01-01
Traditional herbal medicines have played important roles in the ways of life of people around the world since ancient times. Despite the advanced medical technology of the modern world, herbal medicines are still used as popular alternatives to synthetic drugs. Due to the increasing demand for herbal medicines, plant species identification has become an important tool to prevent substitution and adulteration. Here we propose a method for biological assessment of the quality of prescribed species in the Chinese Pharmacopoeia by use of high resolution melting (HRM) analysis of microsatellite loci. We tested this method on licorice, a traditional herbal medicine with a long history. Results showed that nine simple sequence repeat (SSR) markers produced distinct melting curve profiles for the five licorice species investigated using HRM analysis. These results were validated by capillary electrophoresis. We applied this protocol to commercially available licorice products, thus enabling the consistent identification of 11 labels with non-declared Glycyrrhiza species. This novel strategy may thus facilitate DNA barcoding as a method of identification of closely related species in herbal medicine products. Based on this study, a brief operating procedure for using the SSR-HRM protocol for herbal authentication is provided.
Li, Jingjian; Xiong, Chao; He, Xia; Lu, Zhaocen; Zhang, Xin; Chen, Xiaoyang; Sun, Wei
2018-01-01
Traditional herbal medicines have played important roles in the ways of life of people around the world since ancient times. Despite the advanced medical technology of the modern world, herbal medicines are still used as popular alternatives to synthetic drugs. Due to the increasing demand for herbal medicines, plant species identification has become an important tool to prevent substitution and adulteration. Here we propose a method for biological assessment of the quality of prescribed species in the Chinese Pharmacopoeia by use of high resolution melting (HRM) analysis of microsatellite loci. We tested this method on licorice, a traditional herbal medicine with a long history. Results showed that nine simple sequence repeat (SSR) markers produced distinct melting curve profiles for the five licorice species investigated using HRM analysis. These results were validated by capillary electrophoresis. We applied this protocol to commercially available licorice products, thus enabling the consistent identification of 11 labels with non-declared Glycyrrhiza species. This novel strategy may thus facilitate DNA barcoding as a method of identification of closely related species in herbal medicine products. Based on this study, a brief operating procedure for using the SSR-HRM protocol for herbal authentication is provided. PMID:29740326
The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.
Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook
2015-07-20
Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.
Le Cunff, Loïc; Fournier-Level, Alexandre; Laucou, Valérie; Vezzulli, Silvia; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Boursiquot, Jean-Michel; This, Patrice
2008-01-01
Background The first high quality draft of the grape genome sequence has just been published. This is a critical step in accessing all the genes of this species and increases the chances of exploiting the natural genetic diversity through association genetics. However, our basic knowledge of the extent of allelic variation within the species is still not sufficient. Towards this goal, we constructed nested genetic core collections (G-cores) to capture the simple sequence repeat (SSR) diversity of the grape cultivated compartment (Vitis vinifera L. subsp. sativa) from the world's largest germplasm collection (Domaine de Vassal, INRA Hérault, France), containing 2262 unique genotypes. Results Sub-samples of 12, 24, 48 and 92 varieties of V. vinifera L. were selected based on their genotypes for 20 SSR markers using the M-strategy. They represent respectively 58%, 73%, 83% and 100% of total SSR diversity. The capture of allelic diversity was analyzed by sequencing three genes scattered throughout the genome on 233 individuals: 41 single nucleotide polymorphisms (SNPs) were identified using the G-92 core (one SNP for every 49 nucleotides) while only 25 were observed using a larger sample of 141 individuals selected on the basis of 50 morphological traits, thus demonstrating the reliability of the approach. Conclusion The G-12 and G-24 core-collections displayed respectively 78% and 88% of the SNPs respectively, and are therefore of great interest for SNP discovery studies. Furthermore, the nested genetic core collections satisfactorily reflected the geographic and the genetic diversity of grape, which are also of great interest for the study of gene evolution in this species. PMID:18384667
Obaid, Ramiz; Abu-Qaoud, Hassan; Arafeh, Rami
2014-01-01
Eight accessions of olive trees from three common varieties in Palestine, Nabali Baladi, Nabali Mohassan and Surri, were genetically evaluated using five simple sequence repeat (SSR) markers. A total of 17 alleles from 5 loci were observed in which 15 (88.2%) were polymorphic and 2 (11.8%) were monomorphic. An average of 3.4 alleles per locus was found ranging from 2.0 alleles with the primers GAPU-103 and DCA-9 to 5.0 alleles with U9932 and DCA-16. The smallest amplicon size observed was 50 bp with the primer DCA-16, whereas the largest one (450 bp) with the primer U9932. Cluster analysis with the unweighted pair group method with arithmetic average (UPGMA) showed three clusters: a cluster with four accessions from the ‘Nabali Baladi’ cultivar, another cluster with three accessions that represents the ‘Nabali Mohassen’ cultivar and finally the ‘Surri’ cultivar. The similarity coefficient for the eight olive tree samples ranged from a maximum of 100% between two accessions from Nabali Baladi and also in two other samples from Nabali Mohassan, to a minimum similarity coefficient (0.315) between the Surri and two Nabali Baladi accessions. The results in this investigation clearly highlight the genetic dissimilarity between the three main olive cultivars that have been misidentified and mixed up in the past, based on conventional morphological characters. PMID:26019564
Cao, Qianjin; Lu, Bao-Rong; Xia, Hui; Rong, Jun; Sala, Francesco; Spada, Alberto; Grassi, Fabrizio
2006-12-01
Weedy rice (Oryza sativa f. spontanea) is one of the most notorious weeds occurring in rice-planting areas worldwide. The objectives of this study are to determine the genetic diversity and differentiation of weedy rice populations from Liaoning Province in North-eastern China and to explore the possible origin of these weedy populations by comparing their genetic relationships with rice varieties (O. sativa) and wild rice (O. rufipogon) from different sources. Simple sequence repeat (SSR) markers were used to estimate the genetic diversity of 30 weedy rice populations from Liaoning, each containing about 30 individuals, selected rice varieties and wild O. rufipogon. Genetic differentiation and the relationships of weedy rice populations were analysed using cluster analysis (UPGMA) and principle component analysis (PCA). The overall genetic diversity of weedy rice populations from Liaoning was relatively high (H(e) = 0.313, I = 0.572), with about 35 % of the genetic variation found among regions. The Liaoning weedy rice populations were closely related to rice varieties from Liaoning and japonica varieties from other regions but distantly related to indica rice varieties and wild O. rufipogon. Weedy rice populations from Liaoning are considerably variable genetically and most probably originated from Liaoning rice varieties by mutation and intervarietal hybrids. Recent changes in farming practices and cultivation methods along with less weed management may have promoted the re-emergence and divergence of weedy rice in North-eastern China.
2012-01-01
Background Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Results Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Conclusions Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project. PMID:23110314
An integrated molecular cytogenetic map of Cucumis sativus L. chromosome 2.
Han, Yonghua; Zhang, Zhonghua; Huang, Sanwen; Jin, Weiwei
2011-01-27
Integration of molecular, genetic and cytological maps is still a challenge for most plant species. Recent progress in molecular and cytogenetic studies created a basis for developing integrated maps in cucumber (Cucumis sativus L.). In this study, eleven fosmid clones and three plasmids containing 45S rDNA, the centromeric satellite repeat Type III and the pericentriomeric repeat CsRP1 sequences respectively were hybridized to cucumber metaphase chromosomes to assign their cytological location on chromosome 2. Moreover, an integrated molecular cytogenetic map of cucumber chromosomes 2 was constructed by fluorescence in situ hybridization (FISH) mapping of 11 fosmid clones together with the cucumber centromere-specific Type III sequence on meiotic pachytene chromosomes. The cytogenetic map was fully integrated with genetic linkage map since each fosmid clone was anchored by a genetically mapped simple sequence repeat marker (SSR). The relationship between the genetic and physical distances along chromosome was analyzed. Recombination was not evenly distributed along the physical length of chromosome 2. Suppression of recombination was found in centromeric and pericentromeric regions. Our results also indicated that the molecular markers composing the linkage map for chromosome 2 provided excellent coverage of the chromosome.
Genetic relationships among seven sections of genus Arachis studied by using SSR markers
2010-01-01
Background The genus Arachis, originated in South America, is divided into nine taxonomical sections comprising of 80 species. Most of the Arachis species are diploids (2n = 2x = 20) and the tetraploid species (2n = 2x = 40) are found in sections Arachis, Extranervosae and Rhizomatosae. Diploid species have great potential to be used as resistance sources for agronomic traits like pests and diseases, drought related traits and different life cycle spans. Understanding of genetic relationships among wild species and between wild and cultivated species will be useful for enhanced utilization of wild species in improving cultivated germplasm. The present study was undertaken to evaluate genetic relationships among species (96 accessions) belonging to seven sections of Arachis by using simple sequence repeat (SSR) markers developed from Arachis hypogaea genomic library and gene sequences from related genera of Arachis. Results The average transferability rate of 101 SSR markers tested to section Arachis and six other sections was 81% and 59% respectively. Five markers (IPAHM 164, IPAHM 165, IPAHM 407a, IPAHM 409, and IPAHM 659) showed 100% transferability. Cluster analysis of allelic data from a subset of 32 SSR markers on 85 wild and 11 cultivated accessions grouped accessions according to their genome composition, sections and species to which they belong. A total of 109 species specific alleles were detected in different wild species, Arachis pusilla exhibited largest number of species specific alleles (15). Based on genetic distance analysis, the A-genome accession ICG 8200 (A. duranensis) and the B-genome accession ICG 8206 (A. ipaënsis) were found most closely related to A. hypogaea. Conclusion A set of cross species and cross section transferable SSR markers has been identified that will be useful for genetic studies of wild species of Arachis, including comparative genome mapping, germplasm analysis, population genetic structure and phylogenetic inferences among species. The present study provides strong support based on both genomic and genic markers, probably for the first time, on relationships of A. monticola and A. hypogaea as well as on the most probable donor of A and B-genomes of cultivated groundnut. PMID:20089171
Shirasawa, Kenta; Hand, Melanie L; Henderson, Steven T; Okada, Takashi; Johnson, Susan D; Taylor, Jennifer M; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M G
2015-03-01
Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. © The Author 2014. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Du, Wanli; Wang, Jing; Pang, Yuhui; Wang, Liangming; Wu, Jun; Zhao, Jixin; Yang, Qunhui; Chen, Xinhong
2014-01-01
We isolated a wheat germplasm line, 22-2, which was derived from common wheat (Triticum aestivum '7182') and Psathyrostachys huashanica 'Keng' (2n = 2x = 14, NsNs). Genomic composition and homoeologous relationships of 22-2 was analyzed using cytology, genomic in situ hybridization (GISH), EST-SSR, and EST-STS to characterize the alien chromatin in the transfer line. The cytological investigations showed that the chromosome number and configuration were 2n = 44 = 22 II. Mitotic and meiotic GISH using P. huashanica genomic DNA as the probe indicated that 22-2 contained a pair of P. huashanica chromosomes. The genomic affinities of the introduced P. huashanica chromosomes were determined by EST-SSR and EST-STS using multiple-loci markers from seven wheat homoeologous groups between the parents and addition line. One EST-SSR and 17 EST-STS markers, which were located on the homoeologous group 3 chromosomes of wheat, amplified polymorphic bands in 22-2 that were unique to P. huashanica. Thus, these markers suggested that the introduced Ns chromosome pair belonged to homoeologous group 3, so we designated 22-2 as a 3Ns disomic addition line. Based on disease reaction to mixed races (CYR31, CYR32, and Shuiyuan14) of stripe rust in the adult stages, 22-2 was found to have high resistance to stripe rust, which was possibly derived from its P. huashanica parent. Consequently, the new disomic addition line 22-2 could be a valuable donor source for wheat improvement depending on the excellent agronomic traits, especially, the introduction of novel disease resistance genes into wheat during breeding programs.
Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji
2012-12-01
In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Van Inghelandt, Delphine; Melchinger, Albrecht E.; Lebreton, Claude
2010-01-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger’s distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity. Electronic supplementary material The online version of this article (doi:10.1007/s00122-009-1256-2) contains supplementary material, which is available to authorized users. PMID:20063144
Carvalho, S I C; Ragassi, C F; Oliveira, I B; Amaral, Z P S; Reifschneider, F J B; Faleiro, F G; Buso, G S C
2015-07-17
In order to support further genetic, diversity, and phylogeny studies of Capsicum species, the transferability of a Capsicum annuum L. simple sequence repeat (SSR) microsatellite set was analyzed for C. frutescens L. ("malagueta" and "tabasco" peppers) and C. chinense Jacq. (smell peppers, among other types). A total of 185 SSR primers were evaluated in 12 accessions from 115 C. frutescens L. and 480 C. chinense Jacq, representing different types within each species. Transferability to C. frutescens L. and C. chinense Jacq. occurred for 116 primers (62.7%). Nineteen (16.37%) were polymorphic in C. frutescens L. and 36 (31.03%) in C. chinense Jacq., 17 of which were coincident and could be used to analyze samples obtained for the 2 species. Among these primers, CA49 showed a different amplitude range of alleles between the 2 species (130-132 base pairs for C. frutescens L. and 120-128 base pairs for C. chinense Jacq.), and could differentiate the species. A total of 55 alleles were identified among the 19 polymorphic SSR loci among accessions of C. frutescens L., with the number of alleles per locus ranging from 2 to 5, a mean of 2.89, and the polymorphic information content ranging from 0.30 to 0.65. The number of alleles identified in C. chinense Jacq. was 119, ranging from 2 to 5 alleles per locus, an average of 3.30, and polymorphic information content from 0.19 to 0.68. The C. annuum L. SSR primers were most often transfer-able and polymorphic for C. frutescens L. and C. chinense Jacq., and we present a set of SSR for each species.
Verification of STS markers for leaf rust resistance genes of wheat by seven European laboratories.
Błaszczyk, Lidia; Chełkowski, Jerzy; Korzun, Victor; Kraic, Jan; Ordon, Frank; Ovesná, Jaroslava; Purnhauser, Laszlo; Tar, Melinda; Vida, Gyula
2004-01-01
A set of Thatcher near-isogenic lines and two breeding lines were used to examine sequence tagged site (STS) markers linked to leaf rust resistance genes Lr9, Lr10, Lr19, Lr24, Lr28, Lr29, Lr35, and a simple sequenced repeat (SSR) marker for Lr39. The selected STS markers for resistance genes Lr9, Lr10, Lr19, Lr24 and Lr28 were identified in seven accessions by seven European laboratories. Near-isogenic lines of the spring wheat Thatcher were used as positive controls. Markers for resistance genes Lr9, Lr10, Lr19, Lr24 were identified in all seven laboratories as amplification products of 1100 bp, 310 bp, 130 bp and 310 bp, respectively. The STS markers linked to resistance genes Lr9, Lr10, Lr19, Lr24, Lr29, Lr35 and the SSR marker for Lr39 were robust and highly specific for these genes and will be useful in marker-assisted selection in wheat. However, the amplification product of 378 bp that corresponded with resistance gene Lr28 was detected in all accessions including genotypes lacking this gene in all seven laboratories. This marker needs to be improved.
Herrmann, Luise; Felbinger, Christine; Haase, Ilka; Rudolph, Barbara; Biermann, Bernhard; Fischer, Markus
2015-05-13
The cocoa type "Colección Castro Naranjal 51" (CCN-51) is known for its resistance to specific climate conditions and its high yield, but it shows a weaker flavor profile and therefore is marketed as bulk cocoa. In a previous study, the two cocoa types Arriba and CCN-51 could easily be distinguished, but differences among the CCN-51 samples were observed. This was unexpected, as CCN-51 is reported to be a clone. To confirm whether CCN-51 is a pure clone, 10 simple sequence repeats (SSR) located on the nuclear genome were used to analyze various CCN-51 samples in comparison to the cocoa varieties Arriba and Criollo. As expected, there are differences in the SSR pattern among CCN-51, Arriba, and Criollo, but a variability within the CCN-51 sample set was detected as well. The previously described sequence variation in the chloroplast genome was confirmed by a variability in the microsatellite loci of the nuclear genome for a comprehensive cultivar collection of CCN-51 of both bean and leaf samples. In summary, beneath somaclonal variation, misidentification of plant collections and also sexual reproduction of CCN-51 can be suggested.
Ge, X J; Liu, M H; Wang, W K; Schaal, B A; Chiang, T Y
2005-04-01
Both demographic history and dispersal mechanisms influence the apportionment of genetic diversity among plant populations across geographical regions. In this study, phylogeography and population structure of wild banana, Musa balbisiana, one of the progenitors of cultivated bananas and plantains in China were investigated by an analysis of genetic diversity of simple sequence repeat (SSR) fingerprint markers and cpDNA PCR-RFLP. A chloroplast DNA (cpDNA) genealogy of 21 haplotypes identified two major clades, which correspond to two geographical regions separated by the Beijiang and Xijiang rivers, suggesting a history of vicariance. Significant genetic differentiation was detected among populations with cpDNA markers, a result consistent with limited seed dispersal in wild banana mediated by foraging of rodents. Nuclear SSR data also revealed significant geographical structuring in banana populations. In western China, however, there was no detected phylogeograpahical pattern, possibly due to frequent pollen flow via fruit bats. In contrast, populations east of the Beijiang River and the population of Hainan Island, where long-range soaring pollinators are absent, are genetically distinct. Colonization-extinction processes may have influenced the evolution of Musa populations, which have a metapopulation structure and are connected by migrating individuals. Effective gene flow via pollen, estimated from the nuclear SSR data, is 3.65 times greater than gene flow via seed, estimated from cpDNA data. Chloroplast and nuclear DNAs provide different insights into phylogeographical patterns of wild banana populations and, taken together, can inform conservation practices.
Hirpara, Darshna G; Gajera, Harsukh P; Hirpara, Hitesh Z; Golakiya, Balubhai A
2017-01-01
The fungus Trichoderma is a teleomorph of the Hypocrea genus and associated with biological control of plant diseases. The microscopic, biochemical, and molecular characterization of Trichoderma was carried out and evaluated for in vitro antagonistic activity against the fungal pathogen Sclerotium rolfsii causing stem rot disease in groundnut. In total, 11 isolates of Trichoderma were examined for antagonism at 6 and 12 days after inoculation (DAI). Out of 11, T. virens NBAII Tvs12 evidenced the highest (87.91%) growth inhibition of the test pathogen followed by T. koningii MTCC 796 (67.03%), T. viride NBAII Tv23 (63.74%), and T. harzianum NBAII Th1 (60.44%). Strong mycoparasitism was observed in the best antagonist Tvs12 strain during 6-12 DAI. The specific activity of cell wall-degrading enzymes - chitinase and β-1,3-glucanase - was positively correlated with growth inhibition of the test pathogen. In total, 18 simple sequence repeat (SSR) polymorphisms were reported to amplify 202 alleles across 11 Trichoderma isolates. The average polymorphism information content for SSR markers was found to be 0.80. The best antagonist Tvs 12 was identified with 7 unique SSR alleles amplified by 5 SSR markers. Clustering patterns of 11 Trichoderma strains showed the best antagonist T. virens NBAII Tvs 12 outgrouped with a minimum 3% similarity from the rest of Trichoderma. © 2017 S. Karger AG, Basel.
Jaiswal, Sarika; Sheoran, Sonia; Arora, Vasu; Angadi, Ulavappa B; Iquebal, Mir A; Raghav, Nishu; Aneja, Bharti; Kumar, Deepender; Singh, Rajender; Sharma, Pradeep; Singh, G P; Rai, Anil; Tiwari, Ratan; Kumar, Dinesh
2017-01-01
Wheat fulfills 20% of global caloric requirement. World needs 60% more wheat for 9 billion population by 2050 but climate change with increasing temperature is projected to affect wheat productivity adversely. Trait improvement and management of wheat germplasm requires genomic resource. Simple Sequence Repeats (SSRs) being highly polymorphic and ubiquitously distributed in the genome, can be a marker of choice but there is no structured marker database with options to generate primer pairs for genotyping on desired chromosome/physical location. Previously associated markers with different wheat trait are also not available in any database. Limitations of in vitro SSR discovery can be overcome by genome-wide in silico mining of SSR. Triticum aestivum SSR database ( TaSSRDb ) is an integrated online database with three-tier architecture, developed using PHP and MySQL and accessible at http://webtom.cabgrid.res.in/wheatssr/. For genotyping, Primer3 standalone code computes primers on user request. Chromosome-wise SSR calling for all the three sub genomes along with choice of motif types is provided in addition to the primer generation for desired marker. We report here a database of highest number of SSRs (476,169) from complex, hexaploid wheat genome (~17 GB) along with previously reported 268 SSR markers associated with 11 traits. Highest (116.93 SSRs/Mb) and lowest (74.57 SSRs/Mb) SSR densities were found on 2D and 3A chromosome, respectively. To obtain homozygous locus, e-PCR was done. Such 30 loci were randomly selected for PCR validation in panel of 18 wheat Advance Varietal Trial (AVT) lines. TaSSRDb can be a valuable genomic resource tool for linkage mapping, gene/QTL (Quantitative trait locus) discovery, diversity analysis, traceability and variety identification. Varietal specific profiling and differentiation can supplement DUS (Distinctiveness, Uniformity, and Stability) testing, EDV (Essentially Derived Variety)/IV (Initial Variety) disputes, seed purity and hybrid wheat testing. All these are required in germplasm management as well as also in the endeavor of wheat productivity.
Jaiswal, Sarika; Sheoran, Sonia; Arora, Vasu; Angadi, Ulavappa B.; Iquebal, Mir A.; Raghav, Nishu; Aneja, Bharti; Kumar, Deepender; Singh, Rajender; Sharma, Pradeep; Singh, G. P.; Rai, Anil; Tiwari, Ratan; Kumar, Dinesh
2017-01-01
Wheat fulfills 20% of global caloric requirement. World needs 60% more wheat for 9 billion population by 2050 but climate change with increasing temperature is projected to affect wheat productivity adversely. Trait improvement and management of wheat germplasm requires genomic resource. Simple Sequence Repeats (SSRs) being highly polymorphic and ubiquitously distributed in the genome, can be a marker of choice but there is no structured marker database with options to generate primer pairs for genotyping on desired chromosome/physical location. Previously associated markers with different wheat trait are also not available in any database. Limitations of in vitro SSR discovery can be overcome by genome-wide in silico mining of SSR. Triticum aestivum SSR database (TaSSRDb) is an integrated online database with three-tier architecture, developed using PHP and MySQL and accessible at http://webtom.cabgrid.res.in/wheatssr/. For genotyping, Primer3 standalone code computes primers on user request. Chromosome-wise SSR calling for all the three sub genomes along with choice of motif types is provided in addition to the primer generation for desired marker. We report here a database of highest number of SSRs (476,169) from complex, hexaploid wheat genome (~17 GB) along with previously reported 268 SSR markers associated with 11 traits. Highest (116.93 SSRs/Mb) and lowest (74.57 SSRs/Mb) SSR densities were found on 2D and 3A chromosome, respectively. To obtain homozygous locus, e-PCR was done. Such 30 loci were randomly selected for PCR validation in panel of 18 wheat Advance Varietal Trial (AVT) lines. TaSSRDb can be a valuable genomic resource tool for linkage mapping, gene/QTL (Quantitative trait locus) discovery, diversity analysis, traceability and variety identification. Varietal specific profiling and differentiation can supplement DUS (Distinctiveness, Uniformity, and Stability) testing, EDV (Essentially Derived Variety)/IV (Initial Variety) disputes, seed purity and hybrid wheat testing. All these are required in germplasm management as well as also in the endeavor of wheat productivity. PMID:29234333
Pervaiz, Tariq; Sun, Xin; Zhang, Yanyi; Tao, Ran; Zhang, Junhuan; Fang, Jinggui
2015-01-16
The nuclear DNA is conventionally used to assess the diversity and relatedness among different species, but variations at the DNA genome level has also been used to study the relationship among different organisms. In most species, mitochondrial and chloroplast genomes are inherited maternally; therefore it is anticipated that organelle DNA remains completely associated. Many research studies were conducted simultaneously on organelle genome. The objectives of this study was to analyze the genetic relationship between chloroplast and mitochondrial DNA in three Chinese Prunus genotypes viz., Prunus persica, Prunus domestica, and Prunus avium. We investigated the genetic diversity of Prunus genotypes using simple sequence repeat (SSR) markers relevant to the chloroplast and mitochondria. Most of the genotypes were genetically similar as revealed by phylogenetic analysis. The Y2 Wu Xing (Cherry) and L2 Hong Xin Li (Plum) genotypes have a high similarity index (0.89), followed by Zi Ye Li (0.85), whereas; L1 Tai Yang Li (plum) has the lowest genetic similarity (0.35). In case of cpSSR, Hong Tao (Peach) and L1 Tai Yang Li (Plum) genotypes demonstrated similarity index of 0.85 and Huang Tao has the lowest similarity index of 0.50. The mtSSR nucleotide sequence analysis revealed that each genotype has similar amplicon length (509 bp) except M5Y1 i.e., 505 bp with CCB256 primer; while in case of NAD6 primer, all genotypes showed different sizes. The MEHO (Peach), MEY1 (Cherry), MEL2 (Plum) and MEL1 (Plum) have 586 bps; while MEY2 (Cherry), MEZI (Plum) and MEHU (Peach) have 585, 584 and 566 bp, respectively. The CCB256 primer showed highly conserved sequences and minute single polymorphic nucleotides with no deletion or mutation. The cpSSR (ARCP511) microsatellites showed the harmonious amplicon length. The CZI (Plum), CHO (Peach) and CL1 (Plum) showed 182 bp; whileCHU (Peach), CY2 (Cherry), CL2 (Plum) and CY1 (Cherry) showed 181 bp amplicon lengths. These results demonstrated high conservation in chloroplast and mitochondrial genome among Prunus species during the evolutionary process. These findings are valuable to study the organelle DNA diversity in different species and genotypes of Prunus to provide in depth insight in to the mitochondrial and chloroplast genomes.
Qi, L L; Long, Y M; Jan, C C; Ma, G J; Gulya, T J
2015-04-01
Pl 17, a novel downy mildew resistance gene independent of known downy mildew resistance genes in sunflowers, was genetically mapped to linkage group 4 of the sunflower genome. Downy mildew (DM), caused by Plasmopara halstedii (Farl.). Berl. et de Toni, is one of the serious sunflower diseases in the world due to its high virulence and the variability of the pathogen. DM resistance in the USDA inbred line, HA 458, has been shown to be effective against all virulent races of P. halstedii currently identified in the USA. To determine the chromosomal location of this resistance, 186 F 2:3 families derived from a cross of HA 458 with HA 234 were phenotyped for their resistance to race 734 of P. halstedii. The segregation ratio of the population supported that the resistance was controlled by a single dominant gene, Pl 17. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) primers were used to identify molecular markers linked to Pl 17. Bulked segregant analysis using 849 SSR markers located Pl 17 to linkage group (LG) 4, which is the first DM gene discovered in this linkage group. An F2 population of 186 individuals was screened with polymorphic SSR and SNP primers from LG4. Two flanking markers, SNP SFW04052 and SSR ORS963, delineated Pl 17 in an interval of 3.0 cM. The markers linked to Pl 17 were validated in a BC3 population. A search for the physical location of flanking markers in sunflower genome sequences revealed that the Pl 17 region had a recombination frequency of 0.59 Mb/cM, which was a fourfold higher recombination rate relative to the genomic average. This region can be considered amenable to molecular manipulation for further map-based cloning of Pl 17.
Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying
2016-01-01
Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326
Taheri, S; Abdullah, T L; Abdullah, N A P; Ahmad, Z; Karimi, E; Shabanimofrad, M R
2014-09-05
The genus Curcuma is a member of the ginger family (Zingiberaceae) that has recently become popular for use as flowering pot plants, both indoors and as patio and landscape plants. We used PCR-based molecular markers (SSRs) to elucidate genetic variation and relationships between five varieties of Curcuma (Curcuma alismatifolia) cultivated in Malaysia. Of the primers tested, 8 (of 17) SSR primers were selected for their reproducibility and high rates of polymorphism. The number of presumed alleles revealed by the SSR analysis ranged from two to six alleles, with a mean value of 3.25 alleles per locus. The values of HO and HE ranged from 0 to 0.8 (mean value of 0.2) and 0.1837 to 0.7755 (mean value of 0.5102), respectively. Eight SSR primers yielded 26 total amplified fragments and revealed high rates of polymorphism among the varieties studied. The polymorphic information content varied from 0.26 to 0.73. Dice's similarity coefficient was calculated for all pairwise comparisons and used to construct an unweighted pair group method with arithmetic average (UPGMA) dendrogram. Similarity coefficient values from 0.2105 to 0.6667 (with an average of 0.4386) were found among the five varieties examined. A cluster analysis of data using a UPGMA algorithm divided the five varieties/hybrids into 2 groups.
Identification of molecular markers associated with mite resistance in coconut (Cocos nucifera L.).
Shalini, K V; Manjunatha, S; Lebrun, P; Berger, A; Baudouin, L; Pirany, N; Ranganath, R M; Prasad, D Theertha
2007-01-01
Coconut mite (Aceria guerreronis 'Keifer') has become a major threat to Indian coconut (Coçcos nucifera L.) cultivators and the processing industry. Chemical and biological control measures have proved to be costly, ineffective, and ecologically undesirable. Planting mite-resistant coconut cultivars is the most effective method of preventing yield loss and should form a major component of any integrated pest management stratagem. Coconut genotypes, and mite-resistant and -susceptible accessions were collected from different parts of South India. Thirty-two simple sequence repeat (SSR) and 7 RAPD primers were used for molecular analyses. In single-marker analysis, 9 SSR and 4 RAPD markers associated with mite resistance were identified. In stepwise multiple regression analysis of SSRs, a combination of 6 markers showed 100% association with mite infestation. Stepwise multiple regression analysis for RAPD data revealed that a combination of 3 markers accounted for 83.86% of mite resistance in the selected materials. Combined stepwise multiple regression analysis of RAPD and SSR data showed that a combination of 5 markers explained 100% of the association with mite resistance in coconut. Markers associated with mite resistance are important in coconut breeding programs and will facilitate the selection of mite-resistant plants at an early stage as well as mother plants for breeding programs.
Halász, Júlia; Makovics-Zsohár, Noémi; Szőke, Ferenc; Ercisli, Sezai; Hegedűs, Attila
2017-02-01
Polyploid Prunus spinosa (2n = 4×) and P. insititia (2n = 6×) represent enormous genetic potential in Central Europe, which can be exploited in breeding programmes. In Hungary, 17 cultivar candidates were selected from wild-growing populations including 10 P. spinosa, 4 P. insititia and three P. spinosa × P. domestica hybrids (2n = 5×). Their taxonomic classification was based on their phenotypic characteristics. Six simple sequence repeats (SSRs) and the multiallelic S-locus genotyping were used to characterize genetic variability and reliable identification of the tested accessions. A total of 98 SSR alleles were identified, which presents 19.5 average allele number per locus, and each of the 17 genotypes could be discriminated based on unique SSR fingerprints. A total of 23 S-RNase alleles were identified. The complete and partial S-genotype was determined for 8 and 9 accessions, respectively. The identification of a cross-incompatible pair of cultivar candidates and several semi-compatible combinations help maximize fruit set in commercial orchards. Our results indicate that the S-allele pools of wild-growing P. spinosa and P. insititia are overlapping in Hungary. A phylogenetic and principal component analysis confirmed the high level of diversity and genetic differentiation present within the analysed genotypes and helped clarify doubtful taxonomic identities. Our data confirm that S-locus genotyping is suitable for diversity studies in polyploid Prunus species. The analysed accessions represent huge genetic potential that can be exploited in commercial cultivation.
Moyib, O K; Mkumbira, J; Odunola, O A; Dixon, A G
2012-12-01
Cyanogenic potential (CNp) of cassava constitutes a serious problem for over 500 million people who rely on the crop as their main source of calories. Genetic diversity is a key to successful crop improvement for breeding new improved variability for target traits. Forty-three improved genotypes of cassava developed by International Institute of Tropical Agriculture (ITA), Ibadan, were characterized for CNp trait using 35 Simple Sequence.Repeat (SSR) markers. Essential colorimetry picric test was used for evaluation of CNp on a color scale of 1 to 14. The CNp scores obtained ranged from 3 to 9, with a mean score of 5.48 (+/- 0.09) based on Statistical Analysis System (SAS) package. TMS M98/ 0068 (4.0 +/- 0.25) was identified as the best genotype with low CNp while TMS M98/0028 (7.75 +/- 0.25) was the worst. The 43 genotypes were assigned into 7 phenotypic groups based on rank-sum analysis in SAS. Dissimilarity analysis representatives for windows generated a phylogenetic tree with 5 clusters which represented hybridizing groups. Each of the clusters (except 4) contained low CNp genotypes that could be used for improving the high CNp genotypes in the same or near cluster. The scatter plot of the genotypes showed that there was little or no demarcation for phenotypic CNp groupings in the molecular groupings. The result of this study demonstrated that SSR markers are powerful tools for the assessment of genetic variability, and proper identification and selection of parents for genetic improvement of low CNp trait among the IITA cassava collection.
NASA Astrophysics Data System (ADS)
Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue
2017-08-01
Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.
Characterization of Phytophthora infestans populations in northwestern Algeria during 2008-2014.
Rekad, Fatma Zohra; Cooke, David Edward Llewelyn; Puglisi, Ivana; Randall, Eva; Guenaoui, Yamina; Bouznad, Zouaoui; Evoli, Maria; Pane, Antonella; Schena, Leonardo; Magnano di San Lio, Gaetano; Cacciola, Santa Olga
2017-05-01
A total of 161 Phytophthora infestans isolates, collected from infected potato and tomato plants during 2008-2014, were characterized based on mating type, metalaxyl sensitivity and polymorphism at 12 simple sequence repeat (SSR) loci, in order to investigate the population of P. infestans in the north-west of Algeria, an emerging potato production region. The majority of isolates were of A2 mating type (112 isolates). A high percentage (89 %) of resistance to metalaxyl among isolates was detected. The metalaxyl resistant phenotype was present in both mating types with a higher percentage in A2 mating type isolates. SSR-based genotypic analysis of P. infestans population showed a low diversity. Genotype 13_A2 was the predominant in the population with a frequency of 67 % followed by 2_A1 (21 %) and 23_A1 (5 %). Genotype 23_A1 was detected only in tomato and potato isolates collected in 2013 and 2014. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
De Bellis, Fabien; Malapa, Roger; Kagy, Valérie; Lebegin, Stéphane; Billot, Claire; Labouisse, Jean-Pierre
2016-08-01
Using next-generation sequencing technology, new microsatellite loci were characterized in Artocarpus altilis (Moraceae) and two congeners to increase the number of available markers for genotyping breadfruit cultivars. A total of 47,607 simple sequence repeat loci were obtained by sequencing a library of breadfruit genomic DNA with an Illumina MiSeq system. Among them, 50 single-locus markers were selected and assessed using 41 samples (39 A. altilis, one A. camansi, and one A. heterophyllus). All loci were polymorphic in A. altilis, 44 in A. camansi, and 21 in A. heterophyllus. The number of alleles per locus ranged from two to 19. The new markers will be useful for assessing the identity and genetic diversity of breadfruit cultivars on a small geographical scale, gaining a better understanding of farmer management practices, and will help to optimize breadfruit genebank management.
Turchetto, Caroline; Segatto, Ana Lúcia A.; Beduschi, Júlia; Bonatto, Sandro L.; Freitas, Loreta B.
2015-01-01
Identifying the genetic basis of speciation is critical for understanding the evolutionary history of closely related wild species. Recently diverged species facilitate the study of speciation because many genetic and morphological characteristics are still shared by the organisms under study. The Petunia genus grows in South American grasslands and comprises both recently diverged wild species and commercial species. In this work, we analysed two closely related species: Petunia exserta, which has a narrow endemic range and grows exclusively in rocky shelters, and Petunia axillaris, which is widely distributed and comprises three allopatric subspecies. Petunia axillaris ssp. axillaris and P. exserta occur in sympatry, and putative hybrids between them have been identified. Here, we analysed 14 expressed sequence tag-simple sequence repeats (EST-SSRs) in 126 wild individuals and 13 putative morphological hybrids with the goals of identifying differentially encoded alleles to characterize their natural genetic diversity, establishing a genetic profile for each taxon and to verify the presence of hybridization signal. Overall, 143 alleles were identified and all taxa contained private alleles. Four major groups were identified in clustering analyses, which indicated that there are genetic distinctions among the groups. The markers evaluated here will be useful in evolutionary studies involving these species and may help categorize individuals by species, thus enabling the identification of hybrids between both their putative taxa. The individuals with intermediate morphology presented private alleles of their both putative parental species, although they showed a level of genetic mixing that was comparable with some of the individuals with typical P. exserta morphology. The EST-SSR markers scattered throughout the Petunia genome are very efficient tools for characterizing the genetic diversity in wild taxa of this genus and aid in identifying interspecific hybrids based on the presence of private alleles. These properties indicate that these markers will be helpful tools in evolutionary studies. PMID:26187606
WebSat--a web software for microsatellite marker development.
Martins, Wellington Santos; Lucas, Divino César Soares; Neves, Kelligton Fabricio de Souza; Bertioli, David John
2009-01-01
Simple sequence repeats (SSR), also known as microsatellites, have been extensively used as molecular markers due to their abundance and high degree of polymorphism. We have developed a simple to use web software, called WebSat, for microsatellite molecular marker prediction and development. WebSat is accessible through the Internet, requiring no program installation. Although a web solution, it makes use of Ajax techniques, providing a rich, responsive user interface. WebSat allows the submission of sequences, visualization of microsatellites and the design of primers suitable for their amplification. The program allows full control of parameters and the easy export of the resulting data, thus facilitating the development of microsatellite markers. The web tool may be accessed at http://purl.oclc.org/NET/websat/
Identification of apple cultivars on the basis of simple sequence repeat markers.
Liu, G S; Zhang, Y G; Tao, R; Fang, J G; Dai, H Y
2014-09-12
DNA markers are useful tools that play an important role in plant cultivar identification. They are usually based on polymerase chain reaction (PCR) and include simple sequence repeats (SSRs), inter-simple sequence repeats, and random amplified polymorphic DNA. However, DNA markers were not used effectively in the complete identification of plant cultivars because of the lack of known DNA fingerprints. Recently, a novel approach called the cultivar identification diagram (CID) strategy was developed to facilitate the use of DNA markers for separate plant individuals. The CID was designed whereby a polymorphic maker was generated from each PCR that directly allowed for cultivar sample separation at each step. Therefore, it could be used to identify cultivars and varieties easily with fewer primers. In this study, 60 apple cultivars, including a few main cultivars in fields and varieties from descendants (Fuji x Telamon) were examined. Of the 20 pairs of SSR primers screened, 8 pairs gave reproducible, polymorphic DNA amplification patterns. The banding patterns obtained from these 8 primers were used to construct a CID map. Each cultivar or variety in this study was distinguished from the others completely, indicating that this method can be used for efficient cultivar identification. The result contributed to studies on germplasm resources and the seedling industry in fruit trees.
Mahjbi, A; Oueslati, A; Baraket, G; Salhi-Hannachi, A; Zehdi Azouzi, S
2016-05-20
Citrus are one of the most cultivated crops in the world. Economically, they are very important fruit trees in Tunisia. Little is known about the genetic diversity of the Tunisian Citrus germplasm. Exploring this diversity is a prerequisite for the identification and characterization of the local germplasm to circumvent and controlling genetic erosion caused by biotic and abiotic stress to aid its conservation and use. In the present study, we explored the genetic diversity of 20 Tunisian orange cultivars [Citrus sinensis (L.) Osbeck] and established their relationships by using seven simple sequence repeat (SSR) loci. In total, 37 alleles and 44 genotypes were scored. The sizes of alleles ranged from 90 to 280 bp. The number of alleles per locus was from 4 to 7, with an average of 5.28. Polymorphic information content value changed from 0.599 to 0.769 with an average of 0.675. Analysis of the genotypes revealed a heterozygote deficiency across all the genotypes. The observed heterozygosity varied from 0 to 1 (average of 0.671). Cluster analysis showed that three groups could be distinguished and the polymorphism occurred independently of the geographical origin of the studied orange cultivars. The detected SSR genotypes allowed the establishment of an identification key with a discriminating power of 100%. Multivariate analysis and the neighbor-joining phylogenetic tree indicated a narrow genetic base for the orange cultivars. The usefulness of SSR markers for orange fingerprinting and evaluation of the genetic diversity in the Tunisian germplasm are discussed in this paper.
Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu
2014-01-01
Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.
Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin
2013-01-01
Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.
Jia, Huixia; Yang, Haifeng; Sun, Pei; Li, Jianbo; Zhang, Jin; Guo, Yinghua; Han, Xiaojiao; Zhang, Guosheng; Lu, Mengzhu; Hu, Jianjun
2016-01-01
Salix psammophila, a sandy shrub known as desert willow, is regarded as a potential biomass feedstock and plays an important role in maintaining local ecosystems. However, a lack of genomic data and efficient molecular markers limit the study of its population evolution and genetic breeding. In this study, chromosome counts, flow cytometry and SSR analyses indicated that S. psammophila is tetraploid. A total of 6,346 EST-SSRs were detected based on 71,458 de novo assembled unigenes from transcriptome data. Twenty-seven EST-SSR markers were developed to evaluate the genetic diversity and population structure of S. psammophila from eight natural populations in Northern China. High levels of genetic diversity (mean 10.63 alleles per locus; mean HE 0.689) were dectected in S. psammophila. The weak population structure and little genetic differentiation (pairwise FST = 0.006–0.016) were found among Population 1-Population 7 (Pop1-Pop7; Inner Mongolia and Shaanxi), but Pop8 (Ningxia) was clearly separated from Pop1-Pop7 and moderate differentiation (pairwise FST = 0.045–0.055) was detected between them, which may be influenced by local habitat conditions. Molecular variance analyses indicated that most of the genetic variation (94.27%) existed within populations. These results provide valuable genetic informations for natural resource conservation and breeding programme optimisation of S. psammophila. PMID:27995985
Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen
2015-01-01
Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution.
Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen
2015-01-01
Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution. PMID:26136762
Li, Juan; Zeng, Yanru; Shen, Dengfeng; Xia, Guohua; Huang, Yinzhi; Huang, Youjun; Chang, Jun; Huang, Jianqin; Wang, Zhengjia
2014-10-01
Hickory (Carya cathayensis Sarg.), an important nut-producing species in Southeastern China, has high economic value, but so far there has been no cultivar bred under species although it is mostly propagated by seeding and some elite individuals have been found. It has been found recently that this species has a certain rate of apomixis and poor knowledge of its genetic background has influenced development of a feasible breeding strategy. Here in this paper we first release SSR (Simple sequence repeat) markers developed in this species and their transferability to other three species of the same genus, Carya. A total of 311 pairs of SSR primers in hickory were developed based on sequenced cDNAs of a fruit development-associated cDNA library and RNA-seq data of developing female floral buds and could be used to distinguish hickory, C. hunanensis Cheng et R. H. Chang ex R. H. Chang et Lu, C. illinoensis K. Koch (pecan) and C. dabieshanensis M. C. Liu et Z. J. Li, but they were monomorphic in both hickory and C. hunanensis although multi-alleles have been identified in all the four species. There is a transferability rate of 63.02% observed between hickory and pecan and the markers can be applied to study genetic diversity of accessions in pecan. When used in C. dabieshanensis, it was revealed that C. dabieshanensis had the number of alleles per locus ranging from 2 to 4, observed heterozygosity from 0 to 0.6667 and expected heterozygosity from 0.333 to 0.8667, respectively, which supports the existence of C. dabieshanensis as a separate species different from hickory and indicates that there is potential for selection and breeding in this species.
Li, Juan; Zeng, Yanru; Shen, Dengfeng; Xia, Guohua; Huang, Yinzhi; Huang, Youjun; Chang, Jun; Huang, Jianqin; Wang, Zhengjia
2014-01-01
Hickory (Carya cathayensis Sarg.), an important nut-producing species in Southeastern China, has high economic value, but so far there has been no cultivar bred under species although it is mostly propagated by seeding and some elite individuals have been found. It has been found recently that this species has a certain rate of apomixis and poor knowledge of its genetic background has influenced development of a feasible breeding strategy. Here in this paper we first release SSR (Simple sequence repeat) markers developed in this species and their transferability to other three species of the same genus, Carya. A total of 311 pairs of SSR primers in hickory were developed based on sequenced cDNAs of a fruit development-associated cDNA library and RNA-seq data of developing female floral buds and could be used to distinguish hickory, C. hunanensis Cheng et R. H. Chang ex R. H. Chang et Lu, C. illinoensis K. Koch (pecan) and C. dabieshanensis M. C. Liu et Z. J. Li, but they were monomorphic in both hickory and C. hunanensis although multi-alleles have been identified in all the four species. There is a transferability rate of 63.02% observed between hickory and pecan and the markers can be applied to study genetic diversity of accessions in pecan. When used in C. dabieshanensis, it was revealed that C. dabieshanensis had the number of alleles per locus ranging from 2 to 4, observed heterozygosity from 0 to 0.6667 and expected heterozygosity from 0.333 to 0.8667, respectively, which supports the existence of C. dabieshanensis as a separate species different from hickory and indicates that there is potential for selection and breeding in this species. PMID:25435799
Biswas, Manosh Kumar; Liu, Yuxuan; Li, Chunyu; Sheng, Ou; Mayer, Christoph; Yi, Ganjun
2015-01-01
The development of organized, informative, robust, user-friendly, and freely accessible molecular markers is imperative to the Musa marker assisted breeding program. Although several hundred SSR markers have already been developed, the number of informative, robust, and freely accessible Musa markers remains inadequate for some breeding applications. In view of this issue, we surveyed SSRs in four different data sets, developed large-scale non-redundant highly informative therapeutic SSR markers, and classified them according to their attributes, as well as analyzed their cross-taxon transferability and utility for the genetic study of Musa and its relatives. A high SSR frequency (177 per Mbp) was found in the Musa genome. AT-rich dinucleotide repeats are predominant, and trinucleotide repeats are the most abundant in transcribed regions. A significant number of Musa SSRs are associated with pre-miRNAs, and 83% of these SSRs are promising candidates for the development of therapeutic SSR markers. Overall, 74% of the SSR markers were polymorphic, and 94% were transferable to at least one Musa spp. Two hundred forty-three markers generated a total of 1047 alleles, with 2-8 alleles each and an average of 4.38 alleles per locus. The PIC values ranged from 0.31 to 0.89 and averaged 0.71. We report the largest set of non-redundant, polymorphic, new SSR markers to be developed in Musa. These additional markers could be a valuable resource for marker-assisted breeding, genetic diversity and genomic studies of Musa and related species. PMID:26121637
Li, Feng; Kitashiba, Hiroyasu; Inaba, Kiyofumi; Nishio, Takeshi
2009-01-01
For identification of genes responsible for varietal differences in flowering time and leaf morphological traits, we constructed a linkage map of Brassica rapa DNA markers including 170 EST-based markers, 12 SSR markers, and 59 BAC sequence-based markers, of which 151 are single nucleotide polymorphism (SNP) markers. By BLASTN, 223 markers were shown to have homologous regions in Arabidopsis thaliana, and these homologous loci covered nearly the whole genome of A. thaliana. Synteny analysis between B. rapa and A. thaliana revealed 33 large syntenic regions. Three quantitative trait loci (QTLs) for flowering time were detected. BrFLC1 and BrFLC2 were linked to the QTLs for bolting time, budding time, and flowering time. Three SNPs in the promoter, which may be the cause of low expression of BrFLC2 in the early-flowering parental line, were identified. For leaf lobe depth and leaf hairiness, one major QTL corresponding to a syntenic region containing GIBBERELLIN 20 OXIDASE 3 and one major QTL containing BrGL1, respectively, were detected. Analysis of nucleotide sequences and expression of these genes suggested possible involvement of these genes in leaf morphological traits. PMID:19884167
Julier, Bernadette; Flajoulot, Sandrine; Barre, Philippe; Cardinet, Gaëlle; Santoni, Sylvain; Huguet, Thierry; Huyghe, Christian
2003-01-01
Background Alfalfa (Medicago sativa) is a major forage crop. The genetic progress is slow in this legume species because of its autotetraploidy and allogamy. The genetic structure of this species makes the construction of genetic maps difficult. To reach this objective, and to be able to detect QTLs in segregating populations, we used the available codominant microsatellite markers (SSRs), most of them identified in the model legume Medicago truncatula from EST database. A genetic map was constructed with AFLP and SSR markers using specific mapping procedures for autotetraploids. The tetrasomic inheritance was analysed in an alfalfa mapping population. Results We have demonstrated that 80% of primer pairs defined on each side of SSR motifs in M. truncatula EST database amplify with the alfalfa DNA. Using a F1 mapping population of 168 individuals produced from the cross of 2 heterozygous parental plants from Magali and Mercedes cultivars, we obtained 599 AFLP markers and 107 SSR loci. All but 3 SSR loci showed a clear tetrasomic inheritance. For most of the SSR loci, the double-reduction was not significant. For the other loci no specific genotypes were produced, so the significant double-reduction could arise from segregation distortion. For each parent, the genetic map contained 8 groups of four homologous chromosomes. The lengths of the maps were 2649 and 3045 cM, with an average distance of 7.6 and 9.0 cM between markers, for Magali and Mercedes parents, respectively. Using only the SSR markers, we built a composite map covering 709 cM. Conclusions Compared to diploid alfalfa genetic maps, our maps cover about 88–100% of the genome and are close to saturation. The inheritance of the codominant markers (SSR) and the pattern of linkage repulsions between markers within each homology group are consistent with the hypothesis of a tetrasomic meiosis in alfalfa. Except for 2 out of 107 SSR markers, we found a similar order of markers on the chromosomes between the tetraploid alfalfa and M. truncatula genomes indicating a high level of colinearity between these two species. These maps will be a valuable tool for alfalfa breeding and are being used to locate QTLs. PMID:14683527
NASA Astrophysics Data System (ADS)
Han, Zhaofang; Xiao, Shijun; Liu, Xiande; Liu, Yang; Li, Jiakai; Xie, Yangjie; Wang, Zhiyong
2017-03-01
The large yellow croaker, Larimichthys crocea is an important marine fish in China with a high economic value. In the last decade, the stock conservation and aquaculture industry of this species have been facing severe challenges because of wild population collapse and degeneration of important economic traits. However, genes contributing to growth and immunity in L. crocea have not been thoroughly analyzed, and available molecular markers are still not sufficient for genetic resource management and molecular selection. In this work, we sequenced the transcriptome in L. crocea liver tissue with a Roche 454 sequencing platform and assembled the transcriptome into 93 801 transcripts. Of them, 38 856 transcripts were successfully annotated in nt, nr, Swiss-Prot, InterPro, COG, GO and KEGG databases. Based on the annotation information, 3 165 unigenes related to growth and immunity were identified. Additionally, a total of 6 391 simple sequence repeats (SSRs) were identified from the transcriptome, among which 4 498 SSRs had enough flanking regions to design primers for polymerase chain reactions (PCR). To access the polymorphism of these markers, 30 primer pairs were randomly selected for PCR amplification and validation in 30 individuals, and 12 primer pairs (40.0%) exhibited obvious length polymorphisms. This work applied RNA-Seq to assemble and analyze a live transcriptome in L. crocea. With gene annotation and sequence information, genes related to growth and immunity were identified and massive SSR markers were developed, providing valuable genetic resources for future gene functional analysis and selective breeding of L. crocea.
Simple sequence repeat markers that identify Claviceps species and strains.
Gilmore, Barbara S; Alderman, Stephen C; Knaus, Brian J; Bassil, Nahla V; Martin, Ruth C; Dombrowski, James E; Dung, Jeremiah K S
2016-01-01
Claviceps purpurea is a pathogen that infects most members of Pooideae, a subfamily of Poaceae, and causes ergot, a floral disease in which the ovary is replaced with a sclerotium. When the ergot body is accidently consumed by either man or animal in high enough quantities, there is extreme pain, limb loss and sometimes death. This study was initiated to develop simple sequence repeat (SSRs) markers for rapid identification of C. purpurea . SSRs were designed from sequence data stored at the National Center for Biotechnology Information database. The study consisted of 74 ergot isolates, from four different host species, Lolium perenne , Poa pratensis , Bromus inermis , and Secale cereale plus three additional Claviceps species, C. pusilla , C. paspali and C. fusiformis. Samples were collected from six different counties in Oregon and Washington over a 5-year period. Thirty-four SSR markers were selected, which enabled the differentiation of each isolate from one another based solely on their molecular fingerprints. Discriminant analysis of principle components was used to identify four isolate groups, CA Group 1, 2, 3, and 4, for subsequent cluster and molecular variance analyses. CA Group 1 consisting of eight isolates from the host species P. pratensis , was separated on the cluster analysis plot from the remaining three groups and this group was later identified as C. humidiphila . The other three groups were distinct from one another, but closely related. These three groups contained samples from all four of the host species. These SSRs are simple to use, reliable and allowed clear differentiation of C. humidiphila from C. purpurea . Isolates from the three separate species, C. pusilla , C. paspali and C. fusiformis , also amplified with these markers. The SSR markers developed in this study will be helpful in defining the population structure and genetics of Claviceps strains. They will also provide valuable tools for plant breeders needing to identify resistance in crops or for researchers examining fungal movements across environments.
WebSat ‐ A web software for microsatellite marker development
Martins, Wellington Santos; Soares Lucas, Divino César; de Souza Neves, Kelligton Fabricio; Bertioli, David John
2009-01-01
Simple sequence repeats (SSR), also known as microsatellites, have been extensively used as molecular markers due to their abundance and high degree of polymorphism. We have developed a simple to use web software, called WebSat, for microsatellite molecular marker prediction and development. WebSat is accessible through the Internet, requiring no program installation. Although a web solution, it makes use of Ajax techniques, providing a rich, responsive user interface. WebSat allows the submission of sequences, visualization of microsatellites and the design of primers suitable for their amplification. The program allows full control of parameters and the easy export of the resulting data, thus facilitating the development of microsatellite markers. Availability The web tool may be accessed at http://purl.oclc.org/NET/websat/ PMID:19255650
Marum, Liliana; Rocheta, Margarida; Maroco, João; Oliveira, M Margarida; Miguel, Célia
2009-04-01
Somatic embryogenesis (SE) is a propagation tool of particular interest for accelerating the deployment of new high-performance planting stock in multivarietal forestry. However, genetic conformity in in vitro propagated plants should be assessed as early as possible, especially in long-living trees such as conifers. The main objective of this work was to study such conformity based on genetic stability at simple sequence repeat (SSR) loci during somatic embryogenesis in maritime pine (Pinus pinaster Ait.). Embryogenic cell lines (ECLs) subjected to tissue proliferation during 6, 14 or 22 months, as well as emblings regenerated from several ECLs, were analyzed. Genetic variation at seven SSR loci was detected in ECLs under proliferation conditions for all time points, and in 5 out of 52 emblings recovered from somatic embryos. Three of these five emblings showed an abnormal phenotype consisting mainly of plagiotropism and loss of apical dominance. Despite the variation found in somatic embryogenesis-derived plant material, no correlation was established between genetic stability at the analyzed loci and abnormal embling phenotype, present in 64% of the emblings. The use of microsatellites in this work was efficient for monitoring mutation events during the somatic embryogenesis in P. pinaster. These molecular markers should be useful in the implementation of new breeding and deployment strategies for improved trees using SE.
Ahmad, Faiz; Hanafi, Mohamed Musa; Hakim, Md Abdul; Rafii, Mohd Y.; Arolu, Ibrahim Wasiu; Akmar Abdullah, Siti Nor
2015-01-01
Coloured rice genotypes have greater nutritious value and consumer demand for these varieties is now greater than ever. The documentation of these genotypes is important for the improvement of the rice plant. In this study, 42 coloured rice genotypes were selected for determination of their genetic divergence using 25 simple sequence repeat (SSR) primers and 15 agro-morphological traits. Twenty-one out of the 25 SSR primers showed distinct, reproducible polymorphism. A dendrogram constructed using the SSR primers clustered the 42 coloured rice genotypes into 7 groups. Further, principle component analysis showed 75.28% of total variations were explained by the first—three components. All agro-morphological traits showed significant difference at the (p≤0.05) and (p≤0.01) levels. From the dendrogram constructed using the agro-morphological traits, all the genotypes were clustered into four distinct groups. Pearson’s correlation coefficient showed that among the 15 agro-morphological traits, the yield contributing factor had positive correlation with the number of tillers, number of panicles, and panicle length. The heritability of the 15 traits ranged from 17.68 to 99.69%. Yield per plant and harvest index showed the highest value for both heritability and genetic advance. The information on the molecular and agro-morphological traits can be used in rice breeding programmes to improve nutritional value and produce higher yields. PMID:26393807
Zhao, Ying; Thammannagowda, Shivegowda; Staton, Margaret; Tang, Sha; Xia, Xinli; Yin, Weilun; Liang, Haiying
2013-03-01
The "living fossil" Metasequoia glyptostroboides Hu et Cheng, commonly known as dawn redwood or Chinese redwood, is the only living species in the genus and is valued for its essential oil and crude extracts that have great potential for anti-fungal activity. Despite its paleontological significance and economical value as a rare relict species, genomic resources of Metasequoia are very limited. In order to gain insight into the molecular mechanisms behind the formation of reproductive buds and the transition from vegetative phase to reproductive phase in Metasequoia, we performed sequencing of expressed sequence tags from Metasequoia vegetative buds and female buds. By using the 454 pyrosequencing technology, a total of 1,571,764 high-quality reads were generated, among which 733,128 were from vegetative buds and 775,636 were from female buds. These EST reads were clustered and assembled into 114,124 putative unique transcripts (PUTs) with an average length of 536 bp. The 97,565 PUTs that were at least 100 bp in length were functionally annotated by a similarity search against public databases and assigned with Gene Ontology (GO) terms. A total of 59 known floral gene families and 190 isotigs involved in hormone regulation were captured in the dataset. Furthermore, a set of PUTs differentially expressed in vegetative and reproductive buds, as well as SSR motifs and high confidence SNPs, were identified. This is the first large-scale expressed sequence tags ever generated in Metasequoia and the first evidence for floral genes in this critically endangered deciduous conifer species.
Vatanparast, Mohammad; Shetty, Prateek; Chopra, Ratan; Doyle, Jeff J; Sathyanarayana, N; Egan, Ashley N
2016-06-30
Winged bean, Psophocarpus tetragonolobus (L.) DC., is similar to soybean in yield and nutritional value but more viable in tropical conditions. Here, we strengthen genetic resources for this orphan crop by producing a de novo transcriptome assembly and annotation of two Sri Lankan accessions (denoted herein as CPP34 [PI 491423] and CPP37 [PI 639033]), developing simple sequence repeat (SSR) markers, and identifying single nucleotide polymorphisms (SNPs) between geographically separated genotypes. A combined assembly based on 804,757 reads from two accessions produced 16,115 contigs with an N50 of 889 bp, over 90% of which has significant sequence similarity to other legumes. Combining contigs with singletons produced 97,241 transcripts. We identified 12,956 SSRs, including 2,594 repeats for which primers were designed and 5,190 high-confidence SNPs between Sri Lankan and Nigerian genotypes. The transcriptomic data sets generated here provide new resources for gene discovery and marker development in this orphan crop, and will be vital for future plant breeding efforts. We also analyzed the soybean trypsin inhibitor (STI) gene family, important plant defense genes, in the context of related legumes and found evidence for radiation of the Kunitz trypsin inhibitor (KTI) gene family within winged bean.
Complete Chloroplast Genome Sequences of Important Oilseed Crop Sesamum indicum L
Yi, Dong-Keun; Kim, Ki-Joong
2012-01-01
Sesamum indicum is an important crop plant species for yielding oil. The complete chloroplast (cp) genome of S. indicum (GenBank acc no. JN637766) is 153,324 bp in length, and has a pair of inverted repeat (IR) regions consisting of 25,141 bp each. The lengths of the large single copy (LSC) and the small single copy (SSC) regions are 85,170 bp and 17,872 bp, respectively. Comparative cp DNA sequence analyses of S. indicum with other cp genomes reveal that the genome structure, gene order, gene and intron contents, AT contents, codon usage, and transcription units are similar to the typical angiosperm cp genomes. Nucleotide diversity of the IR region between Sesamum and three other cp genomes is much lower than that of the LSC and SSC regions in both the coding region and noncoding region. As a summary, the regional constraints strongly affect the sequence evolution of the cp genomes, while the functional constraints weakly affect the sequence evolution of cp genomes. Five short inversions associated with short palindromic sequences that form step-loop structures were observed in the chloroplast genome of S. indicum. Twenty-eight different simple sequence repeat loci have been detected in the chloroplast genome of S. indicum. Almost all of the SSR loci were composed of A or T, so this may also contribute to the A-T richness of the cp genome of S. indicum. Seven large repeated loci in the chloroplast genome of S. indicum were also identified and these loci are useful to developing S. indicum-specific cp genome vectors. The complete cp DNA sequences of S. indicum reported in this paper are prerequisite to modifying this important oilseed crop by cp genetic engineering techniques. PMID:22606240
Development and validation of the first SSR markers for Mimosa scabrella Benth.
Saiki, F A; Bernardi, A P; Reis, M S; Faoro, H; Souza, E M; Pedrosa, F O; Mantovani, A; Guidolin, A F
2017-02-16
Mimosa scabrella Benth., popularly known as ''bracatinga'', is a pioneer and endemic species of Brazil, occurring in Mixed Ombrophilous Forest associated with Brazilian Atlantic Rainforest biomes. It is a fast-growing tree of the Fabaceae family that facilitates the dynamics of ecological succession. SSR development, when there is no genome sequence, is time and labor intensive and there are no molecular markers for M. scabrella. We developed and validated the first microsatellite markers for this tetraploid species, evaluating mother trees and progenies. Using Illumina sequencing, we identified 290 SSR loci and 211 primer pairs. After 31 SSR loci PCR/agarose electrophoresis selection, a subset of 11 primer pairs was synthetized with fluorescence in the forward primer for PCR and capillary electrophoresis validation with leaf DNA of 33 adult and 411 progeny individuals. Polymorphic locus percentage was 36, 4 in 11 loci, 3 chloroplast SSRs, and 1 nuclear SSR. Allele number of polymorphic loci ranged from 2 to 11 alleles considering all sampling. All 11 primer pairs were also tested for cross-species amplification for five Fabaceae-Mimosoideae species, ranging from 2 loci transferred to Calliandra tweedii Benth. and all 11 loci transferred to Mimosa taimbensis Burkart. The assessed and validated SSR markers for M. scabrella are suitable and useful for analysis and population genetic studies.
Schlautman, Brandon; Fajardo, Diego; Bougie, Tierney; Wiesman, Eric; Polashock, James; Vorsa, Nicholi; Steffan, Shawn; Zalapa, Juan
2015-01-27
The American cranberry, Vaccinium macrocarpon Ait., is an economically important North American fruit crop that is consumed because of its unique flavor and potential health benefits. However, a lack of abundant, genome-wide molecular markers has limited the adoption of modern molecular assisted selection approaches in cranberry breeding programs. To increase the number of available markers in the species, this study identified, tested, and validated microsatellite markers from existing nuclear and transcriptome sequencing data. In total, new primers were designed, synthesized, and tested for 979 SSR loci; 697 of the markers amplified allele patterns consistent with single locus segregation in a diploid organism and were considered polymorphic. Of the 697 polymorphic loci, 507 were selected for additional genetic diversity and segregation analyses in 29 cranberry genotypes. More than 95% of the 507 loci did not display segregation distortion at the p < 0.05 level, and contained moderate to high levels of polymorphism with a polymorphic information content >0.25. This comprehensive collection of developed and validated microsatellite loci represents a substantial addition to the molecular tools available for geneticists, genomicists, and breeders in cranberry and Vaccinium.
Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping
2015-01-01
The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.
Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping
2015-01-01
The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum. PMID:25705213
Szczecińska, Monika
2016-01-01
Background Research into the protection of rare and endangered plant species involves genetic analyses to determine their genetic variation and genetic structure. Various categories of genetic markers are used for this purpose. Microsatellites, also known as simple sequence repeats (SSR), are the most popular category of markers in population genetics research. In most cases, microsatellites account for a large part of the noncoding DNA and exert a neutral effect on the genome. Neutrality is a desirable feature in evaluations of genetic differences between populations, but it does not support analyses of a population’s ability to adapt to a given environment or its evolutionary potential. Despite the numerous advantages of microsatellites, non-neutral markers may supply important information in conservation genetics research. They are used to evaluate adaptation to specific environmental conditions and a population’s adaptive potential. The aim of this study was to compare the level of genetic variation in Pulsatilla patens populations revealed by neutral SSR markers and putatively adaptive ISJ markers (intron-exon splice junction). Methods The experiment was conducted on 14 Polish populations of P. patens and three P. patens populations from the nearby region of Vitebsk in Belarus. A total of 345 individuals were examined. Analyses were performed with the use of eight SSR primers specific to P. patens and three ISJ primers. Results SSR markers revealed a higher level of genetic variation than ISJ markers (He = 0.609, He = 0.145, respectively). An analysis of molecular variance (AMOVA) revealed that, the overall genetic diversity between the analyzed populations defined by parameters FST and ΦPT for SSR (20%) and ΦPT for ISJ (21%) markers was similar. Analysis conducted in the Structure program divided analyzed populations into two groups (SSR loci) and three groups (ISJ markers). Mantel test revealed correlations between the geographic distance and genetic diversity of Polish populations of P. patens for ISJ markers, but not for SSR markers. Conclusions The results of the present study suggest that ISJ markers can complement the analyses based on SSRs. However, neutral and adaptive markers should not be alternatively applied. Neutral microsatellite markers cannot depict the full range of genetic variation in a population because they do not enable to analyze functional variation. Although ISJ markers are less polymorphic, they can contribute to the reliability of analyses based on SSRs. PMID:27833793
Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen
2015-01-01
Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.
Cooper, Laurel D.; Kishore, Venkata K.; Knapp, Steven J.; Kling, Jennifer G.
2015-01-01
The seed oil of meadowfoam, a new crop in the Limnanthaceae family, is highly enriched in very long chain fatty acids that are desaturated at the Δ5 position. The unusual oil is desirable for cosmetics and innovative industrial applications and the seed meal remaining after oil extraction contains glucolimnanthin, a methoxylated benzylglucosinolate whose degradation products are herbicidal and anti-microbial. Here we describe EST analysis of the developing seed transcriptome that identified major genes involved in biosynthesis and assembly of the seed oil and in glucosinolate metabolic pathways. mRNAs encoding acyl-CoA Δ5 desaturase were notably abundant. The library was searched for simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). Fifty-four new SSR markers and eight candidate gene markers were developed and combined with previously developed SSRs to construct a new genetic map for Limnanthes alba. Mapped genes in the lipid biosynthetic pathway encode 3-ketoacyl-CoA synthase (KCS), Δ5 desaturase (Δ5DS), lysophosphatidylacyl-acyl transferase (LPAT), and acyl-CoA diacylglycerol acyl transferase (DGAT). Mapped genes in glucosinolate biosynthetic and degradation pathways encode CYP79A, myrosinase (TGG), and epithiospecifier modifier protein (ESM). The resources developed in this study will further the domestication and improvement of meadowfoam as an oilseed crop. PMID:26038713
De Bellis, Fabien; Malapa, Roger; Kagy, Valérie; Lebegin, Stéphane; Billot, Claire; Labouisse, Jean-Pierre
2016-01-01
Premise of the study: Using next-generation sequencing technology, new microsatellite loci were characterized in Artocarpus altilis (Moraceae) and two congeners to increase the number of available markers for genotyping breadfruit cultivars. Methods and Results: A total of 47,607 simple sequence repeat loci were obtained by sequencing a library of breadfruit genomic DNA with an Illumina MiSeq system. Among them, 50 single-locus markers were selected and assessed using 41 samples (39 A. altilis, one A. camansi, and one A. heterophyllus). All loci were polymorphic in A. altilis, 44 in A. camansi, and 21 in A. heterophyllus. The number of alleles per locus ranged from two to 19. Conclusions: The new markers will be useful for assessing the identity and genetic diversity of breadfruit cultivars on a small geographical scale, gaining a better understanding of farmer management practices, and will help to optimize breadfruit genebank management. PMID:27610273
Xu, Jiajia; Li, Yuanyuan; Ma, Xiuling; Ding, Jianfeng; Wang, Kai; Wang, Sisi; Tian, Ye; Zhang, Hui; Zhu, Xin-Guang
2013-09-01
Setaria viridis is an emerging model species for genetic studies of C4 photosynthesis. Many basic molecular resources need to be developed to support for this species. In this paper, we performed a comprehensive transcriptome analysis from multiple developmental stages and tissues of S. viridis using next-generation sequencing technologies. Sequencing of the transcriptome from multiple tissues across three developmental stages (seed germination, vegetative growth, and reproduction) yielded a total of 71 million single end 100 bp long reads. Reference-based assembly using Setaria italica genome as a reference generated 42,754 transcripts. De novo assembly generated 60,751 transcripts. In addition, 9,576 and 7,056 potential simple sequence repeats (SSRs) covering S. viridis genome were identified when using the reference based assembled transcripts and the de novo assembled transcripts, respectively. This identified transcripts and SSR provided by this study can be used for both reverse and forward genetic studies based on S. viridis.
Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L
2009-07-01
Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (
Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.
2009-01-01
Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168
Vieira, Leila do Nascimento; Dos Anjos, Karina Goulart; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Greco, Thiago Machado; Pedrosa, Fábio de Oliveira; de Souza, Emanuel Maltempi; Rogalski, Marcelo; de Souza, Robson Francisco; Guerra, Miguel Pedro
2016-05-01
The complete plastome sequencing is an efficient option for increasing phylogenetic resolution and evolutionary studies, as well as may greatly facilitate the use of plastid DNA markers in plant population genetic studies. Merostachys and Guadua stand out as the most common and the highest potential utilization bamboos indigenous of Brazil. Here, we sequenced the complete plastome sequences of the Brazilian Guadua chacoensis and Merostachys sp. to perform full plastome phylogeny and characterize the occurrence, type, and distribution of SRRs using 20 Bambuseae species. The determined plastome sequence of Merostachys sp. and G. chacoensis is 136,334 and 135,403 bp in size, respectively, with an identical gene content and typical quadripartite structure consisting of a pair of IRs separated by the LSC and SSC regions. The Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of Paleotropical and Neotropical Bamboos clades. The Neotropical bamboos segregated into three well-supported lineages, Chusqueinae, Guaduinae, and Arthrostylidiinae, with the last two forming a well-supported sister relationship. Paleotropical bamboos segregated into two well-supported lineages, Hickeliinae and Bambusinae + Melocanninae. We identified 141.8 cpSSR in Bambuseae plastomes and an inferior value (38.15) for plastome coding sequences. Among them, we identified 16 polymorphic SSR loci, with number of alleles varying from 3 to 10. These 16 polymorphic cpSSR loci in Bambuseae plastome can be assessed for the intraspecific level of polymorphism, leading to innovative highly sensitive phylogeographic and population genetics studies for this tribe.
Zhang, Yu; Zhang, Xiaojuan; Chen, Xi; Sun, Wang; Li, Jiao
2018-01-01
Qinba area has a long history of tea planting and is a northernmost region in China where Camellia sinensis L. is grown. In order to provide basic data for selection and optimization of molecular markers of tea plants. 118 markers, including 40 EST-SSR, 40 SRAP and 38 SCoT markers were used to evaluate the genetic diversity of 50 tea plant ( Camellia sinensis. ) samples collected from Qinb. tea germplasm, assess population structure. In this study, a total of 414 alleles were obtained using 38 pairs of SCoT primers, with an average of 10.89 alleles per primer. The percentage of polymorphic bands (PPB), polymorphism information content (PIC), resolving power (Rp), effective multiplex ratio (EMR), average band informativeness (Ib av ), and marker index (MI) were 96.14%, 0.79, 6.71, 10.47, 0.58, and 6.07 respectively. 338 alleles were amplified via 40 pairs of SRAP (8.45 per primer), with PPB, PIC, Rp, EMR, Ib av, and MI values of 89.35%, 0.77, 5.11, 7.55, 0.61, and 4.61, respectively. Furthermore, 320 alleles have been detected using 40 EST-SSR primers (8.00 per primer), with PPB, PIC, Rp, EMR, Ib av , and MI values of 94.06%, 0.85, 4.48, 7.53, 0.56, and 4.22 respectively. These results indicated that SCoT markers had higher efficiency.Mantel test was used to analyze the genetic distance matrix generated by EST-SSRs, SRAPs and SCoTs. The results showed that the correlation between the genetic distance matrix based on EST-SSR and that based on SRAP was very small ( r = 0.01), followed by SCoT and SRAP ( r = 0.17), then by SCoT and EST-SSR ( r = 0.19).The 50 tea samples were divided into two sub-populations using STRUCTURE, Neighbor-joining (NJ) method and principal component analyses (PCA). The results produced by STRUCTURE were completely consistent with the PCA analysis. Furthermore, there is no obvious relationship between the results produced using sub-populational and geographical data. Among the three types of markers, SCoT markers has many advantages in terms of NPB, PPB, Rp, EMR, and MI. Nevertheless, the values of PIC showed different trends, with the highest values generated with EST-SSR, followed by SCoT and SRAP. The average band informativeness showed similar trends. Correlation between genetic distances produced by three different molecular markers were very small, thus it is not recommended to use a single marker to evaluate genetic diversity and population structure. It is hence suggested that combining of different types of molecular markers should be used to evaluate the genetic diversity and population structure. It also seems crucial to screen out, for each type of molecular markers, core markers of Camellia sinensis . This study revealed that genes of exotic plant varieties have been constantly integrated into the gene pool of Qinba area tea. A low level of genetic diversity was observed; this is shown by an average coefficient of genetic similarity of 0.74.
ESTuber db: an online database for Tuber borchii EST sequences.
Lazzari, Barbara; Caprera, Andrea; Cosentino, Cristian; Stella, Alessandra; Milanesi, Luciano; Viotti, Angelo
2007-03-08
The ESTuber database (http://www.itb.cnr.it/estuber) includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface. Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure. The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database.
Hou, Lu; Cui, Yanhong; Li, Xiang; Chen, Wu; Zhang, Zhiyong; Pang, Xiaoming; Li, Yingyue
2018-01-01
Thuja koraiensis Nakai is an endangered conifer of high economic and ecological value in Jilin Province, China. However, studies on its population structure and conservation genetics have been limited by the lack of genomic data. Here, 37,761 microsatellites (simple sequence repeat, SSR) were detected based on 875,792 de novo-assembled contigs using a restriction-associated DNA (RAD) approach. Among these SSRs, 300 were randomly selected to test for polymorphisms and 96 obtained loci were able to amplify a fragment of expected size. Twelve polymorphic SSR markers were developed to analyze the genetic diversity and population structure of three natural populations. High genetic diversity (mean NA = 5.481, HE = 0.548) and moderate population differentiation (pairwise Fst = 0.048–0.078, Nm = 2.940–4.958) were found in this species. Molecular variance analysis suggested that most of the variation (83%) existed within populations. Combining the results of STRUCTURE, principal coordinate, and neighbor-joining analysis, the 232 individuals were divided into three genetic clusters that generally correlated with their geographical distributions. Finally, appropriate conservation strategies were proposed to protect this species. This study provides genetic information for the natural resource conservation and utilization of T. koraiensis and will facilitate further studies of the evolution and phylogeography of the species. PMID:29673217
2011-01-01
Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a future genome sequence for oak. PMID:21645357
Cuadrado, A; Cardoso, M; Jouve, N
2008-01-01
A significant fraction of the nuclear DNA of all eukaryotes is occupied by simple sequence repeats (SSRs) or microsatellites. This type of sequence has sparked great interest as a means of studying genetic variation, linkage mapping, gene tagging and evolution. Although SSRs at different positions in a gene help determine the regulation of expression and the function of the protein produced, little attention has been paid to the chromosomal organisation and distribution of these sequences, even in model species. This review discusses the main achievements in the characterisation of long-range SSR organisation in the chromosomes of Triticum aestivum L., Secale cereale L., and Hordeum vulgare L. (all members of Triticeae). We have detected SSRs using an improved FISH technique based on the random primer labelling of synthetic oligonucleotides (15-24 bases) in multi-colour experiments. Detailed information on the presence and distribution of AC, AG and all the possible classes of trinucleotide repeats has been acquired. These data have revealed the motif-dependent and non-random chromosome distributions of SSRs in the different genomes, and allowed the correlation of particular SSRs with chromosome areas characterised by specific features (e.g., heterochromatin, euchromatin and centromeres) in all three species. The present review provides a detailed comparative study of the distribution of these SSRs in each of the seven chromosomes of the genomes A, B and D of wheat, H of barley and R of rye. The importance of SSRs in plant breeding and their possible role in chromosome structure, function and evolution is discussed. 2008 S. Karger AG, Basel
Qin, Hongde; Chen, Min; Yi, Xianda; Bie, Shu; Zhang, Cheng; Zhang, Youchang; Lan, Jiayang; Meng, Yanyan; Yuan, Youlu; Jiao, Chunhai
2015-01-01
Detecting QTLs (quantitative trait loci) that enhance cotton yield and fiber quality traits and accelerate breeding has been the focus of many cotton breeders. In the present study, 359 SSR (simple sequence repeat) markers were used for the association mapping of 241 Upland cotton collections. A total of 333 markers, representing 733 polymorphic loci, were detected. The average linkage disequilibrium (LD) decay distances were 8.58 cM (r2 > 0.1) and 5.76 cM (r2 > 0.2). 241 collections were arranged into two subgroups using STRUCTURE software. Mixed linear modeling (MLM) methods (with population structure (Q) and relative kinship matrix (K)) were applied to analyze four phenotypic datasets obtained from four environments (two different locations and two years). Forty-six markers associated with the number of bolls per plant (NB), boll weight (BW), lint percentage (LP), fiber length (FL), fiber strength (FS) and fiber micornaire value (FM) were repeatedly detected in at least two environments. Of 46 associated markers, 32 were identified as new association markers, and 14 had been previously reported in the literature. Nine association markers were near QTLs (at a distance of less than 1-2 LD decay on the reference map) that had been previously described. These results provide new useful markers for marker-assisted selection in breeding programs and new insights for understanding the genetic basis of Upland cotton yields and fiber quality traits at the whole-genome level.
Xue, Yufei; Chen, Baojun; Wang, Rui; Win, Aung Naing; Li, Jiana; Chai, Yourong
2018-02-01
Rapeseed (Brassica napus) is an important oilseed crop worldwide, and fatty acid (FA) compositions determine the nutritional and economic value of its seed oil. Fatty acid desaturases (FADs) play a pivotal role in regulating FA compositions, but to date, no comprehensive genome-wide analysis of FAD gene family in rapeseed and its parent species has been reported. In this study, using homology searches, 84, 45, and 44 FAD genes were identified in rapeseed, Brassica rapa, and Brassica oleracea genomes, respectively. These FAD genes were unevenly located in 17 chromosomes and 2 scaffolds of rapeseed, 9 chromosomes and 1 scaffold of B. rapa, and all the chromosomes of B. oleracea. Phylogenetic analysis showed that the soluble and membrane-bound FADs in the three Brassica species were divided into four and six subfamilies, respectively. Generally, the soluble FADs contained two conserved histidine boxes, while three highly conserved histidine boxes were harbored in membrane-bound FADs. Exon-intron structure, intron phase, and motif composition and position were highly conserved in each FAD subfamily. Putative subcellular locations of FAD proteins in three Brassica species were consistent with those of corresponding known FADs. In total, 25 of simple sequence repeat (SSR) loci were found in FAD genes of the three Brassica species. Transcripts of selected FAD genes in the three species were examined in various organs/tissues or stress treatments from NCBI expressed sequence tag (EST) database. This study provides a critical molecular basis for quality improvement of rapeseed oil and facilitates our understanding of key roles of FAD genes in plant growth and development and stress response.
Identification of mildew resistance in wild and cultivated Central Asian grape germplasm
2013-01-01
Background Cultivated grapevines, Vitis vinifera subsp. sativa, evolved from their wild relative, V. vinifera subsp. sylvestris. They were domesticated in Central Asia in the absence of the powdery mildew fungus, Erysiphe necator, which is thought to have originated in North America. However, powdery mildew resistance has previously been discovered in two Central Asian cultivars and in Chinese Vitis species. Results A set of 380 unique genotypes were evaluated with data generated from 34 simple sequence repeat (SSR) markers. The set included 306 V. vinifera cultivars, 40 accessions of V. vinifera subsp. sylvestris, and 34 accessions of Vitis species from northern Pakistan, Afghanistan and China. Based on the presence of four SSR alleles previously identified as linked to the powdery mildew resistance locus, Ren1, 10 new mildew resistant genotypes were identified in the test set: eight were V. vinifera cultivars and two were V. vinifera subsp. sylvestris based on flower and seed morphology. Sequence comparison of a 620 bp region that includes the Ren1-linked allele (143 bp) of the co-segregating SSR marker SC8-0071-014, revealed that the ten newly identified genotypes have sequences that are essentially identical to the previously identified mildew resistant V. vinifera cultivars: ‘Kishmish vatkana’ and ‘Karadzhandal’. Kinship analysis determined that three of the newly identified powdery mildew resistant accessions had a relationship with ‘Kishmish vatkana’ and ‘Karadzhandal’, and that six were not related to any other accession in this study set. Clustering procedures assigned accessions into three groups: 1) Chinese species; 2) a mixed group of cultivated and wild V. vinifera; and 3) table grape cultivars, including nine of the powdery mildew resistant accessions. Gene flow was detected among the groups. Conclusions This study provides evidence that powdery mildew resistance is present in V. vinifera subsp. sylvestris, the dioecious wild progenitor of the cultivated grape. Four first-degree parent progeny relationships were discovered among the hermaphroditic powdery mildew resistant cultivars, supporting the existence of intentional grape breeding efforts. Although several Chinese grape species are resistant to powdery mildew, no direct genetic link to the resistance found in V. vinifera could be established. PMID:24093598
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.
Cao, Yinhe; Tung, Wen-Wen; Gao, J B
2004-01-01
With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Avvaru, Akshay Kumar; Sowpati, Divya Tej; Mishra, Rakesh Kumar
2018-03-15
Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online.
Vatanparast, Mohammad; Shetty, Prateek; Chopra, Ratan; Doyle, Jeff J.; Sathyanarayana, N.; Egan, Ashley N.
2016-01-01
Winged bean, Psophocarpus tetragonolobus (L.) DC., is similar to soybean in yield and nutritional value but more viable in tropical conditions. Here, we strengthen genetic resources for this orphan crop by producing a de novo transcriptome assembly and annotation of two Sri Lankan accessions (denoted herein as CPP34 [PI 491423] and CPP37 [PI 639033]), developing simple sequence repeat (SSR) markers, and identifying single nucleotide polymorphisms (SNPs) between geographically separated genotypes. A combined assembly based on 804,757 reads from two accessions produced 16,115 contigs with an N50 of 889 bp, over 90% of which has significant sequence similarity to other legumes. Combining contigs with singletons produced 97,241 transcripts. We identified 12,956 SSRs, including 2,594 repeats for which primers were designed and 5,190 high-confidence SNPs between Sri Lankan and Nigerian genotypes. The transcriptomic data sets generated here provide new resources for gene discovery and marker development in this orphan crop, and will be vital for future plant breeding efforts. We also analyzed the soybean trypsin inhibitor (STI) gene family, important plant defense genes, in the context of related legumes and found evidence for radiation of the Kunitz trypsin inhibitor (KTI) gene family within winged bean. PMID:27356763
Michael, Todd P.; Park, Sohyun; Kim, Tae-Sung; Booth, Jim; Byer, Amanda; Sun, Qi; Chory, Joanne; Lee, Kwangwon
2007-01-01
Background WHITE COLLAR-1 (WC-1) mediates interactions between the circadian clock and the environment by acting as both a core clock component and as a blue light photoreceptor in Neurospora crassa. Loss of the amino-terminal polyglutamine (NpolyQ) domain in WC-1 results in an arrhythmic circadian clock; this data is consistent with this simple sequence repeat (SSR) being essential for clock function. Methodology/Principal Findings Since SSRs are often polymorphic in length across natural populations, we reasoned that investigating natural variation of the WC-1 NpolyQ may provide insight into its role in the circadian clock. We observed significant phenotypic variation in the period, phase and temperature compensation of circadian regulated asexual conidiation across 143 N. crassa accessions. In addition to the NpolyQ, we identified two other simple sequence repeats in WC-1. The sizes of all three WC-1 SSRs correlated with polymorphisms in other clock genes, latitude and circadian period length. Furthermore, in a cross between two N. crassa accessions, the WC-1 NpolyQ co-segregated with period length. Conclusions/Significance Natural variation of the WC-1 NpolyQ suggests a mechanism by which period length can be varied and selected for by the local environment that does not deleteriously affect WC-1 activity. Understanding natural variation in the N. crassa circadian clock will facilitate an understanding of how fungi exploit their environments. PMID:17726525
New microsatellite loci for pomegranate, Punica granatum (Lythraceae).
Currò, Sergio; Caruso, Marco; Distefano, Gaetano; Gentile, Alessandra; La Malfa, Stefano
2010-07-01
A new set of pomegranate microsatellites was selected and characterized to assess the level of genetic diversity among cultivars and wild genotypes. • Nine Simple Sequence Repeat (SSR) markers were obtained using the Microsatellite-AFLP technique and were successfully amplified in 34 genotypes belonging to Italian, Spanish, and Turkish germplasm collections. The number of alleles per locus ranged from 1 to 5, and the total number of alleles was 22. • Because only a few codominant markers are available for this species, the newly identified SSRs will facilitate genetic diversity studies, fingerprinting, and mapping. In addition, the 9 loci successfully amplified in P. granatum var. nana. No cross transferability was observed for Cuphea micropetala and Lagerstroemia indica (Lythraceae).
Genetic diversity and relationship analysis of Gossypium arboreum accessions.
Liu, F; Zhou, Z L; Wang, C Y; Wang, Y H; Cai, X Y; Wang, X X; Zhang, Z S; Wang, K B
2015-11-19
Simple sequence repeat techniques were used to identify the genetic diversity of 101 Gossypium arboreum accessions collected from India, Vietnam, and the southwest of China (Guizhou, Guangxi, and Yunnan provinces). Twenty-six pairs of SSR primers produced a total of 103 polymorphic loci with an average of 3.96 polymorphic loci per primer. The average of the effective number of alleles, Nei's gene diversity, and Shannon's information index were 0.59, 0.2835, and 0.4361, respectively. The diversity varied among different geographic regions. The result of principal component analysis was consistent with that of unweighted pair group method with arithmetic mean clustering analysis. The 101 G. arboreum accessions were clustered into 2 groups.
Fitts, Douglas A
2017-09-21
The variable criteria sequential stopping rule (vcSSR) is an efficient way to add sample size to planned ANOVA tests while holding the observed rate of Type I errors, α o , constant. The only difference from regular null hypothesis testing is that criteria for stopping the experiment are obtained from a table based on the desired power, rate of Type I errors, and beginning sample size. The vcSSR was developed using between-subjects ANOVAs, but it should work with p values from any type of F test. In the present study, the α o remained constant at the nominal level when using the previously published table of criteria with repeated measures designs with various numbers of treatments per subject, Type I error rates, values of ρ, and four different sample size models. New power curves allow researchers to select the optimal sample size model for a repeated measures experiment. The criteria held α o constant either when used with a multiple correlation that varied the sample size model and the number of predictor variables, or when used with MANOVA with multiple groups and two levels of a within-subject variable at various levels of ρ. Although not recommended for use with χ 2 tests such as the Friedman rank ANOVA test, the vcSSR produces predictable results based on the relation between F and χ 2 . Together, the data confirm the view that the vcSSR can be used to control Type I errors during sequential sampling with any t- or F-statistic rather than being restricted to certain ANOVA designs.
Lee, Imchang; Chalita, Mauricio; Ha, Sung-Min; Na, Seong-In; Yoon, Seok-Hwan; Chun, Jongsik
2017-06-01
Thanks to the recent advancement of DNA sequencing technology, the cost and time of prokaryotic genome sequencing have been dramatically decreased. It has repeatedly been reported that genome sequencing using high-throughput next-generation sequencing is prone to contaminations due to its high depth of sequencing coverage. Although a few bioinformatics tools are available to detect potential contaminations, these have inherited limitations as they only use protein-coding genes. Here we introduce a new algorithm, called ContEst16S, to detect potential contaminations using 16S rRNA genes from genome assemblies. We screened 69 745 prokaryotic genomes from the NCBI Assembly Database using ContEst16S and found that 594 were contaminated by bacteria, human and plants. Of the predicted contaminated genomes, 8 % were not predicted by the existing protein-coding gene-based tool, implying that both methods can be complementary in the detection of contaminations. A web-based service of the algorithm is available at www.ezbiocloud.net/tools/contest16s.
Genetic linkage map and QTL identification for adventitious rooting traits in red gum eucalypts.
Sumathi, Murugan; Bachpai, Vijaya Kumar Waman; Mayavel, A; Dasgupta, Modhumita Ghosh; Nagarajan, Binai; Rajasugunasekar, D; Sivakumar, Veerasamy; Yasodha, Ramasamy
2018-05-01
The eucalypt species, Eucalyptus tereticornis and Eucalyptus camaldulensis , show tolerance to drought and salinity conditions, respectively, and are widely cultivated in arid and semiarid regions of tropical countries. In this study, genetic linkage map was developed for interspecific cross E. tereticornis × E. camaldulensis using pseudo-testcross strategy with simple sequence repeats (SSRs), intersimple sequence repeats (ISSRs), and sequence-related amplified polymorphism (SRAP) markers. The consensus genetic map comprised totally 283 markers with 84 SSRs, 94 ISSRs, and 105 SRAP markers on 11 linkage groups spanning 1163.4 cM genetic distance. Blasting the SSR sequences against E. grandis sequences allowed an alignment of 64% and the average ratio of genetic-to-physical distance was 1.7 Mbp/cM, which strengths the evidence that high amount of synteny and colinearity exists among eucalypts genome. Blast searches also revealed that 37% of SSRs had homologies with genes, which could potentially be used in the variety of downstream applications including candidate gene polymorphism. Quantitative trait loci (QTL) analysis for adventitious rooting traits revealed six QTL for rooting percent and root length on five chromosomes with interval and composite interval mapping. All the QTL explained 12.0-14.7% of the phenotypic variance, showing the involvement of major effect QTL on adventitious rooting traits. Increasing the density of markers would facilitate the detection of more number of small-effect QTL and also underpinning the genes involved in rooting process.
Moolhuijzen, P; Cakir, M; Hunter, A; Schibeci, D; Macgregor, A; Smith, C; Francki, M; Jones, M G K; Appels, R; Bellgard, M
2006-06-01
The identification of markers in legume pasture crops, which can be associated with traits such as protein and lipid production, disease resistance, and reduced pod shattering, is generally accepted as an important strategy for improving the agronomic performance of these crops. It has been demonstrated that many quantitative trait loci (QTLs) identified in one species can be found in other plant species. Detailed legume comparative genomic analyses can characterize the genome organization between model legume species (e.g., Medicago truncatula, Lotus japonicus) and economically important crops such as soybean (Glycine max), pea (Pisum sativum), chickpea (Cicer arietinum), and lupin (Lupinus angustifolius), thereby identifying candidate gene markers that can be used to track QTLs in lupin and pasture legume breeding. LegumeDB is a Web-based bioinformatics resource for legume researchers. LegumeDB analysis of Medicago truncatula expressed sequence tags (ESTs) has identified novel simple sequence repeat (SSR) markers (16 tested), some of which have been putatively linked to symbiosome membrane proteins in root nodules and cell-wall proteins important in plant-pathogen defence mechanisms. These novel markers by preliminary PCR assays have been detected in Medicago truncatula and detected in at least one other legume species, Lotus japonicus, Glycine max, Cicer arietinum, and (or) Lupinus angustifolius (15/16 tested). Ongoing research has validated some of these markers to map them in a range of legume species that can then be used to compile composite genetic and physical maps. In this paper, we outline the features and capabilities of LegumeDB as an interactive application that provides legume genetic and physical comparative maps, and the efficient feature identification and annotation of the vast tracks of model legume sequences for convenient data integration and visualization. LegumeDB has been used to identify potential novel cross-genera polymorphic legume markers that map to agronomic traits, supporting the accelerated identification of molecular genetic factors underpinning important agronomic attributes in lupin.
Multiplexed microsatellite recovery using massively parallel sequencing
Jennings, T.N.; Knaus, B.J.; Mullins, T.D.; Haig, S.M.; Cronn, R.C.
2011-01-01
Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5M (USD).
Protein Degradation in a TX-TL Cell-free Expression System Using ClpXP Protease
2014-07-14
function in TX-TL, as well as bacteriophage assembly [2, 6]. Circuits can also be prototyped from basic parts within 8 hours, avoiding cloning and...mRFP, and Venus and variants eGFP-ssrA, mRFP-ssrA, and Venus-ssrA, coding sequences were cloned into a T7-lacO inducible vector containing a N...12672L12677.! 6.! Shin,!J.,!P.!Jardine,!and!V.!Noireaux,!Genome(Replication,(Synthesis,(and( Assembly(of(the( Bacteriophage (T7(in(a(Single(Cell9Free
Kumar, Bhupender; Talukdar, Akshay; Verma, Khushbu; Bala, Indu; Harish, G D; Gowda, Sarmrat; Lal, S K; Sapra, R L; Singh, K P
2015-02-01
Yellow Mosaic Virus (YMV) is a serious disease of soybean. Resistance to YMV was mapped in 180 soybean genotypes through association mapping approach using 121 simple sequence repeats (SSR) and four resistance gene analogue (RGA)-based markers. The association mapping population (AMP) (96 genotypes) and confirmation population (CP) (84 genotypes) was tested for resistance to YMV at hot-spot consecutively for 3 years (2007-2009). The genotypes exhibited significant variability for YMV resistance (P < 0.01). Molecular genotyping and population structure analysis with 'admixture' co-ancestry model detected seven optimal sub-populations in the AMP. Linkage disequilibrium (LD) between the markers extended up to 35 and 10 cM with r2 > 0.15, and >0.25, respectively. The 4 RGA-based markers showed no association with YMV resistance. Two SSR markers, Satt301 and GMHSP179 on chromosome 17 were found to be in significant LD with YMV resistance. Contingency Chi-square test confirmed the association (P < 0.01) and the utility of the markers was validated in the CP. It would pave the way for marker assisted selection for YMV resistance in soybean. This is the first report of its kind in soybean.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greenspan, D.S.; Northrup, H.; Au, K.S.
1995-02-10
COL5A1, the gene for the {alpha}1 chain of type V collagen, has been considered a candidate gene for certain diseases based on chromosomal location and/or disease phenotype. We have employed 3{prime}-untranslated region RFLPs to exclude COL5A1 as a candidate gene in families with tuberous sclerosis 1, Ehlers-Danlos syndrome type H, and nail-patella syndrome. In addition, we describe a polymorphic simple sequence repeat (SSR) within a COL5A1 intron. This SSR is used to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia (Osler-Rendu-Weber disease) and to add COL5A1 to the existing map of {open_quotes}index{close_quotes} markers of chromosome 9 by evaluationmore » of the COL5A1 locus on the CEPH 40-family reference pedigree set. This genetic mapping places COL5A1 between markers D9S66 and D9S67. 14 refs., 1 fig., 2 tabs.« less
Hodel, Richard G. J.; Segovia-Salcedo, M. Claudia; Landis, Jacob B.; Crowl, Andrew A.; Sun, Miao; Liu, Xiaoxian; Gitzendanner, Matthew A.; Douglas, Norman A.; Germain-Aubrey, Charlotte C.; Chen, Shichao; Soltis, Douglas E.; Soltis, Pamela S.
2016-01-01
Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers. PMID:27347456
Thakur, Ajay Kumar; Singh, Kunwar Harendra; Singh, Lal; Nanjundan, Joghee; Khan, Yasin Jeshima; Singh, Dhiraj
2018-01-01
Oilseed Brassica represents an important group of oilseed crops with a long history of evolution and cultivation. To understand the origin and evolution of Brassica amphidiploids, simple sequence repeat (SSR) markers were used to unravel genetic variations in three diploids and three amphidiploid Brassica species of U's triangle along with Eruca sativa as an outlier. Of 124 Brassica-derived SSR loci assayed, 100% cross-transferability was obtained for B. juncea and three subspecies of B. rapa , while lowest cross-transferability (91.93%) was obtained for Eruca sativa . The average % age of cross-transferability across all the seven species was 98.15%. The number of alleles detected at each locus ranged from one to six with an average of 3.41 alleles per primer pair. Neighbor-Joining-based dendrogram divided all the 40 accessions into two main groups composed of B. juncea / B. nigra/B. rapa and B. carinata/B. napus/B. oleracea . C-genome of oilseed Brassica species remained relatively more conserved than A- and B-genome. A- genome present in B. juncea and B. napus seems distinct from each other and hence provides great opportunity for generating diversity through synthesizing amphidiploids from different sources of A- genome. B. juncea had least intra-specific distance indicating narrow genetic base. B. rapa appears to be more primitive species from which other two diploid species might have evolved. The SSR marker set developed in this study will assist in DNA fingerprinting of various Brassica species cultivars, evaluating the genetic diversity in Brassica germplasm, genome mapping and construction of linkage maps, gene tagging and various other genomics-related studies in Brassica species. Further, the evolutionary relationship established among various Brassica species would assist in formulating suitable breeding strategies for widening the genetic base of Brassica amphidiploids by exploiting the genetic diversity present in diploid progenitor gene pools.
Huang, Qingyang; Beharav, Alex; Li, Youchun; Kirzhner, Valery; Nevo, Eviatar
2002-12-01
Genetic diversity at 38 microsatellite (short sequence repeats (SSRs)) loci was studied in a sample of 54 plants representing a natural population of wild barley, Hordeum spontaneum, at the Neve Yaar microsite in Israel. Wild barley at the microsite was organized in a mosaic pattern over an area of 3180 m2 in the open Tabor oak forest, which was subdivided into four microniches: (i) sun-rock (11 genotypes), (ii) sun-soil (18 genotypes), (iii) shade-soil (11 genotypes), and (iv) shade-rock (14 genotypes). Fifty-four genotypes were tested for ecological-genetic microniche correlates. Analysis of 36 loci showed that allele distributions at SSR loci were nonrandom but structured by ecological stresses (climatic and edaphic). Sixteen (45.7%) of 35 polymorphic loci varied significantly (p < 0.05) in allele frequencies among the microniches. Significant genetic divergence and diversity were found among the four subpopulations. The soil and shade subpopulations showed higher genetic diversities at SSR loci than the rock and sun subpopulations, and the lowest genetic diversity was observed in the sun-rock subpopulation, in contrast with the previous allozyme and RAPD studies. On average, of 36 loci, 88.75% of the total genetic diversity exists within the four microniches, while 11.25% exists between the microniches. In a permutation test, G(ST) was lower for 4999 out of 5000 randomized data sets (p < 0.001) when compared with real data (0.1125). The highest genetic distance was between shade-soil and sun-rock (D = 0.222). Our results suggest that diversifying natural selection may act upon some regulatory regions, resulting in adaptive SSR divergence. Fixation of some loci (GMS61, GMS1, and EBMAC824) at a specific microniche seems to suggest directional selection. The pattern of other SSR loci suggests the operation of balancing selection. SSRs may be either direct targets of selection or markers of selected haplotypes (selective sweep).
Molecular Mapping of Stripe Rust Resistance Gene Yr76 in Winter Club Wheat Cultivar Tyee.
Xiang, C; Feng, J Y; Wang, M N; Chen, X M; See, D R; Wan, A M; Wang, T
2016-10-01
Tyee, one of the wheat cultivars used to differentiate races of Puccinia striiformis f. sp. tritici in the United States, was identified to have a single gene for all-stage resistance, tentatively named YrTye. To map the gene, Tyee was crossed with 'Avocet Susceptible' (AvS). Genetic analysis of the F 1 , F 2 , F 2:3 , and BC 1 progenies confirmed a single dominant gene for resistance to race PSTv-37 that is avirulent to YrTye. A mapping population of 135 F 2 plants was phenotyped with PSTv-37 and the derived F 2:3 lines were tested with races PSTv-37, PSTv-40, and PSTv-79. The F 2 mapping population was genotyped with simple sequence repeat (SSR) markers. A genetic map comprising 13 SSR markers located YrTye in chromosome 3AS flanked distally by SSR marker wmc11 and proximally by wmc532 at 2.6 and 3.4 cM, respectively. Amplification of Chinese Spring 3A deletion lines placed the gene in the distal bin 3AS4-0.45 to 1.00. Because YrTye is different from all formally named Yr genes in chromosomal location, we permanently name the gene Yr76. A near-isogenic line of spring common wheat was developed and selected by testing F 3 lines derived from a AvS*4/Tyee cross with Tyee-avirulent and virulent races and the flanking markers. The specific SSR alleles flanking Yr76 were validated using cultivars and breeding lines with and without the gene, and showed high polymorphisms. The specificity of Yr76 is useful in differentiating P. striiformis f. sp. tritici races, and its tightly linked markers will be useful in developing resistant cultivars when combining the gene with other genes for resistance to stripe rust.
Kaur, Kuljit; Sharma, Vikas; Singh, Vijay; Wani, Mohammad Saleem; Gupta, Raghbir Chand
2016-12-01
Tribulus terrestris L., commonly called puncture vine and gokhru, is an important member of Zygophyllaceae. The species is highly important in context to therapeutic uses and provides important active principles responsible for treatment of various diseases and also used as tonic. It is widely distributed in tropical regions of India and the world. However, status of its genetic diversity remained concealed due to lack of research work in this species. In present study, genetic diversity and structure of different populations of T. terrestris from north India was examined at molecular level using newly developed Simple Sequence Repeat (SSR) markers. In total, 20 primers produced 48 alleles in a size range of 100-500 bp with maximum (4) fragments amplified by TTMS-1, TTMS-25 and TTMS-33. Mean Polymorphism Information Content (PIC) and Marker Index (MI) were 0.368 and 1.01, respectively. Dendrogram showed three groups, one of which was purely containing accessions from Rajasthan while other two groups corresponded to Punjab and Haryana regions with intermixing of few other accessions. Analysis of molecular variance partitioned 76 % genetic variance within populations and 24 % among populations. Bayesian model based STRUCTURE analysis detected two genetic stocks for analyzed germplasm and also detected some admixed individuals. Different geographical populations of this species showed high level of genetic diversity. Results of present study can be useful in identifying diverse accessions and management of this plant resource. Moreover, the novel SSR markers developed can be utilized for various genetic analyses in this species in future.
Ting, Ngoot-Chin; Jansen, Johannes; Nagappan, Jayanthi; Ishak, Zamzuri; Chin, Cheuk-Weng; Tan, Soon-Guan; Cheah, Suan-Choo; Singh, Rajinder
2013-01-01
Clonal reproduction of oil palm by means of tissue culture is a very inefficient process. Tissue culturability is known to be genotype dependent with some genotypes being more amenable to tissue culture than others. In this study, genetic linkage maps enriched with simple sequence repeat (SSR) markers were developed for dura (ENL48) and pisifera (ML161), the two fruit forms of oil palm, Elaeis guineensis. The SSR markers were mapped onto earlier reported parental maps based on amplified fragment length polymorphism (AFLP) and restriction fragment length polymorphism (RFLP) markers. The new linkage map of ENL48 contains 148 markers (33 AFLPs, 38 RFLPs and 77 SSRs) in 23 linkage groups (LGs), covering a total map length of 798.0 cM. The ML161 map contains 240 markers (50 AFLPs, 71 RFLPs and 119 SSRs) in 24 LGs covering a total of 1,328.1 cM. Using the improved maps, two quantitative trait loci (QTLs) associated with tissue culturability were identified each for callusing rate and embryogenesis rate. A QTL for callogenesis was identified in LGD4b of ENL48 and explained 17.5% of the phenotypic variation. For embryogenesis rate, a QTL was detected on LGP16b in ML161 and explained 20.1% of the variation. This study is the first attempt to identify QTL associated with tissue culture amenity in oil palm which is an important step towards understanding the molecular processes underlying clonal regeneration of oil palm. PMID:23382832
Genetic diversity analysis of common beans based on molecular markers
Gill-Langarica, Homar R.; Muruaga-Martínez, José S.; Vargas-Vázquez, M.L. Patricia; Rosales-Serna, Rigoberto; Mayek-Pérez, Netzahualcoyotl
2011-01-01
A core collection of the common bean (Phaseolus vulgaris L.), representing genetic diversity in the entire Mexican holding, is kept at the INIFAP (Instituto Nacional de Investigaciones Forestales, Agricolas y Pecuarias, Mexico) Germplasm Bank. After evaluation, the genetic structure of this collection (200 accessions) was compared with that of landraces from the states of Oaxaca, Chiapas and Veracruz (10 genotypes from each), as well as a further 10 cultivars, by means of four amplified fragment length polymorphisms (AFLP) +3/+3 primer combinations and seven simple sequence repeats (SSR) loci, in order to define genetic diversity, variability and mutual relationships. Data underwent cluster (UPGMA) and molecular variance (AMOVA) analyses. AFLP analysis produced 530 bands (88.5% polymorphic) while SSR primers amplified 174 alleles, all polymorphic (8.2 alleles per locus). AFLP indicated that the highest genetic diversity was to be found in ten commercial-seed classes from two major groups of accessions from Central Mexico and Chiapas, which seems to be an important center of diversity in the south. A third group included genotypes from Nueva Granada, Mesoamerica, Jalisco and Durango races. Here, SSR analysis indicated a reduced number of shared haplotypes among accessions, whereas the highest genetic components of AMOVA variation were found within accessions. Genetic diversity observed in the common-bean core collection represents an important sample of the total Phaseolus genetic variability at the main Germplasm Bank of INIFAP. Molecular marker strategies could contribute to a better understanding of the genetic structure of the core collection as well as to its improvement and validation. PMID:22215964
Genetic diversity analysis of common beans based on molecular markers.
Gill-Langarica, Homar R; Muruaga-Martínez, José S; Vargas-Vázquez, M L Patricia; Rosales-Serna, Rigoberto; Mayek-Pérez, Netzahualcoyotl
2011-10-01
A core collection of the common bean (Phaseolus vulgaris L.), representing genetic diversity in the entire Mexican holding, is kept at the INIFAP (Instituto Nacional de Investigaciones Forestales, Agricolas y Pecuarias, Mexico) Germplasm Bank. After evaluation, the genetic structure of this collection (200 accessions) was compared with that of landraces from the states of Oaxaca, Chiapas and Veracruz (10 genotypes from each), as well as a further 10 cultivars, by means of four amplified fragment length polymorphisms (AFLP) +3/+3 primer combinations and seven simple sequence repeats (SSR) loci, in order to define genetic diversity, variability and mutual relationships. Data underwent cluster (UPGMA) and molecular variance (AMOVA) analyses. AFLP analysis produced 530 bands (88.5% polymorphic) while SSR primers amplified 174 alleles, all polymorphic (8.2 alleles per locus). AFLP indicated that the highest genetic diversity was to be found in ten commercial-seed classes from two major groups of accessions from Central Mexico and Chiapas, which seems to be an important center of diversity in the south. A third group included genotypes from Nueva Granada, Mesoamerica, Jalisco and Durango races. Here, SSR analysis indicated a reduced number of shared haplotypes among accessions, whereas the highest genetic components of AMOVA variation were found within accessions. Genetic diversity observed in the common-bean core collection represents an important sample of the total Phaseolus genetic variability at the main Germplasm Bank of INIFAP. Molecular marker strategies could contribute to a better understanding of the genetic structure of the core collection as well as to its improvement and validation.
Asea, Godfrey; Vivek, Bindiganavile S; Bigirwa, George; Lipps, Patrick E; Pratt, Richard C
2009-05-01
Maize production in sub-Saharan Africa incurs serious losses to epiphytotics of foliar diseases. Quantitative trait loci conditioning partial resistance (rQTL) to infection by causal agents of gray leaf spot (GLS), northern corn leaf blight (NCLB), and maize streak have been reported. Our objectives were to identify simple-sequence repeat (SSR) molecular markers linked to consensus rQTL and one recently identified rQTL associated with GLS, and to determine their suitability as tools for selection of improved host resistance. We conducted evaluations of disease severity phenotypes in separate field nurseries, each containing 410 F2:3 families derived from a cross between maize inbred CML202 (NCLB and maize streak resistant) and VP31 (a GLS-resistant breeding line) that possess complimentary rQTL. F2:3 families were selected for resistance based on genotypic (SSR marker), phenotypic, or combined data and the selected F3:4 families were reevaluated. Phenotypic values associated with SSR markers for consensus rQTL in bins 4.08 for GLS, 5.04 for NCLB, and 1.04 for maize streak significantly reduced disease severity in both generations based on single-factor analysis of variance and marker-interval analysis. These results were consistent with the presence of homozygous resistant parent alleles, except in bin 8.06, where markers were contributed by the NCLB-susceptible parent. Only one marker associated with resistance could be confirmed in bins 2.09 (GLS) and 3.06 (NCLB), illustrating the need for more robust rQTL discovery, fine-mapping, and validation prior to undertaking marker-based selection.
Xu, Xiaomei; Chao, Juan; Cheng, Xueli; Wang, Rui; Sun, Baojuan; Wang, Hengming; Luo, Shaobo; Xu, Xiaowan; Wu, Tingquan; Li, Ying
2016-01-01
Phytophthora root rot caused by Phytophthora capsici (P. capsici) is a serious limitation to pepper production in Southern China, with high temperature and humidity. Mapping PRR resistance genes can provide linked DNA markers for breeding PRR resistant varieties by molecular marker-assisted selection (MAS). Two BC1 populations and an F2 population derived from a cross between P. capsici-resistant accession, Criollo de Morelos 334 (CM334) and P. capsici-susceptible accession, New Mexico Capsicum Accession 10399 (NMCA10399) were used to investigate the genetic characteristics of PRR resistance. PRR resistance to isolate Byl4 (race 3) was controlled by a single dominant gene, PhR10, that was mapped to an interval of 16.39Mb at the end of the long arm of chromosome 10. Integration of bulked segregant analysis (BSA) and Specific Length Amplified Fragment sequencing (SLAF-seq) provided an efficient genetic mapping strategy. Ten polymorphic Simple Sequence Repeat (SSR) markers were found within this region and used to screen the genotypes of 636 BC1 plants, delimiting PhR10 to a 2.57 Mb interval between markers P52-11-21 (1.5 cM away) and P52-11-41 (1.1 cM). A total of 163 genes were annotated within this region and 31 were predicted to be associated with disease resistance. PhR10 is a novel race specific gene for PRR, and this paper describes linked SSR markers suitable for marker-assisted selection of PRR resistant varieties, also laying a foundation for cloning the resistance gene.
Cieslarová, Jaroslava; Hanáček, Pavel; Fialová, Eva; Hýbl, Miroslav; Smýkal, Petr
2011-11-01
Microsatellites, or simple sequence repeats (SSRs) are widespread class of repetitive DNA sequences, used in population genetics, genetic diversity and mapping studies. In spite of the SSR utility, the genetic and evolutionary mechanisms are not fully understood. We have investigated three microsatellite loci with different position in the pea (Pisum sativum L.) genome, the A9 locus residing in LTR region of abundant retrotransposon, AD270 as intergenic and AF016458 located in 5'untranslated region of expressed gene. Comparative analysis of a 35 pair samples from seven pea varieties propagated by single-seed descent for ten generations, revealed single 4 bp mutation in 10th generation sample at AD270 locus corresponding to stepwise increase in one additional ATCT repeat unit. The estimated mutation rate was 4.76 × 10(-3) per locus per generation, with a 95% confidence interval of 1.2 × 10(-4) to 2.7 × 10(-2). The comparison of cv. Bohatýr accessions retrieved from different collections, showed intra-, inter-accession variation and differences in flanking and repeat sequences. Fragment size and sequence alternations were also found in long term in vitro organogenic culture, established at 1983, indicative of somatic mutation process. The evidence of homoplasy was detected across of unrelated pea genotypes, which adversaly affects the reliability of diversity estimates not only for diverse germplasm but also highly bred material. The findings of this study have important implications for Pisum phylogeny studies, variety identification and registration process in pea breeding where mutation rate influences the genetic diversity and the effective population size estimates.
Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin
2013-01-01
Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of genetic variation among Turkish olive genotypes revealed by SNPs, AFLPs and SSRs allowed us to characterize the Turkish olive genotype. PMID:24058483
Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian
2009-03-01
Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
Sharma, Vishakha; Nandineni, Madhusudan R
2014-04-01
Potato (Solanum tuberosum) is an important non-cereal crop throughout the world and is highly recommended for ensuring global food security. Owing to the complexities in genetics and inheritance pattern of potato, the conventional method of cross breeding for developing improved varieties has been difficult. Identification and tagging of desirable traits with informative molecular markers would aid in the development of improved varieties. Insertional polymorphism of copia-like and gypsy-like long terminal repeat retrotransposons (RTN) were investigated among 47 potato varieties from India using Inter-Retrotransposon Amplified Polymorphism (IRAP) and Retrotransposon Microsatellite Amplified Polymorphism (REMAP) marker techniques and were compared with the DNA profiles obtained with simple sequence repeats (SSRs). The genetic polymorphism, efficiency of polymorphism and effectiveness of marker systems were evaluated to assess the extent of genetic diversity among Indian potato varieties. A total of 139 polymorphic SSR alleles, 270 IRAP and 98 REMAP polymorphic bands, showing polymorphism of 100%, 87.9% and 68.5%, respectively, were used for detailed characterization of the genetic relationships among potato varieties by using cluster analysis and principal coordinate analysis (PCoA). IRAP analysis resulted in the highest number of polymorphic bands with an average of 15 polymorphic bands per assay unit when compared to the other two marker systems. Based on pair-wise comparison, the genetic similarity was calculated using Dice similarity coefficient. The SSRs showed a wide range in genetic similarity values (0.485-0.971) as compared to IRAP (0.69-0.911) and REMAP (0.713-0.947). A Mantel's matrix correspondence test showed a high positive correlation (r=0.6) between IRAP and REMAP, an intermediate value (r=0.58) for IRAP and SSR and the lowest value (r=0.17) for SSR and REMAP. Statistically significant cophenetic correlation coefficient values, of 0.961, 0.941 and 0.905 were observed for REMAP, IRAP and SSR, respectively. The widespread presence and distinct DNA profiles for copia-like and gypsy-like RTNs in the examined genotypes indicate that these elements are active in the genome and may have even contributed to the potato genome organization. Although the three marker systems were capable of distinguishing all the 47 varieties; high reproducibility, low cost and ease of DNA profiling data collection make IRAP and REMAP markers highly efficient whole-genome scanning molecular probes for population genetic studies. Information obtained from the present study regarding the genetic association and distinctiveness provides an useful guide for selection of germplasm for plant breeding and conservation efforts. Copyright © 2014. Published by Elsevier Inc.
Diversity of garlic (Allium sativum L.) using SSR, EST and AFLP markers
USDA-ARS?s Scientific Manuscript database
Germplasm from the center of origin/diversity is important for the breeding and fingerprinting crop plants. In this study we utilized both dominant and co-dominant markers for the characterization of garlic samples from diverse geographic origins to assess the relative utility of these markers to id...
Use of EST-SSR loci flanking regions for phylogenetic analysis of genus Arachis
USDA-ARS?s Scientific Manuscript database
All wild peanut collections in the genus Arachis were assigned to nine taxonomy sections on the bases of cross-compatibility and morphologic character clustering. These nine sections consist of 80 species from the most ancient to the most advanced, providing a diverse genetic resource for phylogenet...
Development and genetic mapping of SSR markers in foxtail millet [Setaria italica (L.) P. Beauv.].
Jia, Xiaoping; Zhang, Zhongbao; Liu, Yinghui; Zhang, Chengwei; Shi, Yunsu; Song, Yanchun; Wang, Tianyu; Li, Yu
2009-02-01
SSR markers are desirable markers in analysis of genetic diversity, quantitative trait loci mapping and gene locating. In this study, SSR markers were developed from two genomic libraries enriched for (GA)n and (CA)n of foxtail millet [Setaria italica (L.) P. Beauv.], a crop of historical importance in China. A total of 100 SSR markers among the 193 primer pairs detected polymorphism between two mapping parents of an F(2) population, i.e. "B100" of cultivated S. italica and "A10" of wild S. viridis. Excluding 14 markers with unclear amplifications, and five markers unlinked with any linkage group, a foxtail millet SSR linkage map was constructed by integrating 81 new developed SSR markers with 20 RFLP anchored markers. The 81 SSRs covered nine chromosomes of foxtail millet. The length of the map was 1,654 cM, with an average interval distance between markers of 16.4 cM. The 81 SSR markers were not evenly distributed throughout the nine chromosomes, with Ch.8 harbouring the least (3 markers) and Ch.9 harbouring the most (18 markers). To verify the usefulness of the SSR markers developed, 37 SSR markers were randomly chosen to analyze genetic diversity of 40 foxtail millet accessions. Totally 228 alleles were detected, with an average 6.16 alleles per locus. Polymorphism information content (PIC) value for each locus ranged from 0.413 to 0.847, with an average of 0.697. A positive correlation between PIC and number of alleles and between PIC and number of repeat unit were found [0.802 and 0.429, respectively (P < 0.01)]. UPGMA analysis revealed that the 40 foxtail millet cultivars could be grouped into five clusters in which the landraces' grouping was largely consistent with ecotypes while the breeding varieties from different provinces in China tended to be grouped together.
Sahu, Dinesh K; Panda, Soumya P; Panda, Sujata; Das, Paramananda; Meher, Prem K; Hazra, Rupenangshu K; Peatman, Eric; Liu, Zhanjiang J; Eknath, Ambekar E; Nandi, Samiran
2013-07-15
Labeo rohita (Ham.) also called rohu is the most important freshwater aquaculture species on the Indian sub continent. Monsoon dependent breeding restricts its seed production beyond season indicating a strong genetic control about which very limited information is available. Additionally, few genomic resources are publicly available for this species. Here we sought to identify reproduction-relevant genes from normalized cDNA libraries of the brain-pituitary-gonad-liver (BPGL-axis) tissues of adult L. rohita collected during post preparatory phase. 6161 random clones sequenced (Sanger-based) from these libraries produced 4642 (75.34%) high-quality sequences. They were assembled into 3631 (78.22%) unique sequences composed of 709 contigs and 2922 singletons. A total of 182 unique sequences were found to be associated with reproduction-related genes, mainly under the GO term categories of reproduction, neuro-peptide hormone activity, hormone and receptor binding, receptor activity, signal transduction, embryonic development, cell-cell signaling, cell death and anti-apoptosis process. Several important reproduction-related genes reported here for the first time in L. rohita are zona pellucida sperm-binding protein 3, aquaporin-12, spermine oxidase, sperm associated antigen 7, testis expressed 261, progesterone receptor membrane component, Neuropeptide Y and Pro-opiomelanocortin. Quantitative RT-PCR-based analyses of 8 known and 8 unknown transcripts during preparatory and post-spawning phase showed increased expression level of most of the transcripts during preparatory phase (except Neuropeptide Y) in comparison to post-spawning phase indicating possible roles in initiation of gonad maturation. Expression of unknown transcripts was also found in prolific breeder common carp and tilapia, but levels of expression were much higher in seasonal breeder rohu. 3631 unique sequences contained 236 (6.49%) putative microsatellites with the AG (28.16%) repeat as the most frequent motif. Twenty loci showed polymorphism in 36 unrelated individuals with allele frequency ranging from 2 to 7 per locus. The observed heterozygosity ranged from 0.096 to 0.774 whereas the expected heterozygosity ranged from 0.109 to 0.801. Identification of 182 important reproduction-related genes and expression pattern of 16 transcripts in preparatory and post-spawning phase along with 20 polymorphic EST-SSRs should be highly useful for the future reproductive molecular studies and selection program in Labeo rohita. Copyright © 2013 Elsevier B.V. All rights reserved.
Zhang, Yu; Yan, Haidong; Jiang, Xiaomei; Wang, Xiaoli; Huang, Linkai; Xu, Bin; Zhang, Xinquan; Zhang, Lexin
2016-01-01
To evaluate genetic variation, population structure, and the extent of linkage disequilibrium (LD), 134 switchgrass ( Panicum virgatum L.) samples were analyzed with 51 markers, including 16 ISSRs, 20 SCoTs, and 15 EST-SSRs. In this study, a high level of genetic variation was observed in the switchgrass samples and they had an average Nei's gene diversity index (H) of 0.311. A total of 793 bands were obtained, of which 708 (89.28 %) were polymorphic. Using a parameter marker index (MI), the efficiency of the three types of markers (ISSR, SCoT, and EST-SSR) in the study were compared and we found that SCoT had a higher marker efficiency than the other two markers. The 134 switchgrass samples could be divided into two sub-populations based on STRUCTURE, UPGMA clustering, and principal coordinate analyses (PCA), and upland and lowland ecotypes could be separated by UPGMA clustering and PCA analyses. Linkage disequilibrium analysis revealed an average r 2 of 0.035 across all 51 markers, indicating a trend of higher LD in sub-population 2 than that in sub-population 1 ( P < 0.01). The population structure revealed in this study will guide the design of future association studies using these switchgrass samples.
Kumar, P Natraj; Sujatha, K; Laha, G S; Rao, K Srinivasa; Mishra, B; Viraktamath, B C; Hari, Y; Reddy, C S; Balachandran, S M; Ram, T; Madhav, M Sheshu; Rani, N Shobha; Neeraja, C N; Reddy, G Ashok; Shaik, H; Sundaram, R M
2012-02-01
Broadening of the genetic base for identification and transfer of genes for resistance to insect pests and diseases from wild relatives of rice is an important strategy in resistance breeding programs across the world. An accession of Oryza nivara, International Rice Germplasm Collection (IRGC) accession number 105710, was identified to exhibit high level and broad-spectrum resistance to Xanthomonas oryzae pv. oryzae. In order to study the genetics of resistance and to tag and map the resistance gene or genes present in IRGC 105710, it was crossed with the bacterial blight (BB)-susceptible varieties 'TN1' and 'Samba Mahsuri' (SM) and then backcrossed to generate backcross mapping populations. Analysis of these populations and their progeny testing revealed that a single dominant gene controls resistance in IRGC 105710. The BC(1)F(2) population derived from the cross IRGC 105710/TN1//TN1 was screened with a set of 72 polymorphic simple-sequence repeat (SSR) markers distributed across the rice genome and the resistance gene was coarse mapped on chromosome 7 between the SSR markers RM5711 and RM6728 at a genetic distance of 17.0 and 19.3 centimorgans (cM), respectively. After analysis involving 49 SSR markers located between the genomic interval spanned by RM5711 and RM6728, and BC(2)F(2) population consisting of 2,011 individuals derived from the cross IRGC 105710/TN1//TN1, the gene was fine mapped between two SSR markers (RMWR7.1 and RMWR7.6) located at a genetic distance of 0.9 and 1.2 cM, respectively, from the gene and flanking it. The linkage distances were validated in a BC(1)F(2) mapping population derived from the cross IRGC 105710/SM//2 × SM. The BB resistance gene present in the O. nivara accession was identified to be novel based on its unique map location on chromosome 7 and wider spectrum of BB resistance; this gene has been named Xa33. The genomic region between the two closely flanking SSR markers was in silico analyzed for putatively expressed candidate genes. In total, eight genes were identified in the region and a putative gene encoding serinethreonine kinase appears to be a candidate for the Xa33 gene.
Resistance Potential of Bread Wheat Genotypes Against Yellow Rust Disease Under Egyptian Climate.
Mahmoud, Amer F; Hassan, Mohamed I; Amein, Karam A
2015-12-01
Yellow rust (stripe rust), caused by Puccinia striiformis f. sp. tritici, is one of the most destructive foliar diseases of wheat in Egypt and worldwide. In order to identify wheat genotypes resistant to yellow rust and develop molecular markers associated with the resistance, fifty F8 recombinant inbred lines (RILs) derived from a cross between resistant and susceptible bread wheat landraces were obtained. Artificial infection of Puccinia striiformis was performed under greenhouse conditions during two growing seasons and relative resistance index (RRI) was calculated. Two Egyptian bread wheat cultivars i.e. Giza-168 (resistant) and Sakha-69 (susceptible) were also evaluated. RRI values of two-year trial showed that 10 RILs responded with RRI value >6 <9 with an average of 7.29, which exceeded the Egyptian bread wheat cultivar Giza-168 (5.58). Thirty three RILs were included among the acceptable range having RRI value >2 <6. However, only 7 RILs showed RRI value <2. Five RILs expressed hypersensitive type of resistance (R) against the pathogen and showed the lowest Average Coefficient of Infection (ACI). Bulked segregant analysis (BSA) with eight simple sequence repeat (SSR), eight sequence-related amplified polymorphism (SRAP) and sixteen random amplified polymorphic DNA (RAPD) markers revealed that three SSR, three SRAP and six RAPD markers were found to be associated with the resistance to yellow rust. However, further molecular analyses would be performed to confirm markers associated with the resistance and suitable for marker-assisted selection. Resistant RILs identified in the study could be efficiently used to improve the resistance to yellow rust in wheat.
Elucidating polyploidization of bermudagrasses as assessed by organelle and nuclear DNA markers.
Gulsen, Osman; Ceylan, Ahmet
2011-12-01
Clarification of relationships among ploidy series of Cynodon accessions could be beneficial to bermudagrass breeding programs, and would enhance our understanding of the evolutionary biology of this warm season grass species. This study was initiated to elucidate polyploidization among Cynodon accessions with different ploidy series collected from Turkey based on chloroplast and nuclear DNA. Forty Cynodon accessions including 7 diploids, 3 triploids, 10 tetraploids, 11 pentaploids, and 9 hexaploids were analyzed using chloroplast DNA restriction fragment-length polymorphism (cpDNA RFLP), chloroplast DNA simple sequence repeat (cpDNA SSR), and nuclear DNA markers based on neighbor-joining (NJ) and principle component analyses (PCA). All three-marker systems with two statistical algorithms clustered the diploids apart from the other ploidy levels. Assuming autopolyploidy, spontaneous polyploidization followed by rapid diversification among the higher ploidy levels than the diploids is likely in Cynodon's evolution. Few tetraploid and hexaploid accessions were clustered with or closely to the group of diploids, supporting the hypothesis above. Eleven haplotypes as estimated by cpDNA RFLP and SSR markers were detected. This study indicated that the diploids had different organelle genome from the rest of the ploidy series and provided valuable insight into relationships among ploidy series of Cynodon accessions based on cp and nuclear DNAs.
Helmersson, Andreas; von Arnold, Sara; Burg, Kornel; Bozhkov, Peter V
2004-10-01
Somatic embryos of Norway spruce (Picea abies (L.) Karst.) differentiate from proembryogenic masses (PEMs), which are subject to autodestruction through programmed cell death. In PEMs, somatic embryo formation and activation of programmed cell death are interrelated processes. We sought to determine if activation of programmed cell death in PEMs is caused by genetic aberrations during somatic embryogenesis. Based on the finding that withdrawal of auxin and cytokinin induces programmed cell death in PEMs, 1-week-old cell suspensions were cultured in medium either with or without auxin and cytokinin and then transferred to maturation medium containing abscisic acid. We analyzed the stability of three nuclear simple sequence repeat (SSR) microsatellite markers at successive stages of somatic embryogenesis in two cell lines. There were no mutations at the SSR loci at any of the successive developmental stages from PEMs to cotyledonary embryos, irrespective of whether or not the proliferation medium in which cell suspensions had been cultured contained auxin or cytokinin. The morphologies of plants regenerated from the cultures were similar, although withdrawal of auxin and cytokinin significantly stimulated the yield of both embryos and plants. We conclude, therefore, that the high genetic stability of somatic embryos in Norway spruce is unaffected by the induction of programmed cell death caused by withdrawal of auxin and cytokinin.
Development of a method for detection and quantification of B. brongniartii and B. bassiana in soil
NASA Astrophysics Data System (ADS)
Canfora, L.; Malusà, E.; Tkaczuk, C.; Tartanus, M.; Łabanowska, B. H.; Pinzari, F.
2016-03-01
A culture independent method based on qPCR was developed for the detection and quantification of two fungal inoculants in soil. The aim was to adapt a genotyping approach based on SSR (Simple Sequence Repeat) marker to a discriminating tracing of two different species of bioinoculants in soil, after their in-field release. Two entomopathogenic fungi, Beauveria bassiana and B. brongniartii, were traced and quantified in soil samples obtained from field trials. These two fungal species were used as biological agents in Poland to control Melolontha melolontha (European cockchafer), whose larvae live in soil menacing horticultural crops. Specificity of SSR markers was verified using controls consisting of: i) soil samples containing fungal spores of B. bassiana and B. brongniartii in known dilutions; ii) the DNA of the fungal microorganisms; iii) soil samples singly inoculated with each fungus species. An initial evaluation of the protocol was performed with analyses of soil DNA and mycelial DNA. Further, the simultaneous detection and quantification of B. bassiana and B. brongniartii in soil was achieved in field samples after application of the bio-inoculants. The protocol can be considered as a relatively low cost solution for the detection, identification and traceability of fungal bio-inoculants in soil.
Development of a method for detection and quantification of B. brongniartii and B. bassiana in soil
Canfora, L.; Malusà, E.; Tkaczuk, C.; Tartanus, M.; Łabanowska, B.H.; Pinzari, F.
2016-01-01
A culture independent method based on qPCR was developed for the detection and quantification of two fungal inoculants in soil. The aim was to adapt a genotyping approach based on SSR (Simple Sequence Repeat) marker to a discriminating tracing of two different species of bioinoculants in soil, after their in-field release. Two entomopathogenic fungi, Beauveria bassiana and B. brongniartii, were traced and quantified in soil samples obtained from field trials. These two fungal species were used as biological agents in Poland to control Melolontha melolontha (European cockchafer), whose larvae live in soil menacing horticultural crops. Specificity of SSR markers was verified using controls consisting of: i) soil samples containing fungal spores of B. bassiana and B. brongniartii in known dilutions; ii) the DNA of the fungal microorganisms; iii) soil samples singly inoculated with each fungus species. An initial evaluation of the protocol was performed with analyses of soil DNA and mycelial DNA. Further, the simultaneous detection and quantification of B. bassiana and B. brongniartii in soil was achieved in field samples after application of the bio-inoculants. The protocol can be considered as a relatively low cost solution for the detection, identification and traceability of fungal bio-inoculants in soil. PMID:26975931
Song, Zhijiao; Zhang, Miaomiao; Li, Fagen; Weng, Qijie; Zhou, Chanpin; Li, Mei; Li, Jie; Huang, Huanhua; Mo, Xiaoyong; Gan, Siming
2016-01-01
Identification of loci or genes under natural selection is important for both understanding the genetic basis of local adaptation and practical applications, and genome scans provide a powerful means for such identification purposes. In this study, genome-wide simple sequence repeats markers (SSRs) were used to scan for molecular footprints of divergent selection in Eucalyptus grandis, a hardwood species occurring widely in costal areas from 32° S to 16° S in Australia. High population diversity levels and weak population structure were detected with putatively neutral genomic SSRs. Using three FST outlier detection methods, a total of 58 outlying SSRs were collectively identified as loci under divergent selection against three non-correlated climatic variables, namely, mean annual temperature, isothermality and annual precipitation. Using a spatial analysis method, nine significant associations were revealed between FST outlier allele frequencies and climatic variables, involving seven alleles from five SSR loci. Of the five significant SSRs, two (EUCeSSR1044 and Embra394) contained alleles of putative genes with known functional importance for response to climatic factors. Our study presents critical information on the population diversity and structure of the important woody species E. grandis and provides insight into the adaptive responses of perennial trees to climatic variations. PMID:27748400
Genetic differentiation and geographical Relationship of Asian barley landraces using SSRs
Naeem, Rehan; Dahleen, Lynn; Mirza, Bushra
2011-01-01
Genetic diversity in 403 morphologically distinct landraces of barley (Hordeum vulgare L. subsp. vulgare) originating from seven geographical zones of Asia was studied using simple sequence repeat (SSR) markers from regions of medium to high recombination in the barley genome. The seven polymorphic SSR markers representing each of the chromosomes chosen for the study revealed a high level of allelic diversity among the landraces. Genetic richness was highest in those from India, followed by Pakistan while it was lowest for Uzbekistan and Turkmenistan. Out of the 50 alleles detected, 15 were unique to a geographic region. Genetic diversity was highest for landraces from Pakistan (0.70 ± 0.06) and lowest for those from Uzbekistan (0.18 ± 0.17). Likewise, polymorphic information content (PIC) was highest for Pakistan (0.67 ± 0.06) and lowest for Uzbekistan (0.15 ± 0.17). Diversity among groups was 40% compared to 60% within groups. Principal component analysis clustered the barley landraces into three groups to predict their domestication patterns. In total 51.58% of the variation was explained by the first two principal components of the barley germplasm. Pakistan landraces were clustered separately from those of India, Iran, Nepal and Iraq, whereas those from Turkmenistan and Uzbekistan were clustered together into a separate group. PMID:21734828
Liu, Minxuan; Xu, Yue; He, Jihong; Zhang, Shuang; Wang, Yinyue; Lu, Ping
2016-01-01
Broomcorn millet (Panicum miliaceum L.), one of the first domesticated crops, has been grown in Northern China for at least 10,000 years. The species is presently a minor crop, and evaluation of its genetic diversity has been very limited. In this study, we analyzed the genetic diversity of 88 accessions of broomcorn millet collected from various provinces of China. Amplification with 67 simple sequence repeat (SSR) primers revealed moderate levels of diversity in the investigated accessions. A total of 179 alleles were detected, with an average of 2.7 alleles per locus. Polymorphism information content and expected heterozygosity ranged from 0.043 to 0.729 (mean = 0.376) and 0.045 to 0.771 (mean = 0.445), respectively. Cluster analysis based on the unweighted pair group method of mathematical averages separated the 88 accessions into four groups at a genetic similarity level of 0.633. A genetic structure assay indicated a close correlation between geographical regions and genetic diversity. The uncovered information will be valuable for defining gene pools and developing breeding programs for broomcorn millet. Furthermore, the millet-specific SSR markers developed in this study should serve as useful tools for assessment of genetic diversity and elucidation of population structure in broomcorn millet. PMID:26985894
QTL mapping of soybean oil content for marker-assisted selection in plant breeding program.
Leite, D C; Pinheiro, J B; Campos, J B; Di Mauro, A O; Unêda-Trevisoli, S H
2016-03-18
The present study was undertaken to detect and map the quantitative trait loci (QTL) related to soybean oil content. We used 244 progenies derived from a bi-parental cross of the Lineage 69 (from Universidade Estadual Paulista "Júlio de Mesquita Filho"/Faculdade de Ciências Agrárias e Veterinárias - Breeding Program) and Tucunaré cultivar. A total of 358 simple sequence repeat (SSR; microsatellite) markers were used to investigate the polymorphism between the parental lines, and for the polymorphic lines all the F2 individuals were tested. Evaluation of the oil content and phenotype was performed with the aid of a Tango equipment by near infra-red reflectance spectroscopy, using single F2 seeds and F2:3 progenies, in triplicate. The data were analyzed by QTL Cartographer program for 56 SSR polymorphic markers. Two oil-content related QTLs were detected on K and H linkage groups. The total phenotypic variation explained by QTLs ranged from 7.8 to 46.75% for oil content. New QTLs were identified for the oil content in addition to those previously identified in other studies. The results reported in this study show that regions different from those already known could be involved in the genetic control of soybean oil content.
The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction
2012-01-01
Background Physalis peruviana commonly known as Cape gooseberry is a member of the Solanaceae family that has an increasing popularity due to its nutritional and medicinal values. A broad range of genomic tools is available for other Solanaceae, including tomato and potato. However, limited genomic resources are currently available for Cape gooseberry. Results We report the generation of a total of 652,614 P. peruviana Expressed Sequence Tags (ESTs), using 454 GS FLX Titanium technology. ESTs, with an average length of 371 bp, were obtained from a normalized leaf cDNA library prepared using a Colombian commercial variety. De novo assembling was performed to generate a collection of 24,014 isotigs and 110,921 singletons, with an average length of 1,638 bp and 354 bp, respectively. Functional annotation was performed using NCBI’s BLAST tools and Blast2GO, which identified putative functions for 21,191 assembled sequences, including gene families involved in all the major biological processes and molecular functions as well as defense response and amino acid metabolism pathways. Gene model predictions in P. peruviana were obtained by using the genomes of Solanum lycopersicum (tomato) and Solanum tuberosum (potato). We predict 9,436 P. peruviana sequences with multiple-exon models and conserved intron positions with respect to the potato and tomato genomes. Additionally, to study species diversity we developed 5,971 SSR markers from assembled ESTs. Conclusions We present the first comprehensive analysis of the Physalis peruviana leaf transcriptome, which will provide valuable resources for development of genetic tools in the species. Assembled transcripts with gene models could serve as potential candidates for marker discovery with a variety of applications including: functional diversity, conservation and improvement to increase productivity and fruit quality. P. peruviana was estimated to be phylogenetically branched out before the divergence of five other Solanaceae family members, S. lycopersicum, S. tuberosum, Capsicum spp, S. melongena and Petunia spp. PMID:22533342
The Physalis peruviana leaf transcriptome: assembly, annotation and gene model prediction.
Garzón-Martínez, Gina A; Zhu, Z Iris; Landsman, David; Barrero, Luz S; Mariño-Ramírez, Leonardo
2012-04-25
Physalis peruviana commonly known as Cape gooseberry is a member of the Solanaceae family that has an increasing popularity due to its nutritional and medicinal values. A broad range of genomic tools is available for other Solanaceae, including tomato and potato. However, limited genomic resources are currently available for Cape gooseberry. We report the generation of a total of 652,614 P. peruviana Expressed Sequence Tags (ESTs), using 454 GS FLX Titanium technology. ESTs, with an average length of 371 bp, were obtained from a normalized leaf cDNA library prepared using a Colombian commercial variety. De novo assembling was performed to generate a collection of 24,014 isotigs and 110,921 singletons, with an average length of 1,638 bp and 354 bp, respectively. Functional annotation was performed using NCBI's BLAST tools and Blast2GO, which identified putative functions for 21,191 assembled sequences, including gene families involved in all the major biological processes and molecular functions as well as defense response and amino acid metabolism pathways. Gene model predictions in P. peruviana were obtained by using the genomes of Solanum lycopersicum (tomato) and Solanum tuberosum (potato). We predict 9,436 P. peruviana sequences with multiple-exon models and conserved intron positions with respect to the potato and tomato genomes. Additionally, to study species diversity we developed 5,971 SSR markers from assembled ESTs. We present the first comprehensive analysis of the Physalis peruviana leaf transcriptome, which will provide valuable resources for development of genetic tools in the species. Assembled transcripts with gene models could serve as potential candidates for marker discovery with a variety of applications including: functional diversity, conservation and improvement to increase productivity and fruit quality. P. peruviana was estimated to be phylogenetically branched out before the divergence of five other Solanaceae family members, S. lycopersicum, S. tuberosum, Capsicum spp, S. melongena and Petunia spp.
da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne
2017-01-01
The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI− bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted. PMID:28187131
da Silva, Rangeline Azevedo; Souza, Gustavo; Lemos, Lívia Santos Lima; Lopes, Uilson Vanderlei; Patrocínio, Nara Geórgia Ribeiro Braz; Alves, Rafael Moysés; Marcellino, Lucília Helena; Clement, Didier; Micheli, Fabienne; Gramacho, Karina Peres
2017-01-01
The genus Theobroma comprises several trees species native to the Amazon. Theobroma cacao L. plays a key economic role mainly in the chocolate industry. Both cultivated and wild forms are described within the genus. Variations in genome size and chromosome number have been used for prediction purposes including the frequency of interspecific hybridization or inference about evolutionary relationships. In this study, the nuclear DNA content, karyotype and genetic diversity using functional microsatellites (EST-SSR) of seven Theobroma species were characterized. The nuclear content of DNA for all analyzed Theobroma species was 1C = ~ 0.46 pg. These species presented 2n = 20 with small chromosomes and only one pair of terminal heterochromatic bands positively stained (CMA+/DAPI- bands). The small size of Theobroma ssp. genomes was equivalent to other Byttnerioideae species, suggesting that the basal lineage of Malvaceae have smaller genomes and that there was an expansion of 2C values in the more specialized family clades. A set of 20 EST-SSR primers were characterized for related species of Theobroma, in which 12 loci were polymorphic. The polymorphism information content (PIC) ranged from 0.23 to 0.65, indicating a high level of information per locus. Combined results of flow cytometry, cytogenetic data and EST-SSRs markers will contribute to better describe the species and infer about the evolutionary relationships among Theobroma species. In addition, the importance of a core collection for conservation purposes is highlighted.
Di Gaspero, G; Cipriani, G; Adam-Blondon, A-F; Testolin, R
2007-05-01
Genetic maps functionally oriented towards disease resistance have been constructed in grapevine by analysing with a simultaneous maximum-likelihood estimation of linkage 502 markers including microsatellites and resistance gene analogs (RGAs). Mapping material consisted of two pseudo-testcrosses, 'Chardonnay' x 'Bianca' and 'Cabernet Sauvignon' x '20/3' where the seed parents were Vitis vinifera genotypes and the male parents were Vitis hybrids carrying resistance to mildew diseases. Individual maps included 320-364 markers each. The simultaneous use of two mapping crosses made with two pairs of distantly related parents allowed mapping as much as 91% of the markers tested. The integrated map included 420 Simple Sequence Repeat (SSR) markers that identified 536 SSR loci and 82 RGA markers that identified 173 RGA loci. This map consisted of 19 linkage groups (LGs) corresponding to the grape haploid chromosome number, had a total length of 1,676 cM and a mean distance between adjacent loci of 3.6 cM. Single-locus SSR markers were randomly distributed over the map (CD = 1.12). RGA markers were found in 18 of the 19 LGs but most of them (83%) were clustered on seven LGs, namely groups 3, 7, 9, 12, 13, 18 and 19. Several RGA clusters mapped to chromosomal regions where phenotypic traits of resistance to fungal diseases such as downy mildew and powdery mildew, bacterial diseases such as Pierce's disease, and pests such as dagger and root-knot nematode, were previously mapped in different segregating populations. The high number of RGA markers integrated into this new map will help find markers linked to genetic determinants of different pest and disease resistances in grape.
Genetic Diversity of Aromatic Rice Germplasm Revealed By SSR Markers.
Jasim Aljumaili, Saba; Rafii, M Y; Latif, M A; Sakimin, Siti Zaharah; Arolu, Ibrahim Wasiu; Miah, Gous
2018-01-01
Aromatic rice cultivars constitute a small but special group of rice and are considered the best in terms of quality and aroma. Aroma is one of the most significant quality traits of rice, and variety with aroma has a higher price in the market. This research was carried out to study the genetic diversity among the 50 aromatic rice accessions from three regions (Peninsular Malaysia, Sabah, and Sarawak) with 3 released varieties as a control using the 32 simple sequence repeat (SSR) markers. The objectives of this research were to quantify the genetic divergence of aromatic rice accessions using SSR markers and to identify the potential accessions for introgression into the existing rice breeding program. Genetic diversity index among the three populations such as Shannon information index ( I ) ranged from 0.25 in control to 0.98 in Sabah population. The mean numbers of effective alleles and Shannon's information index were 0.36 and 64.90%, respectively. Similarly, the allelic diversity was very high with mean expected heterozygosity ( H e ) of 0.60 and mean Nei's gene diversity index of 0.36. The dendrogram based on UPGMA and Nei's genetic distance classified the 53 rice accessions into 10 clusters. Analysis of molecular variance (AMOVA) revealed that 89% of the total variation observed in this germplasm came from within the populations, while 11% of the variation emanated among the populations. These results reflect the high genetic differentiation existing in this aromatic rice germplasm. Using all these criteria and indices, seven accessions (Acc9993, Acc6288, Acc6893, Acc7580, Acc6009, Acc9956, and Acc11816) from three populations have been identified and selected for further evaluation before introgression into the existing breeding program and for future aromatic rice varietal development.
Genetic Diversity of Aromatic Rice Germplasm Revealed By SSR Markers
Jasim Aljumaili, Saba; Sakimin, Siti Zaharah; Arolu, Ibrahim Wasiu; Miah, Gous
2018-01-01
Aromatic rice cultivars constitute a small but special group of rice and are considered the best in terms of quality and aroma. Aroma is one of the most significant quality traits of rice, and variety with aroma has a higher price in the market. This research was carried out to study the genetic diversity among the 50 aromatic rice accessions from three regions (Peninsular Malaysia, Sabah, and Sarawak) with 3 released varieties as a control using the 32 simple sequence repeat (SSR) markers. The objectives of this research were to quantify the genetic divergence of aromatic rice accessions using SSR markers and to identify the potential accessions for introgression into the existing rice breeding program. Genetic diversity index among the three populations such as Shannon information index (I) ranged from 0.25 in control to 0.98 in Sabah population. The mean numbers of effective alleles and Shannon's information index were 0.36 and 64.90%, respectively. Similarly, the allelic diversity was very high with mean expected heterozygosity (He) of 0.60 and mean Nei's gene diversity index of 0.36. The dendrogram based on UPGMA and Nei's genetic distance classified the 53 rice accessions into 10 clusters. Analysis of molecular variance (AMOVA) revealed that 89% of the total variation observed in this germplasm came from within the populations, while 11% of the variation emanated among the populations. These results reflect the high genetic differentiation existing in this aromatic rice germplasm. Using all these criteria and indices, seven accessions (Acc9993, Acc6288, Acc6893, Acc7580, Acc6009, Acc9956, and Acc11816) from three populations have been identified and selected for further evaluation before introgression into the existing breeding program and for future aromatic rice varietal development. PMID:29736396
NASA Astrophysics Data System (ADS)
Wang, Xiaoxia; Xu, Wenteng; Liu, Yang; Wang, Lei; Sun, Hejun; Wang, Lei; Chen, Songlin
2016-11-01
In recent years, Edwardsiella tarda has become one of the most deadly pathogens of Japanese flounder ( Paralichthys olivaceus), causing serious annual losses in commercial production. In contrast to the rapid advances in the aquaculture of P. olivaceus, the study of E. tarda resistance-related markers has lagged behind, hindering the development of a disease-resistant strain. Thus, a marker-trait association analysis was initiated, combining bulked segregant analysis (BSA) and quantitative trait loci (QTL) mapping. Based on 180 microsatellite loci across all chromosomes, 106 individuals from the F1333 (♀: F0768 ×♂: F0915) (Nomenclature rule: F+year+family number) were used to detect simple sequence repeats (SSRs) and QTLs associated with E. tarda resistance. After a genomic scan, three markers (Scaffold 404-21589, Scaffold 404-21594 and Scaffold 270-13812) from the same linkage group (LG)-1 exhibited a significant difference between DNA, pooled/bulked from the resistant and susceptible groups (P <0.001). Therefore, 106 individuals were genotyped using all the SSR markers in LG1 by single marker analysis. Two different analytical models were then employed to detect SSR markers with different levels of significance in LG1, where 17 and 18 SSR markers were identified, respectively. Each model found three resistance-related QTLs by composite interval mapping (CIM). These six QTLs, designated qE1-6, explained 16.0%-89.5% of the phenotypic variance. Two of the QTLs, qE-2 and qE-4, were located at the 66.7 cM region, which was considered a major candidate region for E. tarda resistance. This study will provide valuable data for further investigations of E. tarda resistance genes and facilitate the selective breeding of disease-resistant Japanese flounder in the future.
Mapping of the apple scab-resistance gene Vb.
Erdin, N; Tartarini, S; Broggini, G A L; Gennari, F; Sansavini, S; Gessler, C; Patocchi, A
2006-10-01
Apple scab, caused by the fungus Venturia inaequalis, is the major production constraint in temperate zones with humid springs. Normally, its control relies on frequent and regular fungicide applications. Because this control strategy has come under increasing criticism, major efforts are being directed toward the breeding of scab-resistant apple cultivars. Modern apple breeding programs include the use of molecular markers, making it possible to combine several different scab-resistance genes in 1 apple cultivar (pyramiding) and to speed up the breeding process. The apple scab-resistance gene Vb is derived from the Siberian crab apple 'Hansen's baccata #2', and is 1 of the 6 "historical" major apple scab-resistance genes (Vf, Va, Vr, Vbj, Vm, and Vb). Molecular markers have been published for all these genes, except Vr. In testcross experiments conducted in the 1960s, it was reported that Vb segregated independently from 3 other major resistance genes, including Vf. Recently, however, Vb and Vf have both been mapped on linkage group 1, a result that contrasts with the findings from former testcross experiments. In this study, simple sequence repeat (SSR) markers were used to identify the precise position of Vb in a cross of 'Golden Delicious' (vbvb) and 'Hansen's baccata #2' (Vbvb). A genome scanning approach, a fast method already used to map apple scab-resistance genes Vr2 and Vm, was used, and the Vb locus was identified on linkage group 12, between the SSR markers Hi02d05 and Hi07f01. This finding confirms the independent segregation of Vb from Vf. With the identification of SSR markers linked to Vb, another major apple scab-resistance gene has become available; breeders can use it to develop durable resistant cultivars with several different resistance genes.
Wang, Jingguo; Jiang, Tingbo; Zou, Detang; Zhao, Hongwei; Li, Qiang; Liu, Hualong; Zhou, Changjun
2014-01-01
Genetic diversity and the relationship among nine japonica rice groups consisting of 288 landraces and varieties in different geographical origins of Northeast Asia (China, Japan, Korea, Democratic People's Republic of Korea) and the Russian Far East district of the Russian Federation were evaluated with 154 simple sequence repeat (SSR) markers. A total of 823 alleles were detected. The observed allele numbers (Na) per locus, Nei's gene diversity (He) and the polymorphism information content (PIC) ranged from 2 to 9, 0.061 to 0.869 and 0.060 to 0.856, with an average of 5.344, 0.624 and 0.586, respectively. Five SSR loci, RM1350, RM1369, RM257, RM336 and RM1374, provided the highest PIC values and are potential for exploring the genetic diversity of rice cultivars in Northeast Asia. Molecular variance analysis showed that a significant difference existed both among groups (91.6%) and within each group (8.4%). The low genetic variation within each group indicated that the gene pool is narrow and alien genetic variation should be introduced into the rice breeding program in Northeast Asia. Based on the He and PIC values, the nine groups were ranked in a descending order: Heilongjiang landraces, Jilin landraces, Japanese improved varieties, Heilongjiang improved varieties, Russian Far East district of the Russian Federation improved varieties, Liaoning improved varieties, Jilin improved varieties, Korean improved varieties and Democratic People's Republic of Korea improved varieties. The nine groups were further divided into three subgroups and the 288 varieties into five clusters. This study provided information for parent selection in order to broaden the gene pool of the japonica rice germplasm in Northeast Asia. PMID:26019508
Wang, Jingguo; Jiang, Tingbo; Zou, Detang; Zhao, Hongwei; Li, Qiang; Liu, Hualong; Zhou, Changjun
2014-03-04
Genetic diversity and the relationship among nine japonica rice groups consisting of 288 landraces and varieties in different geographical origins of Northeast Asia (China, Japan, Korea, Democratic People's Republic of Korea) and the Russian Far East district of the Russian Federation were evaluated with 154 simple sequence repeat (SSR) markers. A total of 823 alleles were detected. The observed allele numbers (Na) per locus, Nei's gene diversity (He) and the polymorphism information content (PIC) ranged from 2 to 9, 0.061 to 0.869 and 0.060 to 0.856, with an average of 5.344, 0.624 and 0.586, respectively. Five SSR loci, RM1350, RM1369, RM257, RM336 and RM1374, provided the highest PIC values and are potential for exploring the genetic diversity of rice cultivars in Northeast Asia. Molecular variance analysis showed that a significant difference existed both among groups (91.6%) and within each group (8.4%). The low genetic variation within each group indicated that the gene pool is narrow and alien genetic variation should be introduced into the rice breeding program in Northeast Asia. Based on the He and PIC values, the nine groups were ranked in a descending order: Heilongjiang landraces, Jilin landraces, Japanese improved varieties, Heilongjiang improved varieties, Russian Far East district of the Russian Federation improved varieties, Liaoning improved varieties, Jilin improved varieties, Korean improved varieties and Democratic People's Republic of Korea improved varieties. The nine groups were further divided into three subgroups and the 288 varieties into five clusters. This study provided information for parent selection in order to broaden the gene pool of the japonica rice germplasm in Northeast Asia.