Scaglione, Davide; Acquadro, Alberto; Portis, Ezio; Taylor, Christopher A; Lanteri, Sergio; Knapp, Steven J
2009-01-01
Background The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genomic analysis, and we have exploited recently acquired EST (expressed sequence tag) sequence data (Composite Genome Project - CGP) to develop an extensive set of microsatellite markers. Results A unigene assembly was created from over 36,000 globe artichoke EST sequences, containing 6,621 contigs and 12,434 singletons. Over 12,000 of these unigenes were functionally assigned on the basis of homology with Arabidopsis thaliana reference proteins. A total of 4,219 perfect repeats, located within 3,308 unigenes was identified and the gene ontology (GO) analysis highlighted some GO term's enrichments among different classes of microsatellites with respect to their position. Sufficient flanking sequence was available to enable the design of primers to amplify 2,311 of these microsatellites, and a set of 300 was tested against a DNA panel derived from 28 C. cardunculus genotypes. Consistent amplification and polymorphism was obtained from 236 of these assays. Their polymorphic information content (PIC) ranged from 0.04 to 0.90 (mean 0.66). Between 176 and 198 of the assays were informative in at least one of the three available mapping populations. Conclusion EST-based microsatellites have provided a large set of de novo genetic markers, which show significant amounts of polymorphism both between and within the three taxa of C. cardunculus. They are thus well suited as assays for phylogenetic analysis, the construction of genetic maps, marker-assisted breeding, transcript mapping and other genomic applications in the species. PMID:19785740
Luo, C; Zhang, Q L; Luo, Z R
2014-04-16
Oriental persimmon (Diospyros kaki Thunb.) (2n = 6x = 90) is a major commercial and deciduous fruit tree that is believed to have originated in China. However, rare transcriptomic and genomic information on persimmon is available. Using Roche 454 sequencing technology, the transcriptome from RNA of the flowers of D. kaki was analyzed. A total of 1,250,893 reads were generated and 83,898 unigenes were assembled. A total of 42,711 SSR loci were identified from 23,494 unigenes and 289 polymerase chain reaction primer pairs were designed. Of these 289 primers, 155 (53.6%) showed robust PCR amplification and 98 revealed polymorphism between 15 persimmon genotypes, indicating a polymorphic rate of 63.23% of the productive primers for characterization and genotyping of the genus Diospyros. Transcriptome sequence data generated from next-generation sequencing technology to identify microsatellite loci appears to be rapid and cost-efficient, particularly for species with no genomic sequence information available.
NASA Astrophysics Data System (ADS)
Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng
2011-06-01
The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.
Fatty Acid Profile and Unigene-Derived Simple Sequence Repeat Markers in Tung Tree (Vernicia fordii)
Zhang, Lin; Jia, Baoguang; Tan, Xiaofeng; Thammina, Chandra S.; Long, Hongxu; Liu, Min; Wen, Shanna; Song, Xianliang; Cao, Heping
2014-01-01
Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple sequence repeat (SSR) markers in tung tree. Fatty acid profiles of 41 accessions showed that the ratio of α-eleostearic acid was increasing continuously with a parallel trend to the amount of tung oil accumulation while the ratios of other fatty acids were decreasing in different stages of the seeds and that α-eleostearic acid (18∶3) consisted of 77% of the total fatty acids in tung oil. Transcriptome sequencing identified 81,805 unigenes from tung cDNA library constructed using seed mRNA and discovered 6,366 SSRs in 5,404 unigenes. The di- and tri-nucleotide microsatellites accounted for 92% of the SSRs with AG/CT and AAG/CTT being the most abundant SSR motifs. Fifteen polymorphic genic-SSR markers were developed from 98 unigene loci tested in 41 cultivated tung accessions by agarose gel and capillary electrophoresis. Genbank database search identified 10 of them putatively coding for functional proteins. Quantitative PCR demonstrated that all 15 polymorphic SSR-associated unigenes were expressed in tung seeds and some of them were highly correlated with oil composition in the seeds. Dendrogram revealed that most of the 41 accessions were clustered according to the geographic region. These new polymorphic genic-SSR markers will facilitate future studies on genetic diversity, molecular fingerprinting, comparative genomics and genetic mapping in tung tree. The lipid profiles in the seeds of 41 tung accessions will be valuable for biochemical and breeding studies. PMID:25167054
Duan, Xinle; Wang, Kang; Su, Sha; Tian, Ruizheng; Li, Yuting; Chen, Maohua
2017-01-01
The bird cherry-oat aphid, Rhopalosiphum padi (L.), is one of the most abundant aphid pests of cereals and has a global distribution. Next-generation sequencing (NGS) is a rapid and efficient method for developing molecular markers. However, transcriptomic and genomic resources of R. padi have not been investigated. In this study, we used transcriptome information obtained by RNA-Seq to develop polymorphic microsatellites for investigating population genetics in this species. The transcriptome of R. padi was sequenced on an Illumina HiSeq 2000 platform. A total of 114.4 million raw reads with a GC content of 40.03% was generated. The raw reads were cleaned and assembled into 29,467 unigenes with an N50 length of 1,580 bp. Using several public databases, 82.47% of these unigenes were annotated. Of the annotated unigenes, 8,022 were assigned to COG pathways, 9,895 were assigned to GO pathways, and 14,586 were mapped to 257 KEGG pathways. A total of 7,936 potential microsatellites were identified in 5,564 unigenes, 60 of which were selected randomly and amplified using specific primer pairs. Fourteen loci were found to be polymorphic in the four R. padi populations. The transcriptomic data presented herein will facilitate gene discovery, gene analyses, and development of molecular markers for future studies of R. padi and other closely related aphid species.
Hu, Zhuang; Zhang, Tian; Gao, Xiao-Xiao; Wang, Yang; Zhang, Qiang; Zhou, Hui-Juan; Zhao, Gui-Fang; Wang, Ma-Li; Woeste, Keith E; Zhao, Peng
2016-04-01
Manchurian walnut (Juglans mandshurica Maxim.) is a vulnerable, temperate deciduous tree valued for its wood and nut, but transcriptomic and genomic data for the species are very limited. Next generation sequencing (NGS) has made it possible to develop molecular markers for this species rapidly and efficiently. Our goal is to use transcriptome information from RNA-Seq to understand development in J. mandshurica and develop polymorphic simple sequence repeats (SSRs, microsatellites) to understand the species' population genetics. In this study, more than 47.7 million clean reads were generated using Illumina sequencing technology. De novo assembly yielded 99,869 unigenes with an average length of 747 bp. Based on sequence similarity search with known proteins, a total of 39,708 (42.32 %) genes were identified. Searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG) identified 15,903 (16.9 %) unigenes. Further, we identified and characterized 63 new transcriptome-derived microsatellite markers. By testing the markers on 4 to 14 individuals from four populations, we found that 20 were polymorphic and easily amplified. The number of alleles per locus ranged from 2 to 8. The observed and expected heterozygosity per locus ranged from 0.209 to 0.813 and 0.335 to 0.842, respectively. These twenty microsatellite markers will be useful for studies of population genetics, diversity, and genetic structure, and they will undoubtedly benefit future breeding studies of this walnut species. Moreover, the information uncovered in this research will also serve as a useful genetic resource for understanding the transcriptome and development of J. mandshurica and other Juglans species.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine
Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson
2011-01-01
Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Liu, Le; Zhang, Shijie; Lian, Chunlan
2015-01-01
Japanese red pine (Pinus densiflora) is extensively cultivated in Japan, Korea, China, and Russia and is harvested for timber, pulpwood, garden, and paper markets. However, genetic information and molecular markers were very scarce for this species. In this study, over 51 million sequencing clean reads from P. densiflora mRNA were produced using Illumina paired-end sequencing technology. It yielded 83,913 unigenes with a mean length of 751 bp, of which 54,530 (64.98%) unigenes showed similarity to sequences in the NCBI database. Among which the best matches in the NCBI Nr database were Picea sitchensis (41.60%), Amborella trichopoda (9.83%), and Pinus taeda (4.15%). A total of 1953 putative microsatellites were identified in 1784 unigenes using MISA (MicroSAtellite) software, of which the tri-nucleotide repeats were most abundant (50.18%) and 629 EST-SSR (expressed sequence tag- simple sequence repeats) primer pairs were successfully designed. Among 20 EST-SSR primer pairs randomly chosen, 17 markers yielded amplification products of the expected size in P. densiflora. Our results will provide a valuable resource for gene-function analysis, germplasm identification, molecular marker-assisted breeding and resistance-related gene(s) mapping for pine for P. densiflora. PMID:26690126
Li, Yunfeng; Zhou, Zunchun; Tian, Meilin; Tian, Yi; Dong, Ying; Li, Shilei; Liu, Weidong; He, Chongbo
2017-08-01
In this study, single nucleotide polymorphism (SNP), microsatellite (SSR) and differentially expressed genes (DEGs) in the oral parts, gonads, and umbrella parts of the jellyfish Rhopilema esculentum were analyzed by RNA-Seq technology. A total of 76.4 million raw reads and 72.1 million clean reads were generated from deep sequencing. Approximately 119,874 tentative unigenes and 149,239 transcripts were obtained. A total of 1,034,708 SNP markers were detected in the three tissues. For microsatellite mining, 5088 SSRs were identified from the unigene sequences. The most frequent repeat motifs were mononucleotide repeats, which accounted for 61.93%. Transcriptome comparison of the three tissues yielded a total of 8841 DEGs, of which 3560 were up-regulated and 5281 were down-regulated. This study represents the greatest sequencing effort carried out for a jellyfish and provides the first high-throughput transcriptomic resource for jellyfish. Copyright © 2017 Elsevier B.V. All rights reserved.
Singh, R K; Singh, R B; Singh, S P; Sharma, M L
2012-04-01
Sugarcane is an important international commodity as a valuable agricultural crop especially in tropical and subtropical countries. Two bulked DNA used to screen polymorphic primers from commercial hybrids (varieties) with moderately resistant and highly susceptible to red rot disease. Among 145 simple sequence repeat and unigene primers screened, 37 (25%) were found to be highly robust and polymorphic with Polymorphism Information Content values ranging from 0.50 to 1.00 with the mean value of 0.82. Among these microsatellites, twenty one were used in the study of genetic relationships and marker identification in sugarcane varieties for red rot resistance. A total of 105 polymorphic DNA bands were identified, with their fragment size ranging from 54 to 1,280 bp. Jaccard's similarity coefficient value recorded between closely related hybrids was 0.986 while lowest coefficient value of 0.341 was detected with distantly related hybrids. The average similarity coefficient among these hybrids was 0.663. Cluster analysis resulted in a dendrogram with two major clusters separating the moderately resistant varieties from highly susceptible varieties. Three group specific fragments amplified by unigene Saccharum microsatellite primers viz; two markers UGSM316(850) and UGSM316(60) were closely associated with moderately resistant varieties by appearing bands in this region but the bands were absent in highly susceptible varieties. Similarly UGSM316(400) marker was tightly linked with highly susceptible varieties by amplifying uniformly in sugarcane varieties showing highly susceptible reaction to red rot but it was absent in moderately resistant varietal groups. Validation of red rot resistance/susceptibility associated markers on a group of different mapping populations for red rot resistant/susceptible traits is in progress.
Li, Yong; Zhang, Weirui
2015-10-01
Microsatellite markers of Jasminum sambac (Oleaceae) were isolated to investigate wild germplasm resources and provide markers for breeding. Illumina sequencing was used to isolate microsatellite markers from the transcriptome of J. sambac. A total of 1322 microsatellites were identified from 49,772 assembled unigenes. One hundred primer pairs were randomly selected to verify primer amplification efficiency. Out of these tested primer pairs, 31 were successfully amplified: 18 primer pairs yielded a single allele, seven exhibited fixed heterozygosity with two alleles, and only six displayed polymorphisms. This study obtained the first set of microsatellite markers for J. sambac, which will be helpful for the assessment of wild germplasm resources and the development of molecular marker-assisted breeding.
Rawal, Hukam C.; Kumar, Shrawan; Mithra S.V., Amitha; Solanke, Amolkumar U.; Saxena, Swati; Tyagi, Anshika; V., Sureshkumar; Yadav, Neelam R.; Kalia, Pritam; Singh, Narendra Pratap; Singh, Nagendra Kumar; Sharma, Tilak Raj; Gaikwad, Kishor
2017-01-01
Clusterbean (Cyamopsis tetragonoloba L. Taub), is an important industrial, vegetable and forage crop. This crop owes its commercial importance to the presence of guar gum (galactomannans) in its endosperm which is used as a lubricant in a range of industries. Despite its relevance to agriculture and industry, genomic resources available in this crop are limited. Therefore, the present study was undertaken to generate RNA-Seq based transcriptome from leaf, shoot, and flower tissues. A total of 145 million high quality Illumina reads were assembled using Trinity into 127,706 transcripts and 48,007 non-redundant high quality (HQ) unigenes. We annotated 79% unigenes against Plant Genes from the National Center for Biotechnology Information (NCBI), Swiss-Prot, Pfam, gene ontology (GO) and KEGG databases. Among the annotated unigenes, 30,020 were assigned with 116,964 GO terms, 9984 with EC and 6111 with 137 KEGG pathways. At different fragments per kilobase of transcript per millions fragments sequenced (FPKM) levels, genes were found expressed higher in flower tissue followed by shoot and leaf. Additionally, we identified 8687 potential simple sequence repeats (SSRs) with an average frequency of one SSR per 8.75 kb. A total of 28 amplified SSRs in 21 clusterbean genotypes resulted in polymorphism in 13 markers with average polymorphic information content (PIC) of 0.21. We also constructed a database named ‘ClustergeneDB’ for easy retrieval of unigenes and the microsatellite markers. The tissue specific genes identified and the molecular marker resources developed in this study is expected to aid in genetic improvement of clusterbean for its end use. PMID:29120386
Li, Yong; Zhang, Weirui
2015-01-01
Premise of the study: Microsatellite markers of Jasminum sambac (Oleaceae) were isolated to investigate wild germplasm resources and provide markers for breeding. Methods and Results: Illumina sequencing was used to isolate microsatellite markers from the transcriptome of J. sambac. A total of 1322 microsatellites were identified from 49,772 assembled unigenes. One hundred primer pairs were randomly selected to verify primer amplification efficiency. Out of these tested primer pairs, 31 were successfully amplified: 18 primer pairs yielded a single allele, seven exhibited fixed heterozygosity with two alleles, and only six displayed polymorphisms. Conclusions: This study obtained the first set of microsatellite markers for J. sambac, which will be helpful for the assessment of wild germplasm resources and the development of molecular marker–assisted breeding. PMID:26504683
Guo, Rui; Landis, Jacob B.; Moore, Michael J.; Meng, Aiping; Jian, Shuguang; Yao, Xiaohong; Wang, Hengchang
2017-01-01
Actinidia eriantha Benth. is a diploid perennial woody vine native to China and is recognized as a valuable species for commercial kiwifruit improvement with high levels of ascorbic acid as well as having been used in traditional Chinese medicine. Due to the lack of genomic resources for the species, microsatellite markers for population genetics studies are scarce. In this study, RNASeq was conducted on fruit tissue of A. eriantha, yielding 5,678,129 reads with a total output of 3.41 Gb. De novo assembly yielded 69,783 non-redundant unigenes (41.3 Mb), of which 21,730 were annotated using protein databases. A total of 8,658 EST-SSR loci were identified in 7,495 unigene sequences, for which primer pairs were successfully designed for 3,842 loci (44.4%). Among these, 183 primer pairs were assayed for PCR amplification, yielding 69 with detectable polymorphism in A. eriantha. Additionally, 61 of the 69 polymorphic loci could be successfully amplified in at least one other Actinidia species. Of these, 14 polymorphic loci (mean NA = 6.07 ± 2.30) were randomly selected for assessing levels of genetic diversity and population structure within A. eriantha. Finally, a neighbor-joining tree and Bayesian clustering analysis showed distinct clustering into two groups (K = 2), agreeing with the geographical distributions of these populations. Overall, our results will facilitate further studies of genetic diversity within A. eriantha and will aid in discriminating outlier loci involved in local adaptation. PMID:28890721
NASA Astrophysics Data System (ADS)
Feng, Yanwei; Liu, Wenfen; Xu, Xin; Yang, Jianmin; Wang, Weijun; Wei, Xiumei; Liu, Xiangquan; Sun, Guohua
2017-10-01
Amphioctopus fangsiao is one of the most economically important species and has been considered to be a candidate for aquaculture. In order to facilitate its fine-scale genetic analyses, we constructed a normalized full-length library successfully and developed a set of microsatellite markers in this study. The normalized full-length library had a storage capacity of 6.9×105 independent clones. The recombination efficiency was 95% and the average size of inserted fragments was longer than 1000 bp. A total of 3440 high quality ESTs were obtained, which were assembled into 1803 unigenes. Of these unigenes, 450 (25%) were assigned into 33 Gene Ontology terms, 576 (31.9%) into 153 Kyoto Encyclopedia of Genes and Genomes pathways, and 275 (15.3%) into 22 Clusters of Orthologous Groups. Seventy-six polymorphic microsatellite markers were identified. The number of alleles per locus ranged from 4 to 17, and the observed and expected heterozygosities varied between 0.167 and 0.967 and between 0.326 and 0.944, respectively. Twelve loci were significantly deviated from Hardy-Weinberg equilibrium after Bonferroni correction and no linkage disequilibrium was found between different loci. This study provided not only a useful resource for the isolation of the functional genes, but also a set of informative microsatellites for the assessment of population structure and conservation genetics of A. fangsiao.
Ghangal, Rajesh; Raghuvanshi, Saurabh; Sharma, Prakash C
2012-02-01
A cDNA library was constructed from the mature leaves of seabuckthorn (Hippophae rhamnoides). Expressed Sequence Tags (ESTs) were generated by single pass sequencing of 4500 cDNA clones. We submitted 3412 ESTs to dbEST of NCBI. Clustering of these ESTs yielded 1665 unigenes comprising of 345 contigs and 1320 singletons. Out of 1665 unigenes, 1278 unigenes were annotated by similarity search while the remaining 387 unannotated unigenes were considered as organism specific. Gene Ontology (GO) analysis of the unigene dataset showed 691 unigenes related to biological processes, 727 to molecular functions and 588 to cellular component category. On the basis of similarity search and GO annotation, 43 unigenes were found responsive to biotic and abiotic stresses. To validate this observation, 13 genes that are known to be associated with cold stress tolerance from previous studies in Arabidopsis and 3 novel transcripts were examined by Real time RT-PCR to understand the change in expression pattern under cold/freeze stress. In silico study of occurrence of microsatellites in these ESTs revealed the presence of 62 Simple Sequence Repeats (SSRs), some of which are being explored to assess genetic diversity among seabuckthorn collections. This is the first report of generation of transcriptome data providing information about genes involved in managing plant abiotic stress in seabuckthorn, a plant known for its enormous medicinal and ecological value. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Patterns of microsatellite evolution inferred from the Helianthus annuus (Asteraceae) transcriptome.
Pramod, Sreepriya; Perkins, Andy D; Welch, Mark E
2014-08-01
The distribution of microsatellites in exons, and their association with gene ontology (GO) terms is explored to elucidate patterns of microsatellite evolution in the common sunflower, Helianthus annuus. The relative position, motif, size and level of impurity were estimated for each microsatellite in the unigene database available from the Compositae Genome Project (CGP), and statistical analyses were performed to determine if differences in microsatellite distributions and enrichment within certain GO terms were significant. There are more translated than untranslated microsatellites, implying that many bring about structural changes in proteins. However, the greatest density is observed within the UTRs, particularly 5'UTRs. Further, UTR microsatellites are purer and longer than coding region microsatellites. This suggests that UTR microsatellites are either younger and under more relaxed constraints, or that purifying selection limits impurities, and directional selection favours their expansion. GOs associated with response to various environmental stimuli including water deprivation and salt stress were significantly enriched with microsatellites. This may suggest that these GOs are more labile in plant genomes, or that selection has favoured the maintenance of microsatellites in these genes over others. This study shows that the distribution of transcribed microsatellites in H. annuus is nonrandom, the coding region microsatellites are under greater constraint compared to the UTR microsatellites, and that these sequences are enriched within genes that regulate plant responses to environmental stress and stimuli.
Park, So Young; Patnaik, Bharat Bhusan; Kang, Se Won; Hwang, Hee-Ju; Chung, Jong Min; Song, Dae Kwon; Sang, Min Kyu; Patnaik, Hongray Howrelia; Lee, Jae Bong; Noh, Mi Young; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Lee, Jun Sang; Han, Yeon Soo; Lee, Yong Seok
2016-01-01
An aquatic gastropod belonging to the family Neritidae, Clithon retropictus is listed as an endangered class II species in South Korea. The lack of information on its genomic background limits the ability to obtain functional data resources and inhibits informed conservation planning for this species. In the present study, the transcriptomic sequencing and de novo assembly of C. retropictus generated a total of 241,696,750 high-quality reads. These assembled to 282,838 unigenes with mean and N50 lengths of 736.9 and 1201 base pairs, respectively. Of these, 125,616 unigenes were subjected to annotation analysis with known proteins in Protostome DB, COG, GO, and KEGG protein databases (BLASTX; E ≤ 0.00001) and with known nucleotides in the Unigene database (BLASTN; E ≤ 0.00001). The GO analysis indicated that cellular process, cell, and catalytic activity are the predominant GO terms in the biological process, cellular component, and molecular function categories, respectively. In addition, 2093 unigenes were distributed in 107 different KEGG pathways. Furthermore, 49,280 simple sequence repeats were identified in the unigenes (>1 kilobase sequences). This is the first report on the identification of transcriptomic and microsatellite resources for C. retropictus, which opens up the possibility of exploring traits related to the adaptation and acclimatization of this species. PMID:27455329
Kang, Se Won; Patnaik, Bharat Bhusan; Hwang, Hee-Ju; Park, So Young; Chung, Jong Min; Song, Dae Kwon; Patnaik, Hongray Howrelia; Lee, Jae Bong; Kim, Changmu; Kim, Soonok; Park, Hong Seog; Park, Seung-Hwan; Park, Young-Su; Han, Yeon Soo; Lee, Jun Sang; Lee, Yong Seok
2017-03-01
Satsuma myomphala is critically endangered through loss of natural habitats, predation by natural enemies, and indiscriminate collection. It is a protected species in Korea but lacks genomic resources for an understanding of varied functional processes attributable to evolutionary success under natural habitats. For assessing the genetic information of S. myomphala, we performed for the first time, de novo transcriptome sequencing and functional annotation of expressed sequences using Illumina Next-Generation Sequencing (NGS) platform and bioinformatics analysis. We identified 103,774 unigenes of which 37,959, 12,890, and 17,699 were annotated in the PANM (Protostome DB), Unigene, and COG (Clusters of Orthologous Groups) databases, respectively. In addition, 14,451 unigenes were predicted under Gene Ontology functional categories, with 4581 assigned to a single category. Furthermore, 3369 sequences with 646 having Enzyme Commission (EC) numbers were mapped to 122 pathways in the Kyoto Encyclopedia of Genes and Genomes Pathway database. The prominent protein domains included the Zinc finger (C2H2-like), Reverse Transcriptase, Thioredoxin-like fold, and RNA recognition motif domain. Many unigenes with homology to immunity, defense, and reproduction-related genes were screened in the transcriptome. We also detected 3120 putative simple sequence repeats (SSRs) encompassing dinucleotide to hexanucleotide repeat motifs from >1kb unigene sequences. A list of PCR primers of SSR loci have been identified to study the genetic polymorphisms. The transcriptome data represents a valuable resource for further investigations on the species genome structure and biology. The unigenes information and microsatellites would provide an indispensable tool for conservation of the species in natural and adaptive environments. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-09-01
Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.
Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng
2015-01-01
Premise of the study: Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag–simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. Methods and Results: We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. Conclusions: These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species. PMID:26421250
[SSR loci information analysis in transcriptome of Andrographis paniculata].
Li, Jun-Ren; Chen, Xiu-Zhen; Tang, Xiao-Ting; He, Rui; Zhan, Ruo-Ting
2018-06-01
To study the SSR loci information and develop molecular markers, a total of 43 683 Unigenes in transcriptome of Andrographis paniculata were used to explore SSR. The distribution frequency of SSR and the basic characteristics of repeat motifs were analyzed using MicroSAtellite software, SSR primers were designed by Primer 3.0 software and then validated by PCR. Moreover, the gene function analysis of SSR Unigene was obtained by Blast. The results showed that 14 135 SSR loci were found in the transcriptome of A. paniculata, which distributed in 9 973 Unigenes with a distribution frequency of 32.36%. Di-nucleotide and Tri-nucleotide repeat were the main types, accounted for 75.54% of all SSRs. The repeat motifs of AT/AT and CCG/CGG were the predominant repeat types of Di-nucleotide and Tri-nucleotide, respectively. A total of 4 740 pairs of SSR primers with the potential to produce polymorphism were designed for maker development. Ten pairs of primers in 20 pairs of randomly picked primers produced fragments with expected molecular size. The gene function of Unigenes containing SSR were mostly related to the basic metabolism function of A. paniculata. The SSR markers in transcriptome of A. paniculata show rich type, strong specificity and high potential of polymorphism, which will benefit the candidate gene mining and marker-assisted breeding. Copyright© by the Chinese Pharmaceutical Association.
Li, Jitao; Li, Jian; Chen, Ping; Liu, Ping; He, Yuying
2015-01-01
The ridgetail white prawn Exopalaemon carinicauda is one of major economic mariculture species in eastern China. The deficiency of genomic and transcriptomic data is becoming the bottleneck of further researches on its good traits. In the present study, 454 pyrosequencing was undertaken to investigate the transcriptome profiles of E. carinicauda. A collection of 1,028,710 sequence reads (459.59 Mb) obtained from cDNA prepared from eyestalk and hemocytes was assembled into 162,056 expressed sequence tags (ESTs). Of these, 29.88 % of 48,428 contigs and 70.12 % of 113,628 singlets possessed high similarities to sequences in the GenBank non-redundant database, with most significant (E value <1e(-10)) unigenes matches occurring with crustacean and insect sequences. KEGG analysis of unigenes identified putative members of biological pathways related to growth and immunity. In addition, we obtained a total of putative 125,112 SNPs and 13,467 microsatellites. These results will contribute to the understanding of the genome makeup and provide useful information for future functional genomic research in E. carinicauda.
Li, Quanquan; Niu, Zubiao; Bao, Yinguang; Tian, Qiuju; Wang, Honggang; Kong, Lingrang; Feng, Deshun
2016-09-15
Wheat powdery mildew, which is mainly caused by Blumeria graminis f. sp. tritici (Bgt), seriously damages wheat production. The wheat-Thinopyrum intermedium alien addition disomic line germplasm SN6306, being one of the important sources of genes for wheat resistance, is highly resistant to Bgt E09 and to many other powdery mildew physiological races. However, knowledge on the resistance mechanism of SN6306 remains limited. Our study employed high-throughput RNA sequencing based on next-generation sequencing technology (Illumina) to obtain an overview of the transcriptome characteristics of SN6306 and its parent wheat Yannong 15 (YN15) during Bgt infection. The sequencing generated 104,773 unigenes, 9909 of which showed varied expression levels. Among the 9909 unigenes, 1678 unigenes showed 0 reads in YN15. The expression levels in Bgt-inoculated SN6306 and YN15 of exactly 39 unigenes that showed 0 or considerably low reads in YN15 were validated to identify the genes involved in Bgt resistance. Among the 39 unigenes, 12 unigenes were upregulated in SN6306 by 3-45 times. These unigenes mainly encoded kinase, synthase, proteases, and signal transduction proteins, which may play an important role in the resistance against Bgt. To confirm whether the unigenes that showed 0 reads in YN15 are really unique to SN6306, 8 unigenes were cloned and sequenced. Results showed that the selected unigenes are more similar to SN6306 and Th. intermedium than to the wheat cultivar YN15. The sequencing results further confirmed that the unigenes showing 0 reads in YN15 are unique to SN6306 and are most likely derived from Th. intermedium (Host) Nevski. Thus, the genes from Th. intermedium most probably conferred the resistance of SN6306 to Bgt. Copyright © 2016 Elsevier B.V. All rights reserved.
Zhang, Shu; Sui, Zhenghong; Chang, Lianpeng; Kang, Kyoungho; Ma, Jinhua; Kong, Fanna; Zhou, Wei; Wang, Jinguo; Guo, Liliang; Geng, Huili; Zhong, Jie; Ma, Qingxia
2014-03-10
In this article, high-throughput de novo transcriptomic sequencing was performed in Alexandrium catenella, which provided the first view of the gene repertoire in this dinoflagellate based on next-generation sequencing (NGS) technologies. A total of 118,304 unigenes were identified with an average length of 673bp (base pair). Of these unigenes, 77,936 (65.9%) were annotated with known proteins based on sequence similarities, among which 24,149 and 22,956 unigenes were assigned to gene ontology categories (GO) and clusters of orthologous groups (COGs), respectively. Furthermore, 16,467 unigenes were mapped onto 322 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). We also detected 1143 simple sequence repeats (SSRs), in which the tri-nucleotide repeat motif (69.3%) was the most abundant. The genetic facts and significance derived from the transcriptome dataset were suggested and discussed. All four core nucleosomal histones and linker histones were detected, in addition to the unigenes involved in histone modifications.190 unigenes were identified as being involved in the endocytosis pathway, and clathrin-dependent endocytosis was suggested to play a role in the heterotrophy of A. catenella. A conserved 22-nt spliced leader (SL) was identified in 21 unigenes which suggested the existence of trans-splicing processing of mRNA in A. catenella. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.
Analyses of carnivore microsatellites and their intimate association with tRNA-derived SINEs.
López-Giráldez, Francesc; Andrés, Olga; Domingo-Roura, Xavier; Bosch, Montserrat
2006-10-23
The popularity of microsatellites has greatly increased in the last decade on account of their many applications. However, little is currently understood about the factors that influence their genesis and distribution among and within species genomes. In this work, we analyzed carnivore microsatellite clones from GenBank to study their association with interspersed repeats and elucidate the role of the latter in microsatellite genesis and distribution. We constructed a comprehensive carnivore microsatellite database comprising 1236 clones from GenBank. Thirty-three species of 11 out of 12 carnivore families were represented, although two distantly related species, the domestic dog and cat, were clearly overrepresented. Of these clones, 330 contained tRNALys-derived SINEs and 357 contained other interspersed repeats. Our rough estimates of tRNA SINE copies per haploid genome were much higher than published ones. Our results also revealed a distinct juxtaposition of AG and A-rich repeats and tRNALys-derived SINEs suggesting their coevolution. Both microsatellites arose repeatedly in two regions of the interspersed repeat. Moreover, microsatellites associated with tRNALys-derived SINEs showed the highest complexity and less potential instability. Our results suggest that tRNALys-derived SINEs are a significant source for microsatellite generation in carnivores, especially for AG and A-rich repeat motifs. These observations indicate two modes of microsatellite generation: the expansion and variation of pre-existing tandem repeats and the conversion of sequences with high cryptic simplicity into a repeat array; mechanisms which are not specific to tRNALys-derived SINEs. Microsatellite and interspersed repeat coevolution could also explain different distribution of repeat types among and within species genomes.Finally, due to their higher complexity and lower potential informative content of microsatellites associated with tRNALys-derived SINEs, we recommend avoiding their use as genetic markers.
Zhao, Hansheng; Sun, Huayu; Li, Lichao; Lou, Yongfeng; Li, Rongsheng; Qi, Lianghua; Gao, Zhimin
2017-01-01
Rattan is an important group of regenerating non-wood climbing palm in tropical forests. The cirrus is an essential climbing organ and provides morphological evidence for evolutionary and taxonomic studies. However, limited data are available on the molecular mechanisms underlying the development of the cirrus. Thus, we performed in-depth transcriptomic sequencing analyses to characterize the cirrus development at different developmental stages of Daemonorops jenkinsiana. The result showed 404,875 transcripts were assembled, including 61,569 high-quality unigenes were identified, of which approximately 76.16% were annotated and classified by seven authorized databases. Moreover, a comprehensive analysis of the gene expression profiles identified differentially expressed genes (DEGs) concentrated in developmental pathways, cell wall metabolism, and hook formation between the different stages of the cirri. Among them, 37 DEGs were validated by qRT-PCR. Furthermore, 14,693 transcriptome-based microsatellites were identified. Of the 168 designed SSR primer pairs, 153 were validated and 16 pairs were utilized for the polymorphic analysis of 25 rattan accessions. These findings can be used to interpret the molecular mechanisms of cirrus development, and the developed microsatellites markers provide valuable data for assisting rattan taxonomy and expanding the understanding of genomic study in rattan. PMID:28383053
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Vigna, Bianca Baccili Zanotto; de Oliveira, Fernanda Ancelmo; de Toledo-Silva, Guilherme; da Silva, Carla Cristina; do Valle, Cacilda Borges; de Souza, Anete Pereira
2016-11-11
Urochloa humidicola (Koronivia grass) is a polyploid (6x to 9x) species that is used as forage in the tropics. Facultative apospory apomixis is present in most of the genotypes of this species, although one individual has been described as sexual. Molecular studies have been restricted to molecular marker approaches for genetic diversity estimations and linkage map construction. The objectives of the present study were to describe and compare the leaf transcriptome of two important genotypes that are highly divergent in terms of their phenotypes and reproduction modes: the sexual BH031 and the aposporous apomictic cultivar BRS Tupi. We sequenced the leaf transcriptome of Koronivia grass using an Illumina GAIIx system, which produced 13.09 Gb of data that consisted of 163,575,526 paired-end reads between the two libraries. We de novo-assembled 76,196 transcripts with an average length of 1,152 bp and filtered 35,093 non-redundant unigenes. A similarity search against the non-redundant National Center of Biotechnology Information (NCBI) protein database returned 65 % hits. We annotated 24,133 unigenes in the Phytozome database and 14,082 unigenes in the UniProtKB/Swiss-Prot database, assigned 108,334 gene ontology terms to 17,255 unigenes and identified 5,324 unigenes in 327 known metabolic pathways. Comparisons with other grasses via a reciprocal BLAST search revealed a larger number of orthologous genes for the Panicum species. The unigenes were involved in C4 photosynthesis, lignocellulose biosynthesis and flooding stress responses. A search for functional molecular markers revealed 4,489 microsatellites and 560,298 single nucleotide polymorphisms (SNPs). A quantitative real-time PCR analysis validated the RNA-seq expression analysis and allowed for the identification of transcriptomic differences between the two evaluated genotypes. Moreover, 192 unannotated sequences were classified as containing complete open reading frames, suggesting that the new, potentially exclusive genes should be further investigated. The present study represents the first whole-transcriptome sequencing of U. humidicola leaves, providing an important public information source of transcripts and functional molecular markers. The qPCR analysis indicated that the expression of certain transcripts confirmed the differential expression observed in silico, which demonstrated that RNA-seq is useful for identifying differentially expressed and unique genes. These results corroborate the findings from previous studies and suggest a hybrid origin for BH031.
High-Throughput Sequencing and De Novo Assembly of the Isatis indigotica Transcriptome
Tang, Xiaoqing; Xiao, Yunhua; Lv, Tingting; Wang, Fangquan; Zhu, QianHao; Zheng, Tianqing; Yang, Jie
2014-01-01
Background Isatis indigotica, the source of the traditional Chinese medicine Radix isatidis (Ban-Lan-Gen), is an extremely important economical crop in China. To facilitate biological, biochemical and molecular research on the medicinal chemicals in I. indigotica, here we report the first I. indigotica transcriptome generated by RNA sequencing (RNA-seq). Results RNA-seq library was created using RNA extracted from a mixed sample including leaf and root. A total of 33,238 unigenes were assembled from more than 28 million of high quality short reads. The quality of the assembly was experimentally examined by cDNA sequencing of seven randomly selected unigenes. Based on blast search 28,184 unigenes had a hit in at least one of the protein and nucleotide databases used in this study, and 8 unigenes were found to be associated with biosynthesis of indole and its derivatives. According to Gene Ontology classification, 22,365 unigenes were categorized into 48 functional groups. Furthermore, Clusters of Orthologous Group and Swiss-Port annotation were assigned for 7,707 and 18,679 unigenes, respectively. Analysis of repeat motifs identified 6,400 simple sequence repeat markers in 4,509 unigenes. Conclusion Our data provide a comprehensive sequence resource for molecular study of I. indigotica. Our results will facilitate studies on the functions of genes involved in the indole alkaloid biosynthesis pathway and on metabolism of nitrogen and indole alkaloids in I. indigotica and its related species. PMID:25259890
Analyses of carnivore microsatellites and their intimate association with tRNA-derived SINEs
López-Giráldez, Francesc; Andrés, Olga; Domingo-Roura, Xavier; Bosch, Montserrat
2006-01-01
Background The popularity of microsatellites has greatly increased in the last decade on account of their many applications. However, little is currently understood about the factors that influence their genesis and distribution among and within species genomes. In this work, we analyzed carnivore microsatellite clones from GenBank to study their association with interspersed repeats and elucidate the role of the latter in microsatellite genesis and distribution. Results We constructed a comprehensive carnivore microsatellite database comprising 1236 clones from GenBank. Thirty-three species of 11 out of 12 carnivore families were represented, although two distantly related species, the domestic dog and cat, were clearly overrepresented. Of these clones, 330 contained tRNALys-derived SINEs and 357 contained other interspersed repeats. Our rough estimates of tRNA SINE copies per haploid genome were much higher than published ones. Our results also revealed a distinct juxtaposition of AG and A-rich repeats and tRNALys-derived SINEs suggesting their coevolution. Both microsatellites arose repeatedly in two regions of the insterspersed repeat. Moreover, microsatellites associated with tRNALys-derived SINEs showed the highest complexity and less potential instability. Conclusion Our results suggest that tRNALys-derived SINEs are a significant source for microsatellite generation in carnivores, especially for AG and A-rich repeat motifs. These observations indicate two modes of microsatellite generation: the expansion and variation of pre-existing tandem repeats and the conversion of sequences with high cryptic simplicity into a repeat array; mechanisms which are not specific to tRNALys-derived SINEs. Microsatellite and interspersed repeat coevolution could also explain different distribution of repeat types among and within species genomes. Finally, due to their higher complexity and lower potential informative content of microsatellites associated with tRNALys-derived SINEs, we recommend avoiding their use as genetic markers. PMID:17059596
Gupta, S K; Gopalakrishna, T
2010-07-01
Unigene sequences available in public databases provide a cost-effective and valuable source for the development of molecular markers. In this study, the identification and development of unigene-based SSR markers in cowpea (Vigna unguiculata (L.) Walp.) is presented. A total of 1071 SSRs were identified in 15 740 cowpea unigene sequences downloaded from the National Center for Biotechnology Information. The most frequent SSR motifs present in the unigenes were trinucleotides (59.7%), followed by dinucleotides (34.8%), pentanucleotides (4%), and tetranucleotides (1.5%). The copy number varied from 6 to 33 for dinucleotide, 5 to 29 for trinucleotide, 5 to 7 for tetranucleotide, and 4 to 6 for pentanucleotide repeats. Primer pairs were successfully designed for 803 SSR motifs and 102 SSR markers were finally characterized and validated. Putative function was assigned to 64.7% of the unigene SSR markers based on significant homology to reported proteins. About 31.7% of the SSRs were present in coding sequences and 68.3% in untranslated regions of the genes. About 87% of the SSRs located in the coding sequences were trinucleotide repeats. Allelic variation at 32 SSR loci produced 98 alleles in 20 cowpea genotypes. The polymorphic information content for the SSR markers varied from 0.10 to 0.83 with an average of 0.53. These unigene SSR markers showed a high rate of transferability (88%) across other Vigna species, thereby expanding their utility. Alignment of unigene sequences with soybean genomic sequences revealed the presence of introns in amplified products of some of the SSR markers. This study presents the distribution of SSRs in the expressed portion of the cowpea genome and is the first report of the development of functional unigene-based SSR markers in cowpea. These SSR markers would play an important role in molecular mapping, comparative genomics, and marker-assisted selection strategies in cowpea and other Vigna species.
Zhang, Zongying; Jiang, Shenghui; Wang, Nan; Li, Min; Ji, Xiaohao; Sun, Shasha; Liu, Jingxuan; Wang, Deyun; Xu, Haifeng; Qi, Sumin; Wu, Shujing; Fei, Zhangjun; Feng, Shouqian; Chen, Xuesen
2015-01-01
Apple is one of the most economically important horticultural fruit crops worldwide. It is critical to gain insights into fruit ripening and softening to improve apple fruit quality and extend shelf life. In this study, forward and reverse suppression subtractive hybridization libraries were generated from ‘Taishanzaoxia’ apple fruits sampled around the ethylene climacteric to isolate ripening- and softening-related genes. A set of 648 unigenes were derived from sequence alignment and cluster assembly of 918 expressed sequence tags. According to gene ontology functional classification, 390 out of 443 unigenes (88%) were assigned to the biological process category, 356 unigenes (80%) were classified in the molecular function category, and 381 unigenes (86%) were allocated to the cellular component category. A total of 26 unigenes differentially expressed during fruit development period were analyzed by quantitative RT-PCR. These genes were involved in cell wall modification, anthocyanin biosynthesis, aroma production, stress response, metabolism, transcription, or were non-annotated. Some genes associated with cell wall modification, anthocyanin biosynthesis and aroma production were up-regulated and significantly correlated with ethylene production, suggesting that fruit texture, coloration and aroma may be regulated by ethylene in ‘Taishanzaoxia’. Some of the identified unigenes associated with fruit ripening and softening have not been characterized in public databases. The results contribute to an improved characterization of changes in gene expression during apple fruit ripening and softening. PMID:26719904
Zeng, Digang; Chen, Xiuli; Xie, Daxiang; Zhao, Yongzhen; Yang, Chunling; Li, Yongmei; Ma, Ning; Peng, Min; Yang, Qiong; Liao, Zhenping; Wang, Hui; Chen, Xiaohan
2013-01-01
The Pacific white shrimp, Litopenaeus vannamei, is a worldwide cultured crustacean species with important commercial value. Over the last two decades, Taura syndrome virus (TSV) has seriously threatened the shrimp aquaculture industry in the Western Hemisphere. To better understand the interaction between shrimp immune and TSV, we performed a transcriptome analysis in the hepatopancreas of L. vannamei challenged with TSV, using the 454 pyrosequencing (Roche) technology. We obtained 126919 and 102181 high-quality reads from TSV-infected and non-infected (control) L. vannamei cDNA libraries, respectively. The overall de novo assembly of cDNA sequence data generated 15004 unigenes, with an average length of 507 bp. Based on BLASTX search (E-value <10-5) against NR, Swissprot, GO, COG and KEGG databases, 10425 unigenes (69.50% of all unigenes) were annotated with gene descriptions, gene ontology terms, or metabolic pathways. In addition, we identified 770 microsatellites and designed 497 sets of primers. Comparative genomic analysis revealed that 1311 genes differentially expressed in the infected shrimp compared to the controls, including 559 up- and 752 down- regulated genes. Among the differentially expressed genes, several are involved in various animal immune functions, such as antiviral, antimicrobial, proteases, protease inhibitors, signal transduction, transcriptional control, cell death and cell adhesion. This study provides valuable information on shrimp gene activities against TSV infection. Results can contribute to the in-depth study of candidate genes in shrimp immunity, and improves our current understanding of this host-virus interaction. In addition, the large amount of transcripts reported in this study provide a rich source for identification of novel genes in shrimp.
Sun, Xiudong; Zhou, Shumei; Meng, Fanlu; Liu, Shiqi
2012-10-01
Garlic is widely used as a spice throughout the world for the culinary value of its flavor and aroma, which are created by the chemical transformation of a series of organic sulfur compounds. To analyze the transcriptome of Allium sativum and discover the genes involved in sulfur metabolism, cDNAs derived from the total RNA of Allium sativum buds were analyzed by Illumina sequencing. Approximately 26.67 million 90 bp paired-end clean reads were achieved in two libraries. A total of 127,933 unigenes were generated by de novo assembly and were compared with the sequences in public databases. Of these, 45,286 unigenes had significant hits to the sequences in the Nr database, 29,514 showed significant similarity to known proteins in the Swiss-Prot database and, 20,706 and 21,952 unigenes had significant similarity to existing sequences in the KEGG and COG databases, respectively. Moreover, genes involved in organic sulfur biosynthesis were identified. These unigenes data will provide the foundation for research on gene expression, genomics and functional genomics in Allium sativum. Key message The obtained unigenes will provide the foundation for research on functional genomics in Allium sativum and its closely related species, and fill the gap of the existing plant EST database.
The testes transcriptome derived from the New World Screwworm, Cochliomyia hominivorax TSA
USDA-ARS?s Scientific Manuscript database
In a collaboration with National Center for Genome Resources researchers, we sequenced and assembled the testes transcriptome derived from the Pacora, Panama, production plant strain of the New World Screwworm, Cochliomyia hominivorax. This transcriptome contains 4,149 unigenes and the Transcriptome...
Ma, Yuewen; Li, Qiulei; Hu, Guibing; Qin, Yonghua
2017-04-20
Seedlessness is an excellent economical trait, and self-incompatibility (SI) is one of important factors resulting in seedless fruit in Citrus. However, SI molecular mechanism in Citrus is still unclear. In this study, RNA-Seq technology was used to identify differentially expressed genes related to SI reaction of 'Wuzishatangju' (Citrus reticulata Blanco). A total of 35.67GB raw RNA-Seq data was generated and was de novo assembled into 50,364 unigenes with an average length of 897bp and N50 value of 1549. Twenty-three candidate unigenes related to SI were analyzed using qPCR at different tissues and stages after self- and cross-pollination. Seven pollen S genes (Unigene0050323, Unigene0001060, Unigene0004230, Unigene0004222, Unigene0012037, Unigene0048889 and Unigene0004272), three pistil S genes (Unigene0019191, Unigene0040115, Unigene0036542) and three genes (Unigene0038751, Unigene0031435 and Unigene0029897) associated with the pathway of ubiquitin-mediated proteolysis were identified. Unigene0031435, Unigene0038751 and Unigene0029897 are probably involved in SI reaction of 'Wuzishatangju' based on expression analyses. The present study provides a new insight into the molecular mechanism of SI in Citrus at the transcriptional level. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Zhan, Aibin; Bao, Zhenmin; Hu, Xiaoli; Lu, Wei; Hu, Jingjie
2009-06-01
Microsatellite markers have become one kind of the most important molecular tools used in various researches. A large number of microsatellite markers are required for the whole genome survey in the fields of molecular ecology, quantitative genetics and genomics. Therefore, it is extremely necessary to select several versatile, low-cost, efficient and time- and labor-saving methods to develop a large panel of microsatellite markers. In this study, we used Zhikong scallop ( Chlamys farreri) as the target species to compare the efficiency of the five methods derived from three strategies for microsatellite marker development. The results showed that the strategy of constructing small insert genomic DNA library resulted in poor efficiency, while the microsatellite-enriched strategy highly improved the isolation efficiency. Although the mining public database strategy is time- and cost-saving, it is difficult to obtain a large number of microsatellite markers, mainly due to the limited sequence data of non-model species deposited in public databases. Based on the results in this study, we recommend two methods, microsatellite-enriched library construction method and FIASCO-colony hybridization method, for large-scale microsatellite marker development. Both methods were derived from the microsatellite-enriched strategy. The experimental results obtained from Zhikong scallop also provide the reference for microsatellite marker development in other species with large genomes.
Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue
2016-09-01
Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
GDR (Genome Database for Rosaceae): integrated web-database for Rosaceae genomics and genetics data
Jung, Sook; Staton, Margaret; Lee, Taein; Blenda, Anna; Svancara, Randall; Abbott, Albert; Main, Dorrie
2008-01-01
The Genome Database for Rosaceae (GDR) is a central repository of curated and integrated genetics and genomics data of Rosaceae, an economically important family which includes apple, cherry, peach, pear, raspberry, rose and strawberry. GDR contains annotated databases of all publicly available Rosaceae ESTs, the genetically anchored peach physical map, Rosaceae genetic maps and comprehensively annotated markers and traits. The ESTs are assembled to produce unigene sets of each genus and the entire Rosaceae. Other annotations include putative function, microsatellites, open reading frames, single nucleotide polymorphisms, gene ontology terms and anchored map position where applicable. Most of the published Rosaceae genetic maps can be viewed and compared through CMap, the comparative map viewer. The peach physical map can be viewed using WebFPC/WebChrom, and also through our integrated GDR map viewer, which serves as a portal to the combined genetic, transcriptome and physical mapping information. ESTs, BACs, markers and traits can be queried by various categories and the search result sites are linked to the mapping visualization tools. GDR also provides online analysis tools such as a batch BLAST/FASTA server for the GDR datasets, a sequence assembly server and microsatellite and primer detection tools. GDR is available at http://www.rosaceae.org. PMID:17932055
A method for the further assembly of targeted unigenes in a transcriptome after assembly by Trinity
Xiao, Xinlong; Ma, Jinbiao; Sun, Yufang; Yao, Yinan
2015-01-01
RNA-sequencing has been widely used to obtain high throughput transcriptome sequences in various species, but the assembly of a full set of complete transcripts is still a significant challenge. Judging by the number of expected transcripts and assembled unigenes in a transcriptome library, we believe that some unigenes could be reassembled. In this study, using the nitrate transporter (NRT) gene family and phosphate transporter (PHT) gene family in Salicornia europaea as examples, we introduced an approach to further assemble unigenes found in transcriptome libraries which had been previously generated by Trinity. To find the unigenes of a particular transcript that contained gaps, we respectively selected 16 NRT candidate unigene pairs and 12 PHT candidate unigene pairs for which the two unigenes had the same annotations, the same expression patterns among various RNA-seq samples, and different positions of the proteins coded as mapped to a reference protein. To fill a gap between the two unigenes, PCR was performed using primers that mapped to the two unigenes and the PCR products were sequenced, which demonstrated that 5 unigene pairs of NRT and 3 unigene pairs of PHT could be reassembled when the gaps were filled using the corresponding PCR product sequences. This fast and simple method will reduce the redundancy of targeted unigenes and allow acquisition of complete coding sequences (CDS). PMID:26528307
UniGene Tabulator: a full parser for the UniGene format.
Lenzi, Luca; Frabetti, Flavia; Facchin, Federica; Casadei, Raffaella; Vitale, Lorenza; Canaider, Silvia; Carinci, Paolo; Zannotti, Maria; Strippoli, Pierluigi
2006-10-15
UniGene Tabulator 1.0 provides a solution for full parsing of UniGene flat file format; it implements a structured graphical representation of each data field present in UniGene following import into a common database managing system usable in a personal computer. This database includes related tables for sequence, protein similarity, sequence-tagged site (STS) and transcript map interval (TXMAP) data, plus a summary table where each record represents a UniGene cluster. UniGene Tabulator enables full local management of UniGene data, allowing parsing, querying, indexing, retrieving, exporting and analysis of UniGene data in a relational database form, usable on Macintosh (OS X 10.3.9 or later) and Windows (2000, with service pack 4, XP, with service pack 2 or later) operating systems-based computers. The current release, including both the FileMaker runtime applications, is freely available at http://apollo11.isto.unibo.it/software/
Li, Shi-Weng; Shi, Rui-Fang; Leng, Yan
2015-01-01
Adventitious rooting is the most important mechanism underlying vegetative propagation and an important strategy for plant propagation under environmental stress. The present study was conducted to obtain transcriptomic data and examine gene expression using RNA-Seq and bioinformatics analysis, thereby providing a foundation for understanding the molecular mechanisms controlling adventitious rooting. Three cDNA libraries constructed from mRNA samples from mung bean hypocotyls during adventitious rooting were sequenced. These three samples generated a total of 73 million, 60 million, and 59 million 100-bp reads, respectively. These reads were assembled into 78,697 unigenes with an average length of 832 bp, totaling 65 Mb. The unigenes were aligned against six public protein databases, and 29,029 unigenes (36.77%) were annotated using BLASTx. Among them, 28,225 (35.75%) and 28,119 (35.62%) unigenes had homologs in the TrEMBL and NCBI non-redundant (Nr) databases, respectively. Of these unigenes, 21,140 were assigned to gene ontology classes, and a total of 11,990 unigenes were classified into 25 KOG functional categories. A total of 7,357 unigenes were annotated to 4,524 KOs, and 4,651 unigenes were mapped onto 342 KEGG pathways using BLAST comparison against the KEGG database. A total of 11,717 unigenes were differentially expressed (fold change>2) during the root induction stage, with 8,772 unigenes down-regulated and 2,945 unigenes up-regulated. A total of 12,737 unigenes were differentially expressed during the root initiation stage, with 9,303 unigenes down-regulated and 3,434 unigenes up-regulated. A total of 5,334 unigenes were differentially expressed between the root induction and initiation stage, with 2,167 unigenes down-regulated and 3,167 unigenes up-regulated. qRT-PCR validation of the 39 genes with known functions indicated a strong correlation (92.3%) with the RNA-Seq data. The GO enrichment, pathway mapping, and gene expression profiles reveal molecular traits for root induction and initiation. This study provides a platform for functional genomic research with this species. PMID:26177103
Li, Shi-Weng; Shi, Rui-Fang; Leng, Yan
2015-01-01
Adventitious rooting is the most important mechanism underlying vegetative propagation and an important strategy for plant propagation under environmental stress. The present study was conducted to obtain transcriptomic data and examine gene expression using RNA-Seq and bioinformatics analysis, thereby providing a foundation for understanding the molecular mechanisms controlling adventitious rooting. Three cDNA libraries constructed from mRNA samples from mung bean hypocotyls during adventitious rooting were sequenced. These three samples generated a total of 73 million, 60 million, and 59 million 100-bp reads, respectively. These reads were assembled into 78,697 unigenes with an average length of 832 bp, totaling 65 Mb. The unigenes were aligned against six public protein databases, and 29,029 unigenes (36.77%) were annotated using BLASTx. Among them, 28,225 (35.75%) and 28,119 (35.62%) unigenes had homologs in the TrEMBL and NCBI non-redundant (Nr) databases, respectively. Of these unigenes, 21,140 were assigned to gene ontology classes, and a total of 11,990 unigenes were classified into 25 KOG functional categories. A total of 7,357 unigenes were annotated to 4,524 KOs, and 4,651 unigenes were mapped onto 342 KEGG pathways using BLAST comparison against the KEGG database. A total of 11,717 unigenes were differentially expressed (fold change>2) during the root induction stage, with 8,772 unigenes down-regulated and 2,945 unigenes up-regulated. A total of 12,737 unigenes were differentially expressed during the root initiation stage, with 9,303 unigenes down-regulated and 3,434 unigenes up-regulated. A total of 5,334 unigenes were differentially expressed between the root induction and initiation stage, with 2,167 unigenes down-regulated and 3,167 unigenes up-regulated. qRT-PCR validation of the 39 genes with known functions indicated a strong correlation (92.3%) with the RNA-Seq data. The GO enrichment, pathway mapping, and gene expression profiles reveal molecular traits for root induction and initiation. This study provides a platform for functional genomic research with this species.
NASA Astrophysics Data System (ADS)
Sun, Xiujun; Li, Dongming; Liu, Zhihong; Zhou, Liqing; Wu, Biao; Yang, Aiguo
2017-10-01
The pen shell ( Atrina pectinata) is a large wedge-shaped bivalve, which belongs to family Pinnidae. Due to its large and nutritious adductor muscle, it is the popular seafood with high commercial value in Asia-Pacific countries. However, limiting genomic and transcriptomic data have hampered its genetic investigations. In this study, the transcriptome of A. pectinata was deeply sequenced using Illumina pair-end sequencing technology. After assembling, a total of 127263 unigenes were obtained. Functional annotation indicated that the highest percentage of unigenes (18.60%) was annotated on GO database, followed by 18.44% on PFAM database and 17.04% on NR database. There were 270 biological pathways matched with those in KEGG database. Furthermore, a total of 23452 potential simple sequence repeats (SSRs) were identified, of them the most abundant type was mono-nucleotide repeats (12902, 55.01%), which was followed by di-nucleotide (8132, 34.68%), tri-nucleotide (2010, 8.57%), tetra-nucleotide (401, 1.71%), and penta-nucleotide (7, 0.03%) repeats. Sixty SSRs were selected for validating and developing genic SSR markers, of them 23 showed polymorphism in a cultured population with the average observed and expected heterozygosities of 0.412 and 0.579, respectively. In this study, we established the first comprehensive transcript dataset of A. pectinata genes. Our results demonstrated that RNA-Seq is a fast and cost-effective method for genic SSR development in non-model species.
Zhu, Chen; Ai, Lin; Wang, Li; Yin, Pingping; Liu, Chenglan; Li, Shanshan; Zeng, Huiming
2016-01-01
Zoysia japonica brown spot was caused by necrotrophic fungus Rhizoctonia solani invasion, which led to severe financial loss in city lawn and golf ground maintenance. However, little was known about the molecular mechanism of R. solani pathogenicity in Z. japonica. In this study we examined early stage interaction between R. solani AG1 IA strain and Z. japonica cultivar "Zenith" root by cell ultra-structure analysis, pathogenesis-related proteins assay and transcriptome analysis to explore molecular clues for AG1 IA strain pathogenicity in Z. japonica. No obvious cell structure damage was found in infected roots and most pathogenesis-related protein activities showedg a downward trend especially in 36 h post inoculation, which exhibits AG1 IA strain stealthy invasion characteristic. According to Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database classification, most DEGs in infected "Zenith" roots dynamically changed especially in three aspects, signal transduction, gene translation, and protein synthesis. Total 3422 unigenes of "Zenith" root were predicted into 14 kinds of resistance (R) gene class. Potential fungal resistance related unigenes of "Zenith" root were involved in ligin biosynthesis, phytoalexin synthesis, oxidative burst, wax biosynthesis, while two down-regulated unigenes encoding leucine-rich repeat receptor protein kinase and subtilisin-like protease might be important for host-derived signal perception to AG1 IA strain invasion. According to Pathogen Host Interaction (PHI) database annotation, 1508 unigenes of AG1 IA strain were predicted and classified into 37 known pathogen species, in addition, unigenes encoding virulence, signaling, host stress tolerance, and potential effector were also predicted. This research uncovered transcriptional profiling during the early phase interaction between R. solani AG1 IA strain and Z. japonica, and will greatly help identify key pathogenicity of AG1 IA strain.
Gupta, Yogesh; Pathak, Ashish K; Singh, Kashmir; Mantri, Shrikant S; Singh, Sudhir P; Tuli, Rakesh
2015-02-14
Annona squamosa L., a popular fruit tree, is the most widely cultivated species of the genus Annona. The lack of transcriptomic and genomic information limits the scope of genome investigations in this important shrub. It bears aggregate fruits with numerous seeds. A few rare accessions with very few seeds have been reported for Annona. A massive pyrosequencing (Roche, 454 GS FLX+) of transcriptome from early stages of fruit development (0, 4, 8 and 12 days after pollination) was performed to produce expression datasets in two genotypes, Sitaphal and NMK-1, that show a contrast in the number of seeds set in fruits. The data reported here is the first source of genome-wide differential transcriptome sequence in two genotypes of A. squamosa, and identifies several candidate genes related to seed development. Approximately 1.9 million high-quality clean reads were obtained in the cDNA library from the developing fruits of both the genotypes, with an average length of about 568 bp. Quality-reads were assembled de novo into 2074 to 11004 contigs in the developing fruit samples at different stages of development. The contig sequence data of all the four stages of each genotype were combined into larger units resulting into 14921 (Sitaphal) and 14178 (NMK-1) unigenes, with a mean size of more than 1 Kb. Assembled unigenes were functionally annotated by querying against the protein sequences of five different public databases (NCBI non redundant, Prunus persica, Vitis vinifera, Fragaria vesca, and Amborella trichopoda), with an E-value cut-off of 10(-5). A total of 4588 (Sitaphal) and 2502 (NMK-1) unigenes did not match any known protein in the NR database. These sequences could be genes specific to Annona sp. or belong to untranslated regions. Several of the unigenes representing pathways related to primary and secondary metabolism, and seed and fruit development expressed at a higher level in Sitaphal, the densely seeded cultivar in comparison to the poorly seeded NMK-1. A total of 2629 (Sitaphal) and 3445 (NMK-1) Simple Sequence Repeat (SSR) motifs were identified respectively in the two genotypes. These could be potential candidates for transcript based microsatellite analysis in A. squamosa. The present work provides early-stage fruit specific transcriptome sequence resource for A. squamosa. This repository will serve as a useful resource for investigating the molecular mechanisms of fruit development, and improvement of fruit related traits in A. squamosa and related species.
Hendre, Prasad S.; Aggarwal, Ramesh K.
2014-01-01
Coffee breeding and improvement efforts can be greatly facilitated by availability of a large repository of simple sequence repeats (SSRs) based microsatellite markers, which provides efficiency and high-resolution in genetic analyses. This study was aimed to improve SSR availability in coffee by developing new genic−/genomic-SSR markers using in-silico bioinformatics and streptavidin-biotin based enrichment approach, respectively. The expressed sequence tag (EST) based genic microsatellite markers (EST-SSRs) were developed using the publicly available dataset of 13,175 unigene ESTs, which showed a distribution of 1 SSR/3.4 kb of coffee transcriptome. Genomic SSRs, on the other hand, were developed from an SSR-enriched small-insert partial genomic library of robusta coffee. In total, 69 new SSRs (44 EST-SSRs and 25 genomic SSRs) were developed and validated as suitable genetic markers. Diversity analysis of selected coffee genotypes revealed these to be highly informative in terms of allelic diversity and PIC values, and eighteen of these markers (∼27%) could be mapped on a robusta linkage map. Notably, the markers described here also revealed a very high cross-species transferability. In addition to the validated markers, we have also designed primer pairs for 270 putative EST-SSRs, which are expected to provide another ca. 200 useful genetic markers considering the high success rate (88%) of marker conversion of similar pairs tested/validated in this study. PMID:25461752
Blair, Matthew W; Hurtado, Natalia; Chavarro, Carolina M; Muñoz-Torres, Monica C; Giraldo, Martha C; Pedraza, Fabio; Tomkins, Jeff; Wing, Rod
2011-03-22
Sequencing of cDNA libraries for the development of expressed sequence tags (ESTs) as well as for the discovery of simple sequence repeats (SSRs) has been a common method of developing microsatellites or SSR-based markers. In this research, our objective was to further sequence and develop common bean microsatellites from leaf and root cDNA libraries derived from the Andean gene pool accession G19833 and the Mesoamerican gene pool accession DOR364, mapping parents of a commonly used reference map. The root libraries were made from high and low phosphorus treated plants. A total of 3,123 EST sequences from leaf and root cDNA libraries were screened and used for direct simple sequence repeat discovery. From these EST sequences we found 184 microsatellites; the majority containing tri-nucleotide motifs, many of which were GC rich (ACC, AGC and AGG in particular). Di-nucleotide motif microsatellites were about half as common as the tri-nucleotide motif microsatellites but most of these were AGn microsatellites with a moderate number of ATn microsatellites in root ESTs followed by few ACn and no GCn microsatellites. Out of the 184 new SSR loci, 120 new microsatellite markers were developed in the BMc (Bean Microsatellites from cDNAs) series and these were evaluated for their capacity to distinguish bean diversity in a germplasm panel of 18 genotypes. We developed a database with images of the microsatellites and their polymorphism information content (PIC), which averaged 0.310 for polymorphic markers. The present study produced information about microsatellite frequency in root and leaf tissues of two important genotypes for common bean genomics: namely G19833, the Andean genotype selected for whole genome shotgun sequencing from race Peru, and DOR364 a race Mesoamerica subgroup 2 genotype that is a small-red seeded, released variety in Central America. Both race Peru and Mesoamerica subgroup 2 (small red beans) have been understudied in comparison to race Nueva Granada and Mesoamerica subgroup 1 (black beans) both with regards to gene expression and as sources of markers. However, we found few differences between SSR type and frequency between the G19833 leaf and DOR364 root tissue-derived ESTs. Overall, our work adds to the analysis of microsatellite frequency evaluation for common bean and provides a new set of 120 BMc markers which combined with the 248 previously developed BMc markers brings the total in this series to 368 markers. Once we include BMd markers, which are derived from GenBank sequences, the current total of gene-based markers from our laboratory surpasses 500 markers. These markers are basic for studies of the transcriptome of common bean and can form anchor points for genetic mapping studies in the future.
Ma, Jun; Kanakala, S; He, Yehua; Zhang, Junli; Zhong, Xiaolan
2015-01-01
Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus.
Ma, Jun; Kanakala, S.; He, Yehua; Zhang, Junli; Zhong, Xiaolan
2015-01-01
Background Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. Results The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. Conclusion The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus. PMID:25769053
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
2016-01-01
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides.
Calcitonin intranasal--unigene: Salcatonin intranasal--unigene.
2004-01-01
An intranasal spray formulation of recombinant salmon calcitonin [salcatonin] is in development with Unigene Laboratories as therapy for postmenopausal osteoporosis. Calcitonin is an endogenous polypeptide hormone that regulates calcium and bone metabolism. It is produced by the parafollicular cells of the thyroid gland in humans and other species. Calcitonin inhibits bone loss through the suppression of osteoclast activity. Salmon calcitonin is approximately 40-50 times more potent than natural human calcitonin at inhibiting osteoclast function. It can be obtained naturally from salmon or can be synthesised with the same chemical structure. Calcitonin was originally available only as an injectable formulation, but in recent years more convenient formulations have become available. Unigene is actively seeking to license its intranasal calcitonin product in Europe and other territories outside the US. nigene licensed its intranasal calcitonin product to Upsher-Smith Laboratories in December 2002, under a $US10 million exclusive US licensing agreement. Under the terms of the agreement, Unigene received an upfront payment of $US3 million from Upsher-Smith and will be eligible to receive milestone payments and royalty payments on product sales. Unigene will be responsible for manufacturing the product at its Boonton facility in New Jersey, USA, and will sell finished calcitonin product to Upsher-Smith. Upsher-Smith will package, market and distribute the product nationwide. Unigene granted an exclusive license to Faran Laboratories in September 2003 for its intranasal calcitonin osteoporosis product in Greece. Unigene will sell the finished product to Faran, who will promote and market it throughout the country after Unigene obtains European regulatory approval and local pricing approval. Unigene will receive an upfront payment and is eligible to receive milestone payments prior to product launch. Faran will pay Unigene a fixed price for each unit of product received. Qingdao General Pharmaceutical Company was a licensee for Unigene's injectable and intranasal calcitonin products in the People's Republic of China, and Unigene had received initial payments from Qingdao General Pharmaceutical in 1996. However, in June 2000, Unigene announced that it has entered into a joint venture with Shijiazhuang Pharmaceutical Group Company for the manufacture and distribution of injectable and intranasal calcitonin for the treatment of osteoporosis in China. Unigene initially will be responsible for supplying bulk calcitonin manufactured in its production facility in New Jersey and the joint venture will be responsible for filling, packaging, promoting and marketing the products. Unigene owns 45% of the contractual joint venture. Unigene is also developing oral and injectable formulations of calcitonin. In January 2004, Unigene announced it had received an approvable letter from the US FDA to market its calcitonin intranasal spray for the treatment of osteoporosis. The letter indicates that the FDA will approve the NDA upon finalisation of the labelling and resolution of specific remaining issues, including the submission of additional information and clinical data. Unigene filed the NDA in March 2003. Using the results from a pilot study completed in the UK, Unigene filed an IND with the FDA and began clinical testing of this intranasal calcitonin in the US. Clinical studies were successfully completed by year end 2001, demonstrating significant bone marker activity and similar serum concentrations between this product and that of an existing nasal calcitonin product. The European Union's regulatory authority, the Committee for Proprietary Medicinal Products (CPMP), has confirmed the efficacy of calcitonin formulations for the treatment of postmenopausal osteoporosis and other bone disorders. The CPMP has recommended revisions to and harmonisation within the European Union of the authorised indications for calcitonin formulations. The CPMP has determined that authorised indications for intranasal calcitonin will be approved for "treatment of established post-menopausal osteoporosis in order to reduce the risk of vertebral fractures", and new prescribing information will clarify that intranasal calcitonin does not appear to reduce the number of hip fractures. These recommendations will be implemented in the near future and will eliminate discrepancies between countries and between formulations. The Chinese regulatory authorities have granted Unigene an import license for calcitonin, and Unigene and Shijiazhuang have submitted an NDA in China. If this NDA is approved, the joint venture will have up to 6 years' market exclusivity for intranasal and injectable calcitonin. Unigene was granted a US patent for the intranasal formulation of calcitonin in 2002.
Zhao, M; Wang, T; Adamson, K J; Storey, K B; Cummins, S F
2016-02-08
The land snail Theba pisana is native to the Mediterranean region but has become one of the most abundant invasive species worldwide. Here, we present three transcriptomes of this agriculture pest derived from three tissues: the central nervous system, hepatopancreas (digestive gland), and foot muscle. Sequencing of the three tissues produced 339,479,092 high quality reads and a global de novo assembly generated a total of 250,848 unique transcripts (unigenes). BLAST analysis mapped 52,590 unigenes to NCBI non-redundant protein databases and further functional analysis annotated 21,849 unigenes with gene ontology. We report that T. pisana transcripts have representatives in all functional classes and a comparison of differentially expressed transcripts amongst all three tissues demonstrates enormous differences in their potential metabolic activities. The genes differentially expressed include those with sequence similarity to those genes associated with multiple bacterial diseases and neurological diseases. To provide a valuable resource that will assist functional genomics study, we have implemented a user-friendly web interface, ThebaDB (http://thebadb.bioinfo-minzhao.org/). This online database allows for complex text queries, sequence searches, and data browsing by enriched functional terms and KEGG mapping.
2011-01-01
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping with a concurrent objective of reducing microarray costs. HIgh-density gene-rich maps represent a powerful resource to assist gene discovery endeavors when used in combination with QTL and association mapping and should be especially valuable to assist the assembly of reference genome sequences soon to come for several plant and animal species. PMID:21492453
Wu, Shaohua; Zhang, Shixin; Chao, Jinquan; Deng, Xiaomin; Chen, Yueyi; Shi, Minjing; Tian, Wei-Min
2016-01-01
The secondary laticifer in rubber tree (Hevea brasiliensis Muell. Arg.) is a specific tissue within the secondary phloem. This tissue differentiates from the vascular cambia, and its function is natural rubber biosynthesis and storage. Given that jasmonates play a pivotal role in secondary laticifer differentiation, we established an experimental system with jasmonate (JA) mimic coronatine (COR) for studying the secondary laticifer differentiation: in this system, differentiation occurs within five days of the treatment of epicormic shoots with COR. In the present study, the experimental system was used to perform transcriptome sequencing and gene expression analysis. A total of 67,873 unigenes were assembled, and 50,548 unigenes were mapped at least in one public database. Of these being annotated unigenes, 15,780 unigenes were differentially expressed early after COR treatment, and 19,824 unigenes were differentially expressed late after COR treatment. At the early stage, 8,646 unigenes were up-regulated, while 7,134 unigenes were down-regulated. At the late stage, the numbers of up- and down-regulated unigenes were 7,711 and 12,113, respectively. The annotation data and gene expression analysis of the differentially expressed unigenes suggest that JA-mediated signalling, Ca2+ signal transduction and the CLAVATA-MAPK-WOX signalling pathway may be involved in regulating secondary laticifer differentiation in rubber trees. PMID:27808245
Li, Yuanjun; Gou, Junbo; Chen, Fangfang; Li, Changfu; Zhang, Yansheng
2016-01-01
Xanthium strumarium L. is a traditional Chinese herb belonging to the Asteraceae family. The major bioactive components of this plant are sesquiterpene lactones (STLs), which include the xanthanolides. To date, the biogenesis of xanthanolides, especially their downstream pathway, remains largely unknown. In X. strumarium, xanthanolides primarily accumulate in its glandular trichomes. To identify putative gene candidates involved in the biosynthesis of xanthanolides, three X. strumarium transcriptomes, which were derived from the young leaves of two different cultivars and the purified glandular trichomes from one of the cultivars, were constructed in this study. In total, 157 million clean reads were generated and assembled into 91,861 unigenes, of which 59,858 unigenes were successfully annotated. All the genes coding for known enzymes in the upstream pathway to the biosynthesis of xanthanolides were present in the X. strumarium transcriptomes. From a comparative analysis of the X. strumarium transcriptomes, this study identified a number of gene candidates that are putatively involved in the downstream pathway to the synthesis of xanthanolides, such as four unigenes encoding CYP71 P450s, 50 unigenes for dehydrogenases, and 27 genes for acetyltransferases. The possible functions of these four CYP71 candidates are extensively discussed. In addition, 116 transcription factors that are highly expressed in X. strumarium glandular trichomes were also identified. Their possible regulatory roles in the biosynthesis of STLs are discussed. The global transcriptomic data for X. strumarium should provide a valuable resource for further research into the biosynthesis of xanthanolides. PMID:27625674
De novo transcriptomic analysis and development of EST-SSRs for Sorbus pohuashanensis (Hance) Hedl.
Guan, Xuelian; Fu, Qiang; Zhang, Ze; Hu, Zenghui; Zheng, Jian; Lu, Yizeng; Li, Wei
2017-01-01
Sorbus pohuashanensis is a native tree species of northern China that is used for a variety of ecological purposes. The species is often grown as an ornamental landscape tree because of its beautiful form, silver flowers in early summer, attractive pinnate leaves in summer, and red leaves and fruits in autumn. However, development and further utilization of the species are hindered by the lack of comprehensive genetic information, which impedes research into its genetics and molecular biology. Recent advances in de novo transcriptome sequencing (RNA-seq) technology have provided an effective means to obtain genomic information from non-model species. Here, we applied RNA-seq for sequencing S. pohuashanensis leaves and obtained a total of 137,506 clean reads. After assembly, 96,213 unigenes with an average length of 770 bp were obtained. We found that 64.5% of the unigenes could be annotated using bioinformatics tools to analyze gene function and alignment with the NCBI database. Overall, 59,089 unigenes were annotated using the Nr database(non-redundant protein database), 35,225 unigenes were annotated using the GO (Gene Ontology categories) database, and 33,168 unigenes were annotated using COG (Cluster of Orthologous Groups). Analysis of the unigenes using the KEGG (Kyoto Encyclopedia of Genes and Genomes) database indicated that 13,953 unigenes were involved in 322 metabolic pathways. Finally, simple sequence repeat (SSR) site detection identified 6,604 unigenes that included EST-SSRs and a total of 7,473 EST-SSRs in the unigene sequences. Fifteen polymorphic SSRs were screened and found to be of use for future genetic research. These unigene sequences will provide important genetic resources for genetic improvement and investigation of biochemical processes in S. pohuashanensis. PMID:28614366
Tang, Pei-An; Wu, Hai-Jing; Xue, Hao; Ju, Xing-Rong; Song, Wei; Zhang, Qi-Lin; Yuan, Ming-Long
2017-07-30
The Indian meal moth Plodia interpunctella (Lepidoptera: Pyralidae) is a worldwide pest that causes serious damage to stored foods. Although many efforts have been conducted on this species due to its economic importance, the study of genetic basis of development, behavior and insecticide resistance has been greatly hampered due to lack of genomic information. In this study, we used high throughput sequencing platform to perform a de novo transcriptome assembly and tag-based digital gene expression profiling (DGE) analyses across four different developmental stages of P. interpunctella (egg, third-instar larvae, pupae and adult). We obtained approximate 9gigabyte (GB) of clean data and recovered 84,938 unigenes, including 37,602 clusters and 47,336 singletons. These unigenes were annotated using BLAST against the non-redundant protein databases and then functionally classified based on Gene Ontology (GO), Clusters of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes databases (KEGG). A large number of differentially expressed genes were identified by pairwise comparisons among different developmental stages. Gene expression profiles dramatically changed between developmental stage transitions. Some of these differentially expressed genes were related to digestion and cuticularization. Quantitative real-time PCR results of six randomly selected genes conformed the findings in the DGEs. Furthermore, we identified over 8000 microsatellite markers and 97,648 single nucleotide polymorphisms which will be useful for population genetics studies of P. interpunctella. This transcriptomic information provided insight into the developmental basis of P. interpunctella and will be helpful for establishing integrated management strategies and developing new targets of insecticides for this serious pest. Copyright © 2017 Elsevier B.V. All rights reserved.
Nigam, Deepti; Saxena, Swati; Ramakrishna, G.; Singh, Archana; Singh, N. K.; Gaikwad, Kishor
2017-01-01
Pigeonpea [Cajanus cajan (L.) Millsp.] is a heat and drought resilient legume crop grown mostly in Asia and Africa. Pigeonpea is affected by various biotic (diseases and insect pests) and abiotic stresses (salinity and water logging) which limit the yield potential of this crop. However, resistance to all these constraints is not readily available in the cultivated genotypes and some of the wild relatives have been found to withstand these resistances. Thus, the utilization of crop wild relatives (CWR) in pigeonpea breeding has been effective in conferring resistance, quality and breeding efficiency traits to this crop. Bud and leaf tissue of Cajanus scarabaeoides, a wild relative of pigeon pea were used for transcriptome profiling. Approximately 30 million clean reads filtered from raw reads by removal of adaptors, ambiguous reads and low-quality reads (3.02 gigabase pairs) were generated by Illumina paired-end RNA-seq technology. All of these clean reads were pooled and assembled de novo into 1,17,007 transcripts using the Trinity. Finally, a total of 98,664 unigenes were derived with mean length of 396 bp and N50 values of 1393. The assembly produced significant mapping results (73.68%) in BLASTN searches of the Glycine max CDS sequence database (Ensembl). Further, uniprot database of Viridiplantae was used for unigene annotation; 81,799 of 98,664 (82.90%) unigenes were finally annotated with gene descriptions or conserved protein domains. Further, a total of 23,475 SSRs were identified in 27,321 unigenes. This data will provide useful information for mining of functionally important genes and SSR markers for pigeonpea improvement. PMID:28748187
2013-01-01
Background Longan is a tropical/subtropical fruit tree of great economic importance in Southeast Asia. Progress in understanding molecular mechanisms of longan embryogenesis, which is the primary influence on fruit quality and yield, is slowed by lack of transcriptomic and genomic information. Illumina second generation sequencing, which is suitable for generating enormous numbers of transcript sequences that can be used for functional genomic analysis of longan. Results In this study, a longan embryogenic callus (EC) cDNA library was sequenced using an Illumina HiSeq 2000 system. A total of 64,876,258 clean reads comprising 5.84 Gb of nucleotides were assembled into 68,925 unigenes of 448-bp mean length, with unigenes ≥1000 bp accounting for 8.26% of the total. Using BLASTx, 40,634 unigenes were found to have significant similarity with accessions in Nr and Swiss- Prot databases. Of these, 38,845 unigenes were assigned to 43 GO sub-categories and 17,118 unigenes were classified into 25 COG sub-groups. In addition, 17,306 unigenes mapped to 199 KEGG pathways, with the categories of Metabolic pathways, Plant-pathogen interaction, Biosynthesis of secondary metabolites, and Genetic information processing being well represented. Analyses of unigenes ≥1000 bp revealed 328 embryogenesis-related unigenes as well as numerous unigenes expressed in EC associated with functions of reproductive growth, such as flowering, gametophytogenesis, and fertility, and vegetative growth, such as root and shoot growth. Furthermore, 23 unigenes related to embryogenesis and reproductive and vegetative growth were validated by quantitative real time PCR (qPCR) in samples from different stages of longan somatic embryogenesis (SE); their differentially expressions in the various embryogenic cultures indicated their possible roles in longan SE. Conclusions The quantity and variety of expressed EC genes identified in this study is sufficient to serve as a global transcriptome dataset for longan EC and to provide more molecular resources for longan functional genomics. PMID:23957614
Xiao, Da; Tan, Xiaoling; Wang, Wenjuan; Zhang, Fan; Desneux, Nicolas; Wang, Su
2017-01-01
Biological control is usually used in combination with chemical control for practical agricultural applications. Thus, the influence of insecticides on the natural predators used for biological control should be investigated for integrated pest management. The ladybird Harmonia axyridis is an effective predator on aphids and coccids. Beta-cypermethrin is a broad-spectrum insecticide used worldwide for controlling insect pests. H. axyridis is becoming increasingly threatened by this insecticide. Here, we investigated the effect of a sublethal dose of beta-cypermethrin on flight, locomotion, respiration, and detoxification system of H. axyridis. After exposure to beta-cypermethrin, succinic female adults flew more times, longer distances, and during longer time periods. Exposure to a sublethal dose of beta-cypermethrin also promoted an increase in walking rate, walking distance, walking duration, and also an increase in respiratory quotient and respiratory rate. To investigate the effects of beta-cypermethrin on H. axyridis detoxification system, we analyzed the transcriptome of H. axyridis adults, focusing on genes related to detoxification systems. De novo assembly generated 65,509 unigenes with a mean length of 799 bp. From these genes, 26,020 unigenes (40.91% of all unigenes) exhibited clear homology to known genes in the NCBI non-redundant database. In addition, 10,402 unigenes were annotated in the Cluster of Orthologous Groups database, 12,088 unigenes were assigned to the Gene Ontology database and 12,269 unigenes were in the Kyoto Encyclopedia of Genes and Genome (KEGG) database. Exposure to beta-cypermethrin had significant effects on the transcriptome profile of H. axyridis adult. Based on uniquely mapped reads, 3,296 unigenes were differentially expressed, 868 unigenes were up-regulated and 2,248 unigenes were down-regulated. We identified differentially-expressed unigenes related to general detoxification systems in H. axyridis. This assembled, annotated transcriptome provides a valuable genomic resource for further understanding the molecular basis of detoxification mechanisms in H. axyridis. PMID:28239355
Xiao, Da; Tan, Xiaoling; Wang, Wenjuan; Zhang, Fan; Desneux, Nicolas; Wang, Su
2017-01-01
Biological control is usually used in combination with chemical control for practical agricultural applications. Thus, the influence of insecticides on the natural predators used for biological control should be investigated for integrated pest management. The ladybird Harmonia axyridis is an effective predator on aphids and coccids. Beta-cypermethrin is a broad-spectrum insecticide used worldwide for controlling insect pests. H. axyridis is becoming increasingly threatened by this insecticide. Here, we investigated the effect of a sublethal dose of beta-cypermethrin on flight, locomotion, respiration, and detoxification system of H. axyridis . After exposure to beta-cypermethrin, succinic female adults flew more times, longer distances, and during longer time periods. Exposure to a sublethal dose of beta-cypermethrin also promoted an increase in walking rate, walking distance, walking duration, and also an increase in respiratory quotient and respiratory rate. To investigate the effects of beta-cypermethrin on H. axyridis detoxification system, we analyzed the transcriptome of H. axyridis adults, focusing on genes related to detoxification systems. De novo assembly generated 65,509 unigenes with a mean length of 799 bp. From these genes, 26,020 unigenes (40.91% of all unigenes) exhibited clear homology to known genes in the NCBI non-redundant database. In addition, 10,402 unigenes were annotated in the Cluster of Orthologous Groups database, 12,088 unigenes were assigned to the Gene Ontology database and 12,269 unigenes were in the Kyoto Encyclopedia of Genes and Genome (KEGG) database. Exposure to beta-cypermethrin had significant effects on the transcriptome profile of H. axyridis adult. Based on uniquely mapped reads, 3,296 unigenes were differentially expressed, 868 unigenes were up-regulated and 2,248 unigenes were down-regulated. We identified differentially-expressed unigenes related to general detoxification systems in H. axyridis . This assembled, annotated transcriptome provides a valuable genomic resource for further understanding the molecular basis of detoxification mechanisms in H. axyridis .
Is gliomatosis peritonei derived from the associated ovarian teratoma?
Kwan, Man-Yee; Kalle, Wouter; Lau, Gene T C; Chan, John K C
2004-06-01
Gliomatosis peritonei, a rare condition that occurs almost exclusively in the setting of ovarian immature teratoma, is characterized by the occurrence of nodules of mature glial tissues in the peritoneum. It is controversial whether glial tissues are derived from maturation of the associated teratomatous tissue that has implanted in the peritoneum, or glial differentiation of subperitoneal stem cells. In this study, we employed the unique genetic characteristics of ovarian teratomas (often with a duplicated set of maternal chromosomes and thus homozygous at many polymorphic microsatellite loci) versus normal tissues (heterozygous pattern due to presence of maternal and paternal genetic materials) to investigate the origin of gliomatosis peritonei. DNA samples were extracted from microdissected paraffin-embedded tissues, including the glial implants, the associated ovarian teratomas, and normal tissues, to determine their patterns of microsatellite loci in a multiplex polymerase chain reaction system. Two cases were not informative because the ovarian teratoma showed a heterozygous microsatellite pattern. In the 5 informative cases, the normal tissues showed a heterozygous pattern in the microsatellite loci, the associated teratomas showed a homozygous pattern, and the glial tissues showed a heterozygous pattern. Thus, gliomatosis peritonei is genetically unrelated to the associated teratoma but is probably derived from nonteratomatous cells, such as through metaplasia of submesothelial cells.
Qiu, Y C; Zhou, R H; Kong, X Y; Zhang, S S; Jia, J Z
2005-11-01
A powdery mildew resistance gene from Triticum urartu Tum. accession UR206 was successfully transferred into hexaploid wheat (Triticum aestivum L.) through crossing and backcrossing. The F1 plants, which had 28 chromosomes and an average of 5.32 bivalents and 17.36 univalents in meiotic pollen mother cells (PMC), were obtained through embryos rescued owing to shriveling of endosperm in hybrid seed of cross Chinese Spring (CS) x UR206. Hybrid seeds were produced through backcrossing F1 with common wheat parents. The derivative lines had normal chromosome numbers and powdery mildew resistance similar to the donor UR206, indicating that the powdery mildew resistance gene originating from T. urartu accession UR206 was successfully transferred and expressed in a hexaploid wheat background. Genetic analysis indicated that a single dominant gene controlled the powdery mildew resistance at the seedling stage. To map and tag the powdery mildew resistance gene, 143 F2 individuals derived from a cross UR206 x UR203 were used to construct a linkage map. The resistant gene was mapped on the chromosome 7AL based on the mapped microsatellite makers. The map spanned 52.1 cM and the order of these microsatellite loci agreed well with the established microsatellite map of chromosome arm 7AL. The resistance gene was flanked by the microsatellite loci Xwmc273 and Xpsp3003, with the genetic distances of 2.2 cM and 3.8 cM, respectively. On the basis of the origin and chromosomal location of the gene, it was temporarily designated PmU.
Qi, Zhitao; Wu, Ping; Zhang, Qihuan; Wei, Youchuan; Wang, Zisheng; Qiu, Ming; Shao, Rong; Li, Yao; Gao, Qian
2016-02-01
Soiny mullet (Liza haematocheila) is becoming an economically important aquaculture mugilid species in China and other Asian countries. However, increasing incidences of bacterial pathogenic diseases has greatly hampered the production of the soiny mullet. Deeper understanding of the soiny mullet immune system and its related genes in response to bacterial infections are necessary for disease control in this species. In this study, the transcriptomic profile of spleen from soiny mullet challenged with Streptococcus dysgalactiae was analyzed by Illumina-based paired-end sequencing method. After assembly, 86,884 unique transcript fragments (unigenes) were assembled, with an average length of 991 bp. Approximately 41,795 (48.1%) unigenes were annotated in the nr NCBI database and 57.9% of the unigenes were similar to that of the Nile tilapia. A total of 24,299 unigenes were categorized into three Gene Ontology (GO) categories (molecular function, cellular component and biological process), 13,570 unigenes into 25 functional Clusters of Orthologous Groups of proteins (COG) categories, and 30,547 unigenes were grouped into 258 known pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Following S. dysgalactiae infection, 11,461 differentially expressed unigenes were identified including 4658 up-regulated unigenes and 6803 down-regulated unigenes. Significant enrichment analysis of these differentially expressed unigenes identified major immune related pathways, including the Toll-like receptor, complement and coagulation cascades, T cell receptor signaling pathway and B cell receptor signaling pathway. In addition, 24,813 simple sequence repeats (SSRs) and 127,503 candidate single nucleotide polymorphisms (SNPs) were identified from the mullet spleen transcriptome. To this date, this study has globally analyzed the transcriptome profile from the spleen of L. haematocheila after S. dysgalactiae infection. Therefore, the results of our study contributes to better on the immune system and defense mechanisms of soiny mullet in response to bacterial infection, and provides valuable references for related studies in mugilidae species which currently lack genomic reference. Copyright © 2015 Elsevier Ltd. All rights reserved.
He, Xueying; Wang, Huan; Yang, Jinfen; Deng, Ke; Wang, Teng
2018-02-01
Amomum villosum Lour. is an important Chinese medicinal plant that has diverse medicinal functions, and mainly contains volatile terpenes. This study aims to explore the WRKY transcription factors (TFs) and terpene synthase (TPS) unigenes that might be involved in terpene biosynthesis in A. villosum, and thus providing some new information on the regulation of terpenes in plants. RNA sequencing of A. villosum induced by methyl jasmonate (MeJA) revealed that the WRKY family was the second largest TF family in the transcriptome. Thirty-six complete WRKY domain sequences were expressed in response to MeJA. Further, six WRKY unigenes were highly correlated with eight deduced TPS unigenes. Ultimately, we combined the terpene abundance with the expression of candidate WRKY TFs and TPS unigenes to presume a possible model wherein AvWRKY61, AvWRKY28, and AvWRKY40 might coordinately trans-activate the AvNeoD promoter. We propose an approach to further investigate TF unigenes that might be involved in terpenoid biosynthesis, and identified four unigenes for further analyses.
Murray, Lee; Mobegi, Victor A; Duffy, Craig W; Assefa, Samuel A; Kwiatkowski, Dominic P; Laman, Eugene; Loua, Kovana M; Conway, David J
2016-05-12
In regions where malaria is endemic, individuals are often infected with multiple distinct parasite genotypes, a situation that may impact on evolution of parasite virulence and drug resistance. Most approaches to studying genotypic diversity have involved analysis of a modest number of polymorphic loci, although whole genome sequencing enables a broader characterisation of samples. PCR-based microsatellite typing of a panel of ten loci was performed on Plasmodium falciparum in 95 clinical isolates from a highly endemic area in the Republic of Guinea, to characterize within-isolate genetic diversity. Separately, single nucleotide polymorphism (SNP) data from genome-wide short-read sequences of the same samples were used to derive within-isolate fixation indices (F ws), an inverse measure of diversity within each isolate compared to overall local genetic diversity. The latter indices were compared with the microsatellite results, and also with indices derived by randomly sampling modest numbers of SNPs. As expected, the number of microsatellite loci with more than one allele in each isolate was highly significantly inversely correlated with the genome-wide F ws fixation index (r = -0.88, P < 0.001). However, the microsatellite analysis revealed that most isolates contained mixed genotypes, even those that had no detectable genome sequence heterogeneity. Random sampling of different numbers of SNPs showed that an F ws index derived from ten or more SNPs with minor allele frequencies of >10 % had high correlation (r > 0.90) with the index derived using all SNPs. Different types of data give highly correlated indices of within-infection diversity, although PCR-based analysis detects low-level minority genotypes not apparent in bulk sequence analysis. When whole-genome data are not obtainable, quantitative assay of ten or more SNPs can yield a reasonably accurate estimate of the within-infection fixation index (F ws).
Duan, Jun; Ladd, Tim; Doucet, Daniel; Cusson, Michel; vanFrankenhuyzen, Kees; Mittapalli, Omprakash; Krell, Peter J; Quan, Guoxing
2015-01-01
The Emerald ash borer (EAB), Agrilus planipennis, is an invasive phloem-feeding insect pest of ash trees. Since its initial discovery near the Detroit, US- Windsor, Canada area in 2002, the spread of EAB has had strong negative economic, social and environmental impacts in both countries. Several transcriptomes from specific tissues including midgut, fat body and antenna have recently been generated. However, the relatively low sequence depth, gene coverage and completeness limited the usefulness of these EAB databases. High-throughput deep RNA-Sequencing (RNA-Seq) was used to obtain 473.9 million pairs of 100 bp length paired-end reads from various life stages and tissues. These reads were assembled into 88,907 contigs using the Trinity strategy and integrated into 38,160 unigenes after redundant sequences were removed. We annotated 11,229 unigenes by searching against the public nr, Swiss-Prot and COG. The EAB transcriptome assembly was compared with 13 other sequenced insect species, resulting in the prediction of 536 unigenes that are Coleoptera-specific. Differential gene expression revealed that 290 unigenes are expressed during larval molting and 3,911 unigenes during metamorphosis from larvae to pupae, respectively (FDR< 0.01 and log2 FC>2). In addition, 1,167 differentially expressed unigenes were identified from larval and adult midguts, 435 unigenes were up-regulated in larval midgut and 732 unigenes were up-regulated in adult midgut. Most of the genes involved in RNA interference (RNAi) pathways were identified, which implies the existence of a system RNAi in EAB. This study provides one of the most fundamental and comprehensive transcriptome resources available for EAB to date. Identification of the tissue- stage- or species- specific unigenes will benefit the further study of gene functions during growth and metamorphosis processes in EAB and other pest insects.
Duan, Jun; Ladd, Tim; Doucet, Daniel; Cusson, Michel; vanFrankenhuyzen, Kees; Mittapalli, Omprakash; Krell, Peter J.; Quan, Guoxing
2015-01-01
Background The Emerald ash borer (EAB), Agrilus planipennis, is an invasive phloem-feeding insect pest of ash trees. Since its initial discovery near the Detroit, US- Windsor, Canada area in 2002, the spread of EAB has had strong negative economic, social and environmental impacts in both countries. Several transcriptomes from specific tissues including midgut, fat body and antenna have recently been generated. However, the relatively low sequence depth, gene coverage and completeness limited the usefulness of these EAB databases. Methodology and Principal Findings High-throughput deep RNA-Sequencing (RNA-Seq) was used to obtain 473.9 million pairs of 100 bp length paired-end reads from various life stages and tissues. These reads were assembled into 88,907 contigs using the Trinity strategy and integrated into 38,160 unigenes after redundant sequences were removed. We annotated 11,229 unigenes by searching against the public nr, Swiss-Prot and COG. The EAB transcriptome assembly was compared with 13 other sequenced insect species, resulting in the prediction of 536 unigenes that are Coleoptera-specific. Differential gene expression revealed that 290 unigenes are expressed during larval molting and 3,911 unigenes during metamorphosis from larvae to pupae, respectively (FDR< 0.01 and log2 FC>2). In addition, 1,167 differentially expressed unigenes were identified from larval and adult midguts, 435 unigenes were up-regulated in larval midgut and 732 unigenes were up-regulated in adult midgut. Most of the genes involved in RNA interference (RNAi) pathways were identified, which implies the existence of a system RNAi in EAB. Conclusions and Significance This study provides one of the most fundamental and comprehensive transcriptome resources available for EAB to date. Identification of the tissue- stage- or species- specific unigenes will benefit the further study of gene functions during growth and metamorphosis processes in EAB and other pest insects. PMID:26244979
Tariq, Mansoor; Chen, Rong; Yuan, Hongyu; Liu, Yanjie; Wu, Yanan; Wang, Junya; Xia, Chun
2015-01-01
Background The Chinese goose is one of the most economically important poultry birds and is a natural reservoir for many avian viruses. However, the nature and regulation of the innate and adaptive immune systems of this waterfowl species are not completely understood due to limited information on the goose genome. Recently, transcriptome sequencing technology was applied in the genomic studies focused on novel gene discovery. Thus, this study described the transcriptome of the goose peripheral blood lymphocytes to identify immunity relevant genes. Principal Findings De novo transcriptome assembly of the goose peripheral blood lymphocytes was sequenced by Illumina-Solexa technology. In total, 211,198 unigenes were assembled from the 69.36 million cleaned reads. The average length, N50 size and the maximum length of the assembled unigenes were 687 bp, 1,298 bp and 18,992 bp, respectively. A total of 36,854 unigenes showed similarity by BLAST search against the NCBI non-redundant (Nr) protein database. For functional classification, 163,161 unigenes were comprised of three Gene Ontology (Go) categories and 67 subcategories. A total of 15,334 unigenes were annotated into 25 eukaryotic orthologous groups (KOGs) categories. Kyoto Encyclopedia of Genes and Genomes (KEGG) database annotated 39,585 unigenes into six biological functional groups and 308 pathways. Among the 2,757 unigenes that participated in the 15 immune system KEGG pathways, 125 of the most important immune relevant genes were summarized and analyzed by STRING analysis to identify gene interactions and relationships. Moreover, 10 genes were confirmed by PCR and analyzed. Of these 125 unigenes, 109 unigenes, approximately 87%, were not previously identified in the goose. Conclusion This de novo transcriptome analysis could provide important Chinese goose sequence information and highlights the value of new gene discovery, pathways investigation and immune system gene identification, and comparison with other avian species as useful tools to understand the goose immune system. PMID:25816068
Microsatellite alterations as clonal markers for the detection of human cancer.
Mao, L; Lee, D J; Tockman, M S; Erozan, Y S; Askin, F; Sidransky, D
1994-01-01
Microsatellite instability has been reported to be an important feature of tumors from hereditary nonpolyposis colorectal carcinoma (HNPCC) patients. The recent discovery of genetic instability in small cell lung carcinoma, a neoplasm not associated with HNPCC, led us to investigate the possible presence of microsatellite alterations in other tumor types. We examined 52 microsatellite repeat sequences in the DNA of normal and tumor pairs from 100 head and neck, bladder, and lung cancer patients by the polymerase chain reaction. Although alterations were rare in dinucleotide repeats, larger (tri- or tetranucleotide) repeats were found to be more prone to expansion or deletion. We screened 100 tumors with a panel of nine tri- and tetranucleotide repeat markers and identified 26 (26%) that displayed alterations in at least one locus. This observation prompted us to examine the possibility of using microsatellite alterations as markers to detect clonal tumor-derived cell populations in pathologic samples. The identical microsatellite alterations detected in the primary tumors were successfully identified in corresponding urine, sputum, and surgical margins from affected patients. This study demonstrates that appropriately selected microsatellite loci are commonly altered in many cancers and can serve as clonal markers for their detection. Images PMID:7937908
Chloroplast and nuclear microsatellite analysis of Aegilops cylindrica.
Gandhi, Harish T; Vales, M Isabel; Watson, Christy J W; Mallory-Smith, Carol A; Mori, Naoki; Rehman, Maqsood; Zemetra, Robert S; Riera-Lizarazu, Oscar
2005-08-01
Aegilops cylindrica Host (2n = 4x = 28, genome CCDD) is an allotetraploid formed by hybridization between the diploid species Ae. tauschii Coss. (2n = 2x = 14, genome DD) and Ae. markgrafii (Greuter) Hammer (2n = 2x = 14, genome CC). Previous research has shown that Ae. tauschii contributed its cytoplasm to Ae. cylindrica. However, our analysis with chloroplast microsatellite markers showed that 1 of the 36 Ae. cylindrica accessions studied, TK 116 (PI 486249), had a plastome derived from Ae. markgrafii rather than Ae. tauschii. Thus, Ae. markgrafii has also contributed its cytoplasm to Ae. cylindrica. Our analysis of chloroplast and nuclear microsatellite markers also suggests that D-type plastome and the D genome in Ae. cylindrica were closely related to, and were probably derived from, the tauschii gene pool of Ae. tauschii. A determination of the likely source of the C genome and the C-type plastome in Ae. cylindrica was not possible.
Alu repeats: A source for the genesis of primate microsatellites
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arcot, S.S.; Batzer, M.A.; Wang, Zhenyuan
1995-09-01
As a result of their abundance, relatively uniform distribution, and high degree of polymorphism, microsatellites and minisatellites have become valuable tools in genetic mapping, forensic identity testing, and population studies. In recent years, a number of microsatellite repeats have been found to be associated with Alu interspersed repeated DNA elements. The association of an Alu element with a microsatellite repeat could result from the integration of an Alu element within a preexisting microsatellite repeat. Alternatively, Alu elements could have a direct role in the origin of microsatellite repeats. Errors introduced during reverse transcription of the primary transcript derived from anmore » Alu {open_quotes}master{close_quote} gene or the accumulation of random mutations in the middle A-rich regions and oligo(dA)-rich tails of Alu elements after insertion and subsequent expansion and contraction of these sequences could result in the genesis of a microsatellite repeat. We have tested these hypotheses by a direct evolutionary comparison of the sequences of some recent Alu elements that are found only in humans and are absent from nonhuman primates, as well as some older Alu elements that are present at orthologous positions in a number of nonhuman primates. The origin of {open_quotes}young{close_quotes} Alu insertions, absence of sequences that resemble microsatellite repeats at the orthologous loci in chimpanzees, and the gradual expansion of microsatellite repeats in some old Alu repeats at orthologous positions within the genomes of a number of nonhuman primates suggest that Alu elements are a source for the genesis of primate microsatellite repeats. 48 refs., 5 figs., 3 tabs.« less
Liu, Qiuxu; Qi, Xiao; Yan, Haidong; Huang, Linkai; Nie, Gang; Zhang, Xinquan
2018-01-16
To select the most stable reference genes in annual ryegrass ( Lolium multiflorum ), we studied annual ryegrass leaf tissues exposed to various abiotic stresses by qRT-PCR and selected 11 candidate reference genes, i.e., 18S rRNA, E2, GAPDH, eIF4A, HIS3, SAMDC, TBP-1, Unigene71, Unigene77, Unigene755, and Unigene14912. We then used GeNorm, NormFinder, and BestKeeper to analyze the expression stability of these 11 genes, and used RefFinder to comprehensively rank genes according to stability. Under different stress conditions, the most suitable reference genes for studies of leaf tissues of annual ryegrass were different. The expression of the eIF4A gene was the most stable under drought stress. Under saline-alkali stress, Unigene14912 has the highest expression stability. Under acidic aluminum stress, SAMDC expression stability was highest. Under heavy metal stress, Unigene71 expression had the highest stability. According to the software analyses, Unigene14912, HIS3, and eIF4A were the most suitable for analyses of abiotic stress in tissues of annual ryegrass. GAPDH was the least suitable reference gene. In conclusion, selecting appropriate reference genes under abiotic stress not only improves the accuracy of annual ryegrass gene expression analyses, but also provides a theoretical reference for the development of reference genes in plants of the genus Lolium .
Zhang, X J; Jiang, H Y; Li, L M; Yuan, L H; Chen, J P
2016-06-20
The aim of this study was to provide comprehensive insights into the genetic background of sturgeon by transcriptome study. We performed a de novo assembly of the Amur sturgeon Acipenser schrenckii transcriptome using Illumina Hiseq 2000 sequencing. A total of 148,817 non-redundant unigenes with base length of approximately 121,698,536 bp and ranges from 201 to 26,789 bp were obtained. All the unigenes were classified into 3368 distinct categories and 145,449 singletons by homologous transcript cluster analysis. In all, 46,865 (31.49%) unigenes showed homologous matches with Nr database and 32,214 (21.65%) unigenes were matched to Nt database. In total, 24,862 unigenes were categorized into significantly enriched 52 function groups by GO analysis, and 38,436 unigenes were classified into 25 groups by KOG prediction, as well as 128 enriched KEGG pathways were identified by 45,598 unigenes (P < 0.05). Subsequently, a total of 19,860 SSRs markers were identified with the abundant di-nucleotide type (10,658; 53.67%) and the most AT/TA motif repeats (2689; 13.54%). A total of 1341 conserved lncRNAs were identified by a customized pipeline. Our study provides new sequence and function information for A. schrenckii, which will be the basis for further genetic studies on sturgeon species. The huge number of potential SSRs and putatively conserved lncRNAs isolated by the transcriptome also shed light on research in many fields, including the evolution, conservation management, and biological processes in sturgeon.
2013-01-01
Background Although banana (Musa sp.) is an important edible crop, contributing towards poverty alleviation and food security, limited transcriptome datasets are available for use in accelerated molecular-based breeding in this genus. 454 GS-FLX Titanium technology was employed to determine the sequence of gene transcripts in genotypes of Musa acuminata ssp. burmannicoides Calcutta 4 and M. acuminata subgroup Cavendish cv. Grande Naine, contrasting in resistance to the fungal pathogen Mycosphaerella musicola, causal organism of Sigatoka leaf spot disease. To enrich for transcripts under biotic stress responses, full length-enriched cDNA libraries were prepared from whole plant leaf materials, both uninfected and artificially challenged with pathogen conidiospores. Results The study generated 846,762 high quality sequence reads, with an average length of 334 bp and totalling 283 Mbp. De novo assembly generated 36,384 and 35,269 unigene sequences for M. acuminata Calcutta 4 and Cavendish Grande Naine, respectively. A total of 64.4% of the unigenes were annotated through Basic Local Alignment Search Tool (BLAST) similarity analyses against public databases. Assembled sequences were functionally mapped to Gene Ontology (GO) terms, with unigene functions covering a diverse range of molecular functions, biological processes and cellular components. Genes from a number of defense-related pathways were observed in transcripts from each cDNA library. Over 99% of contig unigenes mapped to exon regions in the reference M. acuminata DH Pahang whole genome sequence. A total of 4068 genic-SSR loci were identified in Calcutta 4 and 4095 in Cavendish Grande Naine. A subset of 95 potential defense-related gene-derived simple sequence repeat (SSR) loci were validated for specific amplification and polymorphism across M. acuminata accessions. Fourteen loci were polymorphic, with alleles per polymorphic locus ranging from 3 to 8 and polymorphism information content ranging from 0.34 to 0.82. Conclusions A large set of unigenes were characterized in this study for both M. acuminata Calcutta 4 and Cavendish Grande Naine, increasing the number of public domain Musa ESTs. This transcriptome is an invaluable resource for furthering our understanding of biological processes elicited during biotic stresses in Musa. Gene-based markers will facilitate molecular breeding strategies, forming the basis of genetic linkage mapping and analysis of quantitative trait loci. PMID:23379821
Li, Fu-Gui; Chen, Jie; Jiang, Xia-Yun; Zou, Shu-Ming
2015-01-01
The blunt snout bream (Megalobrama amblycephala) is an important freshwater aquaculture species, but it is sensitive to hypoxia. No transcriptome data related to growth and hypoxia response are available for this species. In this study, we performed de novo transcriptome sequencing for the liver and gills of the fast-growth family and slow-growth family derived from ‘Pujiang No.1’ F10 blunt snout bream that were under hypoxic stress and normoxia, respectively. The fish were divided into the following 4 groups: fast-growth family under hypoxic stress, FH; slow-growth family under hypoxic stress, SH; fast-growth family under normoxia, FN; and slow-growth family under normoxia, SN. A total of 185 million high-quality reads were obtained from the normalized cDNA of the pooled samples, which were assembled into 465,582 contigs and 237,172 transcripts. A total of 31,338 transcripts from the same locus (unigenes) were annotated and assigned to 104 functional groups, and 23,103 unigenes were classified into seven main categories, including 45 secondary KEGG pathways. A total of 22,255 (71%) known putative unigenes were found to be shared across the genomes of five model fish species and mammals, and a substantial number (9.4%) of potentially novel genes were identified. When 6,639 unigenes were used in the analysis of differential expression (DE) genes, the number of putative DE genes related to growth pathways in FH, SH, SN and FN was 159, 118, 92 and 65 in both the liver and gills, respectively, and the number of DE genes related to hypoxic response was 57, 33, 23 and 21 in FH, FN, SH and SN, respectively. Our results suggest that growth performance of the fast-growth family should be due to complex mutual gene regulatory mechanisms of these putative DE genes between growth and hypoxia. PMID:26554582
Chen, Jie; Tan, Ren-Ke; Guo, Xiao-Juan; Fu, Zheng-Li; Wang, Zheng; Zhang, Zhi-Yan; Tan, Xiao-Li
2015-01-01
Brassica napus seed is a lipid storage organ containing approximately 40% oil, while its leaves contain many kinds of lipids for many biological roles, but the overall amounts are less than in seeds. Thus, lipid biosynthesis in the developing seeds and the leaves is strictly regulated which results the final difference of lipids. However, there are few reports about the molecular mechanism controlling the difference in lipid biosynthesis between developing seeds and leaves. In this study, we tried to uncover this mechanism by analyzing the transcriptome data for lipid biosynthesis. The transcriptome data were de novo assembled and a total of 47216 unigenes were obtained, which had an N50 length and median of 1271 and 755 bp, respectively. Among these unigenes, 36368 (about 77.02%) were annotated and there were 109 up-regulated unigenes and 72 down-regulated unigenes in the developing seeds lipid synthetic pathway after comparing with leaves. In the oleic acid pathway, 23 unigenes were up-regulated and four unigenes were down-regulated. During triacylglycerol (TAG) synthesis, the key unigenes were all up-regulated, such as phosphatidate phosphatase and diacylglycerol O-acyltransferase. During palmitic acid, palmitoleic acid, stearic acid, linoleic acid and linolenic acid synthesis in leaves, the unigenes were nearly all up-regulated, which indicated that the biosynthesis of these particular fatty acids were more important in leaves. In the developing seeds, almost all the unigenes in the ABI3VP1, RKD, CPP, E2F-DP, GRF, JUMONJI, MYB-related, PHD and REM transcript factorfamilies were up-regulated, which helped us to discern the regulation mechanism underlying lipid biosynthesis. The differential up/down-regulation of the genes and TFs involved in lipid biosynthesis in developing seeds and leaves provided direct evidence that allowed us to map the network that regulates lipid biosynthesis, and the identification of new TFs that are up-regulated in developing seeds will help us to further elucidate the lipids biosynthesis pathway in developing seeds and leaves. PMID:25965272
Zhou, Fengyan; Zhang, Yong; Tang, Wei; Wang, Mei; Gao, Tongchun
2017-12-06
Asia minor bluegrass (Polypogon fugax, P. fugax), a weed that is both distributed across China and associated with winter crops, has evolved resistance to acetyl-CoA carboxylase (ACCase) herbicides, but the resistance mechanism remains unclear. The goal of this study was to analyze the transcriptome between resistant and sensitive populations of P. fugax at the flowering stage. Populations resistant and susceptible to clodinafop-propargyl showed distinct transcriptome profiles. A total of 206,041 unigenes were identified; 165,901 unique sequences were annotated using BLASTX alignment databases. Among them, 5904 unigenes were classified into 58 transcription factor families. Nine families were related to the regulation of plant growth and development and to stress responses. Twelve unigenes were differentially expressed between the clodinafop-propargyl-sensitive and clodinafop-propargyl-resistant populations at the early flowering stage; among those unigenes, three belonged to the ABI3VP1, BHLH, and GRAS families, while the remaining nine belonged to the MADS family. Compared with the clodinafop-propargyl-sensitive plants, the resistant plants exhibited different expression pattern of these 12 unigenes. This study identified differentially expressed unigenes related to ACCase-resistant P. fugax and thus provides a genomic resource for understanding the molecular basis of early flowering.
He, Miao; Wang, Ying; Hua, Wenping; Zhang, Yuan; Wang, Zhezhi
2012-01-01
Background Hypericum perforatum L. (St. John’s wort) is a medicinal plant with pharmacological properties that are antidepressant, anti-inflammatory, antiviral, anti-cancer, and antibacterial. Its major active metabolites are hypericins, hyperforins, and melatonin. However, little genetic information is available for this species, especially that concerning the biosynthetic pathways for active ingredients. Methodology/Principal Findings Using de novo transcriptome analysis, we obtained 59,184 unigenes covering the entire life cycle of these plants. In all, 40,813 unigenes (68.86%) were annotated and 2,359 were assigned to secondary metabolic pathways. Among them, 260 unigenes are involved in the production of hypericin, hyperforin, and melatonin. Another 2,291 unigenes are classified as potential Type III polyketide synthase. Our BlastX search against the AGRIS database reveals 1,772 unigenes that are homologous to 47 known Arabidopsis transcription factor families. Further analysis shows that 10.61% (6,277) of these unigenes contain 7,643 SSRs. Conclusion We have identified a set of putative genes involved in several secondary metabolism pathways, especially those related to the synthesis of its active ingredients. Our results will serve as an important platform for public information about gene expression, genomics, and functional genomics in H. perforatum. PMID:22860059
Liu, S; Liu, L; Tang, Y; Xiong, S; Long, J; Liu, Z; Tian, N
2017-07-01
The regulatory mechanism of flavonoids, which synergise anti-malarial and anti-cancer compounds in Artemisia annua, is still unclear. In this study, an anthocyanidin-accumulating mutant callus was induced from A. annua and comparative transcriptomic analysis of wild-type and mutant calli performed, based on the next-generation Illumina/Solexa sequencing platform and de novo assembly. A total of 82,393 unigenes were obtained and 34,764 unigenes were annotated in the public database. Among these, 87 unigenes were assigned to 14 structural genes involved in the flavonoid biosynthetic pathway and 37 unigenes were assigned to 17 structural genes related to metabolism of flavonoids. More than 30 unigenes were assigned to regulatory genes, including R2R3-MYB, bHLH and WD40, which might regulate flavonoid biosynthesis. A further 29 unigenes encoding flavonoid biosynthetic enzymes or transcription factors were up-regulated in the mutant, while 19 unigenes were down-regulated, compared with the wild type. Expression levels of nine genes involved in the flavonoid pathway were compared using semi-quantitative RT-PCR, and results were consistent with comparative transcriptomic analysis. Finally, a putative flavonol synthase gene (AaFLS1) was identified from enzyme assay in vitro and in vivo through heterogeneous expression, and confirmed comparative transcriptomic analysis of wild-type and mutant callus. The present work has provided important target genes for the regulation of flavonoid biosynthesis in A. annua. © 2017 German Botanical Society and The Royal Botanical Society of the Netherlands.
Chen, Jingchao; Huang, Hongjuan; Wei, Shouhui; Huang, Zhaofeng; Wang, Xu; Zhang, Chaoxian
2017-01-01
Glyphosate is an important non-selective herbicide that is in common use worldwide. However, evolved glyphosate-resistant (GR) weeds significantly affect crop yields. Unfortunately, the mechanisms underlying resistance in GR weeds, such as goosegrass (Eleusine indica (L.) Gaertn.), an annual weed found worldwide, have not been fully elucidated. In this study, transcriptome analysis was conducted to further assess the potential mechanisms of glyphosate resistance in goosegrass. The RNA sequencing libraries generated 24 597 462 clean reads. De novo assembly analysis produced 48 852 UniGenes with an average length of 847 bp. All UniGenes were annotated using seven databases. Sixteen candidate differentially expressed genes selected by digital gene expression analysis were validated by quantitative real-time PCR (qRT-PCR). Among these UniGenes, the EPSPS and PFK genes were constitutively up-regulated in resistant (R) individuals and showed a higher copy number than that in susceptible (S) individuals. The expressions of four UniGenes relevant to photosynthesis were inhibited by glyphosate in S individuals, and this toxic response was confirmed by gas exchange analysis. Two UniGenes annotated as glutathione transferase (GST) were constitutively up-regulated in R individuals, and were induced by glyphosate both in R and S. In addition, the GST activities in R individuals were higher than in S. Our research confirmed that two UniGenes (PFK, EPSPS) were strongly associated with target resistance, and two GST-annotated UniGenes may play a role in metabolic glyphosate resistance in goosegrass. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Zhu, Haisheng; Liu, Jianting; Wen, Qingfang; Chen, Mindong; Wang, Bin; Zhang, Qianrong; Xue, Zhuzheng
2017-01-01
Fresh-cut luffa (Luffa cylindrica) fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar 'Fusi-3'. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1-6 h). Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD) identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism.
Liang, Li-Song; Ma, Qing-Hua; Chen, Xin; Zong, Jian-Wei; Wang, Gui-Xi
2015-01-01
Plant WRKY transcription factors are known to regulate various biotic and abiotic stress responses. In this study we identified a total of 30 putative WRKY unigenes in a transcriptome dataset of the Chinese wild Hazel, Corylus heterophylla, a species that is noted for its cold tolerance. Thirteen full-length of these ChWRKY genes were cloned and found to encode complete protein sequences, and they were divided into three groups, based on the number of WRKY domains and the pattern of zinc finger structures. Representatives of each of the groups, Unigene25835 (group I), Unigene37641 (group II) and Unigene20441 (group III), were transiently expressed as fusion proteins with yellow fluorescent fusion protein in Nicotiana benthamiana, where they were observed to accumulate in the nucleus, in accordance with their predicted roles as transcriptional activators. An analysis of the expression patterns of all 30 WRKY genes revealed differences in transcript abundance profiles following exposure to cold, drought and high salinity conditions. Among the stress-inducible genes, 23 were up-regulated by all three abiotic stresses and the WRKY genes collectively exhibited four different patterns of expression in flower buds during the overwintering period from November to April. The organ/tissue related expression analysis showed that 18 WRKY genes were highly expressed in stem but only 2 (Unigene9262 and Unigene43101) were greatest in male anthotaxies. The expression of Unigene37641, a member of the group II WRKY genes, was substantially up-regulated by cold, drought and salinity treatments, and its overexpression in Arabidopsis thaliana resulted in better seedling growth, compared with wild type plants, under cold treatment conditions. The transgenic lines also had exhibited higher soluble protein content, superoxide dismutase and peroxidase activiety and lower levels of malondialdehyde, which collectively suggets that Unigene37641 expression promotes cold tolerance. PMID:26270529
Zhao, Tian-Tian; Zhang, Jin; Liang, Li-Song; Ma, Qing-Hua; Chen, Xin; Zong, Jian-Wei; Wang, Gui-Xi
2015-01-01
Plant WRKY transcription factors are known to regulate various biotic and abiotic stress responses. In this study we identified a total of 30 putative WRKY unigenes in a transcriptome dataset of the Chinese wild Hazel, Corylus heterophylla, a species that is noted for its cold tolerance. Thirteen full-length of these ChWRKY genes were cloned and found to encode complete protein sequences, and they were divided into three groups, based on the number of WRKY domains and the pattern of zinc finger structures. Representatives of each of the groups, Unigene25835 (group I), Unigene37641 (group II) and Unigene20441 (group III), were transiently expressed as fusion proteins with yellow fluorescent fusion protein in Nicotiana benthamiana, where they were observed to accumulate in the nucleus, in accordance with their predicted roles as transcriptional activators. An analysis of the expression patterns of all 30 WRKY genes revealed differences in transcript abundance profiles following exposure to cold, drought and high salinity conditions. Among the stress-inducible genes, 23 were up-regulated by all three abiotic stresses and the WRKY genes collectively exhibited four different patterns of expression in flower buds during the overwintering period from November to April. The organ/tissue related expression analysis showed that 18 WRKY genes were highly expressed in stem but only 2 (Unigene9262 and Unigene43101) were greatest in male anthotaxies. The expression of Unigene37641, a member of the group II WRKY genes, was substantially up-regulated by cold, drought and salinity treatments, and its overexpression in Arabidopsis thaliana resulted in better seedling growth, compared with wild type plants, under cold treatment conditions. The transgenic lines also had exhibited higher soluble protein content, superoxide dismutase and peroxidase activiety and lower levels of malondialdehyde, which collectively suggets that Unigene37641 expression promotes cold tolerance.
2013-01-01
Background Litchi (Litchi chinensis Sonn.) is one of the most important fruit trees cultivated in tropical and subtropical areas. However, a lack of transcriptomic and genomic information hinders our understanding of the molecular mechanisms underlying fruit set and fruit development in litchi. Shading during early fruit development decreases fruit growth and induces fruit abscission. Here, high-throughput RNA sequencing (RNA-Seq) was employed for the de novo assembly and characterization of the fruit transcriptome in litchi, and differentially regulated genes, which are responsive to shading, were also investigated using digital transcript abundance(DTA)profiling. Results More than 53 million paired-end reads were generated and assembled into 57,050 unigenes with an average length of 601 bp. These unigenes were annotated by querying against various public databases, with 34,029 unigenes found to be homologous to genes in the NCBI GenBank database and 22,945 unigenes annotated based on known proteins in the Swiss-Prot database. In further orthologous analyses, 5,885 unigenes were assigned with one or more Gene Ontology terms, 10,234 hits were aligned to the 24 Clusters of Orthologous Groups classifications and 15,330 unigenes were classified into 266 Kyoto Encyclopedia of Genes and Genomes pathways. Based on the newly assembled transcriptome, the DTA profiling approach was applied to investigate the differentially expressed genes related to shading stress. A total of 3.6 million and 3.5 million high-quality tags were generated from shaded and non-shaded libraries, respectively. As many as 1,039 unigenes were shown to be significantly differentially regulated. Eleven of the 14 differentially regulated unigenes, which were randomly selected for more detailed expression comparison during the course of shading treatment, were identified as being likely to be involved in the process of fruitlet abscission in litchi. Conclusions The assembled transcriptome of litchi fruit provides a global description of expressed genes in litchi fruit development, and could serve as an ideal repository for future functional characterization of specific genes. The DTA analysis revealed that more than 1000 differentially regulated unigenes respond to the shading signal, some of which might be involved in the fruitlet abscission process in litchi, shedding new light on the molecular mechanisms underlying organ abscission. PMID:23941440
Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe
2013-01-01
Background Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. Results The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. Conclusions This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species. PMID:23861841
Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe
2013-01-01
Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.
Xianjun, Peng; Linhong, Teng; Xiaoman, Wang; Yucheng, Wang; Shihua, Shen
2014-01-01
The paper mulberry is one of the multifunctional tree species in agroforestry systems and is also commonly utilized in traditional medicine in China and other Asian countries. However, little is known about its molecular genetics, which hinders research on and exploitation of this valuable resource. To discern the correlation between gene expression and the essential properties of the paper mulberry, we performed a transcriptomics analysis, assembling a total of 37,725 unigenes from 54,638,676 reads generated by RNA-seq. Among these, 22,692 unigenes showed greater than 60% similarity with genes from other species. The lengths of 13,566 annotated unigenes were longer than 1,000 bp. Functional clustering analysis with COG (Cluster of Orthologous Groups) revealed that 17,184 unigenes are primarily involved in transcription, translation, signal transduction, carbohydrate metabolism, secondary metabolism, and energy metabolism. GO (Gene Ontology) annotation suggests enrichment of genes encoding antioxidant activity, transporter activity, biosynthesis, metabolism and stress response, with a total of 30,659 unigenes falling in these categories. KEGG (Kyoto Encyclopedia of Genes and Genomes) metabolic pathway analysis showed that 7,199 unigenes are associated with 119 metabolic pathways. In addition to the basic metabolism, these genes are enriched for plant pathogen interaction, flavonoid metabolism and other secondary metabolic processes. Furthermore, differences in the transcriptomes of leaf, stem and root tissues were analyzed and 7,233 specifically expressed unigenes were identified. This global expression analysis provided novel insights about the molecular mechanisms of the biosynthesis of flavonoid, lignin and cellulose, as well as on the response to biotic and abiotic stresses including the remediation of contaminated soil by the paper mulberry.
Dong, Bin; Wu, Bin; Hong, Wenhong; Li, Xiuping; Li, Zhuo; Xue, Li; Huang, Yongfang
2017-01-01
The tea-oil camellia (Camellia oleifera) is the most important oil plant in southern China, and has a strong resistance to drought and barren soil. Understanding the molecular mechanisms of drought tolerance would greatly promote its cultivation and molecular breeding. In total, we obtained 76,585 unigenes with an average length of 810 bp and an N50 of 1,092 bp. We mapped all the unigenes to the NCBI 'nr' (non-redundant), SwissProt, KEGG, and clusters of orthologous groups (COG) databases, where 52,531 (68.6%) unigenes were functionally annotated. According to the annotation, 46,171 (60.8%) unigenes belong to 338 KEGG pathways. We identified a series of unigenes that are related to the synthesis and regulation of abscisic acid (ABA), the activity of protective enzymes, vitamin B6 metabolism, the metabolism of osmolytes, and pathways related to the biosynthesis of secondary metabolites. After exposed to drought for 12 hours, the number of differentially-expressed genes (DEGs) between treated plants and control plants increased in the G4 cultivar, while there was no significant increase in the drought-tolerant C3 cultivar. DEGs associated with drought stress responsive pathways were identified by KEGG pathway enrichment analysis. Moreover, we found 789 DEGs related to transcription factors. Finally, according to the results of qRT-PCR, the expression levels of the 20 unigenes tested were consistent with the results of next-generation sequencing. In the present study, we identified a large set of cDNA unigenes from C. oleifera annotated using public databases. Further studies of DEGs involved in metabolic pathways related to drought stress and transcription will facilitate the discovery of novel genes involved in resistance to drought stress in this commercially important plant.
Wu, Bin; Hong, Wenhong; Li, Xiuping; Li, Zhuo; Xue, Li; Huang, Yongfang
2017-01-01
Background The tea-oil camellia (Camellia oleifera) is the most important oil plant in southern China, and has a strong resistance to drought and barren soil. Understanding the molecular mechanisms of drought tolerance would greatly promote its cultivation and molecular breeding. Results In total, we obtained 76,585 unigenes with an average length of 810 bp and an N50 of 1,092 bp. We mapped all the unigenes to the NCBI ‘nr’ (non-redundant), SwissProt, KEGG, and clusters of orthologous groups (COG) databases, where 52,531 (68.6%) unigenes were functionally annotated. According to the annotation, 46,171 (60.8%) unigenes belong to 338 KEGG pathways. We identified a series of unigenes that are related to the synthesis and regulation of abscisic acid (ABA), the activity of protective enzymes, vitamin B6 metabolism, the metabolism of osmolytes, and pathways related to the biosynthesis of secondary metabolites. After exposed to drought for 12 hours, the number of differentially-expressed genes (DEGs) between treated plants and control plants increased in the G4 cultivar, while there was no significant increase in the drought-tolerant C3 cultivar. DEGs associated with drought stress responsive pathways were identified by KEGG pathway enrichment analysis. Moreover, we found 789 DEGs related to transcription factors. Finally, according to the results of qRT-PCR, the expression levels of the 20 unigenes tested were consistent with the results of next-generation sequencing. Conclusions In the present study, we identified a large set of cDNA unigenes from C. oleifera annotated using public databases. Further studies of DEGs involved in metabolic pathways related to drought stress and transcription will facilitate the discovery of novel genes involved in resistance to drought stress in this commercially important plant. PMID:28759610
Chen, Mindong; Wang, Bin; Zhang, Qianrong; Xue, Zhuzheng
2017-01-01
Fresh-cut luffa (Luffa cylindrica) fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar ‘Fusi-3’. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1–6 h). Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD) identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism. PMID:29145430
Development of 12 genic microsatellite loci for a biofuel grass, Miscanthus sinensis (Poaceae).
Ho, Chuan-Wen; Wu, Tai-Han; Hsu, Tsai-Wen; Huang, Jao-Ching; Huang, Chi-Chun; Chiang, Tzen-Yuh
2011-08-01
Miscanthus, a nonfood plant with high potential as a biofuel, has been used in Europe and the United States. The selection of a cultivar with high biomass, photosynthetic efficiency, and stress resistance from wild populations has become an important issue. New genic microsatellite markers will aid the assessment of genetic diversity for different strains. Twelve polymorphic microsatellite markers derived from the transcriptome of Miscanthus sinensis fo. glaber were identified and screened on 80 individuals of M. sinensis. The number of alleles per locus ranged from 6 to 12, and the mean expected heterozygosity was 0.75. Cross-taxa transferability revealed that all loci can be applied to all varieties of M. sinensis, as well as the closely related species M. floridulus. These new genic microsatellite markers are useful for characterizing different traits in breeding programs or to select genes useful for biofuel.
Baliraine, F N; Bonizzoni, M; Guglielmino, C R; Osir, E O; Lux, S A; Mulaa, F J; Gomulski, L M; Zheng, L; Quilici, S; Gasperi, G; Malacrida, A R
2004-03-01
A set of 10 microsatellite markers was used to survey the levels of genetic variability and to analyse the genetic aspects of the population dynamics of two potentially invasive pest fruit fly species, Ceratitis rosa and C. fasciventris, in Africa. The loci were derived from the closely related species, C. capitata. The degree of microsatellite polymorphism in C. rosa and C. fasciventris was extensive and comparable to that of C. capitata. In C. rosa, the evolution of microsatellite polymorphism in its distribution area reflects the colonization history of this species. The mainland populations are more polymorphic than the island populations. Low levels of differentiation were found within the Africa mainland area, while greater levels of differentiation affect the islands. Ceratitis fasciventris is a central-east African species. The microsatellite data over the Uganda/Kenya spatial scale suggest a recent expansion and possibly continuing gene flow within this area. The microsatellite variability data from C. rosa and C. fasciventris, together with those of C. capitata, support the hypothesis of an east African origin of the Ceratitis spp.
Zhu, W-C; Sun, J-T; Dai, J; Huang, J-R; Chen, L; Hong, X-Y
2017-11-27
Athetis lepigone (Möschler) (Lepidoptera: Noctuidae) is a new outbreak pest in China. Consequently, it is unclear whether the emergence and spread of the outbreak of this pest are triggered by rapid in situ population size increases in each outbreak area, or by immigrants from a potential source area in China. In order to explore the outbreak process of this pest through a population genetics approach, we developed ten novel polymorphic expressed sequence tags (EST)-derived microsatellites. These new microsatellites had moderately high levels of polymorphism in the tested population. The number of alleles per locus ranged from 3 to 19, with an average of 8.6, and the expected heterozygosity ranged from 0.269 to 0.783. A preliminary population genetic analysis using these new microsatellites revealed a lack of population genetic structure in natural populations of A. lepigone. The estimates of recent migration rate revealed strong gene flow among populations. In conclusion, our study developed the first set of EST-microsatellite markers and shed a new light on the population genetic structure of this pest in China.
Characterization of (CA)n microsatellite repeats from large-insert clones.
Litt, M; Browne, D
2001-05-01
The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit determination of sequences flanking the microsatellites. When cosmids or large-insert phage clones are used as primary sources of (CA)n repeat markers, they have traditionally been subcloned into plasmid vectors such as pUC18 or M13 mp 18/19 cloning vectors to obtain fragments of suitable size for DNA sequencing. This unit presents an alternative approach whereby a set of degenerate sequencing primers that anneal directly to (CA)n microsatellites can be used to determine sequences that are inaccessible with vector-derived primers. Because the primers anneal to the repeat and not to the vector, they can be used with subclones containing inserts of several kilobases and should, in theory, always give sequence in the regions directly flanking the repeat. Degeneracy at the 3 end of each of these primers prevents elongation of primers that have annealed out-of-register. The most laborious part of developing (CA)n microsatellite repeats as genetic markers is constructing DNA clones to permit.
Harmon, Monica; Lane, Thomas; Staton, Margaret; Coggeshall, Mark V; Best, Teodora; Chen, Chien-Chih; Liang, Haiying; Zembower, Nicole; Drautz-Moses, Daniela I; Hwee, Yap Zhei; Schuster, Stephan C; Schlarbaum, Scott E; Carlson, John E; Gailing, Oliver
2017-08-08
Sugar maple (Acer saccharum Marsh.) is a hardwood tree species native to northeastern North America and economically valued for its wood and sap. Yet, few molecular genetic resources have been developed for this species to date. Microsatellite markers have been a useful tool in population genetics, e.g., to monitor genetic variation and to analyze gene flow patterns. The objective of this study is to develop a reference transcriptome and microsatellite markers in sugar maple. A set of 117,861 putative unique transcripts were assembled using 29.2 Gb of RNA sequencing data derived from different tissues and stress treatments. From this set of sequences a total of 1068 microsatellite motifs were identified. Out of 58 genic microsatellite markers tested on a population of 47 sugar maple trees in upper Michigan, 22 amplified well, of which 16 were polymorphic and 6 were monomorphic. Values for expected heterozygosity varied from 0.224 to 0.726 for individual loci. Of the 16 polymorphic markers, 15 exhibited transferability to other Acer L. species. Genic microsatellite markers can be applied to analyze genetic variation in potentially adaptive genes relative to genomic reference markers as a basis for the management of sugar maple genetic resources in the face of climate change.
Wang, Limin; Yang, Haijiao; Liu, Rongning; Fan, Guoqiang
2015-08-01
Toxic metal pollution is a major environmental problem that has received wide attention. Platanus acerifolia (London plane tree) is an important greening tree species that can adapt to environmental pollution. The genetic basis and molecular mechanisms associated with the ability of P. acerifolia to respond lead (Pb) stress have not been reported so far. In this study, 16,246 unigenes differentially expressed unigenes that were obtained from P. acerifolia under Pb stress using next-generation sequencing. Essential pathways such as photosynthesis, and gibberellins and glutathione metabolism were enriched among the differentially expressed unigenes. Furthermore, many important unigenes, including antioxidant enzymes, plants chelate compounds, and metal transporters involved in defense and detoxification mechanisms, were differentially expressed in response to Pb stress. The unigenes encoding the oxygen-evolving enhancer Psb and OEE protein families were downregulated in Pb-stressed plants, implying that oxygen production might decrease in plants under Pb stress. The relationship between gibberellin and P. acerifolia flowering is also discussed. The information and new insights obtained in this study will contribute to further investigations into the molecular regulation mechanisms of Pb accumulation and tolerance in greening tree species.
Lei, Yanyuan; Zhu, Xun; Xie, Wen; Wu, Qingjun; Wang, Shaoli; Guo, Zhaojiang; Xu, Baoyun; Li, Xianchun; Zhou, Xuguo; Zhang, Youjun
2014-01-01
To investigate the response of Plutella xylostella transcriptome in defending against a Bt toxin, high-throughput RNA-sequencing was carried out to examine Cry1Ac-susceptible and -resistant strains. The comparative analysis indentified over 2900 differentially expressed unigenes (DEUs) between these two strains. Gene Ontology analysis placed these unigenes primarily into cell, cell part, organelle, binding, catalytic, cellular process, metabolic process, and response to stimulus categories. Based on pathway analyses, DEUs were enriched in oxidoreductase activity and membrane lipid metabolic processes, and they were also significantly enriched in pathways related to the metabolic and biosynthesis of secondary metabolites. Most of the unigenes involved in the metabolic pathway were up-regulated in resistant strains. Within the ABC transporter pathway, majority of the down-regulated unigenes belong to ABCC2 and ABCC10, respectively, while up-regulated unigenes were mainly categorized as ABCG2. Furthermore, two aminopeptidases, and four cadherins encoding genes were significantly elevated as well. This study provides a transcriptome foundation for the identification and functional characterization of genes involved in the Bt resistance in an agriculturally important insect pest, P. xylostella. © 2013 Elsevier B.V. All rights reserved.
Yu, Ganjun; Wu, Yanfeng; Wang, Wenying; Xu, Jia; Lv, Xiaoping; Cao, Xuetao; Wan, Tao
2018-04-05
PD-1 blockade has demonstrated impressive clinical outcomes in colorectal cancers that have high microsatellite instability. However, the therapeutic efficacy for patients with tumors with low microsatellite instability or stable microsatellites needs further improvement. Here, we have demonstrated that low-dose decitabine could increase the expression of immune-related genes such as major histocompatibility complex genes and cytokine-related genes as well as the number of lymphocytes at the tumor site in CT26 colorectal cancer-bearing mice. A more significant inhibition of tumor growth and a prolongation of survival were observed in the CT26 mouse model after treatment with a combination of PD-1 blockade and decitabine than in mice treated with decitabine or PD-1 blockade alone. The anti-tumor effect of the PD-1 blockade was enhanced by low-dose decitabine. The results of RNA sequencing and whole-genome bisulfite sequencing of decitabine-treated CT26 cells and tumor samples with microsatellite stability from the patient tumor-derived xenograft model have shown that many immune-related genes, including antigen-processing and antigen-presenting genes, were upregulated, whereas the promoter demethylation was downregulated after decitabine exposure. Therefore, decitabine-based tumor microenvironment re-modulation could improve the effect of the PD-1 blockade. The application of decitabine in PD-1 blockade-based immunotherapy may elicit more potent immune responses, which can provide clinical benefits to the colorectal cancer patients with low microsatellite instability or stable microsatellites.
Tang, Cheng; Lan, Daoliang; Zhang, Huanrong; Ma, Jing; Yue, Hua
2013-01-01
Duck is an economically important poultry and animal model for human viral hepatitis B. However, the molecular mechanisms underlying host-virus interaction remain unclear because of limited information on the duck genome. This study aims to characterize the duck normal liver transcriptome and to identify the differentially expressed transcripts at 24 h after duck hepatitis A virus genotype C (DHAV-C) infection using Illumina-Solexa sequencing. After removal of low-quality sequences and assembly, a total of 52,757 unigenes was obtained from the normal liver group. Further blast analysis showed that 18,918 unigenes successfully matched the known genes in the database. GO analysis revealed that 25,116 unigenes took part in 61 categories of biological processes, cellular components, and molecular functions. Among the 25 clusters of orthologous group categories (COG), the cluster for "General function prediction only" represented the largest group, followed by "Transcription" and "Replication, recombination, and repair." KEGG analysis showed that 17,628 unigenes were involved in 301 pathways. Through comparison of normal and infected transcriptome data, we identified 20 significantly differentially expressed unigenes, which were further confirmed by real-time polymerase chain reaction. Of the 20 unigenes, nine matched the known genes in the database, including three up-regulated genes (virus replicase polyprotein, LRRC3B, and PCK1) and six down-regulated genes (CRP, AICL-like 2, L1CAM, CYB26A1, CHAC1, and ADAM32). The remaining 11 novel unigenes that did not match any known genes in the database may provide a basis for the discovery of new transcripts associated with infection. This study provided a gene expression pattern for normal duck liver and for the previously unrecognized changes in gene transcription that are altered during DHAV-C infection. Our data revealed useful information for future studies on the duck genome and provided new insights into the molecular mechanism of host-DHAV-C interaction.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Shuangyan; Huang, Xin; Yang, Xiaohan
BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resultedmore » in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.« less
Long, Yong; Li, Qing; Zhou, Bolan; Song, Guili; Li, Tao; Cui, Zongbin
2013-01-01
Fish skin serves as the first line of defense against a wide variety of chemical, physical and biological stressors. Secretion of mucus is among the most prominent characteristics of fish skin and numerous innate immune factors have been identified in the epidermal mucus. However, molecular mechanisms underlying the mucus secretion and immune activities of fish skin remain largely unclear due to the lack of genomic and transcriptomic data for most economically important fish species. In this study, we characterized the skin transcriptome of mud loach using Illumia paired-end sequencing. A total of 40364 unigenes were assembled from 86.6 million (3.07 gigabases) filtered reads. The mean length, N50 size and maximum length of assembled transcripts were 387, 611 and 8670 bp, respectively. A total of 17336 (43.76%) unigenes were annotated by blast searches against the NCBI non-redundant protein database. Gene ontology mapping assigned a total of 108513 GO terms to 15369 (38.08%) unigenes. KEGG orthology mapping annotated 9337 (23.23%) unigenes. Among the identified KO categories, immune system is the largest category that contains various components of multiple immune pathways such as chemokine signaling, leukocyte transendothelial migration and T cell receptor signaling, suggesting the complexity of immune mechanisms in fish skin. As for mucin biosynthesis, 37 unigenes were mapped to 7 enzymes of the mucin type O-glycan biosynthesis pathway and 8 members of the polypeptide N-acetylgalactosaminyltransferase family were identified. Additionally, 38 unigenes were mapped to 23 factors of the SNARE interactions in vesicular transport pathway, indicating that the activity of this pathway is required for the processes of epidermal mucus storage and release. Moreover, 1754 simple sequence repeats (SSRs) were detected in 1564 unigenes and dinucleotide repeats represented the most abundant type. These findings have laid the foundation for further understanding the secretary processes and immune functions of loach skin mucus. PMID:23437293
Long, Yong; Li, Qing; Zhou, Bolan; Song, Guili; Li, Tao; Cui, Zongbin
2013-01-01
Fish skin serves as the first line of defense against a wide variety of chemical, physical and biological stressors. Secretion of mucus is among the most prominent characteristics of fish skin and numerous innate immune factors have been identified in the epidermal mucus. However, molecular mechanisms underlying the mucus secretion and immune activities of fish skin remain largely unclear due to the lack of genomic and transcriptomic data for most economically important fish species. In this study, we characterized the skin transcriptome of mud loach using Illumia paired-end sequencing. A total of 40364 unigenes were assembled from 86.6 million (3.07 gigabases) filtered reads. The mean length, N50 size and maximum length of assembled transcripts were 387, 611 and 8670 bp, respectively. A total of 17336 (43.76%) unigenes were annotated by blast searches against the NCBI non-redundant protein database. Gene ontology mapping assigned a total of 108513 GO terms to 15369 (38.08%) unigenes. KEGG orthology mapping annotated 9337 (23.23%) unigenes. Among the identified KO categories, immune system is the largest category that contains various components of multiple immune pathways such as chemokine signaling, leukocyte transendothelial migration and T cell receptor signaling, suggesting the complexity of immune mechanisms in fish skin. As for mucin biosynthesis, 37 unigenes were mapped to 7 enzymes of the mucin type O-glycan biosynthesis pathway and 8 members of the polypeptide N-acetylgalactosaminyltransferase family were identified. Additionally, 38 unigenes were mapped to 23 factors of the SNARE interactions in vesicular transport pathway, indicating that the activity of this pathway is required for the processes of epidermal mucus storage and release. Moreover, 1754 simple sequence repeats (SSRs) were detected in 1564 unigenes and dinucleotide repeats represented the most abundant type. These findings have laid the foundation for further understanding the secretary processes and immune functions of loach skin mucus.
Dufresnes, Christophe; Brelsford, Alan; Béziers, Paul; Perrin, Nicolas
2014-07-01
A simple way to quickly optimize microsatellites in nonmodel organisms is to reuse loci available in closely related taxa; however, this approach can be limited by the stochastic and low cross-amplification success experienced in some groups (e.g. amphibians). An efficient alternative is to develop loci from transcriptome sequences. Transcriptomic microsatellites have been found to vary in their levels of cross-species amplification and variability, but this has to date never been tested in amphibians. Here, we compare the patterns of cross-amplification and levels of polymorphism of 18 published anonymous microsatellites isolated from genomic DNA vs. 17 loci derived from a transcriptome, across nine species of tree frogs (Hyla arborea and Hyla cinerea group). We established a clear negative relationship between divergence time and amplification success, which was much steeper for anonymous than transcriptomic markers, with half-lives (time at which 50% of the markers still amplify) of 1.1 and 37 My, respectively. Transcriptomic markers are significantly less polymorphic than anonymous loci, but remain variable across diverged taxa. We conclude that the exploitation of amphibian transcriptomes for developing microsatellites seems an optimal approach for multispecies surveys (e.g. analyses of hybrid zones, comparative linkage mapping), whereas anonymous microsatellites may be more informative for fine-scale analyses of intraspecific variation. Moreover, our results confirm the pattern that microsatellite cross-amplification is greatly variable among amphibians and should be assessed independently within target lineages. Finally, we provide a bank of microsatellites for Palaearctic tree frogs (so far only available for H. arborea), which will be useful for conservation and evolutionary studies in this radiation. © 2013 John Wiley & Sons Ltd.
Gao, Yuan; He, Xiaoli; Wu, Bin; Long, Qiliang; Shao, Tianwei; Wang, Zi; Wei, Jianhe; Li, Yong; Ding, Wanlong
2016-01-01
Panax ginseng C. A. Meyer is a highly valued medicinal plant. Cylindrocarpon destructans is a destructive pathogen that causes root rot and significantly reduces the quality and yield of P. ginseng. However, an efficient method to control root rot remains unavailable because of insufficient understanding of the molecular mechanism underlying C. destructans-P. ginseng interaction. In this study, C. destructans-induced transcriptomes at different time points were investigated using RNA sequencing (RNA-Seq). De novo assembly produced 73,335 unigenes for the P. ginseng transcriptome after C. destructans infection, in which 3,839 unigenes were up-regulated. Notably, the abundance of the up-regulated unigenes sharply increased at 0.5 d postinoculation to provide effector-triggered immunity. In total, 24 of 26 randomly selected unigenes can be validated using quantitative reverse transcription (qRT)-PCR. Gene ontology enrichment analysis of these unigenes showed that "defense response to fungus", "defense response" and "response to stress" were enriched. In addition, differentially expressed transcription factors involved in the hormone signaling pathways after C. destructans infection were identified. Finally, differentially expressed unigenes involved in reactive oxygen species and ginsenoside biosynthetic pathway during C. destructans infection were indentified. To our knowledge, this study is the first to report on the dynamic transcriptome triggered by C. destructans. These results improve our understanding of disease resistance in P. ginseng and provide a useful resource for quick detection of induced markers in P. ginseng before the comprehensive outbreak of this disease caused by C. destructans.
Kuo, Hsiao-Che; Hsu, Hao-Hsuan; Chua, Chee Shin; Wang, Ting-Yu; Chen, Young-Mao; Chen, Tzong-Yueh
2014-04-30
Most giant groupers in the market are derived from inbred stock. Inbreeding can cause trait depression, compromising the animals' fitness and disease resistance, obligating farmers to apply increased amounts of drugs. In order to solve this problem, a pedigree classification method is needed. Here, microsatellite and mitochondrial DNA were used as genetic markers to analyze the genetic relationships among giant grouper broodstocks. The 776-bp fragment of high polymorphic mitochondrial D-loop sequence was selected for measuring sibling relatedness. In a sample of 118 giant groupers, 42 haplotypes were categorized, with nucleotide diversity (π) of 0.00773 and haplotype diversity (HD) of 0.983. Furthermore, microsatellites were used for investigation of parentage. Six out of 33 microsatellite loci were selected as markers based on having a high number of alleles and compliance with Hardy-Weinberg equilibrium. Microsatellite profiles based on these loci provide high variability with low combined non-exclusion probability, permitting practical use in aquaculture. The method described here could be used to improve grouper broodstock management and lower the chances of inbreeding. This approach is expected to lead to production of higher quality groupers with higher disease resistance, thereby reducing the need for drug application.
Francisco, Flávio de O; Brito, Rute M; Arias, Maria C
2006-01-01
In the present study we compare genetic characteristics (allele diversity and observed heterozygosity) of microsatellite loci, from three stingless bee species (Plebeia remota Holmberg, Partamona mulata Moure In Camargo and Partamona helleri Friese), amplified by using heterospecific primers originally designed for Melipona bicolor Lepeletier and Scaptotrigona postica Latreille. We analyzed 360 individuals of P. remota from 72 nests, 58 individuals of R. mulata from 58 nests, and 47 individuals of P. helleri from 47 nests. The three species studied showed low level of polymorphism for the loci amplified with primers derived from M. bicolor. However, for the loci amplified with primers derived from S. postica, only P. remota presented low level of polymorphism.
Qiu, Fan; Kitchen, Andrew; Beerli, Peter; Miyamoto, Michael M
2013-02-01
A recent study using both mitochondrial DNA (mtDNA) and microsatellite data reported on a population size discrepancy in the eastern tiger salamander where the effective population size (N(e)) estimate of the former exceeded that of the latter. That study suggested, among other hypotheses, that homoplasy of microsatellite alleles is responsible for the discrepancy. In this investigation, we report 10 new cases of a similar discrepancy in five species of tuna. These cases derive from our Bayesian inferences using data from Pacific Bluefin Tuna (Thunnus orientalis) and Yellowfin Tuna (Thunnus albacares), as well as from published estimates of genetic diversity for additional populations of Yellowfin Tuna and three other tuna species. Phylogenetic character analyses of inferred genealogies of Pacific Bluefin and Yellowfin Tuna reveal similar reduced levels of mtDNA and microsatellite homoplasy. Thus, the discrepancy between inferred population sizes from mtDNA and microsatellite data in tuna is most likely not an artifact of the chosen mutation models used in the microsatellite analyses, but may reflect behavioral differences between the sexes such as female-biased philopatry and male-biased dispersal. This explanation now warrants critical testing with more local populations of tuna and with other animal and plant groups that have different life histories. Copyright © 2012 Elsevier Inc. All rights reserved.
Pajuelo, Mónica J; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H; Gilman, Robert H; Porcella, Steve; Zimic, Mirko
2015-12-01
Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies.
Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu
2014-01-01
Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
Genes expressed during the development and ripening of watermelon fruit.
Levi, A; Davis, A; Hernandez, A; Wechter, P; Thimmapuram, J; Trebitsh, T; Tadmor, Y; Katzir, N; Portnoy, V; King, S
2006-11-01
A normalized cDNA library was constructed using watermelon flesh mRNA from three distinct developmental time-points and was subtracted by hybridization with leaf cDNA. Random cDNA clones of the watermelon flesh subtraction library were sequenced from the 5' end in order to identify potentially informative genes associated with fruit setting, development, and ripening. One-thousand and forty-six 5'-end sequences (expressed sequence tags; ESTs) were assembled into 832 non-redundant sequences, designated as "EST-unigenes". Of these 832 "EST-unigenes", 254 ( approximately 30%) have no significant homology to sequences published so far for other plant species. Additionally, 168 "EST-unigenes" ( approximately 20%) correspond to genes with unknown function, whereas 410 "EST-unigenes" ( approximately 50%) correspond to genes with known function in other plant species. These "EST-unigenes" are mainly associated with metabolism, membrane transport, cytoskeleton synthesis and structure, cell wall formation and cell division, signal transduction, nucleic acid binding and transcription factors, defense and stress response, and secondary metabolism. This study provides the scientific community with novel genetic information for watermelon as well as an expanded pool of genes associated with fruit development in watermelon. These genes will be useful targets in future genetic and functional genomic studies of watermelon and its development.
Chen, Chengyu; Wang, Cuicui; Liu, Ying; Shi, Xueyan; Gao, Xiwu
2018-02-07
Pesticide tolerance poses many challenges for pest control, particularly for destructive pests such as Bradysia odoriphaga. Imidacloprid has been used to control B. odoriphaga since 2013, however, imidacloprid resistance in B. odoriphaga has developed in recent years. Identifying actual and potential genes involved in detoxification metabolism of imidacloprid could offer solutions for controlling this insect. In this study, RNA-seq was used to explore differentially expressed genes in B. odoriphaga that respond to imidacloprid treatment. Differential expression data between imidacloprid treatment and the control revealed 281 transcripts (176 with annotations) showing upregulation and 394 transcripts (235 with annotations) showing downregulation. Among them, differential expression levels of seven P450 unigenes were associated with imidacloprid detoxification mechanism, with 4 unigenes that were upregulated and 3 unigenes that were downregulated. The qRT-PCR results of the seven differential expression P450 unigenes after imidacloprid treatment were consistent with RNA-Seq data. Furthermore, oral delivery mediated RNA interference of these four upregulated P450 unigenes followed by an insecticide bioassay significantly increased the mortality of imidacloprid-treated B. odoriphaga. This result indicated that the four upregulated P450s are involved in detoxification of imidacloprid. This study provides a genetic basis for further exploring P450 genes for imidacloprid detoxification in B. odoriphaga.
Wang, Chongbin; Zou, Tonglei; Xu, Nianjun; Sun, Xue
2017-01-01
Culturing the economically important macroalga Gracilariopsis lemaneiformis (Rhodophyta) is limited due to the high temperatures in the summertime on the southern Chinese coast. Previous studies have demonstrated that two phytohormones, salicylic acid (SA) and methyl jasmonate (MJ), can alleviate the adverse effects of high-temperature stress on Gp. lemaneiformis. To elucidate the molecular mechanisms underlying SA- and MJ-mediated heat tolerance, we performed comprehensive analyses of transcriptome-wide gene expression profiles using RNA sequencing (RNA-seq) technology. A total of 14,644 unigenes were assembled, and 10,501 unigenes (71.71%) were annotated to the reference databases. In the SA, MJ and SA/MJ treatment groups, 519, 830, and 974 differentially expressed unigenes were detected, respectively. Unigenes related to photosynthesis and glycometabolism were enriched by SA, while unigenes associated with glycometabolism, protein synthesis, heat shock and signal transduction were increased by MJ. A crosstalk analysis revealed that 216 genes were synergistically regulated, while 18 genes were antagonistically regulated by SA and MJ. The results indicated that the two phytohormones could mitigate the adverse effects of heat on multiple pathways, and they predominantly acted synergistically to resist heat stress. These results will provide new insights into how SA and MJ modulate the molecular mechanisms that counteract heat stress in algae. PMID:28464018
Evidence of Genetic Differentiation for Hawaii Insular False Killer Whales (Pseudorca crassidens)
2010-05-01
derived from beluga whales ( Delphinapterus leucas ) (Buchanan et al., 1996), EV94t derived from humpback whales (Megaptera novaenglia) (Valsecchi and Amos...Buchanan, F. C., Friesen, M. K., Littlejohn, R. P., and Clayton, J. W. 1996. Microsatellites from the beluga whale Delphinapterus leucas . Mol. Ecol. 5:571
Wang, Yanjie; Dong, Chunlan; Xue, Zeyun; Jin, Qijiang; Xu, Yingchun
2016-01-15
Paeonia ostii, an important ornamental and medicinal plant, grows normally on copper (Cu) mines with widespread Cu contamination of soils, and it has the ability to lower Cu contents in the Cu-contaminated soils. However, very little molecular information concerned with Cu resistance of P. ostii is available. In this study, high-throughput de novo transcriptome sequencing was carried out for P. ostii with and without Cu treatment using Illumina HiSeq 2000 platform. A total of 77,704 All-unigenes were obtained with a mean length of 710 bp. Of these unigenes, 47,461 were annotated with public databases based on sequence similarities. Comparative transcript profiling allowed the discovery of 4324 differentially expressed genes (DEGs), with 2207 up-regulated and 2117 down-regulated unigenes in Cu-treated library as compared to the control counterpart. Based on these DEGs, Gene Ontology (GO) enrichment analysis indicated Cu stress-relevant terms, such as 'membrane' and 'antioxidant activity'. Meanwhile, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis uncovered some important pathways, including 'biosynthesis of secondary metabolites' and 'metabolic pathways'. In addition, expression patterns of 12 selected DEGs derived from quantitative real-time polymerase chain reaction (qRT-PCR) were consistent with their transcript abundance changes obtained by transcriptomic analyses, suggesting that all the 12 genes were authentically involved in Cu tolerance in P. ostii. This is the first report to identify genes related to Cu stress responses in P. ostii, which could offer valuable information on the molecular mechanisms of Cu resistance, and provide a basis for further genomics research on this and related ornamental species for phytoremediation. Copyright © 2015 Elsevier B.V. All rights reserved.
Gardiner, Jack; Schroeder, Steven; Polacco, Mary L.; Sanchez-Villeda, Hector; Fang, Zhiwei; Morgante, Michele; Landewe, Tim; Fengler, Kevin; Useche, Francisco; Hanafey, Michael; Tingey, Scott; Chou, Hugh; Wing, Rod; Soderlund, Carol; Coe, Edward H.
2004-01-01
Our goal is to construct a robust physical map for maize (Zea mays) comprehensively integrated with the genetic map. We have used a two-dimensional 24 × 24 overgo pooling strategy to anchor maize expressed sequence tagged (EST) unigenes to 165,888 bacterial artificial chromosomes (BACs) on high-density filters. A set of 70,716 public maize ESTs seeded derivation of 10,723 EST unigene assemblies. From these assemblies, 10,642 overgo sequences of 40 bp were applied as hybridization probes. BAC addresses were obtained for 9,371 overgo probes, representing an 88% success rate. More than 96% of the successful overgo probes identified two or more BACs, while 5% identified more than 50 BACs. The majority of BACs identified (79%) were hybridized with one or two overgos. A small number of BACs hybridized with eight or more overgos, suggesting that these BACs must be gene rich. Approximately 5,670 overgos identified BACs assembled within one contig, indicating that these probes are highly locus specific. A total of 1,795 megabases (Mb; 87%) of the total 2,050 Mb in BAC contigs were associated with one or more overgos, which are serving as sequence-tagged sites for single nucleotide polymorphism development. Overgo density ranged from less than one overgo per megabase to greater than 20 overgos per megabase. The majority of contigs (52%) hit by overgos contained three to nine overgos per megabase. Analysis of approximately 1,022 Mb of genetically anchored BAC contigs indicates that 9,003 of the total 13,900 overgo-contig sites are genetically anchored. Our results indicate overgos are a powerful approach for generating gene-specific hybridization probes that are facilitating the assembly of an integrated genetic and physical map for maize. PMID:15020742
Zhuang, Yongbin; Tripp, Erin A
2017-01-17
New combinations of divergent genomes can give rise to novel genetic functions in resulting hybrid progeny. Such functions may yield opportunities for ecological divergence, contributing ultimately to reproductive isolation and evolutionary longevity of nascent hybrid lineages. In plants, the degree to which transgressive genotypes contribute to floral novelty remains a question of key interest. Here, we generated an F 1 hybrid plant between the red-flowered Ruellia elegans and yellow flowered R. speciosa. RNA-seq technology was used to explore differential gene expression between the hybrid and its two parents, with emphasis on genetic elements involved in the production of floral anthocyanin pigments. The hybrid was purple flowered and produced novel floral delphinidin pigments not manufactured by either parent. We found that nearly a fifth of all 86,475 unigenes expressed were unique to the hybrid. The majority of hybrid unigenes (80.97%) showed a pattern of complete dominance to one parent or the other although this ratio was uneven, suggesting asymmetrical influence of parental genomes on the progeny transcriptome. However, 8.87% of all transcripts within the hybrid were expressed at significantly higher or lower mean levels than observed for either parent. A total of 28 unigenes coding putatively for eight core enzymes in the anthocyanin pathway were recovered, along with three candidate MYBs involved in anthocyanin regulation. Our results suggest that models of gene evolution that explain phenotypic novelty and hybrid establishment in plants may need to include transgressive effects. Additionally, our results lend insight into the potential for floral novelty that derives from unions of divergent genomes. These findings serve as a starting point to further investigate molecular mechanisms involved in flower color transitions in Ruellia.
Brondani, Rosana PV; Williams, Emlyn R; Brondani, Claudio; Grattapaglia, Dario
2006-01-01
Background Eucalypts are the most widely planted hardwood trees in the world occupying globally more than 18 million hectares as an important source of carbon neutral renewable energy and raw material for pulp, paper and solid wood. Quantitative Trait Loci (QTLs) in Eucalyptus have been localized on pedigree-specific RAPD or AFLP maps seriously limiting the value of such QTL mapping efforts for molecular breeding. The availability of a genus-wide genetic map with transferable microsatellite markers has become a must for the effective advancement of genomic undertakings. This report describes the development of a novel set of 230 EMBRA microsatellites, the construction of the first comprehensive microsatellite-based consensus linkage map for Eucalyptus and the consolidation of existing linkage information for other microsatellites and candidate genes mapped in other species of the genus. Results The consensus map covers ~90% of the recombining genome of Eucalyptus, involves 234 mapped EMBRA loci on 11 linkage groups, an observed length of 1,568 cM and a mean distance between markers of 8.4 cM. A compilation of all microsatellite linkage information published in Eucalyptus allowed us to establish the homology among linkage groups between this consensus map and other maps published for E. globulus. Comparative mapping analyses also resulted in the linkage group assignment of other 41 microsatellites derived from other Eucalyptus species as well as candidate genes and QTLs for wood and flowering traits published in the literature. This report significantly increases the availability of microsatellite markers and mapping information for species of Eucalyptus and corroborates the high conservation of microsatellite flanking sequences and locus ordering between species of the genus. Conclusion This work represents an important step forward for Eucalyptus comparative genomics, opening stimulating perspectives for evolutionary studies and molecular breeding applications. The generalized use of an increasingly larger set of interspecific transferable markers and consensus mapping information, will allow faster and more detailed investigations of QTL synteny among species, validation of expression-QTL across variable genetic backgrounds and positioning of a growing number of candidate genes co-localized with QTLs, to be tested in association mapping experiments. PMID:16995939
Yamamoto, Naoki; Takano, Tomoyuki; Tanaka, Keisuke; Ishige, Taichiro; Terashima, Shin; Endo, Chisato; Kurusu, Takamitsu; Yajima, Shunsuke; Yano, Kentaro; Tada, Yuichi
2015-01-01
The turf grass Sporobolus virginicus is halophyte and has high salinity tolerance. To investigate the molecular basis of its remarkable tolerance, we performed Illumina high-throughput RNA sequencing on roots and shoots of a S. virginicus genotype under normal and saline conditions. The 130 million short reads were assembled into 444,242 unigenes. A comparative analysis of the transcriptome with rice and Arabidopsis transcriptome revealed six turf grass-specific unigenes encoding transcription factors. Interestingly, all of them showed root specific expression and five of them encode bZIP type transcription factors. Another remarkable transcriptional feature of S. virginicus was activation of specific pathways under salinity stress. Pathway enrichment analysis suggested transcriptional activation of amino acid, pyruvate, and phospholipid metabolism. Up-regulation of several unigenes, previously shown to respond to salt stress in other halophytes was also observed. Gene Ontology enrichment analysis revealed that unigenes assigned as proteins in response to water stress, such as dehydrin and aquaporin, and transporters such as cation, amino acid, and citrate transporters, and H+-ATPase, were up-regulated in both shoots and roots under salinity. A correspondence analysis of the enriched pathways in turf grass cells, but not in rice cells, revealed two groups of unigenes similarly up-regulated in the turf grass in response to salt stress; one of the groups, showing excessive up-regulation under salinity, included unigenes homologos to salinity responsive genes in other halophytes. Thus, the present study identified candidate genes involved in salt tolerance of S. virginicus. This genetic resource should be valuable for understanding the mechanisms underlying high salt tolerance in S. virginicus. This information can also provide insight into salt tolerance in other halophytes. PMID:25954282
Rai, Amit; Kamochi, Hidetaka; Suzuki, Hideyuki; Nakamura, Michimi; Takahashi, Hiroki; Hatada, Tomoki; Saito, Kazuki; Yamazaki, Mami
2017-01-01
Lonicera japonica is one of the most important medicinal plants with applications in traditional Chinese and Japanese medicine for thousands of years. Extensive studies on the constituents of L. japonica extracts have revealed an accumulation of pharmaceutically active metabolite classes, such as chlorogenic acid, luteolin and other flavonoids, and secoiridoids, which impart characteristic medicinal properties. Despite being a rich source of pharmaceutically active metabolites, little is known about the biosynthetic enzymes involved, and their expression profile across different tissues of L. japonica. In this study, we performed de novo transcriptome assembly for L. japonica, representing transcripts from nine different tissues. A total of 22 Gbps clean RNA-seq reads from nine tissues of L. japonica were used, resulting in 243,185 unigenes, with 99,938 unigenes annotated based on a homology search using blastx against the NCBI-nr protein database. Unsupervised principal component analysis and correlation studies using transcript expression data from all nine tissues of L. japonica showed relationships between tissues, explaining their association at different developmental stages. Homologs for all genes associated with chlorogenic acid, luteolin, and secoiridoid biosynthesis pathways were identified in the L. japonica transcriptome assembly. Expression of unigenes associated with chlorogenic acid was enriched in stems and leaf-2, unigenes from luteolin were enriched in stems and flowers, while unigenes from secoiridoid metabolic pathways were enriched in leaf-1 and shoot apex. Our results showed that different tissues of L. japonica are enriched with sets of unigenes associated with specific pharmaceutically important metabolic pathways and, therefore, possess unique medicinal properties. The present study will serve as a resource for future attempts for functional characterization of enzyme coding genes within key metabolic processes.
Lee, Xuezhu; Yi, Yang; Weng, Shaoping; Zeng, Jie; Zhang, Hetong; He, Jianguo; Dong, Chuanfu
2016-02-01
Cyprinid Herpesvirus 3 (CyHV-3) can infect and specifically cause a huge economic loss in both common carp (Cyprinus carpio) and its ornamental koi variety. The molecular mechanisms underlying CyHV-3 infection are not well understood. In this study, koi spleen tissues of both mock and CyHV-3 infection groups were collected, and high-throughput sequencing technology was used to analyze the differentially expressed genes (DEGs) at the transcriptome level. A total of 105,356,188 clean reads from two libraries were obtained. After the de novo assembly of the transcripts, 129,314 unigenes were generated. Of these unigenes, 70,655 unigenes were matched to the known proteins in the database, while 2190 unigenes were predicted by ESTScan software. Comparing the infection group to the mock group, a total of 23,029 significantly differentially expressed unigenes were identified, including 10,493 up-regulated DEGs and 12,536 down-regulated DEGs. GO (Gene Ontology) annotation and functional enrichment analysis indicated that all of the DEGs were annotated into GO terms in three main GO categories: biological process, cellular component and molecular function. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis of the DEGs showed that a total of 12,002 DEG unigenes were annotated into 256 pathways classified into 6 main categories. Additionally, 20 differentially expressed genes were validated by quantitative real-time PCR. As the first report of a transcriptome analysis of koi carp with CyHV-3 infection, the data presented here provide knowledge of the innate immune response against CyHV-3 in koi carp and useful data for further research of the molecular mechanism of CyHV-3 infection. Copyright © 2015 Elsevier Ltd. All rights reserved.
Lin, Qiang; Luo, Wei; Wan, Shiming; Gao, Zexia
2016-01-01
Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23%) unigenes for H. erectus and 17,900 (49.57%) for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus. PMID:27128031
Lin, Qiang; Luo, Wei; Wan, Shiming; Gao, Zexia
2016-01-01
Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23%) unigenes for H. erectus and 17,900 (49.57%) for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus.
Zhang, Jianxia; He, Chunmei; Wu, Kunlin; Teixeira da Silva, Jaime A.; Zeng, Songjun; Zhang, Xinhua; Yu, Zhenming; Xia, Haoqiang; Duan, Jun
2016-01-01
Dendrobium officinale is one of the most important Chinese medicinal herbs. Polysaccharides are one of the main active ingredients of D. officinale. To identify the genes that maybe related to polysaccharides synthesis, two cDNA libraries were prepared from juvenile and adult D. officinale, and were named Dendrobium-1 and Dendrobium-2, respectively. Illumina sequencing for Dendrobium-1 generated 102 million high quality reads that were assembled into 93,881 unigenes with an average sequence length of 790 base pairs. The sequencing for Dendrobium-2 generated 86 million reads that were assembled into 114,098 unigenes with an average sequence length of 695 base pairs. Two transcriptome databases were integrated and assembled into a total of 145,791 unigenes. Among them, 17,281 unigenes were assigned to 126 KEGG pathways while 135 unigenes were involved in fructose and mannose metabolism. Gene Ontology analysis revealed that the majority of genes were associated with metabolic and cellular processes. Furthermore, 430 glycosyltransferase and 89 cellulose synthase genes were identified. Comparative analysis of both transcriptome databases revealed a total of 32,794 differential expression genes (DEGs), including 22,051 up-regulated and 10,743 down-regulated genes in Dendrobium-2 compared to Dendrobium-1. Furthermore, a total of 1142 and 7918 unigenes showed unique expression in Dendrobium-1 and Dendrobium-2, respectively. These DEGs were mainly correlated with metabolic pathways and the biosynthesis of secondary metabolites. In addition, 170 DEGs belonged to glycosyltransferase genes, 37 DEGs were related to cellulose synthase genes and 627 DEGs encoded transcription factors. This study substantially expands the transcriptome information for D. officinale and provides valuable clues for identifying candidate genes involved in polysaccharide biosynthesis and elucidating the mechanism of polysaccharide biosynthesis. PMID:26904032
Analysis of Litopenaeus vannamei Transcriptome Using the Next-Generation DNA Sequencing Technique
Li, Chaozheng; Weng, Shaoping; Chen, Yonggui; Yu, Xiaoqiang; Lü, Ling; Zhang, Haiqing; He, Jianguo; Xu, Xiaopeng
2012-01-01
Background Pacific white shrimp (Litopenaeus vannamei), the major species of farmed shrimps in the world, has been attracting extensive studies, which require more and more genome background knowledge. The now available transcriptome data of L. vannamei are insufficient for research requirements, and have not been adequately assembled and annotated. Methodology/Principal Findings This is the first study that used a next-generation high-throughput DNA sequencing technique, the Solexa/Illumina GA II method, to analyze the transcriptome from whole bodies of L. vannamei larvae. More than 2.4 Gb of raw data were generated, and 109,169 unigenes with a mean length of 396 bp were assembled using the SOAP denovo software. 73,505 unigenes (>200 bp) with good quality sequences were selected and subjected to annotation analysis, among which 37.80% can be matched in NCBI Nr database, 37.3% matched in Swissprot, and 44.1% matched in TrEMBL. Using BLAST and BLAST2Go softwares, 11,153 unigenes were classified into 25 Clusters of Orthologous Groups of proteins (COG) categories, 8171 unigenes were assigned into 51 Gene ontology (GO) functional groups, and 18,154 unigenes were divided into 220 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. To primarily verify part of the results of assembly and annotations, 12 assembled unigenes that are homologous to many embryo development-related genes were chosen and subjected to RT-PCR for electrophoresis and Sanger sequencing analyses, and to real-time PCR for expression profile analyses during embryo development. Conclusions/Significance The L. vannamei transcriptome analyzed using the next-generation sequencing technique enriches the information of L. vannamei genes, which will facilitate our understanding of the genome background of crustaceans, and promote the studies on L. vannamei. PMID:23071809
Rai, Amit; Yamazaki, Mami; Takahashi, Hiroki; Nakamura, Michimi; Kojoma, Mareshige; Suzuki, Hideyuki; Saito, Kazuki
2016-01-01
The Panax genus has been a source of natural medicine, benefitting human health over the ages, among which the Panax japonicus represents an important species. Our understanding of several key pathways and enzymes involved in the biosynthesis of ginsenosides, a pharmacologically active class of metabolites and a major chemical constituents of the rhizome extracts from the Panax species, are limited. Limited genomic information, and lack of studies on comparative transcriptomics across the Panax species have restricted our understanding of the biosynthetic mechanisms of these and many other important classes of phytochemicals. Herein, we describe Illumina based RNA sequencing analysis to characterize the transcriptome and expression profiles of genes expressed in the five tissues of P. japonicus, and its comparison with other Panax species. RNA sequencing and de novo transcriptome assembly for P. japonicus resulted in a total of 135,235 unigenes with 78,794 (58.24%) unigenes being annotated using NCBI-nr database. Transcriptome profiling, and gene ontology enrichment analysis for five tissues of P. japonicus showed that although overall processes were evenly conserved across all tissues. However, each tissue was characterized by several unique unigenes with the leaves showing the most unique unigenes among the tissues studied. A comparative analysis of the P. japonicus transcriptome assembly with publically available transcripts from other Panax species, namely, P. ginseng, P. notoginseng, and P. quinquefolius also displayed high sequence similarity across all Panax species, with P. japonicus showing highest similarity with P. ginseng. Annotation of P. japonicus transcriptome resulted in the identification of putative genes encoding all enzymes from the triterpene backbone biosynthetic pathways, and identified 24 and 48 unigenes annotated as cytochrome P450 (CYP) and glycosyltransferases (GT), respectively. These CYPs and GTs annotated unigenes were conserved across all Panax species and co-expressed with other the transcripts involved in the triterpenoid backbone biosynthesis pathways. Unigenes identified in this study represent strong candidates for being involved in the triterpenoid saponins biosynthesis, and can serve as a basis for future validation studies. PMID:27148308
Kuo, Hsiao-Che; Hsu, Hao-Hsuan; Chua, Chee Shin; Wang, Ting-Yu; Chen, Young-Mao; Chen, Tzong-Yueh
2014-01-01
Most giant groupers in the market are derived from inbred stock. Inbreeding can cause trait depression, compromising the animals’ fitness and disease resistance, obligating farmers to apply increased amounts of drugs. In order to solve this problem, a pedigree classification method is needed. Here, microsatellite and mitochondrial DNA were used as genetic markers to analyze the genetic relationships among giant grouper broodstocks. The 776-bp fragment of high polymorphic mitochondrial D-loop sequence was selected for measuring sibling relatedness. In a sample of 118 giant groupers, 42 haplotypes were categorized, with nucleotide diversity (π) of 0.00773 and haplotype diversity (HD) of 0.983. Furthermore, microsatellites were used for investigation of parentage. Six out of 33 microsatellite loci were selected as markers based on having a high number of alleles and compliance with Hardy-Weinberg equilibrium. Microsatellite profiles based on these loci provide high variability with low combined non-exclusion probability, permitting practical use in aquaculture. The method described here could be used to improve grouper broodstock management and lower the chances of inbreeding. This approach is expected to lead to production of higher quality groupers with higher disease resistance, thereby reducing the need for drug application. PMID:24796300
Yan, Guoyong; Zhang, Gen; Huang, Jiaomei; Lan, Yi; Sun, Jin; Zeng, Cong; Wang, Yong; Qian, Pei-Yuan; He, Lisheng
2017-10-27
Megabalanus barnacle is one of the model organisms for marine biofouling research. However, further elucidation of molecular mechanisms underlying larval settlement has been hindered due to the lack of genomic information thus far. In the present study, cDNA libraries were constructed for cyprids, the key stage for larval settlement, and adults of Megabalanus volcano . After high-throughput sequencing and de novo assembly, 42,620 unigenes were obtained with a N50 value of 1532 bp. These unigenes were annotated by blasting against the NCBI non-redundant (nr), Swiss-Prot, Cluster of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Finally, 19,522, 15,691, 14,459, and 10,914 unigenes were identified correspondingly. There were 22,158 differentially expressed genes (DEGs) identified between two stages. Compared with the cyprid stage, 8241 unigenes were down-regulated and 13,917 unigenes were up-regulated at the adult stage. The neuroactive ligand-receptor interaction pathway (ko04080) was significantly enriched by KEGG enrichment analysis of the DEGs, suggesting that it possibly involved in larval settlement. Potential functions of three conserved allatostatin neuropeptide-receptor pairs and two light-sensitive opsin proteins were further characterized, indicating that they might regulate attachment and metamorphosis at cyprid stage. These results provided a deeper insight into the molecular mechanisms underlying larval settlement of barnacles.
Yan, Guoyong; Huang, Jiaomei; Lan, Yi; Zeng, Cong; Wang, Yong; Qian, Pei-Yuan; He, Lisheng
2017-01-01
Megabalanus barnacle is one of the model organisms for marine biofouling research. However, further elucidation of molecular mechanisms underlying larval settlement has been hindered due to the lack of genomic information thus far. In the present study, cDNA libraries were constructed for cyprids, the key stage for larval settlement, and adults of Megabalanus volcano. After high-throughput sequencing and de novo assembly, 42,620 unigenes were obtained with a N50 value of 1532 bp. These unigenes were annotated by blasting against the NCBI non-redundant (nr), Swiss-Prot, Cluster of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Finally, 19,522, 15,691, 14,459, and 10,914 unigenes were identified correspondingly. There were 22,158 differentially expressed genes (DEGs) identified between two stages. Compared with the cyprid stage, 8241 unigenes were down-regulated and 13,917 unigenes were up-regulated at the adult stage. The neuroactive ligand-receptor interaction pathway (ko04080) was significantly enriched by KEGG enrichment analysis of the DEGs, suggesting that it possibly involved in larval settlement. Potential functions of three conserved allatostatin neuropeptide-receptor pairs and two light-sensitive opsin proteins were further characterized, indicating that they might regulate attachment and metamorphosis at cyprid stage. These results provided a deeper insight into the molecular mechanisms underlying larval settlement of barnacles. PMID:29077039
Hahn, Daniel A; Ragland, Gregory J; Shoemaker, D DeWayne; Denlinger, David L
2009-01-01
Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (E<0.0001) representing approximately 9,000 independent transcripts. Comparison of the distribution of S. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic studies of S. crassipalpis and other flesh flies. PMID:19454017
Tada, T; Seki, Y; Kameyama, Y; Kikkawa, Y; Wada, K
2016-12-19
The Ezo red fox (Vulpes vulpes schrencki), a subspecies endemic to Hokkaido island, Japan, is a known host species for the tapeworm Echinococcus multilocularis. To develop tools for molecular ecological studies, we isolated 28 microsatellite regions from the genome of Ezo red fox, and developed 18 polymorphic microsatellite markers. These markers were characterized using 7 individuals and 22 fecal samples of the Ezo red fox. The number of alleles for these markers ranged from 1 to 7, and the observed heterozygosity, estimated on the basis of the genotypes of 7 individuals, ranged from 0.29 to 1.00. All markers, except DvNok5, were in Hardy-Weinberg equilibrium (P > 0.05), and no linkage disequilibrium was detected among these loci, except between DvNok14 and DvNok28 (P = 0.01). Moreover, six microsatellite loci were successfully genotyped using feces-derived DNA from the Ezo red fox. The markers developed in our study might serve as a useful tool for molecular ecological studies of the Ezo red fox.
A Genetic Linkage Map of the Male Goat Genome
Vaiman, D.; Schibler, L.; Bourgeois, F.; Oustry, A.; Amigues, Y.; Cribiu, E. P.
1996-01-01
This paper presents a first genetic linkage map of the goat genome. Primers derived from the flanking sequences of 612 bovine, ovine and goat microsatellite markers were gathered and tested for amplification with goat DNA under standardized PCR conditions. This screen made it possible to choose a set of 55 polymorphic markers that can be used in the three species and to define a panel of 223 microsatellites suitable for the goat. Twelve half-sib paternal goat families were then used to build a linkage map of the goat genome. The linkage analysis made it possible to construct a meiotic map covering 2300 cM, i.e., >80% of the total estimated length of the goat genome. Moreover, eight cosmids containing microsatellites were mapped by fluorescence in situ hybridization in goat and sheep. Together with 11 microsatellite-containing cosmids previously mapped in cattle (and supposing conservation of the banding pattern between this species and the goat) and data from the sheep map, these results made the orientation of 15 linkage groups possible. Furthermore, 12 coding sequences were mapped either genetically or physically, providing useful data for comparative mapping. PMID:8878693
Yu, Jeong-Nam; Won, Changman; Jun, Jumin; Lim, YoungWoon; Kwak, Myounghai
2011-01-01
Background Microsatellites, a special class of repetitive DNA sequence, have become one of the most popular genetic markers for population/conservation genetic studies. However, its application to endangered species has been impeded by high development costs, a lack of available sequences, and technical difficulties. The water deer Hydropotes inermis is the sole existing endangered species of the subfamily Capreolinae. Although population genetics studies are urgently required for conservation management, no species-specific microsatellite marker has been reported. Methods We adopted next-generation sequencing (NGS) to elucidate the microsatellite markers of Korean water deer and overcome these impediments on marker developments. We performed genotyping to determine the efficiency of this method as applied to population genetics. Results We obtained 98 Mbp of nucleotide information from 260,467 sequence reads. A total of 20,101 di-/tri-nucleotide repeat motifs were identified; di-repeats were 5.9-fold more common than tri-repeats. [CA]n and [AAC]n/[AAT]n repeats were the most frequent di- and tri-repeats, respectively. Of the 17,206 di-repeats, 12,471 microsatellite primer pairs were derived. PCR amplification of 400 primer pairs yielded 106 amplicons and 79 polymorphic markers from 20 individual Korean water deer. Polymorphic rates of the 79 new microsatellites varied from 2 to 11 alleles per locus (He: 0.050–0.880; Ho: 0.000–1.000), while those of known microsatellite markers transferred from cattle to Chinese water deer ranged from 4 to 6 alleles per locus (He: 0.279–0.714; Ho: 0.300–0.400). Conclusions Polymorphic microsatellite markers from Korean water deer were successfully identified using NGS without any prior sequence information and deposited into the public database. Thus, the methods described herein represent a rapid and low-cost way to investigate the population genetics of endangered/non-model species. PMID:22069476
2012-01-01
Background Some organisms can survive extreme desiccation by entering into a state of suspended animation known as anhydrobiosis. Panagrolaimus superbus is a free-living anhydrobiotic nematode that can survive rapid environmental desiccation. The mechanisms that P. superbus uses to combat the potentially lethal effects of cellular dehydration may include the constitutive and inducible expression of protective molecules, along with behavioural and/or morphological adaptations that slow the rate of cellular water loss. In addition, inducible repair and revival programmes may also be required for successful rehydration and recovery from anhydrobiosis. Results To identify constitutively expressed candidate anhydrobiotic genes we obtained 9,216 ESTs from an unstressed mixed stage population of P. superbus. We derived 4,009 unigenes from these ESTs. These unigene annotations and sequences can be accessed at http://www.nematodes.org/nembase4/species_info.php?species=PSC. We manually annotated a set of 187 constitutively expressed candidate anhydrobiotic genes from P. superbus. Notable among those is a putative lineage expansion of the lea (late embryogenesis abundant) gene family. The most abundantly expressed sequence was a member of the nematode specific sxp/ral-2 family that is highly expressed in parasitic nematodes and secreted onto the surface of the nematodes' cuticles. There were 2,059 novel unigenes (51.7% of the total), 149 of which are predicted to encode intrinsically disordered proteins lacking a fixed tertiary structure. One unigene may encode an exo-β-1,3-glucanase (GHF5 family), most similar to a sequence from Phytophthora infestans. GHF5 enzymes have been reported from several species of plant parasitic nematodes, with horizontal gene transfer (HGT) from bacteria proposed to explain their evolutionary origin. This P. superbus sequence represents another possible HGT event within the Nematoda. The expression of five of the 19 putative stress response genes tested was upregulated in response to desiccation. These were the antioxidants glutathione peroxidase, dj-1 and 1-Cys peroxiredoxin, an shsp sequence and an lea gene. Conclusions P. superbus appears to utilise a strategy of combined constitutive and inducible gene expression in preparation for entry into anhydrobiosis. The apparent lineage expansion of lea genes, together with their constitutive and inducible expression, suggests that LEA3 proteins are important components of the anhydrobiotic protection repertoire of P. superbus. PMID:22281184
Šarhanová, Petra; Sharbel, Timothy F; Sochor, Michal; Vašut, Radim J; Dancák, Martin; Trávnícek, Bohumil
2017-08-01
Rubus subgenus Rubus is a group of mostly apomictic and polyploid species with a complicated taxonomy and history of ongoing hybridization. The only polyploid series with prevailing sexuality is the series Glandulosi , although the apomictic series Discolores and Radula also retain a high degree of sexuality, which is influenced by environmental conditions and/or pollen donors. The aim of this study is to detect sources of genetic variability, determine the origin of apomictic taxa and validate microsatellite markers by cloning and sequencing. A total of 206 individuals from two central European regions were genotyped for 11 nuclear microsatellite loci and the chloroplast trn L- trn F region. Microsatellite alleles were further sequenced in order to determine the exact repeat number and to detect size homoplasy due to insertions/deletions in flanking regions. The results confirm that apomictic microspecies of ser. Radula are derived from crosses between sexual series Glandulosi and apomictic series Discolores , whereby the apomict acts as pollen donor. Each apomictic microspecies is derived from a single distinct genotype differing from the parental taxa, suggesting stabilized clonal reproduction. Intraspecific variation within apomicts is considerably low compared with sexual series Glandulosi , and reflects somatic mutation accumulation. While facultative apomicts produce clonal offspring, sexual species are the conduits of origin for new genetically different apomictic lineages. One of the main driving forces of evolution and speciation in the highly apomictic subgenus Rubus in central Europe is sexuality in the series Glandulosi . Palaeovegetation data suggest that initial hybridizations took place over different time periods in the two studied regions, and that the successful origin and spread of apomictic microspecies of the series Radula took place over several millennia. Additionally, the cloning and sequencing show that standard evaluations of microsatellite repeat numbers underestimate genetic variability considering homoplasy in allele size. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Pajuelo, Mónica J.; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H.; Gilman, Robert H.; Porcella, Steve; Zimic, Mirko
2015-01-01
Background Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. Methods For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. Results The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. Conclusions/Significance The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies. PMID:26697878
USDA-ARS?s Scientific Manuscript database
Jasmonic acid is a natural plant hormone that induces native defense responses in plants. Sugarbeet (Beta vulgaris L.) root unigenes that were differentially expressed 2 and 60 days after a postharvest jasmonic acid treatment are presented. Data include changes in unigene expression relative to wate...
Zheng, Xiasheng; Xu, Hui; Ma, Xinye; Zhan, Ruoting; Chen, Weiwen
2014-01-01
Ilex asprella, which contains abundant α-amyrin type triterpenoid saponins, is an anti-influenza herbal drug widely used in south China. In this work, we first analysed the transcriptome of the I. asprella root using RNA-Seq, which provided a dataset for functional gene mining. mRNA was isolated from the total RNA of the I. asprella root and reverse-transcribed into cDNA. Then, the cDNA library was sequenced using an Illumina HiSeq™ 2000, which generated 55,028,452 clean reads. De novo assembly of these reads generated 51,865 unigenes, in which 39,269 unigenes were annotated (75.71% yield). According to the structures of the triterpenoid saponins of I. asprella, a putative biosynthetic pathway downstream of 2,3-oxidosqualene was proposed and candidate unigenes in the transcriptome data that were potentially involved in the pathway were screened using homology-based BLAST and phylogenetic analysis. Further amplification and functional analysis of these putative unigenes will provide insight into the biosynthesis of Ilex triterpenoid saponins. PMID:24722569
Li, Fang; Xu, Zijian; Sun, Mengli; Cong, Hanqing; Qiao, Fei; Zhong, Xiaohong
2017-01-01
Hedera helix L. is an important traditional medicinal plant in Europe. The main active components are triterpenoid saponins, but none of the potential enzymes involved in triterpenoid saponins biosynthesis have been discovered and annotated. Here is reported the first study of global transcriptome analyses using the Illumina HiSeq™ 2500 platform for H. helix. In total, over 24 million clean reads were produced and 96,333 unigenes were assembled, with an average length of 1385 nt; more than 79,085 unigenes had at least one significant match to an existing gene model. Differentially Expressed Gene analysis identified 6,222 and 7,012 unigenes which were expressed either higher or lower in leaf samples when compared with roots. After functional annotation and classification, two pathways and 410 unigenes related to triterpenoid saponins biosynthesis were discovered. The accuracy of these de novo sequences was validated by RT-qPCR analysis and a RACE clone. These data will enrich our knowledge of triterpenoid saponin biosynthesis and provide a theoretical foundation for molecular research on H. helix. PMID:28771546
Yang, Qing; Sun, Fanyue; Yang, Zhi; Li, Hongjun
2014-01-01
Calanus sinicus Brodsky (Copepoda, Crustacea) is a dominant zooplanktonic species widely distributed in the margin seas of the Northwest Pacific Ocean. In this study, we utilized an RNA-Seq-based approach to develop molecular resources for C. sinicus. Adult samples were sequenced using the Illumina HiSeq 2000 platform. The sequencing data generated 69,751 contigs from 58.9 million filtered reads. The assembled contigs had an average length of 928.8 bp. Gene annotation allowed the identification of 43,417 unigene hits against the NCBI database. Gene ontology (GO) and KEGG pathway mapping analysis revealed various functional genes related to diverse biological functions and processes. Transcripts potentially involved in stress response and lipid metabolism were identified among these genes. Furthermore, 4,871 microsatellites and 110,137 single nucleotide polymorphisms (SNPs) were identified in the C. sinicus transcriptome sequences. SNP validation by the melting temperature (T m)-shift method suggested that 16 primer pairs amplified target products and showed biallelic polymorphism among 30 individuals. The present work demonstrates the power of Illumina-based RNA-Seq for the rapid development of molecular resources in nonmodel species. The validated SNP set from our study is currently being utilized in an ongoing ecological analysis to support a future study of C. sinicus population genetics. PMID:24982883
Yu, Shijiang; Ding, Lili; Luo, Ren; Li, Xiaojiao; Yang, Juan; Liu, Haoqiang; Cong, Lin; Ran, Chun
2016-01-01
Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq) to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3%) unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri.
Yu, Shijiang; Ding, Lili; Luo, Ren; Li, Xiaojiao; Yang, Juan; Liu, Haoqiang; Cong, Lin; Ran, Chun
2016-01-01
Dialeurodes citri is a major pest in citrus producing areas, and large-scale outbreaks have occurred increasingly often in recent years. Lecanicillium attenuatum is an important entomopathogenic fungus that can parasitize and kill D. citri. We separated the fungus from corpses of D. citri larvae. However, the sound immune defense system of pests makes infection by an entomopathogenic fungus difficult. Here we used RNA sequencing technology (RNA-Seq) to build a transcriptome database for D. citri and performed digital gene expression profiling to screen genes that act in the immune defense of D. citri larvae infected with a pathogenic fungus. De novo assembly generated 84,733 unigenes with mean length of 772 nt. All unigenes were searched against GO, Nr, Swiss-Prot, COG, and KEGG databases and a total of 28,190 (33.3%) unigenes were annotated. We identified 129 immunity-related unigenes in transcriptome database that were related to pattern recognition receptors, information transduction factors and response factors. From the digital gene expression profile, we identified 441 unigenes that were differentially expressed in D. citri infected with L. attenuatum. Through calculated Log2Ratio values, we identified genes for which fold changes in expression were obvious, including cuticle protein, vitellogenin, cathepsin, prophenoloxidase, clip-domain serine protease, lysozyme, and others. Subsequent quantitative real-time polymerase chain reaction analysis verified the results. The identified genes may serve as target genes for microbial control of D. citri. PMID:27644092
Wei, Hairong; Chen, Xin; Zong, Xiaojuan; Shu, Huairui; Gao, Dongsheng; Liu, Qingzhong
2015-01-01
Background Fruit color is one of the most important economic traits of the sweet cherry (Prunus avium L.). The red coloration of sweet cherry fruit is mainly attributed to anthocyanins. However, limited information is available regarding the molecular mechanisms underlying anthocyanin biosynthesis and its regulation in sweet cherry. Methodology/Principal Findings In this study, a reference transcriptome of P. avium L. was sequenced and annotated to identify the transcriptional determinants of fruit color. Normalized cDNA libraries from red and yellow fruits were sequenced using the next-generation Illumina/Solexa sequencing platform and de novo assembly. Over 66 million high-quality reads were assembled into 43,128 unigenes using a combined assembly strategy. Then a total of 22,452 unigenes were compared to public databases using homology searches, and 20,095 of these unigenes were annotated in the Nr protein database. Furthermore, transcriptome differences between the four stages of fruit ripening were analyzed using Illumina digital gene expression (DGE) profiling. Biological pathway analysis revealed that 72 unigenes were involved in anthocyanin biosynthesis. The expression patterns of unigenes encoding phenylalanine ammonia-lyase (PAL), 4-coumarate-CoA ligase (4CL), chalcone synthase (CHS), chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), flavanone 3’-hydroxylase (F3’H), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS) and UDP glucose: flavonol 3-O-glucosyltransferase (UFGT) during fruit ripening differed between red and yellow fruit. In addition, we identified some transcription factor families (such as MYB, bHLH and WD40) that may control anthocyanin biosynthesis. We confirmed the altered expression levels of eighteen unigenes that encode anthocyanin biosynthetic enzymes and transcription factors using quantitative real-time PCR (qRT-PCR). Conclusions/Significance The obtained sweet cherry transcriptome and DGE profiling data provide comprehensive gene expression information that lends insights into the molecular mechanisms underlying anthocyanin biosynthesis. These results will provide a platform for further functional genomic research on this fruit crop. PMID:25799516
Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin
2014-01-01
Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10(-5)), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus.
Li, Guoxi; Zhao, Yinli; Liu, Zhonghu; Gao, Chunsheng; Yan, Fengbin; Liu, Bianzhi; Feng, Jianxin
2015-06-01
Common carp (Cyprinus carpio) is one of the most important aquacultured species of the family Cyprinidae, and breeding this species for disease resistance is becoming more and more important. However, at the genome or transcriptome levels, study of the immunogenetics of disease resistance in the common carp is lacking. In this study, 60,316,906 and 75,200,328 paired-end clean reads were obtained from two cDNA libraries of the common carp spleen by Illumina paired-end sequencing technology. Totally, 130,293 unique transcript fragments (unigenes) were assembled, with an average length of 1400.57 bp. Approximately 105,612 (81.06%) unigenes could be annotated according to their homology with matches in the Nr, Nt, Swiss-Prot, COG, GO, or KEGG databases, and they were found to represent 46,747 non-redundant genes. Comparative analysis showed that 59.82% of the unigenes have significant similarity to zebrafish Refseq proteins. Gene expression comparison revealed that 10,432 and 6889 annotated unigenes were, respectively, up- and down-regulated with at least twofold changes between two developmental stages of the common carp spleen. Gene ontology and KEGG analysis were performed to classify all unigenes into functional categories for understanding gene functions and regulation pathways. In addition, 46,847 simple sequence repeats (SSRs) were detected from 35,618 unigenes, and a large number of single nucleotide polymorphism (SNP) and insertion/deletion (INDEL) sites were identified in the spleen transcriptome of common carp. This study has characterized the spleen transcriptome of the common carp for the first time, providing a valuable resource for a better understanding of the common carp immune system and defense mechanisms. This knowledge will also facilitate future functional studies on common carp immunogenetics that may eventually be applied in breeding programs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Yang, Wanyun; Zheng, Junjun; Jia, Boyin; Wei, Haijun; Wang, Guiwu; Yang, Fuhe
2018-02-15
Every part of the sika deer (Cervus nippon) body is valuable traditional Chinese medicine. And sika deer is the most important semi-domestic medicinal animal that is widely bred in Jilin province northeast of China. But few studies had been conducted to characterize the microsatellite markers derived from sika deer. We firstly used IlluminaHiSeq™2500 sequencing technology obtained 125Mbp genomic data of sika deer. Using microsatellite identification tool (MISA), 22,479 microsatellites were identified. From these data, 100 potential primers were selected for further polymorphic validation, finally, 76 primer pairs were successfully amplified and 29 primer pairs were found to be obvious polymorphic in 8 different individuals. Using those polymorphic microsatellite markers, we analyzed the genetic diversity of Jilin sika deer population. The mean number of alleles of the 29 loci is 9.31 based on genotyping blood DNA from 96 Jilin sika deer; The mean expected heterozygosity and polymorphic information content (PIC) value of the 29 loci is 0.72 and 0.68 respectively, and among which 26 loci are highly polymorphic (PIC>0.50). According to the electrophoretic results and PIC value of these 29 loci, 10 loci with combined paternity exclusion probabilities>99.99% were selected to use in parentage verification for 16 sika deer. All the offspring of a family could be successfully assigned to their biological father. These microsatellite markers generated in this study could greatly facilitate future studies of molecular breeding in sika deer. Copyright © 2017 Elsevier B.V. All rights reserved.
Experimental Estimation of Mutation Rates in a Wheat Population With a Gene Genealogy Approach
Raquin, Anne-Laure; Depaulis, Frantz; Lambert, Amaury; Galic, Nathalie; Brabant, Philippe; Goldringer, Isabelle
2008-01-01
Microsatellite markers are extensively used to evaluate genetic diversity in natural or experimental evolving populations. Their high degree of polymorphism reflects their high mutation rates. Estimates of the mutation rates are therefore necessary when characterizing diversity in populations. As a complement to the classical experimental designs, we propose to use experimental populations, where the initial state is entirely known and some intermediate states have been thoroughly surveyed, thus providing a short timescale estimation together with a large number of cumulated meioses. In this article, we derived four original gene genealogy-based methods to assess mutation rates with limited bias due to relevant model assumptions incorporating the initial state, the number of new alleles, and the genetic effective population size. We studied the evolution of genetic diversity at 21 microsatellite markers, after 15 generations in an experimental wheat population. Compared to the parents, 23 new alleles were found in generation 15 at 9 of the 21 loci studied. We provide evidence that they arose by mutation. Corresponding estimates of the mutation rates ranged from 0 to 4.97 × 10−3 per generation (i.e., year). Sequences of several alleles revealed that length polymorphism was only due to variation in the core of the microsatellite. Among different microsatellite characteristics, both the motif repeat number and an independent estimation of the Nei diversity were correlated with the novel diversity. Despite a reduced genetic effective size, global diversity at microsatellite markers increased in this population, suggesting that microsatellite diversity should be used with caution as an indicator in biodiversity conservation issues. PMID:18689900
Experimental estimation of mutation rates in a wheat population with a gene genealogy approach.
Raquin, Anne-Laure; Depaulis, Frantz; Lambert, Amaury; Galic, Nathalie; Brabant, Philippe; Goldringer, Isabelle
2008-08-01
Microsatellite markers are extensively used to evaluate genetic diversity in natural or experimental evolving populations. Their high degree of polymorphism reflects their high mutation rates. Estimates of the mutation rates are therefore necessary when characterizing diversity in populations. As a complement to the classical experimental designs, we propose to use experimental populations, where the initial state is entirely known and some intermediate states have been thoroughly surveyed, thus providing a short timescale estimation together with a large number of cumulated meioses. In this article, we derived four original gene genealogy-based methods to assess mutation rates with limited bias due to relevant model assumptions incorporating the initial state, the number of new alleles, and the genetic effective population size. We studied the evolution of genetic diversity at 21 microsatellite markers, after 15 generations in an experimental wheat population. Compared to the parents, 23 new alleles were found in generation 15 at 9 of the 21 loci studied. We provide evidence that they arose by mutation. Corresponding estimates of the mutation rates ranged from 0 to 4.97 x 10(-3) per generation (i.e., year). Sequences of several alleles revealed that length polymorphism was only due to variation in the core of the microsatellite. Among different microsatellite characteristics, both the motif repeat number and an independent estimation of the Nei diversity were correlated with the novel diversity. Despite a reduced genetic effective size, global diversity at microsatellite markers increased in this population, suggesting that microsatellite diversity should be used with caution as an indicator in biodiversity conservation issues.
Bi, Lei; Guan, Chun-jie; Yang, Guan-e; Yang, Fei; Yan, Hong-yu; Li, Qing-shan
2016-04-01
The purple photosynthetic bacterium Rhodopseudomonas palustris has been widely applied to enhance the therapeutic effects of traditional Chinese medicine using novel biotransformation technology. However, comprehensive studies of the R. palustris biotransformation mechanism are rare. Therefore, investigation of the expression patterns of genes involved in metabolic pathways that are active during the biotransformation process is essential to elucidate this complicated mechanism. To promote further study of the biotransformation of R. palustris, we assembled all R. palustris transcripts using Trinity software and performed differential expression analysis of the resulting unigenes. A total of 9725, 7341 and 10,963 unigenes were obtained by assembling the alpha-rhamnetin-3-rhamnoside-treated R. palustris (RPB) reads, control R. palustris (RPS) reads and combined RPB&RPS reads, respectively. A total of 9971 unigenes assembled from the RPB&RPS reads were mapped to the nr, nt, Swiss-Prot, Gene Ontology (GO), Clusters of Orthologous Groups (COGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (E-value <0.00001) databases using BLAST software. A total of 3360 unique differentially expressed genes (DEGs) in RPB versus RPS were identified, among which 922 unigenes were up-regulated and 2438 were down-regulated. The unigenes were mapped to the KEGG database, resulting in the identification of 7676 pathways among all annotated unigenes and 2586 pathways among the DEGs. Some sets of functional unigenes annotated to important metabolic pathways and environmental information processing were differentially expressed between the RPS and RPB samples, including those involved in energy metabolism (18.4% of total DEGs), carbohydrate metabolism (36.0% of total DEGs), ABC transport (6.0% of total DEGs), the two-component system (8.6% of total DEGs), cell motility (4.3% of total DEGs) and the cell cycle (1.5% of total DEGs). We also identified 19 transcripts annotated as hydrolytic enzymes and other enzymes involved in ARR catabolism in R. palustris. We present the first comparative transcriptome profiles of RPB and RPS samples to facilitate elucidation of the molecular mechanism of biotransformation in R. palustris. Furthermore, we propose two putative ARR biotransformation mechanisms in R. palustris. These analytical results represent a useful genomic resource for in-depth research into the molecular basis of biotransformation and genetic modification in R. palustris. Copyright © 2016 Elsevier GmbH. All rights reserved.
Sun, Peipei; Mao, Yunxiang; Li, Guiyang; Cao, Min; Kong, Fanna; Wang, Li; Bi, Guiqi
2015-06-17
Pyropia yezoensis is a model organism often used to investigate the mechanisms underlying stress tolerance in intertidal zones. The digital gene expression (DGE) approach was used to characterize a genome-wide comparative analysis of differentially expressed genes (DEGs) that influence the physiological, developmental or biochemical processes in samples subjected to 4 treatments: high-temperature stress (HT), chilling stress (CS), freezing stress (FS) and normal temperature (NT). Equal amounts of total RNAs collected from 8 samples (two biological replicates per treatment) were sequenced using the Illumina/Solexa platform. Compared with NT, a total of 2202, 1334 and 592 differentially expressed unigenes were detected in HT, CS and FS respectively. Clustering analysis suggested P. yezoensis acclimates to low and high-temperature stress condition using different mechanisms: In heat stress, the unigenes related to replication and repair of DNA and protein processing in endoplasmic reticulum were active; however at low temperature stresses, unigenes related to carbohydrate metabolism and energy metabolism were active. Analysis of gene differential expression showed that four categories of DEGs functioning as temperature sensors were found, including heat shock proteins, H2A, histone deacetylase complex and transcription factors. Heat stress caused chloroplast genes down-regulated and unigenes encoding metacaspases up-regulated, which is an important regulator of PCD. Cold stress caused an increase in the expression of FAD to improve the proportion of polyunsaturated fatty acids. An up-regulated unigene encoding farnesyl pyrophosphate synthase was found in cold stress, indicating that the plant hormone ABA also played an important role in responding to temperature stress in P. yezoensis. The variation of amount of unigenes and different gene expression pattern under different temperature stresses indicated the complicated and diverse regulation mechanism in response to temperature stress in P. yezoensis. Several common metabolism pathways were found both in P. yezoensis and in higher plants, such as FAD in low-temperature stress and HSP in heat stress. Meanwhile, many chloroplast genes and unigene related to the synthesis of abscisic acid were detected, revealing its unique temperature-regulation mechanism in this intertidal species. This sequencing dataset and analysis may serve as a valuable resource to study the mechanisms involved in abiotic stress tolerance in intertidal seaweeds.
Cushion, Melanie T; Smulian, A George; Slaven, Bradley E; Sesterhenn, Tom; Arnold, Jonathan; Staben, Chuck; Porollo, Aleksey; Adamczak, Rafal; Meller, Jarek
2007-05-09
Members of the genus Pneumocystis are fungal pathogens that cause pneumonia in a wide variety of mammals with debilitated immune systems. Little is known about their basic biological functions, including life cycle, since no species can be cultured continuously outside the mammalian lung. To better understand the pathological process, about 4500 ESTS derived from sequencing of the poly(A) tail ends of P. carinii mRNAs during fulminate infection were annotated and functionally characterized as unassembled reads, and then clustered and reduced to a unigene set with 1042 members. Because of the presence of sequences from other microbial genomes and the rat host, the analysis and compression to a unigene set was necessarily an iterative process. BLASTx analysis of the unassembled reads (UR) vs. the Uni-Prot and TREMBL databases revealed 56% had similarities to existing polypeptides at E values of
Sesterhenn, Tom; Arnold, Jonathan; Staben, Chuck; Porollo, Aleksey; Adamczak, Rafal; Meller, Jarek
2007-01-01
Members of the genus Pneumocystis are fungal pathogens that cause pneumonia in a wide variety of mammals with debilitated immune systems. Little is known about their basic biological functions, including life cycle, since no species can be cultured continuously outside the mammalian lung. To better understand the pathological process, about 4500 ESTS derived from sequencing of the poly(A) tail ends of P. carinii mRNAs during fulminate infection were annotated and functionally characterized as unassembled reads, and then clustered and reduced to a unigene set with 1042 members. Because of the presence of sequences from other microbial genomes and the rat host, the analysis and compression to a unigene set was necessarily an iterative process. BLASTx analysis of the unassembled reads (UR) vs. the Uni-Prot and TREMBL databases revealed 56% had similarities to existing polypeptides at E values of≤10−6, with the remainder lacking any significant homology. The most abundant transcripts in the UR were associated with stress responses, energy production, transcription and translation. Most (70%) of the UR had similarities to proteins from filamentous fungi (e.g., Aspergillus, Neurospora) and existing P. carinii gene products. In contrast, similarities to proteins of the yeast-like fungi, Schizosaccharomyces pombe and Saccharomyces cerevisiae, predominated in the unigene set. Gene Ontology analysis using BLAST2GO revealed P. carinii dedicated most of its transcripts to cellular and physiological processes (∼80%), molecular binding and catalytic activities (∼70%), and were primarily derived from cell and organellar compartments (∼80%). KEGG Pathway mapping showed the putative P. carinii genes represented most standard metabolic pathways and cellular processes, including the tricarboxylic acid cycle, glycolysis, amino acid biosynthesis, cell cycle and mitochondrial function. Several gene homologs associated with mating, meiosis, and sterol biosynthesis in fungi were identified. Genes encoding the major surface glycoprotein family (MSG), heat shock (HSP70), and proteases (PROT/KEX) were the most abundantly expressed of known P. carinii genes. The apparent presence of many metabolic pathways in P. carinii, sexual reproduction within the host, and lack of an invasive infection process in the immunologically intact host suggest members of the genus Pneumocystis may be adapted parasites and have a compatible relationship with their mammalian hosts. This study represents the first characterization of the expressed genes of a non-culturable fungal pathogen of mammals during the infective process. PMID:17487271
White spotting in the domestic cat (Felis catus) maps near KIT on feline chromosome B1
Cooper, MP; Fretwell, N; Bailey, SJ; Lyons, LA
2006-01-01
Summary Five feline-derived microsatellite markers were genotyped in a large pedigree of cats that segregates for ventral white spotting. Both KIT and EDNRB cause similar white spotting phenotypes in other species. Thus, three of the five microsatellite markers chosen were on feline chromosome B1 in close proximity to KIT; the other two markers were on feline chromosome A1 near EDNRB. Pairwise linkage analysis supported linkage of the white spotting with the three chromosome B1 markers but not with the two chromosome A1 markers. This study indicates that KIT, or another gene within the linked region, is a candidate for white spotting in cats. Platelet-derived growth factor alpha (PDGFRA) is also a strong candidate, assuming that the KIT–PDGFRA linkage group, which is conserved in many mammalian species, is also conserved in the cat. PMID:16573531
Rai, Amit; Nakaya, Taiki; Shimizu, Yohei; Rai, Megha; Nakamura, Michimi; Suzuki, Hideyuki; Saito, Kazuki; Yamazaki, Mami
2018-05-29
Lithospermum officinale is a valuable source of bioactive metabolites with medicinal and industrial values. However, little is known about genes involved in the biosynthesis of these metabolites, primarily due to the lack of genome or transcriptome resources. This study presents the first effort to establish and characterize de novo transcriptome assembly resource for L. officinale and expression analysis for three of its tissues, namely leaf, stem, and root. Using over 4Gbps of RNA-sequencing datasets, we obtained de novo transcriptome assembly of L. officinale , consisting of 77,047 unigenes with assembly N50 value as 1524 bps. Based on transcriptome annotation and functional classification, 52,766 unigenes were assigned with putative genes functions, gene ontology terms, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. KEGG pathway and gene ontology enrichment analysis using highly expressed unigenes across three tissues and targeted metabolome analysis showed active secondary metabolic processes enriched specifically in the root of L. officinale . Using co-expression analysis, we also identified 20 and 48 unigenes representing different enzymes of lithospermic/chlorogenic acid and shikonin biosynthesis pathways, respectively. We further identified 15 candidate unigenes annotated as cytochrome P450 with the highest expression in the root of L. officinale as novel genes with a role in key biochemical reactions toward shikonin biosynthesis. Thus, through this study, we not only generated a high-quality genomic resource for L. officinale but also propose candidate genes to be involved in shikonin biosynthesis pathways for further functional characterization. Georg Thieme Verlag KG Stuttgart · New York.
NASA Astrophysics Data System (ADS)
Zhang, Hui; Zhai, Yuxiu; Yao, Lin; Jiang, Yanhua; Li, Fengling
2017-05-01
Chlamys farreri is an economically important mollusk that can accumulate excessive amounts of cadmium (Cd). Studying the molecular mechanism of Cd accumulation in bivalves is difficult because of the lack of genome background. Transcriptomic analysis based on high-throughput RNA sequencing has been shown to be an efficient and powerful method for the discovery of relevant genes in non-model and genome reference-free organisms. Here, we constructed two cDNA libraries (control and Cd exposure groups) from the digestive gland of C. farreri and compared the transcriptomic data between them. A total of 227 673 transcripts were assembled into 105 071 unigenes, most of which shared high similarity with sequences in the NCBI non-redundant protein database. For functional classification, 24 493 unigenes were assigned to Gene Ontology terms. Additionally, EuKaryotic Ortholog Groups and Kyoto Encyclopedia of Genes and Genomes analyses assigned 12 028 unigenes to 26 categories and 7 849 unigenes to five pathways, respectively. Comparative transcriptomics analysis identified 3 800 unigenes that were differentially expressed in the Cd-treated group compared with the control group. Among them, genes associated with heavy metal accumulation were screened, including metallothionein, divalent metal transporter, and metal tolerance protein. The functional genes and predicted pathways identified in our study will contribute to a better understanding of the metabolic and immune system in the digestive gland of C. farreri. In addition, the transcriptomic data will provide a comprehensive resource that may contribute to the understanding of molecular mechanisms that respond to marine pollutants in bivalves.
Ouyang, Kunxi; Li, Juncheng; Zhao, Xianhai; Que, Qingmin; Li, Pei; Huang, Hao; Deng, Xiaomei; Singh, Sunil Kumar; Wu, Ai-Min; Chen, Xiaoyang
2016-01-01
Neolamarckia cadamba is a fast-growing tropical hardwood tree that is used extensively for plywood and pulp production, light furniture fabrication, building materials, and as a raw material for the preparation of certain indigenous medicines. Lack of genomic resources hampers progress in the molecular breeding and genetic improvement of this multipurpose tree species. In this study, transcriptome profiling of differentiating stems was performed to understand N. cadamba xylogenesis. The N. cadamba transcriptome was sequenced using Illumina paired-end sequencing technology. This generated 42.49 G of raw data that was then de novo assembled into 55,432 UniGenes with a mean length of 803.2bp. Approximately 47.8% of the UniGenes (26,487) were annotated against publically available protein databases, among which 21,699 and 7,754 UniGenes were assigned to Gene Ontology categories (GO) and Clusters of Orthologous Groups (COG), respectively. 5,589 UniGenes could be mapped onto 116 pathways using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database. Among 6,202 UniGenes exhibiting differential expression during xylogenesis, 1,634 showed significantly higher levels of expression in the basal and middle stem segments compared to the apical stem segment. These genes included NAC and MYB transcription factors related to secondary cell wall biosynthesis, genes related to most metabolic steps of lignin biosynthesis, and CesA genes involved in cellulose biosynthesis. This study lays the foundation for further screening of key genes associated with xylogenesis in N. cadamba as well as enhancing our understanding of the mechanism of xylogenesis in fast-growing trees.
Valade, R; Kenis, M; Hernandez-Lopez, A; Augustin, S; Mari Mena, N; Magnoux, E; Rougerie, R; Lakatos, F; Roques, A; Lopez-Vaamonde, C
2009-08-01
Biological invasions usually start with a small number of founder individuals. These founders are likely to represent a small fraction of the total genetic diversity found in the source population. Our study set out to trace genetically the geographical origin of the horse-chestnut leafminer, Cameraria ohridella, an invasive microlepidopteran whose area of origin is still unkown. Since its discovery in Macedonia 25 years ago, this insect has experienced an explosive westward range expansion, progressively colonizing all of Central and Western Europe. We used cytochrome oxidase I sequences (DNA barcode fragment) and a set of six polymorphic microsatellites to assess the genetic variability of C. ohridella populations, and to test the hypothesis that C. ohridella derives from the southern Balkans (Albania, Macedonia and Greece). Analysis of mtDNA of 486 individuals from 88 localities allowed us to identify 25 geographically structured haplotypes. In addition, 480 individuals from 16 populations from Europe and the southern Balkans were genotyped for 6 polymorphic microsatellite loci. High haplotype diversity and low measures of nucleotide diversities including a significantly negative Tajima's D indicate that C. ohridella has experienced rapid population expansion during its dispersal across Europe. Both mtDNA and microsatellites show a reduction in genetic diversity of C. ohridella populations sampled from artificial habitats (e.g. planted trees in public parks, gardens, along roads in urban or sub-urban areas) across Europe compared with C. ohridella sampled in natural stands of horse-chestnuts in the southern Balkans. These findings suggest that European populations of C. ohridella may indeed derive from the southern Balkans.
Ng, C Y; Wickneswari, R; Choong, C Y
2014-08-07
Calamus palustris Griff. is an economically important dioecious rattan species in Southeast Asia. However, dioecy and onset of flowering at 3-4 years old render uncertainties in desired female:male seedling ratios to establish a productive seed orchard for this rattan species. We constructed a subtractive library for male floral tissue to understand the genetic mechanism for gender determination in C. palustris. The subtractive library produced 1536 clones with 1419 clones of high quality. Reverse Northern screening showed 313 clones with differential expression, and sequence analyses clustered them into 205 unigenes, including 32 contigs and 173 singletons. The subtractive library was further validated with reverse transcription-quantitative polymerase chain reaction analysis. Homology identification classified the unigenes into 12 putative functional proteins with 83% unigenes showing significant match to proteins in databases. Functional annotations of these unigenes revealed genes involved in male flower development, including MADS-box genes, pollen-related genes, phytohormones for flower development, and male flower organ development. Our results showed that the male floral genes may play a vital role in sex determination in C. palustris. The identified genes can be exploited to understand the molecular basis of sex determination in C. palustris.
The Spatial and Temporal Transcriptomic Landscapes of Ginseng, Panax ginseng C. A. Meyer.
Wang, Kangyu; Jiang, Shicui; Sun, Chunyu; Lin, Yanping; Yin, Rui; Wang, Yi; Zhang, Meiping
2015-12-11
Ginseng, including Asian ginseng (Panax ginseng C. A. Meyer) and American ginseng (P. quinquefolius L.), is one of the most important medicinal herbs in Asia and North America, but significantly understudied. This study sequenced and characterized the transcriptomes and expression profiles of genes expressed in 14 tissues and four different aged roots of Asian ginseng. A total of 265.2 million 100-bp clean reads were generated using the high-throughput sequencing platform HiSeq 2000, representing >8.3x of the 3.2-Gb ginseng genome. From the sequences, 248,993 unigenes were assembled for whole plant, 61,912-113,456 unigenes for each tissue and 54,444-65,412 unigenes for different year-old roots. We comprehensively analyzed the unigene sets and gene expression profiles. We found that the number of genes allocated to each functional category is stable across tissues or developmental stages, while the expression profiles of different genes of a gene family or involved in ginsenoside biosynthesis dramatically diversified spatially and temporally. These results provide an overall insight into the spatial and temporal transcriptome dynamics and landscapes of Asian ginseng, and comprehensive resources for advanced research and breeding of ginseng and related species.
Zhu, Fengjiao; Yang, Zongying; Zhang, Yiliu; Hu, Kun; Fang, Wenhong
2017-01-01
Enrofloxacin is the most commonly used antibiotic to control diseases in aquatic animals caused by A. hydrophila. This study conducted de novo transcriptome sequencing and compared the global transcriptomes of enrofloxacin-resistant and enrofloxacin-susceptible strains. We got a total of 4,714 unigenes were assembled. Of these, 4,122 were annotated. A total of 3,280 unigenes were assigned to GO, 3,388 unigenes were classified into Cluster of Orthologous Groups of proteins (COG) using BLAST and BLAST2GO software, and 2,568 were mapped onto pathways using the Kyoto Encyclopedia of Gene and Genomes Pathway database. Furthermore, 218 unigenes were deemed to be DEGs. After enrofloxacin treatment, 135 genes were upregulated and 83 genes were downregulated. The GO terms biological process (126 genes) and metabolic process (136 genes) were the most enriched, and the terms for protein folding, response to stress, and SOS response were also significantly enriched. This study identified enrofloxacin treatment affects multiple biological functions of A. hydrophila. Enrofloxacin resistance in A. hydrophila is closely related to the reduction of intracellular drug accumulation caused by ABC transporters and increased expression of topoisomerase IV.
Yang, Zongying; Zhang, Yiliu; Hu, Kun; Fang, Wenhong
2017-01-01
Enrofloxacin is the most commonly used antibiotic to control diseases in aquatic animals caused by A. hydrophila. This study conducted de novo transcriptome sequencing and compared the global transcriptomes of enrofloxacin-resistant and enrofloxacin-susceptible strains. We got a total of 4,714 unigenes were assembled. Of these, 4,122 were annotated. A total of 3,280 unigenes were assigned to GO, 3,388 unigenes were classified into Cluster of Orthologous Groups of proteins (COG) using BLAST and BLAST2GO software, and 2,568 were mapped onto pathways using the Kyoto Encyclopedia of Gene and Genomes Pathway database. Furthermore, 218 unigenes were deemed to be DEGs. After enrofloxacin treatment, 135 genes were upregulated and 83 genes were downregulated. The GO terms biological process (126 genes) and metabolic process (136 genes) were the most enriched, and the terms for protein folding, response to stress, and SOS response were also significantly enriched. This study identified enrofloxacin treatment affects multiple biological functions of A. hydrophila. Enrofloxacin resistance in A. hydrophila is closely related to the reduction of intracellular drug accumulation caused by ABC transporters and increased expression of topoisomerase IV. PMID:28708867
NASA Astrophysics Data System (ADS)
Haifig, Ives; Vargo, Edward L.; Labadie, Paul; Costa-Leonardo, Ana Maria
2016-02-01
A termite colony is usually founded by a pair of alates, the primary reproductives, which produce all the nestmates. In some species, secondary reproductives appear to either replace the primaries or supplement colony reproduction. In termites, secondary reproductives are generally ergatoids derived from workers or nymphoids derived from nymphs. Silvestritermes euamignathus is a termite species that forms multiple nymphoid reproductives, and to date it was hypothesized that these secondary reproductives were the progeny of the primary founding reproductives. We developed markers for 12 microsatellite loci and used COI mitochondrial DNA (mtDNA) to genotype 59 nymphoid neotenics found in a colony of S. euamignathus to test this hypothesis. Our results showed that nymphoids of S. euamignathus are not all siblings. The microsatellite analysis suggests that the secondary reproductives derived from a minimum of four different pairs of reproductives belonging to at least two different matrilines. This is the first record of non-sibling secondary reproductives occupying the same nest in a higher termite. These unrelated reproductives might be the result of either pleometrotic colony foundation or colony fusion.
Development and Characterization of 1,906 EST-SSR Markers from Unigenes in Jute (Corchorus spp.)
Zhang, Liwu; Li, Yanru; Tao, Aifen; Fang, Pingping; Qi, Jianmin
2015-01-01
Jute, comprising white and dark jute, is the second important natural fiber crop after cotton worldwide. However, the lack of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers has resulted in a large gap in the improvement of jute. Previously, de novo 48,914 unigenes from white jute were assembled. In this study, 1,906 EST-SSRs were identified from these assembled uingenes. Among these markers, di-, tri- and tetra-nucleotide repeat types were the abundant types (12.0%, 56.9% and 21.6% respectively). The AG-rich or GA-rich nucleotide repeats were the predominant. Subsequently, a sample of 116 SSRs, located in genes encoding transcription factors and cellulose synthases, were selected to survey polymorphisms among12 diverse jute accessions. Of these, 83.6% successfully amplified at least one fragment and detected polymorphism among the 12diverse genotypes, indicating that the newly developed SSRs are of good quality. Furthermore, the genetic similarity coefficients of all the 12 accessions were evaluated using 97 polymorphic SSRs. The cluster analysis divided the jute accessions into two main groups with genetic similarity coefficient of 0.61. These EST-SSR markers not only enrich molecular markers of jute genome, but also facilitate genetic and genomic researches in jute. PMID:26512891
2012-01-01
Background In rubber tree, bark is one of important agricultural and biological organs. However, the molecular mechanism involved in the bark formation and development in rubber tree remains largely unknown, which is at least partially due to lack of bark transcriptomic and genomic information. Therefore, it is necessary to carried out high-throughput transcriptome sequencing of rubber tree bark to generate enormous transcript sequences for the functional characterization and molecular marker development. Results In this study, more than 30 million sequencing reads were generated using Illumina paired-end sequencing technology. In total, 22,756 unigenes with an average length of 485 bp were obtained with de novo assembly. The similarity search indicated that 16,520 and 12,558 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 6,867 and 5,559 unigenes were separately assigned to Gene Ontology (GO) and Clusters of Orthologous Group (COG). When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 12,097 unigenes were assigned to 5 main categories including 123 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (9,043, 74.75%), suggesting the active metabolic processes in rubber tree bark. In addition, a total of 39,257 EST-SSRs were identified from 22,756 unigenes, and the characterizations of EST-SSRs were further analyzed in rubber tree. 110 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among 13 Hevea germplasms, PCR success rate and polymorphism rate of 110 markers were separately 96.36% and 55.45% in this study. Conclusion By assembling and analyzing de novo transcriptome sequencing data, we reported the comprehensive functional characterization of rubber tree bark. This research generated a substantial fraction of rubber tree transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation, and microarrays development in rubber tree. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding in rubber tree. Moreover, this study also supported that transcriptome analysis based on Illumina paired-end sequencing is a powerful tool for transcriptome characterization and molecular marker development in non-model species, especially those with large and complex genomes. PMID:22607098
Wei, Lin; Li, Shenghua; Liu, Shenggui; He, Anna; Wang, Dan; Wang, Jie; Tang, Yulian; Wu, Xianjin
2014-01-01
Background Houttuynia cordata Thunb. is an important traditional medical herb in China and other Asian countries, with high medicinal and economic value. However, a lack of available genomic information has become a limitation for research on this species. Thus, we carried out high-throughput transcriptomic sequencing of H. cordata to generate an enormous transcriptome sequence dataset for gene discovery and molecular marker development. Principal Findings Illumina paired-end sequencing technology produced over 56 million sequencing reads from H. cordata mRNA. Subsequent de novo assembly yielded 63,954 unigenes, 39,982 (62.52%) and 26,122 (40.84%) of which had significant similarity to proteins in the NCBI nonredundant protein and Swiss-Prot databases (E-value <10−5), respectively. Of these annotated unigenes, 30,131 and 15,363 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In addition, 24,434 (38.21%) unigenes were mapped onto 128 pathways using the KEGG pathway database and 17,964 (44.93%) unigenes showed homology to Vitis vinifera (Vitaceae) genes in BLASTx analysis. Furthermore, 4,800 cDNA SSRs were identified as potential molecular markers. Fifty primer pairs were randomly selected to detect polymorphism among 30 samples of H. cordata; 43 (86%) produced fragments of expected size, suggesting that the unigenes were suitable for specific primer design and of high quality, and the SSR marker could be widely used in marker-assisted selection and molecular breeding of H. cordata in the future. Conclusions This is the first application of Illumina paired-end sequencing technology to investigate the whole transcriptome of H. cordata and to assemble RNA-seq reads without a reference genome. These data should help researchers investigating the evolution and biological processes of this species. The SSR markers developed can be used for construction of high-resolution genetic linkage maps and for gene-based association analyses in H. cordata. This work will enable future functional genomic research and research into the distinctive active constituents of this genus. PMID:24392108
Ke, Tao; Yu, Jingyin; Dong, Caihua; Mao, Han; Hua, Wei; Liu, Shengyi
2015-01-21
Oil crop seeds are important sources of fatty acids (FAs) for human and animal nutrition. Despite their importance, there is a lack of an essential bioinformatics resource on gene transcription of oil crops from a comparative perspective. In this study, we developed ocsESTdb, the first database of expressed sequence tag (EST) information on seeds of four large-scale oil crops with an emphasis on global metabolic networks and oil accumulation metabolism that target the involved unigenes. A total of 248,522 ESTs and 106,835 unigenes were collected from the cDNA libraries of rapeseed (Brassica napus), soybean (Glycine max), sesame (Sesamum indicum) and peanut (Arachis hypogaea). These unigenes were annotated by a sequence similarity search against databases including TAIR, NR protein database, Gene Ontology, COG, Swiss-Prot, TrEMBL and Kyoto Encyclopedia of Genes and Genomes (KEGG). Five genome-scale metabolic networks that contain different numbers of metabolites and gene-enzyme reaction-association entries were analysed and constructed using Cytoscape and yEd programs. Details of unigene entries, deduced amino acid sequences and putative annotation are available from our database to browse, search and download. Intuitive and graphical representations of EST/unigene sequences, functional annotations, metabolic pathways and metabolic networks are also available. ocsESTdb will be updated regularly and can be freely accessed at http://ocri-genomics.org/ocsESTdb/ . ocsESTdb may serve as a valuable and unique resource for comparative analysis of acyl lipid synthesis and metabolism in oilseed plants. It also may provide vital insights into improving oil content in seeds of oil crop species by transcriptional reconstruction of the metabolic network.
Deep RNA-Seq to unlock the gene bank of floral development in Sinapis arvensis.
Liu, Jia; Mei, Desheng; Li, Yunchang; Huang, Shunmou; Hu, Qiong
2014-01-01
Sinapis arvensis is a weed with strong biological activity. Despite being a problematic annual weed that contaminates agricultural crop yield, it is a valuable alien germplasm resource. It can be utilized for broadening the genetic background of Brassica crops with desirable agricultural traits like resistance to blackleg (Leptosphaeria maculans), stem rot (Sclerotinia sclerotium) and pod shatter (caused by FRUITFULL gene). However, few genetic studies of S. arvensis were reported because of the lack of genomic resources. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive dataset for S. arvensis for the first time. We used Illumina paired-end sequencing technology to sequence the S. arvensis flower transcriptome and generated 40,981,443 reads that were assembled into 131,278 transcripts. We de novo assembled 96,562 high quality unigenes with an average length of 832 bp. A total of 33,662 full-length ORF complete sequences were identified, and 41,415 unigenes were mapped onto 128 pathways using the KEGG Pathway database. The annotated unigenes were compared against Brassica rapa, B. oleracea, B. napus and Arabidopsis thaliana. Among these unigenes, 76,324 were identified as putative homologs of annotated sequences in the public protein databases, of which 1194 were associated with plant hormone signal transduction and 113 were related to gibberellin homeostasis/signaling. Unigenes that did not match any of those sequence datasets were considered to be unique to S. arvensis. Furthermore, 21,321 simple sequence repeats were found. Our study will enhance the currently available resources for Brassicaceae and will provide a platform for future genomic studies for genetic improvement of Brassica crops.
Deep RNA-Seq to Unlock the Gene Bank of Floral Development in Sinapis arvensis
Liu, Jia; Mei, Desheng; Li, Yunchang; Huang, Shunmou; Hu, Qiong
2014-01-01
Sinapis arvensis is a weed with strong biological activity. Despite being a problematic annual weed that contaminates agricultural crop yield, it is a valuable alien germplasm resource. It can be utilized for broadening the genetic background of Brassica crops with desirable agricultural traits like resistance to blackleg (Leptosphaeria maculans), stem rot (Sclerotinia sclerotium) and pod shatter (caused by FRUITFULL gene). However, few genetic studies of S. arvensis were reported because of the lack of genomic resources. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive dataset for S. arvensis for the first time. We used Illumina paired-end sequencing technology to sequence the S. arvensis flower transcriptome and generated 40,981,443 reads that were assembled into 131,278 transcripts. We de novo assembled 96,562 high quality unigenes with an average length of 832 bp. A total of 33,662 full-length ORF complete sequences were identified, and 41,415 unigenes were mapped onto 128 pathways using the KEGG Pathway database. The annotated unigenes were compared against Brassica rapa, B. oleracea, B. napus and Arabidopsis thaliana. Among these unigenes, 76,324 were identified as putative homologs of annotated sequences in the public protein databases, of which 1194 were associated with plant hormone signal transduction and 113 were related to gibberellin homeostasis/signaling. Unigenes that did not match any of those sequence datasets were considered to be unique to S. arvensis. Furthermore, 21,321 simple sequence repeats were found. Our study will enhance the currently available resources for Brassicaceae and will provide a platform for future genomic studies for genetic improvement of Brassica crops. PMID:25192023
Huang, Ruimin; Huang, Youjun; Sun, Zhichao; Huang, Jianqin; Wang, Zhengjia
2017-05-24
Pecan (Carya illinoinensis) is an important woody tree species because of the high content of healthy oil in its nut. Thus far, the pathways and key genes related to oil biosynthesis in developing pecan seeds remain largely unclear. Our analyses revealed that mature pecan embryo accumulated more than 80% oil, in which 90% was unsaturated fatty acids with abundant oleic acid. RNA sequencing generated 84,643 unigenes in three cDNA libraries prepared from pecan embryos collected at 105, 120, and 165 days after flowering (DAF). We identified 153 unigenes associated with lipid biosynthesis, including 107 unigenes for fatty acid biosynthesis, 34 for triacylglycerol biosynthesis, 7 for oil bodies, and 5 for transcription factors involved in oil synthesis. The genes associated with fatty acid synthesis were the most abundantly expressed genes at 120 DAF. Additionally, the biosynthesis of oil began to increase while crude fat contents increased from 16.61 to 74.45% (165 DAF). We identified four SAD, two FAD2, one FAD6, two FAD7, and two FAD8 unigenes responsible for unsaturated fatty acid biosynthesis. However, FAD3 homologues were not detected. Consequently, we inferred that the linolenic acid in developing pecan embryos is generated by FAD7 and FAD8 in plastids rather than FAD3 in endoplasmic reticula. During pecan embryo development, different unigenes are expressed for plastidial and cytosolic glycolysis. Plastidial glycolysis is more relevant to lipid synthesis than cytosolic glycolysis. The 18 most important genes associated with lipid biosynthesis were evaluated in five stages of developing embryos using quantitative PCR (qPCR). The qPCR data were well consistent with their expression in transcriptomic analyses. Our data would be important for the metabolic engineering of pecans to increase oil contents and modify fatty acid composition.
Xia, Wei; Mason, Annaliese S.; Xia, Zhihui; Qiao, Fei; Zhao, Songlin; Tang, Haoru
2013-01-01
Background Cocos nucifera (coconut), a member of the Arecaceae family, is an economically important woody palm grown in tropical regions. Despite its agronomic importance, previous germplasm assessment studies have relied solely on morphological and agronomical traits. Molecular biology techniques have been scarcely used in assessment of genetic resources and for improvement of important agronomic and quality traits in Cocos nucifera, mostly due to the absence of available sequence information. Methodology/Principal Findings To provide basic information for molecular breeding and further molecular biological analysis in Cocos nucifera, we applied RNA-seq technology and de novo assembly to gain a global overview of the Cocos nucifera transcriptome from mixed tissue samples. Using Illumina sequencing, we obtained 54.9 million short reads and conducted de novo assembly to obtain 57,304 unigenes with an average length of 752 base pairs. Sequence comparison between assembled unigenes and released cDNA sequences of Cocos nucifera and Elaeis guineensis indicated that the assembled sequences were of high quality. Approximately 99.9% of unigenes were novel compared to the released coconut EST sequences. Using BLASTX, 68.2% of unigenes were successfully annotated based on the Genbank non-redundant (Nr) protein database. The annotated unigenes were then further classified using the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Conclusions/Significance Our study provides a large quantity of novel genetic information for Cocos nucifera. This information will act as a valuable resource for further molecular genetic studies and breeding in coconut, as well as for isolation and characterization of functional genes involved in different biochemical pathways in this important tropical crop species. PMID:23555859
Yan, Hongwei; Cui, Xin; Shen, Xufang; Wang, Lianshun; Jiang, Linan; Liu, Haiying; Liu, Ying; Liu, Qi; Jiang, Chen
2018-06-01
The mantis shrimp Oratosquilla oratoria is a widely distributed, commercially important crustacean species. Although its conservation and the development of successful artificial breeding technologies have recently received considerable attention, there are currently no available data regarding the molecular mechanisms in controlling reproduction. In this study, we performed transcriptome sequencing of the testis, ovary, female and male eyestalks and the androgenic gland of O. oratoria, and compared the expression pattern of transcripts from the testis and ovary libraries to identify genes involved in gonadal development. A total of 147,130,937 clean reads were retrieved after removing the adapters in reads and filtering out low-quality data. All the reads were assembled into 94,990 unigenes (23,133 in testis and ovary) with an average length of 783 base pairs (bp) and N50 of 1502 bp. A search of all-unigenes against COG, GO, KEGG, KOG, Pfam, Swiss-Prot and Nr databases resulted in a total of 19,404 annotated unigenes. Comparison of the sequences in the ovary and testis libraries revealed that 1188 unigenes were up-regulated in the ovary and 2732 were up-regulated in the testis. Twenty ovary-up-regulated and 21 testis-up-regulated unigenes were confirmed by quantitative real-time PCR. Additionally, 13,437 simple sequence repeats (SSRs) and 275,799 putative single nucleotide polymorphisms (SNPs) were identified. The important functional genes and pathways identified here provide a valuable dataset for understanding the molecular mechanisms controlling gonad development in O. oratoria, and the numerous (13,437 SSRs and 275,799 SNPs) molecular markers obtained here will provide fundamental basis for functional genomic and population genetic studies of O. oratoria. Copyright © 2018 Elsevier Inc. All rights reserved.
Zhang, Gaisheng; Ju, Lan; Zhang, Jiao; Yu, Yongang; Niu, Na; Wang, Junwei; Ma, Shoucai
2015-01-01
Wheat (Triticum aestivum L.), one of the world’s most important food crops, is a strictly autogamous (self-pollinating) species with exclusively perfect flowers. Male sterility induced by chemical hybridizing agents has increasingly attracted attention as a tool for hybrid seed production in wheat; however, the molecular mechanisms of male sterility induced by the agent SQ-1 remain poorly understood due to limited whole transcriptome data. Therefore, a comparative analysis of wheat anther transcriptomes for male fertile wheat and SQ-1–induced male sterile wheat was carried out using next-generation sequencing technology. In all, 42,634,123 sequence reads were generated and were assembled into 82,356 high-quality unigenes with an average length of 724 bp. Of these, 1,088 unigenes were significantly differentially expressed in the fertile and sterile wheat anthers, including 643 up-regulated unigenes and 445 down-regulated unigenes. The differentially expressed unigenes with functional annotations were mapped onto 60 pathways using the Kyoto Encyclopedia of Genes and Genomes database. They were mainly involved in coding for the components of ribosomes, photosynthesis, respiration, purine and pyrimidine metabolism, amino acid metabolism, glutathione metabolism, RNA transport and signal transduction, reactive oxygen species metabolism, mRNA surveillance pathways, protein processing in the endoplasmic reticulum, protein export, and ubiquitin-mediated proteolysis. This study is the first to provide a systematic overview comparing wheat anther transcriptomes of male fertile wheat with those of SQ-1–induced male sterile wheat and is a valuable source of data for future research in SQ-1–induced wheat male sterility. PMID:25898130
Fan, Haikuo; Xiao, Yong; Yang, Yaodong; Xia, Wei; Mason, Annaliese S; Xia, Zhihui; Qiao, Fei; Zhao, Songlin; Tang, Haoru
2013-01-01
Cocos nucifera (coconut), a member of the Arecaceae family, is an economically important woody palm grown in tropical regions. Despite its agronomic importance, previous germplasm assessment studies have relied solely on morphological and agronomical traits. Molecular biology techniques have been scarcely used in assessment of genetic resources and for improvement of important agronomic and quality traits in Cocos nucifera, mostly due to the absence of available sequence information. To provide basic information for molecular breeding and further molecular biological analysis in Cocos nucifera, we applied RNA-seq technology and de novo assembly to gain a global overview of the Cocos nucifera transcriptome from mixed tissue samples. Using Illumina sequencing, we obtained 54.9 million short reads and conducted de novo assembly to obtain 57,304 unigenes with an average length of 752 base pairs. Sequence comparison between assembled unigenes and released cDNA sequences of Cocos nucifera and Elaeis guineensis indicated that the assembled sequences were of high quality. Approximately 99.9% of unigenes were novel compared to the released coconut EST sequences. Using BLASTX, 68.2% of unigenes were successfully annotated based on the Genbank non-redundant (Nr) protein database. The annotated unigenes were then further classified using the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Our study provides a large quantity of novel genetic information for Cocos nucifera. This information will act as a valuable resource for further molecular genetic studies and breeding in coconut, as well as for isolation and characterization of functional genes involved in different biochemical pathways in this important tropical crop species.
Ren, Yipeng; Xue, Junli; Yang, Huanhuan; Pan, Baoping; Bu, Wenjun
2017-05-01
The Manila clam, Ruditapes philippinarum, is one of the most economically important aquatic clams that are harvested on a large scale by the mariculture industry in China. However, increasing reports of bacterial pathogenic diseases have had a negative effect on the aquaculture industry of R. philippinarum. In the present study, the two transcriptome libraries of untreated (termed H) and challenged Vibrio anguillarum (termed HV) hepatopancreas were constructed and sequenced from Manila clam using an Illumina-based paired-end sequencing platform. In total, 75,302,886 and 66,578,976 high-quality clean reads were assembled from 101,080,746 and 99,673,538 raw data points from the two transcriptome libraries described above, respectively. Furthermore, 156,116 unigenes were generated from 210,685 transcripts, with an N50 length of 1125 bp, and from the annotated SwissProt, NR, NT, KO, GO, KOG and KEGG databases. Moreover, a total of 4071 differentially expressed unigenes (HV vs H) were detected, including 903 up-regulated and 3168 down-regulated genes. Among these differentially expressed unigenes, 226 unigenes were annotated using KEGG annotation in 16 immune-related signaling pathways, including Toll-like receptor, NF-kappa B, MAPK, NOD-like receptor, RIG-I-like receptor, and the TNF and chemokine signaling pathways. Finally, 20,341 simple sequence repeats (SSRs) and 214,430 potential single nucleotide polymorphisms (SNPs) were detected from the H and HV transcriptome libraries. In conclusion, these studies identified many candidate immune-related genes and signaling pathways and conducted a comparative analysis of the differentially expressed unigenes from Manila clam hepatopancreas in response to V. anguillarum stimulation. These data laid the foundation for studying the innate immune systems and defense mechanisms in R. philippinarum. Copyright © 2017 Elsevier Ltd. All rights reserved.
Wu, Zhigang; Wu, Jinwei; Wang, Yalin; Hou, Hongwei
2017-01-01
Premise of the study: Microsatellite or simple sequence repeat (SSR) markers were developed to investigate the influence of ecological factors on gene flow and spatial genetic structuring of the submerged plant Ranunculus bungei (Ranunculaceae), which is regarded as an important species for understanding how plants adapt to an aquatic environment. Methods and Results: Twenty-two microsatellite loci were identified from an expressed sequence tag (EST) library. The number of alleles per locus ranged from one to five, and the expected heterozygosity varied from 0.0 to 0.5 in four Chinese populations of R. bungei. Fourteen loci were polymorphic and significantly deviated from Hardy–Weinberg equilibrium. All of the loci were found to be amplifiable in two other species of Ranunculus section Batrachium, and cross-amplification in six riparian and aquatic species of Ranunculaceae was also partially successful. Conclusions: These novel EST-SSR markers will be useful for ecological and evolutionary studies of R. bungei as well as related species. PMID:28791205
Ye, Yuzhen; Wang, Zhongwei; Zhou, Jianfeng; Wu, Qingjiang
2009-08-01
Microsatellite markers and D-loop sequences of mtDNA from a female allotetraploid parent carp and her progenies of generations 1 and 2 induced by sperm of five distant fish species were analyzed. Eleven microsatellite markers were used to identify 48 alleles from the allotetraploid female. The same number of alleles (48) appeared in the first and second generations of the gynogenetic offspring, regardless of the source of the sperm used as an activator. The mtDNA D-loop analysis was performed on the female tetraploid parent, 25 gynogenetic offspring, and 5 sperm-donor species. Fourteen variable sites from the 1,018 bp sequences were observed in the offspring as compared to the female tetraploid parent. Results from D-loop sequence and microsatellite marker analysis showed exclusive maternal transmission, and no genetic information was derived from the father. Our study suggests that progenies of artificial tetraploid carp are genetically stable, which is important for genetic breeding of this tetraploid fish.
Zhang, Xiaodong; Allan, Andrew C.; Li, Caixia; Wang, Yuanzhong; Yao, Qiuyang
2015-01-01
Gentiana rigescens is an important medicinal herb in China. The main validated medicinal component gentiopicroside is synthesized in shoots, but is mainly found in the plant’s roots. The gentiopicroside biosynthetic pathway and its regulatory control remain to be elucidated. Genome resources of gentian are limited. Next-generation sequencing (NGS) technologies can aid in supplying global gene expression profiles. In this study we present sequence and transcript abundance data for the root and leaf transcriptome of G. rigescens, obtained using the Illumina Hiseq2000. Over fifty million clean reads were obtained from leaf and root libraries. This yields 76,717 unigenes with an average length of 753 bp. Among these, 33,855 unigenes were identified as putative homologs of annotated sequences in public protein and nucleotide databases. Digital abundance analysis identified 3306 unigenes differentially enriched between leaf and root. Unigenes found in both tissues were categorized according to their putative functional categories. Of the differentially expressed genes, over 130 were annotated as related to terpenoid biosynthesis. This work is the first study of global transcriptome analyses in gentian. These sequences and putative functional data comprise a resource for future investigation of terpenoid biosynthesis in Gentianaceae species and annotation of the gentiopicroside biosynthetic pathway and its regulatory mechanisms. PMID:26006235
Identification of genes differentially expressed during ripening of banana.
Manrique-Trujillo, Sandra Mabel; Ramírez-López, Ana Cecilia; Ibarra-Laclette, Enrique; Gómez-Lim, Miguel Angel
2007-08-01
The banana (Musa acuminata, subgroup Cavendish 'Grand Nain') is a climacteric fruit of economic importance. A better understanding of the banana ripening process is needed to improve fruit quality and to extend shelf life. Eighty-four up-regulated unigenes were identified by differential screening of a banana fruit cDNA subtraction library at a late ripening stage. The ripening stages in this study were defined according to the peel color index (PCI). Unigene sequences were analyzed with different databases to assign a putative identification. The expression patterns of 36 transcripts confirmed as positive by differential screening were analyzed comparing the PCI 1, PCI 5 and PCI 7 ripening stages. Expression profiles were obtained for unigenes annotated as orcinol O-methyltransferase, putative alcohol dehydrogenase, ubiquitin-protein ligase, chorismate mutase and two unigenes with non-significant matches with any reported sequence. Similar expression profiles were observed in banana pulp and peel. Our results show differential expression of a group of genes involved in processes associated with fruit ripening, such as stress, detoxification, cytoskeleton and biosynthesis of volatile compounds. Some of the identified genes had not been characterized in banana fruit. Besides providing an overview of gene expression programs and metabolic pathways at late stages of banana fruit ripening, this study contributes to increasing the information available on banana fruit ESTs.
Gao, Yi; Wei, Jiankai; Yuan, Jianbo; Zhang, Xiaojun; Li, Fuhua; Xiang, Jianhai
2017-04-24
Exoskeleton construction is an important issue in shrimp. To better understand the molecular mechanism of exoskeleton formation, development and reconstruction, the transcriptome of the entire developmental process in Litopenaeus vannamei, including nine early developmental stages and eight adult-moulting stages, was sequenced and analysed using Illumina RNA-seq technology. A total of 117,539 unigenes were obtained, with 41.2% unigenes predicting the full-length coding sequence. Gene Ontology, Clusters of Orthologous Group (COG), the Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and functional annotation of all unigenes gave a better understanding of the exoskeleton developmental process in L. vannamei. As a result, more than six hundred unigenes related to exoskeleton development were identified both in the early developmental stages and adult-moulting. A cascade of sequential expression events of exoskeleton-related genes were summarized, including exoskeleton formation, regulation, synthesis, degradation, mineral absorption/reabsorption, calcification and hardening. This new insight on major transcriptional events provide a deep understanding for exoskeleton formation and reconstruction in L. vannamei. In conclusion, this is the first study that characterized the integrated transcriptomic profiles cover the entire exoskeleton development from zygote to adult-moulting in a crustacean, and these findings will serve as significant references for exoskeleton developmental biology and aquaculture research.
Physiological and transcriptome response to cadmium in cosmos (Cosmos bipinnatus Cav.) seedlings.
Liu, Yujing; Yu, Xiaofang; Feng, Yimei; Zhang, Chao; Wang, Chao; Zeng, Jian; Huang, Zhuo; Kang, Houyang; Fan, Xing; Sha, Lina; Zhang, Haiqin; Zhou, Yonghong; Gao, Suping; Chen, Qibing
2017-10-31
To date, several species of Asteraceae have been considered as Cd-accumulators. However, little information on the Cd tolerance and associated mechanisms of Asteraceae species Cosmos bipinnatus, is known. Presently, several physiological indexes and transcriptome profiling under Cd stress were investigated. C. bipinnatus exhibited strong Cd tolerance and recommended as a Cd-accumulator, although the biomasses were reduced by Cd. Meanwhile, Cd stresses reduced Zn and Ca uptake, but increased Fe uptake. Subcellular distribution indicated that the vacuole sequestration in root mainly detoxified Cd under lower Cd stress. Whilst, cell wall binding and vacuole sequestration in root co-detoxified Cd under high Cd exposure. Meanwhile, 66,407 unigenes were assembled and 41,674 (62.75%) unigenes were annotated in at least one database. 2,658 DEGs including 1,292 up-regulated unigenes and 1,366 down-regulated unigenes were identified under 40 μmol/L Cd stress. Among of these DEGs, ZIPs, HMAs, NRAMPs and ABC transporters might participate in Cd uptake, translocation and accumulation. Many DEGs participating in several processes such as cell wall biosynthesis, GSH metabolism, TCA cycle and antioxidant system probably play critical roles in cell wall binding, vacuole sequestration and detoxification. These results provided a novel insight into the physiological and transcriptome response to Cd in C. bipinnatus seedlings.
Molecular mapping and breeding with microsatellite markers.
Lightfoot, David A; Iqbal, Muhammad J
2013-01-01
In genetics databases for crop plant species across the world, there are thousands of mapped loci that underlie quantitative traits, oligogenic traits, and simple traits recognized by association mapping in populations. The number of loci will increase as new phenotypes are measured in more diverse genotypes and genetic maps based on saturating numbers of markers are developed. A period of locus reevaluation will decrease the number of important loci as those underlying mega-environmental effects are recognized. A second wave of reevaluation of loci will follow from developmental series analysis, especially for harvest traits like seed yield and composition. Breeding methods to properly use the accurate maps of QTL are being developed. New methods to map, fine map, and isolate the genes underlying the loci will be critical to future advances in crop biotechnology. Microsatellite markers are the most useful tool for breeders. They are codominant, abundant in all genomes, highly polymorphic so useful in many populations, and both economical and technically easy to use. The selective genotyping approaches, including genotype ranking (indexing) based on partial phenotype data combined with favorable allele data and bulked segregation event (segregant) analysis (BSA), will be increasingly important uses for microsatellites. Examples of the methods for developing and using microsatellites derived from genomic sequences are presented for monogenic, oligogenic, and polygenic traits. Examples of successful mapping, fine mapping, and gene isolation are given. When combined with high-throughput methods for genotyping and a genome sequence, the use of association mapping with microsatellite markers will provide critical advances in the analysis of crop traits.
Jorge, Paulo H; Mastrochirico-Filho, Vito A; Hata, Milene E; Mendes, Natália J; Ariede, Raquel B; de Freitas, Milena Vieira; Vera, Manuel; Porto-Foresti, Fábio; Hashimoto, Diogo T
2018-01-01
The pirapitinga, Piaractus brachypomus (Characiformes, Serrasalmidae), is a fish from the Amazon basin and is considered to be one of the main native species used in aquaculture production in South America. The objectives of this study were: (1) to perform liver transcriptome sequencing of pirapitinga through NGS and then validate a set of microsatellite markers for this species; and (2) to use polymorphic microsatellites for analysis of genetic variability in farmed stocks. The transcriptome sequencing was carried out through the Roche/454 technology, which resulted in 3,696 non-redundant contigs. Of this total, 2,568 contigs had similarity in the non-redundant (nr) protein database (Genbank) and 2,075 sequences were characterized in the categories of Gene Ontology (GO). After the validation process of 30 microsatellite loci, eight markers showed polymorphism. The analysis of these polymorphic markers in farmed stocks revealed that fish farms from North Brazil had a higher genetic diversity than fish farms from Southeast Brazil. AMOVA demonstrated that the highest proportion of variation was presented within the populations. However, when comparing different groups (1: Wild; 2: North fish farms; 3: Southeast fish farms), a considerable variation between the groups was observed. The F ST values showed the occurrence of genetic structure among the broodstocks from different regions of Brazil. The transcriptome sequencing in pirapitinga provided important genetic resources for biological studies in this non-model species, and microsatellite data can be used as the framework for the genetic management of breeding stocks in Brazil, which might provide a basis for a genetic pre-breeding programme.
Mohamed Yusoff, Aini; Tan, Tze King; Hari, Ranjeev; Koepfli, Klaus-Peter; Wee, Wei Yee; Antunes, Agostinho; Sitam, Frankie Thomas; Rovie-Ryan, Jeffrine Japning; Karuppannan, Kayal Vizi; Wong, Guat Jah; Lipovich, Leonard; Warren, Wesley C.; O’Brien, Stephen J.; Choo, Siew Woh
2016-01-01
Pangolins are scale-covered mammals, containing eight endangered species. Maintaining pangolins in captivity is a significant challenge, in part because little is known about their genetics. Here we provide the first large-scale sequencing of the critically endangered Manis javanica transcriptomes from eight different organs using Illumina HiSeq technology, yielding ~75 Giga bases and 89,754 unigenes. We found some unigenes involved in the insect hormone biosynthesis pathway and also 747 lipids metabolism-related unigenes that may be insightful to understand the lipid metabolism system in pangolins. Comparative analysis between M. javanica and other mammals revealed many pangolin-specific genes significantly over-represented in stress-related processes, cell proliferation and external stimulus, probably reflecting the traits and adaptations of the analyzed pregnant female M. javanica. Our study provides an invaluable resource for future functional works that may be highly relevant for the conservation of pangolins. PMID:27618997
Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin
2014-01-01
The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. PMID:24615268
Alifrangis, Michael; Schousboe, Mette L.; Ishengoma, Deus; Lusingu, John; Pota, Hirva; Kavishe, Reginald A.; Pearce, Richard; Ord, Rosalynn; Lynch, Caroline; Dejene, Seyoum; Cox, Jonathan; Rwakimari, John; Minja, Daniel T.R.; Lemnge, Martha M.; Roper, Cally
2014-01-01
Super-resistant Plasmodium falciparum threatens the effectiveness of sulfadoxine–pyrimethamine in intermittent preventive treatment for malaria during pregnancy. It is characterized by the A581G Pfdhps mutation on a background of the double-mutant Pfdhps and the triple-mutant Pfdhfr. Using samples collected during 2004–2008, we investigated the evolutionary origin of the A581G mutation by characterizing microsatellite diversity flanking Pfdhps triple-mutant (437G+540E+581G) alleles from 3 locations in eastern Africa and comparing it with double-mutant (437G+540E) alleles from the same area. In Ethiopia, both alleles derived from 1 lineage that was distinct from those in Uganda and Tanzania. Uganda and Tanzania triple mutants derived from the previously characterized southeastern Africa double-mutant lineage. The A581G mutation has occurred multiple times on local Pfdhps double-mutant backgrounds; however, a novel microsatellite allele incorporated into the Tanzania lineage since 2004 illustrates the local expansion of emergent triple-mutant lineages. PMID:25061906
2012-01-01
Background Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20–30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes. Results In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions. Conclusions A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata. PMID:23171398
Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar
2014-01-01
Abstract Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes. PMID:24802971
Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar; Talukdar, Anupam Das
2014-05-01
Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes.
The first Taxus rhizosphere microbiome revealed by shotgun metagenomic sequencing.
Hao, Da-Cheng; Zhang, Cai-Rong; Xiao, Pei-Gen
2018-06-01
In the present study, the shotgun high throughput metagenomic sequencing was implemented to globally capture the features of Taxus rhizosphere microbiome. Total reads could be assigned to 6925 species belonging to 113 bacteria phyla and 301 species of nine fungi phyla. For archaea and virus, 263 and 134 species were for the first time identified, respectively. More than 720,000 Unigenes were identified by clean reads assembly. The top five assigned phyla were Actinobacteria (363,941 Unigenes), Proteobacteria (182,053), Acidobacteria (44,527), Ascomycota (fungi; 18,267), and Chloroflexi (15,539). KEGG analysis predicted numerous functional genes; 7101 Unigenes belong to "Xenobiotics biodegradation and metabolism." A total of 12,040 Unigenes involved in defense mechanisms (e.g., xenobiotic metabolism) were annotated by eggNOG. Talaromyces addition could influence not only the diversity and structure of microbial communities of Taxus rhizosphere, but also the relative abundance of functional genes, including metabolic genes, antibiotic resistant genes, and genes involved in pathogen-host interaction, bacterial virulence, and bacterial secretion system. The structure and function of rhizosphere microbiome could be sensitive to non-native microbe addition, which could impact on the pollutant degradation. This study, complementary to the amplicon sequencing, more objectively reflects the native microbiome of Taxus rhizosphere and its response to environmental pressure, and lays a foundation for potential combination of phytoremediation and bioaugmentation. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Cao, Zhe; Deng, Zhanao
2017-01-01
Roots are vital to plant survival and crop yield, yet few efforts have been made to characterize the expressed genes in the roots of non-model plants (root transcriptomes). This study was conducted to sequence, assemble, annotate, and characterize the root transcriptomes of three caladium cultivars (Caladium × hortulanum) using RNA-Seq. The caladium cultivars used in this study have different levels of resistance to Pythium myriotylum, the most damaging necrotrophic pathogen to caladium roots. Forty-six to 61 million clean reads were obtained for each caladium root transcriptome. De novo assembly of the reads resulted in approximately 130,000 unigenes. Based on bioinformatic analysis, 71,825 (52.3%) caladium unigenes were annotated for putative functions, 48,417 (67.4%) and 31,417 (72.7%) were assigned to Gene Ontology (GO) and Clusters of Orthologous Groups (COG), respectively, and 46,406 (64.6%) unigenes were assigned to 128 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. A total of 4518 distinct unigenes were observed only in Pythium-resistant “Candidum” roots, of which 98 seemed to be involved in disease resistance and defense responses. In addition, 28,837 simple sequence repeat sites and 44,628 single nucleotide polymorphism sites were identified among the three caladium cultivars. These root transcriptome data will be valuable for further genetic improvement of caladium and related aroids. PMID:28346370
Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen
2017-07-11
Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.
NASA Astrophysics Data System (ADS)
Tong, Yanli; Sun, Xiuqin; Wang, Bo; Wang, Ling; Li, Yan; Tian, Jinhu; Zheng, Fengrong; Zheng, Minggang
2015-01-01
Platichthys stellatus is an economically important marine bony fish species that is cultured in China on a large scale. However, very little is known about its immune-related genes. In this study, the transcriptome of the immune organs of P. stellatus that were intraperitoneally challenged with the pathogen E dwardsiella ictaluri JCM1680 is analyzed. Total RNA from four tissues (spleen, kidney, liver, and intestine) was mixed equally and then sequenced on an Illumina HiSeq 2000 platform. Overall, 28 465 813 quality reads were generated and assembled into 43 061 unigenes. Similarity searches against public protein sequence databases were used to annotate 28 291 unigenes (65.7% of the total), 368 of which were associated with immunoregulation, including 188 related to immunity response. Additionally, the transcript levels of immunity response unigenes annotated as related to tumor necrosis factor (TNF), TNF receptor, chemokine, major histocompatibility complex, and interleukin-6 were investigated in the different tissues of normal and infected P. stellatus by real-time quantitative PCR. The results confirmed that the unigenes identified in the transcriptome database were indeed expressed and up-regulated in infected P. stellatus. To our knowledge, this is the first report of the sequencing and analysis of the transcriptome of P. stellatus. These findings provide insights into the transcriptomics and immunogenetics of bony fish.
De novo RNA-seq and functional annotation of Ornithonyssus bacoti.
Niu, DongLing; Wang, RuiLing; Zhao, YaE; Yang, Rui; Hu, Li
2018-06-01
Ornithonyssus bacoti (Hirst) (Acari: Macronyssidae) is a vector and reservoir of pathogens causing serious infectious diseases, such as epidemic hemorrhagic fever, endemic typhus, tularemia, and leptospirosis. Its genome and transcriptome data are lacking in public databases. In this study, total RNA was extracted from live O. bacoti to conduct RNA-seq, functional annotation, coding domain sequence (CDS) prediction and simple sequence repeats (SSRs) detection. The results showed that 65.8 million clean reads were generated and assembled into 72,185 unigenes, of which 49.4% were annotated by seven functional databases. 23,121 unigenes were annotated and assigned to 457 species by non-redundant protein sequence database. The BLAST top-two hit species were Metaseiulus occidentalis and Ixodes scapularis. The procedure detected 12,426 SSRs, of which tri- and di-nucleotides were the most abundant types and the representative motifs were AAT/ATT and AC/GT. 26,936 CDS were predicted with a mean length of 711 bp. 87 unigenes of 30 functional genes, which are usually involved in stress responses, drug resistance, movement, metabolism and allergy, were further identified by bioinformatics methods. The unigenes putatively encoding cytochrome P450 proteins were further analyzed phylogenetically. In conclusion, this study completed the RNA-seq and functional annotation of O. bacoti successfully, which provides reliable molecular data for its future studies of gene function and molecular markers.
Insight into the transcriptome of Arthrobotrys conoides using high throughput sequencing.
Ramesh, Pandit; Reena, Patel; Amitbikram, Mohapatra; Chaitanya, Joshi; Anju, Kunjadia
2015-12-01
Arthrobotrys conoides is a nematode-trapping fungus belonging to Orbiliales, Ascomycota group, and traps prey nematodes by means of adhesive network. Fungus has a potential to be used as a biocontrol agent against plant parasitic nematodes. In the present study, we characterized the transcriptome of A. conoides using high-throughput sequencing technology and characterized its virulence unigenes. Total 7,255 cDNA contigs with an average length of 425 bp were generated and 6184 (61.81%) transcripts were functionally annotated and characterized. Majority of unigenes were found analogous to the genes of plant pathogenic fungi. A total of 1749 transcripts were found to be orthologous with eukaryotic proteins of KOG database. Several carbohydrate active enzymes and peptidases were identified. We also analyzed classically and nonclassically secreted proteins and confirmed by BLASTP against fungal secretome database. A total of 916 contigs were analogous to 556 unique proteins of Pathogen Host Interaction (PHI) database. Further, we identified 91 unigenes homologous to the database of fungal virulence factor (DFVF). A total of 104 putative protein kinases coding transcripts were identified by BLASTP against KinBase database, which are major players in signaling pathways. This study provides a comprehensive look at the transcriptome of A. conoides and the identified unigenes might have a role in catching and killing prey nematodes by A. conoides. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sun, Yepeng; Wang, Fawei; Wang, Nan; Dong, Yuanyuan; Liu, Qi; Zhao, Lei; Chen, Huan; Liu, Weican; Yin, Hailong; Zhang, Xiaomei; Yuan, Yanxi; Li, Haiyan
2013-01-01
Background Leymus chinensis (Trin.) Tzvel. is a high saline-alkaline tolerant forage grass genus of the tribe Gramineae family, which also plays an important role in protection of natural environment. To date, little is known about the saline-alkaline tolerance of L. chinensis on the molecular level. To better understand the molecular mechanism of saline-alkaline tolerance in L. chinensis, 454 pyrosequencing was used for the transcriptome study. Results We used Roche-454 massive parallel pyrosequencing technology to sequence two different cDNA libraries that were built from the two samples of control and under saline-alkaline treatment (optimal stress concentration-Hoagland solution with 100 mM NaCl and 200 mM NaHCO3). A total of 363,734 reads in control group and 526,267 reads in treatment group with an average length of 489 bp and 493 bp were obtained, respectively. The reads were assembled into 104,105 unigenes with MIRA sequence assemable software, among which, 73,665 unigenes were in control group, 88,016 unigenes in treatment group and 57,576 unigenes in both groups. According to the comparative expression analysis between the two groups with the threshold of “log2 Ratio ≥1”, there were 36,497 up-regulated unegenes and 18,218 down-regulated unigenes predicted to be the differentially expressed genes. After gene annotation and pathway enrichment analysis, most of them were involved in stress and tolerant function, signal transduction, energy production and conversion, and inorganic ion transport. Furthermore, 16 of these differentially expressed genes were selected for real-time PCR validation, and they were successfully confirmed with the results of 454 pyrosequencing. Conclusions This work is the first time to study the transcriptome of L. chinensis under saline-alkaline treatment based on the 454-FLX massively parallel DNA sequencing platform. It also deepened studies on molecular mechanisms of saline-alkaline in L. chinensis, and constituted a database for future studies. PMID:23365637
MELOGEN: an EST database for melon functional genomics
Gonzalez-Ibeas, Daniel; Blanca, José; Roig, Cristina; González-To, Mireia; Picó, Belén; Truniger, Verónica; Gómez, Pedro; Deleu, Wim; Caño-Delgado, Ana; Arús, Pere; Nuez, Fernando; Garcia-Mas, Jordi; Puigdomènech, Pere; Aranda, Miguel A
2007-01-01
Background Melon (Cucumis melo L.) is one of the most important fleshy fruits for fresh consumption. Despite this, few genomic resources exist for this species. To facilitate the discovery of genes involved in essential traits, such as fruit development, fruit maturation and disease resistance, and to speed up the process of breeding new and better adapted melon varieties, we have produced a large collection of expressed sequence tags (ESTs) from eight normalized cDNA libraries from different tissues in different physiological conditions. Results We determined over 30,000 ESTs that were clustered into 16,637 non-redundant sequences or unigenes, comprising 6,023 tentative consensus sequences (contigs) and 10,614 unclustered sequences (singletons). Many potential molecular markers were identified in the melon dataset: 1,052 potential simple sequence repeats (SSRs) and 356 single nucleotide polymorphisms (SNPs) were found. Sixty-nine percent of the melon unigenes showed a significant similarity with proteins in databases. Functional classification of the unigenes was carried out following the Gene Ontology scheme. In total, 9,402 unigenes were mapped to one or more ontology. Remarkably, the distributions of melon and Arabidopsis unigenes followed similar tendencies, suggesting that the melon dataset is representative of the whole melon transcriptome. Bioinformatic analyses primarily focused on potential precursors of melon micro RNAs (miRNAs) in the melon dataset, but many other genes potentially controlling disease resistance and fruit quality traits were also identified. Patterns of transcript accumulation were characterised by Real-Time-qPCR for 20 of these genes. Conclusion The collection of ESTs characterised here represents a substantial increase on the genetic information available for melon. A database (MELOGEN) which contains all EST sequences, contig images and several tools for analysis and data mining has been created. This set of sequences constitutes also the basis for an oligo-based microarray for melon that is being used in experiments to further analyse the melon transcriptome. PMID:17767721
Kaur, Sukhjiwan; Cogan, Noel O I; Pembleton, Luke W; Shinozuka, Maiko; Savin, Keith W; Materne, Michael; Forster, John W
2011-05-25
Lentil (Lens culinaris Medik.) is a cool-season grain legume which provides a rich source of protein for human consumption. In terms of genomic resources, lentil is relatively underdeveloped, in comparison to other Fabaceae species, with limited available data. There is hence a significant need to enhance such resources in order to identify novel genes and alleles for molecular breeding to increase crop productivity and quality. Tissue-specific cDNA samples from six distinct lentil genotypes were sequenced using Roche 454 GS-FLX Titanium technology, generating c. 1.38 × 106 expressed sequence tags (ESTs). De novo assembly generated a total of 15,354 contigs and 68,715 singletons. The complete unigene set was sequence-analysed against genome drafts of the model legume species Medicago truncatula and Arabidopsis thaliana to identify 12,639, and 7,476 unique matches, respectively. When compared to the genome of Glycine max, a total of 20,419 unique hits were observed corresponding to c. 31% of the known gene space. A total of 25,592 lentil unigenes were subsequently annoated from GenBank. Simple sequence repeat (SSR)-containing ESTs were identified from consensus sequences and a total of 2,393 primer pairs were designed. A subset of 192 EST-SSR markers was screened for validation across a panel 12 cultivated lentil genotypes and one wild relative species. A total of 166 primer pairs obtained successful amplification, of which 47.5% detected genetic polymorphism. A substantial collection of ESTs has been developed from sequence analysis of lentil genotypes using second-generation technology, permitting unigene definition across a broad range of functional categories. As well as providing resources for functional genomics studies, the unigene set has permitted significant enhancement of the number of publicly-available molecular genetic markers as tools for improvement of this species.
Yang, Deying; Fu, Yan; Wu, Xuhang; Xie, Yue; Nie, Huaming; Chen, Lin; Nong, Xiang; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yan, Ning; Zhang, Runhui; Zheng, Wanpeng; Yang, Guangyou
2012-01-01
Background Taenia pisiformis is one of the most common intestinal tapeworms and can cause infections in canines. Adult T. pisiformis (canines as definitive hosts) and Cysticercus pisiformis (rabbits as intermediate hosts) cause significant health problems to the host and considerable socio-economic losses as a consequence. No complete genomic data regarding T. pisiformis are currently available in public databases. RNA-seq provides an effective approach to analyze the eukaryotic transcriptome to generate large functional gene datasets that can be used for further studies. Methodology/Principal Findings In this study, 2.67 million sequencing clean reads and 72,957 unigenes were generated using the RNA-seq technique. Based on a sequence similarity search with known proteins, a total of 26,012 unigenes (no redundancy) were identified after quality control procedures via the alignment of four databases. Overall, 15,920 unigenes were mapped to 203 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Through analyzing the glycolysis/gluconeogenesis and axonal guidance pathways, we achieved an in-depth understanding of the biochemistry of T. pisiformis. Here, we selected four unigenes at random and obtained their full-length cDNA clones using RACE PCR. Functional distribution characteristics were gained through comparing four cestode species (72,957 unigenes of T. pisiformis, 30,700 ESTs of T. solium, 1,058 ESTs of Eg+Em [conserved ESTs between Echinococcus granulosus and Echinococcus multilocularis]), with the cluster of orthologous groups (COG) and gene ontology (GO) functional classification systems. Furthermore, the conserved common genes in these four cestode species were obtained and aligned by the KEGG database. Conclusion This study provides an extensive transcriptome dataset obtained from the deep sequencing of T. pisiformis in a non-model whole genome. The identification of conserved genes may provide novel approaches for potential drug targets and vaccinations against cestode infections. Research can now accelerate into the functional genomics, immunity and gene expression profiles of cestode species. PMID:22514598
Deep insight into the Ganoderma lucidum by comprehensive analysis of its transcriptome.
Yu, Guo-Jun; Wang, Man; Huang, Jie; Yin, Ya-Lin; Chen, Yi-Jie; Jiang, Shuai; Jin, Yan-Xia; Lan, Xian-Qing; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui
2012-01-01
Ganoderma lucidum is a basidiomycete white rot fungus and is of medicinal importance in China, Japan and other countries in the Asiatic region. To date, much research has been performed in identifying the medicinal ingredients in Ganoderma lucidum. Despite its important therapeutic effects in disease, little is known about Ganoderma lucidum at the genomic level. In order to gain a molecular understanding of this fungus, we utilized Illumina high-throughput technology to sequence and analyze the transcriptome of Ganoderma lucidum. We obtained 6,439,690 and 6,416,670 high-quality reads from the mycelium and fruiting body of Ganoderma lucidum, and these were assembled to form 18,892 and 27,408 unigenes, respectively. A similarity search was performed against the NCBI non-redundant nucleotide database and a customized database composed of five fungal genomes. 11,098 and 8, 775 unigenes were matched to the NCBI non-redundant nucleotide database and our customized database, respectively. All unigenes were subjected to annotation by Gene Ontology, Eukaryotic Orthologous Group terms and Kyoto Encyclopedia of Genes and Genomes. Differentially expressed genes from the Ganoderma lucidum mycelium and fruiting body stage were analyzed, resulting in the identification of 13 unigenes which are involved in the terpenoid backbone biosynthesis pathway. Quantitative real-time PCR was used to confirm the expression levels of these unigenes. Ganoderma lucidum was also studied for wood degrading activity and a total of 22 putative FOLymes (fungal oxidative lignin enzymes) and 120 CAZymes (carbohydrate-active enzymes) were predicted from our Ganoderma lucidum transcriptome. Our study provides comprehensive gene expression information on Ganoderma lucidum at the transcriptional level, which will form the foundation for functional genomics studies in this fungus. The use of Illumina sequencing technology has made de novo transcriptome assembly and gene expression analysis possible in species that lack full genome information.
Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng
2013-01-01
Background The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. Principal Findings We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10−5), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. Conclusion This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers. PMID:24015242
Gui, Duan; Jia, Kuntong; Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng
2013-01-01
The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10(-5)), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers.
Deep Insight into the Ganoderma lucidum by Comprehensive Analysis of Its Transcriptome
Yu, Guo-Jun; Wang, Man; Huang, Jie; Yin, Ya-Lin; Chen, Yi-Jie; Jiang, Shuai; Jin, Yan-Xia; Lan, Xian-Qing; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui
2012-01-01
Background Ganoderma lucidum is a basidiomycete white rot fungus and is of medicinal importance in China, Japan and other countries in the Asiatic region. To date, much research has been performed in identifying the medicinal ingredients in Ganoderma lucidum. Despite its important therapeutic effects in disease, little is known about Ganoderma lucidum at the genomic level. In order to gain a molecular understanding of this fungus, we utilized Illumina high-throughput technology to sequence and analyze the transcriptome of Ganoderma lucidum. Methodology/Principal Findings We obtained 6,439,690 and 6,416,670 high-quality reads from the mycelium and fruiting body of Ganoderma lucidum, and these were assembled to form 18,892 and 27,408 unigenes, respectively. A similarity search was performed against the NCBI non-redundant nucleotide database and a customized database composed of five fungal genomes. 11,098 and 8, 775 unigenes were matched to the NCBI non-redundant nucleotide database and our customized database, respectively. All unigenes were subjected to annotation by Gene Ontology, Eukaryotic Orthologous Group terms and Kyoto Encyclopedia of Genes and Genomes. Differentially expressed genes from the Ganoderma lucidum mycelium and fruiting body stage were analyzed, resulting in the identification of 13 unigenes which are involved in the terpenoid backbone biosynthesis pathway. Quantitative real-time PCR was used to confirm the expression levels of these unigenes. Ganoderma lucidum was also studied for wood degrading activity and a total of 22 putative FOLymes (fungal oxidative lignin enzymes) and 120 CAZymes (carbohydrate-active enzymes) were predicted from our Ganoderma lucidum transcriptome. Conclusions Our study provides comprehensive gene expression information on Ganoderma lucidum at the transcriptional level, which will form the foundation for functional genomics studies in this fungus. The use of Illumina sequencing technology has made de novo transcriptome assembly and gene expression analysis possible in species that lack full genome information. PMID:22952861
Gaynor, Kaitlyn M; Solomon, Joseph W; Siller, Stefanie; Jessell, Linnet; Duffy, J Emmett; Rubenstein, Dustin R
2017-11-01
Molecular markers are powerful tools for studying patterns of relatedness and parentage within populations and for making inferences about social evolution. However, the development of molecular markers for simultaneous study of multiple species presents challenges, particularly when species exhibit genome duplication or polyploidy. We developed microsatellite markers for Synalpheus shrimp, a genus in which species exhibit not only great variation in social organization, but also interspecific variation in genome size and partial genome duplication. From the four primary clades within Synalpheus, we identified microsatellites in the genomes of four species and in the consensus transcriptome of two species. Ultimately, we designed and tested primers for 143 microsatellite markers across 25 species. Although the majority of markers were disomic, many markers were polysomic for certain species. Surprisingly, we found no relationship between genome size and the number of polysomic markers. As expected, markers developed for a given species amplified better for closely related species than for more distant relatives. Finally, the markers developed from the transcriptome were more likely to work successfully and to be disomic than those developed from the genome, suggesting that consensus transcriptomes are likely to be conserved across species. Our findings suggest that the transcriptome, particularly consensus sequences from multiple species, can be a valuable source of molecular markers for taxa with complex, duplicated genomes. © 2017 John Wiley & Sons Ltd.
Kujur, Alice; Bajaj, Deepak; Saxena, Maneesha S.; Tripathi, Shailesh; Upadhyaya, Hari D.; Gowda, C.L.L.; Singh, Sube; Jain, Mukesh; Tyagi, Akhilesh K.; Parida, Swarup K.
2013-01-01
We developed 1108 transcription factor gene-derived microsatellite (TFGMS) and 161 transcription factor functional domain-associated microsatellite (TFFDMS) markers from 707 TFs of chickpea. The robust amplification efficiency (96.5%) and high intra-specific polymorphic potential (34%) detected by markers suggest their immense utilities in efficient large-scale genotyping applications, including construction of both physical and functional transcript maps and understanding population structure. Candidate gene-based association analysis revealed strong genetic association of TFFDMS markers with three major seed and pod traits. Further, TFGMS markers in the 5′ untranslated regions of TF genes showing differential expression during seed development had higher trait association potential. The significance of TFFDMS markers was demonstrated by correlating their allelic variation with amino acid sequence expansion/contraction in the functional domain and alteration of secondary protein structure encoded by genes. The seed weight-associated markers were validated through traditional bi-parental genetic mapping. The determination of gene-specific linkage disequilibrium (LD) patterns in desi and kabuli based on single nucleotide polymorphism-microsatellite marker haplotypes revealed extended LD decay, enhanced LD resolution and trait association potential of genes. The evolutionary history of a strong seed-size/weight-associated TF based on natural variation and haplotype sharing among desi, kabuli and wild unravelled useful information having implication for seed-size trait evolution during chickpea domestication. PMID:23633531
Topdar, N; Kundu, A; Sinha, M K; Sarkar, D; Das, M; Banerjee, S; Kar, C S; Satya, P; Balyan, H S; Mahapatra, B S; Gupta, P K
2013-01-01
We report the first complete microsatellite genetic map of jute (Corchorus olitorius L.; 2n = 2x = 14) using an F6 recombinant inbred population. Of the 403 microsatellite markers screened, 82 were mapped on the seven linkage groups (LGs) that covered a total genetic distance of 799.9 cM, with an average marker interval of 10.7 cM. LG5 had the longest and LG7 the shortest genetic lengths, whereas LG1 had the maximum and LG7 the minimum number of markers. Segregation distortion of microsatellite loci was high (61%), with the majority of them (76%) skewed towards the female parent. Genomewide non-parametric single-marker analysis in combination with multiple quantitative trait loci (QTL)-models (MQM) mapping detected 26 definitive QTLs for bast fibre quality, yield and yield-related traits. These were unevenly distributed on six LGs, as colocalized clusters, at genomic sectors marked by 15 microsatellite loci. LG1 was the QTL-richest map sector, with the densest colocalized clusters of QTLs governing fibre yield, yield-related traits and tensile strength. Expectedly, favorable QTLs were derived from the desirable parents, except for nearly all of those of fibre fineness, which might be due to the creation of new gene combinations. Our results will be a good starting point for further genome analyses in jute.
Woerner, Stefan M.; Yuan, Yan P.; Benner, Axel; Korff, Sebastian; von Knebel Doeberitz, Magnus; Bork, Peer
2010-01-01
About 15% of human colorectal cancers and, at varying degrees, other tumor entities as well as nearly all tumors related to Lynch syndrome are hallmarked by microsatellite instability (MSI) as a result of a defective mismatch repair system. The functional impact of resulting mutations depends on their genomic localization. Alterations within coding mononucleotide repeat tracts (MNRs) can lead to protein truncation and formation of neopeptides, whereas alterations within untranslated MNRs can alter transcription level or transcript stability. These mutations may provide selective advantage or disadvantage to affected cells. They may further concern the biology of microsatellite unstable cells, e.g. by generating immunogenic peptides induced by frameshifts mutations. The Selective Targets database (http://www.seltarbase.org) is a curated database of a growing number of public MNR mutation data in microsatellite unstable human tumors. Regression calculations for various MSI–H tumor entities indicating statistically deviant mutation frequencies predict TGFBR2, BAX, ACVR2A and others that are shown or highly suspected to be involved in MSI tumorigenesis. Many useful tools for further analyzing genomic DNA, derived wild-type and mutated cDNAs and peptides are integrated. A comprehensive database of all human coding, untranslated, non-coding RNA- and intronic MNRs (MNR_ensembl) is also included. Herewith, SelTarbase presents as a plenty instrument for MSI-carcinogenesis-related research, diagnostics and therapy. PMID:19820113
USDA-ARS?s Scientific Manuscript database
Population genetic analysis of genotypes comprised of seven microsatellite loci revealed clonal genetic patterns in each of four populations of the protistan estuarine parasite Perkinsus marinus. Each locus was amplified directly from DNA extracted from infected oysters collected from four geographi...
Isolation of mini- and microsatellite loci from chromosome 19 library
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prosnyak, M.I.; Belajeva, O.V.; Polukarova, L.G.
Mini- and microsatellite sequences are abundant in the human genome and are very useful as genetic markers. We report the isolation of a panel of clones containing marker sequences from chromosome 19. We screened 10,000 clones from the chromosome 19 cosmid library for the presence of di-(CA)n, tri-(TCC)n, (CAC)n microsatellites and M13-like minisatellite sequences. For this we have used synthetic oligonucleotides and polynucleotides, including micro- (CA, TCC, CAC) and minisatellite (M13 core) sequences. Preliminary results indicated that the chromosome 19 cosmid library contained both human and hamster clones. In order to identify human sequences from this library we have developedmore » the technique of colony and blot hybridization with Alu-PCR, L1-PCR and B1-PCR probes. Dozens of clones have been selected, some of which were analyzed by conventional Southern blot analysis and non-radioactive in situ hybridization of chromosomes. Highly informative markers derived from these clones will be used for physical and genetic mapping of chromosome 19.« less
Sun, Xiaoning; Cai, Ruibo; Jin, Xuelin; Shafer, Aaron B A; Hu, Xiaolong; Yang, Shuang; Li, Yimeng; Qi, Lei; Liu, Shuqiang; Hu, Defu
2018-01-12
Forest musk deer (Moschus berezovskii; FMD) are both economically valuable and highly endangered. A problem for FMD captive breeding programs has been the susceptibility of FMD to abscesses. To investigate the mechanisms of abscess development in FMD, the blood transcriptomes of three purulent and three healthy individuals were generated. A total of ~39.68 Gb bases were generated using Illumina HiSeq 4000 sequencing technology and 77,752 unigenes were identified after assembling. All the unigenes were annotated, with 63,531 (81.71%) mapping to at least one database. Based on these functional annotations, 45,798 coding sequences (CDS) were detected, along with 12,697 simple sequence repeats (SSRs) and 65,536 single nucleotide polymorphisms (SNPs). A total of 113 unigenes were found to be differentially expressed between healthy and purulent individuals. Functional annotation indicated that most of these differentially expressed genes were involved in the regulation of immune system processes, particularly those associated with parasitic and bacterial infection pathways.
Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin
2014-03-10
The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. Copyright © 2014 Zhang et al.
Shah, Faheem Afzal; Wang, Qiaojian; Wang, Zhaocheng; Wu, Lifang
2018-01-01
Pecan is an economically important nut crop tree due to its unique texture and flavor properties. The pecan seed is rich of unsaturated fatty acid and protein. However, little is known about the molecular mechanisms of the biosynthesis of fatty acids in the developing seeds. In this study, transcriptome sequencing of the developing seeds was performed using Illumina sequencing technology. Pecan seed embryos at different developmental stages were collected and sequenced. The transcriptomes of pecan seeds at two key developing stages (PA, the initial stage and PS, the fast oil accumulation stage) were also compared. A total of 82,155 unigenes, with an average length of 1,198 bp from seven independent libraries were generated. After functional annotations, we detected approximately 55,854 CDS, among which, 2,807 were Transcription Factor (TF) coding unigenes. Further, there were 13,325 unigenes that showed a 2-fold or greater expression difference between the two groups of libraries (two developmental stages). After transcriptome analysis, we identified abundant unigenes that could be involved in fatty acid biosynthesis, degradation and some other aspects of seed development in pecan. This study presents a comprehensive dataset of transcriptomic changes during the seed development of pecan. It provides insights in understanding the molecular mechanisms responsible for fatty acid biosynthesis in the seed development. The identification of functional genes will also be useful for the molecular breeding work of pecan. PMID:29694395
Xu, Zheng; Ni, Jun; Shah, Faheem Afzal; Wang, Qiaojian; Wang, Zhaocheng; Wu, Lifang; Fu, Songling
2018-01-01
Pecan is an economically important nut crop tree due to its unique texture and flavor properties. The pecan seed is rich of unsaturated fatty acid and protein. However, little is known about the molecular mechanisms of the biosynthesis of fatty acids in the developing seeds. In this study, transcriptome sequencing of the developing seeds was performed using Illumina sequencing technology. Pecan seed embryos at different developmental stages were collected and sequenced. The transcriptomes of pecan seeds at two key developing stages (PA, the initial stage and PS, the fast oil accumulation stage) were also compared. A total of 82,155 unigenes, with an average length of 1,198 bp from seven independent libraries were generated. After functional annotations, we detected approximately 55,854 CDS, among which, 2,807 were Transcription Factor (TF) coding unigenes. Further, there were 13,325 unigenes that showed a 2-fold or greater expression difference between the two groups of libraries (two developmental stages). After transcriptome analysis, we identified abundant unigenes that could be involved in fatty acid biosynthesis, degradation and some other aspects of seed development in pecan. This study presents a comprehensive dataset of transcriptomic changes during the seed development of pecan. It provides insights in understanding the molecular mechanisms responsible for fatty acid biosynthesis in the seed development. The identification of functional genes will also be useful for the molecular breeding work of pecan.
Rebeca, Carballar-Lejarazú; Zhu, Xiaoli; Guo, Yajie; Lin, Qiannan; Hu, Xia; Wang, Rong; Liang, Guanghong; Guan, Xiong
2017-01-01
The pine aphid Cinara pinitabulaeformis Zhang et Zhang is the main pine pest in China, it causes pine needles to produce dense dew (honeydew) which can lead to sooty mold (black filamentous saprophytic ascomycetes). Although common chemical and physical strategies are used to prevent the disease caused by C. pinitabulaeformis Zhang et Zhang, new strategies based on biological and/or genetic approaches are promising to control and eradicate the disease. However, there is no information about genomics, proteomics or transcriptomics to allow the design of new control strategies for this pine aphid. We used next generation sequencing technology to sequence the transcriptome of C. pinitabulaeformis Zhang et Zhang and built a transcriptome database. We identified 80,259 unigenes assigned for Gene Ontology (GO) terms and information for a total of 11,609 classified unigenes was obtained in the Clusters of Orthologous Groups (COGs). A total of 10,806 annotated unigenes were analyzed to identify the represented biological pathways, among them 8,845 unigenes matched with 228 KEGG pathways. In addition, our data describe propagative viruses, nutrition-related genes, detoxification related molecules, olfactory related receptors, stressed-related protein, putative insecticide resistance genes and possible insecticide targets. Moreover, this study provides valuable information about putative insecticide resistance related genes and for the design of new genetic/biological based strategies to manage and control C. pinitabulaeformis Zhang et Zhang populations. PMID:28570707
Cui, Kai; Wang, Haiying; Liao, Shengxi; Tang, Qi; Li, Li; Cui, Yongzhong; He, Yuan
2016-01-01
Dendrocalamus sinicus is the world’s largest bamboo species with strong woody culms, and known for its fast-growing culms. As an economic bamboo species, it was popularized for multi-functional applications including furniture, construction, and industrial paper pulp. To comprehensively elucidate the molecular processes involved in its culm elongation, Illumina paired-end sequencing was conducted. About 65.08 million high-quality reads were produced, and assembled into 81,744 unigenes with an average length of 723 bp. A total of 64,338 (79%) unigenes were annotated for their functions, of which, 56,587 were annotated in the NCBI non-redundant protein database and 35,262 were annotated in the Swiss-Prot database. Also, 42,508 and 21,009 annotated unigenes were allocated to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. By searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG), 33,920 unigenes were assigned to 128 KEGG pathways. Meanwhile, 8,553 simple sequence repeats (SSRs) and 81,534 single-nucleotide polymorphism (SNPs) were identified, respectively. Additionally, 388 transcripts encoding lignin biosynthesis were detected, among which, 27 transcripts encoding Shikimate O-hydroxycinnamoyltransferase (HCT) specifically expressed in D. sinicus when compared to other bamboo species and rice. The phylogenetic relationship between D. sinicus and other plants was analyzed, suggesting functional diversity of HCT unigenes in D. sinicus. We conjectured that HCT might lead to the high lignin content and giant culm. Given that the leaves are not yet formed and culm is covered with sheaths during culm elongation, the existence of photosynthesis of bamboo culm is usually neglected. Surprisedly, 109 transcripts encoding photosynthesis were identified, including photosystem I and II, cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase, and 24 transcripts were characterized as antenna proteins that regarded as the main tool for capturing light of plants, implying stem photosynthesis plays a key role during culm elongation due to the unavailability of its leaf. By real-time quantitative PCR, the expression level of 6 unigenes was detected. The results showed the expression level of all genes accorded with the transcriptome data, which confirm the reliability of the transcriptome data. As we know, this is the first study underline the D. sinicus transcriptome, which will deepen the understanding of the molecular mechanisms of culm development. The results may help variety improvement and resource utilization of bamboos. PMID:27304219
Cui, Kai; Wang, Haiying; Liao, Shengxi; Tang, Qi; Li, Li; Cui, Yongzhong; He, Yuan
2016-01-01
Dendrocalamus sinicus is the world's largest bamboo species with strong woody culms, and known for its fast-growing culms. As an economic bamboo species, it was popularized for multi-functional applications including furniture, construction, and industrial paper pulp. To comprehensively elucidate the molecular processes involved in its culm elongation, Illumina paired-end sequencing was conducted. About 65.08 million high-quality reads were produced, and assembled into 81,744 unigenes with an average length of 723 bp. A total of 64,338 (79%) unigenes were annotated for their functions, of which, 56,587 were annotated in the NCBI non-redundant protein database and 35,262 were annotated in the Swiss-Prot database. Also, 42,508 and 21,009 annotated unigenes were allocated to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. By searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG), 33,920 unigenes were assigned to 128 KEGG pathways. Meanwhile, 8,553 simple sequence repeats (SSRs) and 81,534 single-nucleotide polymorphism (SNPs) were identified, respectively. Additionally, 388 transcripts encoding lignin biosynthesis were detected, among which, 27 transcripts encoding Shikimate O-hydroxycinnamoyltransferase (HCT) specifically expressed in D. sinicus when compared to other bamboo species and rice. The phylogenetic relationship between D. sinicus and other plants was analyzed, suggesting functional diversity of HCT unigenes in D. sinicus. We conjectured that HCT might lead to the high lignin content and giant culm. Given that the leaves are not yet formed and culm is covered with sheaths during culm elongation, the existence of photosynthesis of bamboo culm is usually neglected. Surprisedly, 109 transcripts encoding photosynthesis were identified, including photosystem I and II, cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase, and 24 transcripts were characterized as antenna proteins that regarded as the main tool for capturing light of plants, implying stem photosynthesis plays a key role during culm elongation due to the unavailability of its leaf. By real-time quantitative PCR, the expression level of 6 unigenes was detected. The results showed the expression level of all genes accorded with the transcriptome data, which confirm the reliability of the transcriptome data. As we know, this is the first study underline the D. sinicus transcriptome, which will deepen the understanding of the molecular mechanisms of culm development. The results may help variety improvement and resource utilization of bamboos.
Jiang, Xinyi; Fitch, Sergio; Wang, Christine; Wilson, Christy; Li, Jianfeng; Grant, Gerald A.; Yang, Fan
2016-01-01
Glioblastoma multiforme (GBM) is one of the most intractable of human cancers, principally because of the highly infiltrative nature of these neoplasms. Tracking and eradicating infiltrating GBM cells and tumor microsatellites is of utmost importance for the treatment of this devastating disease, yet effective strategies remain elusive. Here we report polymeric nanoparticle-engineered human adipose-derived stem cells (hADSCs) overexpressing tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) as drug-delivery vehicles for targeting and eradicating GBM cells in vivo. Our results showed that polymeric nanoparticle-mediated transfection led to robust up-regulation of TRAIL in hADSCs, and that TRAIL-expressing hADSCs induced tumor-specific apoptosis. When transplanted in a mouse intracranial xenograft model of patient-derived glioblastoma cells, hADSCs exhibited long-range directional migration and infiltration toward GBM tumor. Importantly, TRAIL-overexpressing hADSCs inhibited GBM growth, extended survival, and reduced the occurrence of microsatellites. Repetitive injection of TRAIL-overexpressing hADSCs significantly prolonged animal survival compared with single injection of these cells. Taken together, our data suggest that nanoparticle-engineered TRAIL-expressing hADSCs exhibit the therapeutically relevant behavior of “seek-and-destroy” tumortropic migration and could be a promising therapeutic approach to improve the treatment outcomes of patients with malignant brain tumors. PMID:27849590
USDA-ARS?s Scientific Manuscript database
Cacao (Theobroma cacao L.), the tree from which cocoa butter and chocolate is derived, is conserved in field genebanks. The largest of these ex situ collections in the public domain is the International Cocoa Genebank, Trinidad (ICG,T). Reduction of genetic redundancy is essential to improve the acc...
USDA-ARS?s Scientific Manuscript database
Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Silva, P R O; Jesus, O N J; Creste, S; Figueira, A; Amorim, E P; Ferreira, C F
2015-09-25
Microsatellite markers have been widely used in the quantification of genetic variability and for genetic breeding in Musa spp. The objective of the present study was to evaluate the discriminatory power of microsatellite markers derived from 'Calcutta 4' and 'Ouro' genomic libraries, and to analyze the genetic variability among 30 banana accessions. Thirty-eight markers were used: 15 from the 'Ouro' library and 23 from the 'Calcutta 4' library. Genetic diversity was evaluated by considering SSR markers as both dominant markers because of the presence of triploid accessions, and co-dominant markers. For the dominant analysis, polymorphism information content (PIC) values for 44 polymorphic markers ranged from 0.063 to 0.533, with a mean value of 0.24. A dendrogram analysis separated the BGB-Banana accessions into 4 groups: the 'Ouro' and 'Muísa Tia' accessions were the most dissimilar (93% dissimilarity), while the most similar accessions were 'Pacovan' and 'Walha'. The mean genetic distance between samples was 0.74. For the analysis considering SSR markers as co-dominants, using only diploid accessions, two groups were separated based on their genome contents (A and B). The PIC values for the markers from the 'Calcutta 4' library varied from 0.4836 to 0.7886, whereas those from the 'Ouro' library ranged from 0.3800 to 0.7521. Given the high PIC values, the markers from both the libraries showed high discriminatory power, and can therefore be widely applied for analysis of genetic diversity, population structures, and linkage mapping in Musa spp.
Weckworth, Byron V; Musiani, Marco; McDevitt, Allan D; Hebblewhite, Mark; Mariani, Stefano
2012-07-01
The role of Beringia as a refugium and route for trans-continental exchange of fauna during glacial cycles of the past 2million years are well documented; less apparent is its contribution as a significant reservoir of genetic diversity. Using mitochondrial DNA sequences and 14 microsatellite loci, we investigate the phylogeographic history of caribou (Rangifer tarandus) in western North America. Patterns of genetic diversity reveal two distinct groups of caribou. Caribou classified as a Northern group, of Beringian origin, exhibited greater number and variability in mtDNA haplotypes compared to a Southern group originating from refugia south of glacial ice. Results indicate that subspecies R. t. granti of Alaska and R. t. groenlandicus of northern Canada do not constitute distinguishable units at mtDNA or microsatellites, belying their current status as separate subspecies. Additionally, the Northern Mountain ecotype of woodland caribou (presently R. t. caribou) has closer kinship to caribou classified as granti or groenlandicus. Comparisons of mtDNA and microsatellite data suggest that behavioural and ecological specialization is a more recently derived life history characteristic. Notably, microsatellite differentiation among Southern herds is significantly greater, most likely as a result of human-induced landscape fragmentation and genetic drift due to smaller population sizes. These results not only provide important insight into the evolutionary history of northern species such as caribou, but also are important indicators for managers evaluating conservation measures for this threatened species. © 2012 Blackwell Publishing Ltd.
Global characterization of Artemisia annua glandular trichome transcriptome using 454 pyrosequencing
Wang, Wei; Wang, Yejun; Zhang, Qing; Qi, Yan; Guo, Dianjing
2009-01-01
Background Glandular trichomes produce a wide variety of commercially important secondary metabolites in many plant species. The most prominent anti-malarial drug artemisinin, a sesquiterpene lactone, is produced in glandular trichomes of Artemisia annua. However, only limited genomic information is currently available in this non-model plant species. Results We present a global characterization of A. annua glandular trichome transcriptome using 454 pyrosequencing. Sequencing runs using two normalized cDNA collections from glandular trichomes yielded 406,044 expressed sequence tags (average length = 210 nucleotides), which assembled into 42,678 contigs and 147,699 singletons. Performing a second sequencing run only increased the number of genes identified by ~30%, indicating that massively parallel pyrosequencing provides deep coverage of the A. annua trichome transcriptome. By BLAST search against the NCBI non-redundant protein database, putative functions were assigned to over 28,573 unigenes, including previously undescribed enzymes likely involved in sesquiterpene biosynthesis. Comparison with ESTs derived from trichome collections of other plant species revealed expressed genes in common functional categories across different plant species. RT-PCR analysis confirmed the expression of selected unigenes and novel transcripts in A. annua glandular trichomes. Conclusion The presence of contigs corresponding to enzymes for terpenoids and flavonoids biosynthesis suggests important metabolic activity in A. annua glandular trichomes. Our comprehensive survey of genes expressed in glandular trichome will facilitate new gene discovery and shed light on the regulatory mechanism of artemisinin metabolism and trichome function in A. annua. PMID:19818120
2014-01-01
Background Syntrichia caninervis is a desiccation-tolerant moss and the dominant bryophyte of the Biological Soil Crusts (BSCs) found in the Mojave and Gurbantunggut deserts. Next generation high throughput sequencing technologies offer an efficient and economic choice for characterizing non-model organism transcriptomes with little or no prior molecular information available. Results In this study, we employed next generation, high-throughput, Illumina RNA-Seq to analyze the poly-(A) + mRNA from hydrated, dehydrating and desiccated S. caninervis gametophores. Approximately 58.0 million paired-end short reads were obtained and 92,240 unigenes were assembled with an average size of 493 bp, N50 value of 662 bp and a total size of 45.48 Mbp. Sequence similarity searches against five public databases (NR, Swiss-Prot, COSMOSS, KEGG and COG) found 54,125 unigenes (58.7%) with significant similarity to an existing sequence (E-value ≤ 1e-5) and could be annotated. Gene Ontology (GO) annotation assigned 24,183 unigenes to the three GO terms: Biological Process, Cellular Component or Molecular Function. GO comparison between P. patens and S. caninervis demonstrated similar sequence enrichment across all three GO categories. 29,370 deduced polypeptide sequences were assigned Pfam domain information and categorized into 4,212 Pfam domains/families. Using the PlantTFDB, 778 unigenes were predicted to be involved in the regulation of transcription and were classified into 49 transcription factor families. Annotated unigenes were mapped to the KEGG pathways and further annotated using MapMan. Comparative genomics revealed that 44% of protein families are shared in common by S. caninervis, P. patens and Arabidopsis thaliana and that 80% are shared by both moss species. Conclusions This study is one of the first comprehensive transcriptome analyses of the moss S. caninervis. Our data extends our knowledge of bryophyte transcriptomes, provides an insight to plants adapted to the arid regions of central Asia, and continues the development of S. caninervis as a model for understanding the molecular aspects of desiccation-tolerance. PMID:25086984
Zhou, Fan; Wang, Guirong; An, Chunju
2014-01-01
Background The Asian corn borer (Ostrinia furnacalis (Guenée)) is one of the most serious corn pests in Asia. Control of this pest with entomopathogenic fungus Beauveria bassiana has been proposed. However, the molecular mechanisms involved in the interactions between O. furnacalis and B. bassiana are unclear, especially under the conditions that the genomic information of O. furnacalis is currently unavailable. So we sequenced and characterized the transcriptome of O. furnacalis larvae infected by B. bassiana with special emphasis on immunity-related genes. Methodology/Principal Findings Illumina Hiseq2000 was used to sequence 4.64 and 4.72 Gb of the transcriptome from water-injected and B. bassiana-injected O. furnacalis larvae, respectively. De novo assembly generated 62,382 unigenes with mean length of 729 nt. All unigenes were searched against Nt, Nr, Swiss-Prot, COG, and KEGG databases for annotations using BLASTN or BLASTX algorithm with an E-value cut-off of 10−5. A total of 35,700 (57.2%) unigenes were annotated to at least one database. Pairwise comparisons resulted in 13,890 differentially expressed genes, with 5,843 up-regulated and 8,047 down-regulated. Based on sequence similarity to homologs known to participate in immune responses, we totally identified 190 potential immunity-related unigenes. They encode 45 pattern recognition proteins, 33 modulation proteins involved in the prophenoloxidase activation cascade, 46 signal transduction molecules, and 66 immune responsive effectors, respectively. The obtained transcriptome contains putative orthologs for nearly all components of the Toll, Imd, and JAK/STAT pathways. We randomly selected 24 immunity-related unigenes and investigated their expression profiles using quantitative RT-PCR assay. The results revealed variant expression patterns in response to the infection of B. bassiana. Conclusions/Significance This study provides the comprehensive sequence resource and expression profiles of the immunity-related genes of O. furnacalis. The obtained data gives an insight into better understanding the molecular mechanisms of innate immune processes in O. furnacalis larvae against B. bassiana. PMID:24466095
Yang, Wei; Yang, Chunping; Zhang, Jin; Yang, Yang; Wang, Baoxin; Guan, Fengrong
2018-01-01
The white-striped longhorn beetle Batocera horsfieldi (Coleoptera: Cerambycidae) is a polyphagous wood-boring pest that causes substantial damage to the lumber industry. Moreover olfactory proteins are crucial components to function in related processes, but the B. horsfieldi genome is not readily available for olfactory proteins analysis. In the present study, developmental transcriptomes of larvae from the first instar to the prepupal stage, pupae, and adults (females and males) from emergence to mating were built by RNA sequencing to establish a genetic background that may help understand olfactory genes. Approximately 199 million clean reads were obtained and assembled into 171,664 transcripts, which were classified into 23,380, 26,511, 22,393, 30,270, and 87, 732 unigenes for larvae, pupae, females, males, and combined datasets, respectively. The unigenes were annotated against NCBI’s non-redundant nucleotide and protein sequences, Swiss-Prot, Gene Ontology (GO), Pfam, Clusters of Eukaryotic Orthologous Groups (KOG), and KEGG Orthology (KO) databases. A total of 43,197 unigenes were annotated into 55 sub-categories under the three main GO categories; 25,237 unigenes were classified into 26 functional KOG categories, and 25,814 unigenes were classified into five functional KEGG Pathway categories. RSEM software identified 2,983, 3,097, 870, 2,437, 5,161, and 2,882 genes that were differentially expressed between larvae and males, larvae and pupae, larvae and females, males and females, males and pupae, and females and pupae, respectively. Among them, genes encoding seven candidate odorant binding proteins (OBPs) and three chemosensory proteins (CSPs) were identified. RT-PCR and RT-qPCR analyses showed that BhorOBP3, BhorCSP2, and BhorOBPC1/C3/C4 were highly expressed in the antenna of males, indicating these genes may may play key roles in foraging and host-orientation in B. horsfieldi. Our results provide valuable molecular information about the olfactory system in B. horsfieldi and will help guide future functional studies on olfactory genes. PMID:29474419
Niu, Jun; An, Jiyong; Wang, Libing; Fang, Chengliang; Ha, Denglong; Fu, Chengyu; Qiu, Lin; Yu, Haiyan; Zhao, Haiyan; Hou, Xinyu; Xiang, Zheng; Zhou, Sufan; Zhang, Zhixiang; Feng, Xinyi; Lin, Shanzhi
2015-01-01
Siberian apricot (Prunus sibirica L.) has emerged as a novel potential source of biodiesel in China, but the molecular regulatory mechanism of oil accumulation in Siberian apricot seed kernels (SASK) is still unknown at present. To better develop SASK oil as woody biodiesel, it is essential to profile transcriptome and to identify the full repertoire of potential unigenes involved in the formation and accumulation of oil SASK during the different developing stages. We firstly detected the temporal patterns for oil content and fatty acid (FA) compositions of SASK in 7 different developing stages. The best time for obtaining the high quality and quantity of SASK oil was characterized at 60 days after flowering (DAF), and the representative periods (10, 30, 50, 60, and 70 DAF) were selected for transcriptomic analysis. By Illumina/Solexa sequencings, approximately 65 million short reads (average length = 96 bp) were obtained, and then assembled into 124,070 unigenes by Trinity strategy (mean size = 829.62 bp). A total of 3,000, 2,781, 2,620, and 2,675 differentially expressed unigenes were identified at 30, 50, 60, and 70 DAF (10 DAF as the control) by DESeq method, respectively. The relationship between the unigene transcriptional profiles and the oil dynamic patterns in developing SASK was comparatively analyzed, and the specific unigenes encoding some known enzymes and transcription factors involved in acetyl-coenzyme A (acetyl-CoA) formation and oil accumulation were determined. Additionally, 5 key metabolic genes implicated in SASK oil accumulation were experimentally validated by quantitative real-time PCR (qRT-PCR). Our findings could help to construction of oil accumulated pathway and to elucidate the molecular regulatory mechanism of increased oil production in developing SASK. This is the first study of oil temporal patterns, transcriptome sequencings, and differential profiles in developing SASK. All our results will serve as the important foundation to further deeply explore the regulatory mechanism of SASK high-quality oil accumulation, and may also provide some reference for researching the woody biodiesel plants.
De Novo Adult Transcriptomes of Two European Brittle Stars: Spotlight on Opsin-Based Photoreception
Mallefet, Jérôme; Flammang, Patrick
2016-01-01
Next generation sequencing (NGS) technology allows to obtain a deeper and more complete view of transcriptomes. For non-model or emerging model marine organisms, NGS technologies offer a great opportunity for rapid access to genetic information. In this study, paired-end Illumina HiSeqTM technology has been employed to analyse transcriptomes from the arm tissues of two European brittle star species, Amphiura filiformis and Ophiopsila aranea. About 48 million Illumina reads were generated and 136,387 total unigenes were predicted from A. filiformis arm tissues. For O. aranea arm tissues, about 47 million reads were generated and 123,324 total unigenes were obtained. Twenty-four percent of the total unigenes from A. filiformis show significant matches with sequences present in reference online databases, whereas, for O. aranea, this percentage amounts to 23%. In both species, around 50% of the predicted annotated unigenes were significantly similar to transcripts from the purple sea urchin, the closest species to date that has undergone complete genome sequencing and annotation. GO, COG and KEGG analyses were performed on predicted brittle star unigenes. We focused our analyses on the phototransduction actors involved in light perception. Firstly, two new echinoderm opsins were identified in O. aranea: one rhabdomeric opsin (homologous to vertebrate melanopsin) and one RGR opsin. The RGR-opsin is supposed to be involved in retinal regeneration while the r-opsin is suspected to play a role in visual-like behaviour. Secondly, potential phototransduction actors were identified in both transcriptomes using the fly (rhabdomeric) and mammal (ciliary) classical phototransduction pathways as references. Finally, the sensitivity of O.aranea to monochromatic light was investigated to complement data available for A. filiformis. The presence of microlens-like structures at the surface of dorsal arm plate of O. aranea could potentially explain phototactic behaviour differences between the two species. The results confirm (i) the ability of these brittle stars to perceive light using opsin-based photoreception, (ii) suggest the co-occurrence of both rhabdomeric and ciliary photoreceptors, and (iii) emphasise the complexity of light perception in this echinoderm class. PMID:27119739
Hwang, Jee Youn; Markkandan, Kesavan; Kwon, Mun Gyeong; Seo, Jung Soo; Yoo, Seung-Il; Hwang, Seong Don; Son, Maeng-Hyun; Park, Junhyung
2018-05-01
Olive flounder (Paralichthys olivaceus) is one of the most valuable marine aquatic species in South Korea and faces tremendous exposure to the viral hemorrhagic septicemia virus (VHSV). Given the growing importance of flounder, it is therefore essential to understand the host defense of P. olivaceus against VHSV infection, but studies on its immune mechanism are hindered by the lack of genomic resources. In this study, the P. olivaceus was infected with disease-causing VHSV isolates, ADC-VHS2012-11 and ADC-VHS2014-5 which showed moderate virulent (20% mortality) and high virulent (65% mortality), in order to investigate the effect of difference in pathogenicity in head kidney during 1, 3, 7 days of post-infection using Illumina sequencing. After removing low-quality sequences, we obtained 144,933,160 high quality reads from thirty-six libraries which were further assembled into 53,384 unigenes with an average length of 563 bp with a range of 200 to 9605 bp. Transcriptome annotation revealed that 30,475 unigenes with a cut-off e-value of 10 -5 were functionally annotated. In total, 10,046 unigenes were clustered into 26 functional categories by searching against the eggNOG database, and 22,233 unigenes to 52 GO terms. In addition, 12,985 unigenes were grouped into 387 KEGG pathways. Among the 13,270 differently expressed genes, 6578 and 6692 were differentially expressed only in moderate and high virulent, respectively. Based on our sequence analysis, many candidate genes with fundamental roles in innate immune system including, pattern recognition receptors (TLRs & RLRs), Mx, complement proteins, lectins, and cytokines (chemokines, IFN, IRF, IL, TRF) were differentially expressed. Furthermore, GO enrichment analysis for these genes revealed gene response to defense response to virus, apoptotic process and transcription factor activity. In summary, this study identifies several putative immune pathways and candidate genes deserving further investigation in the context of novel gene discovery, gene expression and regulation studies and lays the foundation for fish immunology especially in P. olivaceus against VHSV. Copyright © 2018. Published by Elsevier Ltd.
Xu, Xing-Li; Cheng, Tian-Yin; Yang, Hu; Yan, Fen; Yang, Ya
2015-06-01
Saliva plays an important role in feeding and pathogen transmission, identification and analysis of tick salivary gland (SG) proteins is considered as a hot spot in anti-tick researching area. Herein, we present the first description of SG transcriptome of Haemaphysalis flava using next-generation sequencing (NGS). A total of over 143 million high-quality reads were assembled into 54,357 unigenes, of which 20,145 (37.06%) had significant similarities to proteins in the Swiss-Prot database. 13,513 annotated sequences were associated with GO terms. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis showed that 14,280 unigenes were assigned to 279 KEGG pathways in total. Reads per kb per million reads (RPKM) analysis showed that there were 3035 down-regulated unigenes and 2260 up-regulated unigenes in the engorged ticks (ET) compared with the semi-engorged one (SET). Several important genes are associated with blood feeding and ingestion as secreted salivary proteins, concluding cysteine, longipain, 4D8, calreticulin, metalloproteases, serine protease inhibitor, enolase, heat shock protein and AV422 in SG, were identified. The qRT-PCR results confirmed that patterns of these genes (except for the longipain gene) expression were consistent with RNA-seq results. This de novo assembly of SG transcriptome of H. flava not only provides more chance for screening and cloning functional genes, but also forms a solid basis for further insight into the changes of salivary proteins during blood-feeding. Copyright © 2015 Elsevier B.V. All rights reserved.
Liu, Shanshan; Chen, Guanxing; Xu, Haidong; Zou, Weibin; Yan, Wenrui; Wang, Qianqian; Deng, Hengwei; Zhang, Heqian; Yu, Guojiao; He, Jianguo; Weng, Shaoping
2017-01-01
Mud crab (Scylla paramamosain) is an economically important marine cultured species in China's coastal area. Mud crab reovirus (MCRV) is the most important pathogen of mud crab, resulting in large economic losses in crab farming. In this paper, next-generation sequencing technology and bioinformatics analysis are used to study transcriptome differences between MCRV-infected mud crab and normal control. A total of 104.3 million clean reads were obtained, including 52.7 million and 51.6 million clean reads from MCRV-infected (CA) and controlled (HA) mud crabs respectively. 81,901, 70,059 and 67,279 unigenes were gained respectively from HA reads, CA reads and HA&CA reads. A total of 32,547 unigenes from HA&CA reads called All-Unigenes were matched to at least one database among Nr, Nt, Swiss-prot, COG, GO and KEGG databases. Among these, 13,039, 20,260 and 11,866 unigenes belonged to the 3, 258 and 25 categories of GO, KEGG pathway, and COG databases, respectively. Solexa/Illumina's DGE platform was also used, and about 13,856 differentially expressed genes (DEGs), including 4444 significantly upregulated and 9412 downregulated DEGs were detected in diseased crabs compared with the control. KEGG pathway analysis revealed that DEGs were obviously enriched in the pathways related to different diseases or infections. This transcriptome analysis provided valuable information on gene functions associated with the response to MCRV in mud crab, as well as detail information for identifying novel genes in the absence of the mud crab genome database. Copyright © 2016. Published by Elsevier Ltd.
Transcriptome analysis of sika deer in China.
Jia, Bo-Yin; Ba, Heng-Xing; Wang, Gui-Wu; Yang, Ying; Cui, Xue-Zhe; Peng, Ying-Hua; Zheng, Jun-Jun; Xing, Xiu-Mei; Yang, Fu-He
2016-10-01
Sika deer is of great commercial value because their antlers are used in tonics and alternative medicine and their meat is healthy and delicious. The goal of this study was to generate transcript sequences from sika deer for functional genomic analyses and to identify the transcripts that demonstrate tissue-specific, age-dependent differential expression patterns. These sequences could enhance our understanding of the molecular mechanisms underlying sika deer growth and development. In the present study, we performed de novo transcriptome assembly and profiling analysis across ten tissue types and four developmental stages (juvenile, adolescent, adult, and aged) of sika deer, using Illumina paired-end tag (PET) sequencing technology. A total of 1,752,253 contigs with an average length of 799 bp were generated, from which 1,348,618 unigenes with an average length of 590 bp were defined. Approximately 33.2 % of these (447,931 unigenes) were then annotated in public protein databases. Many sika deer tissue-specific, age-dependent unigenes were identified. The testes have the largest number of tissue-enriched unigenes, and some of them were prone to develop new functions for other tissues. Additionally, our transcriptome revealed that the juvenile-adolescent transition was the most complex and important stage of the sika deer life cycle. The present work represents the first multiple tissue transcriptome analysis of sika deer across four developmental stages. The generated data not only provide a functional genomics resource for future biological research on sika deer but also guide the selection and manipulation of genes controlling growth and development.
Yang, Minglei; Wu, Ying; Jin, Shan; Hou, Jinyan; Mao, Yingji; Liu, Wenbo; Shen, Yangcheng; Wu, Lifang
2015-01-01
Sapium sebiferum (Linn.) Roxb. (Chinese Tallow Tree) is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA) and triacylglycerol (TAG) biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.
Guo, Wei-Li; Chen, Ru-Gang; Gong, Zhen-Hui; Yin, Yan-Xu; Li, Da-Wei
2013-01-01
Low temperature is one of the major factors limiting pepper (Capsicum annuum L.) production during winter and early spring in non-tropical regions. Application of exogenous abscisic acid (ABA) effectively alleviates the symptoms of chilling injury, such as wilting and formation of necrotic lesions on pepper leaves; however, the underlying molecular mechanism is not understood. The aim of this study was to identify genes that are differentially up- or downregulated in ABA-pretreated hot pepper seedlings incubated at 6°C for 48 h, using a suppression subtractive hybridization (SSH) method. A total of 235 high-quality ESTs were isolated, clustered and assembled into a collection of 73 unigenes including 18 contigs and 55 singletons. A total of 37 unigenes (50.68%) showed similarities to genes with known functions in the non-redundant database; the other 36 unigenes (49.32%) showed low similarities or unknown functions. Gene ontology analysis revealed that the 37 unigenes could be classified into nine functional categories. The expression profiles of 18 selected genes were analyzed using quantitative RT-PCR; the expression levels of 10 of these genes were at least two-fold higher in the ABA-pretreated seedlings under chilling stress than water-pretreated (control) plants under chilling stress. In contrast, the other eight genes were downregulated in ABA-pretreated seedlings under chilling stress, with expression levels that were one-third or less of the levels observed in control seedlings under chilling stress. These results suggest that ABA can positively and negatively regulate genes in pepper plants under chilling stress.
Qu, Cheng; Fu, Ningning; Xu, Yihua
2016-01-01
The sycamore lace bug, Corythucha ciliata (Hemiptera: Tingidae), is an invasive forestry pest rapidly expanding in many countries. This pest poses a considerable threat to the urban forestry ecosystem, especially to Platanus spp. However, its molecular biology and biochemistry are poorly understood. This study reports the first C. ciliata transcriptome, encompassing three different life stages (Nymphs, adults female (AF) and adults male (AM)). In total, 26.53 GB of clean data and 60,879 unigenes were obtained from three RNA-seq libraries. These unigenes were annotated and classified by Nr (NCBI non-redundant protein sequences), Nt (NCBI non-redundant nucleotide sequences), Pfam (Protein family), KOG/COG (Clusters of Orthologous Groups of proteins), Swiss-Prot (A manually annotated and reviewed protein sequence database), and KO (KEGG Ortholog database). After all pairwise comparisons between these three different samples, a large number of differentially expressed genes were revealed. The dramatic differences in global gene expression profiles were found between distinct life stages (nymphs and AF, nymphs and AM) and sex difference (AF and AM), with some of the significantly differentially expressed genes (DEGs) being related to metamorphosis, digestion, immune and sex difference. The different express of unigenes were validated through quantitative Real-Time PCR (qRT-PCR) for 16 randomly selected unigenes. In addition, 17,462 potential simple sequence repeat molecular markers were identified in these transcriptome resources. These comprehensive C. ciliata transcriptomic information can be utilized to promote the development of environmentally friendly methodologies to disrupt the processes of metamorphosis, digestion, immune and sex differences. PMID:27494615
A New Omics Data Resource of Pleurocybella porrigens for Gene Discovery
Dohra, Hideo; Someya, Takumi; Takano, Tomoyuki; Harada, Kiyonori; Omae, Saori; Hirai, Hirofumi; Yano, Kentaro; Kawagishi, Hirokazu
2013-01-01
Background Pleurocybella porrigens is a mushroom-forming fungus, which has been consumed as a traditional food in Japan. In 2004, 55 people were poisoned by eating the mushroom and 17 people among them died of acute encephalopathy. Since then, the Japanese government has been alerting Japanese people to take precautions against eating the P . porrigens mushroom. Unfortunately, despite efforts, the molecular mechanism of the encephalopathy remains elusive. The genome and transcriptome sequence data of P . porrigens and the related species, however, are not stored in the public database. To gain the omics data in P . porrigens , we sequenced genome and transcriptome of its fruiting bodies and mycelia by next generation sequencing. Methodology/Principal Findings Short read sequences of genomic DNAs and mRNAs in P . porrigens were generated by Illumina Genome Analyzer. Genome short reads were de novo assembled into scaffolds using Velvet. Comparisons of genome signatures among Agaricales showed that P . porrigens has a unique genome signature. Transcriptome sequences were assembled into contigs (unigenes). Biological functions of unigenes were predicted by Gene Ontology and KEGG pathway analyses. The majority of unigenes would be novel genes without significant counterparts in the public omics databases. Conclusions Functional analyses of unigenes present the existence of numerous novel genes in the basidiomycetes division. The results mean that the omics information such as genome, transcriptome and metabolome in basidiomycetes is short in the current databases. The large-scale omics information on P . porrigens , provided from this research, will give a new data resource for gene discovery in basidiomycetes. PMID:23936076
Transcriptomic analysis of Aegilops tauschii during long-term salinity stress.
Mansouri, Mehdi; Naghavi, Mohammad Reza; Alizadeh, Hoshang; Mohammadi-Nejad, Ghasem; Mousavi, Seyed Ahmad; Salekdeh, Ghasem Hosseini; Tada, Yuichi
2018-06-21
Aegilops tauschii is the diploid progenitor of the bread wheat D-genome. It originated from Iran and is a source of abiotic stress tolerance genes. However, little is known about the molecular events of salinity tolerance in Ae. tauschii. This study investigates the leaf transcriptional changes associated with long-term salt stress. Total RNA extracted from leaf tissues of control and salt-treated samples was sequenced using the Illumina technology, and more than 98 million high-quality reads were assembled into 255,446 unigenes with an average length of 1398 bp and an N50 of 2269 bp. Functional annotation of the unigenes showed that 93,742 (36.69%) had at least a significant BLAST hit in the SwissProt database, while 174,079 (68.14%) showed significant similarity to proteins in the NCBI nr database. Differential expression analysis identified 4506 salt stress-responsive unigenes. Bioinformatic analysis of the differentially expressed unigenes (DEUs) revealed a number of biological processes and pathways involved in the establishment of ion homeostasis, signaling processes, carbohydrate metabolism, and post-translational modifications. Fine regulation of starch and sucrose content may be important features involved in salt tolerance in Ae. tauschii. Moreover, 82% of DEUs mapped to the D-subgenome, including known QTL for salt tolerance, and these DEUs showed similar salt stress responses in other accessions of Ae. tauschii. These results could provide fundamental insight into the regulatory process underlying salt tolerance in Ae. tauschii and wheat and facilitate identification of genes involved in their salt tolerance mechanisms.
Li, Qianqian; Liu, Jianguo; Zhang, Litao; Liu, Qian
2014-01-01
Background Algae in the order Trentepohliales have a broad geographic distribution and are generally characterized by the presence of abundant β-carotene. The many monographs published to date have mainly focused on their morphology, taxonomy, phylogeny, distribution and reproduction; molecular studies of this order are still rare. High-throughput RNA sequencing (RNA-Seq) technology provides a powerful and efficient method for transcript analysis and gene discovery in Trentepohlia jolithus. Methods/Principal Findings Illumina HiSeq 2000 sequencing generated 55,007,830 Illumina PE raw reads, which were assembled into 41,328 assembled unigenes. Based on NR annotation, 53.28% of the unigenes (22,018) could be assigned to gene ontology classes with 54 subcategories and 161,451 functional terms. A total of 26,217 (63.44%) assembled unigenes were mapped to 128 KEGG pathways. Furthermore, a set of 5,798 SSRs in 5,206 unigenes and 131,478 putative SNPs were identified. Moreover, the fact that all of the C4 photosynthesis genes exist in T. jolithus suggests a complex carbon acquisition and fixation system. Similarities and differences between T. jolithus and other algae in carotenoid biosynthesis are also described in depth. Conclusions/Significance This is the first broad transcriptome survey for T. jolithus, increasing the amount of molecular data available for the class Ulvophyceae. As well as providing resources for functional genomics studies, the functional genes and putative pathways identified here will contribute to a better understanding of carbon fixation and fatty acid and carotenoid biosynthesis in T. jolithus. PMID:25254555
Jia, Zhiying; Wang, Qiai; Wu, Kaikai; Wei, Zhenlin; Zhou, Zunchun; Liu, Xiaolin
2017-09-01
Strongylocentrotus nudus is an edible sea urchin, mainly harvested in China. Correlation studies indicated that S. nudus with larger diameter have a prolonged marketing time and better palatability owing to their precocious gonads and extended maturation process. However, the molecular mechanism underlying this phenomenon is still unknown. Here, transcriptome sequencing was applied to study the ovaries of adult S. nudus with different shell diameters to explore the possible mechanism. In this study, four independent cDNA libraries were constructed, including two from the big size urchins and two from the small ones using a HiSeq™2500 platform. A total of 88,581 unigenes were acquired with a mean length of 1354bp, of which 66,331 (74.88%) unigenes could be annotated using six major publicly available databases. Comparative analysis revealed that 353 unigenes were differentially expressed (with log2(ratio)≥1, FDR≤0.001) between the two groups. Of these, 20 differentially expressed genes (DEGs) were selected to confirm the accuracy of RNA-seq data by quantitative real-time RT-PCR. Furthermore, gene ontology and KEGG pathway enrichment analyses were performed to find the putative genes and pathways related to ovarian maturity. Eight unigenes were identified as significant DEGs involved in reproduction related pathways; these included Mos, Cdc20, Rec8, YP30, cytochrome P450 2U1, ovoperoxidase, proteoliaisin, and rendezvin. Our research fills the gap in the studies on the S. nudus ovaries using transcriptome analysis. Copyright © 2017 Elsevier Inc. All rights reserved.
Comparative transcript profiling of the fertile and sterile flower buds of pol CMS in B. napus.
An, Hong; Yang, Zonghui; Yi, Bin; Wen, Jing; Shen, Jinxiong; Tu, Jinxing; Ma, Chaozhi; Fu, Tingdong
2014-04-03
The Polima (pol) system of cytoplasmic male sterility (CMS) and its fertility restoration gene Rfp have been used in hybrid breeding in Brassica napus, which has greatly improved the yield of rapeseed. However, the mechanism of the male sterility transition in pol CMS remains to be determined. To investigate the transcriptome during the male sterility transition in pol CMS, a near-isogenic line (NIL) of pol CMS was constructed. The phenotypic features and sterility stage were confirmed by anatomical analysis. Subsequently, we compared the genomic expression profiles of fertile and sterile young flower buds by RNA-Seq. A total of 105,481,136 sequences were successfully obtained. These reads were assembled into 112,770 unigenes, which composed the transcriptome of the bud. Among these unigenes, 72,408 (64.21%) were annotated using public protein databases and classified into functional clusters. In addition, we investigated the changes in expression of the fertile and sterile buds; the RNA-seq data showed 1,148 unigenes had significantly different expression and they were mainly distributed in metabolic and protein synthesis pathways. Additionally, some unigenes controlling anther development were dramatically down-regulated in sterile buds. These results suggested that an energy deficiency caused by orf224/atp6 may inhibit a series of genes that regulate pollen development through nuclear-mitochondrial interaction. This results in the sterility of pol CMS by leading to the failure of sporogenous cell differentiation. This study may provide assistance for detailed molecular analysis and a better understanding of pol CMS in B. napus.
Tong, Jun; Dong, Yanfang; Xu, Dongyun; Mao, Jing; Zhou, Yuan
2017-01-01
Rhododendron spp. is an important ornamental species that is widely cultivated for landscape worldwide. Heat stress is a major obstacle for its cultivation in south China. Previous studies on rhododendron principally focused on its physiological and biochemical processes, which are involved in a series of stress tolerance. However, molecular or genetic properties of rhododendron’s response to heat stress are still poorly understood. The phenotype and chlorophyll fluorescence kinetics parameters of four rhododendron cultivars were compared under normal or heat stress conditions, and a cultivar with highest heat tolerance, “Yanzhimi” (R. obtusum) was selected for transcriptome sequencing. A total of 325,429,240 high quality reads were obtained and assembled into 395,561 transcripts and 92,463 unigenes. Functional annotation showed that 38,724 unigenes had sequence similarity to known genes in at least one of the proteins or nucleotide databases used in this study. These 38,724 unigenes were categorized into 51 functional groups based on Gene Ontology classification and were blasted to 24 known cluster of orthologous groups. A total of 973 identified unigenes belonged to 57 transcription factor families, including the stress-related HSF, DREB, ZNF, and NAC genes. Photosynthesis was significantly enriched in the Kyoto Encyclopedia of Genes and Genomes pathway, and the changed expression pattern was illustrated. The key pathways and signaling components that contribute to heat tolerance in rhododendron were revealed. These results provide a potentially valuable resource that can be used for heat-tolerance breeding. PMID:29059200
Kelly Ivors; Matteo Garbelotto; Ineke De Vries; Peter Bonants
2006-01-01
Investigating the population genetics of Phytophthora ramorum, the causal agent of sudden oak death (SOD), is critical to understanding the biology and epidemiology of this important phytopathogen. Raw sequence data (445,000 reads) of P. ramorum was provided by the Joint Genome Institute. Our objective was to develop and utilize...
Transcriptomic Analysis of Flower Blooming in Jasminum sambac through De Novo RNA Sequencing.
Li, Yong-Hua; Zhang, Wei; Li, Yong
2015-06-10
Flower blooming is a critical and complicated plant developmental process in flowering plants. However, insufficient information is available about the complex network that regulates flower blooming in Jasminum sambac. In this study, we used the RNA-Seq platform to analyze the molecular regulation of flower blooming in J. sambac by comparing the transcript profiles at two flower developmental stages: budding and blooming. A total of 4577 differentially-expressed genes (DEGs) were identified between the two floral stages. The Gene Ontology and the Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses revealed that the DEGs in the "oxidation-reduction process", "extracellular region", "steroid biosynthesis", "glycosphingolipid biosynthesis", "plant hormone signal transduction" and "pentose and glucuronate interconversions" might be associated with flower development. A total of 103 and 92 unigenes exhibited sequence similarities to the known flower development and floral scent genes from other plants. Among these unigenes, five flower development and 19 floral scent unigenes exhibited at least four-fold differences in expression between the two stages. Our results provide abundant genetic resources for studying the flower blooming mechanisms and molecular breeding of J. sambac.
Wang, Long; Chen, Yun; Wang, Suke; Xue, Huabai; Su, Yanli; Yang, Jian; Li, Xiugen
2018-01-01
Pear ( Pyrus spp.) is a popular fruit that is commercially cultivated in most temperate regions. In fruits, sugar metabolism and accumulation are important factors for fruit organoleptic quality. Post-harvest ripening is a special feature of 'Red Clapp's Favorite'. In this study, transcriptome sequencing based on the Illumina platform generated 23.8 - 35.8 million unigenes of nine cDNA libraries constructed using RNAs from the 'Red Clapp's Favorite' pear variety with different treatments, in which 2629 new genes were discovered, and 2121 of them were annotated. A total of 2146 DEGs, 3650 DEGs, 1830 DEGs from each comparison were assembled. Moreover, the gene expression patterns of 8 unigenes related to sugar metabolism revealed by qPCR. The main constituents of soluble sugars were fructose and glucose after pear fruit post-harvest ripening, and five unigenes involved in sugar metabolism were discovered. Our study not only provides a large-scale assessment of transcriptome resources of 'Red Clapp's Favorite' but also lays the foundation for further research into genes correlated with sugar metabolism.
Duan, Dong; Jia, Yun; Yang, Jie; Li, Zhong-Hu
2017-01-01
The sex determination in gymnosperms is still poorly characterized due to the lack of genomic/transcriptome resources and useful molecular genetic markers. To enhance our understanding of the molecular mechanisms of the determination of sexual recognition of reproductive structures in conifers, the transcriptome of male and female conelets were characterized in a Chinese endemic conifer species, Pinus bungeana Zucc. ex Endl. The 39.62 Gb high-throughput sequencing reads were obtained from two kinds of sexual conelets. After de novo assembly of the obtained reads, 85,305 unigenes were identified, 53,944 (63.23%) of which were annotated with public databases. A total of 12,073 differentially expressed genes were detected between the two types of sexes in P. bungeana, and 5766 (47.76%) of them were up-regulated in females. The Kyoto Encyclopedia of Genes and Genomes (KEGG) enriched analysis suggested that some of the genes were significantly associated with the sex determination process of P. bungeana, such as those involved in tryptophan metabolism, zeatin biosynthesis, and cysteine and methionine metabolism, and the phenylpropanoid biosynthesis pathways. Meanwhile, some important plant hormone pathways (e.g., the gibberellin (GA) pathway, carotenoid biosynthesis, and brassinosteroid biosynthesis (BR) pathway) that affected sexual determination were also induced in P. bungeana. In addition, 8791 expressed sequence tag-simple sequence repeats (EST-SSRs) from 7859 unigenes were detected in P. bungeana. The most abundant repeat types were dinucleotides (1926), followed by trinucleotides (1711). The dominant classes of the sequence repeat were A/T (4942) in mononucleotides and AT/AT (1283) in dinucleotides. Among these EST-SSRs, 84 pairs of primers were randomly selected for the characterization of potential molecular genetic markers. Finally, 19 polymorphic EST-SSR primers were characterized. We found low to moderate levels of genetic diversity (NA = 1.754; HO = 0.206; HE = 0.205) across natural populations of P. bungeana. The cluster analysis revealed two distinct genetic groups for the six populations that were sampled in this endemic species, which might be caused by the fragmentation of habitats and long-term geographic isolation among different populations. Taken together, this work provides important insights into the molecular mechanisms of sexual identity in the reproductive organs of P. bungeana. The molecular genetic resources that were identified in this study will also facilitate further studies in functional genomics and population genetics in the Pinus species. PMID:29257091
Li, Y. H.; Chu, H. P.; Jiang, Y. N.; Lin, C. Y.; Li, S. H.; Li, K. T.; Weng, G. J.; Cheng, C. C.; Lu, D. J.; Ju, Y. T.
2014-01-01
The Lanyu is a miniature pig breed indigenous to Lanyu Island, Taiwan. It is distantly related to Asian and European pig breeds. It has been inbred to generate two breeds and crossed with Landrace and Duroc to produce two hybrids for laboratory use. Selecting sets of informative genetic markers to track the genetic qualities of laboratory animals and stud stock is an important function of genetic databases. For more than two decades, Lanyu derived breeds of common ancestry and crossbreeds have been used to examine the effectiveness of genetic marker selection and optimal approaches for individual assignment. In this paper, these pigs and the following breeds: Berkshire, Duroc, Landrace and Yorkshire, Meishan and Taoyuan, TLRI Black Pig No. 1, and Kaohsiung Animal Propagation Station Black pig are studied to build a genetic reference database. Nineteen microsatellite markers (loci) provide information on genetic variation and differentiation among studied breeds. High differentiation index (FST) and Cavalli-Sforza chord distances give genetic differentiation among breeds, including Lanyu’s inbred populations. Inbreeding values (FIS) show that Lanyu and its derived inbred breeds have significant loss of heterozygosity. Individual assignment testing of 352 animals was done with different numbers of microsatellite markers in this study. The testing assigned 99% of the animals successfully into their correct reference populations based on 9 to 14 markers ranking D-scores, allelic number, expected heterozygosity (HE) or FST, respectively. All miss-assigned individuals came from close lineage Lanyu breeds. To improve individual assignment among close lineage breeds, microsatellite markers selected from Lanyu populations with high polymorphic, heterozygosity, FST and D-scores were used. Only 6 to 8 markers ranking HE, FST or allelic number were required to obtain 99% assignment accuracy. This result suggests empirical examination of assignment-error rates is required if discernible levels of co-ancestry exist. In the reference group, optimum assignment accuracy was achievable achieved through a combination of different markers by ranking the heterozygosity, FST and allelic number of close lineage populations. PMID:25049996
Microsatellite DNA library for Caiman latirostris.
Zucoloto, Rodrigo Barban; Verdade, Luciano Martins; Coutinho, Luiz Lehmann
2002-12-15
New genetic markers were characterized for the broad-snouted caiman (Caiman latirostris) by constructing libraries enriched for microsatellite DNA. Construction and characterization of these libraries are described in the present study. One microsatellite marker was developed from a (ACC-TGG)(n)enriched microsatellite DNA library, and 12 microsatellite markers were developed from a (AC-TG)(n)enriched microsatellite DNA library. These markers were tested in wild-caught animals, and these tests resulted in ten new polymorphic microsatellites for C. latirostris. Copyright 2002 Wiley-Liss, Inc.
An Autosomal Genetic Linkage Map of the Sheep Genome
Crawford, A. M.; Dodds, K. G.; Ede, A. J.; Pierson, C. A.; Montgomery, G. W.; Garmonsway, H. G.; Beattie, A. E.; Davies, K.; Maddox, J. F.; Kappes, S. W.; Stone, R. T.; Nguyen, T. C.; Penty, J. M.; Lord, E. A.; Broom, J. E.; Buitkamp, J.; Schwaiger, W.; Epplen, J. T.; Matthew, P.; Matthews, M. E.; Hulme, D. J.; Beh, K. J.; McGraw, R. A.; Beattie, C. W.
1995-01-01
We report the first extensive ovine genetic linkage map covering 2070 cM of the sheep genome. The map was generated from the linkage analysis of 246 polymorphic markers, in nine three-generation fullsib pedigrees, which make up the AgResearch International Mapping Flock. We have exploited many markers from cattle so that valuable comparisons between these two ruminant linkage maps can be made. The markers, used in the segregation analyses, comprised 86 anonymous microsatellite markers derived from the sheep genome, 126 anonymous microsatellites from cattle, one from deer, and 33 polymorphic markers of various types associated with known genes. The maximum number of informative meioses within the mapping flock was 222. The average number of informative meioses per marker was 140 (range 18-209). Linkage groups have been assigned to all 26 sheep autosomes. PMID:7498748
Reuschenbach, Miriam; Kloor, Matthias; Morak, Monika; Wentzensen, Nicolas; Germann, Anja; Garbe, Yvette; Tariverdian, Mirjam; Findeisen, Peter; Neumaier, Michael; Holinski-Feder, Elke; Doeberitz, Magnus von Knebel
2014-01-01
High level microsatellite instability (MSI-H) occurs in about 15% of colorectal cancer (CRCs), either as sporadic cancers or in the context of hereditary non-polyposis cancer (HNPCC) or Lynch syndrome. In MSI-H CRC, mismatch repair deficiency leads to insertion/deletion mutations at coding microsatellites (cMS) and thus to the translation of frameshift peptides (FSPs). FSPs are potent inductors of T cell responses in vitro and in vivo. The present study aims at the identification of FSP-specific humoral immune responses in MSI-H CRC and Lynch syndrome. Sera from patients with history of MSI-H CRC (n=69), healthy Lynch syndrome mutation carriers (n=31) and healthy controls (n=52) were analyzed for antibodies against FSPs using peptide ELISA. Reactivities were measured against FSPs derived from genes frequently mutated in MSI-H CRCs, AIM2, TGFBR2, CASP5, TAF1B, ZNF294, and MARCKS. Antibody reactivity against FSPs was significantly higher in MSI-H CRC patients than in healthy controls (p=0.036, Mann-Whitney) and highest in patients with shortest interval between tumor resection and serum sampling. Humoral immune responses in patients were most frequently directed against FSPs derived from mutated TAF1B (11.6%, 8/69) and TGFBR2 (10.1%, 7/69). Low level FSP-specific antibodies were also detected in healthy mutation carriers. Our results show that antibody responses against FSPs are detectable in MSI-H CRC patients and healthy Lynch syndrome mutation carriers. Based on the high number of defined FSP antigens, measuring FSP-specific humoral immune responses is a highly promising tool for future diagnostic application in MSI-H cancer patients. PMID:19957108
Reuschenbach, Miriam; Kloor, Matthias; Morak, Monika; Wentzensen, Nicolas; Germann, Anja; Garbe, Yvette; Tariverdian, Mirjam; Findeisen, Peter; Neumaier, Michael; Holinski-Feder, Elke; von Knebel Doeberitz, Magnus
2010-06-01
High level microsatellite instability (MSI-H) occurs in about 15% of colorectal cancer (CRCs), either as sporadic cancers or in the context of hereditary non-polyposis cancer or Lynch syndrome. In MSI-H CRC, mismatch repair deficiency leads to insertion/deletion mutations at coding microsatellites and thus to the translation of frameshift peptides (FSPs). FSPs are potent inductors of T cell responses in vitro and in vivo. The present study aims at the identification of FSP-specific humoral immune responses in MSI-H CRC and Lynch syndrome. Sera from patients with history of MSI-H CRC (n = 69), healthy Lynch syndrome mutation carriers (n = 31) and healthy controls (n = 52) were analyzed for antibodies against FSPs using peptide ELISA. Reactivities were measured against FSPs derived from genes frequently mutated in MSI-H CRCs, AIM2, TGFBR2, CASP5, TAF1B, ZNF294, and MARCKS. Antibody reactivity against FSPs was significantly higher in MSI-H CRC patients than in healthy controls (P = 0.036, Mann-Whitney) and highest in patients with shortest interval between tumor resection and serum sampling. Humoral immune responses in patients were most frequently directed against FSPs derived from mutated TAF1B (11.6%, 8/69) and TGFBR2 (10.1%, 7/69). Low level FSP-specific antibodies were also detected in healthy mutation carriers. Our results show that antibody responses against FSPs are detectable in MSI-H CRC patients and healthy Lynch syndrome mutation carriers. Based on the high number of defined FSP antigens, measuring FSP-specific humoral immune responses is a highly promising tool for future diagnostic application in MSI-H cancer patients.
MacDonald, A J; Fitzsimmons, N N; Chambers, B; Renfree, M B; Sarre, S D
2014-03-01
The emerging availability of microsatellite markers from mammalian sex chromosomes provides opportunities to investigate both male- and female-mediated gene flow in wild populations, identifying patterns not apparent from the analysis of autosomal markers alone. Tammar wallabies (Macropus eugenii), once spread over the southern mainland, have been isolated on several islands off the Western Australian and South Australian coastlines for between 10,000 and 13,000 years. Here, we combine analyses of autosomal, Y-linked and X-linked microsatellite loci to investigate genetic variation in populations of this species on two islands (Kangaroo Island, South Australia and Garden Island, Western Australia). All measures of diversity were higher for the larger Kangaroo Island population, in which genetic variation was lowest at Y-linked markers and highest at autosomal markers (θ=3.291, 1.208 and 0.627 for autosomal, X-linked and Y-linked data, respectively). Greater relatedness among females than males provides evidence for male-biased dispersal in this population, while sex-linked markers identified genetic lineages not apparent from autosomal data alone. Overall genetic diversity in the Garden Island population was low, especially on the Y chromosome where most males shared a common haplotype, and we observed high levels of inbreeding and relatedness among individuals. Our findings highlight the utility of this approach for management actions, such as the selection of animals for translocation or captive breeding, and the ecological insights that may be gained by combining analyses of microsatellite markers on sex chromosomes with those derived from autosomes.
Liu, Fuli; Hu, Zimin; Liu, Wenhui; Li, Jingjing; Wang, Wenjun; Liang, Zhourui; Wang, Feijiu; Sun, Xiutao
2016-01-01
Using transcriptome data to mine microsatellite and develop markers has growingly become prevalent. However, characterizing the possible function of microsatellite is relatively rare. In this study, we explored microsatellites in the transcriptome of the brown alga Sargassum thunbergii and characterized the frequencies, distribution, function and evolution, and developed primers to validate these microsatellites. Our results showed that Tri-nucleotide is the most abundant, followed by di- and mono-nucleotide. The length of microsatellite was significantly affected by the repeat motif size. The density of microsatellite in the CDS region is significantly lower than that in the UTR region. The annotation of the transcripts containing microsatellite showed that 573 transcripts have GO terms and can be categorized into 42 groups. Pathways enrichment showed that microsatellites were significantly overrepresented in the genes involved in pathways such as Ubiquitin mediated proteolysis, RNA degradation, Spliceosome, etc. Primers flanking 961 microsatellite loci were designed, and among the 30 pairs of primer selected randomly for availability test, 23 were proved to be efficient. These findings provided new insight into the function and evolution of microsatellite in transcriptome, and the identified microsatellite loci within the annotated gene will be useful for developing functional markers in S. thunbergii. PMID:26732855
Shi, Jiaqin; Huang, Shunmou; Fu, Donghui; Yu, Jinyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong
2013-01-01
Despite their ubiquity and functional importance, microsatellites have been largely ignored in comparative genomics, mostly due to the lack of genomic information. In the current study, microsatellite distribution was characterized and compared in the whole genomes and both the coding and non-coding DNA sequences of the sequenced Brassica, Arabidopsis and other angiosperm species to investigate their evolutionary dynamics in plants. The variation in the microsatellite frequencies of these angiosperm species was much smaller than those for their microsatellite numbers and genome sizes, suggesting that microsatellite frequency may be relatively stable in plants. The microsatellite frequencies of these angiosperm species were significantly negatively correlated with both their genome sizes and transposable elements contents. The pattern of microsatellite distribution may differ according to the different genomic regions (such as coding and non-coding sequences). The observed differences in many important microsatellite characteristics (especially the distribution with respect to motif length, type and repeat number) of these angiosperm species were generally accordant with their phylogenetic distance, which suggested that the evolutionary dynamics of microsatellite distribution may be generally consistent with plant divergence/evolution. Importantly, by comparing these microsatellite characteristics (especially the distribution with respect to motif type) the angiosperm species (aside from a few species) all clustered into two obviously different groups that were largely represented by monocots and dicots, suggesting a complex and generally dichotomous evolutionary pattern of microsatellite distribution in angiosperms. Polyploidy may lead to a slight increase in microsatellite frequency in the coding sequences and a significant decrease in microsatellite frequency in the whole genome/non-coding sequences, but have little effect on the microsatellite distribution with respect to motif length, type and repeat number. Interestingly, several microsatellite characteristics seemed to be constant in plant evolution, which can be well explained by the general biological rules. PMID:23555856
Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao
2014-01-01
Background Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Principal Findings Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Conclusion Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb. PMID:24956277
Li, Xihong; Cui, Zhaoxia; Liu, Yuan; Song, Chengwen; Shi, Guohui
2013-01-01
Background The Chinese mitten crab Eriocheir sinensis is an important economic crustacean and has been seriously attacked by various diseases, which requires more and more information for immune relevant genes on genome background. Recently, high-throughput RNA sequencing (RNA-seq) technology provides a powerful and efficient method for transcript analysis and immune gene discovery. Methods/Principal Findings A cDNA library from hepatopancreas of E. sinensis challenged by a mixture of three pathogen strains (Gram-positive bacteria Micrococcus luteus, Gram-negative bacteria Vibrio alginolyticus and fungi Pichia pastoris; 108 cfu·mL−1) was constructed and randomly sequenced using Illumina technique. Totally 39.76 million clean reads were assembled to 70,300 unigenes. After ruling out short-length and low-quality sequences, 52,074 non-redundant unigenes were compared to public databases for homology searching and 17,617 of them showed high similarity to sequences in NCBI non-redundant protein (Nr) database. For function classification and pathway assignment, 18,734 (36.00%) unigenes were categorized to three Gene Ontology (GO) categories, 12,243 (23.51%) were classified to 25 Clusters of Orthologous Groups (COG), and 8,983 (17.25%) were assigned to six Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Potentially, 24, 14, 47 and 132 unigenes were characterized to be involved in Toll, IMD, JAK-STAT and MAPK pathways, respectively. Conclusions/Significance This is the first systematical transcriptome analysis of components relating to innate immune pathways in E. sinensis. Functional genes and putative pathways identified here will contribute to better understand immune system and prevent various diseases in crab. PMID:23874555
Wu, Shuanghua; Lei, Jianjun; Chen, Guoju; Chen, Hancai; Cao, Bihao; Chen, Changming
2017-01-01
Chinese kale, a vegetable of the cruciferous family, is a popular crop in southern China and Southeast Asia due to its high glucosinolate content and nutritional qualities. However, there is little research on the molecular genetics and genes involved in glucosinolate metabolism and its regulation in Chinese kale. In this study, we sequenced and characterized the transcriptomes and expression profiles of genes expressed in 11 tissues of Chinese kale. A total of 216 million 150-bp clean reads were generated using RNA-sequencing technology. From the sequences, 98,180 unigenes were assembled for the whole plant, and 49,582~98,423 unigenes were assembled for each tissue. Blast analysis indicated that a total of 80,688 (82.18%) unigenes exhibited similarity to known proteins. The functional annotation and classification tools used in this study suggested that genes principally expressed in Chinese kale, were mostly involved in fundamental processes, such as cellular and molecular functions, the signal transduction, and biosynthesis of secondary metabolites. The expression levels of all unigenes were analyzed in various tissues of Chinese kale. A large number of candidate genes involved in glucosinolate metabolism and its regulation were identified, and the expression patterns of these genes were analyzed. We found that most of the genes involved in glucosinolate biosynthesis were highly expressed in the root, petiole, and in senescent leaves. The expression patterns of ten glucosinolate biosynthetic genes from RNA-seq were validated by quantitative RT-PCR in different tissues. These results provided an initial and global overview of Chinese kale gene functions and expression activities in different tissues. PMID:28228764
Huang, Lin; Li, Guiyang; Mo, Zhaolan; Xiao, Peng; Li, Jie; Huang, Jie
2015-01-01
Background Japanese flounder (Paralichthys olivaceus) is an economically important marine fish in Asia and has suffered from disease outbreaks caused by various pathogens, which requires more information for immune relevant genes on genome background. However, genomic and transcriptomic data for Japanese flounder remain scarce, which limits studies on the immune system of this species. In this study, we characterized the Japanese flounder spleen transcriptome using an Illumina paired-end sequencing platform to identify putative genes involved in immunity. Methodology/Principal Findings A cDNA library from the spleen of P. olivaceus was constructed and randomly sequenced using an Illumina technique. The removal of low quality reads generated 12,196,968 trimmed reads, which assembled into 96,627 unigenes. A total of 21,391 unigenes (22.14%) were annotated in the NCBI Nr database, and only 1.1% of the BLASTx top-hits matched P. olivaceus protein sequences. Approximately 12,503 (58.45%) unigenes were categorized into three Gene Ontology groups, 19,547 (91.38%) were classified into 26 Cluster of Orthologous Groups, and 10,649 (49.78%) were assigned to six Kyoto Encyclopedia of Genes and Genomes pathways. Furthermore, 40,928 putative simple sequence repeats and 47, 362 putative single nucleotide polymorphisms were identified. Importantly, we identified 1,563 putative immune-associated unigenes that mapped to 15 immune signaling pathways. Conclusions/Significance The P. olivaceus transciptome data provides a rich source to discover and identify new genes, and the immune-relevant sequences identified here will facilitate our understanding of the mechanisms involved in the immune response. Furthermore, the plentiful potential SSRs and SNPs found in this study are important resources with respect to future development of a linkage map or marker assisted breeding programs for the flounder. PMID:25723398
Gene Discovery through Transcriptome Sequencing for the Invasive Mussel Limnoperna fortunei
Uliano-Silva, Marcela; Americo, Juliana Alves; Brindeiro, Rodrigo; Dondero, Francesco; Prosdocimi, Francisco; de Freitas Rebelo, Mauro
2014-01-01
The success of the Asian bivalve Limnoperna fortunei as an invader in South America is related to its high acclimation capability. It can inhabit waters with a wide range of temperatures and salinity and handle long-term periods of air exposure. We describe the transcriptome of L. fortunei aiming to give a first insight into the phenotypic plasticity that allows non-native taxa to become established and widespread. We sequenced 95,219 reads from five main tissues of the mussel L. fortunei using Roche’s 454 and assembled them to form a set of 84,063 unigenes (contigs and singletons) representing partial or complete gene sequences. We annotated 24,816 unigenes using a BLAST sequence similarity search against a NCBI nr database. Unigenes were divided into 20 eggNOG functional categories and 292 KEGG metabolic pathways. From the total unigenes, 1,351 represented putative full-length genes of which 73.2% were functionally annotated. We described the first partial and complete gene sequences in order to start understanding bivalve invasiveness. An expansion of the hsp70 gene family, seen also in other bivalves, is present in L. fortunei and could be involved in its adaptation to extreme environments, e.g. during intertidal periods. The presence of toll-like receptors gives a first insight into an immune system that could be more complex than previously assumed and may be involved in the prevention of disease and extinction when population densities are high. Finally, the apparent lack of special adaptations to extremely low O2 levels is a target worth pursuing for the development of a molecular control approach. PMID:25047650
Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.
Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong
2014-05-01
We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.
Guan, Qijie; Yu, Jiaojiao; Zhu, Wei; Yang, Bingxian; Li, Yaohan; Zhang, Lin; Tian, Jingkui
2018-03-01
Ultraviolet-B (UVB) irradiation induces oxidative stress in plant cells due to the generation of excessive reactive oxygen species. Morus alba L. (M. abla) is an important medicinal plant used for the treatment of human diseases. Also, its leaves are widely used as food for silkworms. In our previous research, we found that a high level of UVB irradiation with dark incubation led to the accumulation of secondary metabolites in M. abla leaf. The aim of the present study was to describe and compare M. alba leaf transcriptomics with different treatments (control, UVB, UVB+dark). Leaf transcripts from M. alba were sequenced using an Illumina Hiseq 2000 system, which produced 14.27Gb of data including 153,204,462 paired-end reads among the three libraries. We de novo assembled 133,002 transcripts with an average length of 1270bp and filtered 69,728 non-redundant unigenes. A similarity search was performed against the non-redundant National Center of Biotechnology Information (NCBI) protein database, which returned 41.08% hits. Among the 20,040 unigenes annotated in UniProtKB/SwissProt database, 16,683 unigenes were assigned 102,232 gene ontology terms and 6667 unigenes were identified in 287 known metabolic pathways. Results of differential gene expression analysis together with real-time quantitative PCR tests indicated that UVB irradiation with dark incubation enhanced the flavonoid biosynthesis in M. alba leaf. Our findings provided a valuable proof for a better understanding of the metabolic mechanism under abiotic stresses in M. alba leaf. Copyright © 2017 Elsevier B.V. All rights reserved.
De novo assembly and transcriptomic profiling of the grazing response in Stipa grandis.
Wan, Dongli; Wan, Yongqing; Hou, Xiangyang; Ren, Weibo; Ding, Yong; Sa, Rula
2015-01-01
Stipa grandis (Poaceae) is one of the dominant species in a typical steppe of the Inner Mongolian Plateau. However, primarily due to heavy grazing, the grasslands have become seriously degraded, and S. grandis has developed a special growth-inhibition phenotype against the stressful habitat. Because of the lack of transcriptomic and genomic information, the understanding of the molecular mechanisms underlying the grazing response of S. grandis has been prohibited. Using the Illumina HiSeq 2000 platform, two libraries prepared from non-grazing (FS) and overgrazing samples (OS) were sequenced. De novo assembly produced 94,674 unigenes, of which 65,047 unigenes had BLAST hits in the National Center for Biotechnology Information (NCBI) non-redundant (nr) database (E-value < 10-5). In total, 47,747, 26,156 and 40,842 unigenes were assigned to the Gene Ontology (GO), Clusters of Orthologous Group (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases, respectively. A total of 13,221 unigenes showed significant differences in expression under the overgrazing condition, with a threshold false discovery rate ≤ 0.001 and an absolute value of log2Ratio ≥ 1. These differentially expressed genes (DEGs) were assigned to 43,257 GO terms and were significantly enriched in 32 KEGG pathways (q-value ≤ 0.05). The alterations in the wound-, drought- and defense-related genes indicate that stressors have an additive effect on the growth inhibition of this species. This first large-scale transcriptome study will provide important information for further gene expression and functional genomics studies, and it facilitated our investigation of the molecular mechanisms of the S. grandis grazing response and the associated morphological and physiological characteristics.
De novo Assembly and Transcriptomic Profiling of the Grazing Response in Stipa grandis
Hou, Xiangyang; Ren, Weibo; Ding, Yong; Sa, Rula
2015-01-01
Background Stipa grandis (Poaceae) is one of the dominant species in a typical steppe of the Inner Mongolian Plateau. However, primarily due to heavy grazing, the grasslands have become seriously degraded, and S. grandis has developed a special growth-inhibition phenotype against the stressful habitat. Because of the lack of transcriptomic and genomic information, the understanding of the molecular mechanisms underlying the grazing response of S. grandis has been prohibited. Results Using the Illumina HiSeq 2000 platform, two libraries prepared from non-grazing (FS) and overgrazing samples (OS) were sequenced. De novo assembly produced 94,674 unigenes, of which 65,047 unigenes had BLAST hits in the National Center for Biotechnology Information (NCBI) non-redundant (nr) database (E-value < 10-5). In total, 47,747, 26,156 and 40,842 unigenes were assigned to the Gene Ontology (GO), Clusters of Orthologous Group (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases, respectively. A total of 13,221 unigenes showed significant differences in expression under the overgrazing condition, with a threshold false discovery rate ≤ 0.001 and an absolute value of log2Ratio ≥ 1. These differentially expressed genes (DEGs) were assigned to 43,257 GO terms and were significantly enriched in 32 KEGG pathways (q-value ≤ 0.05). The alterations in the wound-, drought- and defense-related genes indicate that stressors have an additive effect on the growth inhibition of this species. Conclusions This first large-scale transcriptome study will provide important information for further gene expression and functional genomics studies, and it facilitated our investigation of the molecular mechanisms of the S. grandis grazing response and the associated morphological and physiological characteristics. PMID:25875617
An, Jing; Shen, Xuefeng; Ma, Qibin; Yang, Cunyi; Liu, Simin; Chen, Yong
2014-01-01
Goosegrass (Eleusine indica L.), a serious annual weed in the world, has evolved resistance to several herbicides including paraquat, a non-selective herbicide. The mechanism of paraquat resistance in weeds is only partially understood. To further study the molecular mechanism underlying paraquat resistance in goosegrass, we performed transcriptome analysis of susceptible and resistant biotypes of goosegrass with or without paraquat treatment. The RNA-seq libraries generated 194,716,560 valid reads with an average length of 91.29 bp. De novo assembly analysis produced 158,461 transcripts with an average length of 1153.74 bp and 100,742 unigenes with an average length of 712.79 bp. Among these, 25,926 unigenes were assigned to 65 GO terms that contained three main categories. A total of 13,809 unigenes with 1,208 enzyme commission numbers were assigned to 314 predicted KEGG metabolic pathways, and 12,719 unigenes were categorized into 25 KOG classifications. Furthermore, our results revealed that 53 genes related to reactive oxygen species scavenging, 10 genes related to polyamines and 18 genes related to transport were differentially expressed in paraquat treatment experiments. The genes related to polyamines and transport are likely potential candidate genes that could be further investigated to confirm their roles in paraquat resistance of goosegrass. This is the first large-scale transcriptome sequencing of E. indica using the Illumina platform. Potential genes involved in paraquat resistance were identified from the assembled sequences. The transcriptome data may serve as a reference for further analysis of gene expression and functional genomics studies, and will facilitate the study of paraquat resistance at the molecular level in goosegrass.
An, Jing; Shen, Xuefeng; Ma, Qibin; Yang, Cunyi; Liu, Simin; Chen, Yong
2014-01-01
Background Goosegrass (Eleusine indica L.), a serious annual weed in the world, has evolved resistance to several herbicides including paraquat, a non-selective herbicide. The mechanism of paraquat resistance in weeds is only partially understood. To further study the molecular mechanism underlying paraquat resistance in goosegrass, we performed transcriptome analysis of susceptible and resistant biotypes of goosegrass with or without paraquat treatment. Results The RNA-seq libraries generated 194,716,560 valid reads with an average length of 91.29 bp. De novo assembly analysis produced 158,461 transcripts with an average length of 1153.74 bp and 100,742 unigenes with an average length of 712.79 bp. Among these, 25,926 unigenes were assigned to 65 GO terms that contained three main categories. A total of 13,809 unigenes with 1,208 enzyme commission numbers were assigned to 314 predicted KEGG metabolic pathways, and 12,719 unigenes were categorized into 25 KOG classifications. Furthermore, our results revealed that 53 genes related to reactive oxygen species scavenging, 10 genes related to polyamines and 18 genes related to transport were differentially expressed in paraquat treatment experiments. The genes related to polyamines and transport are likely potential candidate genes that could be further investigated to confirm their roles in paraquat resistance of goosegrass. Conclusion This is the first large-scale transcriptome sequencing of E. indica using the Illumina platform. Potential genes involved in paraquat resistance were identified from the assembled sequences. The transcriptome data may serve as a reference for further analysis of gene expression and functional genomics studies, and will facilitate the study of paraquat resistance at the molecular level in goosegrass. PMID:24927422
Li, You-Zhi; Pan, Ying-Hua; Sun, Chang-Bin; Dong, Hai-Tao; Luo, Xing-Lu; Wang, Zhi-Qiang; Tang, Ji-Liang; Chen, Baoshan
2010-12-01
A cDNA library was constructed from the root tissues of cassava variety Huanan 124 at the root bulking stage. A total of 9,600 cDNA clones from the library were sequenced with single-pass from the 5'-terminus to establish a catalogue of expressed sequence tags (ESTs). Assembly of the resulting EST sequences resulted in 2,878 putative unigenes. Blastn analysis showed that 62.6% of the unigenes matched with known cassava ESTs and the rest had no 'hits' against the cassava database in the integrative PlantGDB database. Blastx analysis showed that 1,715 (59.59%) of the unigenes matched with one or more GenBank protein entries and 1,163 (40.41%) had no 'hits'. A cDNA microarray with 2,878 unigenes was developed and used to analyze gene expression profiling of Huanan 124 at key growth stages including seedling, formation of root system, root bulking, and starch maturity. Array data analysis revealed that (1) the higher ratio of up-regulated ribosome-related genes was accompanied by a high ratio of up-regulated ubiquitin, proteasome-related and protease genes in cassava roots; (2) starch formation and degradation simultaneously occur at the early stages of root development but starch degradation is declined partially due to decrease in UDP-glucose dehydrogenase activity with root maturity; (3) starch may also be synthesized in situ in roots; (4) starch synthesis, translocation, and accumulation are also associated probably with signaling pathways that parallel Wnt, LAM, TCS and ErbB signaling pathways in animals; (5) constitutive expression of stress-responsive genes may be due to the adaptation of cassava to harsh environments during long-term evolution.
Gong, Zhen-Hui; Yin, Yan-Xu; Li, Da-Wei
2013-01-01
Low temperature is one of the major factors limiting pepper (Capsicum annuum L.) production during winter and early spring in non-tropical regions. Application of exogenous abscisic acid (ABA) effectively alleviates the symptoms of chilling injury, such as wilting and formation of necrotic lesions on pepper leaves; however, the underlying molecular mechanism is not understood. The aim of this study was to identify genes that are differentially up- or downregulated in ABA-pretreated hot pepper seedlings incubated at 6°C for 48 h, using a suppression subtractive hybridization (SSH) method. A total of 235 high-quality ESTs were isolated, clustered and assembled into a collection of 73 unigenes including 18 contigs and 55 singletons. A total of 37 unigenes (50.68%) showed similarities to genes with known functions in the non-redundant database; the other 36 unigenes (49.32%) showed low similarities or unknown functions. Gene ontology analysis revealed that the 37 unigenes could be classified into nine functional categories. The expression profiles of 18 selected genes were analyzed using quantitative RT-PCR; the expression levels of 10 of these genes were at least two-fold higher in the ABA-pretreated seedlings under chilling stress than water-pretreated (control) plants under chilling stress. In contrast, the other eight genes were downregulated in ABA-pretreated seedlings under chilling stress, with expression levels that were one-third or less of the levels observed in control seedlings under chilling stress. These results suggest that ABA can positively and negatively regulate genes in pepper plants under chilling stress. PMID:23825555
Jiang, Qiang; Yang, Chun Hong; Zhang, Yan; Sun, Yan; Li, Rong Ling; Wang, Chang Fa; Zhong, Ji Feng; Huang, Jin Ming
2016-01-01
Alternative splicing (AS) contributes to the complexity of the mammalian proteome and plays an important role in diseases, including infectious diseases. The differential AS patterns of these transcript sequences between the healthy (HS3A) and mastitic (HS8A) cows naturally infected by Staphylococcus aureus were compared to understand the molecular mechanisms underlying mastitis resistance and susceptibility. In this study, using the Illumina paired-end RNA sequencing method, 1352 differentially expressed genes (DEGs) with higher than twofold changes were found in the HS3A and HS8A mammary gland tissues. Gene ontology and KEGG pathway analyses revealed that the cytokine–cytokine receptor interaction pathway is the most significantly enriched pathway. Approximately 16k annotated unigenes were respectively identified in two libraries, based on the bovine Bos taurus UMD3.1 sequence assembly and search. A total of 52.62% and 51.24% annotated unigenes were alternatively spliced in term of exon skipping, intron retention, alternative 5′ splicing and alternative 3ʹ splicing. Additionally, 1,317 AS unigenes were HS3A-specific, whereas 1,093 AS unigenes were HS8A-specific. Some immune-related genes, such as ITGB6, MYD88, ADA, ACKR1, and TNFRSF1B, and their potential relationships with mastitis were highlighted. From Chromosome 2, 4, 6, 7, 10, 13, 14, 17, and 20, 3.66% (HS3A) and 5.4% (HS8A) novel transcripts, which harbor known quantitative trait locus associated with clinical mastitis, were identified. Many DEGs in the healthy and mastitic mammary glands are involved in immune, defense, and inflammation responses. These DEGs, which exhibit diverse and specific splicing patterns and events, can endow dairy cattle with the potential complex genetic resistance against mastitis. PMID:27459697
Sun, Haiyue; Liu, Yushan; Gai, Yuzhuo; Geng, Jinman; Chen, Li; Liu, Hongdi; Kang, Limin; Tian, Youwen; Li, Yadong
2015-09-02
Cranberries (Vaccinium macrocarpon Ait.), renowned for their excellent health benefits, are an important berry crop. Here, we performed transcriptome sequencing of one cranberry cultivar, from fruits at two different developmental stages, on the Illumina HiSeq 2000 platform. Our main goals were to identify putative genes for major metabolic pathways of bioactive compounds and compare the expression patterns between white fruit (W) and red fruit (R) in cranberry. In this study, two cDNA libraries of W and R were constructed. Approximately 119 million raw sequencing reads were generated and assembled de novo, yielding 57,331 high quality unigenes with an average length of 739 bp. Using BLASTx, 38,460 unigenes were identified as putative homologs of annotated sequences in public protein databases, including NCBI NR, NT, Swiss-Prot, KEGG, COG and GO. Of these, 21,898 unigenes mapped to 128 KEGG pathways, with the metabolic pathways, secondary metabolites, glycerophospholipid metabolism, ether lipid metabolism, starch and sucrose metabolism, purine metabolism, and pyrimidine metabolism being well represented. Among them, many candidate genes were involved in flavonoid biosynthesis, transport and regulation. Furthermore, digital gene expression (DEG) analysis identified 3,257 unigenes that were differentially expressed between the two fruit developmental stages. In addition, 14,473 simple sequence repeats (SSRs) were detected. Our results present comprehensive gene expression information about the cranberry fruit transcriptome that could facilitate our understanding of the molecular mechanisms of fruit development in cranberries. Although it will be necessary to validate the functions carried out by these genes, these results could be used to improve the quality of breeding programs for the cranberry and related species.
AmpuBase: a transcriptome database for eight species of apple snails (Gastropoda: Ampullariidae).
Ip, Jack C H; Mu, Huawei; Chen, Qian; Sun, Jin; Ituarte, Santiago; Heras, Horacio; Van Bocxlaer, Bert; Ganmanee, Monthon; Huang, Xin; Qiu, Jian-Wen
2018-03-05
Gastropoda, with approximately 80,000 living species, is the largest class of Mollusca. Among gastropods, apple snails (family Ampullariidae) are globally distributed in tropical and subtropical freshwater ecosystems and many species are ecologically and economically important. Ampullariids exhibit various morphological and physiological adaptations to their respective habitats, which make them ideal candidates for studying adaptation, population divergence, speciation, and larger-scale patterns of diversity, including the biogeography of native and invasive populations. The limited availability of genomic data, however, hinders in-depth ecological and evolutionary studies of these non-model organisms. Using Illumina Hiseq platforms, we sequenced 1220 million reads for seven species of apple snails. Together with the previously published RNA-Seq data of two apple snails, we conducted de novo transcriptome assembly of eight species that belong to five genera of Ampullariidae, two of which represent Old World lineages and the other three New World lineages. There were 20,730 to 35,828 unigenes with predicted open reading frames for the eight species, with N50 (shortest sequence length at 50% of the unigenes) ranging from 1320 to 1803 bp. 69.7% to 80.2% of these unigenes were functionally annotated by searching against NCBI's non-redundant, Gene Ontology database and the Kyoto Encyclopaedia of Genes and Genomes. With these data we developed AmpuBase, a relational database that features online BLAST functionality for DNA/protein sequences, keyword searching for unigenes/functional terms, and download functions for sequences and whole transcriptomes. In summary, we have generated comprehensive transcriptome data for multiple ampullariid genera and species, and created a publicly accessible database with a user-friendly interface to facilitate future basic and applied studies on ampullariids, and comparative molecular studies with other invertebrates.
NASA Astrophysics Data System (ADS)
Shi, Pengju; Dong, Shihang; Zhang, Huanjun; Wang, Peiliang; Niu, Zhuang; Fang, Yan
2018-03-01
Polybrominated diphenyl ethers (PBDEs) are ubiquitous global pollutants, which are known to have immune, development, reproduction, and endocrine toxicity in aquatic organisms, including bivalves. 2,2',4,4'-Tetrabromodiphenyl ether (BDE-47) is the predominant PBDE congener detected in environmental samples and the tissues of organisms. However, the mechanism of its toxicity remains unclear. In this study, high-throughput sequencing was performed using the clam Mactra veneriformis, a good model for toxicological research, to clarify the transcriptomic response to BDE-47 and the mechanism responsible for the toxicity of BDE-47. The clams were exposed to 5 μg/L BDE-47 for 3 days and the digestive glands were sampled for high-throughput sequencing analysis. We obtained 127 648, 154 225, and 124 985 unigenes by de novo assembly of the control group reads (CG), BDE-47 group reads (BDEG), and control and BDE-47 reads (CG & BDEG), respectively. We annotated 32 176 unigenes from the CG & BDEG reads using the NR database. We categorized 24 401 unigenes into 25 functional COG clusters and 21 749 unigenes were assigned to 259 KEGG pathways. Moreover, 17 625 differentially expressed genes (DEGs) were detected, with 10 028 upregulated DEGs and 7 597 downregulated DEGs. Functional enrichment analysis showed that the DEGs were involved with detoxification, antioxidant defense, immune response, apoptosis, and other functions. The mRNA expression levels of 26 DEGs were verified by quantitative real-time PCR, which demonstrated the high agreement between the two methods. These results provide a good basis for future research using the M. veneriformis model into the mechanism of PBDEs toxicity and molecular biomarkers for BDE-47 pollution. The regulation and interaction of the DEGs would be studied in the future for clarifying the mechanism of PBDEs toxicity.
Yan, Xiuqin; Zhang, Xue; Lu, Min; He, Yong; An, Huaming
2015-04-25
Rosa roxburghii Tratt. is a well-known ornamental rose species native to China. In addition, the fruits of this species are valued for their nutritional and medicinal characteristics, especially their high ascorbic acid (AsA) levels. Nevertheless, AsA biosynthesis in R. roxburghii fruit has not been explored in detail because of a lack of genomic resources for this species. High-throughput transcriptomic sequencing generating large volumes of transcript sequence data can aid in gene discovery and molecular marker development. In this study, we generated more than 53 million clean reads using Illumina paired-end sequencing technology. De novo assembly yielded 106,590 unigenes, with an average length of 343 bp. On the basis of sequence similarity to known proteins, 9301 and 2393 unigenes were classified into Gene Ontology and Clusters of Orthologous Group categories, respectively. There were 7480 unigenes assigned to 124 pathways in the Kyoto Encyclopedia of Gene and Genome pathway database. BLASTx searches identified 498 unique putative transcripts encoding various transcription factors, some known to regulate fruit development. qRT-PCR validated the expressions of most of the genes encoding the main enzymes involved in ascorbate biosynthesis. In addition, 9131 potential simple sequence repeat (SSR) loci were identified among the unigenes. One hundred and two primer pairs were synthesized and 71 pairs produced an amplification product during initial screening. Among the amplified products, 30 were polymorphic in the 16 R. roxburghii germplasms tested. Our study was the first to produce a large volume of transcriptome data from R. roxburghii. The resulting sequence collection is a valuable resource for gene discovery and marker-assisted selective breeding in this rose species. Copyright © 2015 Elsevier B.V. All rights reserved.
Wang, Wenlei; Li, Huanqin; Lin, Xiangzhi; Yang, Shanjun; Wang, Zhaokai; Fang, Baishan
2015-12-11
Tissue culture could solve the problems associated with Gracilaria cultivation, including the consistent supply of high-quality seed stock, strain improvement, and efficient mass culture of high-yielding commercial strains. However, STC lags behind that of higher plants because of the paucity of genomic information. Transcriptome analysis and the identification of potential unigenes involved in the formation and regeneration of callus or direct induction of ABs are essential. Herein, the CK, EWAB and NPA G. lichenoides transcriptomes were analyzed using the Illumina sequencing platform in first time. A total of 17,922,453,300 nucleotide clean bases were generated and assembled into 21,294 unigenes, providing a total gene space of 400,912,038 nucleotides with an average length of 1,883 and N 50 of 5,055 nucleotides and a G + C content of 52.02%. BLAST analysis resulted in the assignment of 13,724 (97.5%), 3,740 (26.6%), 9,934 (70.6%), 10,611 (75.4%), 9,490 (67.4%), and 7,773 (55.2%) unigenes were annotated to the NR, NT, Swiss-Prot, KEGG, COG, and GO databases, respectively, and the total of annotated unigenes was 14,070. A total of 17,099 transcripts were predicted to possess open reading frames, including 3,238 predicted and 13,861 blasted based on protein databases. In addition, 3,287 SSRs were detected in G.lichenoides, providing further support for genetic variation and marker-assisted selection in the future. Our results suggest that auxin polar transport, auxin signal transduction, crosstalk with other endogenous plant hormones and antioxidant systems, play important roles for ABs formation in G. lichenoides explants in vitro. The present findings will facilitate further studies on gene discovery and on the molecular mechanisms underlying the tissue culture of seaweed.
Fukushima, Atsushi; Nakamura, Michimi; Suzuki, Hideyuki; Yamazaki, Mami; Knoch, Eva; Mori, Tetsuya; Umemoto, Naoyuki; Morita, Masaki; Hirai, Go; Sodeoka, Mikiko; Saito, Kazuki
2016-01-01
The genus Physalis in the Solanaceae family contains several species of benefit to humans. Examples include P. alkekengi (Chinese-lantern plant, hôzuki in Japanese) used for medicinal and for decorative purposes, and P. peruviana, also known as Cape gooseberry, which bears an edible, vitamin-rich fruit. Members of the Physalis genus are a valuable resource for phytochemicals needed for the development of medicines and functional foods. To fully utilize the potential of these phytochemicals we need to understand their biosynthesis, and for this we need genomic data, especially comprehensive transcriptome datasets for gene discovery. We report the de novo assembly of the transcriptome from leaves of P. alkekengi and P. peruviana using Illumina RNA-seq technologies. We identified 75,221 unigenes in P. alkekengi and 54,513 in P. peruviana. All unigenes were annotated with gene ontology (GO), Enzyme Commission (EC) numbers, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG). We classified unigenes encoding enzyme candidates putatively involved in the secondary metabolism and identified more than one unigenes for each step in terpenoid backbone- and steroid biosynthesis in P. alkekengi and P. peruviana. To measure the variability of the withanolides including physalins and provide insights into their chemical diversity in Physalis, we also analyzed the metabolite content in leaves of P. alkekengi and P. peruviana at five different developmental stages by liquid chromatography-mass spectrometry. We discuss that comprehensive transcriptome approaches within a family can yield a clue for gene discovery in Physalis and provide insights into their complex chemical diversity. The transcriptome information we submit here will serve as an important public resource for further studies of the specialized metabolism of Physalis species. PMID:28066454
Fukushima, Atsushi; Nakamura, Michimi; Suzuki, Hideyuki; Yamazaki, Mami; Knoch, Eva; Mori, Tetsuya; Umemoto, Naoyuki; Morita, Masaki; Hirai, Go; Sodeoka, Mikiko; Saito, Kazuki
2016-01-01
The genus Physalis in the Solanaceae family contains several species of benefit to humans. Examples include P. alkekengi (Chinese-lantern plant, hôzuki in Japanese) used for medicinal and for decorative purposes, and P. peruviana , also known as Cape gooseberry, which bears an edible, vitamin-rich fruit. Members of the Physalis genus are a valuable resource for phytochemicals needed for the development of medicines and functional foods. To fully utilize the potential of these phytochemicals we need to understand their biosynthesis, and for this we need genomic data, especially comprehensive transcriptome datasets for gene discovery. We report the de novo assembly of the transcriptome from leaves of P. alkekengi and P. peruviana using Illumina RNA-seq technologies. We identified 75,221 unigenes in P. alkekengi and 54,513 in P. peruviana . All unigenes were annotated with gene ontology (GO), Enzyme Commission (EC) numbers, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG). We classified unigenes encoding enzyme candidates putatively involved in the secondary metabolism and identified more than one unigenes for each step in terpenoid backbone- and steroid biosynthesis in P. alkekengi and P. peruviana . To measure the variability of the withanolides including physalins and provide insights into their chemical diversity in Physalis , we also analyzed the metabolite content in leaves of P. alkekengi and P. peruviana at five different developmental stages by liquid chromatography-mass spectrometry. We discuss that comprehensive transcriptome approaches within a family can yield a clue for gene discovery in Physalis and provide insights into their complex chemical diversity. The transcriptome information we submit here will serve as an important public resource for further studies of the specialized metabolism of Physalis species.
Jiang, Ni-Hao; Zhang, Guang-Hui; Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao
2014-01-01
Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb.
Xue, Shuxia; Liu, Yichen; Zhang, Yichen; Sun, Yan; Geng, Xuyun; Sun, Jinsheng
2013-01-01
White spot syndrome virus (WSSV) is a causative pathogen found in most shrimp farming areas of the world and causes large economic losses to the shrimp aquaculture. The mechanism underlying the molecular pathogenesis of the highly virulent WSSV remains unknown. To better understand the virus-host interactions at the molecular level, the transcriptome profiles in hemocytes of unchallenged and WSSV-challenged shrimp (Litopenaeus vannamei) were compared using a short-read deep sequencing method (Illumina). RNA-seq analysis generated more than 25.81 million clean pair end (PE) reads, which were assembled into 52,073 unigenes (mean size = 520 bp). Based on sequence similarity searches, 23,568 (45.3%) genes were identified, among which 6,562 and 7,822 unigenes were assigned to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. Searches in the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG) mapped 14,941 (63.4%) unigenes to 240 KEGG pathways. Among all the annotated unigenes, 1,179 were associated with immune-related genes. Digital gene expression (DGE) analysis revealed that the host transcriptome profile was slightly changed in the early infection (5 hours post injection) of the virus, while large transcriptional differences were identified in the late infection (48 hpi) of WSSV. The differentially expressed genes mainly involved in pattern recognition genes and some immune response factors. The results indicated that antiviral immune mechanisms were probably involved in the recognition of pathogen-associated molecular patterns. This study provided a global survey of host gene activities against virus infection in a non-model organism, pacific white shrimp. Results can contribute to the in-depth study of candidate genes in white shrimp, and help to improve the current understanding of host-pathogen interactions.
Transcriptomic immune response of Tenebrio molitor pupae to parasitization by Scleroderma guani.
Zhu, Jia-Ying; Yang, Pu; Zhang, Zhong; Wu, Guo-Xing; Yang, Bin
2013-01-01
Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE) analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26%) showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular understanding of the host-parasitoid interaction.
Zhang, Yanzhao; Xu, Shuzhen; Cheng, Yanwei; Peng, Zhengfeng; Han, Jianming
2018-01-01
Red leaf lettuce ( Lactuca sativa L.) is popular due to its high anthocyanin content, but poor leaf coloring often occurs under low light intensity. In order to reveal the mechanisms of anthocyanins affected by light intensity, we compared the transcriptome of L. sativa L. var. capitata under light intensities of 40 and 100 μmol m -2 s -1 . A total of 62,111 unigenes were de novo assembled with an N50 of 1,681 bp, and 48,435 unigenes were functionally annotated in public databases. A total of 3,899 differentially expressed genes (DEGs) were detected, of which 1,377 unigenes were up-regulated and 2,552 unigenes were down-regulated in the high light samples. By Kyoto Encyclopedia of Genes and Genomes enrichment analysis, the DEGs were significantly enriched in 14 pathways. Using gene annotation and phylogenetic analysis, we identified seven anthocyanin structural genes, including CHS , CHI , F3H , F3'H , DFR , ANS , and 3GT , and two anthocyanin transport genes, GST and MATE . In terms of anthocyanin regulatory genes, five MYBs and one bHLH gene were identified. An HY5 gene was discovered, which may respond to light-signaling and regulate anthocyanin structural genes. These genes showed a log2FC of 2.7-9.0 under high irradiance, and were validated using quantitative real-time-PCR. In conclusion, our results indicated transcriptome variance in red leaf lettuce under low and high light intensity, and observed a anthocyanin biosynthesis and regulation pattern. The data should further help to unravel the molecular mechanisms of anthocyanins influenced by light intensity.
Hayat Topcu; Nergiz Coban; Keith Woeste; Mehmet Sutyemez; Salih Kafkas
2015-01-01
We attempted to develop new polymorphic SSR primer pairs in walnut using sequences derived from Juglans nigra L. genomic enriched library with GA repeat. The designed 94 SSR primer pairs were subjected to gradient PCR in 12 walnut cultivars to determine their optimum annealing temperatures and to determine whether they produce bands. Then, the...
Microsatellite analysis of medfly bioinfestations in California.
Bonizzoni, M; Zheng, L; Guglielmino, C R; Haymer, D S; Gasperi, G; Gomulski, L M; Malacrida, A R
2001-10-01
The Mediterranean fruit fly, Ceratitis capitata, is a destructive agricultural pest with a long history of invasion success. This pest has been affecting different regions of the United States for the past 30 years, but a number of studies of medfly bioinfestations has focused on the situation in California. Although some progress has been made in terms of establishing the origin of infestations, the overall status of this pest in this area remains controversial. Specifically, do flies captured over the years represent independent infestations or the persistence of a resident population? We present an effort to answer this question based on the use of multilocus genotyping. Ten microsatellite loci were used to analyse 109 medflies captured in several infestations within California between 1992 and 1998. Using these same markers, 242 medflies from regions of the world having 'established' populations of this pest including Hawaii, Guatemala, El Salvador, Ecuador, Brazil, Argentina and Peru, were also analysed. Although phylogenetic analysis, amova analysis, the IMMANC assignment test and geneclass exclusion test analysis suggest that some of the medflies captured in California are derived from independent invasion events, analysis of specimens from the Los Angeles basin provides support for the hypothesis that an endemic population, probably derived from Guatemala, has been established.
A caprine chimera produced by injection of embryonic germ cells into a blastocyst.
Jia, W; Yang, W; Lei, A; Gao, Z; Yang, C; Hua, J; Huang, W; Ma, X; Wang, H; Dou, Z
2008-02-01
This report details a chimeric goat derived by injecting caprine embryonic germ (EG) cells into a host blastocyst. The EG cells, isolated from the primordial genital ridge of white Guanzhong goat fetuses (28-42 days of pregnancy), had alkaline phosphatase activity and several stem cell markers, including SSEA-1, c-kit, and Nanog. Ten to 20EG cells were microinjected into the blastocoelic cavity of a host blastocyst collected from a black goat following natural service. Twenty-nine injected blastocysts were transferred into nine white surrogate goats. One of the recipients maintained pregnancy to term and gave birth to three kids: one male, one female, and a dead, malformed fetus of undetermined gender; all three fetuses were black, but the female and the malformed fetus each had a large white spot on their head. Based on PCR and microsatellite DNA assay, the female and the malformed fetus were monozygotic twins and chimeras. Microsatellite assay on various tissues from the dead fetus (including skin, blood, liver, placenta, lung, heart, spleen, muscle, and brain), revealed that these tissues and organs were chimeric and contained cells derived from EG cells. In conclusion, caprine EG cells differentiated into all three germ layers in vivo.
Low abundance of microsatellite repeats in the genome of the Brown-headed Cowbird (Molothrus ater)
Longmire, Jonathan L.; Hahn, D.C.; Roach, J.L.
1999-01-01
A cosmid library made from brown-headed cowbird (Molothrus ater) DNA was examined for representation of 17 distinct microsatellite motifs including all possible mono-, di-, and trinucleotide microsatellites, and the tetranucleotide repeat (GATA)n. The overall density of microsatellites within cowbird DNA was found to be one repeat per 89 kb and the frequency of the most abundant motif, (AGC)n, was once every 382 kb. The abundance of microsatellites within the cowbird genome is estimated to be reduced approximately 15-fold compared to humans. The reduced frequency of microsatellites seen in this study is consistent with previous observations indicating reduced numbers of microsatellites and other interspersed repeats in avian DNA. In addition to providing new information concerning the abundance of microsatellites within an avian genome, these results provide useful insights for selecting cloning strategies that might be used in the development of locus-specific microsatellite markers for avian studies.
NASA Astrophysics Data System (ADS)
Li, Qi; Akihiro, Kijima
2007-01-01
The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the microsatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (<20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
Genetics and epigenetics of small bowel adenocarcinoma: the interactions of CIN, MSI, and CIMP.
Warth, Arne; Kloor, Matthias; Schirmacher, Peter; Bläker, Hendrik
2011-04-01
Characterization of tumor genetics and epigenetics allows to stratify a tumor entity according to molecular pathways and may shed light on the interactions of different types of DNA alterations during tumorigenesis. Small intestinal adenocarcinoma is rare, and to date the interrelation of genomic instability and epigenetics has not been investigated in this tumor type. We therefore analyzed 37 primary small bowel carcinomas with known microsatellite instability and KRAS status for chromosomal instability using comparative genomic hybridization, for the presence of aberrant methylation (CpG island methylation phenotype) by methylation-specific polymerase chain reaction, and for BRAF mutations. Chromosomal instability was detected in 22 of 37 (59%) tumors (3 of 9 microsatellite instable, and 19 of 28 microsatellite stable carcinomas). Nine carcinomas (24%) were microsatellite and chromosomally stable. High-level DNA methylation was found in 16% of chromosomal instable tumors and in 44% of both microsatellite instable and microsatellite and chromosomally stable carcinomas. KRAS was mutated in 55, 0, and 10% of chromosomal instable, microsatellite instable, and microsatellite and chromosomally stable tumors, respectively whereas the frequencies of BRAF mutations were 6% for chromosomal instable and 22% for both microsatellite instable and microsatellite and chromosomally stable carcinomas. In conclusion, in this study we show that chromosomal instable carcinomas of the small intestine are distinguished from microsatellite instable and microsatellite and chromosomally stable tumors by a high frequency of KRAS mutations, low frequencies of CpG island methylation phenotype, and BRAF mutations. In microsatellite instable and microsatellite and chromosomally stable cancers, CpG island methylation phenotype and BRAF/KRAS mutations are similarly distributed, indicating common mechanisms of tumor initiation or progression in their molecular pathogenesis.
[Locus HS.633957 expression in human gastrointestinal tract and tumors].
Polev, D E; Krukovskaia, L L; Kozlov, A P
2011-01-01
Human locus HS.633957 corresponds to its namesake cluster in the UniGene database http:/www.ncbi.nlm.nih.gov/unigene. It is located on chromosome 7 and is 3.7 tpn in size. It does not seem to encode proteins nor has its function been identified. According to bioinformation evidence, its expression is tumor-specific. PCR assay on kDNA samples from different intact human tissues detected its slight expression in liver, heart, embryonal brain and kidney as well as in a wide spectrum of tumors. This work features locus Hs.633957 expression in different parts of human gastrointestinal tract and tumors.
Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.
Liu, Yaling; Zhang, Pengfei; Song, Meiling; Hou, Junling; Qing, Mei; Wang, Wenquan; Liu, Chunsheng
2015-01-01
Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice. PMID:26571372
De novo characterization of Lentinula edodes C(91-3) transcriptome by deep Solexa sequencing.
Zhong, Mintao; Liu, Ben; Wang, Xiaoli; Liu, Lei; Lun, Yongzhi; Li, Xingyun; Ning, Anhong; Cao, Jing; Huang, Min
2013-02-01
Lentinula edodes, has been utilized as food, as well as, in popular medicine, moreover, its extract isolated from its mycelium and fruiting body have shown several therapeutic properties. Yet little is understood about its genes involved in these properties, and the absence of L.edodes genomes has been a barrier to the development of functional genomics research. However, high throughput sequencing technologies are now being widely applied to non-model species. To facilitate research on L.edodes, we leveraged Solexa sequencing technology in de novo assembly of L.edodes C(91-3) transcriptome. In a single run, we produced more than 57 million sequencing reads. These reads were assembled into 28,923 unigene sequences (mean size=689bp) including 18,120 unigenes with coding sequence (CDS). Based on similarity search with known proteins, assembled unigene sequences were annotated with gene descriptions, gene ontology (GO) and clusters of orthologous group (COG) terms. Our data provides the first comprehensive sequence resource available for functional genomics studies in L.edodes, and demonstrates the utility of Illumina/Solexa sequencing for de novo transcriptome characterization and gene discovery in a non-model mushroom. Copyright © 2012 Elsevier Inc. All rights reserved.
Gao, Zhengquan; Li, Yan; Wu, Guanxun; Li, Guoqiang; Sun, Haifeng; Deng, Suzhen; Shen, Yicheng; Chen, Guoqiang; Zhang, Ruihao; Meng, Chunxiao; Zhang, Xiaowen
2015-01-01
Haematococcus pluvialis is an astaxanthin-rich microalga that can increase its astaxanthin production by salicylic acid (SA) or jasmonic acid (JA) induction. The genetic transcriptome details of astaxanthin biosynthesis were analyzed by exposing the algal cells to 25 mg/L of SA and JA for 1, 6 and 24 hours, plus to the control (no stress). Based on the RNA-seq analysis, 56,077 unigenes (51.7%) were identified with functions in response to the hormone stress. The top five identified subcategories were cell, cellular process, intracellular, catalytic activity and cytoplasm, which possessed 5600 (~9.99%), 5302 (~9.45%), 5242 (~9.35%), 4407 (~7.86%) and 4195 (~7.48%) unigenes, respectively. Furthermore, 59 unigenes were identified and assigned to 26 putative transcription factors (TFs), including 12 plant-specific TFs. They were likely associated with astaxanthin biosynthesis in Haematococcus upon SA and JA stress. In comparison, the up-regulation of differential expressed genes occurred much earlier, with higher transcript levels in the JA treatment (about 6 h later) than in the SA treatment (beyond 24 h). These results provide valuable information for directing metabolic engineering efforts to improve astaxanthin biosynthesis in H. pluvialis.
2013-01-01
Background Cymbidium sinense belongs to the Orchidaceae, which is one of the most abundant angiosperm families. C. sinense, a high-grade traditional potted flower, is most prevalent in China and some Southeast Asian countries. The control of flowering time is a major bottleneck in the industrialized development of C. sinense. Little is known about the mechanisms responsible for floral development in this orchid. Moreover, genome references for entire transcriptome sequences do not currently exist for C. sinense. Thus, transcriptome and expression profiling data for this species are needed as an important resource to identify genes and to better understand the biological mechanisms of floral development in C. sinense. Results In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. Transcriptome analysis assembles gene-related information related to vegetative and reproductive growth of C. sinense. Illumina sequencing generated 54,248,006 high quality reads that were assembled into 83,580 unigenes with an average sequence length of 612 base pairs, including 13,315 clusters and 70,265 singletons. A total of 41,687 (49.88%) unique sequences were annotated, 23,092 of which were assigned to specific metabolic pathways by the Kyoto Encyclopedia of Genes and Genomes (KEGG). Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with metabolic and cellular processes, cell and cell parts, catalytic activity and binding. Furthermore, 120 flowering-associated unigenes, 73 MADS-box unigenes and 28 CONSTANS-LIKE (COL) unigenes were identified from our collection. In addition, three digital gene expression (DGE) libraries were constructed for the vegetative phase (VP), floral differentiation phase (FDP) and reproductive phase (RP). The specific expression of many genes in the three development phases was also identified. 32 genes among three sub-libraries with high differential expression were selected as candidates connected with flower development. Conclusion RNA-seq and DGE profiling data provided comprehensive gene expression information at the transcriptional level that could facilitate our understanding of the molecular mechanisms of floral development at three development phases of C. sinense. This data could be used as an important resource for investigating the genetics of the flowering pathway and various biological mechanisms in this orchid. PMID:23617896
Huang, Xianzhong; Yang, Lifei; Jin, Yuhuan; Lin, Jun; Liu, Fang
2017-01-01
Arabidopsis pumila is an ephemeral plant, and a close relative of the model plant Arabidopsis thaliana , but it possesses higher photosynthetic efficiency, higher propagation rate, and higher salinity tolerance compared to those A. thaliana , thus providing a candidate plant system for gene mining for environmental adaption and salt tolerance. However, A. pumila is an under-explored resource for understanding the genetic mechanisms underlying abiotic stress adaptation. To improve our understanding of the molecular and genetic mechanisms of salt stress adaptation, more than 19,900 clones randomly selected from a cDNA library constructed previously from leaf tissue exposed to high-salinity shock were sequenced. A total of 16,014 high-quality expressed sequence tags (ESTs) were generated, which have been deposited in the dbEST GenBank under accession numbers JZ932319 to JZ948332. Clustering and assembly of these ESTs resulted in the identification of 8,835 unique sequences, consisting of 2,469 contigs and 6,366 singletons. The blastx results revealed 8,011 unigenes with significant similarity to known genes, while only 425 unigenes remained uncharacterized. Functional classification demonstrated an abundance of unigenes involved in binding, catalytic, structural or transporter activities, and in pathways of energy, carbohydrate, amino acid, or lipid metabolism. At least seven main classes of genes were related to salt-tolerance among the 8,835 unigenes. Many previously reported salt tolerance genes were also manifested in this library, for example VP1, H + -ATPase, NHX1, SOS2, SOS3, NAC, MYB, ERF, LEA, P5CS1 . In addition, 251 transcription factors were identified from the library, classified into 42 families. Lastly, changes in expression of the 12 most abundant unigenes, 12 transcription factor genes, and 19 stress-related genes in the first 24 h of exposure to high-salinity stress conditions were monitored by qRT-PCR. The large-scale EST library obtained in this study provides first-hand information on gene sequences expressed in young leaves of A. pumila exposed to salt shock. The rapid discovery of known or unknown genes related to salinity stress response in A. pumila will facilitate the understanding of complex adaptive mechanisms for ephemerals.
Schwarz, Jodi A; Brokstein, Peter B; Voolstra, Christian; Terry, Astrid Y; Miller, David J; Szmant, Alina M; Coffroth, Mary Alice; Medina, Mónica
2008-01-01
Background Scleractinian corals are the foundation of reef ecosystems in tropical marine environments. Their great success is due to interactions with endosymbiotic dinoflagellates (Symbiodinium spp.), with which they are obligately symbiotic. To develop a foundation for studying coral biology and coral symbiosis, we have constructed a set of cDNA libraries and generated and annotated ESTs from two species of corals, Acropora palmata and Montastraea faveolata. Results We generated 14,588 (Ap) and 3,854 (Mf) high quality ESTs from five life history/symbiosis stages (spawned eggs, early-stage planula larvae, late-stage planula larvae either infected with symbionts or uninfected, and adult coral). The ESTs assembled into a set of primarily stage-specific clusters, producing 4,980 (Ap), and 1,732 (Mf) unigenes. The egg stage library, relative to the other developmental stages, was enriched in genes functioning in cell division and proliferation, transcription, signal transduction, and regulation of protein function. Fifteen unigenes were identified as candidate symbiosis-related genes as they were expressed in all libraries constructed from the symbiotic stages and were absent from all of the non symbiotic stages. These include several DNA interacting proteins, and one highly expressed unigene (containing 17 cDNAs) with no significant protein-coding region. A significant number of unigenes (25) encode potential pattern recognition receptors (lectins, scavenger receptors, and others), as well as genes that may function in signaling pathways involved in innate immune responses (toll-like signaling, NFkB p105, and MAP kinases). Comparison between the A. palmata and an A. millepora EST dataset identified ferritin as a highly expressed gene in both datasets that appears to be undergoing adaptive evolution. Five unigenes appear to be restricted to the Scleractinia, as they had no homology to any sequences in the nr databases nor to the non-scleractinian cnidarians Nematostella vectensis and Hydra magnipapillata. Conclusion Partial sequencing of 5 cDNA libraries each for A. palmata and M. faveolata has produced a rich set of candidate genes (4,980 genes from A. palmata, and 1,732 genes from M. faveolata) that we can use as a starting point for examining the life history and symbiosis of these two species, as well as to further expand the dataset of cnidarian genes for comparative genomics and evolutionary studies. PMID:18298846
Transcriptomes of Arbuscular Mycorrhizal Fungi and Litchi Host Interaction after Tree Girdling
Shu, Bo; Li, Weicai; Liu, Liqin; Wei, Yongzan; Shi, Shengyou
2016-01-01
Trunk girdling can increase carbohydrate content above the girdling site and is an important strategy for inhibiting new shoot growth to promote flowering in cultivated litchi (Litchi chinensis Sonn.). However, girdling inhibits carbohydrate transport to the root in nearly all of the fruit development periods and consequently decreases root absorption. The mechanism through which carbohydrates regulate root development in arbuscular mycorrhiza (AM) remains largely unknown. Carbohydrate content, AM colonization, and transcriptome in the roots were analyzed to elucidate the interaction between host litchi and AM fungi when carbohydrate content decreases. Girdling decreased glucose, fructose, sucrose, quebrachitol, and starch contents in the litchi mycorrhizal roots, thereby reducing AM colonization. RNA-seq achieved approximately 60 million reads of each sample, with an average length of reads reaching 100 bp. Assembly of all the reads of the 30 samples produced 671,316 transcripts and 381,429 unigenes, with average lengths of 780 and 643 bp, respectively. Litchi (54,100 unigenes) and AM fungi unigenes (33,120 unigenes) were achieved through sequence annotation during decreased carbohydrate content. Analysis of differentially expressed genes (DEG) showed that flavonoids, alpha-linolenic acid, and linoleic acid are the main factors that regulate AM colonization in litchi. However, flavonoids may play a role in detecting the stage at which carbohydrate content decreases; alpha-linolenic acid or linoleic acid may affect AM formation under the adaptation process. Litchi trees stimulated the expression of defense-related genes and downregulated symbiosis signal-transduction genes to inhibit new AM colonization. Moreover, transcription factors of the AP2, ERF, Myb, WRKY, bHLH families, and lectin genes altered maintenance of litchi mycorrhizal roots in the post-symbiotic stage for carbohydrate starvation. Similar to those of the litchi host, the E3 ubiquitin ligase complex SCF subunit scon-3 and polyubiquitin of AM fungi were upregulated at the perceived stages. This occurrence suggested that ubiquitination plays an important role in perceiving carbohydrate decrease in AM fungi. The transcription of cytochrome b-245 and leucine-rich repeat was detected in the DEG database, implying that the transcripts were involved in AM fungal adaptation under carbohydrate starvation. The transcriptome data might suggest novel functions of unigenes in carbohydrate shortage of mycorrhizal roots. PMID:27065972
Zhang, Jianxia; Wu, Kunlin; Zeng, Songjun; Teixeira da Silva, Jaime A; Zhao, Xiaolan; Tian, Chang-En; Xia, Haoqiang; Duan, Jun
2013-04-24
Cymbidium sinense belongs to the Orchidaceae, which is one of the most abundant angiosperm families. C. sinense, a high-grade traditional potted flower, is most prevalent in China and some Southeast Asian countries. The control of flowering time is a major bottleneck in the industrialized development of C. sinense. Little is known about the mechanisms responsible for floral development in this orchid. Moreover, genome references for entire transcriptome sequences do not currently exist for C. sinense. Thus, transcriptome and expression profiling data for this species are needed as an important resource to identify genes and to better understand the biological mechanisms of floral development in C. sinense. In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. Transcriptome analysis assembles gene-related information related to vegetative and reproductive growth of C. sinense. Illumina sequencing generated 54,248,006 high quality reads that were assembled into 83,580 unigenes with an average sequence length of 612 base pairs, including 13,315 clusters and 70,265 singletons. A total of 41,687 (49.88%) unique sequences were annotated, 23,092 of which were assigned to specific metabolic pathways by the Kyoto Encyclopedia of Genes and Genomes (KEGG). Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with metabolic and cellular processes, cell and cell parts, catalytic activity and binding. Furthermore, 120 flowering-associated unigenes, 73 MADS-box unigenes and 28 CONSTANS-LIKE (COL) unigenes were identified from our collection. In addition, three digital gene expression (DGE) libraries were constructed for the vegetative phase (VP), floral differentiation phase (FDP) and reproductive phase (RP). The specific expression of many genes in the three development phases was also identified. 32 genes among three sub-libraries with high differential expression were selected as candidates connected with flower development. RNA-seq and DGE profiling data provided comprehensive gene expression information at the transcriptional level that could facilitate our understanding of the molecular mechanisms of floral development at three development phases of C. sinense. This data could be used as an important resource for investigating the genetics of the flowering pathway and various biological mechanisms in this orchid.
Yang, Wei; Yang, Chunping; Lu, Lin; Chen, Zhangming
2017-01-01
Cyrtotrachelus buqueti is an extremely harmful bamboo borer, and the larvae of this pest attack clumping bamboo shoots. Pheromone-binding proteins (PBPs) play an important role in identifying insect sex pheromones, but the C. buqueti genome is not readily available for PBP analysis. Developmental transcriptomes of eggs, larvae from the first instar to the prepupal stage, pupae, and adults (females and males) from emergence to mating were built by RNA sequencing (RNA-Seq) in the present study to establish a sequence background of C. buqueti to help understand PBPs. Approximately 164.8 million clean reads were obtained and annotated into 108,854 transcripts. These were assembled into 24,338, 21,597, 24,798, 21,886, 24,642, and 83,115 unigenes for eggs, larvae, pupae, females, males, and the combined datasets, respectively. Unigenes were annotated against NCBI non-redundant protein sequences, NCBI non-redundant nucleotide sequences, Gene Ontology (GO), Protein family, Clusters of Orthologous Groups of Proteins/ Clusters of Eukaryotic Orthologous Groups (KOG), Swiss-Prot, and KEGG Orthology databases. A total of 17,213 unigenes were annotated into 55 sub-categories belonging to three main GO categories; 10,672 unigenes were classified into 26 functional categories by KOG classification, and 8,063 unigenes were classified into five functional KEGG categories. RSEM software for RNA sequencing showed that 4,816, 3,176, 3,661, 2,898, 4,316, 8,019, 7,273, 5,922, 5,844, and 4,570 genes were differentially expressed between larvae and males, larvae and eggs, larvae and pupae, larvae and females, males and females, males and eggs, males and pupae, females and eggs, females and pupae, and eggs and pupae, respectively. Of these, three were confirmed to be significantly differentially expressed between larvae, females, and males. Furthermore, PBP Cbuq7577_g1 was highly expressed in the antenna of males. A comprehensive sequence resource of a desirable quality was constructed from developmental transcriptomes of C. buqueti eggs, larvae, pupae, and adults. This work enriches the genomic data of C. buqueti, and facilitates our understanding of its metamorphosis, development, and response to environmental change. The identified candidate PBP Cbuq7577_g1 might play a crucial role in identifying sex pheromones, and could be used as a targeted gene to control C. buqueti numbers by disrupting sex pheromone communication. PMID:28662071
Schwarz, Jodi A.; Brokstein, Peter B.; Voolstra, Christian R.; ...
2008-02-25
Scleractinian corals are the foundation of reef ecosystems in tropical marine environments. Their great success is due to interactions with endosymbiotic dinoflagellates (Symbiodinium spp.), with which they are obligately symbiotic. To develop a foundation for studying coral biology and coral symbiosis, we have constructed a set of cDNA libraries and generated and annotated ESTs from two species of corals, Acropora palmata and Montastraea faveolata. Here we generated 14,588 (Ap) and 3,854 (Mf) high quality ESTs from five life history/symbiosis stages (spawned eggs, early-stage planula larvae, late-stage planula larvae either infected with symbionts or uninfected, and adult coral). The ESTs assembledmore » into a set of primarily stage-specific clusters, producing 4,980 (Ap), and 1,732 (Mf) unigenes. The egg stage library, relative to the other developmental stages, was enriched in genes functioning in cell division and proliferation, transcription, signal transduction, and regulation of protein function. Fifteen unigenes were identified as candidate symbiosis-related genes as they were expressed in all libraries constructed from the symbiotic stages and were absent from all of the non symbiotic stages. These include several DNA interacting proteins, and one highly expressed unigene (containing 17 cDNAs) with no significant protein-coding region. A significant number of unigenes (25) encode potential pattern recognition receptors (lectins, scavenger receptors, and others), as well as genes that may function in signaling pathways involved in innate immune responses (toll-like signaling, NFkB p105, and MAP kinases). Comparison between the A. palmata and an A. millepora EST dataset identified ferritin as a highly expressed gene in both datasets that appears to be undergoing adaptive evolution. Five unigenes appear to be restricted to the Scleractinia, as they had no homology to any sequences in the nr databases nor to the non-scleractinian cnidarians Nematostella vectensis and Hydra magnipapillata. In conclusion, partial sequencing of 5 cDNA libraries each for A. palmata and M. faveolata has produced a rich set of candidate genes (4,980 genes from A. palmata, and 1,732 genes from M. faveolata) that we can use as a starting point for examining the life history and symbiosis of these two species, as well as to further expand the dataset of cnidarian genes for comparative genomics and evolutionary studies.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, CY; Yang, H; Wei, CL
Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Using high-throughput Illumina RNA-seq, the transcriptome from poly (A){sup +} RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled intomore » 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real time PCR (qRT-PCR). An extensive transcriptome dataset has been obtained from the deep sequencing of tea plant. The coverage of the transcriptome is comprehensive enough to discover all known genes of several major metabolic pathways. This transcriptome dataset can serve as an important public information platform for gene expression, genomics, and functional genomic studies in C. sinensis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schwarz, Jodi A.; Brokstein, Peter B.; Voolstra, Christian R.
Scleractinian corals are the foundation of reef ecosystems in tropical marine environments. Their great success is due to interactions with endosymbiotic dinoflagellates (Symbiodinium spp.), with which they are obligately symbiotic. To develop a foundation for studying coral biology and coral symbiosis, we have constructed a set of cDNA libraries and generated and annotated ESTs from two species of corals, Acropora palmata and Montastraea faveolata. Here we generated 14,588 (Ap) and 3,854 (Mf) high quality ESTs from five life history/symbiosis stages (spawned eggs, early-stage planula larvae, late-stage planula larvae either infected with symbionts or uninfected, and adult coral). The ESTs assembledmore » into a set of primarily stage-specific clusters, producing 4,980 (Ap), and 1,732 (Mf) unigenes. The egg stage library, relative to the other developmental stages, was enriched in genes functioning in cell division and proliferation, transcription, signal transduction, and regulation of protein function. Fifteen unigenes were identified as candidate symbiosis-related genes as they were expressed in all libraries constructed from the symbiotic stages and were absent from all of the non symbiotic stages. These include several DNA interacting proteins, and one highly expressed unigene (containing 17 cDNAs) with no significant protein-coding region. A significant number of unigenes (25) encode potential pattern recognition receptors (lectins, scavenger receptors, and others), as well as genes that may function in signaling pathways involved in innate immune responses (toll-like signaling, NFkB p105, and MAP kinases). Comparison between the A. palmata and an A. millepora EST dataset identified ferritin as a highly expressed gene in both datasets that appears to be undergoing adaptive evolution. Five unigenes appear to be restricted to the Scleractinia, as they had no homology to any sequences in the nr databases nor to the non-scleractinian cnidarians Nematostella vectensis and Hydra magnipapillata. In conclusion, partial sequencing of 5 cDNA libraries each for A. palmata and M. faveolata has produced a rich set of candidate genes (4,980 genes from A. palmata, and 1,732 genes from M. faveolata) that we can use as a starting point for examining the life history and symbiosis of these two species, as well as to further expand the dataset of cnidarian genes for comparative genomics and evolutionary studies.« less
2011-01-01
Background Cucurbita pepo belongs to the Cucurbitaceae family. The "Zucchini" types rank among the highest-valued vegetables worldwide, and other C. pepo and related Cucurbita spp., are food staples and rich sources of fat and vitamins. A broad range of genomic tools are today available for other cucurbits that have become models for the study of different metabolic processes. However, these tools are still lacking in the Cucurbita genus, thus limiting gene discovery and the process of breeding. Results We report the generation of a total of 512,751 C. pepo EST sequences, using 454 GS FLX Titanium technology. ESTs were obtained from normalized cDNA libraries (root, leaves, and flower tissue) prepared using two varieties with contrasting phenotypes for plant, flowering and fruit traits, representing the two C. pepo subspecies: subsp. pepo cv. Zucchini and subsp. ovifera cv Scallop. De novo assembling was performed to generate a collection of 49,610 Cucurbita unigenes (average length of 626 bp) that represent the first transcriptome of the species. Over 60% of the unigenes were functionally annotated and assigned to one or more Gene Ontology terms. The distributions of Cucurbita unigenes followed similar tendencies than that reported for Arabidopsis or melon, suggesting that the dataset may represent the whole Cucurbita transcriptome. About 34% unigenes were detected to have known orthologs of Arabidopsis or melon, including genes potentially involved in disease resistance, flowering and fruit quality. Furthermore, a set of 1,882 unigenes with SSR motifs and 9,043 high confidence SNPs between Zucchini and Scallop were identified, of which 3,538 SNPs met criteria for use with high throughput genotyping platforms, and 144 could be detected as CAPS. A set of markers were validated, being 80% of them polymorphic in a set of variable C. pepo and C. moschata accessions. Conclusion We present the first broad survey of gene sequences and allelic variation in C. pepo, where limited prior genomic information existed. The transcriptome provides an invaluable new tool for biological research. The developed molecular markers are the basis for future genetic linkage and quantitative trait loci analysis, and will be essential to speed up the process of breeding new and better adapted squash varieties. PMID:21310031
Transcriptomes of Arbuscular Mycorrhizal Fungi and Litchi Host Interaction after Tree Girdling.
Shu, Bo; Li, Weicai; Liu, Liqin; Wei, Yongzan; Shi, Shengyou
2016-01-01
Trunk girdling can increase carbohydrate content above the girdling site and is an important strategy for inhibiting new shoot growth to promote flowering in cultivated litchi (Litchi chinensis Sonn.). However, girdling inhibits carbohydrate transport to the root in nearly all of the fruit development periods and consequently decreases root absorption. The mechanism through which carbohydrates regulate root development in arbuscular mycorrhiza (AM) remains largely unknown. Carbohydrate content, AM colonization, and transcriptome in the roots were analyzed to elucidate the interaction between host litchi and AM fungi when carbohydrate content decreases. Girdling decreased glucose, fructose, sucrose, quebrachitol, and starch contents in the litchi mycorrhizal roots, thereby reducing AM colonization. RNA-seq achieved approximately 60 million reads of each sample, with an average length of reads reaching 100 bp. Assembly of all the reads of the 30 samples produced 671,316 transcripts and 381,429 unigenes, with average lengths of 780 and 643 bp, respectively. Litchi (54,100 unigenes) and AM fungi unigenes (33,120 unigenes) were achieved through sequence annotation during decreased carbohydrate content. Analysis of differentially expressed genes (DEG) showed that flavonoids, alpha-linolenic acid, and linoleic acid are the main factors that regulate AM colonization in litchi. However, flavonoids may play a role in detecting the stage at which carbohydrate content decreases; alpha-linolenic acid or linoleic acid may affect AM formation under the adaptation process. Litchi trees stimulated the expression of defense-related genes and downregulated symbiosis signal-transduction genes to inhibit new AM colonization. Moreover, transcription factors of the AP2, ERF, Myb, WRKY, bHLH families, and lectin genes altered maintenance of litchi mycorrhizal roots in the post-symbiotic stage for carbohydrate starvation. Similar to those of the litchi host, the E3 ubiquitin ligase complex SCF subunit scon-3 and polyubiquitin of AM fungi were upregulated at the perceived stages. This occurrence suggested that ubiquitination plays an important role in perceiving carbohydrate decrease in AM fungi. The transcription of cytochrome b-245 and leucine-rich repeat was detected in the DEG database, implying that the transcripts were involved in AM fungal adaptation under carbohydrate starvation. The transcriptome data might suggest novel functions of unigenes in carbohydrate shortage of mycorrhizal roots.
2011-01-01
Background Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Results Using high-throughput Illumina RNA-seq, the transcriptome from poly (A)+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real time PCR (qRT-PCR). Conclusions An extensive transcriptome dataset has been obtained from the deep sequencing of tea plant. The coverage of the transcriptome is comprehensive enough to discover all known genes of several major metabolic pathways. This transcriptome dataset can serve as an important public information platform for gene expression, genomics, and functional genomic studies in C. sinensis. PMID:21356090
Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.
Li, Xinguo; Wu, Harry X; Southerton, Simon G
2010-06-21
Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.
Groves, Ryan A.; Hagel, Jillian M.; Zhang, Ye; Kilpatrick, Korey; Levy, Asaf; Marsolais, Frédéric; Lewinsohn, Efraim; Sensen, Christoph W.; Facchini, Peter J.
2015-01-01
Amphetamine analogues are produced by plants in the genus Ephedra and by khat (Catha edulis), and include the widely used decongestants and appetite suppressants (1S,2S)-pseudoephedrine and (1R,2S)-ephedrine. The production of these metabolites, which derive from L-phenylalanine, involves a multi-step pathway partially mapped out at the biochemical level using knowledge of benzoic acid metabolism established in other plants, and direct evidence using khat and Ephedra species as model systems. Despite the commercial importance of amphetamine-type alkaloids, only a single step in their biosynthesis has been elucidated at the molecular level. We have employed Illumina next-generation sequencing technology, paired with Trinity and Velvet-Oases assembly platforms, to establish data-mining frameworks for Ephedra sinica and khat plants. Sequence libraries representing a combined 200,000 unigenes were subjected to an annotation pipeline involving direct searches against public databases. Annotations included the assignment of Gene Ontology (GO) terms used to allocate unigenes to functional categories. As part of our functional genomics program aimed at novel gene discovery, the databases were mined for enzyme candidates putatively involved in alkaloid biosynthesis. Queries used for mining included enzymes with established roles in benzoic acid metabolism, as well as enzymes catalyzing reactions similar to those predicted for amphetamine alkaloid metabolism. Gene candidates were evaluated based on phylogenetic relationships, FPKM-based expression data, and mechanistic considerations. Establishment of expansive sequence resources is a critical step toward pathway characterization, a goal with both academic and industrial implications. PMID:25806807
Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants
2010-01-01
Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927
Behura, Susanta K; Severson, David W
2015-02-01
We present a detailed genome-wide comparative study of motif mismatches of microsatellites among 20 insect species representing five taxonomic orders. The results show that varying proportions (∼15-46%) of microsatellites identified in these species are imperfect in motif structure, and that they also vary in chromosomal distribution within genomes. It was observed that the genomic abundance of imperfect repeats is significantly associated with the length and number of motif mismatches of microsatellites. Furthermore, microsatellites with a higher number of mismatches tend to have lower abundance in the genome, suggesting that sequence heterogeneity of repeat motifs is a key determinant of genomic abundance of microsatellites. This relationship seems to be a general feature of microsatellites even in unrelated species such as yeast, roundworm, mouse and human. We provide a mechanistic explanation of the evolutionary link between motif heterogeneity and genomic abundance of microsatellites by examining the patterns of motif mismatches and allele sequences of single-nucleotide polymorphisms identified within microsatellite loci. Using Drosophila Reference Genetic Panel data, we further show that pattern of allelic variation modulates motif heterogeneity of microsatellites, and provide estimates of allele age of specific imperfect microsatellites found within protein-coding genes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
NASA Astrophysics Data System (ADS)
Han, Zhaofang; Xiao, Shijun; Liu, Xiande; Liu, Yang; Li, Jiakai; Xie, Yangjie; Wang, Zhiyong
2017-03-01
The large yellow croaker, Larimichthys crocea is an important marine fish in China with a high economic value. In the last decade, the stock conservation and aquaculture industry of this species have been facing severe challenges because of wild population collapse and degeneration of important economic traits. However, genes contributing to growth and immunity in L. crocea have not been thoroughly analyzed, and available molecular markers are still not sufficient for genetic resource management and molecular selection. In this work, we sequenced the transcriptome in L. crocea liver tissue with a Roche 454 sequencing platform and assembled the transcriptome into 93 801 transcripts. Of them, 38 856 transcripts were successfully annotated in nt, nr, Swiss-Prot, InterPro, COG, GO and KEGG databases. Based on the annotation information, 3 165 unigenes related to growth and immunity were identified. Additionally, a total of 6 391 simple sequence repeats (SSRs) were identified from the transcriptome, among which 4 498 SSRs had enough flanking regions to design primers for polymerase chain reactions (PCR). To access the polymorphism of these markers, 30 primer pairs were randomly selected for PCR amplification and validation in 30 individuals, and 12 primer pairs (40.0%) exhibited obvious length polymorphisms. This work applied RNA-Seq to assemble and analyze a live transcriptome in L. crocea. With gene annotation and sequence information, genes related to growth and immunity were identified and massive SSR markers were developed, providing valuable genetic resources for future gene functional analysis and selective breeding of L. crocea.
Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie
2011-09-01
Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.
Jiang, Yanliang; Feng, Shuaisheng; Xu, Jian; Zhang, Songhao; Li, Shangqi; Sun, Xiaoqing; Xu, Peng
2016-10-01
Aerial breathing in fish was an important adaption for successful survival in hypoxic water. All aerial breathing fish are bimodal breathers. It is intriguing that they can obtain oxygen from both air and water. However, the genetic basis underlying bimodal breathing has not been extensively studied. In this study, we performed next-generation sequencing on a bimodal breathing fish, the Northern snakehead, Channa argus, and generated a transcriptome profiling of C. argus. A total of 53,591 microsatellites and 26,378 SNPs were identified and classified. A Ka/Ks analysis of the unigenes indicated that 63 genes were under strong positive selection. Furthermore, the transcriptomes from the aquatic breathing organ (gill) and the aerial breathing organ (suprabranchial chamber) were sequenced and compared, and the results showed 1,966 genes up-regulated in the gill and 2,727 genes up-regulated in the suprabranchial chamber. A gene pathway analysis concluded that four functional categories were significant, of which angiogenesis and elastic fibre formation were up-regulated in the suprabranchial chamber, indicating that the aerial breathing organ may be more efficient for gas exchange due to its highly vascularized and elastic structure. In contrast, ion uptake and transport and acid-base balance were up-regulated in the gill, indicating that the aquatic breathing organ functions in ion homeostasis and acid-base balance, in addition to breathing. Understanding the genetic mechanism underlying bimodal breathing will shed light on the initiation and importance of aerial breathing in the evolution of vertebrates. Copyright © 2016 Elsevier B.V. All rights reserved.
Barker, F. Keith; Oyler-McCance, Sara; Tomback, Diana F.
2015-01-01
Next generation sequencing methods allow rapid, economical accumulation of data that have many applications, even at relatively low levels of genome coverage. However, the utility of shotgun sequencing data sets for specific goals may vary depending on the biological nature of the samples sequenced. We show that the ability to assemble mitogenomes from three avian samples of two different tissue types varies widely. In particular, data with coverage typical of microsatellite development efforts (∼1×) from DNA extracted from avian blood failed to cover even 50% of the mitogenome, relative to at least 500-fold coverage from muscle-derived data. Researchers should consider possible applications of their data and select the tissue source for their work accordingly. Practitioners analyzing low-coverage shotgun sequencing data (including for microsatellite locus development) should consider the potential benefits of mitogenome assembly, including internal barcode verification of species identity, mitochondrial primer development, and phylogenetics.
An autosomal genetic linkage map of the sheep genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crawford, A.M.; Ede, A.J.; Pierson, C.A.
1995-06-01
We report the first extensive ovine genetic linkage map covering 2070 cM of the sheep genome. The map was generated from the linkage analysis of 246 polymorphic markers, in nine three-generation full-sib pedigrees, which make up the AgResearch International Mapping Flock. We have exploited many markers from cattle so that valuable comparisons between these two ruminant linkage maps can be made. The markers, used in the segregation analyses, comprised 86 anonymous microsatellite markers derived from the sheep genome, 126 anonymous microsatellites from cattle, one from deer, and 33 polymorphic markers of various types associated with known genes. The maximum numbermore » of informative meioses within the mapping flock was 22. The average number of informative meioses per marker was 140 (range 18-209). Linkage groups have been assigned to all 26 sheep autosomes. 102 refs., 8 figs., 5 tabs.« less
[Genetic diversity of microsatellite loci in captive Amur tigers].
Zhang, Yu-Gaung; Li, Di-Qiang; Xiao, Qi-Ming; Rao, Li-Qun; Zhang, Xue-Wen
2004-09-01
The tiger is one of the most threatened wildlife species since the abundance and distribution of tiger have decreased dramatically in the last century. The wild Amur tiger (Panthera tigris altaica) only distributed in northeast China, the far east area of Russia and the north Korea and its size of wild population is about 450 in the world and 20 in China. Several hundred captive populations of Amur tigers are the main source to protect gene library of tiger and the source of recovering the wild populations. The Breeding Center for Felidae at Hengdaohezi and Haoerbin Tiger Park in Heilongjiang Province is the biggest captive breeding base in China. How to make clear the genetic pedigree and establish reasonable breeding system is the urgent issues. So we use the microsatellite DNA markers and non-invasive technology to research on the genetic diversity of captive Amur tiger in this study. Ten microsatellite loci (Fca005, Fca075, Fca094, Fca152, Fca161, Fca294, Pti002, Pti003, Pti007 and Pti010), highly variable nuclear markers, were studied their genetic diversity in 113 captive Amur tigers. The PCR amplified products of microsatellite loci were detected by non-denatured polyacrylamide gel electrophoresis. Allele numbers, allelic frequency, gene heterozygosity(H(e)), polymorphism information content(PIC) and effective number of allele(N(e)) were calculated. 41 alleles were found and their size were ranged from 110bp to 250bp in ten microsatellite loci, Fca152 had 6 alleles, Fca075, Fca094 and Fca294 had 5 alleles, Fca005 and Pti002 had 4 alleles and the others had 3 alleles in all tiger samples, respectively. The allelic frequencies were from 0.009 to 0.767; The He ranged from 0.385 to 0.707, and Fca294 and Pti010 locus had the highest and lowest value; the PIC were from 0.353 to 0.658, Fca294 and Pti010 locus had the highest and lowest value; and N(e) were from 1.626 to 3.409, Fca294 and Pti010 locus had the highest and lowest value, which showed the ten microsatellie loci had high or medium polymorphism in these Amur tigers and had high genetic diversity. At the same time, we only found even bases variability which showed the even bases repeat sequence (CA/GT) maybe the basic unit for length variability of microsatellite in all loci. In this study, the samples were made up of 75 hair specimens, 23 blood specimens and 15 tissue specimens, we obtained the genome DNA from hairs using the non-invasive DNA technology and demonstrated that DNA derived from hair samples is as good as that obtained from blood samples for the analysis of microsatellite polymorphism. These results imply that microsatellite DNA markers and non-invasive DNA technology can help study the genetic diversity of Amur tiger. This method could be used in the captive management of other endangered species.
Y.A. Kapetanakos; I.J. Lovette; T.E. Katzner
2014-01-01
Southeast Asian vultures have been greatly reduced in range and population numbers, but it is challenging to use traditional tagging and monitoring techniques to track changes in their populations. Genotypes derived from non-invasively collected feather samples provide an alternative and effective means to 'capture' individual vultures for mark-recapture...
Molecular Population Genetics of the Northern Elephant Seal Mirounga angustirostris.
Abadía-Cardoso, Alicia; Freimer, Nelson B; Deiner, Kristy; Garza, John Carlos
2017-09-01
The northern elephant seal, Mirounga angustirostris, was heavily hunted and declared extinct in the 19th century. However, a colony remained on remote Guadalupe Island, Mexico and the species has since repopulated most of its historical distribution. Here, we present a comprehensive evaluation of genetic variation in the species. First, we assess the effect of the demographic bottleneck on microsatellite variability and compare it with that found in other pinnipeds, demonstrating levels of variation similar to that in species that continue to be threatened with extinction. Next, we use sequence data from these markers to demonstrate that some of the limited polymorphism predates the bottleneck. However, most contemporary variation appears to have arisen recently and persisted due to exponential growth. We also describe how we use the range in allele size of microsatellites to estimate ancestral effective population size before the bottleneck, demonstrating a large reduction in effective size. We then employ a classical method for bacteria to estimate the microsatellite mutation rate in the species, deriving an estimate that is extremely similar to that estimated for a similar set of loci in humans, indicating consistency of microsatellite mutation rates in mammals. Finally, we find slight significant structure between some geographically separated colonies, although its biological significance is unclear. This work demonstrates that genetic analysis can be useful for evaluating the population biology of the northern elephant seal, in spite of the bottleneck that removed most genetic variation from the species. Published by Oxford University Press on behalf of The American Genetic Association 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Tokarskaya, O N; Kan, N G; Petrosyan, V G; Martirosyan, I A; Grechko, V V; Danielyan, F D; Darevsky, I S; Ryskov, A P
2001-07-01
Multilocus DNA fingerprinting has been used to study the variability of some mini- and microsatellite sequences in parthenogenetic species of Caucasian rock lizards of the genus Lacerta (L. dahli, L. armeniaca and L. unisexualis). We demonstrate that these clonally reproducing lizards possess species-specific DNA fingerprints with a low degree of intra- and interpopulation variation. Mean indices of similarity obtained using M13 DNA, (GACA)4 and (TCC)50 as probes were 0.962 and 0.966 in L. dahli and L. armeniaca, respectively. The mean index of similarity obtained using M 13 and GATA probes in L. unisexualis was estimated to be 0.95. However, despite the high degree of band-sharing, variable DNA fragments were revealed in all populations with the microsatellite probes. An particularly high level of variability was observed for (TCC)n microsatellites in populations of L. unisexualis. In fact TCC-derived DNA fingerprints were close to being individual-specific, with a mean index of similarity of 0.824. Fingerprint analysis of parthenogenetic families of L. armeniaca showed that all maternal fragments were inherited together by the progeny, and no differences in fingerprint patterns were observed. On the other hand, while identical DNA fingerprints were obtained from L. unisexualis families with M13 and (GATA)4 probes, use of the (TCC)50 probe revealed remarkable intrafamily variation in this species. It is assumed that the genetic heterogeneity observed in parthenogenetic populations may be explained, at least in part, by the existence of genetically unstable microsatellite loci. Our data serve to illustrate processes of spontaneous mutagenesis and the initial stages of clonal differentiation in natural populations of the lizard species studied.
Plechakova, Olga; Tranchant-Dubreuil, Christine; Benedet, Fabrice; Couderc, Marie; Tinaut, Alexandra; Viader, Véronique; De Block, Petra; Hamon, Perla; Campa, Claudine; de Kochko, Alexandre; Hamon, Serge; Poncet, Valérie
2009-01-01
Background In the past few years, functional genomics information has been rapidly accumulating on Rubiaceae species and especially on those belonging to the Coffea genus (coffee trees). An increasing number of expressed sequence tag (EST) data and EST- or genomic-derived microsatellite markers have been generated, together with Conserved Ortholog Set (COS) markers. This considerably facilitates comparative genomics or map-based genetic studies through the common use of orthologous loci across different species. Similar genomic information is available for e.g. tomato or potato, members of the Solanaceae family. Since both Rubiaceae and Solanaceae belong to the Euasterids I (lamiids) integration of information on genetic markers would be possible and lead to more efficient analyses and discovery of key loci involved in important traits such as fruit development, quality, and maturation, or adaptation. Our goal was to develop a comprehensive web data source for integrated information on validated orthologous markers in Rubiaceae. Description MoccaDB is an online MySQL-PHP driven relational database that houses annotated and/or mapped microsatellite markers in Rubiaceae. In its current release, the database stores 638 markers that have been defined on 259 ESTs and 379 genomic sequences. Marker information was retrieved from 11 published works, and completed with original data on 132 microsatellite markers validated in our laboratory. DNA sequences were derived from three Coffea species/hybrids. Microsatellite markers were checked for similarity, in vitro tested for cross-amplification and diversity/polymorphism status in up to 38 Rubiaceae species belonging to the Cinchonoideae and Rubioideae subfamilies. Functional annotation was provided and some markers associated with described metabolic pathways were also integrated. Users can search the database for marker, sequence, map or diversity information through multi-option query forms. The retrieved data can be browsed and downloaded, along with protocols used, using a standard web browser. MoccaDB also integrates bioinformatics tools (CMap viewer and local BLAST) and hyperlinks to related external data sources (NCBI GenBank and PubMed, SOL Genomic Network database). Conclusion We believe that MoccaDB will be extremely useful for all researchers working in the areas of comparative and functional genomics and molecular evolution, in general, and population analysis and association mapping of Rubiaceae and Solanaceae species, in particular. PMID:19788737
Detecting microsatellites within genomes: significant variation among algorithms.
Leclercq, Sébastien; Rivals, Eric; Jarne, Philippe
2007-04-18
Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.
Detecting microsatellites within genomes: significant variation among algorithms
Leclercq, Sébastien; Rivals, Eric; Jarne, Philippe
2007-01-01
Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker). Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions. PMID:17442102
Undermethylated DNA as a source of microsatellites from a conifer genome.
Zhou, Y; Bui, T; Auckland, L D; Williams, C G
2002-02-01
Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.
Microsatellites in the Genome of the Edible Mushroom, Volvariella volvacea
Chen, Mingjie; Wang, Hong; Bao, Dapeng
2014-01-01
Using bioinformatics software and database, we have characterized the microsatellite pattern in the V. volvacea genome and compared it with microsatellite patterns found in the genomes of four other edible fungi: Coprinopsis cinerea, Schizophyllum commune, Agaricus bisporus, and Pleurotus ostreatus. A total of 1346 microsatellites have been identified, with mono-nucleotides being the most frequent motif. The relative abundance of microsatellites was lower in coding regions with 21 No./Mb. However, the microsatellites in the V. volvacea gene models showed a greater tendency to be located in the CDS regions. There was also a higher preponderance of trinucleotide repeats, especially in the kinase genes, which implied a possible role in phenotypic variation. Among the five fungal genomes, microsatellite abundance appeared to be unrelated to genome size. Furthermore, the short motifs (mono- to tri-nucleotides) outnumbered other categories although these differed in proportion. Data analysis indicated a possible relationship between the most frequent microsatellite types and the genetic distance between the five fungal genomes. PMID:24575404
Microsatellites in the genome of the edible mushroom, Volvariella volvacea.
Wang, Ying; Chen, Mingjie; Wang, Hong; Wang, Jing-Fang; Bao, Dapeng
2014-01-01
Using bioinformatics software and database, we have characterized the microsatellite pattern in the V. volvacea genome and compared it with microsatellite patterns found in the genomes of four other edible fungi: Coprinopsis cinerea, Schizophyllum commune, Agaricus bisporus, and Pleurotus ostreatus. A total of 1346 microsatellites have been identified, with mono-nucleotides being the most frequent motif. The relative abundance of microsatellites was lower in coding regions with 21 No./Mb. However, the microsatellites in the V. volvacea gene models showed a greater tendency to be located in the CDS regions. There was also a higher preponderance of trinucleotide repeats, especially in the kinase genes, which implied a possible role in phenotypic variation. Among the five fungal genomes, microsatellite abundance appeared to be unrelated to genome size. Furthermore, the short motifs (mono- to tri-nucleotides) outnumbered other categories although these differed in proportion. Data analysis indicated a possible relationship between the most frequent microsatellite types and the genetic distance between the five fungal genomes.
Microsatellite diversity of isolates of the parasitic nematode Haemonchus contortus.
Otsen, M; Plas, M E; Lenstra, J A; Roos, M H; Hoekstra, R
2000-09-01
The alarming development of anthelmintic resistance in important gastrointestinal nematode parasites of man and live-stock is caused by selection for specific genotypes. In order to provide genetic tools to study the nematode populations and the consequences of anthelmintic treatment, we isolated and sequenced 59 microsatellites of the sheep and goat parasite Haemonchus contortus. These microsatellites consist typically of 2-10 tandems CA/GT repeats that are interrupted by sequences of 1-10 bp. A predominant cause of the imperfect structure of the microsatellites appeared mutations of G/C bp in the tandem repeat. About 44% of the microsatellites were associated with the HcREP1 direct repeat, and it was demonstrated that a generic HcREP1 primer could be used to amplify HcREP1-associated microsatellites. Thirty microsatellites could be typed by polymerase chain reaction (PCR) of which 27 were polymorphic. A number of these markers were used to detect genetic contamination of an experimental inbred population. The microsatellites may also contribute to the genetic mapping of drug resistance genes.
2011-01-01
derived from killer whales (Orcinus orca; Hoelzel et al. 1998a); loci D5 and D12 from beluga whales ( Delphinapterus leucas ; Buchanan et al. 1996); and... Delphinapterus leucas . Molecular Ecology 5:571–575. Carretta, J. V., K. A. Forney, M. S. Lowry, et al. 2009. U.S. Pacific marine mammal stock assessments
Ismael, Nour El Hoda S; El Sheikh, Samar A; Talaat, Suzan M; Salem, Eman M
2017-03-15
Colorectal cancer (CRC) is one of the most common cancers worldwide. Microsatellite instability (MSI) is detected in about 15% of all colorectal cancers. CRC with MSI has particular characteristics such as improved survival rates and better prognosis. They also have a distinct sensitivity to the action of chemotherapy. The aim of the study was to detect microsatellite instability in a cohort of colorectal cancer Egyptian patients using the immunohistochemical expression of mismatch repair proteins (MLH1, MSH2, MSH6 and PMS2). Cases were divided into Microsatellite stable (MSS), Microsatellite unstable low (MSI-L) and Microsatellite unstable high (MSI-H). This Microsatellite stability status was correlated with different clinicopathological parameters. There was a statistically significant correlation between the age of cases, tumor site & grade and the microsatellite stability status. There was no statistically significant correlation between the gender of patients, tumor subtype, stage, mucoid change, necrosis, tumor borders, lymphocytic response, lymphovascular emboli and the microsatellite stability status. Testing for MSI should be done for all colorectal cancer patients, especially those younger than 50 years old, right sided and high-grade CRCs.
Song, Huwei; Zhao, Xiangxiang; Hu, Weicheng; Wang, Xinfeng; Shen, Ting; Yang, Liming
2016-11-04
Loquat ( Eriobotrya japonica Lindl.) is an important non-climacteric fruit and rich in essential nutrients such as minerals and carotenoids. During fruit development and ripening, thousands of the differentially expressed genes (DEGs) from various metabolic pathways cause a series of physiological and biochemical changes. To better understand the underlying mechanism of fruit development, the Solexa/Illumina RNA-seq high-throughput sequencing was used to evaluate the global changes of gene transcription levels. More than 51,610,234 high quality reads from ten runs of fruit development were sequenced and assembled into 48,838 unigenes. Among 3256 DEGs, 2304 unigenes could be annotated to the Gene Ontology database. These DEGs were distributed into 119 pathways described in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. A large number of DEGs were involved in carbohydrate metabolism, hormone signaling, and cell-wall degradation. The real-time reverse transcription (qRT)-PCR analyses revealed that several genes related to cell expansion, auxin signaling and ethylene response were differentially expressed during fruit development. Other members of transcription factor families were also identified. There were 952 DEGs considered as novel genes with no annotation in any databases. These unigenes will serve as an invaluable genetic resource for loquat molecular breeding and postharvest storage.
Salt-Responsive Transcriptome Profiling of Suaeda glauca via RNA Sequencing
Jin, Hangxia; Dong, Dekun; Yang, Qinghua; Zhu, Danhua
2016-01-01
Background Suaeda glauca, a succulent halophyte of the Chenopodiaceae family, is widely distributed in coastal areas of China. Suaeda glauca is highly resistant to salt and alkali stresses. In the present study, the salt-responsive transcriptome of Suaeda glauca was analyzed to identify genes involved in salt tolerance and study halophilic mechanisms in this halophyte. Results Illumina HiSeq 2500 was used to sequence cDNA libraries from salt-treated and control samples with three replicates each treatment. De novo assembly of the six transcriptomes identified 75,445 unigenes. A total of 23,901 (31.68%) unigenes were annotated. Compared with transcriptomes from the three salt-treated and three salt-free samples, 231 differentially expressed genes (DEGs) were detected (including 130 up-regulated genes and 101 down-regulated genes), and 195 unigenes were functionally annotated. Based on the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) classifications of the DEGs, more attention should be paid to transcripts associated with signal transduction, transporters, the cell wall and growth, defense metabolism and transcription factors involved in salt tolerance. Conclusions This report provides a genome-wide transcriptional analysis of a halophyte, Suaeda glauca, under salt stress. Further studies of the genetic basis of salt tolerance in halophytes are warranted. PMID:26930632
Bian, Hai-Xu; Ma, Hong-Fang; Zheng, Xi-Xi; Peng, Ming-Hui; Li, Yu-Ping; Su, Jun-Fang; Wang, Huan; Li, Qun; Xia, Run-Xi; Liu, Yan-Qun; Jiang, Xing-Fu
2017-05-24
The oriental armyworm Mythimna separate is an economically important insect with a wide distribution and strong migratory activity. However, knowledge about the molecular mechanisms regulating the physiological and behavioural responses of the oriental armyworm is scarce. In the present study, we took a transcriptomic approach to characterize the gene network in the adult head of M. separate. The sequencing and de novo assembly yielded 63,499 transcripts, which were further assembled into 46,459 unigenes with an N50 of 1,153 bp. In the head transcriptome data, unigenes involved in the 'signal transduction mechanism' are the most abundant. In total, 937 signal transduction unigenes were assigned to 22 signalling pathways. The circadian clock, melanin synthesis, and non-receptor protein of olfactory gene families were then identified, and phylogenetic analyses were performed with these M. separate genes, the model insect Bombyx mori and other insects. Furthermore, 1,372 simple sequence repeats of 2-6 bp in unit length were identified. The transcriptome data represent a comprehensive molecular resource for the adult head of M. separate, and these identified genes can be valid targets for further gene function research to address the molecular mechanisms regulating the migratory and olfaction genes of the oriental armyworm.
Tran, Hung Bao; Lee, Yen-Hung; Guo, Jiin-Ju; Cheng, Ta-Chih
2018-06-01
Cobia, Rachycentron canadum, one of the most important aquatic species in Taiwan, has suffered heavy losses from Photobacterium damselae subsp. piscicida, which is the causal agent of photobacteriosis. In this study, the transcriptomic profiles of livers and spleens from Pdp-infected and non-infected cobia were obtained for the first time by Illumina-based paired-end sequencing method with a focus on immune-related genes. In total, 164,882 high quality unigenes were obtained in four libraries. Following Pdp infection, 7302 differentially expressed unigenes from liver and 8600 differentially expressed unigenes from spleen were identified. Twenty-seven of the differently expressed genes were further validated by RT-qPCR (average correlation coefficient 0.839, p-value <0.01). Results indicated a negative regulation of complement components and increased expression of genes involved in MyD88-independent pathway. Moreover, a remarkable finding was the increased expression of IL-10, implying an inadequacy of immune responses. This study not only characterized several putative immune pathways, but also provided a better understanding of the molecular responses to photobacteriosis in cobia. Copyright © 2018 Elsevier Ltd. All rights reserved.
Leyva-Pérez, María de la O; Valverde-Corredor, Antonio; Valderrama, Raquel; Jiménez-Ruiz, Jaime; Muñoz-Merida, Antonio; Trelles, Oswaldo; Barroso, Juan Bautista; Mercado-Blanco, Jesús; Luque, Francisco
2015-01-01
Low temperature severely affects plant growth and development. To overcome this constraint, several plant species from regions having a cool season have evolved an adaptive response, called cold acclimation. We have studied this response in olive tree (Olea europaea L.) cv. Picual. Biochemical stress markers and cold-stress symptoms were detected after the first 24 h as sagging leaves. After 5 days, the plants were found to have completely recovered. Control and cold-stressed plants were sequenced by Illumina HiSeq 1000 paired-end technique. We also assembled a new olive transcriptome comprising 157,799 unigenes and found 6,309 unigenes differentially expressed in response to cold. Three types of response that led to cold acclimation were found: short-term transient response, early long-term response, and late long-term response. These subsets of unigenes were related to different biological processes. Early responses involved many cold-stress-responsive genes coding for, among many other things, C-repeat binding factor transcription factors, fatty acid desaturases, wax synthesis, and oligosaccharide metabolism. After long-term exposure to cold, a large proportion of gene down-regulation was found, including photosynthesis and plant growth genes. Up-regulated genes after long-term cold exposure were related to organelle fusion, nucleus organization, and DNA integration, including retrotransposons. PMID:25324298
Wang, Yonglin; Xiong, Dianguang; Jiang, Ning; Li, Xuewu; Yang, Qiqing; Tian, Chengming
2016-01-01
Arceuthobium (dwarf mistletoes) are hemiparasites that may cause great damage to infected trees belonging to Pinaceae and Cupressaceae. Currently, dwarf mistletoe control involves the use of the ethylene-producing product ethephon (ETH), which acts by inducing dwarf mistletoe shoot abscission. However, the process by which ETH functions is mostly unknown. Therefore, the transcriptome of the ETH-exposed plants was compared to non-exposed controls to identify genes associated with the response to ethephon. In this study, the reference transcriptome was contained 120,316 annotated unigenes, with a total of 21,764 ETH-responsive differentially expressed unigenes were identified. These ETH-associated genes clustered into 20 distinctly expressed pattern groups, providing a view of molecular events with good spatial and temporal resolution. As expected, the greatest number of unigenes with changed expression were observed at the onset of abscission, suggesting induction by ethylene. ETH also affected genes associated with shoot abscission processes including hormone biosynthesis and signaling, cell wall hydrolysis and modification, lipid transference, and more. The comprehensive transcriptome data set provides a wealth of genomic resources for dwarf mistletoe communities and contributes to a better understanding of the molecular regulatory mechanism of ethylene-caused shoots abscission. PMID:27941945
Transcriptomes That Confer to Plant Defense against Powdery Mildew Disease in Lagerstroemia indica
Shi, Weibing; Rinehart, Timothy
2015-01-01
Transcriptome analysis was conducted in two popular Lagerstroemia cultivars: “Natchez” (NAT), a white flower and powdery mildew resistant interspecific hybrid and “Carolina Beauty” (CAB), a red flower and powdery mildew susceptible L. indica cultivar. RNA-seq reads were generated from Erysiphe australiana infected leaves and de novo assembled. A total of 37,035 unigenes from 224,443 assembled contigs in both genotypes were identified. Approximately 85% of these unigenes have known function. Of them, 475 KEGG genes were found significantly different between the two genotypes. Five of the top ten differentially expressed genes (DEGs) involved in the biosynthesis of secondary metabolites (plant defense) and four in flavonoid biosynthesis pathway (antioxidant activities or flower coloration). Furthermore, 5 of the 12 assembled unigenes in benzoxazinoid biosynthesis and 7 of 11 in flavonoid biosynthesis showed higher transcript abundance in NAT. The relative abundance of transcripts for 16 candidate DEGs (9 from CAB and 7 from NAT) detected by qRT-PCR showed general agreement with the abundances of the assembled transcripts in NAT. This study provided the first transcriptome analyses in L. indica. The differential transcript abundance between two genotypes indicates that it is possible to identify candidate genes that are associated with the plant defenses or flower coloration. PMID:26247009
Fan, R; Ling, P; Hao, C Y; Li, F P; Huang, L F; Wu, B D; Wu, H S
2015-10-19
Black pepper is a perennial climbing vine. It is widely cultivated because its berries can be utilized not only as a spice in food but also for medicinal use. This study aimed to construct a standardized, high-quality cDNA library to facilitated identification of new Piper hainanense transcripts. For this, 262 unigenes were used to generate raw reads. The average length of these 262 unigenes was 774.8 bp. Of these, 94 genes (35.9%) were newly identified, according to the NCBI protein database. Thus, identification of new genes may broaden the molecular knowledge of P. hainanense on the basis of Clusters of Orthologous Groups and Gene Ontology categories. In addition, certain basic genes linked to physiological processes, which can contribute to disease resistance and thereby to the breeding of black pepper. A total of 26 unigenes were found to be SSR markers. Dinucleotide SSR was the main repeat motif, accounting for 61.54%, followed by trinucleotide SSR (23.07%). Eight primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among twenty-one piper germplasm. These results present a novel sequence information of P. hainanense, which can serve as the foundation for further genetic research on this species.
Li, Meng-Yao; Tan, Hua-Wei; Wang, Feng; Jiang, Qian; Xu, Zhi-Sheng; Tian, Chang; Xiong, Ai-Sheng
2014-01-01
Parsley is an important biennial Apiaceae species that is widely cultivated as herb, spice, and vegetable. Previous studies on parsley principally focused on its physiological and biochemical properties, including phenolic compound and volatile oil contents. However, little is known about the molecular and genetic properties of parsley. In this study, 23,686,707 high-quality reads were obtained and assembled into 81,852 transcripts and 50,161 unigenes for the first time. Functional annotation showed that 30,516 unigenes had sequence similarity to known genes. In addition, 3,244 putative simple sequence repeats were detected in curly parsley. Finally, 1,569 of the identified unigenes belonged to 58 transcription factor families. Various abiotic stresses have a strong detrimental effect on the yield and quality of parsley. AP2/ERF transcription factors have important functions in plant development, hormonal regulation, and abiotic response. A total of 88 putative AP2/ERF factors were identified from the transcriptome sequence of parsley. Seven AP2/ERF transcription factors were selected in this study to analyze the expression profiles of parsley under different abiotic stresses. Our data provide a potentially valuable resource that can be used for intensive parsley research.
Wang, Feng; Jiang, Qian; Xu, Zhi-Sheng; Tian, Chang; Xiong, Ai-Sheng
2014-01-01
Parsley is an important biennial Apiaceae species that is widely cultivated as herb, spice, and vegetable. Previous studies on parsley principally focused on its physiological and biochemical properties, including phenolic compound and volatile oil contents. However, little is known about the molecular and genetic properties of parsley. In this study, 23,686,707 high-quality reads were obtained and assembled into 81,852 transcripts and 50,161 unigenes for the first time. Functional annotation showed that 30,516 unigenes had sequence similarity to known genes. In addition, 3,244 putative simple sequence repeats were detected in curly parsley. Finally, 1,569 of the identified unigenes belonged to 58 transcription factor families. Various abiotic stresses have a strong detrimental effect on the yield and quality of parsley. AP2/ERF transcription factors have important functions in plant development, hormonal regulation, and abiotic response. A total of 88 putative AP2/ERF factors were identified from the transcriptome sequence of parsley. Seven AP2/ERF transcription factors were selected in this study to analyze the expression profiles of parsley under different abiotic stresses. Our data provide a potentially valuable resource that can be used for intensive parsley research. PMID:25268141
Zhou, Xiaoxu; Wang, Hongdi; Cui, Jun; Qiu, Xuemei; Chang, Yaqing; Wang, Xiuli
2016-12-01
Tube foot as one of the ambulacral appendages types in Aspidochirote holothurioids, is known for their functions in locomotion, feeding, chemoreception, light sensitivity and respiration. In this study, we explored the characteristic of transcriptome in the tube foot of sea cucumber (Apostichopus japonicus). Our results showed that among 390 unigenes which specifically expressed in the tube foot, 190 of them were annotated. Based on the assembly transcriptome, we found 219,860 SNPs from 34,749 unigenes, 97,683, 53,624, 27,767 and 40,786 were located in CDSs, 5'-UTRs, 3'-UTRs and non-CDS separately. Furthermore, 12,114 SSRs were detected from 7394 unigenes. Target genes of four specifically expressed miRNAs (miR-29a, miR-29b, miR-278-3p and miR-2005) in tube foot were also predicted based on the transcriptome, which contain immune-related factors (MBL, VLRA, AjC3, MyD88, CFB), skin pigmentation (MITF), candidate regeneration factor (TRP) and holothurians autolysis-related factor (CL). These results develop a relatively large number of molecular markers and transcriptome resources, and will provide a foundation for further analyses on the function and molecular mechanisms underlying A. japonicas tube foot. Copyright © 2016 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Microsatellite markers are highly variable and very commonly used in population genetics studies. However, microsatellite loci are typically poorly conserved and cannot be used in distant related species. Thus, development of clade-specific microsatellite markers would increase efficiency and allow ...
Kamali, Maryam; Sharakhova, Maria V; Baricheva, Elina; Karagodin, Dmitrii; Tu, Zhijian; Sharakhov, Igor V
2011-01-01
Anopheles stephensi is one of the major vectors of malaria in the Middle East and Indo-Pakistan subcontinent. Understanding the population genetic structure of malaria mosquitoes is important for developing adequate and successful vector control strategies. Commonly used markers for inferring anopheline taxonomic and population status include microsatellites and chromosomal inversions. Knowledge about chromosomal locations of microsatellite markers with respect to polymorphic inversions could be useful for better understanding a genetic structure of natural populations. However, fragments with microsatellites used in population genetic studies are usually too short for successful labeling and hybridization with chromosomes. We designed new primers for amplification of microsatellite loci identified in the A. stephensi genome sequenced with next-generation technologies. Twelve microsatellites were mapped to polytene chromosomes from ovarian nurse cells of A. stephensi using fluorescent in situ hybridization. All microsatellites hybridized to unique locations on autosomes, and 7 of them localized to the largest arm 2R. Ten microsatellites were mapped inside the previously described polymorphic chromosomal inversions, including 4 loci located inside the widespread inversion 2Rb. We analyzed microsatellite-based population genetic data available for A. stephensi in light of our mapping results. This study demonstrates that the chromosomal position of microsatellites may affect estimates of population genetic parameters and highlights the importance of developing physical maps for nonmodel organisms.
Zhao, Hansheng; Yang, Li; Peng, Zhenhua; Sun, Huayu; Yue, Xianghua; Lou, Yongfeng; Dong, Lili; Wang, Lili; Gao, Zhimin
2015-01-26
Morphology-based taxonomy via exiguously reproductive organ has severely limitation on bamboo taxonomy, mainly owing to infrequent and unpredictable flowering events of bamboo. Here, we present the first genome-wide analysis and application of microsatellites based on the genome of moso bamboo (Phyllostachys edulis) to assist bamboo taxonomy. Of identified 127,593 microsatellite repeat-motifs, the primers of 1,451 microsatellites were designed and 1,098 markers were physically mapped on the genome of moso bamboo. A total of 917 markers were successfully validated in 9 accessions with ~39.8% polymorphic potential. Retrieved from validated microsatellite markers, 23 markers were selected for polymorphic analysis among 78 accessions and 64 alleles were detected with an average of 2.78 alleles per primers. The cluster result indicated the majority of the accessions were consistent with their current taxonomic classification, confirming the suitability and effectiveness of the developed microsatellite markers. The variations of microsatellite marker in different species were confirmed by sequencing and in silico comparative genome mapping were investigated. Lastly, a bamboo microsatellites database (http://www.bamboogdb.org/ssr) was implemented to browse and search large information of bamboo microsatellites. Consequently, our results of microsatellite marker development are valuable for assisting bamboo taxonomy and investigating genomic studies in bamboo and related grass species.
Survey and Analysis of Microsatellites in the Silkworm, Bombyx mori
Prasad, M. Dharma; Muthulakshmi, M.; Madhu, M.; Archak, Sunil; Mita, K.; Nagaraju, J.
2005-01-01
We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of ≥15 bases of mononucleotide repeats and ≥5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2–14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama. PMID:15371363
Fu, Minghui; Jiang, Lihua; Li, Yuanmei; Yan, Guohua; Zheng, Lijun; Jinping, Peng
2014-12-01
Eichhornia crassipes is an aquatic plant native to the Amazon River Basin. It has become a serious weed in freshwater habitats in rivers, lakes and reservoirs both in tropical and warm temperate areas worldwide. Some research has stated that it can be used for water phytoremediation, due to its strong assimilation of nitrogen and phosphorus, and the accumulation of heavy metals, and its growth and spread may play an important role in environmental ecology. In order to explore the molecular mechanism of E. crassipes to responses to nitrogen deficiency, we constructed forward and reversed subtracted cDNA libraries for E. crassipes roots under nitrogen deficient condition using a suppressive subtractive hybridization (SSH) method. The forward subtraction included 2,100 clones, and the reversed included 2,650 clones. One thousand clones were randomly selected from each library for sequencing. About 737 (527 unigenes) clones from the forward library and 757 (483 unigenes) clones from the reversed library were informative. Sequence BlastX analysis showed that there were more transporters and adenosylhomocysteinase-like proteins in E. crassipes cultured in nitrogen deficient medium; while, those cultured in nitrogen replete medium had more proteins such as UBR4-like e3 ubiquitin-protein ligase and fasciclin-like arabinogalactan protein 8-like, as well as more cytoskeletal proteins, including actin and tubulin. Cluster of Orthologous Group (COG) analysis also demonstrated that in the forward library, the most ESTs were involved in coenzyme transportation and metabolism. In the reversed library, cytoskeletal ESTs were the most abundant. Gene Ontology (GO) analysis categories demonstrated that unigenes involved in binding, cellular process and electron carrier were the most differentially expressed unigenes between the forward and reversed libraries. All these results suggest that E. crassipes can respond to different nitrogen status by efficiently regulating and controlling some transporter gene expressions, certain metabolism processes, specific signal transduction pathways and cytoskeletal construction.
Jia, Tianqi; Wei, Danfeng; Meng, Shan; Allan, Andrew C.; Zeng, Lihui
2014-01-01
Longan (Dimocarpus longan L.) is a tropical/subtropical fruit tree of significant economic importance in Southeast Asia. However, a lack of transcriptomic and genomic information hinders research on longan traits, such as the control of flowering. In this study, high-throughput RNA sequencing (RNA-Seq) was used to investigate differentially expressed genes between a unique longan cultivar ‘Sijimi’(S) which flowers throughout the year and a more typical cultivar ‘Lidongben’(L) which flowers only once in the season, with the aim of identifying candidate genes associated with continuous flowering. 36,527 and 40,982 unigenes were obtained by de novo assembly of the clean reads from cDNA libraries of L and S cultivars. Additionally 40,513 unigenes were assembled from combined reads of these libraries. A total of 32,475 unigenes were annotated by BLAST search to NCBI non-redundant protein (NR), Swiss-Prot, Clusters of Orthologous Groups (COGs) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Of these, almost fifteen thousand unigenes were identified as significantly differentially expressed genes (DEGs) by using Reads Per kb per Million reads (RPKM) method. A total of 6,415 DEGs were mapped to 128 KEGG pathways, and 8,743 DEGs were assigned to 54 Gene Ontology categories. After blasting the DEGs to public sequence databases, 539 potential flowering-related DEGs were identified. In addition, 107 flowering-time genes were identified in longan, their expression levels between two longan samples were compared by RPKM method, of which the expression levels of 15 were confirmed by real-time quantitative PCR. Our results suggest longan homologues of SHORT VEGETATIVE PHASE (SVP), GIGANTEA (GI), F-BOX 1 (FKF1) and EARLY FLOWERING 4 (ELF4) may be involved this flowering trait and ELF4 may be a key gene. The identification of candidate genes related to continuous flowering will provide new insight into the molecular process of regulating flowering time in woody plants. PMID:25479005
Mohanty, Jatindra Nath; Nayak, Sanghamitra; Jha, Sumita; Joshi, Raj Kumar
2017-08-30
Dioecious species offer an inclusive structure to study the molecular basis of sexual dimorphism in angiosperms. Despite having a small genome and heteromorphic sex chromosomes, Coccinia grandis is a highly neglected dioecious species with little information available on its physical state, genetic orientation and key sex-defining elements. In the present study, we performed RNA-Seq and DGE analysis of male (MB) and female (FB) buds in C. grandis to gain insights into the molecular basis of sex determination in this plant. De novo assembly of 75 million clean reads resulted in 72,479 unigenes for male library and 63,308 unigenes for female library with a mean length of 736bp. 61,458 (85.57%) unigenes displayed significant similarity with protein sequences from publicly available databases. Comparative transcriptome analyses revealed 1410 unigenes as differentially expressed (DEGs) between MB and FB samples. A consistent correlation between the expression levels of DEGs was observed for the RNA-Seq pattern and qRT-PCR validation. Functional annotation showed high enrichment of DEGs involved in phytohormone biosynthesis, hormone signaling and transduction, transcriptional regulation and methyltransferase activity. High induction of hormone responsive genes such as ARF6, ACC synthase1, SNRK2 and BRI1-associated receptor kinase 1 (BAK1) suggest that multiple phytohormones and their signaling crosstalk play crucial role in sex determination in this species. Beside, the transcription factors such as zinc fingers, homeodomain leucine zippers and MYBs were identified as major determinants of male specific expression. Moreover, the detection of multiple DEGs as the miRNA target site implies that a small RNA mediated gene silencing cascade may also be regulating gender differentiation in C. grandis. Overall, the present transcriptome resources provide us a large number of DEGs involved in sex expression and could form the groundwork for unravelling the molecular mechanism of sex determination in C. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.
Wu, Qing-jun; Wang, Shao-li; Yang, Xin; Yang, Ni-na; Li, Ru-mei; Jiao, Xiao-guo; Pan, Hui-peng; Liu, Bai-ming; Su, Qi; Xu, Bao-yun; Hu, Song-nian; Zhou, Xu-guo; Zhang, You-jun
2012-01-01
Background Bemisia tabaci (Gennadius) is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. Methodology and Principal Findings Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45%) unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10–5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. Conclusions This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the B. tabaci complex. Moreover, current pyrosequencing effort greatly enriched the existing whitefly EST database, and makes RNAseq a viable option for future genomic analysis. PMID:22558125
Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana.
Liu, Yanan; Wang, Baoju; Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia
2016-01-01
The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model.
Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana
Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia
2016-01-01
The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model. PMID:27806133
Chen, Xin; Zhang, Jin; Liu, Qingzhong; Guo, Wei; Zhao, Tiantian; Ma, Qinghua; Wang, Guixi
2014-01-01
The genus Corylus is an important woody species in Northeast China. Its products, hazelnuts, constitute one of the most important raw materials for the pastry and chocolate industry. However, limited genetic research has focused on Corylus because of the lack of genomic resources. The advent of high-throughput sequencing technologies provides a turning point for Corylus research. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive database for the Corylus heterophylla Fisch floral buds. The C. heterophylla Fisch floral buds transcriptome was sequenced using the Illumina paired-end sequencing technology. We produced 28,930,890 raw reads and assembled them into 82,684 contigs. A total of 40,941 unigenes were identified, among which 30,549 were annotated in the NCBI Non-redundant (Nr) protein database and 18,581 were annotated in the Swiss-Prot database. Of these annotated unigenes, 25,311 and 10,514 unigenes were assigned to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. We could map 17,207 unigenes onto 128 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. Additionally, based on the transcriptome, we constructed a candidate cold tolerance gene set of C. heterophylla Fisch floral buds. The expression patterns of selected genes during four stages of cold acclimation suggested that these genes might be involved in different cold responsive stages in C. heterophylla Fisch floral buds. The transcriptome of C. heterophylla Fisch floral buds was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the C. heterophylla Fisch floral buds transcriptome. Candidate genes potentially involved in cold tolerance were identified, providing a material basis for future molecular mechanism analysis of C. heterophylla Fisch floral buds tolerant to cold stress.
Hill, Theresa A.; Ashrafi, Hamid; Reyes-Chin-Wo, Sebastian; Yao, JiQiang; Stoffel, Kevin; Truco, Maria-Jose; Kozik, Alexander; Michelmore, Richard W.; Van Deynze, Allen
2013-01-01
The widely cultivated pepper, Capsicum spp., important as a vegetable and spice crop world-wide, is one of the most diverse crops. To enhance breeding programs, a detailed characterization of Capsicum diversity including morphological, geographical and molecular data is required. Currently, molecular data characterizing Capsicum genetic diversity is limited. The development and application of high-throughput genome-wide markers in Capsicum will facilitate more detailed molecular characterization of germplasm collections, genetic relationships, and the generation of ultra-high density maps. We have developed the Pepper GeneChip® array from Affymetrix for polymorphism detection and expression analysis in Capsicum. Probes on the array were designed from 30,815 unigenes assembled from expressed sequence tags (ESTs). Our array design provides a maximum redundancy of 13 probes per base pair position allowing integration of multiple hybridization values per position to detect single position polymorphism (SPP). Hybridization of genomic DNA from 40 diverse C. annuum lines, used in breeding and research programs, and a representative from three additional cultivated species (C. frutescens, C. chinense and C. pubescens) detected 33,401 SPP markers within 13,323 unigenes. Among the C. annuum lines, 6,426 SPPs covering 3,818 unigenes were identified. An estimated three-fold reduction in diversity was detected in non-pungent compared with pungent lines, however, we were able to detect 251 highly informative markers across these C. annuum lines. In addition, an 8.7 cM region without polymorphism was detected around Pun1 in non-pungent C. annuum. An analysis of genetic relatedness and diversity using the software Structure revealed clustering of the germplasm which was confirmed with statistical support by principle components analysis (PCA) and phylogenetic analysis. This research demonstrates the effectiveness of parallel high-throughput discovery and application of genome-wide transcript-based markers to assess genetic and genomic features among Capsicum annuum. PMID:23409153
Hill, Theresa A; Ashrafi, Hamid; Reyes-Chin-Wo, Sebastian; Yao, JiQiang; Stoffel, Kevin; Truco, Maria-Jose; Kozik, Alexander; Michelmore, Richard W; Van Deynze, Allen
2013-01-01
The widely cultivated pepper, Capsicum spp., important as a vegetable and spice crop world-wide, is one of the most diverse crops. To enhance breeding programs, a detailed characterization of Capsicum diversity including morphological, geographical and molecular data is required. Currently, molecular data characterizing Capsicum genetic diversity is limited. The development and application of high-throughput genome-wide markers in Capsicum will facilitate more detailed molecular characterization of germplasm collections, genetic relationships, and the generation of ultra-high density maps. We have developed the Pepper GeneChip® array from Affymetrix for polymorphism detection and expression analysis in Capsicum. Probes on the array were designed from 30,815 unigenes assembled from expressed sequence tags (ESTs). Our array design provides a maximum redundancy of 13 probes per base pair position allowing integration of multiple hybridization values per position to detect single position polymorphism (SPP). Hybridization of genomic DNA from 40 diverse C. annuum lines, used in breeding and research programs, and a representative from three additional cultivated species (C. frutescens, C. chinense and C. pubescens) detected 33,401 SPP markers within 13,323 unigenes. Among the C. annuum lines, 6,426 SPPs covering 3,818 unigenes were identified. An estimated three-fold reduction in diversity was detected in non-pungent compared with pungent lines, however, we were able to detect 251 highly informative markers across these C. annuum lines. In addition, an 8.7 cM region without polymorphism was detected around Pun1 in non-pungent C. annuum. An analysis of genetic relatedness and diversity using the software Structure revealed clustering of the germplasm which was confirmed with statistical support by principle components analysis (PCA) and phylogenetic analysis. This research demonstrates the effectiveness of parallel high-throughput discovery and application of genome-wide transcript-based markers to assess genetic and genomic features among Capsicum annuum.
Wang, Yan; Li, Kui; Zheng, Baoqiang; Miao, Kun
2015-01-01
Tree peony (Paeonia suffruticosa Andrews) is a very famous traditional ornamental plant in China. P. delavayi is a species endemic to Southwest China that has aroused great interest from researchers as a precious genetic resource for flower color breeding. However, the current understanding of the molecular mechanisms of flower pigmentation in this plant is limited, hindering the genetic engineering of novel flower color in tree peonies. In this study, we conducted a large-scale transcriptome analysis based on Illumina HiSeq sequencing of cDNA libraries generated from yellow and purple-red P. delavayi petals. A total of 90,202 unigenes were obtained by de novo assembly, with an average length of 721 nt. Using Blastx, 44,811 unigenes (49.68%) were found to have significant similarity to accessions in the NR, NT, and Swiss-Prot databases. We also examined COG, GO and KEGG annotations to better understand the functions of these unigenes. Further analysis of the two digital transcriptomes revealed that 6,855 unigenes were differentially expressed between yellow and purple-red flower petals, with 3,430 up-regulated and 3,425 down-regulated. According to the RNA-Seq data and qRT-PCR analysis, we proposed that four up-regulated key structural genes, including F3H, DFR, ANS and 3GT, might play an important role in purple-red petal pigmentation, while high co-expression of THC2'GT, CHI and FNS II ensures the accumulation of pigments contributing to the yellow color. We also found 50 differentially expressed transcription factors that might be involved in flavonoid biosynthesis. This study is the first to report genetic information for P. delavayi. The large number of gene sequences produced by transcriptome sequencing and the candidate genes identified using pathway mapping and expression profiles will provide a valuable resource for future association studies aimed at better understanding the molecular mechanisms underlying flower pigmentation in tree peonies. PMID:26267644
Gao, Jinjun; Yu, Xinxin; Ma, Fengming; Li, Jing
2014-01-01
Background Broccoli (Brassica oleracea var. italica), a member of Cruciferae, is an important vegetable containing high concentration of various nutritive and functional molecules especially the anticarcinogenic glucosinolates. The sprouts of broccoli contain 10–100 times higher level of glucoraphanin, the main contributor of the anticarcinogenesis, than the edible florets. Despite the broccoli sprouts’ functional importance, currently available genetic and genomic tools for their studies are very limited, which greatly restricts the development of this functionally important vegetable. Results A total of ∼85 million 251 bp reads were obtained. After de novo assembly and searching the assembled transcripts against the Arabidopsis thaliana and NCBI nr databases, 19,441 top-hit transcripts were clustered as unigenes with an average length of 2,133 bp. These unigenes were classified according to their putative functional categories. Cluster analysis of total unigenes with similar expression patterns and differentially expressed unigenes among different tissues, as well as transcription factor analysis were performed. We identified 25 putative glucosinolate metabolism genes sharing 62.04–89.72% nucleotide sequence identity with the Arabidopsis orthologs. This established a broccoli glucosinolate metabolic pathway with high colinearity to Arabidopsis. Many of the biosynthetic and degradation genes showed higher expression after germination than in seeds; especially the expression of the myrosinase TGG2 was 20–130 times higher. These results along with the previous reports about these genes’ studies in Arabidopsis and the glucosinolate concentration in broccoli sprouts indicate the breakdown products of glucosinolates may play important roles in the stage of broccoli seed germination and sprout development. Conclusion Our study provides the largest genetic resource of broccoli to date. These data will pave the way for further studies and genetic engineering of broccoli sprouts and will also provide new insight into the genomic research of this species and its relatives. PMID:24586398
Gao, Jinjun; Yu, Xinxin; Ma, Fengming; Li, Jing
2014-01-01
Broccoli (Brassica oleracea var. italica), a member of Cruciferae, is an important vegetable containing high concentration of various nutritive and functional molecules especially the anticarcinogenic glucosinolates. The sprouts of broccoli contain 10-100 times higher level of glucoraphanin, the main contributor of the anticarcinogenesis, than the edible florets. Despite the broccoli sprouts' functional importance, currently available genetic and genomic tools for their studies are very limited, which greatly restricts the development of this functionally important vegetable. A total of ∼85 million 251 bp reads were obtained. After de novo assembly and searching the assembled transcripts against the Arabidopsis thaliana and NCBI nr databases, 19,441 top-hit transcripts were clustered as unigenes with an average length of 2,133 bp. These unigenes were classified according to their putative functional categories. Cluster analysis of total unigenes with similar expression patterns and differentially expressed unigenes among different tissues, as well as transcription factor analysis were performed. We identified 25 putative glucosinolate metabolism genes sharing 62.04-89.72% nucleotide sequence identity with the Arabidopsis orthologs. This established a broccoli glucosinolate metabolic pathway with high colinearity to Arabidopsis. Many of the biosynthetic and degradation genes showed higher expression after germination than in seeds; especially the expression of the myrosinase TGG2 was 20-130 times higher. These results along with the previous reports about these genes' studies in Arabidopsis and the glucosinolate concentration in broccoli sprouts indicate the breakdown products of glucosinolates may play important roles in the stage of broccoli seed germination and sprout development. Our study provides the largest genetic resource of broccoli to date. These data will pave the way for further studies and genetic engineering of broccoli sprouts and will also provide new insight into the genomic research of this species and its relatives.
Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming
2015-01-01
Background Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. Results We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington’s, Alzheimer’s and Parkinson’s diseases. This is the first description of degenerative disease-associated genes in jellyfish. Conclusion We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular level and information on the underlying molecular mechanisms of jellyfish stinging. The findings of this study may also be used in comparative studies of gene expression profiling among different jellyfish species. PMID:26551022
Liu, Guoyan; Zhou, Yonghong; Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming
2015-01-01
Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington's, Alzheimer's and Parkinson's diseases. This is the first description of degenerative disease-associated genes in jellyfish. We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular level and information on the underlying molecular mechanisms of jellyfish stinging. The findings of this study may also be used in comparative studies of gene expression profiling among different jellyfish species.
Transcriptomic Immune Response of Tenebrio molitor Pupae to Parasitization by Scleroderma guani
Zhu, Jia-Ying; Yang, Pu; Zhang, Zhong; Wu, Guo-Xing; Yang, Bin
2013-01-01
Background Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE) analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. Methodology/Principal Findings In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26%) showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. Conclusions/Significance obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular understanding of the host-parasitoid interaction. PMID:23342153
Isolation and characterization of microsatellite Loci for Cornus sanguniea (Cornaceae) 1
USDA-ARS?s Scientific Manuscript database
Premise of the study: Microsatellite loci were developed for Cornus sanguinea and will permit genetic and conservation studies of the species. Methods and Results: A microsatellite-enriched library was used to develop 16 polymorphic microsatellite loci for C. sanguinea. The loci amplified 5-11 allel...
Development of microsatellite loci for the endangered species Pityopsis ruthii (Asteraceae)1
USDA-ARS?s Scientific Manuscript database
Premise of the study: Microsatellite loci were developed for the endangered species Pityopsis ruthii and will permit genetic and conservation studies of the species. Methods and Results:A microsatellite enriched library was used to develop 12 polymorphic microsatellite loci for P. ruthii. The loci ...
Li, Shi-Weng; Shi, Rui-Fang; Leng, Yan; Zhou, Yuan
2016-01-12
Auxin plays a critical role in inducing adventitious rooting in many plants. Indole-3-butyric acid (IBA) is the most widely employed auxin for adventitious rooting. However, the molecular mechanisms by which auxin regulate the process of adventitious rooting are less well known. The RNA-Seq data analysis indicated that IBA treatment greatly increased the amount of clean reads and the amount of expressed unigenes by 24.29 % and 27.42 % and by 4.3 % and 5.04 % at two time points, respectively, and significantly increased the numbers of unigenes numbered with RPKM = 10-100 and RPKM = 500-1000 by 13.04 % and 3.12 % and by 24.66 % and 108.2 % at two time points, respectively. Gene Ontology (GO) enrichment analysis indicated that the enrichment of down-regulated GOs was 2.87-fold higher than that of up-regulated GOs at stage 1, suggesting that IBA significantly down-regulated gene expression at 6 h. The GO functional category indicated that IBA significantly up- or down-regulated processes associated with auxin signaling, ribosome assembly and protein synthesis, photosynthesis, oxidoreductase activity and extracellular region, secondary cell wall biogenesis, and the cell wall during the development process. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment indicated that ribosome biogenesis, plant hormone signal transduction, pentose and glucuronate interconversions, photosynthesis, phenylpropanoid biosynthesis, sesquiterpenoid and triterpenoid biosynthesis, ribosome, cutin, flavonoid biosynthesis, and phenylalanine metabolism were the pathways most highly regulated by IBA. A total of 6369 differentially expressed (2-fold change > 2) unigenes (DEGs) with 3693 (58 %) that were up-regulated and 2676 (42 %) down-regulated, 5433 unigenes with 2208 (40.6 %) that were up-regulated and 3225 (59.4 %) down-regulated, and 7664 unigenes with 3187 (41.6 %) that were up-regulated and 4477 (58.4 %) down-regulated were detected at stage 1, stage 2, and between stage 1 and stage 2, respectively, suggesting that IBA treatment increased the number of DEGs. A total of 143 DEGs specifically involved in plant hormone signaling and 345 transcription factor (TF) genes were also regulated by IBA. qRT-PCR validation of the 36 genes with known functions indicated a strong correlation with the RNA-Seq data. The changes in GO functional categories, KEGG pathways, and global DEG profiling during adventitious rooting induced by IBA were analyzed. These results provide valuable information about the molecular traits of IBA regulation of adventitious rooting.
Zhang, Cheng-Cai; Wang, Li-Yuan; Wei, Kang; Wu, Li-Yun; Li, Hai-Lin; Zhang, Fen; Cheng, Hao; Ni, De-Jiang
2016-05-17
Self-incompatibility (SI) is under genetic control and prevents inbreeding depression in angiosperms. SI mechanisms are quite complicated and still poorly understood in many plants. Tea (Camellia sinensis L.) belonging to the family of Theaceae, exhibits high levels of SI and high heterozygosity. Uncovering the molecular basis of SI of the tea plant may enhance breeding and simplify genomics research for the whole family. The growth of pollen tubes following selfing and crossing was observed using fluorescence microscopy. Self-pollen tubes grew slower than cross treatments from 24 h to 72 h after pollination. RNA-seq was employed to explore the molecular mechanisms of SI and to identify SI-related genes in C. sinensis. Self and cross-pollinated styles were collected at 24 h, 48 h and 72 h after pollination. Six RNA-seq libraries (SP24, SP48, SP72, CP24 CP48 and CP72; SP = self-pollinated, CP = cross-pollinated) were constructed and separately sequenced. In total, 299.327 million raw reads were generated. Following assembly, 63,762 unigenes were identified, and 27,264 (42.76 %) unigenes were annotated in five public databases: NR, KOG, KEGG, Swiss-Port and GO. To identify SI-related genes, the fragments per kb per million mapped reads (FPKM) values of each unigene were evaluated. Comparisons of CP24 vs. SP24, CP48 vs. SP48 and CP72 vs. SP72 revealed differential expression of 3,182, 3,575 and 3,709 genes, respectively. Consequently, several ubiquitin-mediated proteolysis, Ca(2+) signaling, apoptosis and defense-associated genes were obtained. The temporal expression pattern of genes following CP and SP was analyzed; 6 peroxidase, 1 polyphenol oxidase and 7 salicylic acid biosynthetic process-related genes were identified. The RNA-seq data were validated by qRT-PCR of 15 unigenes. Finally, a unigene (CL25983Contig1) with strong homology to the S-RNase was analyzed. It was mainly expressed in styles, with dramatically higher expression in self-pollinated versus cross-pollinated tissues at 24 h post-pollination. The present study reports the transcriptome of styles after cross- and self-pollination in tea and offers novel insights into the molecular mechanism behind SI in C. sinensis. We believe that this RNA-seq dataset will be useful for improvement in C. sinensis as well as other plants in the Theaceae family.
Zhao, Daqiu; Jiang, Yao; Ning, Chuanlong; Meng, Jiasong; Lin, Shasha; Ding, Wen; Tao, Jun
2014-08-19
Herbaceous peony (Paeonia lactiflora Pall.) is a traditional flower in China and a wedding attractive flower in worldwide. In its flower colour, yellow is the rarest which is ten times the price of the other colours. However, the breeding of new yellow P. lactiflora varieties using genetic engineering is severely limited due to the little-known biochemical and molecular mechanisms underlying its characteristic formation. In this study, two cDNA libraries generated from P. lactiflora chimaera with red outer-petal and yellow inner-petal were sequenced using an Illumina HiSeq™ 2000 platform. 66,179,398 and 65,481,444 total raw reads from red outer-petal and yellow inner-petal cDNA libraries were generated, which were assembled into 61,431 and 70,359 Unigenes with an average length of 628 and 617 nt, respectively. Moreover, 61,408 non-redundant All-unigenes were obtained, with 37,511 All-unigenes (61.08%) annotated in public databases. In addition, 6,345 All-unigenes were differentially expressed between the red outer-petal and yellow inner-petal, with 3,899 up-regulated and 2,446 down-regulated All-unigenes, and the flavonoid metabolic pathway related to colour development was identified using the Kyoto encyclopedia of genes and genomes database (KEGG). Subsequently, the expression patterns of 10 candidate differentially expressed genes (DEGs) involved in the flavonoid metabolic pathway were examined, and flavonoids were qualitatively and quantitatively analysed. Numerous anthoxanthins (flavone and flavonol) and a few anthocyanins were detected in the yellow inner-petal, which were all lower than those in the red outer-petal due to the low expression levels of the phenylalanine ammonialyase gene (PlPAL), flavonol synthase gene (PlFLS), dihydroflavonol 4-reductase gene (PlDFR), anthocyanidin synthase gene (PlANS), anthocyanidin 3-O-glucosyltransferase gene (Pl3GT) and anthocyanidin 5-O-glucosyltransferase gene (Pl5GT). Transcriptome sequencing (RNA-Seq) analysis based on the high throughput sequencing technology was an efficient approach to identify critical genes in P. lactiflora and other non-model plants. The flavonoid metabolic pathway and glucide metabolic pathway were identified as relatived yellow formation in P. lactiflora, PlPAL, PlFLS, PlDFR, PlANS, Pl3GT and Pl5GT were selected as potential candidates involved in flavonoid metabolic pathway, which inducing inhibition of anthocyanin biosynthesis mediated yellow formation in P. lactiflora. This study could lay a theoretical foundation for breeding new yellow P. lactiflora varieties.
Genetic diversity of loquat germplasm (Eriobotrya japonica (Thunb) Lindl) assessed by SSR markers.
Soriano, José Miguel; Romero, Carlos; Vilanova, Santiago; Llácer, Gerardo; Badenes, María Luisa
2005-02-01
Genetic relationships among 40 loquat (Eriobotrya japonica (Thunb) Lindl) accessions that originated from different countries and that are part of the germplasm collection of the Instituto Valenciano de Investigaciones Agrarias (IVIA) (Valencia, Spain) were evaluated using microsatellites. Thirty primer pairs flanking microsatellites previously identified in Malus x domestica (Borkh.) were assayed. Thirteen of them amplified polymorphic products and unambiguously distinguished 34 genotypes from the 40 accessions analyzed. Six accessions showing identical marker patterns were Spanish local varieties thought to have been derived from 'Algerie' by a mutational process very common in loquat species. A total of 39 alleles were detected in the population studied, with a mean value of 2.4 alleles per locus. The expected and observed heterozygosities were 0.46 and 51% on average, respectively, leading to a negative value of the Wright's fixation index (-0.20). The values of these parameters indicate a smaller degree of genetic diversity in the set of loquat accessions analyzed than in other members of the Rosaceae family. Unweighted pair-group method (UPGMA) cluster analysis, based on Nei's genetic distance, generally grouped genotypes according to their geographic origins and pedigrees. The high number of alleles and the high expected heterozygosity detected with SSR markers developed in Malus x domestica (Borkh.) make them a suitable tool for loquat cultivar identification, confirming microsatellite marker transportability among genera in the Rosaceae family.
Characterization and mapping of complementary lesion-mimic genes lm1 and lm2 in common wheat.
Yao, Qin; Zhou, Ronghua; Fu, Tihua; Wu, Weiren; Zhu, Zhendong; Li, Aili; Jia, Jizeng
2009-10-01
A lesion-mimic phenotype appeared in a segregating population of common wheat cross Yanzhan 1/Zaosui 30. The parents had non-lesion normal phenotypes. Shading treatment and histochemical analyses showed that the lesions were caused by light-dependent cell death and were not associated with pathogens. Studies over two cropping seasons showed that some lines with more highly expressed lesion-mimic phenotypes exhibited significantly lower grain yields than those with the normal phenotype, but there were no significant effects in the lines with weakly expressed lesion-mimic phenotypes. Among yield traits, one-thousand grain weight was the most affected by lesion-mimic phenotypes. Genetic analysis indicated that this was a novel type of lesion mimic, which was caused by interaction of recessive genes derived from each parent. The lm1 (lesion mimic 1) locus from Zaosui 30 was flanked by microsatellite markers Xwmc674 and Xbarc133/Xbarc147 on chromosome 3BS, at genetic distances of 1.2 and 3.8 cM, respectively, whereas lm2 from Yanzhan 1 was mapped between microsatellite markers Xgwm513 and Xksum154 on chromosome 4BL, at genetic distances of 1.5 and 3 cM, respectively. The linked microsatellite makers identified in this study might be useful for evaluating whether potential parents with normal phenotype are carriers of lesion-mimic alleles.
Ashworth, V E T M; Clegg, M T
2003-01-01
Twenty-five microsatellite markers uniquely differentiated 35 avocado cultivars and two wild relatives. Average heterozygosity was high (60.7%), ranging from 32% in P. steyermarkii to 84% in Fuerte and Bacon. In a subset of 15 cultivars, heterozygosity averaged 63.5% for microsatellites, compared to 41.8% for restriction fragment length polymorphisms (RFLPs). A neighbor-joining tree, according to average shared allele distances, consisted of three clusters likely corresponding to the botanical races of avocado and intermediate clusters uniting genotypes of presumably racially hybrid origin. Several results were at odds with existing botanical assignments that are sometimes rendered difficult by incomplete pedigree information, the complexity of the hybrid status (multiple backcrossing), or both. For example, cv. Harvest clustered with the Guatemalan race cultivars, yet it is derived from the Guatemalan x Mexican hybrid cv. Gwen. Persea schiedeana grouped with cv. Bacon. The rootstock G875 emerged as the most divergent genotype in our data set. Considerable diversity was found particularly among accessions from Guatemala, including G810 (West Indian race), G6 (Mexican race), G755A (hybrid Guatemalan x P. schiedeana), and G875 (probably not P. americana). Low bootstrap support, even upon exclusion of (known) hybrid genotypes from the data matrix, suggests the existence of ancient hybridization or that the botanical races originated more recently than previously thought.
Carroll, E L; Baker, C S; Watson, M; Alderman, R; Bannister, J; Gaggiotti, O E; Gröcke, D R; Patenaude, N; Harcourt, R
2015-11-09
Fidelity to migratory destinations is an important driver of connectivity in marine and avian species. Here we assess the role of maternally directed learning of migratory habitats, or migratory culture, on the population structure of the endangered Australian and New Zealand southern right whale. Using DNA profiles, comprising mitochondrial DNA (mtDNA) haplotypes (500 bp), microsatellite genotypes (17 loci) and sex from 128 individually-identified whales, we find significant differentiation among winter calving grounds based on both mtDNA haplotype (FST = 0.048, ΦST = 0.109, p < 0.01) and microsatellite allele frequencies (FST = 0.008, p < 0.01), consistent with long-term fidelity to calving areas. However, most genetic comparisons of calving grounds and migratory corridors were not significant, supporting the idea that whales from different calving grounds mix in migratory corridors. Furthermore, we find a significant relationship between δ(13)C stable isotope profiles of 66 Australian southern right whales, a proxy for feeding ground location, and both mtDNA haplotypes and kinship inferred from microsatellite-based estimators of relatedness. This indicates migratory culture may influence genetic structure on feeding grounds. This fidelity to migratory destinations is likely to influence population recovery, as long-term estimates of historical abundance derived from estimates of genetic diversity indicate the South Pacific calving grounds remain at <10% of pre-whaling abundance.
Pumping capacity and reliability of cryogenic micro-pump for micro-satellite applications
NASA Astrophysics Data System (ADS)
Zhang, Xin; Zhao, Yi; Li, Biao; Ludlow, Daryl
2004-10-01
In micro-satellites, delicate instruments are compacted into a limited space. This raises concerns of active cooling and remote cooling. Silicon based micro-pump arrays are employed thanks to manufacturing simplicity, a small cryogen charge, etc, and keep the instruments within a narrow cryogenic temperature range. The pumping capacity and reliability of the micro-pump are critical in terms of heat balance calculation and lifetime evaluation. The pumping capacity is associated with the diaphragm deflection while the reliability is associated with stress and fatigue. Both of them heavily depend on the silicon diaphragm, one of the key components. This paper examines the pumping capacity and reliability of the micro-pump under cryogenic temperature for micro-satellite applications. In this work, differential pressure was used for the actuation of a single-crystal silicon diaphragm. Diaphragm deflection and stress distribution were achieved using interferometry and micro-Raman spectroscopy, respectively. As a result, smaller pumping capacity was derived under cryogenic temperature, compared to that under room temperature, indicating a stiffer material. From stress mapping, the edge centers were believed to be the most vulnerable to fracture, which was further validated by analyzing the fracture diaphragm. Moreover, a fatigue testing was conducted for 1.8 million cycles with no damage found, verifying silicon as a viable material for long time operation in a cryogenic environment.
Identification of common, unique and polymorphic microsatellites among 73 cyanobacterial genomes.
Kabra, Ritika; Kapil, Aditi; Attarwala, Kherunnisa; Rai, Piyush Kant; Shanker, Asheesh
2016-04-01
Microsatellites also known as Simple Sequence Repeats are short tandem repeats of 1-6 nucleotides. These repeats are found in coding as well as non-coding regions of both prokaryotic and eukaryotic genomes and play a significant role in the study of gene regulation, genetic mapping, DNA fingerprinting and evolutionary studies. The availability of 73 complete genome sequences of cyanobacteria enabled us to mine and statistically analyze microsatellites in these genomes. The cyanobacterial microsatellites identified through bioinformatics analysis were stored in a user-friendly database named CyanoSat, which is an efficient data representation and query system designed using ASP.net. The information in CyanoSat comprises of perfect, imperfect and compound microsatellites found in coding, non-coding and coding-non-coding regions. Moreover, it contains PCR primers with 200 nucleotides long flanking region. The mined cyanobacterial microsatellites can be freely accessed at www.compubio.in/CyanoSat/home.aspx. In addition to this 82 polymorphic, 13,866 unique and 2390 common microsatellites were also detected. These microsatellites will be useful in strain identification and genetic diversity studies of cyanobacteria.
Microsatellite DNA capture from enriched libraries.
Gonzalez, Elena G; Zardoya, Rafael
2013-01-01
Microsatellites are DNA sequences of tandem repeats of one to six nucleotides, which are highly polymorphic, and thus the molecular markers of choice in many kinship, population genetic, and conservation studies. There have been significant technical improvements since the early methods for microsatellite isolation were developed, and today the most common procedures take advantage of the hybrid capture methods of enriched-targeted microsatellite DNA. Furthermore, recent advents in sequencing technologies (i.e., next-generation sequencing, NGS) have fostered the mining of microsatellite markers in non-model organisms, affording a cost-effective way of obtaining a large amount of sequence data potentially useful for loci characterization. The rapid improvements of NGS platforms together with the increase in available microsatellite information open new avenues to the understanding of the evolutionary forces that shape genetic structuring in wild populations. Here, we provide detailed methodological procedures for microsatellite isolation based on the screening of GT microsatellite-enriched libraries, either by cloning and Sanger sequencing of positive clones or by direct NGS. Guides for designing new species-specific primers and basic genotyping are also given.
Microsatellite analysis of Fasciola spp. in Egypt.
Dar, Yasser; Amer, Said; Courtioux, Bertrand; Dreyfuss, Gilles
2011-12-01
Recently, the topic of diversity in Fasciola population in Egypt is controversial. The present study was performed to study the genetic diversity of isolated flukes based on microsatellites markers. Fasciola worms were collected from different hosts and geographical locations in Egypt. Control samples of Fasciola hepatica from France as well as Fasciola gigantica from Cameroon were included in the study. Collected flukes were identified morphologically and subjected for analysis using four microsatellite markers. Results of microsatellite profile (FM1 and FM2) proved that both species of Fasciola are distributed in Egypt irrespective of geographical location and host. Nevertheless, the microsatellite profile of some analyzed loci (FM2 and FM3) proved that Egyptian flukes showed more alleles compared to the reference ones. Differences of microsatellite profile in Egyptian isolates than that of corresponding reference samples indicate the remarkable diversity of these isolates. The present results highlighted the utility of microsatellite profile to discriminate between Fasciola species and to elucidate the diversity within the species. To our knowledge, this is the first time to study microsatellite polymorphism in Fasciola populations in Egypt.
NASA Astrophysics Data System (ADS)
Zhang, Wenjing; Yu, Yuhe; Shen, Yunfen; Miao, Wei; Feng, Weisong
2004-04-01
In this paper, we took the lead in studying on specificity of the microsatellite DNA loci and applicability of microsatellite DNA primers in protozoa. In order to study characters of microsatellites in free-living protozoa, eight microsatellite loci primers developed from Trypanosoma cruzi (MCLE01, SCLE10, MCLE08, SCLE11, MCLF10, MCLG10, MCL03, MCL05) were employed to amplify microsatellite in four free-living protozoa, including Bodo designis, Euglena gracilis FACHB848, Paramecium bruzise and Tetrahymena thermophila BF1. In the amplification systems of P. bruzise, four loci (SCLE10, SCLE11, MCLF10, MCL03) were amplified successfully, and four amplification fragments were in proper size. In genome of E. gracilis FACHB848, five of eight primers brought five clear amplification bands. In B. designis, three (No.4, 5 and 7) of eight loci produced clear and sharp products without stutter bands, whereas no bands appeared in T. thermophila BF1. Further, eight 300 500 bp amplification fragments were cloned and sequenced. Nevertheless, all sequenced products did not contain corresponding microsatellite sequence, although Bodo is in the same order and has the nearest phylogenetic relation with Trypanosoma among these four species. Thus, the microsatellite DNA primers can not be applied among order or more far taxa, and the specificity of microsatellite DNA is very high in protozoa. The results of this study will contribute to our understanding of microsatellite DNA in protozoa.
Nguyen, Thao T B; Arimatsu, Yuji; Hong, Sung-Jong; Brindley, Paul J; Blair, David; Laha, Thewarach; Sripa, Banchob
2015-06-01
Clonorchis sinensis is an important carcinogenic human liver fluke endemic in East and Southeast Asia. There are several conventional molecular markers that have been used for identification and genetic diversity; however, no information about microsatellites of this liver fluke is published so far. We here report microsatellite characterization and marker development for a genetic diversity study in C. sinensis, using a genome-wide bioinformatics approach. Based on our search criteria, a total of 256,990 microsatellites (≥12 base pairs) were identified from a genome database of C. sinensis, with hexanucleotide motif being the most abundant (51%) followed by pentanucleotide (18.3%) and trinucleotide (12.7%). The tetranucleotide, dinucleotide, and mononucleotide motifs accounted for 9.75, 7.63, and 0.14%, respectively. The total length of all microsatellites accounts for 0. 72% of 547 Mb of the whole genome size, and the frequency of microsatellites was found to be one microsatellite in every 2.13 kb of DNA. For the di-, tri-, and tetranucleotide, the repeat numbers redundant are six (28%), four (45%), and three (76%), respectively. The ATC repeat is the most abundant microsatellites followed by AT, AAT, and AC, respectively. Within 40 microsatellite loci developed, 24 microsatellite markers showed potential to differentiate between C. sinensis and Opisthorchis viverrini. Seven out of 24 loci showed to be heterozygous with observed heterozygosity that ranged from 0.467 to 1. Four primer sets could amplify both C. sinensis and O. viverrini DNA with different sizes. This study provides basic information of C. sinensis microsatellites, and the genome-wide markers developed may be a useful tool for the genetic study of C. sinensis.
Huang, Jie; Li, Yu-Zhi; Du, Lian-Ming; Yang, Bo; Shen, Fu-Jun; Zhang, He-Min; Zhang, Zhi-He; Zhang, Xiu-Yue; Yue, Bi-Song
2015-02-07
The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda. By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What's more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system. The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual identification method and the genetic diversity analysis method in this study provided adequate material for the future study of giant panda.
Ritschel, Patricia Silva; Lins, Tulio Cesar de Lima; Tristan, Rodrigo Lourenço; Buso, Gláucia Salles Cortopassi; Buso, José Amauri; Ferreira, Márcio Elias
2004-01-01
Background Despite the great advances in genomic technology observed in several crop species, the availability of molecular tools such as microsatellite markers has been limited in melon (Cucumis melo L.) and cucurbit species. The development of microsatellite markers will have a major impact on genetic analysis and breeding of melon, especially on the generation of marker saturated genetic maps and implementation of marker assisted breeding programs. Genomic microsatellite enriched libraries can be an efficient alternative for marker development in such species. Results Seven hundred clones containing microsatellite sequences from a Tsp-AG/TC microsatellite enriched library were identified and one-hundred and forty-four primer pairs designed and synthesized. When 67 microsatellite markers were tested on a panel of melon and other cucurbit accessions, 65 revealed DNA polymorphisms among the melon accessions. For some cucurbit species, such as Cucumis sativus, up to 50% of the melon microsatellite markers could be readily used for DNA polymophism assessment, representing a significant reduction of marker development costs. A random sample of 25 microsatellite markers was extracted from the new microsatellite marker set and characterized on 40 accessions of melon, generating an allelic frequency database for the species. The average expected heterozygosity was 0.52, varying from 0.45 to 0.70, indicating that a small set of selected markers should be sufficient to solve questions regarding genotype identity and variety protection. Genetic distances based on microsatellite polymorphism were congruent with data obtained from RAPD marker analysis. Mapping analysis was initiated with 55 newly developed markers and most primers showed segregation according to Mendelian expectations. Linkage analysis detected linkage between 56% of the markers, distributed in nine linkage groups. Conclusions Genomic library microsatellite enrichment is an efficient procedure for marker development in melon. One-hundred and forty-four new markers were developed from Tsp-AG/TC genomic library. This is the first reported attempt of successfully using enriched library for microsatellite marker development in the species. A sample of the microsatellite markers tested proved efficient for genetic analysis of melon, including genetic distance estimates and identity tests. Linkage analysis indicated that the markers developed are dispersed throughout the genome and should be very useful for genetic analysis of melon. PMID:15149552
Chloroplast microsatellite primers for cacao (Theobroma cacao) and other Malvaceae.
Yang, Ji Y; Motilal, Lambert A; Dempewolf, Hannes; Maharaj, Kamaldeo; Cronk, Q C B
2011-12-01
Chloroplast microsatellites were developed in Theobroma cacao to examine the genetic diversity of cacao cultivars in Trinidad and Tobago. Nine polymorphic microsatellites were designed from the chloroplast genomes of two T. cacao accessions. These microsatellites were tested in 95 hybrid accessions from Trinidad and Tobago. An average of 2.9 alleles per locus was found. These chloroplast microsatellites, particularly the highly polymorphic pentameric repeat, were useful in assessing genetic variation in T. cacao. In addition, these markers should also prove to be useful for population genetic studies in other species of Malvaceae.
Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B
2015-01-01
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816
Guivier, Emmanuel; Galan, Maxime; Malé, Pierre-Jean G; Kallio, Eva R; Voutilainen, Liina; Henttonen, Heikki; Olsson, Gert E; Lundkvist, Ake; Tersago, Katrien; Augot, Denis; Cosson, Jean-François; Charbonnel, Nathalie
2010-10-01
We analysed the influence of MHC class II Dqa and Drb genes on Puumala virus (PUUV) infection in bank voles (Myodes glareolus). We considered voles sampled in five European localities or derived from a previous experiment that showed variable infection success of PUUV. The genetic variation observed in the Dqa and Drb genes was assessed by using single-strand conformation polymorphism and pyrosequencing methods, respectively. Patterns were compared with those obtained from 13 microsatellites. We revealed significant genetic differentiation between PUUV-seronegative and -seropositive bank voles sampled in wild populations, at the Drb gene only. The absence of genetic differentiation observed at neutral microsatellites confirmed the important role of selective pressures in shaping these Drb patterns. Also, we found no significant associations between infection success and MHC alleles among laboratory-colonized bank voles, which is explained by a loss of genetic variability that occurred during the captivity of these voles.
Taylor, Steve M.; Antonia, Alejandro L.; Parobek, Christian M.; Juliano, Jonathan J.; Janko, Mark; Emch, Michael; Alam, Md Tauqeer; Udhayakumar, Venkatachalam; Tshefu, Antoinette K.; Meshnick, Steven R.
2013-01-01
Understanding the spatial clustering of Plasmodium falciparum populations can assist efforts to contain drug-resistant parasites and maintain the efficacy of future drugs. We sequenced single nucleotide polymorphisms (SNPs) in the dihydropteroate synthase gene (dhps) associated with sulfadoxine resistance and 5 microsatellite loci flanking dhps in order to investigate the genetic backgrounds, genetic relatedness, and geographic clustering of falciparum parasites in the Democratic Republic of the Congo (DRC). Resistant haplotypes were clustered into subpopulations: one in the northeast DRC, and the other in the balance of the DRC. Network and clonal lineage analyses of the flanking microsatellites indicate that geographically-distinct mutant dhps haplotypes derive from separate lineages. The DRC is therefore a watershed for haplotypes associated with sulfadoxine resistance. Given the importance of central Africa as a corridor for the spread of antimalarial resistance, the identification of the mechanisms of this transit can inform future policies to contain drug-resistant parasite strains. PMID:23372922
A microsatellite genetic linkage map of black rockfish ( Sebastes schlegeli)
NASA Astrophysics Data System (ADS)
Chu, Guannan; Jiang, Liming; He, Yan; Yu, Haiyang; Wang, Zhigang; Jiang, Haibin; Zhang, Quanqi
2014-12-01
Ovoviviparous black rockfish ( Sebastes schlegeli) is an important marine fish species for aquaculture and fisheries in China. Genetic information of this species is scarce because of the lack of microsatellite markers. In this study, a large number of microsatellite markers of black rockfish were isolated by constructing microsatellite-enriched libraries. Female- and male-specific genetic linkage maps were constructed using 435 microsatellite markers genotyped in a full-sib family of the fish species. The female linkage map contained 140 microsatellite markers, in which 23 linkage groups had a total genetic length of 1334.1 cM and average inter-marker space of 13.3 cM. The male linkage map contained 156 microsatellite markers, in which 25 linkage groups had a total genetic length of 1359.6 cM and average inter-marker distance of 12.4 cM. The genome coverage of the female and male linkage maps was 68.6% and 69.3%, respectively. The female-to-male ratio of the recombination rate was approximately 1.07:1 in adjacent microsatellite markers. This paper presents the first genetic linkage map of microsatellites in black rockfish. The collection of polymorphic markers and sex-specific linkage maps of black rockfish could be useful for further investigations on parental assignment, population genetics, quantitative trait loci mapping, and marker-assisted selection in related breeding programs.
CMD: a Cotton Microsatellite Database resource for Gossypium genomics
Blenda, Anna; Scheffler, Jodi; Scheffler, Brian; Palmer, Michael; Lacape, Jean-Marc; Yu, John Z; Jesudurai, Christopher; Jung, Sook; Muthukumar, Sriram; Yellambalase, Preetham; Ficklin, Stephen; Staton, Margaret; Eshelman, Robert; Ulloa, Mauricio; Saha, Sukumar; Burr, Ben; Liu, Shaolin; Zhang, Tianzhen; Fang, Deqiu; Pepper, Alan; Kumpatla, Siva; Jacobs, John; Tomkins, Jeff; Cantrell, Roy; Main, Dorrie
2006-01-01
Background The Cotton Microsatellite Database (CMD) is a curated and integrated web-based relational database providing centralized access to publicly available cotton microsatellites, an invaluable resource for basic and applied research in cotton breeding. Description At present CMD contains publication, sequence, primer, mapping and homology data for nine major cotton microsatellite projects, collectively representing 5,484 microsatellites. In addition, CMD displays data for three of the microsatellite projects that have been screened against a panel of core germplasm. The standardized panel consists of 12 diverse genotypes including genetic standards, mapping parents, BAC donors, subgenome representatives, unique breeding lines, exotic introgression sources, and contemporary Upland cottons with significant acreage. A suite of online microsatellite data mining tools are accessible at CMD. These include an SSR server which identifies microsatellites, primers, open reading frames, and GC-content of uploaded sequences; BLAST and FASTA servers providing sequence similarity searches against the existing cotton SSR sequences and primers, a CAP3 server to assemble EST sequences into longer transcripts prior to mining for SSRs, and CMap, a viewer for comparing cotton SSR maps. Conclusion The collection of publicly available cotton SSR markers in a centralized, readily accessible and curated web-enabled database provides a more efficient utilization of microsatellite resources and will help accelerate basic and applied research in molecular breeding and genetic mapping in Gossypium spp. PMID:16737546
Koi, Minoru; Tseng-Rogenski, Stephanie S; Carethers, John M
2018-01-15
Microsatellite alterations within genomic DNA frameshift as a result of defective DNA mismatch repair (MMR). About 15% of sporadic colorectal cancers (CRCs) manifest hypermethylation of the DNA MMR gene MLH1 , resulting in mono- and di-nucleotide frameshifts to classify it as microsatellite instability-high (MSI-H) and hypermutated, and due to frameshifts at coding microsatellites generating neo-antigens, produce a robust protective immune response that can be enhanced with immune checkpoint blockade. More commonly, approximately 50% of sporadic non-MSI-H CRCs demonstrate frameshifts at di- and tetra-nucleotide microsatellites to classify it as MSI-low/elevated microsatellite alterations at selected tetranucleotide repeats (EMAST) as a result of functional somatic inactivation of the DNA MMR protein MSH3 via a nuclear-to-cytosolic displacement. The trigger for MSH3 displacement appears to be inflammation and/or oxidative stress, and unlike MSI-H CRC patients, patients with MSI-L/EMAST CRCs show poor prognosis. These inflammatory-associated microsatellite alterations are a consequence of the local tumor microenvironment, and in theory, if the microenvironment is manipulated to lower inflammation, the microsatellite alterations and MSH3 dysfunction should be corrected. Here we describe the mechanisms and significance of inflammatory-associated microsatellite alterations, and propose three areas to deeply explore the consequences and prevention of inflammation's effect upon the DNA MMR system.
Koi, Minoru; Tseng-Rogenski, Stephanie S; Carethers, John M
2018-01-01
Microsatellite alterations within genomic DNA frameshift as a result of defective DNA mismatch repair (MMR). About 15% of sporadic colorectal cancers (CRCs) manifest hypermethylation of the DNA MMR gene MLH1, resulting in mono- and di-nucleotide frameshifts to classify it as microsatellite instability-high (MSI-H) and hypermutated, and due to frameshifts at coding microsatellites generating neo-antigens, produce a robust protective immune response that can be enhanced with immune checkpoint blockade. More commonly, approximately 50% of sporadic non-MSI-H CRCs demonstrate frameshifts at di- and tetra-nucleotide microsatellites to classify it as MSI-low/elevated microsatellite alterations at selected tetranucleotide repeats (EMAST) as a result of functional somatic inactivation of the DNA MMR protein MSH3 via a nuclear-to-cytosolic displacement. The trigger for MSH3 displacement appears to be inflammation and/or oxidative stress, and unlike MSI-H CRC patients, patients with MSI-L/EMAST CRCs show poor prognosis. These inflammatory-associated microsatellite alterations are a consequence of the local tumor microenvironment, and in theory, if the microenvironment is manipulated to lower inflammation, the microsatellite alterations and MSH3 dysfunction should be corrected. Here we describe the mechanisms and significance of inflammatory-associated microsatellite alterations, and propose three areas to deeply explore the consequences and prevention of inflammation’s effect upon the DNA MMR system. PMID:29375743
Uesugi, Noriyuki; Sugai, Tamotsu; Sugimoto, Ryo; Eizuka, Makoto; Fujita, Yasuko; Sato, Ayaka; Osakabe, Mitsumasa; Ishida, Kazuyuki; Koeda, Keisuke; Sasaki, Akira; Matsumoto, Takayuki
2017-10-01
The molecular alterations and pathological features of gastric papillary adenocarcinoma (GPA) remain unknown. We examined GPA samples and compared their molecular and pathological characteristics with those of gastric tubular adenocarcinoma (GTA). Additionally, we identified pathological and molecular features of GPA that vary with microsatellite stability. In the present study, samples from 63 GPA patients and 47 GTA patients were examined using a combination of polymerase chain reaction (PCR)-microsatellite assays and PCR-pyrosequencing in order to detect microsatellite instability (microsatellite instability, MSI; microsatellite stable, MSS), methylation status (low methylation, intermediate methylation and high methylation level), and chromosomal AI in multiple cancer-related loci. Additionally, the expression levels of TP53 and Her2 were evaluated using immunohistochemistry. GTA and GPA are statistically different in their frequency of pathological features, including mucinous, poorly differentiated and invasive micropapillary components. Clear genetic patterns differentiating GPA and GTA could not be identified with a hierarchical cluster analysis, but microsatellite stability was linked with TP53 and Her2 overexpression. Methylation status in GPA was also associated with the development of high microsatellite instability. However, no pathological differences were associated with microsatellite stability. We suggest that although molecular alterations in a subset of GPAs are closely associated with microsatellite stability, they play a minor role in GPA carcinogenesis. Copyright © 2017 Royal College of Pathologists of Australasia. Published by Elsevier B.V. All rights reserved.
Kejia Pang; Keith Woeste; Charles Michler
2017-01-01
A set of eight microsatellite markers was used to genotype 25 black walnut (Juglans nigra L.) clones within the Purdue University germplasm repository. The identities of 212 ramets were verified using the same eight microsatellite markers. Some trees were mislabeled and corrected as to clone using analysis of microsatellite markers. A genetic...
Gerchen, Jörn F.; Reichert, Samuel J.; Röhr, Johannes T.; Dieterich, Christoph; Kloas, Werner
2016-01-01
Large genome size, including immense repetitive and non-coding fractions, still present challenges for capacity, bioinformatics and thus affordability of whole genome sequencing in most amphibians. Here, we test the performance of a single transcriptome to understand whether it can provide a cost-efficient resource for species with large unknown genomes. Using RNA from six different tissues from a single Palearctic green toad (Bufo viridis) specimen and Hiseq2000, we obtained 22,5 Mio reads and publish >100,000 unigene sequences. To evaluate efficacy and quality, we first use this data to identify green toad specific candidate genes, known from other vertebrates for their role in sex determination and differentiation. Of a list of 37 genes, the transcriptome yielded 32 (87%), many of which providing the first such data for this non-model anuran species. However, for many of these genes, only fragments could be retrieved. In order to allow also applications to population genetics, we further used the transcriptome for the targeted development of 21 non-anonymous microsatellites and tested them in genetic families and backcrosses. Eleven markers were specifically developed to be located on the B. viridis sex chromosomes; for eight markers we can indeed demonstrate sex-specific transmission in genetic families. Depending on phylogenetic distance, several markers, which are sex-linked in green toads, show high cross-amplification success across the anuran phylogeny, involving nine systematic anuran families. Our data support the view that single transcriptome sequencing (based on multiple tissues) provides a reliable genomic resource and cost-efficient method for non-model amphibian species with large genome size and, despite limitations, should be considered as long as genome sequencing remains unaffordable for most species. PMID:27232626
Differentially expressed transcripts in stomach of Penaeus monodon in response to AHPND infection.
Soonthornchai, Wipasiri; Chaiyapechara, Sage; Klinbunga, Sirawut; Thongda, Wilawan; Tangphatsornruang, Sithichoke; Yoocha, Thippawan; Jarayabhand, Padermsak; Jiravanichpaisal, Pikul
2016-12-01
Acute Hepatopancreatic Necrosis Disease (AHPND) is an emerging disease in aquacultured shrimp caused by a pathogenic strain of Vibrio parahaemolyticus. As with several pathogenic bacteria, colonization of the stomach appeared to be the initial step of the infection for AHPND-causing Vibrio. To understand the immune responses in the stomach of black tiger shrimp (Penaeus monodon), differentially expressed transcripts (DETs) in the stomach during V. parahaemolyticus strain 3HP (VP3HP) infection was examined using Ion Torrent sequencing. From the total 42,998 contigs obtained, 1585 contigs representing 1513 unigenes were significantly differentially expressed with 1122 and 391 unigenes up- and down-regulated, respectively. Among the DETs, there were 141 immune-related unigenes in 10 functional categories: antimicrobial peptide, signal transduction pathway, proPO system, oxidative stress, proteinases/proteinase inhibitors, apoptotic tumor-related protein, pathogen recognition immune regulator, blood clotting system, adhesive protein and heat shock protein. Expression profiles of 20 of 22 genes inferred from RNA sequencing were confirmed with the results from qRT-PCR. Additionally, a novel isoform of anti-lipopolysaccharide factor, PmALF7 whose transcript was induced in the stomach after challenge with VP3HP was discovered. This study provided a fundamental information on the molecular response in the shrimp stomach during the AHPND infection that would be beneficial for future research. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Leyva-Pérez, María de la O; Valverde-Corredor, Antonio; Valderrama, Raquel; Jiménez-Ruiz, Jaime; Muñoz-Merida, Antonio; Trelles, Oswaldo; Barroso, Juan Bautista; Mercado-Blanco, Jesús; Luque, Francisco
2015-02-01
Low temperature severely affects plant growth and development. To overcome this constraint, several plant species from regions having a cool season have evolved an adaptive response, called cold acclimation. We have studied this response in olive tree (Olea europaea L.) cv. Picual. Biochemical stress markers and cold-stress symptoms were detected after the first 24 h as sagging leaves. After 5 days, the plants were found to have completely recovered. Control and cold-stressed plants were sequenced by Illumina HiSeq 1000 paired-end technique. We also assembled a new olive transcriptome comprising 157,799 unigenes and found 6,309 unigenes differentially expressed in response to cold. Three types of response that led to cold acclimation were found: short-term transient response, early long-term response, and late long-term response. These subsets of unigenes were related to different biological processes. Early responses involved many cold-stress-responsive genes coding for, among many other things, C-repeat binding factor transcription factors, fatty acid desaturases, wax synthesis, and oligosaccharide metabolism. After long-term exposure to cold, a large proportion of gene down-regulation was found, including photosynthesis and plant growth genes. Up-regulated genes after long-term cold exposure were related to organelle fusion, nucleus organization, and DNA integration, including retrotransposons. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi
2013-01-01
Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
De Novo Transcriptome Analysis for Kentucky Bluegrass Dwarf Mutants Induced by Space Mutation
Gan, Lu; Di, Rong; Chao, Yuehui; Han, Liebao; Chen, Xingwu; Wu, Chao; Yin, Shuxia
2016-01-01
Kentucky bluegrass (Poa pratensis L.) is a major cool-season turfgrass requiring frequent mowing. Utilization of cultivars with slow growth is a promising method to decrease mowing frequency. In this study, two dwarf mutant selections of Kentucky bluegrass (A12 and A16) induced by space mutation were analyzed for the differentially expressed genes compared with the wild type (WT) by the high-throughput RNA-Seq technology. 253,909 unigenes were obtained by de novo assembly. 24.20% of the unigenes had a significant level of amino acid sequence identity to Brachypodium distachyon proteins, followed by Hordeum vulgare with 18.72% among the non-redundant (NR) Blastx top hits. Assembled unigenes were associated with 32 pathways using KEGG orthology terms and their respective KEGG maps. Between WT and A16 libraries, 4,203 differentially expressed genes (DEGs) were identified, whereas there were 883 DEGs between WT and A12 libraries. Further investigation revealed that the DEG pathways were mainly involved in terpenoid biosynthesis and plant hormone metabolism, which might account for the differences of plant height and leaf blade color between dwarf mutant and WT plants. Our study presents the first comprehensive transcriptomic data and gene function analysis of Poa pratensis L., providing a valuable resource for future studies in plant dwarfing breeding and comparative genome analysis for Pooideae plants. PMID:27010560
Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.
Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A
2011-04-08
To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.
Huang, Haijiao; Chen, Su; Li, Huiyu; Jiang, Jing
2015-09-01
Overexpression of BpAP1 could cause early flowering in birch. BpAP1 affected the expression of many flowering-related unigenes and diterpenoid biosynthesis in transgenic birch, and BpPI was a putative target gene of BpAP1. APETALA1 (AP1) is an MADS-box transcription factor that is involved in the flowering process in plants and has been a focus of genetic studies examining flower development. Here, we carried out transcriptome analysis of birch (Betula platyphylla Suk.), including BpAP1 overexpression lines, BpAP1 suppression lines, and non-transgenic line (NT). Compared with NT, we detected 8302 and 7813 differentially expressed unigenes in 35S::BpAP1 and 35S::BpAP1RNAi transgenic lines, respectively. Overexpression and suppression of BpAP1 in birch affected diterpenoid biosynthesis and altered expression of many flowering-related unigenes. Moreover, combining information from the RNA-seq database and the birch genome, we predicted downstream target genes of BpAP1. Among the 166 putative target genes of BpAP1, there was a positive correlation between BpAP1 and BpPI. These results provide references for further examining the relationship between BpAP1 and its target genes, and reveal that BpAP1 functions as a transcription regulator in birch.
Gene discovery using next-generation pyrosequencing to develop ESTs for Phalaenopsis orchids
2011-01-01
Background Orchids are one of the most diversified angiosperms, but few genomic resources are available for these non-model plants. In addition to the ecological significance, Phalaenopsis has been considered as an economically important floriculture industry worldwide. We aimed to use massively parallel 454 pyrosequencing for a global characterization of the Phalaenopsis transcriptome. Results To maximize sequence diversity, we pooled RNA from 10 samples of different tissues, various developmental stages, and biotic- or abiotic-stressed plants. We obtained 206,960 expressed sequence tags (ESTs) with an average read length of 228 bp. These reads were assembled into 8,233 contigs and 34,630 singletons. The unigenes were searched against the NCBI non-redundant (NR) protein database. Based on sequence similarity with known proteins, these analyses identified 22,234 different genes (E-value cutoff, e-7). Assembled sequences were annotated with Gene Ontology, Gene Family and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Among these annotations, over 780 unigenes encoding putative transcription factors were identified. Conclusion Pyrosequencing was effective in identifying a large set of unigenes from Phalaenopsis. The informative EST dataset we developed constitutes a much-needed resource for discovery of genes involved in various biological processes in Phalaenopsis and other orchid species. These transcribed sequences will narrow the gap between study of model organisms with many genomic resources and species that are important for ecological and evolutionary studies. PMID:21749684
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gao, Jian; Luo, Mao; Zhu, Ye
2015-03-27
Viola yedoensis Makino is an important Chinese traditional medicine plant adapted to cadmium (Cd) pollution regions. Illumina sequencing technology was used to sequence the transcriptome of V. yedoensis Makino. We sequenced Cd-treated (VIYCd) and untreated (VIYCK) samples of V. yedoensis, and obtained 100,410,834 and 83,587,676 high quality reads, respectively. After de novo assembly and quantitative assessment, 109,800 unigenes were finally generated with an average length of 661 bp. We then obtained functional annotations by aligning unigenes with public protein databases including NR, NT, SwissProt, KEGG and COG. In addition, 892 differentially expressed genes (DEGs) were investigated between the two libraries ofmore » untreated (VIYCK) and Cd-treated (VIYCd) plants. Moreover, 15 randomly selected DEGs were further validated with qRT-PCR and the results were highly accordant with the Solexa analysis. This study firstly generated a successful global analysis of the V. yedoensis transcriptome and it will provide for further studies on gene expression, genomics, and functional genomics in Violaceae. - Highlights: • A de novo assembly generated 109,800 unigenes and 5,4479 of them were annotated. • 31,285 could be classified into 26 COG categories. • 263 biosynthesis pathways were predicted and classified into five categories. • 892 DEGs were detected and 15 of them were validated by qRT-PCR.« less
Liu, Su; Rao, Xiang-Jun; Li, Mao-Ye; Feng, Ming-Feng; He, Meng-Zhu; Li, Shi-Guang
2015-03-01
We present the first antennal transcriptome sequencing information for the yellow mealworm beetle, Tenebrio molitor (Coleoptera: Tenebrionidae). Analysis of the transcriptome dataset obtained 52,216,616 clean reads, from which 35,363 unigenes were assembled. Of these, 18,820 unigenes showed significant similarity (E-value <10(-5)) to known proteins in the NCBI non-redundant protein database. Gene ontology (GO) and Cluster of Orthologous Groups (COG) analyses were used for functional classification of these unigenes. We identified 19 putative odorant-binding protein (OBP) genes, 12 chemosensory protein (CSP) genes, 20 olfactory receptor (OR) genes, 6 ionotropic receptor (IR) genes and 2 sensory neuron membrane protein (SNMP) genes. BLASTX best hit results indicated that these chemosensory genes were most identical to their respective orthologs from Tribolium castaneum. Phylogenetic analyses also revealed that the T. molitor OBPs and CSPs are closely related to those of T. castaneum. Real-time quantitative PCR assays showed that eight TmolOBP genes were antennae-specific. Of these, TmolOBP5, TmolOBP7 and TmolOBP16 were found to be predominantly expressed in male antennae, while TmolOBP17 was expressed mainly in the legs of males. Several other genes were identified that were neither tissue-specific nor sex-specific. These results establish a firm foundation for future studies of the chemosensory genes in T. molitor. Copyright © 2015 Elsevier Inc. All rights reserved.
Identification of genes associated with low furanocoumarin content in grapefruit.
Chen, Chunxian; Yu, Qibin; Wei, Xu; Cancalon, Paul F; Gmitter, Fred G
2014-10-01
Some furanocoumarins in grapefruit (Citrus paradisi) are associated with the so-called grapefruit juice effect. Previous phytochemical quantification and genetic analysis suggested that the synthesis of these furanocoumarins may be controlled by a single gene in the pathway. In this study, cDNA-amplified fragment length polymorphism (cDNA-AFLP) analysis of fruit tissues was performed to identify the candidate gene(s) likely associated with low furanocoumarin content in grapefruit. Fifteen tentative differentially expressed fragments were cloned through the cDNA-AFLP analysis of the grapefruit variety Foster and its spontaneous low-furanocoumarin mutant Low Acid Foster. Sequence analysis revealed a cDNA-AFLP fragment, Contig 6, was homologous to a substrate-proved psoralen synthase gene, CYP71A22, and was part of citrus unigenes Cit.3003 and Csi.1332, and predicted genes Ciclev10004717m in mandarin and orange1.1g041507m in sweet orange. The two predicted genes contained the highly conserved motifs at one of the substrate recognition sites of CYP71A22. Digital gene expression profile showed the unigenes were expressed only in fruit and seed. Quantitative real-time PCR also proved Contig 6 was down-regulated in Low Acid Foster. These results showed the differentially expressed Contig 6 was related to the reduced furanocoumarin levels in the mutant. The identified fragment, homologs, unigenes, and genes may facilitate further furanocoumarin genetic study and grapefruit variety improvement.
Guan, Bi-Cai; Gong, Xi; Zhou, Shi-Liang
2011-08-01
The development of compound microsatellite markers was conducted in Dysosma pleiantha to investigate genetic diversity and population genetic structure of this threatened medicinal plant. Using the compound microsatellite marker technique, 14 microsatellite markers that were successfully amplified showed polymorphism when tested on 38 individuals from three populations in eastern China. Overall, the number of alleles per locus ranged from 2 to 14, with an average of 7.71 alleles per locus. These results indicate that these microsatellite markers are adequate for detecting and characterizing population genetic structure and genetic diversity in Dysosma pleiantha.
Nguyen, Thao T.B.; Arimatsu, Yuji; Hong, Sung-Jong; Brindley, Paul J.; Blair, David; Laha, Thewarach; Sripa, Banchob
2015-01-01
Clonorchis sinensis is an important carcinogenic human liver fluke endemic in East and Southeast Asia. There are several conventional molecular markers have been used for identification and genetic diversity, however, no information about microsatellites of this liver fluke published so far. We here report microsatellite characterization and marker development for genetic diversity study in C. sinensis using genome-wide bioinformatics approach. Based on our search criteria, a total of 256,990 microsatellites (≥ 12 base pairs) were identified from genome database of C. sinensis with hexa-nucleotide motif being the most abundant (51%) followed by penta-nucleotide (18.3%) and tri-nucleotide (12.7%). The tetra-nucleotide, di-nucleotide and mononucleotide motifs accounted for 9.75 %, 7.63% and 0.14%, respectively. The total length of all microsatellites accounts for 0. 72 % of 547 Mb of the whole genome size and the frequency of microsatellites were found to be one microsatellite in every 2.13 kb of DNA. For the di-, tri, and tetra-nucleotide, the repeat numbers redundant are six (28%), four (45%) and three (76%), respectively. The ATC repeat is the most abundant microsatellites followed by AT, AAT and AC, respectively. Within 40 microsatellite loci developed, 24 microsatellite markers showed potential to differentiate between C. sinensis and O. viverrini. Seven out of 24 loci showed heterozygous with observed heterozygosity ranged from 0.467 to 1. Four-primer sets could amplify both C. sinensis and O. viverrini DNA with different sizes. This study provides basic information of C. sinensis microsatellites and the genome-wide markers developed may be a useful tool for genetic study of C. sinensis. PMID:25782682
Ramchiary, Nirala; Nguyen, Van Dan; Li, Xiaonan; Hong, Chang Pyo; Dhandapani, Vignesh; Choi, Su Ryun; Yu, Ge; Piao, Zhong Yun; Lim, Yong Pyo
2011-01-01
Genic microsatellite markers, also known as functional markers, are preferred over anonymous markers as they reveal the variation in transcribed genes among individuals. In this study, we developed a total of 707 expressed sequence tag-derived simple sequence repeat markers (EST-SSRs) and used for development of a high-density integrated map using four individual mapping populations of B. rapa. This map contains a total of 1426 markers, consisting of 306 EST-SSRs, 153 intron polymorphic markers, 395 bacterial artificial chromosome-derived SSRs (BAC-SSRs), and 572 public SSRs and other markers covering a total distance of 1245.9 cM of the B. rapa genome. Analysis of allelic diversity in 24 B. rapa germplasm using 234 mapped EST-SSR markers showed amplification of 2 alleles by majority of EST-SSRs, although amplification of alleles ranging from 2 to 8 was found. Transferability analysis of 167 EST-SSRs in 35 species belonging to cultivated and wild brassica relatives showed 42.51% (Sysimprium leteum) to 100% (B. carinata, B. juncea, and B. napus) amplification. Our newly developed EST-SSRs and high-density linkage map based on highly transferable genic markers would facilitate the molecular mapping of quantitative trait loci and the positional cloning of specific genes, in addition to marker-assisted selection and comparative genomic studies of B. rapa with other related species. PMID:21768136
Informative genomic microsatellite markers for efficient genotyping applications in sugarcane.
Parida, Swarup K; Kalia, Sanjay K; Kaul, Sunita; Dalal, Vivek; Hemaprabha, G; Selvi, Athiappan; Pandit, Awadhesh; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; Srivastava, Prem Shankar; Singh, Nagendra K; Mohapatra, Trilochan
2009-01-01
Genomic microsatellite markers are capable of revealing high degree of polymorphism. Sugarcane (Saccharum sp.), having a complex polyploid genome requires more number of such informative markers for various applications in genetics and breeding. With the objective of generating a large set of microsatellite markers designated as Sugarcane Enriched Genomic MicroSatellite (SEGMS), 6,318 clones from genomic libraries of two hybrid sugarcane cultivars enriched with 18 different microsatellite repeat-motifs were sequenced to generate 4.16 Mb high-quality sequences. Microsatellites were identified in 1,261 of the 5,742 non-redundant clones that accounted for 22% enrichment of the libraries. Retro-transposon association was observed for 23.1% of the identified microsatellites. The utility of the microsatellite containing genomic sequences were demonstrated by higher primer designing potential (90%) and PCR amplification efficiency (87.4%). A total of 1,315 markers including 567 class I microsatellite markers were designed and placed in the public domain for unrestricted use. The level of polymorphism detected by these markers among sugarcane species, genera, and varieties was 88.6%, while cross-transferability rate was 93.2% within Saccharum complex and 25% to cereals. Cloning and sequencing of size variant amplicons revealed that the variation in the number of repeat-units was the main source of SEGMS fragment length polymorphism. High level of polymorphism and wide range of genetic diversity (0.16-0.82 with an average of 0.44) assayed with the SEGMS markers suggested their usefulness in various genotyping applications in sugarcane.
Bonatelli, Isabel A S; Carstens, Bryan C; Moraes, Evandro M
2015-01-01
Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.
Bonatelli, Isabel A. S.; Carstens, Bryan C.; Moraes, Evandro M.
2015-01-01
Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms. PMID:26561396
Regidor-Cerrillo, Javier; Díez-Fuertes, Francisco; García-Culebras, Alicia; Moore, Dadín P.; González-Warleta, Marta; Cuevas, Carmen; Schares, Gereon; Katzer, Frank; Pedraza-Díaz, Susana; Mezo, Mercedes; Ortega-Mora, Luis M.
2013-01-01
The cyst-forming protozoan parasite Neospora caninum is one of the main causes of bovine abortion worldwide and is of great economic importance in the cattle industry. Recent studies have revealed extensive genetic variation among N . caninum isolates based on microsatellite sequences (MSs). MSs may be suitable molecular markers for inferring the diversity of parasite populations, molecular epidemiology and the basis for phenotypic variations in N . caninum , which have been poorly defined. In this study, we evaluated nine MS markers using a panel of 11 N . caninum -derived reference isolates from around the world and 96 N . caninum bovine clinical samples and one ovine clinical sample collected from four countries on two continents, including Spain, Argentina, Germany and Scotland, over a 10-year period. These markers were used as molecular tools to investigate the genetic diversity, geographic distribution and population structure of N . caninum . Multilocus microsatellite genotyping based on 7 loci demonstrated high levels of genetic diversity in the samples from all of the different countries, with 96 microsatellite multilocus genotypes (MLGs) identified from 108 N . caninum samples. Geographic sub-structuring was present in the country populations according to pairwise F ST. Principal component analysis (PCA) and Neighbor Joining tree topologies also suggested MLG segregation partially associated with geographical origin. An analysis of the MLG relationships, using eBURST, confirmed that the close genetic relationship observed between the Spanish and Argentinean populations may be the result of parasite migration (i.e., the introduction of novel MLGs from Spain to South America) due to cattle movement. The eBURST relationships also revealed genetically different clusters associated with the abortion. The presence of linkage disequilibrium, the co-existence of specific MLGs to individual farms and eBURST MLG relationships suggest a predominant clonal propagation for Spanish N . caninum MLGs in cattle. PMID:23940816
Micro-satellite constellations for monitoring cryospheric processes and related natural hazards
NASA Astrophysics Data System (ADS)
Kaeaeb, A.; Altena, B.; Mascaro, J.
2016-12-01
Currently, several micro-satellite constellations for earth-observation are planned or under build-up. Here, we assess the potential of the well-advanced Planet satellite constellation for investigating cryospheric processes. In its final stage, the Planet constellation will consist of 150 free-flying micro-satellites in near-polar and ISS orbits. The instruments carry RGB+NIR frame cameras that image the Earth surface in nadir direction with resolutions of 3-5 m, covering 20 x 13 km per image. In its final set-up, the constellation will be able to image the (almost) entire land surface at least once per day, under the limitation of cloud cover. Here, we explore new possibilities for insight into cryospheric processes that this very high repeat cycle combined with high image resolution offer. Based on repeat Planet imagery we derive repeat glacier velocity fields for example glaciers in the northern and southern hemispheres. We find it especially useful to monitor the ice velocities near calving fronts and simultaneously detect changes of the front, pointing to calving events. We also explore deformation fields over creeping mountain permafrost, so-called rockglaciers. As a second, very promising cryospheric application we suggest monitoring of glacier and permafrost related natural hazards. In cases such as temporary lakes, lake outbursts, landslides, rock avalanches, visual information over remote areas and at high frequencies are crucial for hazard assessment, early warning or disaster management. Based on several examples, we demonstrate that massive micro-satellite constellations such Planet's are exactly able to provide this type of information. As a third promising example, we show how such high-repeat optical satellite data are useful to monitor river ice and related jams and flooding. At certain latitudes, the repeat frequency of the data is even high enough to track river ice floes and thus water velocities.
Smith, S; Joss, T; Stow, A
2011-10-01
The analysis of microsatellite loci has allowed significant advances in evolutionary biology and pest management. However, until very recently, the potential benefits have been compromised by the high costs of developing these neutral markers. High-throughput sequencing provides a solution to this problem. We describe the development of 13 microsatellite markers for the eusocial ambrosia beetle, Austroplatypus incompertus, a significant pest of forests in southeast Australia. The frequency of microsatellite repeats in the genome of A. incompertus was determined to be low, and previous attempts at microsatellite isolation using a traditional genomic library were problematic. Here, we utilised two protocols, microsatellite-enriched genomic library construction and high-throughput 454 sequencing and characterised 13 loci which were polymorphic among 32 individuals. Numbers of alleles per locus ranged from 2 to 17, and observed and expected heterozygosities from 0.344 to 0.767 and from 0.507 to 0.860, respectively. These microsatellites have the resolution required to analyse fine-scale colony and population genetic structure. Our work demonstrates the utility of next-generation 454 sequencing as a method for rapid and cost-effective acquisition of microsatellites where other techniques have failed, or for taxa where marker development has historically been both complicated and expensive.
Vianna, Juliana A.; Noll, Daly; Mura-Jornet, Isidora; Valenzuela-Guerra, Paulina; González-Acuña, Daniel; Navarro, Cristell; Loyola, David E.; Dantas, Gisele P. M.
2017-01-01
Abstract Microsatellites are valuable molecular markers for evolutionary and ecological studies. Next generation sequencing is responsible for the increasing number of microsatellites for non-model species. Penguins of the Pygoscelis genus are comprised of three species: Adélie (P. adeliae), Chinstrap (P. antarcticus) and Gentoo penguin (P. papua), all distributed around Antarctica and the sub-Antarctic. The species have been affected differently by climate change, and the use of microsatellite markers will be crucial to monitor population dynamics. We characterized a large set of genome-wide microsatellites and evaluated polymorphisms in all three species. SOLiD reads were generated from the libraries of each species, identifying a large amount of microsatellite loci: 33,677, 35,265 and 42,057 for P. adeliae, P. antarcticus and P. papua, respectively. A large number of dinucleotide (66,139), trinucleotide (29,490) and tetranucleotide (11,849) microsatellites are described. Microsatellite abundance, diversity and orthology were characterized in penguin genomes. We evaluated polymorphisms in 170 tetranucleotide loci, obtaining 34 polymorphic loci in at least one species and 15 polymorphic loci in all three species, which allow to perform comparative studies. Polymorphic markers presented here enable a number of ecological, population, individual identification, parentage and evolutionary studies of Pygoscelis, with potential use in other penguin species. PMID:28898354
Carroll, E. L.; Baker, C. S.; Watson, M.; Alderman, R.; Bannister, J.; Gaggiotti, O. E.; Gröcke, D. R.; Patenaude, N.; Harcourt, R.
2015-01-01
Fidelity to migratory destinations is an important driver of connectivity in marine and avian species. Here we assess the role of maternally directed learning of migratory habitats, or migratory culture, on the population structure of the endangered Australian and New Zealand southern right whale. Using DNA profiles, comprising mitochondrial DNA (mtDNA) haplotypes (500 bp), microsatellite genotypes (17 loci) and sex from 128 individually-identified whales, we find significant differentiation among winter calving grounds based on both mtDNA haplotype (FST = 0.048, ΦST = 0.109, p < 0.01) and microsatellite allele frequencies (FST = 0.008, p < 0.01), consistent with long-term fidelity to calving areas. However, most genetic comparisons of calving grounds and migratory corridors were not significant, supporting the idea that whales from different calving grounds mix in migratory corridors. Furthermore, we find a significant relationship between δ13C stable isotope profiles of 66 Australian southern right whales, a proxy for feeding ground location, and both mtDNA haplotypes and kinship inferred from microsatellite-based estimators of relatedness. This indicates migratory culture may influence genetic structure on feeding grounds. This fidelity to migratory destinations is likely to influence population recovery, as long-term estimates of historical abundance derived from estimates of genetic diversity indicate the South Pacific calving grounds remain at <10% of pre-whaling abundance. PMID:26548756
García, Graciela; Ríos, Néstor; Gutiérrez, Verónica; Varela, Jorge Guerra; Bouza Fernández, Carmen; Pardo, Belén Gómez; Portela, Paulino Martínez
2014-01-01
The present paper integrates phylogenetic and population genetics analyses based on mitochondrial and nuclear molecular markers in silversides, genus Odontesthes, from a non-sampled area in the SW Atlantic Ocean to address species discrimination and to define Managements Units for sustainable conservation. All phylogenetic analyses based on the COI mitochondrial gene were consistent to support the monophyly of the genus Odontesthes and to include O. argentinensis, O. perugiae-humensis and some O. bonariensis haplotypes in a basal polytomy conforming a major derivative clade. Microsatellites data revealed somewhat higher genetic variability values in the O. argentinensis-perugia populations than in O. bonariensis and O. perugia-humensis taxa. Contrasting population genetics structuring emerged from mitochondrial and microsatellites analyses in these taxa. Whereas mitochondrial data supported two major groups (O. argentinensis-perugia-humensis vs. O. bonariensis-perugiae-humensis populations), microsatellite data detected three major genetic entities represented by O. bonariensis, O. perugiae-humensis and an admixture of populations belonging to O. argentinensis-perugiae respectively. Therefore, the star COI polytomy in the tree topology involving these taxa could be interpreted by several hypothetic scenarios such as the existence of shared ancestral polymorphisms, incomplete lineage sorting in a radiating speciation process and/or reticulation events. Present findings support that promiscuous and recent contact between incipient species sharing asymmetric gene flow exchanges, blurs taxa boundaries yielding complicated taxonomy and Management Units delimitation in silverside genus Odontesthes from SW Atlantic Ocean basins. PMID:25126842
García, Graciela; Ríos, Néstor; Gutiérrez, Verónica; Varela, Jorge Guerra; Bouza Fernández, Carmen; Pardo, Belén Gómez; Portela, Paulino Martínez
2014-01-01
The present paper integrates phylogenetic and population genetics analyses based on mitochondrial and nuclear molecular markers in silversides, genus Odontesthes, from a non-sampled area in the SW Atlantic Ocean to address species discrimination and to define Managements Units for sustainable conservation. All phylogenetic analyses based on the COI mitochondrial gene were consistent to support the monophyly of the genus Odontesthes and to include O. argentinensis, O. perugiae-humensis and some O. bonariensis haplotypes in a basal polytomy conforming a major derivative clade. Microsatellites data revealed somewhat higher genetic variability values in the O. argentinensis-perugia populations than in O. bonariensis and O. perugia-humensis taxa. Contrasting population genetics structuring emerged from mitochondrial and microsatellites analyses in these taxa. Whereas mitochondrial data supported two major groups (O. argentinensis-perugia-humensis vs. O. bonariensis-perugiae-humensis populations), microsatellite data detected three major genetic entities represented by O. bonariensis, O. perugiae-humensis and an admixture of populations belonging to O. argentinensis-perugiae respectively. Therefore, the star COI polytomy in the tree topology involving these taxa could be interpreted by several hypothetic scenarios such as the existence of shared ancestral polymorphisms, incomplete lineage sorting in a radiating speciation process and/or reticulation events. Present findings support that promiscuous and recent contact between incipient species sharing asymmetric gene flow exchanges, blurs taxa boundaries yielding complicated taxonomy and Management Units delimitation in silverside genus Odontesthes from SW Atlantic Ocean basins.
Hodel, Richard G. J.; Segovia-Salcedo, M. Claudia; Landis, Jacob B.; Crowl, Andrew A.; Sun, Miao; Liu, Xiaoxian; Gitzendanner, Matthew A.; Douglas, Norman A.; Germain-Aubrey, Charlotte C.; Chen, Shichao; Soltis, Douglas E.; Soltis, Pamela S.
2016-01-01
Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers. PMID:27347456
No clustering for linkage map based on low-copy and undermethylated microsatellites.
Zhou, Yi; Gwaze, David P; Reyes-Valdés, M Humberto; Bui, Thomas; Williams, Claire G
2003-10-01
Clustering has been reported for conifer genetic maps based on hypomethylated or low-copy molecular markers, resulting in uneven marker distribution. To test this, a framework genetic map was constructed from three types of microsatellites: low-copy, undermethylated, and genomic. These Pinus taeda L. microsatellites were mapped using a three-generation pedigree with 118 progeny. The microsatellites were highly informative; of the 32 markers in intercross configuration, 29 were segregating for three or four alleles in the progeny. The sex-averaged map placed 51 of the 95 markers in 15 linkage groups at LOD > 4.0. No clustering or uneven distribution across the genome was observed. The three types of P. taeda microsatellites were randomly dispersed within each linkage group. The 51 microsatellites covered a map distance of 795 cM, an average distance of 21.8 cM between markers, roughly half of the estimated total map length. The minimum and maximum distances between any two bins was 4.4 and 45.3 cM, respectively. These microsatellites provided anchor points for framework mapping for polymorphism in P. taeda and other closely related hard pines.
Lewis, Emily M; Fant, Jeremie B; Moore, Michael J; Hastings, Amy P; Larson, Erica L; Agrawal, Anurag A; Skogen, Krissa A
2016-02-01
Eleven nuclear and four plastid microsatellite markers were screened for two gypsum endemic species, Oenothera gayleana and O. hartwegii subsp. filifolia, and tested for cross-amplification in the remaining 11 taxa within Oenothera sect. Calylophus (Onagraceae). Microsatellite markers were tested in two to three populations spanning the ranges of both O. gayleana and O. hartwegii subsp. filifolia. The nuclear microsatellite loci consisted of both di- and trinucleotide repeats with one to 17 alleles per population. Several loci showed significant deviation from Hardy-Weinberg equilibrium, which may be evidence of chromosomal rings. The plastid microsatellite markers identified one to seven haplotypes per population. The transferability of these markers was confirmed in all 11 taxa within Oenothera sect. Calylophus. The microsatellite loci characterized here are the first developed and tested in Oenothera sect. Calylophus. These markers will be used to assess whether pollinator foraging distance influences population genetic parameters in predictable ways.
Witherup, Colby; Ragone, Diane; Wiesner-Hanks, Tyr; Irish, Brian; Scheffler, Brian; Simpson, Sheron; Zee, Francis; Zuberi, M Iqbal; Zerega, Nyree J C
2013-07-01
Microsatellite loci were isolated and characterized from enriched genomic libraries of Artocarpus altilis (breadfruit) and tested in four Artocarpus species and one hybrid. The microsatellite markers provide new tools for further studies in Artocarpus. • A total of 25 microsatellite loci were evaluated across four Artocarpus species and one hybrid. Twenty-one microsatellite loci were evaluated on A. altilis (241), A. camansi (34), A. mariannensis (15), and A. altilis × mariannensis (64) samples. Nine of those loci plus four additional loci were evaluated on A. heterophyllus (jackfruit, 426) samples. All loci are polymorphic for at least one species. The average number of alleles ranges from two to nine within taxa. • These microsatellite primers will facilitate further studies on the genetic structure and evolutionary and domestication history of Artocarpus species. They will aid in cultivar identification and establishing germplasm conservation strategies for breadfruit and jackfruit.
Brenner, Eric D; Katari, Manpreet S; Stevenson, Dennis W; Rudd, Stephen A; Douglas, Andrew W; Moss, Walter N; Twigg, Richard W; Runko, Suzan J; Stellari, Giulia M; McCombie, WR; Coruzzi, Gloria M
2005-01-01
Background Ginkgo biloba L. is the only surviving member of one of the oldest living seed plant groups with medicinal, spiritual and horticultural importance worldwide. As an evolutionary relic, it displays many characters found in the early, extinct seed plants and extant cycads. To establish a molecular base to understand the evolution of seeds and pollen, we created a cDNA library and EST dataset from the reproductive structures of male (microsporangiate), female (megasporangiate), and vegetative organs (leaves) of Ginkgo biloba. Results RNA from newly emerged male and female reproductive organs and immature leaves was used to create three distinct cDNA libraries from which 6,434 ESTs were generated. These 6,434 ESTs from Ginkgo biloba were clustered into 3,830 unigenes. A comparison of our Ginkgo unigene set against the fully annotated genomes of rice and Arabidopsis, and all available ESTs in Genbank revealed that 256 Ginkgo unigenes match only genes among the gymnosperms and non-seed plants – many with multiple matches to genes in non-angiosperm plants. Conversely, another group of unigenes in Gingko had highly significant homology to transcription factors in angiosperms involved in development, including MADS box genes as well as post-transcriptional regulators. Several of the conserved developmental genes found in Ginkgo had top BLAST homology to cycad genes. We also note here the presence of ESTs in G. biloba similar to genes that to date have only been found in gymnosperms and an additional 22 Ginkgo genes common only to genes from cycads. Conclusion Our analysis of an EST dataset from G. biloba revealed genes potentially unique to gymnosperms. Many of these genes showed homology to fully sequenced clones from our cycad EST dataset found in common only with gymnosperms. Other Ginkgo ESTs are similar to developmental regulators in higher plants. This work sets the stage for future studies on Ginkgo to better understand seed and pollen evolution, and to resolve the ambiguous phylogenetic relationship of G. biloba among the gymnosperms. PMID:16225698
Zhang, Min; Zhou, Yuwen; Wang, Hui; Jones, Huw; Gao, Qiang; Wang, Dahai; Ma, Youzhi; Xia, Lanqin
2013-08-16
The grain aphid (Sitobion avenae F.) is a major agricultural pest which causes significant yield losses of wheat in China, Europe and North America annually. Transcriptome profiling of the grain aphid alimentary canal after feeding on wheat plants could provide comprehensive gene expression information involved in feeding, ingestion and digestion. Furthermore, selection of aphid-specific RNAi target genes would be essential for utilizing a plant-mediated RNAi strategy to control aphids via a non-toxic mode of action. However, due to the tiny size of the alimentary canal and lack of genomic information on grain aphid as a whole, selection of the RNAi targets is a challenging task that as far as we are aware, has never been documented previously. In this study, we performed de novo transcriptome assembly and gene expression analyses of the alimentary canals of grain aphids before and after feeding on wheat plants using Illumina RNA sequencing. The transcriptome profiling generated 30,427 unigenes with an average length of 664 bp. Furthermore, comparison of the transcriptomes of alimentary canals of pre- and post feeding grain aphids indicated that 5490 unigenes were differentially expressed, among which, diverse genes and/or pathways were identified and annotated. Based on the RPKM values of these unigenes, 16 of them that were significantly up or down-regulated upon feeding were selected for dsRNA artificial feeding assay. Of these, 5 unigenes led to higher mortality and developmental stunting in an artificial feeding assay due to the down-regulation of the target gene expression. Finally, by adding fluorescently labelled dsRNA into the artificial diet, the spread of fluorescence signal in the whole body tissues of grain aphid was observed. Comparison of the transcriptome profiles of the alimentary canals of pre- and post-feeding grain aphids on wheat plants provided comprehensive gene expression information that could facilitate our understanding of the molecular mechanisms underlying feeding, ingestion and digestion. Furthermore, five novel and effective potential RNAi target genes were identified in grain aphid for the first time. This finding would provide a fundamental basis for aphid control in wheat through plant mediated RNAi strategy.
2012-01-01
Background High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). Results We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. Conclusion By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously. PMID:22583801
Stoffel, Kevin; van Leeuwen, Hans; Kozik, Alexander; Caldwell, David; Ashrafi, Hamid; Cui, Xinping; Tan, Xiaoping; Hill, Theresa; Reyes-Chin-Wo, Sebastian; Truco, Maria-Jose; Michelmore, Richard W; Van Deynze, Allen
2012-05-14
High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously.
Gimenes, Marcos A; Hoshino, Andrea A; Barbosa, Andrea VG; Palmieri, Dario A; Lopes, Catalina R
2007-01-01
Background The genus Arachis includes Arachis hypogaea (cultivated peanut) and wild species that are used in peanut breeding or as forage. Molecular markers have been employed in several studies of this genus, but microsatellite markers have only been used in few investigations. Microsatellites are very informative and are useful to assess genetic variability, analyze mating systems and in genetic mapping. The objectives of this study were to develop A. hypogaea microsatellite loci and to evaluate the transferability of these markers to other Arachis species. Results Thirteen loci were isolated and characterized using 16 accessions of A. hypogaea. The level of variation found in A. hypogaea using microsatellites was higher than with other markers. Cross-transferability of the markers was also high. Sequencing of the fragments amplified using the primer pair Ah11 from 17 wild Arachis species showed that almost all wild species had similar repeated sequence to the one observed in A. hypogaea. Sequence data suggested that there is no correlation between taxonomic relationship of a wild species to A. hypogaea and the number of repeats found in its microsatellite loci. Conclusion These results show that microsatellite primer pairs from A. hypogaea have multiple uses. A higher level of variation among A. hypogaea accessions can be detected using microsatellite markers in comparison to other markers, such as RFLP, RAPD and AFLP. The microsatellite primers of A. hypogaea showed a very high rate of transferability to other species of the genus. These primer pairs provide important tools to evaluate the genetic variability and to assess the mating system in Arachis species. PMID:17326826
Microsatellites as targets of natural selection.
Haasl, Ryan J; Payseur, Bret A
2013-02-01
The ability to survey polymorphism on a genomic scale has enabled genome-wide scans for the targets of natural selection. Theory that connects patterns of genetic variation to evidence of natural selection most often assumes a diallelic locus and no recurrent mutation. Although these assumptions are suitable to selection that targets single nucleotide variants, fundamentally different types of mutation generate abundant polymorphism in genomes. Moreover, recent empirical results suggest that mutationally complex, multiallelic loci including microsatellites and copy number variants are sometimes targeted by natural selection. Given their abundance, the lack of inference methods tailored to the mutational peculiarities of these types of loci represents a notable gap in our ability to interrogate genomes for signatures of natural selection. Previous theoretical investigations of mutation-selection balance at multiallelic loci include assumptions that limit their application to inference from empirical data. Focusing on microsatellites, we assess the dynamics and population-level consequences of selection targeting mutationally complex variants. We develop general models of a multiallelic fitness surface, a realistic model of microsatellite mutation, and an efficient simulation algorithm. Using these tools, we explore mutation-selection-drift equilibrium at microsatellites and investigate the mutational history and selective regime of the microsatellite that causes Friedreich's ataxia. We characterize microsatellite selective events by their duration and cost, note similarities to sweeps from standing point variation, and conclude that it is premature to label microsatellites as ubiquitous agents of efficient adaptive change. Together, our models and simulation algorithm provide a powerful framework for statistical inference, which can be used to test the neutrality of microsatellites and other multiallelic variants.
Microsatellites as Targets of Natural Selection
Haasl, Ryan J.; Payseur, Bret A.
2013-01-01
The ability to survey polymorphism on a genomic scale has enabled genome-wide scans for the targets of natural selection. Theory that connects patterns of genetic variation to evidence of natural selection most often assumes a diallelic locus and no recurrent mutation. Although these assumptions are suitable to selection that targets single nucleotide variants, fundamentally different types of mutation generate abundant polymorphism in genomes. Moreover, recent empirical results suggest that mutationally complex, multiallelic loci including microsatellites and copy number variants are sometimes targeted by natural selection. Given their abundance, the lack of inference methods tailored to the mutational peculiarities of these types of loci represents a notable gap in our ability to interrogate genomes for signatures of natural selection. Previous theoretical investigations of mutation-selection balance at multiallelic loci include assumptions that limit their application to inference from empirical data. Focusing on microsatellites, we assess the dynamics and population-level consequences of selection targeting mutationally complex variants. We develop general models of a multiallelic fitness surface, a realistic model of microsatellite mutation, and an efficient simulation algorithm. Using these tools, we explore mutation-selection-drift equilibrium at microsatellites and investigate the mutational history and selective regime of the microsatellite that causes Friedreich’s ataxia. We characterize microsatellite selective events by their duration and cost, note similarities to sweeps from standing point variation, and conclude that it is premature to label microsatellites as ubiquitous agents of efficient adaptive change. Together, our models and simulation algorithm provide a powerful framework for statistical inference, which can be used to test the neutrality of microsatellites and other multiallelic variants. PMID:23104080
Gupta, Sarika; Kumari, Kajal; Sahu, Pranav Pankaj; Vidapu, Sudhakar; Prasad, Manoj
2012-02-01
The unavailability of microsatellite markers and saturated genetic linkage map has restricted the genetic improvement of foxtail millet [Setaria italica (L.) P. Beauv.], despite the fact that in recent times it has been documented as a new model species for biofuel grasses. With the objective to generate a good number of microsatellite markers in foxtail millet cultivar 'Prasad', 690 clones were sequenced which generated 112.95 kb high quality sequences obtained from three genomic libraries each enriched with different microsatellite repeat motifs. Microsatellites were identified in 512 (74.2%) of the 690 positive clones and 172 primer pairs (pp) were successfully designed from 249 (48.6%) unique SSR-containing clones. The efficacies of the microsatellite containing genomic sequences were established by superior primer designing ability (69%), PCR amplification efficiency (85.5%) and polymorphic potential (52%) in the parents of F(2) mapping population. Out of 172 pp, functional 147 markers showed high level of cross-species amplification (~74%) in six grass species. Higher polymorphism rate and broad range of genetic diversity (0.30-0.69 averaging 0.58) obtained in constructed phylogenetic tree using 52 microsatellite markers, demonstrated the utility of markers in germplasm characterizations. In silico comparative mapping of 147 foxtail millet microsatellite containing sequences against the mapping data of sorghum (~18%), maize (~16%) and rice (~5%) indicated the presence of orthologous sequences of the foxtail millet in the respective species. The result thus demonstrates the applicability of microsatellite markers in various genotyping applications, determining phylogenetic relationships and comparative mapping in several important grass species.
Souza, Helena A V; Collevatti, Rosane G; Lemos-Filho, José P; Santos, Fabrício R; Lovato, Maria Bernadete
2012-03-01
Microsatellite markers were developed for Dimorphandra mollis (Leguminosae), a widespread tree in the Brazilian cerrado (a savanna-like vegetation). Microsatellite markers were developed from an enriched library. The analyses of polymorphism were based on 56 individuals from three populations. Nine microsatellite loci were polymorphic, with the number of alleles per locus ranging from three to 10 across populations. The observed and expected heterozygosities per locus and population ranged from 0.062 to 0.850 and from 0.062 to 0.832, respectively. These microsatellites provide an efficient tool for population genetics studies and will be used to assess the genetic diversity and spatial genetic structure of D. mollis.
Genomic characterization of EmsB microsatellite loci in Echinococcus multilocularis.
Valot, Benoît; Knapp, Jenny; Umhang, Gérald; Grenouillet, Frédéric; Millon, Laurence
2015-06-01
EmsB is a molecular marker applied to Echinococcus multilocularis genotyping studies. This marker has largely been used to investigate the epidemiology of the parasite in different endemic foci. The present study has lifted the veil on the genetic structure of this microsatellite. By in silico analysis on the E. multilocularis genome the microsatellite was described in about 40 copies on the chromosome 5 of the parasite. Similar structure was found in the relative parasite Echinococcus granulosus, where the microsatellite was firstly described. The present study completes the first investigations made on the EmsB microsatellite origins and confirms the reliability of this highly discriminant molecular marker. Copyright © 2015 Elsevier B.V. All rights reserved.
Estoup, Arnaud; Jarne, Philippe; Cornuet, Jean-Marie
2002-09-01
Homoplasy has recently attracted the attention of population geneticists, as a consequence of the popularity of highly variable stepwise mutating markers such as microsatellites. Microsatellite alleles generally refer to DNA fragments of different size (electromorphs). Electromorphs are identical in state (i.e. have identical size), but are not necessarily identical by descent due to convergent mutation(s). Homoplasy occurring at microsatellites is thus referred to as size homoplasy. Using new analytical developments and computer simulations, we first evaluate the effect of the mutation rate, the mutation model, the effective population size and the time of divergence between populations on size homoplasy at the within and between population levels. We then review the few experimental studies that used various molecular techniques to detect size homoplasious events at some microsatellite loci. The relationship between this molecularly accessible size homoplasy size and the actual amount of size homoplasy is not trivial, the former being considerably influenced by the molecular structure of microsatellite core sequences. In a third section, we show that homoplasy at microsatellite electromorphs does not represent a significant problem for many types of population genetics analyses realized by molecular ecologists, the large amount of variability at microsatellite loci often compensating for their homoplasious evolution. The situations where size homoplasy may be more problematic involve high mutation rates and large population sizes together with strong allele size constraints.
Microsatellites for Lindera species
Craig S. Echt; D. Deemer; T.L. Kubisiak; C.D. Nelson
2006-01-01
Microsatellite markers were developed for conservation genetic studies of Lindera melissifolia (pondberry), a federally endangered shrub of southern bottomland ecosystems. Microsatellite sequences were obtained from DNA libraries that were enriched for the (AC)n simple sequence repeat motif. From 35 clone sequences, 20 primer...
Polido, C A; Mantello, C C; Moraes, A P; Souza, A P; Forni-Martins, E R
2014-12-04
Aeschynomene falcata is an important forage species; however, because of low seed production, it is underutilized as forage species. Aeschynomene is a polyphyletic genus with a challenging taxonomic position. Two subgenera have been proposed, and it is suggested that Aeschynomene can be split in 2 genera. Thus, new markers, such as microsatellite sequences, are desirable for improving breeding programs for A. falcata. Based on transferability and in situ localization, these microsatellite sequences can be applied as chromosome markers in the genus Aeschynomene and closely related genera. Here, we report the first microsatellite library developed for this genus; 11 microsatellites were characterized, with observed and expected heterozygosities ranging from 0.0000 to 0.7143 and from 0.1287 to 0.8360, respectively. Polymorphic information content varied from 0.1167 to 0.7786. The departure from Hardy-Weinberg equilibrium may have resulted from frequent autogamy, which is characteristic of A. falcata. Of the 11 microsatellites, 9 loci were cross-amplified in A. brevipes and A. paniculata and 7 in Dalbergia nigra and Machaerium vestitum. Five of these 7 cross-amplified microsatellites were applied as probes during the in situ hybridization assay and 2 showed clear signals on A. falcata chromosomes, ensuring their viability as chromosome markers.
Warner, Jeffery F; Harpole, Michael G; Crerar, Lorelei D
2017-01-01
Microsatellite DNA can provide more detailed population genetic information than mitochondrial DNA which is normally used to research ancient bone. The methods detailed in this chapter can be utilized for any type of bone. However, for this example, four microsatellite loci were isolated from Steller's sea cow (Hydrodamalis gigas) using published primers for manatee and dugong microsatellites. The primers DduC05 (Broderick et al., Mol Ecol Notes 6:1275-1277, 2007), Tmakb60, TmaSC5 (Pause et al., Mol Ecol Notes 6: 1073-1076, 2007), and TmaE11 (Garcia-Rodriguez et al., Mol Ecol 12:2161-2163, 2000) all successfully amplified microsatellites from H. gigas. The DNA samples were from bone collected on Bering or St. Lawrence Islands. DNA was analyzed using primers with the fluorescent label FAM-6. Sequenced alleles were then used to indicate a difference in the number of repeats and thus a difference in individuals. This is the first time that H. gigas microsatellite loci have been isolated. These techniques for ancient bone microsatellite analysis allow an estimate of population size for a newly discovered St. Lawrence Island sea cow population.
Wu, Jinwei; Zhao, Hua-Bin; Yu, Dan; Xu, Xinwei
2017-01-31
Waterlogging or flooding is one of the most challenging abiotic stresses experienced by plants. Unlike many flooding-tolerant plants, floating-leaved aquatic plants respond actively to flooding stress by fast growth and elongation of its petioles to make leaves re-floating. However, the molecular mechanisms of this plant group responding to flood have not been investigated before. Here, we investigated the genetic basis of this adaptive response by characterizing the petiole transcriptomes of a floating-leaved species Nymphoides peltata under normal and flooding conditions. Clean reads under normal and flooding conditions with pooled sampling strategy were assembled into 124,302 unigenes. A total of 8883 unigenes were revealed to be differentially expressed between normal and flooding conditions. Among them, top ranked differentially expressed genes were mainly involved in antioxidant process, photosynthesis process and carbohydrate metabolism, including the glycolysis and a modified tricarboxylic acid cycle - alanine metabolism. Eight selected unigenes with significantly differentiated expression changes between normal and flooding conditions were validated by qRT-PCR. Among these processes, antioxidant process and glycolysis are commonly induced by waterlogging or flooding environment in plants, whereas photosynthesis and alanine metabolism are rarely occurred in other flooding-tolerant plants, suggesting the significant contributions of the two processes in the active response of N. peltata to flooding stress. Our results provide a valuable genomic resource for future studies on N. peltata and deepen our understanding of the genetic basis underlying the response to flooding stress in aquatic plants.
Zhang, Shi-Xin; Wu, Shao-Hua; Chen, Yue-Yi; Tian, Wei-Min
2015-01-01
The secondary laticifer in the secondary phloem is differentiated from the vascular cambia of the rubber tree (Hevea brasiliensis Muell. Arg.). The number of secondary laticifers is closely related to the rubber yield potential of Hevea. Pharmacological data show that jasmonic acid and its precursor linolenic acid are effective in inducing secondary laticifer differentiation in epicormic shoots of the rubber tree. In the present study, an experimental system of coronatine-induced laticifer differentiation was developed to perform SSH identification of genes with differential expression. A total of 528 positive clones were obtained by blue-white screening, of which 248 clones came from the forward SSH library while 280 clones came from the reverse SSH library. Approximately 215 of the 248 clones and 171 of the 280 clones contained cDNA inserts by colony PCR screening. A total of 286 of the 386 ESTs were detected to be differentially expressed by reverse northern blot and sequenced. Approximately 147 unigenes with an average length of 497 bp from the forward and 109 unigenes with an average length of 514 bp from the reverse SSH libraries were assembled and annotated. The unigenes were associated with the stress/defense response, plant hormone signal transduction and structure development. It is suggested that Ca2+ signal transduction and redox seem to be involved in differentiation, while PGA and EIF are associated with the division of cambium initials for COR-induced secondary laticifer differentiation in the rubber tree. PMID:26147807
Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng
2012-01-01
To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Gao, Huanhuan; Zhai, Yifan; Wang, Wenbo; Chen, Hao; Zhou, Xianhong; Zhuang, Qianying; Yu, Yi; Li, Rumei
2016-01-01
Bradysia odoriphaga (Diptera: Sciaridae) is the most important pest of Chinese chive (Allium tuberosum) in Asia; however, the molecular genetics are poorly understood. To explore the molecular biological mechanism of development, Illumina sequencing and de novo assembly were performed in the third-instar, fourth-instar, and pupal B. odoriphaga. The study resulted in 16.2 Gb of clean data and 47,578 unigenes (≥125 bp) contained in 7,632,430 contigs, 46.21% of which were annotated from non-redundant protein (NR), Gene Ontology (GO), Clusters of Orthologous Groups (COG), Eukaryotic Orthologous Groups (KOG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. It was found that 19.67% of unigenes matched the homologous species mainly, including Aedes aegypti, Culex quinquefasciatus, Ceratitis capitata, and Anopheles gambiae. According to differentially expressed gene (DEG) analysis, 143, 490, and 309 DEGs were annotated as involved in the developmental process in the GO database respectively, in the comparisons of third-instar and fourth-instar larvae, third-instar larvae and pupae, and fourth-instar larvae and pupae. Twenty-five genes were closely related to these processes, including developmental process, reproduction process, and reproductive organs development and programmed cell death (PCD). The information of unigenes assembled in B. odoriphaga through transcriptome and DEG analyses could provide a detailed genetic basis and regulated information for elaborating the developmental mechanism from the larval, pre-pupal to pupal stages of B. odoriphaga.
Sun, Luchao; Rai, Amit; Rai, Megha; Nakamura, Michimi; Kawano, Noriaki; Yoshimatsu, Kayo; Suzuki, Hideyuki; Kawahara, Nobuo; Saito, Kazuki; Yamazaki, Mami
2018-05-07
The three Forsythia species, F. suspensa, F. viridissima and F. koreana, have been used as herbal medicines in China, Japan and Korea for centuries and they are known to be rich sources of numerous pharmaceutical metabolites, forsythin, forsythoside A, arctigenin, rutin and other phenolic compounds. In this study, de novo transcriptome sequencing and assembly was performed on these species. Using leaf and flower tissues of F. suspensa, F. viridissima and F. koreana, 1.28-2.45-Gbp sequences of Illumina based pair-end reads were obtained and assembled into 81,913, 88,491 and 69,458 unigenes, respectively. Classification of the annotated unigenes in gene ontology terms and KEGG pathways was used to compare the transcriptome of three Forsythia species. The expression analysis of orthologous genes across all three species showed the expression in leaf tissues being highly correlated. The candidate genes presumably involved in the biosynthetic pathway of lignans and phenylethanoid glycosides were screened as co-expressed genes. They express highly in the leaves of F. viridissima and F. koreana. Furthermore, the three unigenes annotated as acyltransferase were predicted to be associated with the biosynthesis of acteoside and forsythoside A from the expression pattern and phylogenetic analysis. This study is the first report on comparative transcriptome analyses of medicinally important Forsythia genus and will serve as an important resource to facilitate further studies on biosynthesis and regulation of therapeutic compounds in Forsythia species.
RNA-Seq reveals leaf cuticular wax-related genes in Welsh onion.
Liu, Qianchun; Wen, Changlong; Zhao, Hong; Zhang, Liying; Wang, Jian; Wang, Yongqin
2014-01-01
The waxy cuticle plays a very important role in plant resistance to various biotic and abiotic stresses and is an important characteristic of Welsh onions. Two different types of biangan Welsh onions (BG) were selected for this study: BG, a wild-type covered by wax, which forms a continuous lipid membrane on its epidermal cells, and GLBG, a glossy mutant of BG whose epidermal cells are not covered by wax. To elucidate the waxy cuticle-related gene expression changes, we used RNA-Seq to compare these two Welsh onion varieties with distinct differences in cuticular wax. The de novo assembly yielded 42,881 putative unigenes, 25.41% of which are longer than 1,000 bp. Among the high-quality unique sequences, 22,289 (52.0%) had at least one significant match to an existing gene model. A total of 798 genes, representing 1.86% of the total putative unigenes, were differentially expressed between these two Welsh onion varieties. The expression patterns of four important unigenes that are related to waxy cuticle biosynthesis were confirmed by RT-qPCR and COG class annotation, which demonstrated that these genes play an important role in defense mechanisms and lipid transport and metabolism. To our knowledge, this study is the first exploration of the Welsh onion waxy cuticle. These results may help to reveal the molecular mechanisms underlying the waxy cuticle and will be useful for waxy gene cloning, genetics and breeding as well as phylogenetic and evolutionary studies of the Welsh onion.
RNA-Seq Reveals Leaf Cuticular Wax-Related Genes in Welsh Onion
Zhao, Hong; Zhang, Liying; Wang, Jian; Wang, Yongqin
2014-01-01
The waxy cuticle plays a very important role in plant resistance to various biotic and abiotic stresses and is an important characteristic of Welsh onions. Two different types of biangan Welsh onions (BG) were selected for this study: BG, a wild-type covered by wax, which forms a continuous lipid membrane on its epidermal cells, and GLBG, a glossy mutant of BG whose epidermal cells are not covered by wax. To elucidate the waxy cuticle-related gene expression changes, we used RNA-Seq to compare these two Welsh onion varieties with distinct differences in cuticular wax. The de novo assembly yielded 42,881 putative unigenes, 25.41% of which are longer than 1,000 bp. Among the high-quality unique sequences, 22,289 (52.0%) had at least one significant match to an existing gene model. A total of 798 genes, representing 1.86% of the total putative unigenes, were differentially expressed between these two Welsh onion varieties. The expression patterns of four important unigenes that are related to waxy cuticle biosynthesis were confirmed by RT-qPCR and COG class annotation, which demonstrated that these genes play an important role in defense mechanisms and lipid transport and metabolism. To our knowledge, this study is the first exploration of the Welsh onion waxy cuticle. These results may help to reveal the molecular mechanisms underlying the waxy cuticle and will be useful for waxy gene cloning, genetics and breeding as well as phylogenetic and evolutionary studies of the Welsh onion. PMID:25415343
Zhang, Yong; Zhang, Shu-Fei; Lin, Lin; Wang, Da-Zhi
2014-01-01
The dinoflagellates and cyanobacteria are two major kingdoms of life producing paralytic shellfish toxins (PSTs), a large group of neurotoxic alkaloids causing paralytic shellfish poisonings around the world. In contrast to the well elucidated PST biosynthetic genes in cyanobacteria, little is known about the dinoflagellates. This study compared transcriptome profiles of a toxin-producing dinoflagellate, Alexandrium catenella (ACHK-T), and its non-toxic mutant form (ACHK-NT) using RNA-seq. All clean reads were assembled de novo into a total of 113,674 unigenes, and 66,812 unigenes were annotated in the known databases. Out of them, 35 genes were found to express differentially between the two strains. The up-regulated genes in ACHK-NT were involved in photosynthesis, carbon fixation and amino acid metabolism processes, indicating that more carbon and energy were utilized for cell growth. Among the down-regulated genes, expression of a unigene assigned to the long isoform of sxtA, the initiator of toxin biosynthesis in cyanobacteria, was significantly depressed, suggesting that this long transcript of sxtA might be directly involved in toxin biosynthesis and its depression resulted in the loss of the ability to synthesize PSTs in ACHK-NT. In addition, 101 putative homologs of 12 cyanobacterial sxt genes were identified, and the sxtO and sxtZ genes were identified in dinoflagellates for the first time. The findings of this study should shed light on the biosynthesis of PSTs in the dinoflagellates. PMID:25421324
Genome-wide transcriptome profiling reveals novel insights into Luffa cylindrica browning.
Chen, Xia; Tan, Taiming; Xu, Changcheng; Huang, Shuping; Tan, Jie; Zhang, Min; Wang, Chunli; Xie, Conghua
2015-08-07
Luffa cylindrica (sponge gourd) is one of the most popular vegetables in China. Production and consumption of L. cylindrica are limited due to postharvest browning; however, little is known about the genetic regulation of the browning process. In the present study, transcriptome profiles of L. cylindrica cultivars, YLB05 (browning resistant) and XTR05 (browning sensitive), were analyzed using next-generation sequencing to clarify the genes and mechanisms associated with browning. A total of 9.1 Gb of valid data including 116,703 unigenes (>200 bp) were obtained and 39,473 sequences were annotated by alignment against five public databases. Of these, there were 27,407 genes assigned to 747 Gene Ontology functional categories; and 12,350 genes were annotated with 25 Eukaryotic Orthologous Groups (KOG) categories with 343 KOG functional terms. Additionally, by searching against the Kyoto Encyclopedia of Genes and Genomes database, 8689 unigenes were mapped to 189 pathways. Furthermore, there were 24,556 sequences found to be differentially regulated, including 4344 annotated unigenes. Several genes potentially associated with phenolic oxidation, carbohydrate and hormone metabolism were found differentially regulated between the cultivars of different browning sensitivities. Our results suggest that elements involved in enzymatic processes and other pathways might be responsible for L. cylindrica browning. The present study provides a comprehensive transcriptome sequence resource, which will facilitate further studies on gene discovery and exploiting the fruit browning mechanism of L. cylindrica. Copyright © 2015 Elsevier Inc. All rights reserved.
Rodamilans, Bernardo; San León, David; Mühlberger, Louisa; Candresse, Thierry; Neumüller, Michael; Oliveros, Juan Carlos; García, Juan Antonio
2014-01-01
Plum pox virus (PPV) infects Prunus trees around the globe, posing serious fruit production problems and causing severe economic losses. One variety of Prunus domestica, named 'Jojo', develops a hypersensitive response to viral infection. Here we compared infected and non-infected samples using next-generation RNA sequencing to characterize the genetic complexity of the viral population in infected samples and to identify genes involved in development of the resistance response. Analysis of viral reads from the infected samples allowed reconstruction of a PPV-D consensus sequence. De novo reconstruction showed a second viral isolate of the PPV-Rec strain. RNA-seq analysis of PPV-infected 'Jojo' trees identified 2,234 and 786 unigenes that were significantly up- or downregulated, respectively (false discovery rate; FDR≤0.01). Expression of genes associated with defense was generally enhanced, while expression of those related to photosynthesis was repressed. Of the total of 3,020 differentially expressed unigenes, 154 were characterized as potential resistance genes, 10 of which were included in the NBS-LRR type. Given their possible role in plant defense, we selected 75 additional unigenes as candidates for further study. The combination of next-generation sequencing and a Prunus variety that develops a hypersensitive response to PPV infection provided an opportunity to study the factors involved in this plant defense mechanism. Transcriptomic analysis presented an overview of the changes that occur during PPV infection as a whole, and identified candidates suitable for further functional characterization.
Chen, Hanting; Deng, Cao; Nie, Hu; Fan, Gang; He, Yang
2017-01-01
Coptis chinensis Franch., the Chinese goldthread ('Weilian' in Chinese), one of the most important medicinal plants from the family Ranunculaceae, and its rhizome has been widely used in Traditional Chinese Medicine for centuries. Here, we analyzed the chemical components and the transcriptome of the Chinese goldthread from three biotopes, including Zhenping, Zunyi and Shizhu. We built comprehensive, high-quality de novo transcriptome assemblies of the Chinese goldthread from short-read RNA-Sequencing data, obtaining 155,710 transcripts and 56,071 unigenes. More than 98.39% and 95.97% of core eukaryotic genes were found in the transcripts and unigenes respectively, indicating that this unigene set capture the majority of the coding genes. A total of 520,462, 493,718, and 507,247 heterozygous SNPs were identified in the three accessions from Zhenping, Zunyi, and Shizhu respectively, indicating high polymorphism in coding regions of the Chinese goldthread (∼1%). Chemical analyses of the rhizome identified six major components, including berberine, palmatine, coptisine, epiberberine, columbamine, and jatrorrhizine. Berberine has the highest concentrations, followed by coptisine, palmatine, and epiberberine sequentially for all the three accessions. The drug quality of the accession from Shizhu may be the highest among these accessions. Differential analyses of the transcriptome identified four pivotal candidate enzymes, including aspartate aminotransferaseprotein, polyphenol oxidase, primary-amine oxidase, and tyrosine decarboxylase, were significantly differentially expressed and may be responsible for the difference of alkaloids contents in the accessions from different biotopes.
Li, Lingli; Zhang, Hehua; Liu, Zhongshuai; Cui, Xiaoyue; Zhang, Tong; Li, Yanfang; Zhang, Lingyun
2016-10-12
Blueberry is an economically important fruit crop in Ericaceae family. The substantial quantities of flavonoids in blueberry have been implicated in a broad range of health benefits. However, the information regarding fruit development and flavonoid metabolites based on the transcriptome level is still limited. In the present study, the transcriptome and gene expression profiling over berry development, especially during color development were initiated. A total of approximately 13.67 Gbp of data were obtained and assembled into 186,962 transcripts and 80,836 unigenes from three stages of blueberry fruit and color development. A large number of simple sequence repeats (SSRs) and candidate genes, which are potentially involved in plant development, metabolic and hormone pathways, were identified. A total of 6429 sequences containing 8796 SSRs were characterized from 15,457 unigenes and 1763 unigenes contained more than one SSR. The expression profiles of key genes involved in anthocyanin biosynthesis were also studied. In addition, a comparison between our dataset and other published results was carried out. Our high quality reads produced in this study are an important advancement and provide a new resource for the interpretation of high-throughput data for blueberry species whether regarding sequencing data depth or species extension. The use of this transcriptome data will serve as a valuable public information database for the studies of blueberry genome and would greatly boost the research of fruit and color development, flavonoid metabolisms and regulation and breeding of more healthful blueberries.
Yang, Zemao; Lu, Ruike; Dai, Zhigang; Yan, An; Tang, Qing; Cheng, Chaohua; Xu, Ying; Yang, Wenting; Su, Jianguang
2017-01-01
High salinity is a major environmental stressor for crops. To understand the regulatory mechanisms underlying salt tolerance, we conducted a comparative transcriptome analysis between salt-tolerant and salt-sensitive jute (Corchorus spp.) genotypes in leaf and root tissues under salt stress and control conditions. In total, 68,961 unigenes were identified. Additionally, 11,100 unigenes (including 385 transcription factors (TFs)) exhibited significant differential expression in salt-tolerant or salt-sensitive genotypes. Numerous common and unique differentially expressed unigenes (DEGs) between the two genotypes were discovered. Fewer DEGs were observed in salt-tolerant jute genotypes whether in root or leaf tissues. These DEGs were involved in various pathways, such as ABA signaling, amino acid metabolism, etc. Among the enriched pathways, plant hormone signal transduction (ko04075) and cysteine/methionine metabolism (ko00270) were the most notable. Eight common DEGs across both tissues and genotypes with similar expression profiles were part of the PYL-ABA-PP2C (pyrabactin resistant-like/regulatory components of ABA receptors-abscisic acid-protein phosphatase 2C). The methionine metabolism pathway was only enriched in salt-tolerant jute root tissue. Twenty-three DEGs were involved in methionine metabolism. Overall, numerous common and unique salt-stress response DEGs and pathways between salt-tolerant and salt-sensitive jute have been discovered, which will provide valuable information regarding salt-stress response mechanisms and help improve salt-resistance molecular breeding in jute. PMID:28927022
Genomic and genotyping characterization of haplotype-based polymorphic microsatellites in Prunus
USDA-ARS?s Scientific Manuscript database
Efficient utilization of microsatellites in genetic studies remains impeded largely due to the unknown status of their primer reliability, chromosomal location, and allele polymorphism. Discovery and characterization of microsatellite polymorphisms in a taxon will disclose the unknowns and gain new ...
Microsatellite primers for red drum (Sciaenops ocellatus)
USDA-ARS?s Scientific Manuscript database
In this note, we document polymerase-chain-reaction (PCR) primer pairs for 101, nuclear-encoded microsatellites designed and developed from a red drum (Sciaenops ocellatus) genomic library. The 101 microsatellites (Genbank Accession Numbers EU015882-EU015982) were amplified successfully and used to...
De Barba, M; Miquel, C; Lobréaux, S; Quenette, P Y; Swenson, J E; Taberlet, P
2017-05-01
Microsatellite markers have played a major role in ecological, evolutionary and conservation research during the past 20 years. However, technical constrains related to the use of capillary electrophoresis and a recent technological revolution that has impacted other marker types have brought to question the continued use of microsatellites for certain applications. We present a study for improving microsatellite genotyping in ecology using high-throughput sequencing (HTS). This approach entails selection of short markers suitable for HTS, sequencing PCR-amplified microsatellites on an Illumina platform and bioinformatic treatment of the sequence data to obtain multilocus genotypes. It takes advantage of the fact that HTS gives direct access to microsatellite sequences, allowing unambiguous allele identification and enabling automation of the genotyping process through bioinformatics. In addition, the massive parallel sequencing abilities expand the information content of single experimental runs far beyond capillary electrophoresis. We illustrated the method by genotyping brown bear samples amplified with a multiplex PCR of 13 new microsatellite markers and a sex marker. HTS of microsatellites provided accurate individual identification and parentage assignment and resulted in a significant improvement of genotyping success (84%) of faecal degraded DNA and costs reduction compared to capillary electrophoresis. The HTS approach holds vast potential for improving success, accuracy, efficiency and standardization of microsatellite genotyping in ecological and conservation applications, especially those that rely on profiling of low-quantity/quality DNA and on the construction of genetic databases. We discuss and give perspectives for the implementation of the method in the light of the challenges encountered in wildlife studies. © 2016 John Wiley & Sons Ltd.
Research of test fault diagnosis method for micro-satellite PSS
NASA Astrophysics Data System (ADS)
Wu, Haichao; Wang, Jinqi; Yang, Zhi; Yan, Meizhi
2017-11-01
Along with the increase in the number of micro-satellite and the shortening of the product's lifecycle, negative effects of satellite ground test failure become more and more serious. Real-time and efficient fault diagnosis becomes more and more necessary. PSS plays an important role in the satellite ground test's safety and reliability as one of the most important subsystems that guarantees the safety of micro-satellite energy. Take test fault diagnosis method of micro-satellite PSS as research object. On the basis of system features of PSS and classic fault diagnosis methods, propose a kind of fault diagnosis method based on the layered and loose coupling way. This article can provide certain reference for fault diagnosis methods research of other subsystems of micro-satellite.
Ley, Alexandra C; Hardy, Olivier J
2016-03-01
Microsatellite markers were developed for the species Haumania danckelmaniana (Marantaceae) from central tropical Africa. Microsatellite isolation was performed simultaneously on three different species of Marantaceae through a procedure that combines multiplex microsatellite enrichment and next-generation sequencing. From 80 primers selected for initial screening, 20 markers positively amplified in H. danckelmaniana, of which 10 presented unambiguous amplification products within the expected size range and eight were polymorphic with four to nine alleles per locus. Positive transferability with the related species H. liebrechtsiana was observed for the same 10 markers. The polymorphic microsatellite markers are suitable for studies in genetic diversity and structure, mating system, and gene flow in H. danckelmaniana and the closely related species H. liebrechtsiana.
BRIEF-REPORT New set of microsatellites for Chinese tallow tree, Triadica sebifera.
Zhuang, Y F; Wang, Z F; Wu, L F
2017-04-05
Chinese tallow (Triadica sebifera) is an important crop and ornamental tree. After it was introduced into the USA, it gradually became a noxious invasive tree in south-eastern America since the middle of the 1900s. Because only six microsatellites were reported previously in T. sebifera, to better understand the genetic diversity and population dynamics of such species, we reported here 28 new microsatellite markers. For these 28 microsatellites, the number of alleles per locus ranged from 2-16. The expected heterozygosity and the expected heterozygosity corrected for sample size varied from 0.0796 to 0.9081 and from 0.0805 to 0.9176, respectively. These microsatellites will provide additional choice to investigate the genetic diversity and structure in T. sebifera.
WebSat--a web software for microsatellite marker development.
Martins, Wellington Santos; Lucas, Divino César Soares; Neves, Kelligton Fabricio de Souza; Bertioli, David John
2009-01-01
Simple sequence repeats (SSR), also known as microsatellites, have been extensively used as molecular markers due to their abundance and high degree of polymorphism. We have developed a simple to use web software, called WebSat, for microsatellite molecular marker prediction and development. WebSat is accessible through the Internet, requiring no program installation. Although a web solution, it makes use of Ajax techniques, providing a rich, responsive user interface. WebSat allows the submission of sequences, visualization of microsatellites and the design of primers suitable for their amplification. The program allows full control of parameters and the easy export of the resulting data, thus facilitating the development of microsatellite markers. The web tool may be accessed at http://purl.oclc.org/NET/websat/
Hatmaker, E. Anne; Wadl, Phillip A.; Mantooth, Kristie; Scheffler, Brian E.; Ownley, Bonnie H.; Trigiano, Robert N.
2015-01-01
Premise of the study: We developed microsatellites from Fothergilla ×intermedia to establish loci capable of distinguishing species and cultivars, and to assess genetic diversity for use by ornamental breeders and to transfer within Hamamelidaceae. Methods and Results: We sequenced a small insert genomic library enriched for microsatellites to develop 12 polymorphic microsatellite loci. The number of alleles detected ranged from four to 15 across five genera within Hamamelidaceae. Shannon’s information index ranged from 0.07 to 0.14. Conclusions: These microsatellite loci provide a set of markers to evaluate genetic diversity of natural and cultivated collections and assist ornamental plant breeders for genetic studies of five popular genera of woody ornamental plants. PMID:25909044
Lewis, Emily M.; Fant, Jeremie B.; Moore, Michael J.; Hastings, Amy P.; Larson, Erica L.; Agrawal, Anurag A.; Skogen, Krissa A.
2016-01-01
Premise of the study: Eleven nuclear and four plastid microsatellite markers were screened for two gypsum endemic species, Oenothera gayleana and O. hartwegii subsp. filifolia, and tested for cross-amplification in the remaining 11 taxa within Oenothera sect. Calylophus (Onagraceae). Methods and Results: Microsatellite markers were tested in two to three populations spanning the ranges of both O. gayleana and O. hartwegii subsp. filifolia. The nuclear microsatellite loci consisted of both di- and trinucleotide repeats with one to 17 alleles per population. Several loci showed significant deviation from Hardy–Weinberg equilibrium, which may be evidence of chromosomal rings. The plastid microsatellite markers identified one to seven haplotypes per population. The transferability of these markers was confirmed in all 11 taxa within Oenothera sect. Calylophus. Conclusions: The microsatellite loci characterized here are the first developed and tested in Oenothera sect. Calylophus. These markers will be used to assess whether pollinator foraging distance influences population genetic parameters in predictable ways. PMID:26949578
Witherup, Colby; Ragone, Diane; Wiesner-Hanks, Tyr; Irish, Brian; Scheffler, Brian; Simpson, Sheron; Zee, Francis; Zuberi, M. Iqbal; Zerega, Nyree J. C.
2013-01-01
• Premise of the study: Microsatellite loci were isolated and characterized from enriched genomic libraries of Artocarpus altilis (breadfruit) and tested in four Artocarpus species and one hybrid. The microsatellite markers provide new tools for further studies in Artocarpus. • Methods and Results: A total of 25 microsatellite loci were evaluated across four Artocarpus species and one hybrid. Twenty-one microsatellite loci were evaluated on A. altilis (241), A. camansi (34), A. mariannensis (15), and A. altilis × mariannensis (64) samples. Nine of those loci plus four additional loci were evaluated on A. heterophyllus (jackfruit, 426) samples. All loci are polymorphic for at least one species. The average number of alleles ranges from two to nine within taxa. • Conclusions: These microsatellite primers will facilitate further studies on the genetic structure and evolutionary and domestication history of Artocarpus species. They will aid in cultivar identification and establishing germplasm conservation strategies for breadfruit and jackfruit. PMID:25202565
Investigation of microsatellite instability in Turkish breast cancer patients.
Demokan, Semra; Muslumanoglu, Mahmut; Yazici, H; Igci, Abdullah; Dalay, Nejat
2002-01-01
Multiple somatic and inherited genetic changes that lead to loss of growth control may contribute to the development of breast cancer. Microsatellites are tandem repeats of simple sequences that occur abundantly and at random throughout most eucaryotic genomes. Microsatellite instability (MI), characterized by the presence of random contractions or expansions in the length of simple sequence repeats or microsatellites, is observed in a variety of tumors. The aim of this study was to compare tumor DNA fingerprints with constitutional DNA fingerprints to investigate changes specific to breast cancer and evaluate its correlation with clinical characteristics. Tumor and normal tissue samples of 38 patients with breast cancer were investigated by comparing PCR-amplified microsatellite sequences D2S443 and D21S1436. Microsatellite instability at D21S1436 and D2S443 was found in 5 (13%) and 7 (18%) patients, respectively. Two patients displayed instability at both marker loci. No association was found between MI and age, family history, lymph node involvement and other clinical parameters.
Kets, C M; van Krieken, J H J M; van Erp, P E J; Feuth, T; Jacobs, Y H A; Brunner, H G; Ligtenberg, M J L; Hoogerbrugge, N
2008-02-15
Most colorectal cancers show either microsatellite or chromosomal instability. A subset of colorectal cancers, especially those diagnosed at young age, is known to show neither of these forms of genetic instability and thus might have a distinct pathogenesis. Colorectal cancers diagnosed at young age are suggestive for hereditary predisposition. We investigate whether such early-onset microsatellite and chromosomally stable colorectal cancers are a hallmark of a genetic susceptibility syndrome. The ploidy status of microsatellite stable (familial) colorectal cancers of patients diagnosed before age 50 (n = 127) was analyzed in relation to the histopathological characteristics and family history. As a control the ploidy status of sporadic colorectal cancer, with normal staining of mismatch repair proteins, diagnosed at the age of 69 years or above (n = 70) was determined. A diploid DNA content was used as a marker for chromosomal stability. Within the group of patients with (familial) early onset microsatellite stable colorectal cancer the chromosomally stable tumors did not differ from chromosomally unstable tumors with respect to mean age at diagnosis, fulfillment of Amsterdam criteria or pathological characteristics. Segregation analysis did not reveal any family with microsatellite and chromosomally stable colorectal cancer in 2 relatives. The prevalence of microsatellite and chromosomally stable colorectal cancer was not significantly different for the early and late onset group (28 and 21%, respectively). We find no evidence that early-onset microsatellite and chromosomally stable colorectal cancer is a hallmark of a hereditary colorectal cancer syndrome. (c) 2007 Wiley-Liss, Inc.
Batch Isolation of Microsatellites for Tropical Plant Species Pyrosequencing
USDA-ARS?s Scientific Manuscript database
Microsatellites were developed for ten tropical species using a method recently developed in our laboratory that involves a combination of two adapters at the SSR-enrichment stage and allows for cost saving and simultaneous loading of samples. The species for which microsatellites were isolated are...
USDA-ARS?s Scientific Manuscript database
Several available Prunus chloroplast genomes have not been exploited to develop polymorphic chloroplast microsatellites that could be useful in Prunus maternal lineage and phylogenetic analysis. In this study, using available bioinformatics tools, 80, 75, and 78 microsatellites were identified from ...
Castro, Juan C; Maddox, J Dylan; Cobos, Marianela; Requena, David; Zimic, Mirko; Bombarely, Aureliano; Imán, Sixto A; Cerdeira, Luis A; Medina, Andersson E
2015-11-24
Myrciaria dubia is an Amazonian fruit shrub that produces numerous bioactive phytochemicals, but is best known by its high L-ascorbic acid (AsA) content in fruits. Pronounced variation in AsA content has been observed both within and among individuals, but the genetic factors responsible for this variation are largely unknown. The goals of this research, therefore, were to assemble, characterize, and annotate the fruit transcriptome of M. dubia in order to reconstruct metabolic pathways and determine if multiple pathways contribute to AsA biosynthesis. In total 24,551,882 high-quality sequence reads were de novo assembled into 70,048 unigenes (mean length = 1150 bp, N50 = 1775 bp). Assembled sequences were annotated using BLASTX against public databases such as TAIR, GR-protein, FB, MGI, RGD, ZFIN, SGN, WB, TIGR_CMR, and JCVI-CMR with 75.2 % of unigenes having annotations. Of the three core GO annotation categories, biological processes comprised 53.6 % of the total assigned annotations, whereas cellular components and molecular functions comprised 23.3 and 23.1 %, respectively. Based on the KEGG pathway assignment of the functionally annotated transcripts, five metabolic pathways for AsA biosynthesis were identified: animal-like pathway, myo-inositol pathway, L-gulose pathway, D-mannose/L-galactose pathway, and uronic acid pathway. All transcripts coding enzymes involved in the ascorbate-glutathione cycle were also identified. Finally, we used the assembly to identified 6314 genic microsatellites and 23,481 high quality SNPs. This study describes the first next-generation sequencing effort and transcriptome annotation of a non-model Amazonian plant that is relevant for AsA production and other bioactive phytochemicals. Genes encoding key enzymes were successfully identified and metabolic pathways involved in biosynthesis of AsA, anthocyanins, and other metabolic pathways have been reconstructed. The identification of these genes and pathways is in agreement with the empirically observed capability of M. dubia to synthesize and accumulate AsA and other important molecules, and adds to our current knowledge of the molecular biology and biochemistry of their production in plants. By providing insights into the mechanisms underpinning these metabolic processes, these results can be used to direct efforts to genetically manipulate this organism in order to enhance the production of these bioactive phytochemicals. The accumulation of AsA precursor and discovery of genes associated with their biosynthesis and metabolism in M. dubia is intriguing and worthy of further investigation. The sequences and pathways produced here present the genetic framework required for further studies. Quantitative transcriptomics in concert with studies of the genome, proteome, and metabolome under conditions that stimulate production and accumulation of AsA and their precursors are needed to provide a more comprehensive view of how these pathways for AsA metabolism are regulated and linked in this species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willard, H.F.; Cremers, F.; Mandel, J.L.
A high-quality integrated genetic and physical map of the X chromosome from telomere to telomere, based primarily on YACs formatted with probes and STSs, is increasingly close to reality. At the Fifth International X Chromosome Workshop, organized by A.M. Poustka and D. Schlessinger in Heidelberg, Germany, April 24--27, 1994, substantial progress was recorded on extension and refinement of the physical map, on the integration of genetic and cytogenetic data, on attempts to use the map to direct gene searches, and on nascent large-scale sequencing efforts. This report summarizes physical and genetic mapping information presented at the workshop and/or published sincemore » the reports of the fourth International X Chromosome Workshop. The principle aim of the workshop was to derive a consensus map of the chromosome, in terms of physical contigs emphasizing the location of genes and microsatellite markers. The resulting map is presented and updates previous versions. This report also updates the list of highly informative microsatellites. The text highlights the working state of the map, the genes known to reside on the X, and the progress toward integration of various types of data.« less
Molecular traceability of beef from synthetic Mexican bovine breeds.
Rodríguez-Ramírez, R; Arana, A; Alfonso, L; González-Córdova, A F; Torrescano, G; Guerrero Legarreta, I; Vallejo-Cordoba, B
2011-10-06
Traceability ensures a link between carcass, quarters or cuts of beef and the individual animal or the group of animals from which they are derived. Meat traceability is an essential tool for successful identification and recall of contaminated products from the market during a food crisis. Meat traceability is also extremely important for protection and value enhancement of good-quality brands. Molecular meat traceability would allow verification of conventional methods used for beef tracing in synthetic Mexican bovine breeds. We evaluated a set of 11 microsatellites for their ability to identify animals belonging to these synthetic breeds, Brangus and Charolais/Brahman (78 animals). Seven microsatellite markers allowed sample discrimination with a match probability, defined as the probability of finding two individuals sharing by chance the same genotypic profile, of 10(-8). The practical application of the marker set was evaluated by testing eight samples from carcasses and pieces of meat at the slaughterhouse and at the point of sale. The DNA profiles of the two samples obtained at these two different points in the production-commercialization chain always proved that they came from the same animal.
Akinyi, Sheila; Hayden, Tonya; Gamboa, Dionicia; Torres, Katherine; Bendezu, Jorge; Abdallah, Joseph F.; Griffing, Sean M.; Quezada, Wilmer Marquiño; Arrospide, Nancy; De Oliveira, Alexandre Macedo; Lucas, Carmen; Magill, Alan J.; Bacon, David J.; Barnwell, John W.; Udhayakumar, Venkatachalam
2013-01-01
The majority of malaria rapid diagnostic tests (RDTs) detect Plasmodium falciparum histidine-rich protein 2 (PfHRP2), encoded by the pfhrp2 gene. Recently, P. falciparum isolates from Peru were found to lack pfhrp2 leading to false-negative RDT results. We hypothesized that pfhrp2-deleted parasites in Peru derived from a single genetic event. We evaluated the parasite population structure and pfhrp2 haplotype of samples collected between 1998 and 2005 using seven neutral and seven chromosome 8 microsatellite markers, respectively. Five distinct pfhrp2 haplotypes, corresponding to five neutral microsatellite-based clonal lineages, were detected in 1998-2001; pfhrp2 deletions occurred within four haplotypes. In 2003-2005, outcrossing among the parasite lineages resulted in eight population clusters that inherited the five pfhrp2 haplotypes seen previously and a new haplotype; pfhrp2 deletions occurred within four of these haplotypes. These findings indicate that the genetic origin of pfhrp2 deletion in Peru was not a single event, but likely occurred multiple times. PMID:24077522
The slick hair coat locus maps to chromosome 20 in Senepol-derived cattle.
Mariasegaram, M; Chase, C C; Chaparro, J X; Olson, T A; Brenneman, R A; Niedz, R P
2007-02-01
The ability to maintain normal temperatures during heat stress is an important attribute for cattle in the subtropics and tropics. Previous studies have shown that Senepol cattle and their crosses with Holstein, Charolais and Angus animals are as heat tolerant as Brahman cattle. This has been attributed to the slick hair coat of Senepol cattle, which is thought to be controlled by a single dominant gene. In this study, a genome scan using a DNA-pooling strategy indicated that the slick locus is most likely on bovine chromosome 20 (BTA20). Interval mapping confirmed the BTA20 assignment and refined the location of the locus. In total, 14 microsatellite markers were individually genotyped in two pedigrees consisting of slick and normal-haired cattle (n = 36), representing both dairy and beef breeds. The maximum LOD score was 9.4 for a 4.4-cM support interval between markers DIK2416 and BM4107. By using additional microsatellite markers in this region, and genotyping in six more pedigrees (n = 86), the slick locus was further localized to the DIK4835 - DIK2930 interval.
Conservation genetics of bull trout: Geographic distribution of variation at microsatellite loci.
P. Spruell; A.R. Hemmingsen; P.J. Howell; N. Kanda; F.W. Allendorf
2003-01-01
We describe the genetic population structure of 65 bull trout (Salvelinus confluentus) populations from the northwestern United States using four microsatellite loci. The distribution of genetic variation as measured by microsatellites is consistent with previous allozyme and mitochondrial DNA analysis. There is relatively little genetic variation...
Jennifer Koch; Dave Carey; M.E. Mason
2010-01-01
Cross-species amplification of six microsatellite markers from European beech (Fagus sylvatica Linn) and nine markers from Japanese beech (Fagus crenata Blume) was tested in American beech (Fagus grandifolia Ehrh.). Three microsatellites from each species were successfully adapted for use in American beech...
Detection of Bladder CA by Microsatellite Analysis (MSA) — EDRN Public Portal
Goal 1: To determine sensitivity and specificity of microsatellite analysis (MSA) of urine sediment, using a panel of 15 microsatellite markers, in detecting bladder cancer in participants requiring cystoscopy. This technique will be compared to the diagnostic standard of cystoscopy, as well as to urine cytology. Goal 2: To determine the temporal performance characteristics of microsatellite analysis of urine sediment. Goal 3: To determine which of the 15 individual markers or combination of markers that make up the MSA test are most predictive of the presence of bladder cancer.
Microsatellite primers for the endangered aquatic herb, Ottelia acuminata (Hydrocharitaceae).
Xu, Chao; Du, Zhi-Yuan; Chen, Jin-Ming; Wang, Qing-Feng
2012-06-01
Microsatellite primers were developed in the endangered aquatic herb, Ottelia acuminata, to characterize its genetic diversity and understand its population structure. Eight polymorphic microsatellite markers were developed from two populations of O. acuminata in China. The number of alleles per locus ranged from one to 15; the observed and expected heterozygosities ranged from 0 to 0.885 and from 0 to 0.888, respectively, in the two populations. Selected loci also amplified successfully in O. sinensis. These microsatellite markers will facilitate further studies on the conservation genetics and evolutionary history of O. acuminata.
WebSat ‐ A web software for microsatellite marker development
Martins, Wellington Santos; Soares Lucas, Divino César; de Souza Neves, Kelligton Fabricio; Bertioli, David John
2009-01-01
Simple sequence repeats (SSR), also known as microsatellites, have been extensively used as molecular markers due to their abundance and high degree of polymorphism. We have developed a simple to use web software, called WebSat, for microsatellite molecular marker prediction and development. WebSat is accessible through the Internet, requiring no program installation. Although a web solution, it makes use of Ajax techniques, providing a rich, responsive user interface. WebSat allows the submission of sequences, visualization of microsatellites and the design of primers suitable for their amplification. The program allows full control of parameters and the easy export of the resulting data, thus facilitating the development of microsatellite markers. Availability The web tool may be accessed at http://purl.oclc.org/NET/websat/ PMID:19255650
Ley, Alexandra C.; Hardy, Olivier J.
2016-01-01
Premise of the study: Microsatellite markers were developed for the species Haumania danckelmaniana (Marantaceae) from central tropical Africa. Methods and Results: Microsatellite isolation was performed simultaneously on three different species of Marantaceae through a procedure that combines multiplex microsatellite enrichment and next-generation sequencing. From 80 primers selected for initial screening, 20 markers positively amplified in H. danckelmaniana, of which 10 presented unambiguous amplification products within the expected size range and eight were polymorphic with four to nine alleles per locus. Positive transferability with the related species H. liebrechtsiana was observed for the same 10 markers. Conclusions: The polymorphic microsatellite markers are suitable for studies in genetic diversity and structure, mating system, and gene flow in H. danckelmaniana and the closely related species H. liebrechtsiana. PMID:27011899
Balaresque, Patricia; King, Turi E; Parkin, Emma J; Heyer, Evelyne; Carvalho-Silva, Denise; Kraaijenbrink, Thirsa; de Knijff, Peter; Tyler-Smith, Chris; Jobling, Mark A
2014-01-01
The male-specific region of the human Y chromosome (MSY) contains eight large inverted repeats (palindromes), in which high-sequence similarity between repeat arms is maintained by gene conversion. These palindromes also harbor microsatellites, considered to evolve via a stepwise mutation model (SMM). Here, we ask whether gene conversion between palindrome microsatellites contributes to their mutational dynamics. First, we study the duplicated tetranucleotide microsatellite DYS385a,b lying in palindrome P4. We show, by comparing observed data with simulated data under a SMM within haplogroups, that observed heteroallelic combinations in which the modal repeat number difference between copies was large, can give rise to homoallelic combinations with zero-repeats difference, equivalent to many single-step mutations. These are unlikely to be generated under a strict SMM, suggesting the action of gene conversion. Second, we show that the intercopy repeat number difference for a large set of duplicated microsatellites in all palindromes in the MSY reference sequence is significantly reduced compared with that for nonpalindrome-duplicated microsatellites, suggesting that the former are characterized by unusual evolutionary dynamics. These observations indicate that gene conversion violates the SMM for microsatellites in palindromes, homogenizing copies within individual Y chromosomes, but increasing overall haplotype diversity among chromosomes within related groups. PMID:24610746
Ribeiro, Guilherme Galvarros Bueno Lobo; De Lima, Reginaldo Ramos; Wiezel, Cláudia Emília Vieira; Ferreira, Luzitano Brandão; Sousa, Sandra Mara Bispo; Rocha, Dulce Maria Sucena; Canas, Maria do Carmo Tomitão; Nardelli-Costa, Juliana; Klautau-Guimarães, Maria De Nazaré; Simões, Aguinaldo Luiz; Oliveira, Silviene Fabiana
2009-01-01
The genetic constitution of Afro-derived Brazilian populations is barely studied. To improve that knowledge, we investigated the AluYAP element and five Y-chromosome STRs (DYS19, DYS390, DYS391, DYS392, and DYS393) to estimate ethnic male contribution in the constitution of four Brazilian quilombos remnants: Mocambo, Rio das Rãs, Kalunga, and Riacho de Sacutiaba. Results indicated significant differences among communities, corroborating historical information about the Brazilian settlement. We concluded that besides African contribution, there was a great European participation in the constitution of these four populations and that observed haplotype variability could be explained by gene flow to quilombos remnants and mutational events in microsatellites (STRs). (c) 2009 Wiley-Liss, Inc.
Schuller, Dorit; Casal, Margarida
2007-02-01
From the analysis of six polymorphic microsatellite loci performed in 361 Saccharomyces cerevisiae isolates, 93 alleles were identified, 52 of them being described for the first time. All these isolates have a distinct mtDNA RFLP pattern. They are derived from a pool of 1620 isolates obtained from spontaneous fermentations of grapes collected in three vineyards of the Vinho Verde Region in Portugal, during the 2001-2003 harvest seasons. For all loci analyzed, observed heterozygosity was 3-4 times lower than the expected value supposing a Hardy-Weinberg equilibrium (random mating and no evolutionary mechanisms acting), indicating a clonal structure and strong populational substructuring. Genetic differences among S. cerevisiae populations were apparent mainly from gradations in allele frequencies rather than from distinctive "diagnostic" genotypes, and the accumulation of small allele-frequency differences across six loci allowed the identification of population structures. Genetic differentiation in the same vineyard in consecutive years was of the same order of magnitude as the differences verified among the different vineyards. Correlation of genetic differentiation with the distance between sampling points within a vineyard suggested a pattern of isolation-by-distance, where genetic divergence in a vineyard increased with size. The continuous use of commercial yeasts has a limited influence on the autochthonous fermentative yeast population collected from grapes and may just slightly change populational structures of strains isolated from sites very close to the winery where they have been used. The present work is the first large-scale approach using microsatellite typing allowing a very fine resolution of indigenous S. cerevisiae populations isolated from vineyards.
Vera, Manuel; Bello, Xabier; Álvarez-Dios, Jose-Antonio; Pardo, Belen G; Sánchez, Laura; Carlsson, Jens; Carlsson, Jeanette E L; Bartolomé, Carolina; Maside, Xulio; Martinez, Paulino
2015-12-01
The flat oyster (Ostrea edulis) is one of the most appreciated molluscs in Europe, but its production has been greatly reduced by the parasite Bonamia ostreae. Here, new generation genomic resources were used to analyse the repetitive fraction of the oyster genome, with the aim of developing molecular markers to face this main oyster production challenge. The resulting oyster database, consists of two sets of 10,318 and 7159 unique contigs (4.8 Mbp and 6.8 Mbp in total length) representing the oyster's genome (WG) and haemocyte transcriptome (HT), respectively. A total of 1083 sequences were identified as TE-derived, which corresponded to 4.0% of WG and 1.1% of HT. They were clustered into 142 homology groups, most of which were assigned to the Penelope order of retrotransposons, and to the Helitron and TIR DNA-transposons. Simple repeats and rRNA pseudogenes, also made a significant contribution to the oyster's genome (0.5% and 0.3% of WG and HT, respectively).The most frequent short tandem repeats identified in WG were tetranucleotide motifs while trinucleotide motifs were in HT. Forty identified microsatellite loci, 20 from each database, were selected for technical validation. Success was much lower among WG than HT microsatellites (15% vs 55%), which could reflect higher variation in anonymous regions interfering with primer annealing. All microsatellites developed adjusted to Hardy-Weinberg proportions and represent a useful tool to support future breeding programmes and to manage genetic resources of natural flat oyster beds. Copyright © 2015 Elsevier B.V. All rights reserved.
Kolleck, Jakob; Yang, Mouyu; Zinner, Dietmar; Roos, Christian
2013-01-01
To evaluate the conservation status of a species or population it is necessary to gain insight into its ecological requirements, reproduction, genetic population structure, and overall genetic diversity. In our study we examined the genetic diversity of Rhinopithecus brelichi by analyzing microsatellite data and compared them with already existing data derived from mitochondrial DNA, which revealed that R. brelichi exhibits the lowest mitochondrial diversity of all so far studied Rhinopithecus species. In contrast, the genetic diversity of nuclear DNA is high and comparable to other Rhinopithecus species, i.e. the examined microsatellite loci are similarly highly polymorphic as in other species of the genus. An explanation for these differences in mitochondrial and nuclear genetic diversity could be a male biased dispersal. Females most likely stay within their natal band and males migrate between bands, thus mitochondrial DNA will not be exchanged between bands but nuclear DNA via males. A Bayesian Skyline Plot based on mitochondrial DNA sequences shows a strong decrease of the female effective population size (Nef) starting about 3,500 to 4,000 years ago, which concurs with the increasing human population in the area and respective expansion of agriculture. Given that we found no indication for a loss of nuclear DNA diversity in R. brelichi it seems that this factor does not represent the most prominent conservation threat for the long-term survival of the species. Conservation efforts should therefore focus more on immediate threats such as development of tourism and habitat destruction. PMID:24009761
Castagnone-Sereno, Philippe; Danchin, Etienne G J; Deleury, Emeline; Guillemaud, Thomas; Malausa, Thibaut; Abad, Pierre
2010-10-25
Microsatellites are the most popular source of molecular markers for studying population genetic variation in eukaryotes. However, few data are currently available about their genomic distribution and abundance across the phylum Nematoda. The recent completion of the genomes of several nematode species, including Meloidogyne incognita, a major agricultural pest worldwide, now opens the way for a comparative survey and analysis of microsatellites in these organisms. Using MsatFinder, the total numbers of 1-6 bp perfect microsatellites detected in the complete genomes of five nematode species (Brugia malayi, Caenorhabditis elegans, M. hapla, M. incognita, Pristionchus pacificus) ranged from 2,842 to 61,547, and covered from 0.09 to 1.20% of the nematode genomes. Under our search criteria, the most common repeat motifs for each length class varied according to the different nematode species considered, with no obvious relation to the AT-richness of their genomes. Overall, (AT)n, (AG)n and (CT)n were the three most frequent dinucleotide microsatellite motifs found in the five genomes considered. Except for two motifs in P. pacificus, all the most frequent trinucleotide motifs were AT-rich, with (AAT)n and (ATT)n being the only common to the five nematode species. A particular attention was paid to the microsatellite content of the plant-parasitic species M. incognita. In this species, a repertoire of 4,880 microsatellite loci was identified, from which 2,183 appeared suitable to design markers for population genetic studies. Interestingly, 1,094 microsatellites were identified in 801 predicted protein-coding regions, 99% of them being trinucleotides. When compared against the InterPro domain database, 497 of these CDS were successfully annotated, and further assigned to Gene Ontology terms. Contrasted patterns of microsatellite abundance and diversity were characterized in five nematode genomes, even in the case of two closely related Meloidogyne species. 2,245 di- to hexanucleotide loci were identified in the genome of M. incognita, providing adequate material for the future development of a wide range of microsatellite markers in this major plant parasite.
Chen, Jing; Zhang, Hanping; Feng, Mingfeng; Zuo, Dengpan; Hu, Yahui; Jiang, Tong
2016-07-13
Woodland strawberry (Fragaria vesca) infected with Strawberry vein banding virus (SVBV) exhibits chlorotic symptoms along the leaf veins. However, little is known about the molecular mechanism of strawberry disease caused by SVBV. We performed the next-generation sequencing (RNA-Seq) study to identify gene expression changes induced by SVBV in woodland strawberry using mock-inoculated plants as a control. Using RNA-Seq, we have identified 36,850 unigenes, of which 517 were differentially expressed in the virus-infected plants (DEGs). The unigenes were annotated and classified with Gene Ontology (GO), Clusters of Orthologous Group (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses. The KEGG pathway analysis of these genes suggested that strawberry disease caused by SVBV may affect multiple processes including pigment metabolism, photosynthesis and plant-pathogen interactions. Our research provides comprehensive transcriptome information regarding SVBV infection in strawberry.
Transcriptomic Analysis of Phenotypic Changes in Birch (Betula platyphylla) Autotetraploids
Mu, Huai-Zhi; Liu, Zi-Jia; Lin, Lin; Li, Hui-Yu; Jiang, Jing; Liu, Gui-Feng
2012-01-01
Plant breeders have focused much attention on polyploid trees because of their importance to forestry. To evaluate the impact of intraspecies genome duplication on the transcriptome, a series of Betula platyphylla autotetraploids and diploids were generated from four full-sib families. The phenotypes and transcriptomes of these autotetraploid individuals were compared with those of diploid trees. Autotetraploids were generally superior in breast-height diameter, volume, leaf, fruit and stoma and were generally inferior in height compared to diploids. Transcriptome data revealed numerous changes in gene expression attributable to autotetraploidization, which resulted in the upregulation of 7052 unigenes and the downregulation of 3658 unigenes. Pathway analysis revealed that the biosynthesis and signal transduction of indoleacetate (IAA) and ethylene were altered after genome duplication, which may have contributed to phenotypic changes. These results shed light on variations in birch autotetraploidization and help identify important genes for the genetic engineering of birch trees. PMID:23202935
Mehta, Nikita; Hagen, Ferry; Aamir, Sadaf; Singh, Sanjay K; Baghela, Abhishek
2017-12-01
Colletotrichum gloeosporioides is an economically important fungal pathogen causing substantial yield losses indifferent host plants. To understand the genetic diversity and molecular epidemiology of this fungus, we have developed a novel, high-resolution multi-locus microsatellite typing (MLMT) method. Bioinformatic analysis of C. gloeosporioides unannotated genome sequence yielded eight potential microsatellite loci, of which five, CG1 (GT) n , CG2 (GT1) n , CG3 (TC) n , CG4 (CT) n , and CG5 (CT1) n were selected for further study based on their universal amplification potential, reproducibility, and repeat number polymorphism. The selected microsatellites were used to analyze 31 strains of C. gloeosporioides isolated from 20 different host plants from India. All microsatellite loci were found to be polymorphic, and the approximate fragment sizes of microsatellite loci CG1, CG2, CG3, CG4, and CG5 were in ranges of 213-241, 197-227, 231-265, 209-275, and 132-188, respectively. Among the 31 isolates, 55 different genotypes were identified. The Simpson's index of diversity (D) values for the individual locus ranged from 0.79 to 0.92, with the D value of all combined five microsatellite loci being 0.99. Microsatellite data analysis revealed that isolates from Ocimum sanctum , Capsicum annuum (chili pepper), and Mangifera indica (mango) formed distinct clusters, therefore exhibited some level of correlation between certain genotypes and host. The developed MLMT method would be a powerful tool for studying the genetic diversity and any possible genotype-host correlation in C. gloeosporioides .
USDA-ARS?s Scientific Manuscript database
The evolution of individual microsatellite loci is often complex and homoplasy is common but often goes undetected. Sequencing alleles at a microsatellite locus can provide a more complete picture of the common evolutionary mechanisms occurring at that locus and can reveal cases of homoplasy. Within...
Evolution and homoplasy at the bem6 microsatellite locus in three Bemisia tabaci cryptic species
USDA-ARS?s Scientific Manuscript database
The evolution of individual microsatellite loci is often complex and homoplasy is common but often goes undetected. Sequencing alleles at a microsatellite locus can provide a more complete picture of the common evolutionary mechanisms occurring at that locus and can reveal cases of homoplasy. Within...
Sarah E. Hopkins; D. Lee Taylor
2011-01-01
Microsatellite primers were developed for the first time in the species Corallorhiza maculata, a nonphotosynthetic orchid that is becoming a model for studying mycorrhizal specificity. Eight polymorphic microsatellite markers were developed using an enrichment and cloning protocol. The number of alleles for each locus ranged from two to seven. The...
Morris, Katrina M.; Kirby, Katherine; Beatty, Julia A.; Barrs, Vanessa R.; Cattley, Sonia; David, Victor; O’Brien, Stephen J.; Menotti-Raymond, Marilyn
2014-01-01
Diversity within the major histocompatibility complex (MHC) reflects the immunological fitness of a population. MHC-linked microsatellite markers provide a simple and an inexpensive method for studying MHC diversity in large-scale studies. We have developed 6 MHC-linked microsatellite markers in the domestic cat and used these, in conjunction with 5 neutral microsatellites, to assess MHC diversity in domestic mixed breed (n = 129) and purebred Burmese (n = 61) cat populations in Australia. The MHC of outbred Australian cats is polymorphic (average allelic richness = 8.52), whereas the Burmese population has significantly lower MHC diversity (average allelic richness = 6.81; P < 0.01). The MHC-linked microsatellites along with MHC cloning and sequencing demonstrated moderate MHC diversity in cheetahs (n = 13) and extremely low diversity in Gir lions (n = 13). Our MHC-linked microsatellite markers have potential future use in diversity and disease studies in other populations and breeds of cats as well as in wild felid species. PMID:24620003
GREENHOUSE, BRYAN; MYRICK, ALISSA; DOKOMAJILAR, CHRISTIAN; WOO, JONATHAN M.; CARLSON, ELAINE J.; ROSENTHAL, PHILIP J.; DORSEY, GRANT
2006-01-01
Genotyping methods for Plasmodium falciparum drug efficacy trials have not been standardized and may fail to accurately distinguish recrudescence from new infection, especially in high transmission areas where polyclonal infections are common. We developed a simple method for genotyping using previously identified microsatellites and capillary electrophoresis, validated this method using mixtures of laboratory clones, and applied the method to field samples. Two microsatellite markers produced accurate results for single-clone but not polyclonal samples. Four other microsatellite markers were as sensitive as, and more specific than, commonly used genotyping techniques based on merozoite surface proteins 1 and 2. When applied to samples from 15 patients in Burkina Faso with recurrent parasitemia after treatment with sulphadoxine-pyrimethamine, the addition of these four microsatellite markers to msp1 and msp2 genotyping resulted in a reclassification of outcomes that strengthened the association between dhfr 59R, an anti-folate resistance mutation, and recrudescence (P = 0.31 versus P = 0.03). Four microsatellite markers performed well on polyclonal samples and may provide a valuable addition to genotyping for clinical drug efficacy studies in high transmission areas. PMID:17123974
Getino-Mamet, Leandro Nicolás; Valdivia-Carrillo, Tania; Gómez Daglio, Liza; García-De León, Francisco Javier
2017-04-01
The Cannonball jellyfish (Stomolophus sp.) is a species of jellyfish with high relevance in artisanal fishing. Studies of their populations do not extend beyond the morphological descriptions knowing that presents a great morphological variability. However, there are no genetic studies to determine the number of independent populations, so microsatellite markers become a suitable option. Since there are no species-specific microsatellite loci, in this paper, 14 new microsatellite loci are characterized. Microsatellite loci were isolated de novo through next generation sequencing, by two runs on Illumina MiSeq. A total of 506,771,269 base pair were obtained, from which 142,616 were microsatellite loci, and 1546 of them could design primers. We tested 14 primer pairs on 32 individuals from Bahía de La Paz, Gulf of California. We observed low genetic variation among loci (mean number of alleles per locus = 4.33, mean observed heterozygosity 0.381, mean expected heterozygosity 0.501). These loci are the first ones described for the species and will be helpful to carry out genetic diversity and population genetics studies.
Chen, Hongdan; Lai, Wenxiang; Fu, Qiang; Lou, Yonggen
2014-01-01
Background The brown planthopper (BPH), Nilaparvata lugens (Stål), one of the most serious rice insect pests in Asia, can quickly overcome rice resistance by evolving new virulent populations. The insect fat body plays essential roles in the life cycles of insects and in plant-insect interactions. However, whether differences in fat body transcriptomes exist between insect populations with different virulence levels and whether the transcriptomic differences are related to insect virulence remain largely unknown. Methodology/Principal Findings In this study, we performed transcriptome-wide analyses on the fat bodies of two BPH populations with different virulence levels in rice. The populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 33,776 and 32,332 unigenes from the fat bodies of TN1 and M populations, respectively, were generated using Illumina technology. Gene ontology annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology classifications indicated that genes related to metabolism and immunity were significantly active in the fat bodies. In addition, a total of 339 unigenes showed homology to genes of yeast-like symbionts (YLSs) from 12 genera and endosymbiotic bacteria Wolbachia. A comparative analysis of the two transcriptomes generated 7,860 differentially expressed genes. GO annotations and enrichment analysis of KEGG pathways indicated these differentially expressed transcripts might be involved in metabolism and immunity. Finally, 105 differentially expressed genes from YLSs and Wolbachia were identified, genes which might be associated with the formation of different virulent populations. Conclusions/Significance This study was the first to compare the fat-body transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our findings provide a molecular resource for future investigations of fat bodies and will be useful in examining the interactions between the fat body and virulence variation in the BPH. PMID:24533099
Chen, Hongdan; Ye, Wenfeng; Li, Shaohui; Lou, Yonggen
2013-01-01
Background The brown planthopper (BPH), Nilaparvata lugens (Stål), a destructive rice pest in Asia, can quickly overcome rice resistance by evolving new virulent populations. Herbivore saliva plays an important role in plant–herbivore interactions, including in plant defense and herbivore virulence. However, thus far little is known about BPH saliva at the molecular level, especially its role in virulence and BPH–rice interaction. Methodology/Principal Findings Using cDNA amplification in combination with Illumina short-read sequencing technology, we sequenced the salivary-gland transcriptomes of two BPH populations with different virulence; the populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 37,666 and 38,451 unigenes were generated from the salivary glands of these populations, respectively. When combined, a total of 43,312 unigenes were obtained, about 18 times more than the number of expressed sequence tags previously identified from these glands. Gene ontology annotations and KEGG orthology classifications indicated that genes related to metabolism, binding and transport were significantly active in the salivary glands. A total of 352 genes were predicted to encode secretory proteins, and some might play important roles in BPH feeding and BPH–rice interactions. Comparative analysis of the transcriptomes of the two populations revealed that the genes related to ‘metabolism,’ ‘digestion and absorption,’ and ‘salivary secretion’ might be associated with virulence. Moreover, 67 genes encoding putative secreted proteins were differentially expressed between the two populations, suggesting these genes may contribute to the change in virulence. Conclusions/Significance This study was the first to compare the salivary-gland transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our data provide a rich molecular resource for future functional studies on salivary glands and will be useful for elucidating the molecular mechanisms underlying BPH feeding and virulence differences. PMID:24244529
Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun
2013-01-01
Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation in G. hirsutum and comparative genomics among Gossypium species. PMID:24146870
2011-01-01
Background Nocturnal insects such as moths are ideal models to study the molecular bases of olfaction that they use, among examples, for the detection of mating partners and host plants. Knowing how an odour generates a neuronal signal in insect antennae is crucial for understanding the physiological bases of olfaction, and also could lead to the identification of original targets for the development of olfactory-based control strategies against herbivorous moth pests. Here, we describe an Expressed Sequence Tag (EST) project to characterize the antennal transcriptome of the noctuid pest model, Spodoptera littoralis, and to identify candidate genes involved in odour/pheromone detection. Results By targeting cDNAs from male antennae, we biased gene discovery towards genes potentially involved in male olfaction, including pheromone reception. A total of 20760 ESTs were obtained from a normalized library and were assembled in 9033 unigenes. 6530 were annotated based on BLAST analyses and gene prediction software identified 6738 ORFs. The unigenes were compared to the Bombyx mori proteome and to ESTs derived from Lepidoptera transcriptome projects. We identified a large number of candidate genes involved in odour and pheromone detection and turnover, including 31 candidate chemosensory receptor genes, but also genes potentially involved in olfactory modulation. Conclusions Our project has generated a large collection of antennal transcripts from a Lepidoptera. The normalization process, allowing enrichment in low abundant genes, proved to be particularly relevant to identify chemosensory receptors in a species for which no genomic data are available. Our results also suggest that olfactory modulation can take place at the level of the antennae itself. These EST resources will be invaluable for exploring the mechanisms of olfaction and pheromone detection in S. littoralis, and for ultimately identifying original targets to fight against moth herbivorous pests. PMID:21276261
The Use of EST Expression Matrixes for the Quality Control of Gene Expression Data
Milnthorpe, Andrew T.; Soloviev, Mikhail
2012-01-01
EST expression profiling provides an attractive tool for studying differential gene expression, but cDNA libraries' origins and EST data quality are not always known or reported. Libraries may originate from pooled or mixed tissues; EST clustering, EST counts, library annotations and analysis algorithms may contain errors. Traditional data analysis methods, including research into tissue-specific gene expression, assume EST counts to be correct and libraries to be correctly annotated, which is not always the case. Therefore, a method capable of assessing the quality of expression data based on that data alone would be invaluable for assessing the quality of EST data and determining their suitability for mRNA expression analysis. Here we report an approach to the selection of a small generic subset of 244 UniGene clusters suitable for identification of the tissue of origin for EST libraries and quality control of the expression data using EST expression information alone. We created a small expression matrix of UniGene IDs using two rounds of selection followed by two rounds of optimisation. Our selection procedures differ from traditional approaches to finding “tissue-specific” genes and our matrix yields consistency high positive correlation values for libraries with confirmed tissues of origin and can be applied for tissue typing and quality control of libraries as small as just a few hundred total ESTs. Furthermore, we can pick up tissue correlations between related tissues e.g. brain and peripheral nervous tissue, heart and muscle tissues and identify tissue origins for a few libraries of uncharacterised tissue identity. It was possible to confirm tissue identity for some libraries which have been derived from cancer tissues or have been normalised. Tissue matching is affected strongly by cancer progression or library normalisation and our approach may potentially be applied for elucidating the stage of normalisation in normalised libraries or for cancer staging. PMID:22412959
Yu, Haixin; Ji, Rui; Ye, Wenfeng; Chen, Hongdan; Lai, Wenxiang; Fu, Qiang; Lou, Yonggen
2014-01-01
The brown planthopper (BPH), Nilaparvata lugens (Stål), one of the most serious rice insect pests in Asia, can quickly overcome rice resistance by evolving new virulent populations. The insect fat body plays essential roles in the life cycles of insects and in plant-insect interactions. However, whether differences in fat body transcriptomes exist between insect populations with different virulence levels and whether the transcriptomic differences are related to insect virulence remain largely unknown. In this study, we performed transcriptome-wide analyses on the fat bodies of two BPH populations with different virulence levels in rice. The populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 33,776 and 32,332 unigenes from the fat bodies of TN1 and M populations, respectively, were generated using Illumina technology. Gene ontology annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology classifications indicated that genes related to metabolism and immunity were significantly active in the fat bodies. In addition, a total of 339 unigenes showed homology to genes of yeast-like symbionts (YLSs) from 12 genera and endosymbiotic bacteria Wolbachia. A comparative analysis of the two transcriptomes generated 7,860 differentially expressed genes. GO annotations and enrichment analysis of KEGG pathways indicated these differentially expressed transcripts might be involved in metabolism and immunity. Finally, 105 differentially expressed genes from YLSs and Wolbachia were identified, genes which might be associated with the formation of different virulent populations. This study was the first to compare the fat-body transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our findings provide a molecular resource for future investigations of fat bodies and will be useful in examining the interactions between the fat body and virulence variation in the BPH.
Ji, Rui; Yu, Haixin; Fu, Qiang; Chen, Hongdan; Ye, Wenfeng; Li, Shaohui; Lou, Yonggen
2013-01-01
The brown planthopper (BPH), Nilaparvata lugens (Stål), a destructive rice pest in Asia, can quickly overcome rice resistance by evolving new virulent populations. Herbivore saliva plays an important role in plant-herbivore interactions, including in plant defense and herbivore virulence. However, thus far little is known about BPH saliva at the molecular level, especially its role in virulence and BPH-rice interaction. Using cDNA amplification in combination with Illumina short-read sequencing technology, we sequenced the salivary-gland transcriptomes of two BPH populations with different virulence; the populations were derived from rice variety TN1 (TN1 population) and Mudgo (M population). In total, 37,666 and 38,451 unigenes were generated from the salivary glands of these populations, respectively. When combined, a total of 43,312 unigenes were obtained, about 18 times more than the number of expressed sequence tags previously identified from these glands. Gene ontology annotations and KEGG orthology classifications indicated that genes related to metabolism, binding and transport were significantly active in the salivary glands. A total of 352 genes were predicted to encode secretory proteins, and some might play important roles in BPH feeding and BPH-rice interactions. Comparative analysis of the transcriptomes of the two populations revealed that the genes related to 'metabolism,' 'digestion and absorption,' and 'salivary secretion' might be associated with virulence. Moreover, 67 genes encoding putative secreted proteins were differentially expressed between the two populations, suggesting these genes may contribute to the change in virulence. This study was the first to compare the salivary-gland transcriptomes of two BPH populations having different virulence traits and to find genes that may be related to this difference. Our data provide a rich molecular resource for future functional studies on salivary glands and will be useful for elucidating the molecular mechanisms underlying BPH feeding and virulence differences.
Zhang, F Q; Lei, S Y; Gao, Q B; Khan, G; Xing, R; Yang, H L; Chen, S L
2015-05-18
Rhodiola alsia, which has been used widely in traditional Chinese medicine for a considerable time, grows on moist habitats at high altitude near the snow line. Microsatellite loci were developed for R. alsia to investigate its population genetics. In total, 17 polymorphic microsatellites were developed based on ESTs from the Illumina HiSeq(TM) 2000 platform. The microsatellite loci were checked for variability using 80 individuals of R. alsia sampled from four locations on the Qinghai-Tibet Plateau. The total number of alleles per locus ranged from 10 to 20, and the observed heterozygosity ranged from 0.000 to 1.000. The null allele frequency ranged from 0.000 to 0.324. These microsatellites are expected to be helpful in future studies of population genetics in R. alsia and related species.
Pećina-Šlaus, Nives; Kafka, Anja; Bukovac, Anja; Vladušić, Tomislav; Tomas, Davor; Hrašćan, Reno
2017-07-01
Postreplicative mismatch repair safeguards the stability of our genome. The defects in its functioning will give rise to microsatellite instability. In this study, 50 meningiomas were investigated for microsatellite instability. Two major mismatch repair genes, MLH1 and MSH2, were analyzed using microsatellite markers D1S1611 and BAT26 amplified by polymerase chain reaction and visualized by gel electrophoresis on high-resolution gels. Furthermore, genes DVL3 (D3S1262), AXIN1 (D16S3399), and CDH1 (D16S752) were also investigated for microsatellite instability. Our study revealed constant presence of microsatellite instability in meningioma patients when compared to their autologous blood DNA. Altogether 38% of meningiomas showed microsatellite instability at one microsatellite locus, 16% on two, and 13.3% on three loci. The percent of detected microsatellite instability for MSH2 gene was 14%, and for MLH1, it was 26%, for DVL3 22.9%, for AXIN1 17.8%, and for CDH1 8.3%. Since markers also allowed for the detection of loss of heterozygosity, gross deletions of MLH1 gene were found in 24% of meningiomas. Genetic changes between MLH1 and MSH2 were significantly positively correlated (p = 0.032). We also noted a positive correlation between genetic changes of MSH2 and DVL3 genes (p = 0.034). No significant associations were observed when MLH1 or MSH2 was tested against specific histopathological meningioma subtype or World Health Organization grade. However, genetic changes in DVL3 were strongly associated with anaplastic histology of meningioma (χ 2 = 9.14; p = 0.01). Our study contributes to better understanding of the genetic profile of human intracranial meningiomas and suggests that meningiomas harbor defective cellular DNA mismatch repair mechanisms.
Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S
2015-01-01
Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource available with the EuCHIP60k, opens a whole new array of opportunities for high-throughput, genome-wide or targeted genotyping in species of Eucalyptus.
Hackler, J.C.; Van Den Bussche, Ronald A.; Leslie, David M.
2007-01-01
Two trinucleotide and seven tetranucleotide microsatellite loci were isolated from an alligator snapping turtle Macrochelys temminckii. To assess the degree of variability in these nine microsatellite loci, we genotyped 174 individuals collected from eight river drainage basins in the southeastern USA. These markers revealed a moderate degree of allelic diversity (six to 16 alleles per locus) and observed heterozygosity (0.166-0.686). These polymorphic microsatellite loci provide powerful tools for population genetic studies for a species that is afforded some level of conservation protection in every state in which it occurs. ?? 2006 The Authors.
NASA Technical Reports Server (NTRS)
Chang, Dong Kyung; Metzgar, David; Wills, Christopher; Boland, C. Richard
2003-01-01
All "minor" components of the human DNA mismatch repair (MMR) system-MSH3, MSH6, PMS2, and the recently discovered MLH3-contain mononucleotide microsatellites in their coding sequences. This intriguing finding contrasts with the situation found in the major components of the DNA MMR system-MSH2 and MLH1-and, in fact, most human genes. Although eukaryotic genomes are rich in microsatellites, non-triplet microsatellites are rare in coding regions. The recurring presence of exonal mononucleotide repeat sequences within a single family of human genes would therefore be considered exceptional.
Zhang, Chan; Liang, Jian; Yang, Le; Sun, Baoguo; Wang, Chengtao
2017-01-01
Monascus purpureus is an important medicinal and edible microbial resource. To facilitate biological, biochemical, and molecular research on medicinal components of M. purpureus, we investigated the M. purpureus transcriptome by RNA sequencing (RNA-seq). An RNA-seq library was created using RNA extracted from a mixed sample of M. purpureus expressing different levels of monacolin K output. In total 29,713 unigenes were assembled from more than 60 million high-quality short reads. A BLAST search revealed hits for 21,331 unigenes in at least one of the protein or nucleotide databases used in this study. The 22,365 unigenes were categorized into 48 functional groups based on Gene Ontology classification. Owing to the economic and medicinal importance of M. purpureus, most studies on this organism have focused on the pharmacological activity of chemical components and the molecular function of genes involved in their biogenesis. In this study, we performed quantitative real-time PCR to detect the expression of genes related to monacolin K (mokA-mokI) at different phases (2, 5, 8, and 12 days) of M. purpureus M1 and M1-36. Our study found that mokF modulates monacolin K biogenesis in M. purpureus. Nine genes were suggested to be associated with the monacolin K biosynthesis. Studies on these genes could provide useful information on secondary metabolic processes in M. purpureus. These results indicate a detailed resource through genetic engineering of monacolin K biosynthesis in M. purpureus and related species. PMID:28114365
Munoz, Sergio; Guerrero, Felix D.; Kellogg, Anastasia; Heekin, Andrew M.
2017-01-01
The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller’s organ, located in the tick’s forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor. PMID:28231302
Munoz, Sergio; Guerrero, Felix D; Kellogg, Anastasia; Heekin, Andrew M; Leung, Ming-Ying
2017-01-01
The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller's organ, located in the tick's forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs) are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs) from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor.
Chen, Zexiong; Tang, Ning; You, Yuming; Lan, Jianbin; Liu, Yiqing; Li, Zhengguo
2015-01-01
Lonicera macranthoides Hand.-Mazz (L. macranthoides) is a medicinal herb that is widely distributed in southern China. The biosynthetic and metabolic pathways for a core secondary metabolite in L. macranthoides, chlorogenic acid (CGA), have been elucidated in many species. However, the mechanisms of CGA biosynthesis and the related gene regulatory network in L. macranthoides are still not well understood. In this study, CGA content was quantified by high performance liquid chromatography (HPLC), and CGA levels differed significantly among three tissues; specifically, the CGA content in young leaves (YL) was greater than that in young stems (YS), which was greater than that in mature flowers (MF). Transcriptome analysis of L. macranthoides yielded a total of 53,533,014 clean reads (average length 90 bp) and 76,453 unigenes (average length 703 bp). A total of 3,767 unigenes were involved in biosynthesis pathways of secondary metabolites. Of these unigenes, 80 were possibly related to CGA biosynthesis. Furthermore, differentially expressed genes (DEGs) were screened in different tissues including YL, MF and YS. In these tissues, 24 DEGs were found to be associated with CGA biosynthesis, including six phenylalanine ammonia lyase (PAL) genes, six 4-coumarate coenzyme A ligase (4CL) genes, four cinnamate 4-Hydroxylase (C4H) genes, seven hydroxycinnamoyl transferase/hydroxycinnamoyl-CoA quinate transferase HCT/HQT genes and one coumarate 3-hydroxylase (C3H) gene.These results further the understanding of CGA biosynthesis and the related regulatory network in L. macranthoides. PMID:26381882
Ye, Hua; Xiao, Shijun; Wang, Xiaoqing; Wang, Zhiyong; Zhang, Zhengshi; Zhu, Chengke; Hu, Bingjie; Lv, Changhuan; Zheng, Shuming; Luo, Hui
2018-04-01
Schizothorax prenanti (S. prenanti) is an indigenous fish species and is popularly cultured in southwestern China. In recent years, intensive farming of S. prenanti and water quality deterioration has increased the susceptibility of this fish to various pathogens, including Aeromonas hydrophila (A. hydrophila), which has caused severe damage to S. prenanti production. However, the understanding of molecular immune response of S. prenanti to A. hydrophila infection is still lacking. In order to better comprehend the S. prenanti time series immune response process against A. hydrophila, we conducted the first transcriptomic comparison in S. prenanti spleen at 4, 24, and 48 h after the infection challenge of A. hydrophila against their control counterparts. In total, 628 million clean reads were obtained from 18 libraries and assembled into 262,745 transcripts. After eliminating sequence redundancy, 69,373 unigenes with an average length of 1476 bp were obtained. Comparative analysis revealed 1890 unigenes with significantly differential expression, including 172, 455, 589 upregulated and 27, 676, 551 unigenes downregulated genes for 4, 24, and 48 h post-infection, respectively. Differentially expressed genes (DEGs) were validated using qPCR for 15 randomly selected genes. Enrichment and pathway analysis of DEGs was carried out to understand the functions of the immune-related genes. Our results revealed that many important functional genes relating to complement and coagulation cascades, chemokine signaling pathway, toll-like receptor signaling pathway, NOD-like receptor signaling pathway and leukocyte transendothelial migration were regulated during the infection of A. hydrophila, and the expression of those genes reflected the transcriptome profiles during the challenging stages.
Transcriptome Analysis of Salt Tolerant Common Bean (Phaseolus vulgaris L.) under Saline Conditions
Hiz, Mahmut Can; Canher, Balkan; Niron, Harun; Turet, Muge
2014-01-01
Salinity is one of the important abiotic stress factors that limit crop production. Common bean, Phaseolus vulgaris L., a major protein source in developing countries, is highly affected by soil salinity and the information on genes that play a role in salt tolerance is scarce. We aimed to identify differentially expressed genes (DEGs) and related pathways by comprehensive analysis of transcriptomes of both root and leaf tissues of the tolerant genotype grown under saline and control conditions in hydroponic system. We have generated a total of 158 million high-quality reads which were assembled into 83,774 all-unigenes with a mean length of 813 bp and N50 of 1,449 bp. Among the all-unigenes, 58,171 were assigned with Nr annotations after homology analyses. It was revealed that 6,422 and 4,555 all-unigenes were differentially expressed upon salt stress in leaf and root tissues respectively. Validation of the RNA-seq quantifications (RPKM values) was performed by qRT-PCR (Quantitative Reverse Transcription PCR) analyses. Enrichment analyses of DEGs based on GO and KEGG databases have shown that both leaf and root tissues regulate energy metabolism, transmembrane transport activity, and secondary metabolites to cope with salinity. A total of 2,678 putative common bean transcription factors were identified and classified under 59 transcription factor families; among them 441 were salt responsive. The data generated in this study will help in understanding the fundamentals of salt tolerance in common bean and will provide resources for functional genomic studies. PMID:24651267
Gao, Jian Ping; Wang, Dong; Cao, Ling Ya; Sun, Hai Feng
2015-01-01
Background Codonopsis pilosula (Franch.) Nannf. is one of the most widely used medicinal plants. Although chemical and pharmacological studies have shown that codonopsis polysaccharides (CPPs) are bioactive compounds and that their composition is variable, their biosynthetic pathways remain largely unknown. Next-generation sequencing is an efficient and high-throughput technique that allows the identification of candidate genes involved in secondary metabolism. Principal Findings To identify the components involved in CPP biosynthesis, a transcriptome library, prepared using root and other tissues, was assembled with the help of Illumina sequencing. A total of 9.2 Gb of clean nucleotides was obtained comprising 91,175,044 clean reads, 102,125 contigs, and 45,511 unigenes. After aligning the sequences to the public protein databases, 76.1% of the unigenes were annotated. Among these annotated unigenes, 26,189 were assigned to Gene Ontology categories, 11,415 to Clusters of Orthologous Groups, and 18,848 to Kyoto Encyclopedia of Genes and Genomes pathways. Analysis of abundance of transcripts in the library showed that genes, including those encoding metallothionein, aquaporin, and cysteine protease that are related to stress responses, were in the top list. Among genes involved in the biosynthesis of CPP, those responsible for the synthesis of UDP-L-arabinose and UDP-xylose were highly expressed. Significance To our knowledge, this is the first study to provide a public transcriptome dataset prepared from C. pilosula and an outline of the biosynthetic pathway of polysaccharides in a medicinal plant. Identified candidate genes involved in CPP biosynthesis provide understanding of the biosynthesis and regulation of CPP at the molecular level. PMID:25719364
Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong
2014-01-01
Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species. PMID:24130371
Genome-Wide Identification and Transferability of Microsatellite Markers between Palmae Species
Xiao, Yong; Xia, Wei; Ma, Jianwei; Mason, Annaliese S.; Fan, Haikuo; Shi, Peng; Lei, Xintao; Ma, Zilong; Peng, Ming
2016-01-01
The Palmae family contains 202 genera and approximately 2800 species. Except for Elaeis guineensis and Phoenix dactylifera, almost no genetic and genomic information is available for Palmae species. Therefore, this is an obstacle to the conservation and genetic assessment of Palmae species, especially those that are currently endangered. The study was performed to develop a large number of microsatellite markers which can be used for genetic analysis in different Palmae species. Based on the assembled genome of E. guineensis and P. dactylifera, a total of 814 383 and 371 629 microsatellites were identified. Among these microsatellites identified in E. guineensis, 734 509 primer pairs could be designed from the flanking sequences of these microsatellites. The majority (618 762) of these designed primer pairs had in silico products in the genome of E. guineensis. These 618 762 primer pairs were subsequently used to in silico amplify the genome of P. dactylifera. A total of 7 265 conserved microsatellites were identified between E. guineensis and P. dactylifera. One hundred and thirty-five primer pairs flanking the conserved SSRs were stochastically selected and validated to have high cross-genera transferability, varying from 16.7 to 93.3% with an average of 73.7%. These genome-wide conserved microsatellite markers will provide a useful tool for genetic assessment and conservation of different Palmae species in the future. PMID:27826307
Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong
2014-02-01
Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.
Yang, Lei; Cheng, Tian-Yin; Zhao, Fei-Yan
2017-02-22
Although Pomacea canaliculata is native to South and Central America, it has become one of the most abundant invasive species worldwide and causes extensive damage to agriculture and horticulture. Conventional physical and chemical techniques have been used to eliminate P. canaliculata, but the effects are not ideal. Therefore, it is important to devise a new method based on a greater understanding of the biology of P. canaliculata. However, the molecular mechanisms underlying digestion and absorption in P. canaliculata are not well understood due to the lack of available genomic information for this species, particularly for digestive enzyme genes. In the present study, hepatopancreas transcriptome sequencing produced over 223 million high-quality reads, and a global de novo assembly generated a total of 87,766 unique transcripts (unigenes), of which 19,942 (22.7%) had significant similarities to proteins in the UniProt database. In addition, 296,675 annotated sequences were associated with Gene Ontology (GO) terms. A Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment was performed for the unique unigenes, and 262 pathways (p-value < 10 -5 ) in P. canaliculata were found to be predominantly related to plant consumption and coarse fiber digestion and absorption. These transcripts were classified into four large categories: hydrolase, transferase, isomerase and cytochrome P450. The Reads Per Kilobase of transcript per Million mapped reads (RPKM) analysis showed that there were 523 down-regulated unigenes and 406 up-regulated unigenes in the starving apple snails compared with the satiated apple snails. Several important genes are associated with digestion and absorption in plants: endo-beta-1, 4-glucanase, xylanase, cellulase, cellulase EGX1, cellulase EGX3 and G-type lysozyme genes were identified. The qRT-PCR results confirmed that the expression patterns of these genes (except for the longipain gene) were consistent with the RNA-Seq results. Our results provide a more comprehensive understanding of the molecular genes associated with hepatopancreas functioning. Differentially expressed genes corresponding to critical metabolic pathways were detected in the transcriptome of starving apple snails compared with satiated apple snails. In addition to the cellulase gene, other genes were identified that may be important factors in plant matter metabolism in P. canaliculata, and this information has the potential to expedite the study of digestive physiology in apple snails.
Sahu, Binod B; Shaw, Birendra P
2009-01-01
Background Despite wealth of information generated on salt tolerance mechanism, its basics still remain elusive. Thus, there is a need of continued effort to understand the salt tolerance mechanism using suitable biotechnological techniques and test plants (species) to enable development of salt tolerant cultivars of interest. Therefore, the present study was undertaken to generate information on salt stress responsive genes in a natural halophyte, Suaeda maritima, using PCR-based suppression subtractive hybridization (PCR-SSH) technique. Results Forward and reverse SSH cDNA libraries were constructed after exposing the young plants to 425 mM NaCl for 24 h. From the forward SSH cDNA library, 429 high quality ESTs were obtained. BLASTX search and TIGR assembler programme revealed overexpression of 167 unigenes comprising 89 singletons and 78 contigs with ESTs redundancy of 81.8%. Among the unigenes, 32.5% were found to be of special interest, indicating novel function of these genes with regard to salt tolerance. Literature search for the known unigenes revealed that only 17 of them were salt-inducible. A comparative analysis of the existing SSH cDNA libraries for NaCl stress in plants showed that only a few overexpressing unigenes were common in them. Moreover, the present study also showed increased expression of phosphoethanolamine N-methyltransferase gene, indicating the possible accumulation of a much studied osmoticum, glycinebetaine, in halophyte under salt stress. Functional categorization of the proteins as per the Munich database in general revealed that salt tolerance could be largely determined by the proteins involved in transcription, signal transduction, protein activity regulation and cell differentiation and organogenesis. Conclusion The study provided a clear indication of possible vital role of glycinebetaine in the salt tolerance process in S. maritima. However, the salt-induced expression of a large number of genes involved in a wide range of cellular functions was indicative of highly complex nature of the process as such. Most of the salt inducible genes, nonetheless, appeared to be species-specific. In light of the observations made, it is reasonable to emphasize that a comparative analysis of ESTs from SSH cDNA libraries generated systematically for a few halophytes with varying salt exposure time may clearly identify the key salt tolerance determinant genes to a minimum number, highly desirable for any genetic manipulation adventure. PMID:19497134