agglutinin-like sequence gene: Topics by Science.gov

Sample records for agglutinin-like sequence gene

Candida albicans Agglutinin-Like Sequence (Als) Family Vignettes: A Review of Als Protein Structure and Function

PubMed Central

Hoyer, Lois L.; Cota, Ernesto

2016-01-01

Approximately two decades have passed since the description of the first gene in the Candida albicans ALS (agglutinin-like sequence) family. Since that time, much has been learned about the composition of the family and the function of its encoded cell-surface glycoproteins. Solution of the structure of the Als adhesive domain provides the opportunity to evaluate the molecular basis for protein function. This review article is formatted as a series of fundamental questions and explores the diversity of the Als proteins, as well as their role in ligand binding, aggregative effects, and attachment to abiotic surfaces. Interaction of Als proteins with each other, their functional equivalence, and the effects of protein abundance on phenotypic conclusions are also examined. Structural features of Als proteins that may facilitate invasive function are considered. Conclusions that are firmly supported by the literature are presented while highlighting areas that require additional investigation to reveal basic features of the Als proteins, their relatedness to each other, and their roles in C. albicans biology. PMID:27014205
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.

PubMed

Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J

1999-01-01

Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
The primary structure of stinging nettle (Urtica dioica) agglutinin. A two-domain member of the hevein family.

PubMed

Beintema, J J; Peumans, W J

1992-03-09

The primary structure of stinging nettle (Urtica dioica) agglutinin has been determined by sequence analysis of peptides obtained from three overlapping proteolytic digests. The sequence of 80 residues consists of two hevein-like domains with the same spacing of half-cystine residues and several other conserved residues as observed earlier in other proteins with hevein-like domains. The hinge region between the two domains is four residues longer than those between the four domains in cereal lectins like wheat germ agglutinin.
Mouse mammary tumor virus-like gene sequences are present in lung patient specimens

PubMed Central

2011-01-01

Background Previous studies have reported on the presence of Murine Mammary Tumor Virus (MMTV)-like gene sequences in human cancer tissue specimens. Here, we search for MMTV-like gene sequences in lung diseases including carcinomas specimens from a Mexican population. This study was based on our previous study reporting that the INER51 lung cancer cell line, from a pleural effusion of a Mexican patient, contains MMTV-like env gene sequences. Results The MMTV-like env gene sequences have been detected in three out of 18 specimens studied, by PCR using a specific set of MMTV-like primers. The three identified MMTV-like gene sequences, which were assigned as INER6, HZ101, and HZ14, were 99%, 98%, and 97% homologous, respectively, as compared to GenBank sequence accession number AY161347. The INER6 and HZ-101 samples were isolated from lung cancer specimens, and the HZ-14 was isolated from an acute inflammatory lung infiltrate sample. Two of the env sequences exhibited disruption of the reading frame due to mutations. Conclusion In summary, we identified the presence of MMTV-like gene sequences in 2 out of 11 (18%) of the lung carcinomas and 1 out of 7 (14%) of acute inflamatory lung infiltrate specimens studied of a Mexican Population. PMID:21943279
The gene for stinging nettle lectin (Urtica dioica agglutinin) encodes both a lectin and a chitinase.

PubMed

Lerner, D R; Raikhel, N V

1992-06-05

Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.
Cloning of a CACTA transposon-like insertion in intron I of tomato invertase Lin5 gene and identification of transposase-like sequences of Solanaceae species.

PubMed

Proels, Reinhard K; Roitsch, Thomas

2006-03-01

Very few CACTA transposon-like sequences have been described in Solanaceae species. Sequence information has been restricted to partial transposase (TPase)-like fragments, and no target gene of CACTA-like transposon insertion has been described in tomato to date. In this manuscript, we report on a CACTA transposon-like insertion in intron I of tomato (Lycopersicon esculentum) invertase gene Lin5 and TPase-like sequences of several Solanaceae species. Consensus primers deduced from the TPase region of the tomato CACTA transposon-like element allowed the amplification of similar sequences from various Solanaceae species of different subfamilies including Solaneae (Solanum tuberosum), Cestreae (Nicotiana tabacum) and Datureae (Datura stramonium). This demonstrates the ubiquitous presence of CACTA-like elements in Solanaceae genomes. The obtained partial sequences are highly conserved, and allow further detection and detailed analysis of CACTA-like transposons throughout Solanaceae species. CACTA-like transposon sequences make possible the evaluation of their use for genome analysis, functional studies of genes and the evolutionary relationships between plant species.
Salivary agglutinin, which binds Streptococcus mutans and Helicobacter pylori, is the lung scavenger receptor cysteine-rich protein gp-340.

PubMed

Prakobphol, A; Xu, F; Hoang, V M; Larsson, T; Bergstrom, J; Johansson, I; Frängsmyr, L; Holmskov, U; Leffler, H; Nilsson, C; Borén, T; Wright, J R; Strömberg, N; Fisher, S J

2000-12-22

Salivary agglutinin is a high molecular mass component of human saliva that binds Streptococcus mutans, an oral bacterium implicated in dental caries. To study its protein sequence, we isolated the agglutinin from human parotid saliva. After trypsin digestion, a portion was analyzed by matrix-assisted laser/desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS), which gave the molecular mass of 14 unique peptides. The remainder of the digest was subjected to high performance liquid chromatography, and the separated peptides were analyzed by MALDI-TOF/post-source decay; the spectra gave the sequences of five peptides. The molecular mass and peptide sequence information showed that salivary agglutinin peptides were identical to sequences in lung (lavage) gp-340, a member of the scavenger receptor cysteine-rich protein family. Immunoblotting with antibodies that specifically recognized either lung gp-340 or the agglutinin confirmed that the salivary agglutinin was gp-340. Immunoblotting with an antibody specific to the sialyl Le(x) carbohydrate epitope detected expression on the salivary but not the lung glycoprotein, possible evidence of different glycoforms. The salivary agglutinin also interacted with Helicobacter pylori, implicated in gastritis and peptic ulcer disease, Streptococcus agalactiae, implicated in neonatal meningitis, and several oral commensal streptococci. These results identify the salivary agglutinin as gp-340 and suggest it binds bacteria that are important determinants of either the oral ecology or systemic diseases.
Llama-derived single domain antibodies specific for Abrus agglutinin.

PubMed

Goldman, Ellen R; Anderson, George P; Zabetakis, Dan; Walper, Scott; Liu, Jinny L; Bernstein, Rachael; Calm, Alena; Carney, James P; O'Brien, Thomas W; Walker, Jennifer L; Garber, Eric A E

2011-11-01

Llama derived single domain antibodies (sdAb), the recombinantly expressed variable heavy domains from the unique heavy-chain only antibodies of camelids, were isolated from a library derived from llamas immunized with a commercial abrin toxoid preparation. Abrin is a potent toxin similar to ricin in structure, sequence and mechanism of action. The selected sdAb were evaluated for their ability to bind to commercial abrin as well as abrax (a recombinant abrin A-chain), purified abrin fractions, Abrus agglutinin (a protein related to abrin but with lower toxicity), ricin, and unrelated proteins. Isolated sdAb were also evaluated for their ability to refold after heat denaturation and ability to be used in sandwich assays as both capture and reporter elements. The best binders were specific for the Abrus agglutinin, showing minimal binding to purified abrin fractions or unrelated proteins. These binders had sub nM affinities and regained most of their secondary structure after heating to 95 °C. They functioned well in sandwich assays. Through gel analysis and the behavior of anti-abrin monoclonal antibodies, we determined that the commercial toxoid preparation used for the original immunizations contained a high percentage of Abrus agglutinin, explaining the selection of Abrus agglutinin binders. Used in conjunction with anti-abrin monoclonal and polyclonal antibodies, these reagents can fill a role to discriminate between the highly toxic abrin and the related, but much less toxic, Abrus agglutinin and distinguish between different crude preparations.
Llama-Derived Single Domain Antibodies Specific for Abrus Agglutinin

PubMed Central

Goldman, Ellen R.; Anderson, George P.; Zabetakis, Dan; Walper, Scott; Liu, Jinny L.; Bernstein, Rachael; Calm, Alena; Carney, James P.; O’Brien, Thomas W.; Walker, Jennifer L.; Garber, Eric A. E.

2011-01-01

Llama derived single domain antibodies (sdAb), the recombinantly expressed variable heavy domains from the unique heavy-chain only antibodies of camelids, were isolated from a library derived from llamas immunized with a commercial abrin toxoid preparation. Abrin is a potent toxin similar to ricin in structure, sequence and mechanism of action. The selected sdAb were evaluated for their ability to bind to commercial abrin as well as abrax (a recombinant abrin A-chain), purified abrin fractions, Abrus agglutinin (a protein related to abrin but with lower toxicity), ricin, and unrelated proteins. Isolated sdAb were also evaluated for their ability to refold after heat denaturation and ability to be used in sandwich assays as both capture and reporter elements. The best binders were specific for the Abrus agglutinin, showing minimal binding to purified abrin fractions or unrelated proteins. These binders had sub nM affinities and regained most of their secondary structure after heating to 95 °C. They functioned well in sandwich assays. Through gel analysis and the behavior of anti-abrin monoclonal antibodies, we determined that the commercial toxoid preparation used for the original immunizations contained a high percentage of Abrus agglutinin, explaining the selection of Abrus agglutinin binders. Used in conjunction with anti-abrin monoclonal and polyclonal antibodies, these reagents can fill a role to discriminate between the highly toxic abrin and the related, but much less toxic, Abrus agglutinin and distinguish between different crude preparations. PMID:22174977
The specificity of Centruroides sculpturatus Ewing (Arizona lethal scorpion) hemolymph agglutinins.

PubMed

Vasta, G R; Cohen, E

1982-01-01

C. sculpturatus sera agglutinate human erythrocytes independently of the ABO blood group, enzyme treatment, incubation temperature or sex of the scorpions. Tested with human lymphocytes and reptile and bird erythrocytes, C. sculpturatus serum reacts like an anti-sialic acid agglutinin. With leukemic lymphocytes, titers are higher than with normal lymphocytes. Mammalian erythrocytes show characteristic agglutination patterns for C. sculpturatus for Limulus polyphemus (horseshoe crab) that suggest different receptors for agglutinins of both species. Cross absorption and elution experiments indicate the presence of at least two specific agglutinins in C. sculpturatus serum. Agglutination is inhibited by N-acetylneuraminic acid and N-glycolyneuraminic acid, for all erythrocytes tested. Calcium is required for optimal activity of C. sculpturatus agglutinins. C. sculpturatus agglutinating activity is destroyed at 65% degrees C for 20 minutes. Titers are decreased by 2-mercaptoethanol, and more so after alkylation with iodoacetic acid suggesting that disulfide bonds are present in C. sculpturatus agglutinin molecules.
The Urtica dioica Agglutinin Is a Complex Mixture of Isolectins 1

PubMed Central

Van Damme, Els J. M.; Broekaert, Willem F.; Peumans, Willy J.

1988-01-01

Rhizomes of stinging nettle (Urtica dioica) contain a complex mixture of isolectins. Ion exchange chromatography with a high resolution fast protein liquid chromatography system revealed six isoforms which exhibit identical agglutination properties and carbohydrate-binding specificity and in addition have the same molecular structure and virtually identical biochemical properties. However, since the U. dioica agglutinin isolectins differ definitely with respect to their amino acid composition, it is likely that at least some of them are different polypeptides coded for by different genes. Images Fig. 3 PMID:16665952
A ZFY-like sequence in fish, with comments on the evolution of the ZFY family of genes in vertebrates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zimmerer, E.J.; Threlkeld, L.

1995-08-01

ZFY-like genes have been observed in a variety of vertebrate species. Although originally implicated as the primary testis-determining gene in humans and other placental mammals, more recent evidence indicates a role(s) outside that of testis determination. In this study, DNA from five species of fish, Carasius auratus, Rivulus marmoratus, Xiphophorus maculatus, X. milleri, and X. nigrensis was subjected to Southern blot analysis using a PCR-amplified fragment of mouse ZFY-like sequence as a probe. Restriction fragment patterns were not polymorphic between sexes in any one species but showed a different pattern for each species. With one exception, Rivulus, a 3.1-kb bandmore » from the EcoRI digestion was common to all. Sequence and open reading frame analysis of this fragment showed a strong homology to other known vertebrate ZFY-like genes. Of particular interest in this gene is a novel third finger domain similar to one human and one alligator ZFY-like gene. Our studies and others provide evidence for a family of vertebrate ZFY genes, with those having this novel third finger being representative of the ancestral condition. 30 refs., 3 figs., 3 tabs.« less
Molecular Characterization and Comparative Sequence Analysis of Defense-Related Gene, Oryza rufipogon Receptor-Like Protein Kinase 1

PubMed Central

Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann

2012-01-01

Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Sequence and expression variations suggest an adaptive role for the DA1-like gene family in the evolution of soybeans.

PubMed

Zhao, Man; Gu, Yongzhe; He, Lingli; Chen, Qingshan; He, Chaoying

2015-05-15

The DA1 gene family is plant-specific and Arabidopsis DA1 regulates seed and organ size, but the functions in soybeans are unknown. The cultivated soybean (Glycine max) is believed to be domesticated from the annual wild soybeans (Glycine soja). To evaluate whether DA1-like genes were involved in the evolution of soybeans, we compared variation at both sequence and expression levels of DA1-like genes from G. max (GmaDA1) and G. soja (GsoDA1). Sequence identities were extremely high between the orthologous pairs between soybeans, while the paralogous copies in a soybean species showed a relatively high divergence. Moreover, the expression variation of DA1-like paralogous genes in soybean was much greater than the orthologous gene pairs between the wild and cultivated soybeans during development and challenging abiotic stresses such as salinity. We further found that overexpressing GsoDA1 genes did not affect seed size. Nevertheless, overexpressing them reduced transgenic Arabidopsis seed germination sensitivity to salt stress. Moreover, most of these genes could improve salt tolerance of the transgenic Arabidopsis plants, corroborated by a detection of expression variation of several key genes in the salt-tolerance pathways. Our work suggested that expression diversification of DA1-like genes is functionally associated with adaptive radiation of soybeans, reinforcing that the plant-specific DA1 gene family might have contributed to the successful adaption to complex environments and radiation of the plants.
Gene end-like sequences within the 3' non-coding region of the Nipah virus genome attenuate viral gene transcription.

PubMed

Sugai, Akihiro; Sato, Hiroki; Yoneda, Misako; Kai, Chieko

2017-08-01

The regulation of transcription during Nipah virus (NiV) replication is poorly understood. Using a bicistronic minigenome system, we investigated the involvement of non-coding regions (NCRs) in the transcriptional re-initiation efficiency of NiV RNA polymerase. Reporter assays revealed that attenuation of NiV gene expression was not constant at each gene junction, and that the attenuating property was controlled by the 3' NCR. However, this regulation was independent of the gene-end, gene-start and intergenic regions. Northern blot analysis indicated that regulation of viral gene expression by the phosphoprotein (P) and large protein (L) 3' NCRs occurred at the transcription level. We identified uridine-rich tracts within the L 3' NCR that are similar to gene-end signals. These gene-end-like sequences were recognized as weak transcription termination signals by the viral RNA polymerase, thereby reducing downstream gene transcription. Thus, we suggest that NiV has a unique mechanism of transcriptional regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Intestinal flora of FAP patients containing APC-like sequences.

PubMed

Hainova, K; Adamcikova, Z; Ciernikova, S; Stevurkova, V; Tyciakova, S; Zajac, V

2014-01-01

Colorectal cancer mortality is one of the most common cause of cancer-related mortality. A multiple risk factors are associated with colorectal cancer, including hereditary, enviromental and inflammatory syndromes affecting the gastrointestinal tract. Familial adenomatous polyposis (FAP) is characterized by the emergence of hundreds to thousands of colorectal adenomatous polyps and FAP syndrome is caused by mutations within the adenomatous polyposis coli (APC) tumor suppressor gene. We analyzed 21 rectal bacterial subclones isolated from FAP patient 41-1 with confirmed 5bp ACAAA deletion within codons 1060-1063 for the presence of APC-like sequences in longest exon 15. The studied section was defined by primers 15Efor-15Erev, what correlates with mutation cluster region (MCR) in which the 75% of all APC germline mutations were detected. More than 90% homology was showed by sequencing and subsequent software comparison. The expression of APC-like sequences was demostrated by Western blot analysis using monoclonal and polyclonal antibodies against APC protein. To study missing link between the DNA analysis (PCR, DNA sequencing) and protein expresion experiments (Western blotting) we analyzed bacterial transcripts containing the 15Efor-15Erev sequence of APC gene by reverse transcription-PCR, what indicated that an APC gene derived fragment may be produced. We observed 97-100 % homology after computer comparison of cDNA PCR products. Our results suggest that presence of APC-like sequences in intestinal/rectal bacteria is enrichment of bacterial genetic information in which horizontal gene transfer between humans and microflora play an important role.
Identification of canine T lymphocytes by membrane receptor to peanut agglutinin: T-lymphocyte identification in dogs with lupus-like syndrome.

PubMed

Rigal, D; Bendali-Ahcène, S; Monier, J C; Mohana, K; Fournel, C

1983-09-01

Canine T lymphocytes were detected, using fluorescent peanut agglutinin (PNA) as a marker. Using a fluorescent technique and cytofluorometry, 70 +/- 11% and 72.4%, respectively, of peripheral blood lymphocytes were bound to PNA. Of thymocytes, 97 +/- 4.5% were detected by fluorescent PNA, but less than 1% were detected for lymphocytes from bone marrow. The T-lymphocyte depletion and enrichment indicated that PNA was bound to lymphocytes recognized by anti-T-lymphocyte heterologous serum. A T-lymphocyte deficiency was detected among 8 dogs with a lupus-like syndrome.
Properties of a U1 RNA enhancer-like sequence.

PubMed Central

Ciliberto, G; Palla, F; Tebb, G; Mattaj, I W; Philipson, L

1987-01-01

The properties of a X.laevis U1B snRNA gene enhancer have been studied by microinjection in Xenopus oocytes. The enhancer-like sequence, defined as a short DNA stretch that is able to activate transcription in an orientation independent manner, is interchangeable between different U snRNA genes. The enhancer sequence alone does not, however, efficiently activate transcription from an SV40 pol II promoter but regains its activity when combined with the U-gene specific proximal sequence element. DNase I protection experiments show that the X.laevis U1B enhancer can interact specifically with a nuclear factor present in mammalian cells. Images PMID:3031597
Diverse nucleotide compositions and sequence fluctuation in Rubisco protein genes

NASA Astrophysics Data System (ADS)

Holden, Todd; Dehipawala, S.; Cheung, E.; Bienaime, R.; Ye, J.; Tremberger, G., Jr.; Schneider, P.; Lieberman, D.; Cheung, T.

2011-10-01

The Rubisco protein-enzyme is arguably the most abundance protein on Earth. The biology dogma of transcription and translation necessitates the study of the Rubisco genes and Rubisco-like genes in various species. Stronger correlation of fractal dimension of the atomic number fluctuation along a DNA sequence with Shannon entropy has been observed in the studied Rubisco-like gene sequences, suggesting a more diverse evolutionary pressure and constraints in the Rubisco sequences. The strategy of using metal for structural stabilization appears to be an ancient mechanism, with data from the porphobilinogen deaminase gene in Capsaspora owczarzaki and Monosiga brevicollis. Using the chi-square distance probability, our analysis supports the conjecture that the more ancient Rubisco-like sequence in Microcystis aeruginosa would have experienced very different evolutionary pressure and bio-chemical constraint as compared to Bordetella bronchiseptica, the two microbes occupying either end of the correlation graph. Our exploratory study would indicate that high fractal dimension Rubisco sequence would support high carbon dioxide rate via the Michaelis- Menten coefficient; with implication for the control of the whooping cough pathogen Bordetella bronchiseptica, a microbe containing a high fractal dimension Rubisco-like sequence (2.07). Using the internal comparison of chi-square distance probability for 16S rRNA (~ E-22) versus radiation repair Rec-A gene (~ E-05) in high GC content Deinococcus radiodurans, our analysis supports the conjecture that high GC content microbes containing Rubisco-like sequence are likely to include an extra-terrestrial origin, relative to Deinococcus radiodurans. Similar photosynthesis process that could utilize host star radiation would not compete with radiation resistant process from the biology dogma perspective in environments such as Mars and exoplanets.
In vitro agglutinin production by earthworm leukocytes.

PubMed

Stein, E A; Cooper, E L

1988-01-01

Leukocytes of the earthworm, Lumbricus terrestris, secrete agglutinins in vitro, as shown by measuring agglutinin titers of the culture medium and by observing secretory rosette formation by leukocytes with erythrocytes. Leukocytes form the highest percentages of secretory rosettes with rabbit erythrocytes (RBC) and with other RBC species in the order: rat, guinea pig, mouse, calf, sheep, horse, goat. Leukocytes displayed allotypic specificity by forming rosettes selectively with erythrocytes from different individual rabbits. Eight sugars inhibited rosette formation, along with the polysaccharide mannan and the glycoproteins thyroglobulin and bovine submaxillary mucin. Cyclohexamide did not affect rosette formation, suggesting that agglutinins may be preformed and stored in leukocytes prior to secretion. Leukocytes also formed E-type rosettes with erythrocytes, but apparently utilized different receptors from those of secretory rosettes since they were not inhibited by the same sugars.

High time for a roll call: gene duplication and phylogenetic relationships of TCP-like genes in monocots

PubMed Central

Mondragón-Palomino, Mariana; Trontin, Charlotte

2011-01-01

Background and Aims The TCP family is an ancient group of plant developmental transcription factors that regulate cell division in vegetative and reproductive structures and are essential in the establishment of flower zygomorphy. In-depth research on eudicot TCPs has documented their evolutionary and developmental role. This has not happened to the same extent in monocots, although zygomorphy has been critical for the diversification of Orchidaceae and Poaceae, the largest families of this group. Investigating the evolution and function of TCP-like genes in a wider group of monocots requires a detailed phylogenetic analysis of all available sequence information and a system that facilitates comparing genetic and functional information. Methods The phylogenetic relationships of TCP-like genes in monocots were investigated by analysing sequences from the genomes of Zea mays, Brachypodium distachyon, Oryza sativa and Sorghum bicolor, as well as EST data from several other monocot species. Key Results All available monocot TCP-like sequences are associated in 20 major groups with an average identity ≥64 % and most correspond to well-supported clades of the phylogeny. Their sequence motifs and relationships of orthology were documented and it was found that 67 % of the TCP-like genes of Sorghum, Oryza, Zea and Brachypodium are in microsyntenic regions. This analysis suggests that two rounds of whole genome duplication drove the expansion of TCP-like genes in these species. Conclusions A system of classification is proposed where putative or recognized monocot TCP-like genes are assigned to a specific clade of PCF-, CIN- or CYC/tb1-like genes. Specific biases in sequence data of this family that must be tackled when studying its molecular evolution and phylogeny are documented. Finally, the significant retention of duplicated TCP genes from Zea mays is considered in the context of balanced gene drive. PMID:21444336
Transcriptional Signatures in Response to Wheat Germ Agglutinin and Starvation in Drosophila melanogaster Larval Midgut

USDA-ARS?s Scientific Manuscript database

One function of plant lectins such as wheat germ agglutinin (WGA) is to serve as defenses against herbivorous insects. The midgut is one critical site affected by dietary lectins. We observed marked cellular, structural, and gene expression changes in the midguts of Drosophila melanogaster third-i...
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

PubMed Central

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Report of cold agglutinins in a patient with acute ischemic stroke.

PubMed

Jin, Haiqiang; Sun, Wei; Sun, Yongan; Huang, Yining; Sun, Yunchuang

2015-10-30

Studies on the role of cold agglutinins in the pathogenesis of acute ischemic stroke are scarce. We present a case of an elderly man with acute cerebral infarction probably due to cold agglutinin disease. On a cold morning, a 71-year-old male of Han nationality with a complaint of sudden onset left-sided weakness and difficulty in speaking was brought to the emergency department. Diffusion weighted magnetic resonance imaging of the brain showed a high-intensity area in the right basal ganglia and corona radiata. Laboratory test showed the presence of high titers of cold agglutinins. There was no history of common risk factors of atherosclerosis, such as hypertension, diabetes mellitus, coronary artery disease or smoking. After being exposed to warm temperature, and with corticosteroid therapy and blood transfusion, the patient's symptoms relieved rapidly. We report here the first case of cerebral infarction probably due to the cold agglutinin disease. The underlying mechanism of cold agglutinins in the pathogenesis of acute ischemic stroke needs to be investigated further.
Antibody interactions with Ricinus communis agglutinins studied by biolayer interferometry

USDA-ARS?s Scientific Manuscript database

Two related agglutinins are present in the seeds of Ricinus communis (castor): ricin, a dichain ribosome-inactivating protein and Ricinus communis agglutinin-1 (RCA-1), a much less toxic hemagglutinin. Because ricin has been used for experimental cancer chemotherapy as well as for intentional poison...
Dynamics of Agglutinin-Like Sequence (ALS) Protein Localization on the Surface of Candida Albicans

ERIC Educational Resources Information Center

Coleman, David Andrew

2009-01-01

The ALS gene family encodes large cell-surface glycoproteins associated with "C. albicans" pathogenesis. Als proteins are thought to act as adhesin molecules binding to host tissues. Wide variation in expression levels among the ALS genes exists and is related to cell morphology and environmental conditions. "ALS1," "ALS3," and "ALS4" are three of…
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

PubMed Central

Benslimane, A A; Dron, M; Hartmann, C; Rode, A

1986-01-01

Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Folding and Homodimerization of Wheat Germ Agglutinin

PubMed Central

Portillo-Téllez, María del Carmen; Bello, Martiniano; Salcedo, Guillermo; Gutiérrez, Gabriel; Gómez-Vidales, Virginia; García-Hernández, Enrique

2011-01-01

Wheat germ agglutinin (WGA) is emblematic of proteins that specialize in the recognition of carbohydrates. It was the first lectin reported to have a capacity for discriminating between normal and malignant cells. Since then, it has become a preferred model for basic research and is frequently considered in the development of biomedical and biotechnological applications. However, the molecular basis for the structural stability of this homodimeric lectin remains largely unknown, a situation that limits the rational manipulation and modification of its function. In this work we performed a thermodynamic characterization of WGA folding and self-association processes as a function of pH and temperature by using differential scanning and isothermal dilution calorimetry. WGA is monomeric at pH 2, and one of its four hevein-like domains is unfolded at room temperature. Under such conditions, the agglutinin exhibits a fully reversible thermal unfolding that consists of three two-state transitions. At higher pH values, the protein forms weak, nonobligate dimers. This behavior contrasts with that observed for the other plant lectins studied thus far, which form strong, obligate oligomers, indicating a distinctly different molecular basis for WGA function. For dimer formation, the four domains must be properly folded. Nevertheless, depending on the solution conditions, self-association may be coupled with folding of the labile domain. Therefore, dimerization may proceed as a rigid-body-like association or a folding-by-binding event. This hybrid behavior is not seen in other plant lectins. The emerging molecular picture for the WGA assembly highlights the need for a reexamination of existing ligand-binding data in the literature. PMID:21943423
Studies on chemical modification of cold agglutinin from the snail Achatina fulica.

PubMed Central

Sarkar, M; Mitra, D; Sen, A K

1987-01-01

The cold agglutinin isolated from the albumin gland of the snail Achatina fulica was modified with various chemical reagents in order to detect the amino acids and/or carbohydrate residues present in its carbohydrate-binding sites. Treatment with reagents considered specific for modification of lysine, arginine and tryptophan residues of the cold agglutinin did not affect the carbohydrate-binding activity of the agglutinin. Modification of tyrosine residues showed some change. However, modification with carbodiimide followed by alpha-aminobutyric acid methyl ester causes almost complete loss of its binding activity, indicating the involvement of aspartic acid and glutamic acid in its carbohydrate-binding activity. The carbohydrate residues of the cold agglutinin were removed by beta-elimination reaction, indicating that the sugars are O-glycosidically linked to protein part of the molecule. Removal of galactose residues from the cold agglutinin by the action of beta-galactosidase indicated that the galactose molecules are beta-linked. These carbohydrate-modified glycoproteins showed a marked change in agglutination property, i.e. they agglutinated rabbit erythrocytes at both 10 degrees C and 25 degrees C, indicating that the galactose residues of the glycoprotein play an important role in the cold-agglutination property of the glycoprotein. The c.d. data showed the presence of an almost identical type of random-coil conformation in the native cold agglutinin at 10 degrees C and in the carbohydrate-modified glycoprotein at 10 degrees C and 25 degrees C. This particular random-coil conformation is essential for carbohydrate-binding property of the agglutinin. Images Fig. 1. PMID:3118867
Transgenic tobacco expressing Pinellia ternata agglutinin confers enhanced resistance to aphids.

PubMed

Yao, Jianhong; Pang, Yongzhen; Qi, Huaxiong; Wan, Bingliang; Zhao, Xiuyun; Kong, Weiwen; Sun, Xiaofen; Tang, Kexuan

2003-12-01

Tobacco leaf discs were transformed with a plasmid, pBIPTA, containing the selectable marker neomycin phosphotransferase gene (nptII) and Pinellia ternata agglutinin gene (pta) via Agrobacterium tumefaciens-mediated transformation. Thirty-two independent transgenic tobacco plants were regenerated. PCR and Southern blot analyses confirmed that the pta gene had integrated into the plant genome and northern blot analysis revealed transgene expression at various levels in transgenic plants. Genetic analysis confirmed Mendelian segregation of the transgene in T1 progeny. Insect bioassays showed that transgenic plants expressing PTA inhibited significantly the growth of peach potato aphid (Myzus persicae Sulzer). This is the first report that transgenic plants expressing pta confer enhanced resistance to aphids. Our study indicates that the pta gene can be used as a supplement to the snowdrop (Galanthus nivalis) lectin gene (gna) in the control of aphids, a sap-sucking insect pest causing significant yield losses of crops.
Mitochondrial gene rearrangements confirm the parallel evolution of the crab-like form.

PubMed Central

Morrison, C L; Harvey, A W; Lavery, S; Tieu, K; Huang, Y; Cunningham, C W

2002-01-01

The repeated appearance of strikingly similar crab-like forms in independent decapod crustacean lineages represents a remarkable case of parallel evolution. Uncertainty surrounding the phylogenetic relationships among crab-like lineages has hampered evolutionary studies. As is often the case, aligned DNA sequences by themselves were unable to fully resolve these relationships. Four nested mitochondrial gene rearrangements--including one of the few reported movements of an arthropod protein-coding gene--are congruent with the DNA phylogeny and help to resolve a crucial node. A phylogenetic analysis of DNA sequences, and gene rearrangements, supported five independent origins of the crab-like form, and suggests that the evolution of the crab-like form may be irreversible. This result supports the utility of mitochondrial gene rearrangements in phylogenetic reconstruction. PMID:11886621
Comparative and evolutionary studies of vertebrate ALDH1A-like genes and proteins.

PubMed

Holmes, Roger S

2015-06-05

Vertebrate ALDH1A-like genes encode cytosolic enzymes capable of metabolizing all-trans-retinaldehyde to retinoic acid which is a molecular 'signal' guiding vertebrate development and adipogenesis. Bioinformatic analyses of vertebrate and invertebrate genomes were undertaken using known ALDH1A1, ALDH1A2 and ALDH1A3 amino acid sequences. Comparative analyses of the corresponding human genes provided evidence for distinct modes of gene regulation and expression with putative transcription factor binding sites (TFBS), CpG islands and micro-RNA binding sites identified for the human genes. ALDH1A-like sequences were identified for all mammalian, bird, lizard and frog genomes examined, whereas fish genomes displayed a more restricted distribution pattern for ALDH1A1 and ALDH1A3 genes. The ALDH1A1 gene was absent in many bony fish genomes examined, with the ALDH1A3 gene also absent in the medaka and tilapia genomes. Multiple ALDH1A1-like genes were identified in mouse, rat and marsupial genomes. Vertebrate ALDH1A1, ALDH1A2 and ALDH1A3 subunit sequences were highly conserved throughout vertebrate evolution. Comparative amino acid substitution rates showed that mammalian ALDH1A2 sequences were more highly conserved than for the ALDH1A1 and ALDH1A3 sequences. Phylogenetic studies supported an hypothesis for ALDH1A2 as a likely primordial gene originating in invertebrate genomes and undergoing sequential gene duplication to generate two additional genes, ALDH1A1 and ALDH1A3, in most vertebrate genomes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
EFFECTS OF X RAYS ON THE PRODUCTION OF AGGLUTININ IN GUINEA PIGS (in Russian)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kovtunovich, L.G.

1958-11-01

Experiments were made with 62 guinea pigs in order to determine the effects of single whole-body exposure to 100 or 500 r on the production of typhoid agglutinins. A considerable depression of agglutinin production was observed only after the first vaccination. After repeated vaccinations, an increase in agglutinin level was observed in all irradiated animals regardless of the dose or length of exposure. (R.V.J.)
Isolation and purification of wheat germ agglutinin and analysis of its properties

NASA Astrophysics Data System (ADS)

Wang, Han

2017-12-01

In this paper, the wheat germ agglutinin was isolated and purified by affinity chromatography of chicken ovomucoid as ligand. The physicochemical properties were analyzed. The chicken ovomucoid was isolated from egg white and conjugated to affinity chromatography column agarose gel to prepare affinity adsorbent. The crude extract of wheat germ was freezedried by affinity chromatography. The physicochemical properties were analyzed by SDSpolyacrylamide gel electrophoresis and isoelectric focusing electrophoresis. And the relative molecular mass and isoelectric point of wheat germ agglutinin were obtained, and the high efficiency of purification of wheat germ agglutinin was proved by affinity chromatography.
The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity

PubMed Central

Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H.; Allen, Andrew S.; Goldstein, David B.

2015-01-01

Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene’s proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene’s regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen’s Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance
Structural and functional analysis of an enhancer GPEI having a phorbol 12-O-tetradecanoate 13-acetate responsive element-like sequence found in the rat glutathione transferase P gene.

PubMed

Okuda, A; Imagawa, M; Maeda, Y; Sakai, M; Muramatsu, M

1989-10-05

We have recently identified a typical enhancer, termed GPEI, located about 2.5 kilobases upstream from the transcription initiation site of the rat glutathione transferase P gene. Analyses of 5' and 3' deletion mutants revealed that the cis-acting sequence of GPEI contained the phorbol 12-O-tetradecanoate 13-acetate responsive element (TRE)-like sequence in it. For the maximal activity, however, GPEI required an adjacent upstream sequence of about 19 base pairs in addition to the TRE-like sequence. With the DNA binding gel-shift assay, we could detect protein(s) that specifically binds to the TRE-like sequence of GPEI fragment, which was possibly c-jun.c-fos complex or a similar protein complex. The sequence immediately upstream of the TRE-like sequence did not have any activity by itself, but augmented the latter activity by about 5-fold.
Characterization and sequence analysis of pilin from F-like plasmids.

PubMed Central

Frost, L S; Finlay, B B; Opgenorth, A; Paranchych, W; Lee, J S

1985-01-01

Conjugative pili are expressed by derepressed plasmids and initiate cell-to-cell contact during bacterial conjugation. They are also the site of attachment for pilus-specific phages (f1, f2, and QB). In this study, the number of pili per cell and their ability to retract in the presence of cyanide was estimated for 13 derepressed plasmids. Selected pilus types were further characterized for reactivity with anti-F and anti-ColB2 pilus antisera as well as two F pilus-specific monoclonal antibodies, one of which is specific for a sequence common to most F-like pilin types (JEL92) and one which is specific for the amino terminus of F pilin (JEL93). The pilin genes from eight of these plasmids were cloned and sequenced, and the results were compared with information on F, ColB2, and pED208 pilin. Six pilus groups were defined: I, was F-like [F, pED202(R386), ColV2-K94, and ColVBtrp]; IIA was ColB2-like in sequence but had a lowered sensitivity to f1 phage due to its decreased ability for pilus retraction [pED236(ColB2) and pED203(ColB4)]; IIB was ColB2-like but retained f1 sensitivity [pED200(R124) and pED207(R538-1)]; III contained R1-19, which had a ColB2-like amino terminus but had an additional lysine residue at its carboxy terminus which may affect its phage sensitivity pattern and its antigenicity; IV was R100-1-like [R100-1 and presumably pED241(R136) and pED204(R6)] which had a unique amino-terminal sequence combined with a carboxy terminus similar to that of F. pED208(Folac) formed group V, which was multipiliated and exhibited poor pilus retraction although it retained full sensitivity to f1 phage. The pED208 pilin gene could not be cloned at this time since it shared no homology with the pilin gene of the F plasmid. Images PMID:2999074
Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships

PubMed Central

Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.

2015-01-01

Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456
Germline whole exome sequencing and large-scale replication identifies FANCM as a likely high grade serous ovarian cancer susceptibility gene.

PubMed

Dicks, Ed; Song, Honglin; Ramus, Susan J; Oudenhove, Elke Van; Tyrer, Jonathan P; Intermaggio, Maria P; Kar, Siddhartha; Harrington, Patricia; Bowtell, David D; Group, Aocs Study; Cicek, Mine S; Cunningham, Julie M; Fridley, Brooke L; Alsop, Jennifer; Jimenez-Linan, Mercedes; Piskorz, Anna; Goranova, Teodora; Kent, Emma; Siddiqui, Nadeem; Paul, James; Crawford, Robin; Poblete, Samantha; Lele, Shashi; Sucheston-Campbell, Lara; Moysich, Kirsten B; Sieh, Weiva; McGuire, Valerie; Lester, Jenny; Odunsi, Kunle; Whittemore, Alice S; Bogdanova, Natalia; Dürst, Matthias; Hillemanns, Peter; Karlan, Beth Y; Gentry-Maharaj, Aleksandra; Menon, Usha; Tischkowitz, Marc; Levine, Douglas; Brenton, James D; Dörk, Thilo; Goode, Ellen L; Gayther, Simon A; Pharoah, D P Paul

2017-08-01

We analyzed whole exome sequencing data in germline DNA from 412 high grade serous ovarian cancer (HGSOC) cases from The Cancer Genome Atlas Project and identified 5,517 genes harboring a predicted deleterious germline coding mutation in at least one HGSOC case. Gene-set enrichment analysis showed enrichment for genes involved in DNA repair (p = 1.8×10 -3 ). Twelve DNA repair genes - APEX1, APLF, ATX, EME1, FANCL, FANCM, MAD2L2, PARP2, PARP3, POLN, RAD54L and SMUG1 - were prioritized for targeted sequencing in up to 3,107 HGSOC cases, 1,491 cases of other epithelial ovarian cancer (EOC) subtypes and 3,368 unaffected controls of European origin. We estimated mutation prevalence for each gene and tested for associations with disease risk. Mutations were identified in both cases and controls in all genes except MAD2L2 , where we found no evidence of mutations in controls. In FANCM we observed a higher mutation frequency in HGSOC cases compared to controls (29/3,107 cases, 0.96 percent; 13/3,368 controls, 0.38 percent; P=0.008) with little evidence for association with other subtypes (6/1,491, 0.40 percent; P=0.82). The relative risk of HGSOC associated with deleterious FANCM mutations was estimated to be 2.5 (95% CI 1.3 - 5.0; P=0.006). In summary, whole exome sequencing of EOC cases with large-scale replication in case-control studies has identified FANCM as a likely novel susceptibility gene for HGSOC, with mutations associated with a moderate increase in risk. These data may have clinical implications for risk prediction and prevention approaches for high-grade serous ovarian cancer in the future and a significant impact on reducing disease mortality.

Urtica dioica agglutinin, a V beta 8.3-specific superantigen, prevents the development of the systemic lupus erythematosus-like pathology of MRL lpr/lpr mice.

PubMed

Musette, P; Galelli, A; Chabre, H; Callard, P; Peumans, W; Truffa-Bachi, P; Kourilsky, P; Gachelin, G

1996-08-01

The V beta 8.3-specific superantigenic lectin Urtica dioica agglutinin (UDA) was used to delete the V beta 8.3+ T cells in MRL lpr/lpr mice. In contrast to the systemic lupus erythematosus-like pathology which progresses with age in the phosphate-buffered saline-injected MRL lpr/lpr controls, UDA-treated animals did not develop overt clinical signs of lupus and nephritis. The pathogenic T cell clones thus reside within the V beta 8.3+ T cell population, which includes an expanded T cell clone described previously. Finally, UDA alters the production of autoantibodies in a sex-dependent manner.
Beware Cold Agglutinins in Organ Donors! Ex Vivo Lung Perfusion From an Uncontrolled Donation After Circulatory-Determination-of-Death Donor With a Cold Agglutinin: A Case Report.

PubMed

Venkataraman, A; Blackwell, J W; Funkhouser, W K; Birchard, K R; Beamer, S E; Simmons, W T; Randell, S H; Egan, T M

2017-09-01

We began to recover lungs from uncontrolled donation after circulatory determination of death to assess for transplant suitability by means of ex vivo lung perfusion (EVLP) and computerized tomographic (CT) scan. Our first case had a cold agglutinin with an interesting outcome. A 60-year-old man collapsed at home and was pronounced dead by Emergency Medical Services personnel. Next-of-kin consented to lung retrieval, and the decedent was ventilated and transported. Lungs were flushed with cold Perfadex, removed, and stored cold. The lungs did not flush well. Medical history revealed a recent hemolytic anemia and a known cold agglutinin. Warm nonventilated ischemia time was 51 minutes. O 2 -ventilated ischemia time was 141 minutes. Total cold ischemia time was 6.5 hours. At cannulation for EVLP, established clots were retrieved from both pulmonary arteries. At initiation of EVLP with Steen solution, tiny red aggregates were observed initially. With warming, the aggregates disappeared and the perfusate became red. After 1 hour, EVLP was stopped because of florid pulmonary edema. The lungs were cooled to 20°C; tiny red aggregates formed again in the perfusate. Ex vivo CT scan showed areas of pulmonary edema and a pyramidal right middle lobe opacity. Dissection showed multiple pulmonary emboli-the likely cause of death. However, histology showed agglutinated red blood cells in the microvasculature in pre- and post-EVLP biopsies, which may have contributed to inadequate parenchymal preservation. Organ donors with cold agglutinins may not be suitable owing to the impact of hypothermic preservation. Copyright © 2017 Elsevier Inc. All rights reserved.
Cold Agglutinin Disease; A Laboratory Challenge.

PubMed

Nikousefat, Zahra; Javdani, Moosa; Hashemnia, Mohammad; Haratyan, Abbas; Jalili, Ali

2015-10-01

Autoimmune haemolytic anemia (AIHA) is a complex process characterized by an immune reaction against red blood cell self-antigens. The analysis of specimens, drawn from patients with cold auto-immune hemolytic anemia is a difficult problem for automated hematology analyzer. This paper was written to alert technologists and pathologists to the presence of cold agglutinins and its effect on laboratory tests. A 72-year-old female presented to the Shafa laboratory for hematology profile evaluation. CBC indices showed invalid findings with the Sysmex automated hematology analyzer. Checking the laboratory process showed precipitation residue sticking to the sides of the tube. After warming the tubes, results become valid and the problem attributed to cold agglutinin disease. In this situation, aggregation of RBCs, which occurs at t < 30°C, causes invalid findings meanwhile working with automated hematology analyzer. Knowledge of this phenomenon can help prevent wasting too much time and make an early and accurate diagnosis.
Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

PubMed Central

Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

2005-01-01

Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134
CRISPR-like sequences in Helicobacter pylori and application in genotyping.

PubMed

Bangpanwimon, Khotchawan; Sottisuporn, Jaksin; Mittraparp-Arthorn, Pimonsri; Ueaphatthanaphanich, Warattaya; Rattanasupar, Attapon; Pourcel, Christine; Vuddhakul, Varaporn

2017-01-01

Many bacteria and archaea possess a defense system called clustered regularly interspaced short palindromic repeats (CRISPR) associated proteins (CRISPR-Cas system) against invaders such as phages or plasmids. This system has not been demonstrated in Helicobacter pylori . The numbers of spacer in CRISPR array differ among bacterial strains and can be used as a genetic marker for bacterial typing. A total of 36 H. pylori isolates were collected from patients in three hospitals located in the central (PBH) and southern (SKH) regions of Thailand. It is of interest that CRISPR-like sequences of this bacterium were detected in vlpC encoded for VacA-like protein C. Virulence genes were investigated and the most pathogenic genotype ( cagA vacA s1m1) was detected in 17 out of 29 (58.6%) isolates from PBH and 5 out of 7 (71.4%) from SKH. vapD gene was identified in each one isolate from PBH and SKH. CRISPR-like sequences and virulence genes of 20 isolates of H. pylori obtained in this study were analyzed and CRISPR-virulence typing was constructed and compared to profiles obtained by the random amplification of polymorphic DNA (RAPD) technique. The discriminatory power (DI) of CRISPR-virulence typing was not different from RAPD typing. CRISPR-virulence typing in H. pylori is easy and reliable for epidemiology and can be used for inter-laboratory interpretation.
Identification, Characterization and Expression of Methuselah-Like Genes in Dastarcus helophoroides (Coleoptera: Bothrideridae).

PubMed

Zhang, Zhengqing; Wang, Huapeng; Hao, Chunfeng; Zhang, Wei; Yang, Miaomiao; Chang, Yong; Li, Menglou

2016-10-21

Dastarcus helophoroides , which has a relatively longer lifespan compared to other insects, is one of the most effective natural enemies of many large-body long-horned beetles. Methuselah (Mth) is associated with the lifespan, stress resistance, and reproduction in Drosophila melanogaster , but Mth is not present in non-drosophiline insects. A number of methuselah-like genes ( mth-likes , mthls ) have been identified in non-drosophiline insects, but it is still unknown whether they are present in Dastarcus helophoroides . We identified three novel mth-like genes in D. helophoroides : mth-like1 , mth-like2 , and mth-like5 , and carried out bioinformatic analysis based on the full-length nucleic acid sequences and deduced amino acid sequences. Real-time quantitative polymerase chain reaction (RT-qPCR) showed variations in expression patterns of mth-like genes in different tissues (highly expressed in reproductive systems) and at different developmental stages, indicating that mth-likes were likely be involved in reproduction and development. The altered mRNA expression in aging adults and under oxidation, high temperature, and starvation stress, indicated that mth-like genes were likely to be involved in aging and the resistance of oxidation, high temperature, and starvation. These results characterize, for the first time, the basic properties of three mth-like genes from D. helophoroides that probably play important roles in development, aging, reproduction, and stress resistance.
A Rare Non-Hemolytic Case of Idiopathic Cold Agglutinin Disease.

PubMed

Erkus, Edip; Kocak, Mehmet Z; Aktas, Gulali; Ozen, Mehmet; Atak, Burcin M; Duman, Tuba T; Tekce, Buket K; Savli, Haluk

2018-06-01

Cold agglutinin disease is a very rare condition associated with agglutination of erythrocytes in cold environment usually due to IgM type antibodies. Other than hemolytic anemias, it may interfere with routine hemogram tests due to miscalculation of red blood cell count (RBC) and other hemogram parameters calculated with involvement of RBC. Awareness of the condition is important to overcome laboratory errors. We studied a peripheral blood smear and repeated the hemogram test at 37°C to establish the diagnosis of cold agglutinin disease. Initial hemogram test results of the fifty-eight year-old man was as follows: RBC: 1.34 M/µL, hemoglobin (Hb): 12.4 g/dL, hematocrit (Htc): 11.8%, mean corpuscular hemoglobin (MCH): 92.4 pg, and mean corpuscular hemoglobin concentration (MCHC): 105 gr/dL. Despite the standard indirect Coombs test being negative, repeated tests at room temperature was 4+. We suspected cold agglutinin disease and repeated the hemogram test using the Bain-Marie method at 37°C and the test results showed RBC: 3.4 M/µL, hemoglobin: 12.6 g/dL, hematocrit: 30.2%, MCH: 31.7 pg, and MCHC: 41.8 g/dL. Inappropriate hemogram results may be a sign of underlying cold agglutinin disease. Hemolytic anemia not always accompanies the disease; however, cold exposure may trigger erythrocyte agglutination in vitro and may cause erratic laboratory results.
Complete sequence of a plasmid from a bovine methicillin-resistant Staphylococcus aureus harbouring a novel ica-like gene cluster in addition to antimicrobial and heavy metal resistance genes.

PubMed

Feßler, Andrea T; Zhao, Qin; Schoenfelder, Sonja; Kadlec, Kristina; Brenner Michael, Geovana; Wang, Yang; Ziebuhr, Wilma; Shen, Jianzhong; Schwarz, Stefan

2017-02-01

The multiresistance plasmid pAFS11, obtained from a bovine methicillin-resistant Staphylococcus aureus (MRSA) isolate, was completely sequenced and analysed for its structure and organisation. Moreover, the susceptibility to the heavy metals cadmium and copper was determined by broth macrodilution. The 49,189-bp plasmid harboured the apramycin resistance gene apmA, two copies of the macrolide/lincosamide/streptogramin B resistance gene erm(B) (both located on remnants of a truncated transposon Tn917), the kanamycin/neomycin resistance gene aadD, the tetracycline resistance gene tet(L) and the trimethoprim resistance gene dfrK. The latter three genes were part of a 7,284-bp segment which was bracketed by two copies of IS431. In addition, the cadmium resistance operon cadDX as well as the copper resistance genes copA and mco were located on the plasmid and mediated a reduced susceptibility to cadmium and copper. Moreover, a complete novel ica-like gene cluster of so far unknown genetic origin was detected on this plasmid. The ica-like gene cluster comprised four different genes whose products showed 64.4-76.9% homology to the Ica proteins known to be involved in biofilm formation of the S. aureus strains Mu50, Mu3 and N315. However, 96.2-99.4% homology was seen to proteins from S. sciuri NS1 indicating an S. sciuri origin. The finding of five different antibiotic resistance genes co-located on a plasmid with heavy metal resistance genes and an ica-like gene cluster is alarming. With the acquisition of this plasmid, antimicrobial multiresistance, heavy metal resistances and potential virulence properties may be co-selected and spread via a single horizontal gene transfer event. Copyright © 2016 Elsevier B.V. All rights reserved.
Evolutionary maintenance of filovirus-like genes in bat genomes

PubMed Central

2011-01-01

Background Little is known of the biological significance and evolutionary maintenance of integrated non-retroviral RNA virus genes in eukaryotic host genomes. Here, we isolated novel filovirus-like genes from bat genomes and tested for evolutionary maintenance. We also estimated the age of filovirus VP35-like gene integrations and tested the phylogenetic hypotheses that there is a eutherian mammal clade and a marsupial/ebolavirus/Marburgvirus dichotomy for filoviruses. Results We detected homologous copies of VP35-like and NP-like gene integrations in both Old World and New World species of Myotis (bats). We also detected previously unknown VP35-like genes in rodents that are positionally homologous. Comprehensive phylogenetic estimates for filovirus NP-like and VP35-like loci support two main clades with a marsupial and a rodent grouping within the ebolavirus/Lloviu virus/Marburgvirus clade. The concordance of VP35-like, NP-like and mitochondrial gene trees with the expected species tree supports the notion that the copies we examined are orthologs that predate the global spread and radiation of the genus Myotis. Parametric simulations were consistent with selective maintenance for the open reading frame (ORF) of VP35-like genes in Myotis. The ORF of the filovirus-like VP35 gene has been maintained in bat genomes for an estimated 13. 4 MY. ORFs were disrupted for the NP-like genes in Myotis. Likelihood ratio tests revealed that a model that accommodates positive selection is a significantly better fit to the data than a model that does not allow for positive selection for VP35-like sequences. Moreover, site-by-site analysis of selection using two methods indicated at least 25 sites in the VP35-like alignment are under positive selection in Myotis. Conclusions Our results indicate that filovirus-like elements have significance beyond genomic imprints of prior infection. That is, there appears to be, or have been, functionally maintained copies of such genes in
Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes

USGS Publications Warehouse

Jarvi, S.I.; Goto, R.M.; Gee, G.F.; Briles, W.E.; Miller, M.M.

1999-01-01

We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbgl and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of '-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.
Identification, inheritance, and linkage of B-G-like and MHC class I genes in cranes.

PubMed

Jarvi, S I; Goto, R M; Gee, G F; Briles, W E; Miller, M M

1999-01-01

We identified B-G-like genes in the whooping and Florida sandhill cranes and linked them to the major histocompatibility complex (MHC). We evaluated the inheritance of B-G-like genes in families of whooping and Florida sandhill cranes using restriction fragment patterns (RFPs). Two B-G-like genes, designated wcbg1 and wcbg2, were located within 8 kb of one another. The fully sequenced wcbg2 gene encodes a B-G IgV-like domain, an additional Ig-like domain, a transmembrane domain, and a single heptad domain typical of alpha-helical coiled coils. Patterns of restriction fragments in DNA from the whooping crane and from a number of other species indicate that the B-G-like gene families of cranes are large with diverse sequences. Segregation of RFPs in families of Florida sandhill cranes provide evidence for genetic polymorphism in the B-G-like genes. The restriction fragments generally segregated in concert with MHC haplotypes assigned by serological typing and by single stranded conformational polymorphism (SSCP) assays based in the second exon of the crane MHC class I genes. This study supports the concept of a long-term association of polymorphic B-G-like genes with the MHC. It also establishes SSCP as a means for evaluating MHC genetic variability in cranes.
Expression of the Galanthus nivalis agglutinin (GNA) gene in transgenic potato plants confers resistance to aphids.

PubMed

Mi, Xiaoxiao; Liu, Xue; Yan, Haolu; Liang, Lina; Zhou, Xiangyan; Yang, Jiangwei; Si, Huaijun; Zhang, Ning

2017-01-01

Aphids, the largest group of sap-sucking pests, cause significant yield losses in agricultural crops worldwide every year. The massive use of pesticides to combat this pest causes severe damage to the environment, putting in risk the human health. In this study, transgenic potato plants expressing Galanthus nivalis agglutinin (GNA) gene were developed using CaMV 35S and ST-LS1 promoters generating six transgenic lines (35S1-35S3 and ST1-ST3 corresponding to the first and second promoter, respectively). Quantitative real-time polymerase chain reaction (qRT-PCR) analysis indicated that the GNA gene was expressed in leaves, stems and roots of transgenic plants under the control of the CaMV 35S promoter, while it was only expressed in leaves and stems under the control of the ST-LS1 promoter. The levels of aphid mortality after 5 days of the inoculation in the assessed transgenic lines ranged from 20 to 53.3%. The range of the aphid population in transgenic plants 15 days after inoculation was between 17.0±1.43 (ST2) and 36.6±0.99 (35S3) aphids per plant, which corresponds to 24.9-53.5% of the aphid population in non-transformed plants. The results of our study suggest that GNA expressed in transgenic potato plants confers a potential tolerance to aphid attack, which appears to be an alternative against the use of pesticides in the future. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Gene Discovery in the Apicomplexa as Revealed by EST Sequencing and Assembly of a Comparative Gene Database

PubMed Central

Li, Li; Brunk, Brian P.; Kissinger, Jessica C.; Pape, Deana; Tang, Keliang; Cole, Robert H.; Martin, John; Wylie, Todd; Dante, Mike; Fogarty, Steven J.; Howe, Daniel K.; Liberator, Paul; Diaz, Carmen; Anderson, Jennifer; White, Michael; Jerome, Maria E.; Johnson, Emily A.; Radke, Jay A.; Stoeckert, Christian J.; Waterston, Robert H.; Clifton, Sandra W.; Roos, David S.; Sibley, L. David

2003-01-01

Large-scale EST sequencing projects for several important parasites within the phylum Apicomplexa were undertaken for the purpose of gene discovery. Included were several parasites of medical importance (Plasmodium falciparum, Toxoplasma gondii) and others of veterinary importance (Eimeria tenella, Sarcocystis neurona, and Neospora caninum). A total of 55,192 ESTs, deposited into dbEST/GenBank, were included in the analyses. The resulting sequences have been clustered into nonredundant gene assemblies and deposited into a relational database that supports a variety of sequence and text searches. This database has been used to compare the gene assemblies using BLAST similarity comparisons to the public protein databases to identify putative genes. Of these new entries, ∼15%–20% represent putative homologs with a conservative cutoff of p < 10−9, thus identifying many conserved genes that are likely to share common functions with other well-studied organisms. Gene assemblies were also used to identify strain polymorphisms, examine stage-specific expression, and identify gene families. An interesting class of genes that are confined to members of this phylum and not shared by plants, animals, or fungi, was identified. These genes likely mediate the novel biological features of members of the Apicomplexa and hence offer great potential for biological investigation and as possible therapeutic targets. [The sequence data from this study have been submitted to dbEST division of GenBank under accession nos.: Toxoplasma gondii: –, –, –, –, – , –, –, –, –. Plasmodium falciparum: –, –, –, –. Sarcocystis neurona: , , , , , , , , , , , , , –, –, –, –, –. Eimeria tenella: –, –, –, –, –, –, –, –, – , –, –, –, –, –, –, –, –, –, –, –. Neospora caninum: –, –, , – , –, –.] PMID:12618375
Allergenicity Assessment of Allium sativum Leaf Agglutinin, a Potential Candidate Protein for Developing Sap Sucking Insect Resistant Food Crops

PubMed Central

Mondal, Hossain Ali; Chakraborti, Dipankar; Majumder, Pralay; Roy, Pampa; Roy, Amit; Bhattacharya, Swati Gupta; Das, Sampa

2011-01-01

Background Mannose-binding Allium sativum leaf agglutinin (ASAL) is highly antinutritional and toxic to various phloem-feeding hemipteran insects. ASAL has been expressed in a number of agriculturally important crops to develop resistance against those insects. Awareness of the safety aspect of ASAL is absolutely essential for developing ASAL transgenic plants. Methodology/Principal Findings Following the guidelines framed by the Food and Agriculture Organization/World Health Organization, the source of the gene, its sequence homology with potent allergens, clinical tests on mammalian systems, and the pepsin resistance and thermostability of the protein were considered to address the issue. No significant homology to the ASAL sequence was detected when compared to known allergenic proteins. The ELISA of blood sera collected from known allergy patients also failed to show significant evidence of cross-reactivity. In vitro and in vivo assays both indicated the digestibility of ASAL in the presence of pepsin in a minimum time period. Conclusions/Significance With these experiments, we concluded that ASAL does not possess any apparent features of an allergen. This is the first report regarding the monitoring of the allergenicity of any mannose-binding monocot lectin having insecticidal efficacy against hemipteran insects. PMID:22110739
A Gene Encoding a Hevein-Like Protein from Elderberry Fruits Is Homologous to PR-4 and Class V Chitinase Genes1

PubMed Central

Van Damme, Els J.M.; Charels, Diana; Roy, Soma; Tierens, Koenraad; Barre, Annick; Martins, José C.; Rougé, Pierre; Van Leuven, Fred; Does, Mirjam; Peumans, Willy J.

1999-01-01

We isolated SN-HLPf (Sambucus nigra hevein-like fruit protein), a hevein-like chitin-binding protein, from mature elderberry fruits. Cloning of the corresponding gene demonstrated that SN-HLPf is synthesized as a chimeric precursor consisting of an N-terminal chitin-binding domain corresponding to the mature elderberry protein and an unrelated C-terminal domain. Sequence comparisons indicated that the N-terminal domain of this precursor has high sequence similarity with the N-terminal domain of class I PR-4 (pathogenesis-related) proteins, whereas the C terminus is most closely related to that of class V chitinases. On the basis of these sequence homologies the gene encoding SN-HLPf can be considered a hybrid between a PR-4 and a class V chitinase gene. PMID:10198114
Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

PubMed Central

Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C

2003-01-01

Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Further characterization of the cold agglutinin from the snail Achatina fulica.

PubMed Central

Mitra, D; Sarkar, M; Allen, A K

1987-01-01

The cold agglutinin from the albumin gland of the snail Achatina fulica was purified to homogeneity by using sheep gastric mucin-Sepharose 4B as affinity column followed by gel filtration on Bio-Gel P-300. The homogeneity was checked by alkaline gel electrophoresis, immunodiffusion and immunoelectrophoresis. The purified cold agglutinin is a glycoprotein of native M2 220,000 consisting of three non-covalently bound subunits of Mr 84,000, 74,000 and 62,000 and having a pI value of 4.5. The predominant amino acids are aspartic acid and glutamic acid (or amides) and serine, which account for 39% of the residues. About 3% of the residues are half-cystine. The lectin is a glycoprotein with about 30.7% carbohydrate, the most abundant sugars being galactose, N-acetylgalactosamine and N-acetylglucosamine. Mannose, xylose and fucose are also present. The inhibition of agglutination of human umbilical-cord erythrocytes by the cold agglutinin is specific for methyl beta-D-galactoside and also for glycolipids present on cord erythrocytes. The c.d. data show only negative ellipticity values in the far-u.v. region for the protein at various concentrations and temperatures and also in the presence of the hapten lactose (at different concentrations), indicating the presence of a random-coil conformation in the agglutinin that varies according to temperature. Images Fig. 2. Fig. 3. Fig. 5. Fig. 6. PMID:3593252
Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

PubMed

Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

2018-01-01

We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation. Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases. We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes. Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.
Ricin, ricin agglutinin, and the ricin binding subunit structural comparison by Raman spectroscopy

NASA Astrophysics Data System (ADS)

Brandt, N. N.; Chikishev, A. Yu.; Sotnikov, A. I.; Savochkina, Yu. A.; Agapov, I. I.; Tonevitsky, A. G.

2005-02-01

Raman spectroscopy is used to study conformation-sensitive vibrational bands of the plant toxins ricin and ricin agglutinin and the ricin binding subunit in aqueous solution. The analysis of the Raman data yields the conformational state of the protein molecules differing from that predicted by the X-ray data. The differences and similarities in the conformational state of ricin, ricin agglutinin, and ricin binding subunit are discussed.
Resolution of Serologic Problems Due to Cold Agglutinins in Chronic Lymphocytic Leukemia.

PubMed

Javed, Rizwan; Datta, Suvro Sankha; Basu, Sabita; Chakrapani, Anupam

2016-06-01

Autoimmune hemolytic anemia can be classified depending on presence of warm, cold or mixed type of autoantibodies that are directed against antigens on the red blood cell surface. Here we report a case of pathological cold agglutinin disease which was eventually detected due to blood group discrepancy. A request was sent to the blood bank for two units of packed red cells in a diagnosed case of CLL which showed type IV discrepancy during blood grouping.The discrepancy was subsequently resolved after warm saline washing of red cells along with repetition of reverse grouping with pre-warmed serum. The direct antiglobulin test was positive and revealed autoanibodies against C3b/C3d only. Indirect antiglobulin test was performed with 3-cell panel in a polyspecific gel card (IgG+C3d) showed a pan-reactive pattern along with a positive autocontrol. Subsequently a cold agglutinin titration was performed and titers of 1024 at 4 °C; titer of 2 at room temperature were detected. Dithiothreitol (DTT) treatment of serum was undertaken and IgM type of autoantibody was detected in this case confirming a case of secondary cold agglutinin disease in this patient. Two units of red cells were transfused to this patient after successfully performing cross-match with pre-warmed serum. It was advised from the blood bank that the blood should be transfused slowly through a blood-warmer and patient should be kept in warm condition to avoid in-vivo hemolysis due to high titer of cold agglutinin. The transfusion was uneventful and patient is on regular follow-up till now. Thus we concluded that serological discrepancies observed in blood bank can successfully guide the bedside transfusion protocol in case of cold agglutinin disease.

Structural analysis of the RH-like blood group gene products in nonhuman primates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Salvignol, I.; Calvas, P.; Blancher, A.

1995-03-01

Rh-related transcripts present in bone marrow samples from several species of nonhuman primates (chimpanzee, gorilla, gibbon, crab-eating macaque) have been amplified by RT-polymerase chain reaction using primers deduced from the sequence of human RH genes. Nucleotide sequence analysis of the nonhuman transcripts revealed a high degree of similarity to human blood group Rh sequences, suggesting a great conservation of the RH genes throughout evolution. Full-length transcripts, potentially encoding 417 amino acid long proteins homologous to Rh polypeptides, were characterized, as well as mRNA isoforms which harbored nucleotide deletions or insertions and potentially encode truncated proteins. Proteins of 30-40,000 M{sub r},more » immunologically related to human Rh proteins, were detected by western blot analysis with antipeptide antibodies, indicating that Rh-like transcripts are translated into membrane proteins. Comparison of human and nonhuman protein sequences was pivotal in clarifying the molecular basis of the blood group C/c polymorphism, showing that only the Pro103Ser substitution was correlated with C/c polymorphism. In addition, it was shown that a proline residue at position 102 was critical in the expression of C and c epitopes, most likely by providing an appropriate conformation of Rh polypeptides. From these data a phylogenetic reconstruction of the RH locus evolution has been calculated from which an unrooted phylogenetic tree could be proposed, indicating that African ape Rh-like genes would be closer to the human RhD gene than to the human RhCE gene. 55 refs., 4 figs., 1 tab.« less
Changing patterns of peanut agglutinin labelling in the dorsal cochlear nucleus correspond to axonal ingrowth.

PubMed Central

Riggs, G H; Schweitzer, L

1994-01-01

Various studies have suggested that glycoconjugates may influence connectivity and lamination in the developing central nervous system and may function as barriers to neuritic extension. It has been proposed that the peanut agglutinin lectin labels a glycoconjugate subserving a barrier function. We chose to investigate the distribution of this peanut-agglutinin-labelled glycoconjugate in the dorsal cochlear nucleus of the developing hamster since the development of the dorsal cochlear nucleus is well characterised and its axons obey laminar boundaries. The distribution of peanut agglutinin label throughout the cochlear nucleus delineated zones that cochlear axons fail to invade. In the dorsal cochlear nucleus, laminar differences were reduced on postnatal d 13 and virtually disappearing by postnatal d 23. Label in the molecular layer dissipated as axons and dendrites grew into this layer. These patterns of peanut agglutinin binding correspond to axonal ingrowth and are consistent with a barrier function for glycoconjugates in the molecular layer. Images Fig. 1 Fig. 2 Fig. 4 PMID:7961144
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

PubMed

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-02-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes

PubMed Central

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-01-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
The first trimeric Galanthus nivalis agglutinin-related lectin of Orchidaceae was found in Dendrobium pendulum: purification, characterization, and effects of stress factors.

PubMed

Siripipatthana, Patthraporn; Phaonakrop, Narumon; Roytrakul, Sittiruk; Senawong, Gulsiri; Mudalige-Jayawickrama, Rasika G; Sattayasai, Nison

2015-07-01

Trimeric Galanthus nivalis agglutinin-related lectin of Orchidaceae with two conformational forms was first studied in Dendrobium pendulum . It was highly expressed by stress factors. Using mannan-agarose column chromatography, a mannose-binding protein was purified from Dendrobium pendulum Roxb. pseudobulb. After heating in the presence of sodium dodecyl sulfate (SDS) with or without 2-mercaptoethanol, the protein showed one band with molecular mass of 14.0 kDa on SDS-polyacrylamide gel electrophoresis (PAGE). Without heating, three bands were found at positions of 14.0, 39.4, and 41.5 kDa, but a higher amount of 39.4 and 41.5 kDa protein bands were seen in the presence of 2-mercaptoethanol. Liquid chromatography-tandem mass spectrometry and database search indicated that the 14.0 kDa protein band contained three peptide fragments identical to parts of a lectin precursor from Dendrobiu m findleyanum Parish & Rchb.f. Native-PAGE and Ferguson plot showed that the purified protein had two native forms with molecular masses of 44.2 and 45.3 kDa, indicating three 14.0 kDa polypeptide subunits. The purified protein exhibited the agglutination activity with trypsinized chicken erythrocytes. It was then recognized as a Galanthus nivalis agglutinin-related lectin and named D. pendulum agglutinin (DPA). Using reverse transcription-polymerase chain reaction and DNA sequencing, the deduced amino acid sequence of DPA precursor showed the highest homology (96.4%) with a lectin precursor of D. findleyanum and contained three mannose-binding sites. Greater amounts of DPA were found when the pseudobulbs were treated with stress factors including ultraviolet light, abscisic acid, hydrogen peroxide, and acetylene gas.
Molecular genetic analysis reveals that a nonribosomal peptide synthetase-like (NRPS-like) gene in Aspergillus nidulans is responsible for microperfuranone biosynthesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yeh, Hsu-Hua; Chiang, Yi Ming; Entwistle, Ruth

2012-04-10

Genome sequencing of Aspergillus species including A. nidulans has revealed that there are far more secondary metabolite biosynthetic gene clusters than secondary metabolites isolated from these organisms. This implies that these organisms can produce additional secondary metabolites have not yet been elucidated. The A. nidulans genome contains twelve nonribosomal peptide synthetase (NRPS), one hybrid polyketide synthase/nonribosomal peptide synthetase (PKS/NRPS), and fourteen NRPS-like genes. The only NRPS-like gene in A. nidulans with a known product is tdiA which is involved in terrequinone A biosynthesis. To attempt to identify the products of these NRPS-like genes, we replaced the native promoters of themore » NRPS-like genes with the inducible alcohol dehydrogenase (alcA) promoter. Our results demonstrated that induction of the single NRPS-like gene AN3396.4 led to the enhanced production of microperfuranone. Furthermore, heterologous expression of AN3396.4 in A. niger confirmed that only one NRPS-like gene, AN3396.4, is necessary for the production of microperfuranone.« less
Identification of a DNA sequence motif required for expression of iron-regulated genes in pseudomonads.

PubMed

Rombel, I T; McMorran, B J; Lamont, I L

1995-02-20

Many bacteria respond to a lack of iron in the environment by synthesizing siderophores, which act as iron-scavenging compounds. Fluorescent pseudomonads synthesize strain-specific but chemically related siderophores called pyoverdines or pseudobactins. We have investigated the mechanisms by which iron controls expression of genes involved in pyoverdine metabolism in Pseudomonas aeruginosa. Transcription of these genes is repressed by the presence of iron in the growth medium. Three promoters from these genes were cloned and the activities of the promoters were dependent on the amounts of iron in the growth media. Two of the promoters were sequenced and the transcriptional start site were identified by S1 nuclease analysis. Sequences similar to the consensus binding site for the Fur repressor protein, which controls expression of iron-repressible genes in several gram-negative species, were not present in the promoters, suggesting that they are unlikely to have a high affinity for Fur. However, comparison of the promoter sequences with those of iron-regulated genes from other Pseudomonas species and also the iron-regulated exotoxin gene of P. aeruginosa allowed identification of a shared sequence element, with the consensus sequence (G/C)CTAAAT-CCC, which is likely to act as a binding site for a transcriptional activator protein. Mutations in this sequence greatly reduced the activities of the promoters characterized here as well as those of other iron-regulated promoters. The requirement for this motif in the promoters of iron-regulated genes of different Pseudomonas species indicates that similar mechanisms are likely to be involved in controlling expression of a range of iron-regulated genes in pseudomonads.
Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes

PubMed Central

2012-01-01

Background The metabolic capacity for nitrogen fixation is known to be present in several prokaryotic species scattered across taxonomic groups. Experimental detection of nitrogen fixation in microbes requires species-specific conditions, making it difficult to obtain a comprehensive census of this trait. The recent and rapid increase in the availability of microbial genome sequences affords novel opportunities to re-examine the occurrence and distribution of nitrogen fixation genes. The current practice for computational prediction of nitrogen fixation is to use the presence of the nifH and/or nifD genes. Results Based on a careful comparison of the repertoire of nitrogen fixation genes in known diazotroph species we propose a new criterion for computational prediction of nitrogen fixation: the presence of a minimum set of six genes coding for structural and biosynthetic components, namely NifHDK and NifENB. Using this criterion, we conducted a comprehensive search in fully sequenced genomes and identified 149 diazotrophic species, including 82 known diazotrophs and 67 species not known to fix nitrogen. The taxonomic distribution of nitrogen fixation in Archaea was limited to the Euryarchaeota phylum; within the Bacteria domain we predict that nitrogen fixation occurs in 13 different phyla. Of these, seven phyla had not hitherto been known to contain species capable of nitrogen fixation. Our analyses also identified protein sequences that are similar to nitrogenase in organisms that do not meet the minimum-gene-set criteria. The existence of nitrogenase-like proteins lacking conserved co-factor ligands in both diazotrophs and non-diazotrophs suggests their potential for performing other, as yet unidentified, metabolic functions. Conclusions Our predictions expand the known phylogenetic diversity of nitrogen fixation, and suggest that this trait may be much more common in nature than it is currently thought. The diverse phylogenetic distribution of nitrogenase-like
A new yeast gene with a myosin-like heptad repeat structure.

PubMed

Kölling, R; Nguyen, T; Chen, E Y; Botstein, D

1993-03-01

We isolated a gene encoding a 218 kDa myosin-like protein from Saccharomyces cerevisiae using a monoclonal antibody directed against human platelet myosin as a probe. The protein sequence encoded by the MLP1 gene (for myosin-like protein) contains extensive stretches of a heptad-repeat pattern suggesting that the protein can form coiled coils typical of myosins. Immunolocalization experiments using affinity-purified antibodies raised against a TrpE-MLP1 fusion protein showed a dot-like structure adjacent to the nucleus in yeast cells bearing the MLP1 gene on a multicopy plasmid. In mouse epithelial cells the yeast anti-MLP1 antibodies stained the nucleus. Mutants bearing disruptions of the MLP1 gene were viable, but more sensitive to ultraviolet light than wild-type strains, suggesting an involvement of MLP1 in DNA repair. The MLP1 gene was mapped to chromosome 11, 25 cM from met1.
Isolation of Soybean Agglutinin (SBA) from Soy Meal.

ERIC Educational Resources Information Center

Sattsangi, Prem D.; And Others

1982-01-01

Describes a straight-forward and relatively inexpensive method for routine isolation of purified soybean agglutinin, suitable for use as a starting material in most studies, especially for fluorescent-labeling experiments. The process is used as a project to provide advanced laboratory training at a two-year college. (Author/JN)
Spindle Epithelial Tumor with Thymus-Like Differentiation (SETTLE): A Next-Generation Sequencing Study.

PubMed

Stevens, Todd M; Morlote, Diana; Swensen, Jeff; Ellis, Michelle; Harada, Shuko; Spencer, Sharon; Prieto-Granada, Carlos N; Folpe, Andrew L; Gatalica, Zoran

2018-05-07

Spindle epithelial tumor with thymus-like differentiation (SETTLE) is a malignant biphasic neoplasm of the thyroid or neck with propensity for late metastasis. Unlike synovial sarcoma, its main morphologic mimic, SETTLE lacks synovial sarcoma-associated translocations. A single case of SETTLE has shown a KRAS mutation but to date no comprehensive next generation sequencing studies of this rare neoplasm have been undertaken. Herein, we subjected 5 well defined cases of SETTLE to direct sequence analysis of 592 genes and fusion gene analysis of 52 genes frequently rearranged in human cancers. We identified one case with two pathogenic variants in the KMT2D gene, one being in an intron splice site (c.674-1A>G) and the other being a frameshift variant (p.M2829fs). This same case also had a pathogenic nonsense variant in the KMT2C gene (p.R1237*). A second case of SETTLE carried a pathogenic NRAS missense variant, Q61R. No other molecular alterations, microsatellite instability, gene fusions or amplifications were identified.
Phylogenetic and expression analysis of the NPR1-like gene family from Persea americana (Mill.).

PubMed

Backer, Robert; Mahomed, Waheed; Reeksting, Bianca J; Engelbrecht, Juanita; Ibarra-Laclette, Enrique; van den Berg, Noëlani

2015-01-01

The NONEXPRESSOR OF PATHOGENESIS-RELATED GENES1 (NPR1) forms an integral part of the salicylic acid (SA) pathway in plants and is involved in cross-talk between the SA and jasmonic acid/ethylene (JA/ET) pathways. Therefore, NPR1 is essential to the effective response of plants to pathogens. Avocado (Persea americana) is a commercially important crop worldwide. Significant losses in production result from Phytophthora root rot, caused by the hemibiotroph, Phytophthora cinnamomi. This oomycete infects the feeder roots of avocado trees leading to an overall decline in health and eventual death. The interaction between avocado and P. cinnamomi is poorly understood and as such limited control strategies exist. Thus uncovering the role of NPR1 in avocado could provide novel insights into the avocado - P. cinnamomi interaction. A total of five NPR1-like sequences were identified. These sequences were annotated using FGENESH and a maximum-likelihood tree was constructed using 34 NPR1-like protein sequences from other plant species. The conserved protein domains and functional motifs of these sequences were predicted. Reverse transcription quantitative PCR was used to analyze the expression of the five NPR1-like sequences in the roots of avocado after treatment with salicylic and jasmonic acid, P. cinnamomi infection, across different tissues and in P. cinnamomi infected tolerant and susceptible rootstocks. Of the five NPR1-like sequences three have strong support for a defensive role while two are most likely involved in development. Significant differences in the expression profiles of these five NPR1-like genes were observed, assisting in functional classification. Understanding the interaction of avocado and P. cinnamomi is essential to developing new control strategies. This work enables further classification of these genes by means of functional annotation and is a crucial step in understanding the role of NPR1 during P. cinnamomi infection.
Phylogenetic and expression analysis of the NPR1-like gene family from Persea americana (Mill.)

PubMed Central

Backer, Robert; Mahomed, Waheed; Reeksting, Bianca J.; Engelbrecht, Juanita; Ibarra-Laclette, Enrique; van den Berg, Noëlani

2015-01-01

The NONEXPRESSOR OF PATHOGENESIS-RELATED GENES1 (NPR1) forms an integral part of the salicylic acid (SA) pathway in plants and is involved in cross-talk between the SA and jasmonic acid/ethylene (JA/ET) pathways. Therefore, NPR1 is essential to the effective response of plants to pathogens. Avocado (Persea americana) is a commercially important crop worldwide. Significant losses in production result from Phytophthora root rot, caused by the hemibiotroph, Phytophthora cinnamomi. This oomycete infects the feeder roots of avocado trees leading to an overall decline in health and eventual death. The interaction between avocado and P. cinnamomi is poorly understood and as such limited control strategies exist. Thus uncovering the role of NPR1 in avocado could provide novel insights into the avocado – P. cinnamomi interaction. A total of five NPR1-like sequences were identified. These sequences were annotated using FGENESH and a maximum-likelihood tree was constructed using 34 NPR1-like protein sequences from other plant species. The conserved protein domains and functional motifs of these sequences were predicted. Reverse transcription quantitative PCR was used to analyze the expression of the five NPR1-like sequences in the roots of avocado after treatment with salicylic and jasmonic acid, P. cinnamomi infection, across different tissues and in P. cinnamomi infected tolerant and susceptible rootstocks. Of the five NPR1-like sequences three have strong support for a defensive role while two are most likely involved in development. Significant differences in the expression profiles of these five NPR1-like genes were observed, assisting in functional classification. Understanding the interaction of avocado and P. cinnamomi is essential to developing new control strategies. This work enables further classification of these genes by means of functional annotation and is a crucial step in understanding the role of NPR1 during P. cinnamomi infection. PMID:25972890
An example of auto-anti-A1 agglutinins.

PubMed

Wright, J; Lim, F C; Freedman, J

1980-10-01

The serum of an elderly man, group A, Le(a+b-), contained an IgM antibody that agglutinated his own cells and the cells of random group A1 donors. Over a period of 5 months, the titre of these auto-anti-A1 agglutinins was 4 at 22 degrees C.
Mouse Vk gene classification by nucleic acid sequence similarity.

PubMed

Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

1989-01-01

Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.
Distinguishing the genotype 1 genes and proteins of human Wa-like rotaviruses vs. porcine rotaviruses

PubMed Central

Silva, Fernanda D.F.; Gregori, F.; McDonald, Sarah M.

2016-01-01

Group A rotaviruses (RVAs) are 11-segmented, double-stranded RNA viruses and important causes of gastroenteritis in the young of many animal species. Previous studies have suggested that human Wa-like RVAs share a close evolutionary relationship with porcine RVAs. Specifically, the VP1-VP3 and NSP2-5/6 genes of these viruses are usually classified as genotype 1 with >81% nucleotide sequence identity. Yet, it remains unknown whether the genotype 1 genes and proteins of human Wa-like strains are distinguishable from those of porcine strains. To investigate this, we performed comprehensive bioinformatic analyses using all known genotype 1 gene sequences. The RVAs analyzed represent wildtype strains isolated from humans or pigs at various geographical locations during the years of 2004–2013, including 11 newly-sequenced porcine RVAs from Brazil. We also analyzed archival strains that were isolated during the years of 1977–1992 as well as atypical strains involved in inter-species transmission between humans and pigs. We found that, in general, the genotype 1 genes of typical modern human Wa-like RVAs clustered together in phylogenetic trees and were separate from those of typical modern porcine RVAs. The only exception was for the NSP5/6 gene, which showed no host-specific phylogenetic clustering. Using amino acid sequence alignments, we identified 34 positions that differentiated the VP1-VP3, NSP2, and NSP3 genotype 1 proteins of typical modern human Wa-like RVAs versus typical modern porcine RVAs and documented how these positions vary in the archival/unusual isolates. No host-specific amino acid positions were identified for NSP4, NSP5, or NSP6. Altogether, the results of this study support the notion that human Wa-like RVAs and porcine RVAs are evolutionarily related, but indicate that some of their genotype 1 genes and proteins have diverged over time possibly as a reflection of sequestered replication and protein co-adaptation in their respective hosts. PMID
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a
Complete cDNA sequence of SAP-like pentraxin from Limulus polyphemus: implications for pentraxin evolution.

PubMed

Tharia, Hazel A; Shrive, Annette K; Mills, John D; Arme, Chris; Williams, Gwyn T; Greenhough, Trevor J

2002-02-22

The serum amyloid P component (SAP)-like pentraxin Limulus polyphemus SAP is a recently discovered, distinct pentraxin species, of known structure, which does not bind phosphocholine and whose N-terminal sequence has been shown to differ markedly from the highly conserved N terminus of all other known horseshoe crab pentraxins. The complete cDNA sequence of Limulus SAP, and the derived amino acid sequence, the first invertebrate SAP-like pentraxin sequence, have been determined. Two sequences were identified that differed only in the length of the 3' untranslated region. Limulus SAP is synthesised as a precursor protein of 234 amino acid residues, the first 17 residues encoding a signal peptide that is absent from the mature protein. Phylogenetic analysis clusters Limulus SAP pentraxin with the horseshoe crab C-reactive proteins (CRPs) rather than the mammalian SAPs, which are clustered with mammalian CRPs. The deduced amino acid sequence shares 22% identity with both human SAP and CRP, which are 51% identical, and 31-35% with horseshoe crab CRPs. These analyses indicate that gene duplication of CRP (or SAP), followed by sequence divergence and the evolution of CRP and/or SAP function, occurred independently along the chordate and arthropod evolutionary lines rather than in a common ancestor. They further indicate that the CRP/SAP gene duplication event in Limulus occurred before both the emergence of the Limulus CRP variants and the mammalian CRP/SAP gene duplication. Limulus SAP, which does not exhibit the CRP characteristic of calcium-dependent binding to phosphocholine, is established as a pentraxin species distinct from all other known horseshoe crab pentraxins that exist in many variant forms sharing a high level of sequence homology. Copyright 2002 Elsevier Science Ltd.
Sequence Composition and Gene Content of the Short Arm of Rye (Secale cereale) Chromosome 1

PubMed Central

Fluch, Silvia; Kopecky, Dieter; Burg, Kornel; Šimková, Hana; Taudien, Stefan; Petzold, Andreas; Kubaláková, Marie; Platzer, Matthias; Berenyi, Maria; Krainer, Siegfried; Doležel, Jaroslav; Lelley, Tamas

2012-01-01

Background The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. Methodology/Principal Findings Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. Conclusions The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye. PMID:22328922
Intragenome Diversity of Gene Families Encoding Toxin-like Proteins in Venomous Animals.

PubMed

Rodríguez de la Vega, Ricardo C; Giraud, Tatiana

2016-11-01

The evolution of venoms is the story of how toxins arise and of the processes that generate and maintain their diversity. For animal venoms these processes include recruitment for expression in the venom gland, neofunctionalization, paralogous expansions, and functional divergence. The systematic study of these processes requires the reliable identification of the venom components involved in antagonistic interactions. High-throughput sequencing has the potential of uncovering the entire set of toxins in a given organism, yet the existence of non-venom toxin paralogs and the misleading effects of partial census of the molecular diversity of toxins make necessary to collect complementary evidence to distinguish true toxins from their non-venom paralogs. Here, we analyzed the whole genomes of two scorpions, one spider and one snake, aiming at the identification of the full repertoires of genes encoding toxin-like proteins. We classified the entire set of protein-coding genes into paralogous groups and monotypic genes, identified genes encoding toxin-like proteins based on known toxin families, and quantified their expression in both venom-glands and pooled tissues. Our results confirm that genes encoding toxin-like proteins are part of multigene families, and that these families arise by recruitment events from non-toxin genes followed by limited expansions of the toxin-like protein coding genes. We also show that failing to account for sequence similarity with non-toxin proteins has a considerable misleading effect that can be greatly reduced by comparative transcriptomics. Our study overall contributes to the understanding of the evolutionary dynamics of proteins involved in antagonistic interactions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.

Bioinformatics analysis of plant orthologous introns: identification of an intronic tRNA-like sequence.

PubMed

Akkuratov, Evgeny E; Walters, Lorraine; Saha-Mandal, Arnab; Khandekar, Sushant; Crawford, Erin; Zirbel, Craig L; Leisner, Scott; Prakash, Ashwin; Fedorova, Larisa; Fedorov, Alexei

2014-09-10

Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals. Copyright © 2014 Elsevier B.V. All rights reserved.
Aspergillus collagen-like genes (acl): identification, sequence polymorphism, and assessment for PCR-based pathogen detection.

PubMed

Tuntevski, Kiril; Durney, Brandon C; Snyder, Anna K; Lasala, P Rocco; Nayak, Ajay P; Green, Brett J; Beezhold, Donald H; Rio, Rita V M; Holland, Lisa A; Lukomski, Slawomir

2013-12-01

The genus Aspergillus is a burden to public health due to its ubiquitous presence in the environment, its production of allergens, and wide demographic susceptibility among cystic fibrosis, asthmatic, and immunosuppressed patients. Current methods of detection of Aspergillus colonization and infection rely on lengthy morphological characterization or nonstandardized serological assays that are restricted to identifying a fungal etiology. Collagen-like genes have been shown to exhibit species-specific conservation across the noncollagenous regions as well as strain-specific polymorphism in the collagen-like regions. Here we assess the conserved region of the Aspergillus collagen-like (acl) genes and explore the application of PCR amplicon size-based discrimination among the five most common etiologic species of the Aspergillus genus, including Aspergillus fumigatus, A. flavus, A. nidulans, A. niger, and A. terreus. Genetic polymorphism and phylogenetic analysis of the aclF1 gene were additionally examined among the available strains. Furthermore, the applicability of the PCR-based assay to identification of these five species in cultures derived from sputum and bronchoalveolar fluid from 19 clinical samples was explored. Application of capillary electrophoresis on nanogels was additionally demonstrated to improve the discrimination between Aspergillus species. Overall, this study demonstrated that Aspergillus acl genes could be used as PCR targets to discriminate between clinically relevant Aspergillus species. Future studies aim to utilize the detection of Aspergillus acl genes in PCR and microfluidic applications to determine the sensitivity and specificity for the identification of Aspergillus colonization and invasive aspergillosis in immunocompromised subjects.
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.

PubMed Central

Hirsh, J; Morgan, B A; Scholnick, S B

1986-01-01

We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
Gene and translation initiation site prediction in metagenomic sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hyatt, Philip Douglas; LoCascio, Philip F; Hauser, Loren John

2012-01-01

Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data. We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translationmore » initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.« less
Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

PubMed

Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

2018-01-09

Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of
Biological safety assessment of mutant variant of Allium sativum leaf agglutinin (mASAL), a novel antifungal protein for future transgenic application.

PubMed

Ghosh, Prithwi; Roy, Amit; Chakraborty, Joydeep; Das, Sampa

2013-12-04

Genetic engineering has established itself to be an important tool for crop improvement. Despite the success, there is always a risk of food allergy induced by alien gene products. The present study assessed the biosafety of mutant Allium sativum leaf agglutinin (mASAL), a potent antifungal protein generated by site directed mutagenesis of Allium sativum leaf agglutinin (ASAL). mASAL was cloned in pET28a+ and expressed in E. coli, and the safety assessment was carried out according to the FAO/WHO guideline (2001). Bioinformatics analysis, pepsin digestion, and thermal stability assay showed the protein to be nonallergenic. Targeted sera screening revealed no significant IgE affinity of mASAL. Furthermore, mASAL sensitized Balb/c mice showed normal histopathology of lung and gut tissue. All results indicated the least possibility of mASAL being an allergen. Thus, mASAL appears to be a promising antifungal candidate protein suitable for agronomical biotechnology.
Genome sequence analysis of predicted polyprenol reductase gene from mangrove plant kandelia obovata

NASA Astrophysics Data System (ADS)

Basyuni, M.; Sagami, H.; Baba, S.; Oku, H.

2018-03-01

It has been previously reported that dolichols but not polyprenols were predominated in mangrove leaves and roots. Therefore, the occurrence of larger amounts of dolichol in leaves of mangrove plants implies that polyprenol reductase is responsible for the conversion of polyprenol to dolichol may be active in mangrove leaves. Here we report the early assessment of probably polyprenol reductase gene from genome sequence of mangrove plant Kandelia obovata. The functional assignment of the gene was based on a homology search of the sequences against the non-redundant (nr) peptide database of NCBI using Blastx. The degree of sequence identity between DNA sequence and known polyprenol reductase was confirmed using the Blastx probability E-value, total score, and identity. The genome sequence data resulted in three partial sequences, termed c23157 (700 bp), c23901 (960 bp), and c24171 (531 bp). The c23157 gene showed the highest similarity (61%) to predicted polyprenol reductase 2- like from Gossypium raimondii with E-value 2e-100. The second gene was c23901 to exhibit high similarity (78%) to the steroid 5-alpha-reductase Det2 from J. curcas with E-value 2e-140. Furthermore, the c24171 gene depicted highest similarity (79%) to the polyprenol reductase 2 isoform X1 from Jatropha curcas with E- value 7e-21.The present study suggested that the c23157, c23901, and c24171, genes may encode predicted polyprenol reductase. The c23157, c23901, c24171 are therefore the new type of predicted polyprenol reductase from K. obovata.
Bacterial-like PPP protein phosphatases: novel sequence alterations in pathogenic eukaryotes and peculiar features of bacterial sequence similarity.

PubMed

Kerk, David; Uhrig, R Glen; Moorhead, Greg B

2013-01-01

Reversible phosphorylation is a widespread modification affecting the great majority of eukaryotic cellular proteins, and whose effects influence nearly every cellular function. Protein phosphatases are increasingly recognized as exquisitely regulated contributors to these changes. The PPP (phosphoprotein phosphatase) family comprises enzymes, which catalyze dephosphorylation at serine and threonine residues. Nearly a decade ago, "bacterial-like" enzymes were recognized with similarity to proteins from various bacterial sources: SLPs (Shewanella-like phosphatases), RLPHs (Rhizobiales-like phosphatases), and ALPHs (ApaH-like phosphatases). A recent article from our laboratory appearing in Plant Physiology characterizes their extensive organismal distribution, abundance in plant species, predicted subcellular localization, motif organization, and sequence evolution. One salient observation is the distinct evolutionary trajectory followed by SLP genes and proteins in photosynthetic eukaryotes vs. animal and plant pathogens derived from photosynthetic ancestors. We present here a closer look at sequence data that emphasizes the distinctiveness of pathogen SLP proteins and that suggests that they might represent novel drug targets. A second observation in our original report was the high degree of similarity between the bacterial-like PPPs of eukaryotes and closely related proteins of the "eukaryotic-like" phyla Myxococcales and Planctomycetes. We here reflect on the possible implications of these observations and their importance for future research.
Maturity onset diabetes of youth (MODY) in Turkish children: sequence analysis of 11 causative genes by next generation sequencing.

PubMed

Ağladıoğlu, Sebahat Yılmaz; Aycan, Zehra; Çetinkaya, Semra; Baş, Veysel Nijat; Önder, Aşan; Peltek Kendirci, Havva Nur; Doğan, Haldun; Ceylaner, Serdar

2016-04-01

Maturity-onset diabetes of the youth (MODY), is a genetically and clinically heterogeneous group of diseasesand is often misdiagnosed as type 1 or type 2 diabetes. The aim of this study is to investigate both novel and proven mutations of 11 MODY genes in Turkish children by using targeted next generation sequencing. A panel of 11 MODY genes were screened in 43 children with MODY diagnosed by clinical criterias. Studies of index cases was done with MISEQ-ILLUMINA, and family screenings and confirmation studies of mutations was done by Sanger sequencing. We identified 28 (65%) point mutations among 43 patients. Eighteen patients have GCK mutations, four have HNF1A, one has HNF4A, one has HNF1B, two have NEUROD1, one has PDX1 gene variations and one patient has both HNF1A and HNF4A heterozygote mutations. This is the first study including molecular studies of 11 MODY genes in Turkish children. GCK is the most frequent type of MODY in our study population. Very high frequency of novel mutations (42%) in our study population, supports that in heterogenous disorders like MODY sequence analysis provides rapid, cost effective and accurate genetic diagnosis.
Aspergillus Collagen-Like Genes (acl): Identification, Sequence Polymorphism, and Assessment for PCR-Based Pathogen Detection

PubMed Central

Tuntevski, Kiril; Durney, Brandon C.; Snyder, Anna K.; LaSala, P. Rocco; Nayak, Ajay P.; Green, Brett J.; Beezhold, Donald H.; Rio, Rita V. M.; Holland, Lisa A.

2013-01-01

The genus Aspergillus is a burden to public health due to its ubiquitous presence in the environment, its production of allergens, and wide demographic susceptibility among cystic fibrosis, asthmatic, and immunosuppressed patients. Current methods of detection of Aspergillus colonization and infection rely on lengthy morphological characterization or nonstandardized serological assays that are restricted to identifying a fungal etiology. Collagen-like genes have been shown to exhibit species-specific conservation across the noncollagenous regions as well as strain-specific polymorphism in the collagen-like regions. Here we assess the conserved region of the Aspergillus collagen-like (acl) genes and explore the application of PCR amplicon size-based discrimination among the five most common etiologic species of the Aspergillus genus, including Aspergillus fumigatus, A. flavus, A. nidulans, A. niger, and A. terreus. Genetic polymorphism and phylogenetic analysis of the aclF1 gene were additionally examined among the available strains. Furthermore, the applicability of the PCR-based assay to identification of these five species in cultures derived from sputum and bronchoalveolar fluid from 19 clinical samples was explored. Application of capillary electrophoresis on nanogels was additionally demonstrated to improve the discrimination between Aspergillus species. Overall, this study demonstrated that Aspergillus acl genes could be used as PCR targets to discriminate between clinically relevant Aspergillus species. Future studies aim to utilize the detection of Aspergillus acl genes in PCR and microfluidic applications to determine the sensitivity and specificity for the identification of Aspergillus colonization and invasive aspergillosis in immunocompromised subjects. PMID:24123732
The petunia AGL6 gene has a SEPALLATA-like function in floral patterning.

PubMed

Rijpkema, Anneke S; Zethof, Jan; Gerats, Tom; Vandenbussche, Michiel

2009-10-01

SEPALLATA (SEP) MADS-box genes are required for the regulation of floral meristem determinacy and the specification of sepals, petals, stamens, carpels and ovules, specifically in angiosperms. The SEP subfamily is closely related to the AGAMOUS LIKE6 (AGL6) and SQUAMOSA (SQUA) subfamilies. So far, of these three groups only AGL6-like genes have been found in extant gymnosperms. AGL6 genes are more similar to SEP than to SQUA genes, both in sequence and in expression pattern. Despite the ancestry and wide distribution of AGL6-like MADS-box genes, not a single loss-of-function mutant exhibiting a clear phenotype has yet been reported; consequently the function of AGL6-like genes has remained elusive. Here, we characterize the Petunia hybrida AGL6 (PhAGL6, formerly called PETUNIA MADS BOX GENE4/pMADS4) gene, and show that it functions redundantly with the SEP genes FLORAL BINDING PROTEIN2 (FBP2) and FBP5 in petal and anther development. Moreover, expression analysis suggests a function for PhAGL6 in ovary and ovule development. The PhAGL6 and FBP2 proteins interact in in vitro experiments overall with the same partners, indicating that the two proteins are biochemically quite similar. It will be interesting to determine the functions of AGL6-like genes of other species, especially those of gymnosperms.
A direct repeat of E-box-like elements is required for cell-autonomous circadian rhythm of clock genes

PubMed Central

Nakahata, Yasukazu; Yoshida, Mayumi; Takano, Atsuko; Soma, Haruhiko; Yamamoto, Takuro; Yasuda, Akio; Nakatsu, Toru; Takumi, Toru

2008-01-01

Background The circadian expression of the mammalian clock genes is based on transcriptional feedback loops. Two basic helix-loop-helix (bHLH) PAS (for Period-Arnt-Sim) domain-containing transcriptional activators, CLOCK and BMAL1, are known to regulate gene expression by interacting with a promoter element termed the E-box (CACGTG). The non-canonical E-boxes or E-box-like sequences have also been reported to be necessary for circadian oscillation. Results We report a new cis-element required for cell-autonomous circadian transcription of clock genes. This new element consists of a canonical E-box or a non-canonical E-box and an E-box-like sequence in tandem with the latter with a short interval, 6 base pairs, between them. We demonstrate that both E-box or E-box-like sequences are needed to generate cell-autonomous oscillation. We also verify that the spacing nucleotides with constant length between these 2 E-elements are crucial for robust oscillation. Furthermore, by in silico analysis we conclude that several clock and clock-controlled genes possess a direct repeat of the E-box-like elements in their promoter region. Conclusion We propose a novel possible mechanism regulated by double E-box-like elements, not to a single E-box, for circadian transcriptional oscillation. The direct repeat of the E-box-like elements identified in this study is the minimal required element for the generation of cell-autonomous transcriptional oscillation of clock and clock-controlled genes. PMID:18177499
Ab initio gene identification in metagenomic sequences

PubMed Central

Zhu, Wenhan; Lomsadze, Alexandre; Borodovsky, Mark

2010-01-01

We describe an algorithm for gene identification in DNA sequences derived from shotgun sequencing of microbial communities. Accurate ab initio gene prediction in a short nucleotide sequence of anonymous origin is hampered by uncertainty in model parameters. While several machine learning approaches could be proposed to bypass this difficulty, one effective method is to estimate parameters from dependencies, formed in evolution, between frequencies of oligonucleotides in protein-coding regions and genome nucleotide composition. Original version of the method was proposed in 1999 and has been used since for (i) reconstructing codon frequency vector needed for gene finding in viral genomes and (ii) initializing parameters of self-training gene finding algorithms. With advent of new prokaryotic genomes en masse it became possible to enhance the original approach by using direct polynomial and logistic approximations of oligonucleotide frequencies, as well as by separating models for bacteria and archaea. These advances have increased the accuracy of model reconstruction and, subsequently, gene prediction. We describe the refined method and assess its accuracy on known prokaryotic genomes split into short sequences. Also, we show that as a result of application of the new method, several thousands of new genes could be added to existing annotations of several human and mouse gut metagenomes. PMID:20403810
FrameD: A flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences.

PubMed

Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick

2003-07-01

We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms.
A Large Family of AvrLm6-like Genes in the Apple and Pear Scab Pathogens, Venturia inaequalis and Venturia pirina

PubMed Central

Shiller, Jason; Van de Wouw, Angela P.; Taranto, Adam P.; Bowen, Joanna K.; Dubois, David; Robinson, Andrew; Deng, Cecilia H.; Plummer, Kim M.

2015-01-01

Venturia inaequalis and V. pirina are Dothideomycete fungi that cause apple scab and pear scab disease, respectively. Whole genome sequencing of V. inaequalis and V. pirina isolates has revealed predicted proteins with sequence similarity to AvrLm6, a Leptosphaeria maculans effector that triggers a resistance response in Brassica napus and B. juncea carrying the resistance gene, Rlm6. AvrLm6-like genes are present as large families (>15 members) in all sequenced strains of V. inaequalis and V. pirina, while in L. maculans, only AvrLm6 and a single paralog have been identified. The Venturia AvrLm6-like genes are located in gene-poor regions of the genomes, and mostly in close proximity to transposable elements, which may explain the expansion of these gene families. An AvrLm6-like gene from V. inaequalis with the highest sequence identity to AvrLm6 was unable to trigger a resistance response in Rlm6-carrying B. juncea. RNA-seq and qRT-PCR gene expression analyses, of in planta- and in vitro-grown V. inaequalis, has revealed that many of the AvrLm6-like genes are expressed during infection. An AvrLm6 homolog from V. inaequalis that is up-regulated during infection was shown (using an eYFP-fusion protein construct) to be localized to the sub-cuticular stroma during biotrophic infection of apple hypocotyls. PMID:26635823
Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

PubMed

Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

2012-07-15

Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of E<10(-5)) are included in 27 clusters. Five clusters are associated with metabolism, containing P450 genes restricted to the Brassica family and predicted to be involved in secondary metabolism. Operon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary
Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.

PubMed Central

Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G

1993-01-01

The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231
Genome-wide analysis of esterase-like genes in the striped rice stem borer, Chilo suppressalis.

PubMed

Wang, Baoju; Wang, Ying; Zhang, Yang; Han, Ping; Li, Fei; Han, Zhaojun

2015-06-01

The striped rice stem borer, Chilo suppressalis, a destructive pest of rice, has developed high levels of resistance to certain insecticides. Esterases are reported to be involved in insecticide resistance in several insects. Therefore, this study systematically analyzed esterase-like genes in C. suppressalis. Fifty-one esterase-like genes were identified in the draft genomic sequences of the species, and 20 cDNA sequences were derived which encoded full- or nearly full-length proteins. The putative esterase proteins derived from these full-length genes are overall highly diversified. However, key residues that are functionally important including the serine residue in the active site are conserved in 18 out of the 20 proteins. Phylogenetic analysis revealed that most of these genes have homologues in other lepidoptera insects. Genes CsuEst6, CsuEst10, CsuEst11, and CsuEst51 were induced by the insecticide triazophos, and genes CsuEst9, CsuEst11, CsuEst14, and CsuEst51 were induced by the insecticide chlorantraniliprole. Our results provide a foundation for future studies of insecticide resistance in C. suppressalis and for comparative research with esterase genes from other insect species.
Structural organization of the porcine and human genes coding for a leydig cell-specific insulin-like peptide (LEY I-L) and chromosomal localization of the human gene (INSL3)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burkhardt E.; Adham, I.M.; Brosig, B.

1994-03-01

Leydig insulin-like protein (LEY I-L) is a member of the insulin-like hormone superfamily. The LEY I-L gene (designated INSL3) is expressed exclusively in prenatal and postnatal Leydig cells. The authors report here the cloning and nucleotide sequence of porcine and human LEY I-L genes including the 5[prime] regions. Both genes consist of two exons and one intron. The organization of the LEY I-L gene is similar to that of insulin and relaxin. The transcription start site in the porcine and human LEY I-L gene is localized 13 and 14 bp upstream of the translation start site, respectively. Alignment of themore » 5[prime] flanking regions of both genes reveals that the first 107 nucleotides upstream of the transcription start site exhibit an overall sequence similarity of 80%. This conserved region contains a consensus TATAA box, a CAAT-like element (GAAT), and a consensus SP1 sequence (GGGCGG) at equivalent positions in both genes and therefore may play a role in regulation of expression of the LEY I-L gene. The porcine and human genome contains a single copy of the LEY I-L gene. By in situ hybridization, the human gene was assigned to bands p13.2-p12 of the short arm of chromosome 19. 25 refs., 6 figs.« less
Automated DNA mutation detection using universal conditions direct sequencing: application to ten muscular dystrophy genes

PubMed Central

2009-01-01

Background One of the most common and efficient methods for detecting mutations in genes is PCR amplification followed by direct sequencing. Until recently, the process of designing PCR assays has been to focus on individual assay parameters rather than concentrating on matching conditions for a set of assays. Primers for each individual assay were selected based on location and sequence concerns. The two primer sequences were then iteratively adjusted to make the individual assays work properly. This generally resulted in groups of assays with different annealing temperatures that required the use of multiple thermal cyclers or multiple passes in a single thermal cycler making diagnostic testing time-consuming, laborious and expensive. These factors have severely hampered diagnostic testing services, leaving many families without an answer for the exact cause of a familial genetic disease. A search of GeneTests for sequencing analysis of the entire coding sequence for genes that are known to cause muscular dystrophies returns only a small list of laboratories that perform comprehensive gene panels. The hypothesis for the study was that a complete set of universal assays can be designed to amplify and sequence any gene or family of genes using computer aided design tools. If true, this would allow automation and optimization of the mutation detection process resulting in reduced cost and increased throughput. Results An automated process has been developed for the detection of deletions, duplications/insertions and point mutations in any gene or family of genes and has been applied to ten genes known to bear mutations that cause muscular dystrophy: DMD; CAV3; CAPN3; FKRP; TRIM32; LMNA; SGCA; SGCB; SGCG; SGCD. Using this process, mutations have been found in five DMD patients and four LGMD patients (one in the FKRP gene, one in the CAV3 gene, and two likely causative heterozygous pairs of variations in the CAPN3 gene of two other patients). Methods and assay

Automated DNA mutation detection using universal conditions direct sequencing: application to ten muscular dystrophy genes.

PubMed

Bennett, Richard R; Schneider, Hal E; Estrella, Elicia; Burgess, Stephanie; Cheng, Andrew S; Barrett, Caitlin; Lip, Va; Lai, Poh San; Shen, Yiping; Wu, Bai-Lin; Darras, Basil T; Beggs, Alan H; Kunkel, Louis M

2009-10-18

One of the most common and efficient methods for detecting mutations in genes is PCR amplification followed by direct sequencing. Until recently, the process of designing PCR assays has been to focus on individual assay parameters rather than concentrating on matching conditions for a set of assays. Primers for each individual assay were selected based on location and sequence concerns. The two primer sequences were then iteratively adjusted to make the individual assays work properly. This generally resulted in groups of assays with different annealing temperatures that required the use of multiple thermal cyclers or multiple passes in a single thermal cycler making diagnostic testing time-consuming, laborious and expensive.These factors have severely hampered diagnostic testing services, leaving many families without an answer for the exact cause of a familial genetic disease. A search of GeneTests for sequencing analysis of the entire coding sequence for genes that are known to cause muscular dystrophies returns only a small list of laboratories that perform comprehensive gene panels.The hypothesis for the study was that a complete set of universal assays can be designed to amplify and sequence any gene or family of genes using computer aided design tools. If true, this would allow automation and optimization of the mutation detection process resulting in reduced cost and increased throughput. An automated process has been developed for the detection of deletions, duplications/insertions and point mutations in any gene or family of genes and has been applied to ten genes known to bear mutations that cause muscular dystrophy: DMD; CAV3; CAPN3; FKRP; TRIM32; LMNA; SGCA; SGCB; SGCG; SGCD. Using this process, mutations have been found in five DMD patients and four LGMD patients (one in the FKRP gene, one in the CAV3 gene, and two likely causative heterozygous pairs of variations in the CAPN3 gene of two other patients). Methods and assay sequences are reported in
Diversification of the insulin-like growth factor 1 gene in mammals.

PubMed

Rotwein, Peter

2017-01-01

Insulin-like growth factor 1 (IGF1), a small, secreted peptide growth factor, is involved in a variety of physiological and patho-physiological processes, including somatic growth, tissue repair, and metabolism of carbohydrates, proteins, and lipids. IGF1 gene expression appears to be controlled by several different signaling cascades in the few species in which it has been evaluated, with growth hormone playing a major role by activating a pathway involving the Stat5b transcription factor. Here, genes encoding IGF1 have been evaluated in 25 different mammalian species representing 15 different orders and ranging over ~180 million years of evolutionary diversification. Parts of the IGF1 gene have been fairly well conserved. Like rat Igf1 and human IGF1, 21 of 23 other genes are composed of 6 exons and 5 introns, and all 23 also contain recognizable tandem promoters, each with a unique leader exon. Exon and intron lengths are similar in most species, and DNA sequence conservation is moderately high in orthologous exons and proximal promoter regions. In contrast, putative growth hormone-activated Stat5b-binding enhancers found in analogous locations in rodent Igf1 and in human IGF1 loci, have undergone substantial variation in other mammals, and a processed retro-transposed IGF1 pseudogene is found in the sloth locus, but not in other mammalian genomes. Taken together, the fairly high level of organizational and nucleotide sequence similarity in the IGF1 gene among these 25 species supports the contention that some common regulatory pathways had existed prior to the beginning of mammalian speciation.
The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads.

PubMed

Wang, Zhiwen; Hobson, Neil; Galindo, Leonardo; Zhu, Shilin; Shi, Daihu; McDill, Joshua; Yang, Linfeng; Hawkins, Simon; Neutelings, Godfrey; Datla, Raju; Lambert, Georgina; Galbraith, David W; Grassa, Christopher J; Geraldes, Armando; Cronk, Quentin C; Cullis, Christopher; Dash, Prasanta K; Kumar, Polumetla A; Cloutier, Sylvie; Sharpe, Andrew G; Wong, Gane K-S; Wang, Jun; Deyholos, Michael K

2012-11-01

Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Lateral Transfer of a Lectin-Like Antifreeze Protein Gene in Fishes

PubMed Central

Graham, Laurie A.; Lougheed, Stephen C.; Ewart, K. Vanya; Davies, Peter L.

2008-01-01

Fishes living in icy seawater are usually protected from freezing by endogenous antifreeze proteins (AFPs) that bind to ice crystals and stop them from growing. The scattered distribution of five highly diverse AFP types across phylogenetically disparate fish species is puzzling. The appearance of radically different AFPs in closely related species has been attributed to the rapid, independent evolution of these proteins in response to natural selection caused by sea level glaciations within the last 20 million years. In at least one instance the same type of simple repetitive AFP has independently originated in two distant species by convergent evolution. But, the isolated occurrence of three very similar type II AFPs in three distantly related species (herring, smelt and sea raven) cannot be explained by this mechanism. These globular, lectin-like AFPs have a unique disulfide-bonding pattern, and share up to 85% identity in their amino acid sequences, with regions of even higher identity in their genes. A thorough search of current databases failed to find a homolog in any other species with greater than 40% amino acid sequence identity. Consistent with this result, genomic Southern blots showed the lectin-like AFP gene was absent from all other fish species tested. The remarkable conservation of both intron and exon sequences, the lack of correlation between evolutionary distance and mutation rate, and the pattern of silent vs non-silent codon changes make it unlikely that the gene for this AFP pre-existed but was lost from most branches of the teleost radiation. We propose instead that lateral gene transfer has resulted in the occurrence of the type II AFPs in herring, smelt and sea raven and allowed these species to survive in an otherwise lethal niche. PMID:18612417
[Undifferentiated cutaneous angiosarcoma of the head: identification by the endothelial marker Ulex europaeus agglutinin I].

PubMed

Bork, K; Fries, J; Hoede, N; Korting, G W; Dienes, P

1985-06-01

Cutaneous angiosarcoma of the head is a rare tumor of the elderly and can occur in an undifferentiated form without any clinical or histological signs of the vascular origin of this tumor. In these cases, the tumor can be identified by using endothelial cell markers, such as factor-VIII-related antigen and ulex europaeus agglutinin I, in an immunofluorescence technique or a peroxidase-antiperoxidase method. A 78-year-old patient is described who died within 18 months from such a tumor, which was diagnosed using the endothelial cell marker, ulex europaeus agglutinin I.
A lectin histochemical study on carbohydrate moieties of the gonadotropin-like substance in the epithelial cells of Hatschek's pit of Branchiostoma belcheri

NASA Astrophysics Data System (ADS)

Fang, Y. Q.; Welsch, U.

1997-03-01

The present light microscopic lectin, histochemical study suggests for the first time that the vertebrate gonadotropin-like substance in the basal part of the epithelial cells of Hatschek's pit is a sialic acid-containing glycoprotein. The binding intensity of the epithelial cells in Hatschek's pit to 6 lectins ( Limulus polyphemus agglutinin (LPA), Wheat germ agglutinin (WGA), Helix pomatia agglutinin (HPA), Concanavalin A (Con A), Ulex europaeus agglutinin I (UEA I) and Ricinus communis agglutinin I (RCA I)) indicate that the carbohydrate composition of the gonadotrophic glycoprotein is similar to that of mammals and fish, and that N-acetyl-D-galactosamine, sialic acid, glucosamine, D-mannose and L-fucose are components of the carbohydrate portion.
Recognition of Yeast Species from Gene Sequence Comparisons

USDA-ARS?s Scientific Manuscript database

This review discusses recognition of yeast species from gene sequence comparisons, which have been responsible for doubling the number of known yeasts over the past decade. The resolution provided by various single gene sequences is examined for both ascomycetous and basidiomycetous species, and th...
Next-generation sequencing using a pre-designed gene panel for the molecular diagnosis of congenital disorders in pediatric patients.

PubMed

Lim, Eileen C P; Brett, Maggie; Lai, Angeline H M; Lee, Siew-Peng; Tan, Ee-Shien; Jamuar, Saumya S; Ng, Ivy S L; Tan, Ene-Choo

2015-12-14

Next-generation sequencing (NGS) has revolutionized genetic research and offers enormous potential for clinical application. Sequencing the exome has the advantage of casting the net wide for all known coding regions while targeted gene panel sequencing provides enhanced sequencing depths and can be designed to avoid incidental findings in adult-onset conditions. A HaloPlex panel consisting of 180 genes within commonly altered chromosomal regions is available for use on both the Ion Personal Genome Machine (PGM) and MiSeq platforms to screen for causative mutations in these genes. We used this Haloplex ICCG panel for targeted sequencing of 15 patients with clinical presentations indicative of an abnormality in one of the 180 genes. Sequencing runs were done using the Ion 318 Chips on the Ion Torrent PGM. Variants were filtered for known polymorphisms and analysis was done to identify possible disease-causing variants before validation by Sanger sequencing. When possible, segregation of variants with phenotype in family members was performed to ascertain the pathogenicity of the variant. More than 97% of the target bases were covered at >20×. There was an average of 9.6 novel variants per patient. Pathogenic mutations were identified in five genes for six patients, with two novel variants. There were another five likely pathogenic variants, some of which were unreported novel variants. In a cohort of 15 patients, we were able to identify a likely genetic etiology in six patients (40%). Another five patients had candidate variants for which further evaluation and segregation analysis are ongoing. Our results indicate that the HaloPlex ICCG panel is useful as a rapid, high-throughput and cost-effective screening tool for 170 of the 180 genes. There is low coverage for some regions in several genes which might have to be supplemented by Sanger sequencing. However, comparing the cost, ease of analysis, and shorter turnaround time, it is a good alternative to exome
Genotypic and Antimicrobial Susceptibility of Carbapenem-resistant Acinetobacter baumannii: Analysis of ISAba Elements and blaOXA-23-like Genes Including a New Variant

PubMed Central

Bahador, Abbas; Raoofian, Reza; Pourakbari, Babak; Taheri, Mohammad; Hashemizadeh, Zahra; Hashemi, Farhad B.

2015-01-01

Carbapenem-resistant Acinetobacter baumannii (CR-AB) causes serious nosocomial infections, especially in ICU wards of hospitals, worldwide. Expression of blaOXA genes is the chief mechanism of conferring carbapenem resistance among CR-AB. Although some blaOXA genes have been studied among CR-AB isolates from Iran, their blaOXA-23-like genes have not been investigated. We used a multiplex-PCR to detect Ambler class A, B, and D carbapenemases of 85 isolates, and determined that 34 harbored blaOXA-23-like genes. Amplified fragment length polymorphism (AFLP) genotyping, followed by DNA sequencing of blaOXA-23-like amplicons of CR-AB from each AFLP group was used to characterize their blaOXA-23-like genes. We also assessed the antimicrobial susceptibility pattern of CR-AB isolates, and tested whether they harbored insertion sequences ISAba1 and ISAba4. Sequence comparison with reference strain A. baumannii (NCTC12156) revealed five types of mutations in blaOXA-23-like genes; including one novel variant and four mutants that were already reported from China and the USA. All of the blaOXA-23-like genes mutations were associated with increased minimum inhibitory concentrations (MICs) against imipenem. ISAba1 and ISAba4 sequences were detected upstream of blaOXA-23 genes in 19 and 7% of isolates, respectively. The isolation of CR-AB with new blaOXA-23 mutations including some that have been reported from the USA and China highlights CR-AB pervasive distribution, which underscores the importance of concerted national and global efforts to control the spread of CR-AB isolates worldwide. PMID:26617588
[Characterization of Black and Dichothrix Cyanobacteria Based on the 16S Ribosomal RNA Gene Sequence

NASA Technical Reports Server (NTRS)

Ortega, Maya

2010-01-01

My project focuses on characterizing different cyanobacteria in thrombolitic mats found on the island of Highborn Cay, Bahamas. Thrombolites are interesting ecosystems because of the ability of bacteria in these mats to remove carbon dioxide from the atmosphere and mineralize it as calcium carbonate. In the future they may be used as models to develop carbon sequestration technologies, which could be used as part of regenerative life systems in space. These thrombolitic communities are also significant because of their similarities to early communities of life on Earth. I targeted two cyanobacteria in my research, Dichothrix spp. and whatever black is, since they are believed to be important to carbon sequestration in these thrombolitic mats. The goal of my summer research project was to molecularly identify these two cyanobacteria. DNA was isolated from each organism through mat dissections and DNA extractions. I ran Polymerase Chain Reactions (PCR) to amplify the 16S ribosomal RNA (rRNA) gene in each cyanobacteria. This specific gene is found in almost all bacteria and is highly conserved, meaning any changes in the sequence are most likely due to evolution. As a result, the 16S rRNA gene can be used for bacterial identification of different species based on the sequence of their 16S rRNA gene. Since the exact sequence of the Dichothrix gene was unknown, I designed different primers that flanked the gene based on the known sequences from other taxonomically similar cyanobacteria. Once the 16S rRNA gene was amplified, I cloned the gene into specialized Escherichia coli cells and sent the gene products for sequencing. Once the sequence is obtained, it will be added to a genetic database for future reference to and classification of other Dichothrix sp.
Characterization of CYCLOIDEA-like genes in Proteaceae, a basal eudicot family with multiple shifts in floral symmetry

PubMed Central

Citerne, Hélène L.; Reyes, Elisabeth; Le Guilloux, Martine; Delannoy, Etienne; Simonnet, Franck; Sauquet, Hervé; Weston, Peter H.; Nadot, Sophie; Damerval, Catherine

2017-01-01

Background and Aims The basal eudicot family Proteaceae (approx. 1700 species) shows considerable variation in floral symmetry but has received little attention in studies of evolutionary development at the genetic level. A framework for understanding the shifts in floral symmetry in Proteaceae is provided by reconstructing ancestral states on an upated phylogeny of the family, and homologues of CYCLOIDEA (CYC), a key gene for the control of floral symmetry in both monocots and eudicots, are characterized. Methods Perianth symmetry transitions were reconstructed on a new species-level tree using parsimony and maximum likelihood. CYC-like genes in 35 species (31 genera) of Proteaceae were sequenced and their phylogeny was reconstructed. Shifts in selection pressure following gene duplication were investigated using nested branch-site models of sequence evolution. Expression patterns of CYC homologues were characterized in three species of Grevillea with different types of floral symmetry. Key Results Zygomorphy has evolved 10–18 times independently in Proteaceae from actinomorphic ancestors, with at least four reversals to actinomorphy. A single duplication of CYC-like genes occurred prior to the diversification of Proteaceae, with putative loss or divergence of the ProtCYC1 paralogue in more than half of the species sampled. No shifts in selection pressure were detected in the branches subtending the two ProtCYC paralogues. However, the amino acid sequence preceding the TCP domain is strongly divergent in Grevillea ProtCYC1 compared with other species. ProtCYC genes were expressed in developing flowers of both actinomorphic and zygomorphic Grevillea species, with late asymmetric expression in the perianth of the latter. Conclusion Proteaceae is a remarkable family in terms of the number of transitions in floral symmetry. Furthermore, although CYC-like genes in Grevillea have unusual sequence characteristics, they display patterns of expression that make them good
SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes.

PubMed

Pruesse, Elmar; Peplies, Jörg; Glöckner, Frank Oliver

2012-07-15

In the analysis of homologous sequences, computation of multiple sequence alignments (MSAs) has become a bottleneck. This is especially troublesome for marker genes like the ribosomal RNA (rRNA) where already millions of sequences are publicly available and individual studies can easily produce hundreds of thousands of new sequences. Methods have been developed to cope with such numbers, but further improvements are needed to meet accuracy requirements. In this study, we present the SILVA Incremental Aligner (SINA) used to align the rRNA gene databases provided by the SILVA ribosomal RNA project. SINA uses a combination of k-mer searching and partial order alignment (POA) to maintain very high alignment accuracy while satisfying high throughput performance demands. SINA was evaluated in comparison with the commonly used high throughput MSA programs PyNAST and mothur. The three BRAliBase III benchmark MSAs could be reproduced with 99.3, 97.6 and 96.1 accuracy. A larger benchmark MSA comprising 38 772 sequences could be reproduced with 98.9 and 99.3% accuracy using reference MSAs comprising 1000 and 5000 sequences. SINA was able to achieve higher accuracy than PyNAST and mothur in all performed benchmarks. Alignment of up to 500 sequences using the latest SILVA SSU/LSU Ref datasets as reference MSA is offered at http://www.arb-silva.de/aligner. This page also links to Linux binaries, user manual and tutorial. SINA is made available under a personal use license.
Low-grade parasitaemias and cold agglutinins in patients with hyper-reactive malarious splenomegaly and acute haemolysis.

PubMed

Torres, J R; Villegas, L; Perez, H; Suarez, L; Torres V, M A; Campos, M

2003-03-01

A cluster of 16 cases of hyper-reactive malarious splenomegaly (HMS) with severe, acute haemolysis, from an isolated, Venezuelan, Yanomami population, was prospectively investigated. Nine (69%) of the 13 HMS sera investigated but only one (7%) of 14 control sera (P < 0.005) contained elevated titres (of at least 1:32) of complement-fixing IgM cold agglutinins (CA). The CA detected had specificity for both the I and i blood-group antigens (with a relative predominance of anti-I) and wide thermal stability. The mean reciprocal CA titre was much higher for the HMS sera than for the control samples (59.16 v. 2.28; P < 0.001). Indirect tests for antiglobulin were positive for two of the 13 HMS cases (but none of 14 controls) investigated; all of the direct tests for antiglobulin gave negative results. The seven HMS cases checked, using an assay based on a nested PCR which amplified species-specific ribosomal sequences from Plasmodium vivax or P. falciparum, each yielded the PCR product that indicated P. vivax infection. However, only six (25%) of the 24 control samples (collected, at the same time as the HMS samples, from asymptomatic adults from the same Yanomami population) were PCR-positive (P < 0.001). In some cases at least, the acute severe episodes of haemolysis occasionally seen in HMS appear to be associated with an auto-immune, cold-agglutinin-mediated response triggered by non-patent parasitaemias.
Molecular Cloning and Sequence Analysis of a Phenylalanine Ammonia-Lyase Gene from Dendrobium

PubMed Central

Cai, Yongping; Lin, Yi

2013-01-01

In this study, a phenylalanine ammonia-lyase (PAL) gene was cloned from Dendrobium candidum using homology cloning and RACE. The full-length sequence and catalytic active sites that appear in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum are also found: PAL cDNA of D. candidum (designated Dc-PAL1, GenBank No. JQ765748) has 2,458 bps and contains a complete open reading frame (ORF) of 2,142 bps, which encodes 713 amino acid residues. The amino acid sequence of DcPAL1 has more than 80% sequence identity with the PAL genes of other plants, as indicated by multiple alignments. The dominant sites and catalytic active sites, which are similar to that showing in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum, are also found in DcPAL1. Phylogenetic tree analysis revealed that DcPAL is more closely related to PALs from orchidaceae plants than to those of other plants. The differential expression patterns of PAL in protocorm-like body, leaf, stem, and root, suggest that the PAL gene performs multiple physiological functions in Dendrobium candidum. PMID:23638048
Defining the carbohydrate specificities of Abrus precatorius agglutinin as T (Gal beta 1----3GalNAc) greater than I/II (Gal beta 1----3/4GlcNAc).

PubMed

Wu, A M; Lin, S R; Chin, L K; Chow, L P; Lin, J Y

1992-09-25

The combining site of the nontoxic carbohydrate binding protein (Abrus precatorius agglutinin, APA) purified from the needs of Abrus precatorius (Jequirity bean), was studied by quantitative precipitin and precipitin-inhibition assays. Of 26 glycoproteins and polysaccharides tested, all, except sialic acid-containing glycoproteins and desialized ovine salivary glycoproteins, reacted strongly with the lectin, and precipitated over 70% of the lectin added, indicating that APA has a broad range of affinity and recognizes (internal) Gal beta 1----sequences of carbohydrate chains. The strong reaction with desialized porcine and rat salivary glycoproteins as well as pneumococcus type XIV polysaccharide suggests that APA has affinity for one or more of the following carbohydrate sequences: Thomsen-Friedenreich (T, Gal beta 1----3GalNAc), blood group precursor type I and/or type II (Gal beta 1----3/4GlcNAc) disaccharide determinants of complex carbohydrates. Among the oligosaccharides tested, the T structure was the best inhibitor; it was 2.4 and 3.2 times more active than type II and type I sequences, respectively. The blood group I Ma-active trisaccharide, Gal beta 1----4GlcNAc beta 1----6Gal, was about as active as the corresponding disaccharide (II). From the above results, we conclude that the size of the combining site of the A. precatorius agglutinin is probably as large as a disaccharide and most strongly complementary to the Gal beta 1----3GalNAc (T determinant) sequence. The carbohydrate specificities of this lectin will be further investigated once the related oligosaccharide structures become available.
Three Cases of Anaerobiospirillum succiniciproducens Bacteremia Confirmed by 16S rRNA Gene Sequencing

PubMed Central

Tee, Wee; Korman, Tony M.; Waters, Mary Jo; Macphee, Andrew; Jenney, Adam; Joyce, Linda; Dyall-Smith, Michael L.

1998-01-01

We describe three cases of Anaerobiospirillum succiniciproducens bacteremia from Australia. We believe one of these cases represents the first report of A. succiniciproducens bacteremia in a human immunodeficiency virus (HIV)-infected individual. The other two patients had an underlying disorder (one patient had bleeding esophageal varices complicating alcohol liver disease and one patient had non-Hodgkin’s lymphoma). A motile, gram-negative, spiral anaerobe was isolated by culturing blood from all patients. Electron microscopy showed a curved bacterium with bipolar tufts of flagella resembling Anaerobiospirillum spp. Sequencing of the 16S rRNA genes of the isolates revealed no close relatives (organisms likely to be in the same genus) in the sequence databases, nor were any sequence data available for A. succiniciproducens. This report presents for the first time the 16S rRNA gene sequence of the type strain of A. succiniciproducens, strain ATCC 29305. Two of the three clinical isolates have sequences identical to that of the type strain, while the sequence of the other strain differs from that of the type strain at 4 nucleotides. PMID:9574678
Gene Discovery through Transcriptome Sequencing for the Invasive Mussel Limnoperna fortunei

PubMed Central

Uliano-Silva, Marcela; Americo, Juliana Alves; Brindeiro, Rodrigo; Dondero, Francesco; Prosdocimi, Francisco; de Freitas Rebelo, Mauro

2014-01-01

The success of the Asian bivalve Limnoperna fortunei as an invader in South America is related to its high acclimation capability. It can inhabit waters with a wide range of temperatures and salinity and handle long-term periods of air exposure. We describe the transcriptome of L. fortunei aiming to give a first insight into the phenotypic plasticity that allows non-native taxa to become established and widespread. We sequenced 95,219 reads from five main tissues of the mussel L. fortunei using Roche’s 454 and assembled them to form a set of 84,063 unigenes (contigs and singletons) representing partial or complete gene sequences. We annotated 24,816 unigenes using a BLAST sequence similarity search against a NCBI nr database. Unigenes were divided into 20 eggNOG functional categories and 292 KEGG metabolic pathways. From the total unigenes, 1,351 represented putative full-length genes of which 73.2% were functionally annotated. We described the first partial and complete gene sequences in order to start understanding bivalve invasiveness. An expansion of the hsp70 gene family, seen also in other bivalves, is present in L. fortunei and could be involved in its adaptation to extreme environments, e.g. during intertidal periods. The presence of toll-like receptors gives a first insight into an immune system that could be more complex than previously assumed and may be involved in the prevention of disease and extinction when population densities are high. Finally, the apparent lack of special adaptations to extremely low O2 levels is a target worth pursuing for the development of a molecular control approach. PMID:25047650
[Target gene sequence capture and next generation sequencing technology to diagnose four children with Alagille syndrome].

PubMed

Gao, M L; Zhong, X M; Ma, X; Ning, H J; Zhu, D; Zou, J Z

2016-06-02

To make genetic diagnosis of Alagille syndrome (ALGS) patients using target gene sequence capture and next generation sequencing technology. Target gene sequence capture and next generation sequencing were used to detect ALGS gene of 4 patients. They were hospitalized at the Affiliated Hospital, Capital Institute of Pediatrics between January 2014 and December 2015, referred to clinical diagnosis of ALGS typical and atypical respectively in 2 cases. Blood samples were collected from patients and their parents and genomic DNA was extracted from lymphocytes. Target gene sequence capture and next generation sequencing was detected. Sanger sequencing was used to confirm the results of the patients and their parents. Cholestasis, heart defects, inverted triangular face and butterfly vertebrae were presented as main clinical features in 4 male patients. The first hospital visiting ages ranged from 3 months and 14 days to 3 years and 1 month. The age of onset ranged from 3 days to 42 days (median 23 days). According to the clinical diagnostic criteria of ALGS, patient 1 and patient 2 were considered as typical ALGS. The other 2 patients were considered as atypical ALGS. Four Jagged 1(JAG1) pathogenic mutations were detected. Three different missense mutations were detected in patient 1 to patient 3 with ALGS(c.839C>T(p.W280X), c. 703G>A(p.R235X), c. 1720C>T(p.V574M)). The JAG1 mutation of patient 3 was first reported. Patient 4 had one novel insertion mutation (c.1779_1780insA(p.Ile594AsnfsTer23)). Parental analysis verified that the JAG1 missense mutation of 3 patients were de novo. The results of sanger sequencing was consistent with the results of the next generation sequencing. Target gene sequence capture combined with next generation sequencing can detect two pathogenic genes in ALGS and test genes of other related diseases in infantile cholestatic diseases simultaneously and presents a high throughput, high efficiency and low cost. It may provide molecular
Engineering disease resistance with pectate lyase-like genes

DOEpatents

Vogel, John; Somerville, Shauna

2005-03-08

A mutant gene coding for pectate lyase and homologs thereof is provided, which when incorporated in transgenic plants effect an increased level disease resistance in such plants. Also is provided the polypeptide sequence for the pectate lyase of the present invention. Methods of obtaining the mutant gene, producing transgenic plants which include the nucleotide sequence for the mutant gene and producing improved disease resistance in a crop of such transgenic plants are also provided.
[Blue-light induced expression of S-adenosy-L-homocysteine hydrolase-like gene in Mucor amphibiorum RCS1].

PubMed

Gao, Ya; Wang, Shu; Fu, Mingjia; Zhong, Guolin

2013-09-04

To determine blue-light induced expression of S-adenosyl-L-homocysteine hydrolase-like (sahhl) gene in fungus Mucor amphibiorum RCS1. In the random process of PCR, a sequence of 555 bp was obtained from M. amphibiorum RCS1. The 555 bp sequence was labeled with digoxin to prepare the probe for northern hybridization. By northern hybridization, the transcription of sahhl gene was analyzed in M. amphibiorum RCS1 mycelia culture process from darkness to blue light to darkness. Simultaneously real-time PCR method was used to the sahhl gene expression analysis. Compared with the sequence of sahh gene from Homo sapiens, Mus musculus and some fungi species, a high homology of the 555 bp sequence was confirmed. Therefore, the preliminary confirmation has supported that the 555 bp sequence should be sahhl gene from M. amphibiorum RCS1. Under the dark pre-culture in 24 h, a large amounts of transcript of sahhl gene in the mycelia can be detected by northern hybridization and real-time PCR in the condition of 24 h blue light. But a large amounts of transcript of sahhl gene were not found in other detection for the dark pre-culture of 48 h, even though M. amphibiorum RCS1 mycelia were induced by blue light. Blue light can induce the expression of sahhl gene in the vigorous growth of M. amphibiorum RCS1 mycelia.

Influence of 5'-flanking sequence on 4.5SI RNA gene transcription by RNA polymerase III.

PubMed

Gogolevskaya, Irina K; Stasenko, Danil V; Tatosyan, Karina A; Kramerov, Dmitri A

2018-05-01

Short nuclear 4.5SI RNA can be found in three related rodent families. Its function remains unknown. The genes of 4.5SI RNA contain an internal promoter of RNA polymerase III composed of the boxes A and B. Here, the effect of the sequence immediately upstream of the mouse 4.5SI RNA gene on its transcription was studied. The gene with deletions and substitutions in the 5'-flanking sequence was used to transfect HeLa cells and its transcriptional activity was evaluated from the cellular level of 4.5SI RNA. Single-nucleotide substitutions in the region adjacent to the transcription start site (positions -2 to -8) decreased the expression activity of the gene down to 40%-60% of the control. The substitution of the conserved pentanucleotide AGAAT (positions -14 to -18) could either decrease (43%-56%) or increase (134%) the gene expression. A TATA-like box (TACATGA) was found at positions -24 to -30 of the 4.5SI RNA gene. Its replacement with a polylinker fragment of the vector did not decrease the transcription level, while its replacement with a GC-rich sequence almost completely (down to 2%-5%) suppressed the transcription of the 4.5SI RNA gene. The effect of plasmid sequences bordering the gene on its transcription by RNA polymerase III is discussed.
Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses

USGS Publications Warehouse

Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.

2004-01-01

The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.
CLAVATA3-like genes are differentially expressed in grape vine (Vitis vinifera) tissues.

PubMed

Tominaga-Wada, Rumi; Nukumizu, Yuka; Wada, Takuji; Sawa, Shinichiro; Tetsumura, Takuya

2013-10-15

The CLAVATA3 (CLV3)/endosperm surrounding region [(ESR) CLE] peptides function as intercellular signaling molecules that regulate various physiological and developmental processes in diverse plant species. We identified five CLV3-like genes from grape vine (Vitis vinifera var. Pinot Noir): VvCLE 6, VvCLE 25-1, VvCLE 25-2, VvCLE 43 and VvCLE TDIF. These CLV3-like genes encode short proteins containing 43-128 amino acids. Except VvCLE TDIF, grape vine CLV3-like proteins possess a consensus amino acid sequence known as the CLE domain. Phylogenic analysis suggests that the VvCLE 6, VvCLE25-1, VvCLE25-2 and VvCLE43 genes have evolved from a single common ancestor to the Arabidopsis CLV3 gene. Expression analyses showed that the five grape CLV3-like genes are expressed in leaves, stems, roots and axillary buds with significant differences in their levels of expression. For example, while all of them were strongly expressed in axillary buds, VvCLE6 and VvCLE43 expression prevailed in roots, and VvCLE25-1, VvCLE25-2 and VvCLE TDIF expression in stems. The differential expression of the five grape CLV3-like peptides suggests that they play different roles in different organs and developmental stages. Copyright © 2013 Elsevier GmbH. All rights reserved.
htsint: a Python library for sequencing pipelines that combines data through gene set generation.

PubMed

Richards, Adam J; Herrel, Anthony; Bonneaud, Camille

2015-09-24

Sequencing technologies provide a wealth of details in terms of genes, expression, splice variants, polymorphisms, and other features. A standard for sequencing analysis pipelines is to put genomic or transcriptomic features into a context of known functional information, but the relationships between ontology terms are often ignored. For RNA-Seq, considering genes and their genetic variants at the group level enables a convenient way to both integrate annotation data and detect small coordinated changes between experimental conditions, a known caveat of gene level analyses. We introduce the high throughput data integration tool, htsint, as an extension to the commonly used gene set enrichment frameworks. The central aim of htsint is to compile annotation information from one or more taxa in order to calculate functional distances among all genes in a specified gene space. Spectral clustering is then used to partition the genes, thereby generating functional modules. The gene space can range from a targeted list of genes, like a specific pathway, all the way to an ensemble of genomes. Given a collection of gene sets and a count matrix of transcriptomic features (e.g. expression, polymorphisms), the gene sets produced by htsint can be tested for 'enrichment' or conditional differences using one of a number of commonly available packages. The database and bundled tools to generate functional modules were designed with sequencing pipelines in mind, but the toolkit nature of htsint allows it to also be used in other areas of genomics. The software is freely available as a Python library through GitHub at https://github.com/ajrichards/htsint.
Cold agglutinin activity in 2 dogs.

PubMed

Rojas-Temahuay, Gabriela; Crain, Sarah; Benson, Catherine; Sharkey, Leslie; Nothnagel, Geneva

2014-09-01

A 5-year-old neutered male Mastiff and an 8-year-old spayed female Labrador Retriever were presented to the University of Minnesota Veterinary Medical Center. The Mastiff was presented for evaluation of lameness and pyoderma one month prior in Missouri, where he tested positive for Ehrlichia canis by serum ELISA test, treated with doxycycline. PCR for Ehrlichia sp, Anaplasma sp, Babesia sp, and Bartonella sp, and PCR for antigen receptor rearrangement were negative, serum protein electrophoresis (SPE) revealed polyclonal gammopathy, and mildly reactive lymphoid cells were seen cytologically. The Labrador presented with a proliferative rostral mandibular gingival mass and lipomas for further presurgical evaluation of cold agglutinin activity documented by a commercial laboratory 2 years earlier prior to removal of a grade II mast cell tumor. This dog had a negative SNAP4Dx, normal SPE, and persistently increased serum ALP activity and polyuria/polydipsia suggestive for hyperadrenocorticism. Both dogs had markedly agglutinated RBC in the EDTA samples that dispersed with warming, and normal plasma color. Cold agglutinin activity was demonstrated by direct saline agglutination testing using whole blood and washed erythrocytes demonstrating agglutination at 30°C, 25°C, 15°C, and 4°C, but not at 37°C. CBC results (ADVIA 2120i) from the Mastiff revealed no significant differences in the RBC results obtained at room temperature (RT) and at 37°C; however, the RT run demonstrated negative bias in neutrophil and platelet concentrations attributed to rapid RBC settling. This uncommon hematologic condition may cause artifacts on the automated leukogram and platelet count, and may be subclinical for long periods. © 2014 American Society for Veterinary Clinical Pathology and European Society for Veterinary Clinical Pathology.
Nucleotide Sequence and Genetic Structure of a Novel Carbaryl Hydrolase Gene (cehA) from Rhizobium sp. Strain AC100

PubMed Central

Hashimoto, Masayuki; Fukui, Mitsuru; Hayano, Kouichi; Hayatsu, Masahito

2002-01-01

Rhizobium sp. strain AC100, which is capable of degrading carbaryl (1-naphthyl-N-methylcarbamate), was isolated from soil treated with carbaryl. This bacterium hydrolyzed carbaryl to 1-naphthol and methylamine. Carbaryl hydrolase from the strain was purified to homogeneity, and its N-terminal sequence, molecular mass (82 kDa), and enzymatic properties were determined. The purified enzyme hydrolyzed 1-naphthyl acetate and 4-nitrophenyl acetate indicating that the enzyme is an esterase. We then cloned the carbaryl hydrolase gene (cehA) from the plasmid DNA of the strain and determined the nucleotide sequence of the 10-kb region containing cehA. No homologous sequences were found by a database homology search using the nucleotide and deduced amino acid sequences of the cehA gene. Six open reading frames including the cehA gene were found in the 10-kb region, and sequencing analysis shows that the cehA gene is flanked by two copies of insertion sequence-like sequence, suggesting that it makes part of a composite transposon. PMID:11872471
DNA sequence responsible for the amplification of adjacent genes.

PubMed

Pasion, S G; Hartigan, J A; Kumar, V; Biswas, D K

1987-10-01

A 10.3-kb DNA fragment in the 5'-flanking region of the rat prolactin (rPRL) gene was isolated from F1BGH(1)2C1, a strain of rat pituitary tumor cells (GH cells) that produces prolactin in response to 5-bromodeoxyuridine (BrdU). Following transfection and integration into genomic DNA of recipient mouse L cells, this DNA induced amplification of the adjacent thymidine kinase gene from Herpes simplex virus type 1 (HSV1TK). We confirmed the ability of this "Amplicon" sequence to induce amplification of other linked or unlinked genes in DNA-mediated gene transfer studies. When transferred into the mouse L cells with the 10.3-5'rPRL gene sequence of BrdU-responsive cells, both the human growth hormone and the HSV1TK genes are amplified in response to 5-bromodeoxyuridine. This observation is substantiated by BrdU-induced amplification of the cotransferred bacterial Neo gene. Cotransfection studies reveal that the BrdU-induced amplification capability is associated with a 4-kb DNA sequence in the 5'-flanking region of the rPRL gene of BrdU-responsive cells. These results demonstrate that genes of heterologous origin, linked or unlinked, and selected or unselected, can be coamplified when located within the amplification boundary of the Amplicon sequence.
[Sequencing technology in gene diagnosis and its application].

PubMed

Yibin, Guo

2014-11-01

The study of gene mutation is one of the hot topics in the field of life science nowadays, and the related detection methods and diagnostic technology have been developed rapidly. Sequencing technology plays an indispensable role in the definite diagnosis and classification of genetic diseases. In this review, we summarize the research progress in sequencing technology, evaluate the advantages and disadvantages of 1(st) ~3(rd) generation of sequencing technology, and describe its application in gene diagnosis. Also we made forecasts and prospects on its development trend.
New data on epizootiology and genetics of piroplasms based on sequences of small ribosomal subunit and cytochrome b genes.

PubMed

Criado, A; Martinez, J; Buling, A; Barba, J C; Merino, S; Jefferies, R; Irwin, P J

2006-12-20

As a continuation of our studies on molecular epizootiology of piroplasmosis in Spain and other countries, we present in this contribution the finding of new hosts for some piroplasms, as well as information on their 18S rRNA gene sequences. Genetic data were complemented with sequences of apocytochrome b gene (whenever possible). The following conclusions were drawn from these molecular studies: Theileria annulata is capable of infecting dogs, since it was diagnosed in a symptomatic animal. According to cytochrome b sequences, isolates from cows and dog present slight differences. The same isolates showed, however, identical sequence in the 18S rRNA gene. This exemplifies well the usefulness of the mitochondrial gene for examining infra-specific variation. Babesia bovis is an occasional parasite of equines, since it was detected in two symptomatic horses. We found evidence of genetic polymorphism occurring in the 18S rRNA gene of Spanish T. equi-like and B. ovis isolates. B. bennetti from Spanish seagull is loosely related to B. ovis, and might represent a genetically distinct branch of babesids. A partial sequence of a cytochrome b pseudogene was obtained for the first time in Babesia canis rossi from South Africa. The pseudogene is distantly related to B. bigemina cytochrome b gene. These new findings confirm the ability of some piroplasms to infect multiple hosts, as well as the existence of a relatively wide genetic polymorphisms with respect to the cytochrome b gene. On the other hand, the existence of mtDNA-like pseudogenes of possible nuclear location in piroplasms is interesting due to their possible impact on molecular phylogeny studies.
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

PubMed

Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

2015-01-01

There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.
The gene transfer agent-like particle of the marine phototrophic bacterium Rhodovulum sulfidophilum.

PubMed

Nagao, Nobuyoshi; Yamamoto, Junya; Komatsu, Hiroyuki; Suzuki, Hiromichi; Hirose, Yuu; Umekage, So; Ohyama, Takashi; Kikuchi, Yo

2015-12-01

Gene transfer agents (GTAs) are shaped like bacteriophage particles but have many properties that distinguish them from bacteriophages. GTAs play a role in horizontal gene transfer in nature and thus affect the evolution of prokaryotic genomes. In the course of studies on the extracellular production of designed RNAs using the marine bacterium Rhodovulum sulfidophilum , we found that this bacterium produces a GTA-like particle. The particle contains DNA fragments of 4.5 kb, which consist of randomly fragmented genomic DNA from the bacterium. This 4.5-kb DNA production was prevented while quorum sensing was inhibited. Direct observation of the particle by transmission electron microscopy revealed that the particle resembles a tailed phage and has a head diameter of about 40 nm and a tail length of about 60 nm. We also identified the structural genes for the GTA in the genome. Translated amino acid sequences and gene positions are closely related to those of the genes that encode the Rhodobacter capsulatus GTA. This is the first report of a GTA-like particle from the genus Rhodovulum . However, gene transfer activity of this particle has not yet been confirmed. The differences between this particle and other GTAs are discussed.
Fluorescence Imaging of Streptococcus pneumoniae with the Helix pomatia agglutinin (HPA) As a Potential, Rapid Diagnostic Tool

PubMed Central

Domenech, Mirian; García, Ernesto

2017-01-01

Streptococcus pneumoniae is a common human pathogen and a major causal agent of life-threatening infections that can either be respiratory or non-respiratory. It is well known that the Helix pomatia (edible snail) agglutinin (HPA) lectin shows specificity for terminal αGalNAc residues present, among other locations, in the Forssman pentasaccharide (αGalNAc1→3βGalNAc1→3αGal1→4βGal1→4βGlc). Based on experiments involving choline-independent mutants and different growth conditions, we propose here that HPA recognizes the αGalNAc terminal residues of the cell wall teichoic and lipoteichoic acids of S. pneumoniae. In addition, experimental evidence showing that pneumococci can be specifically labeled with HPA when growing as planktonic cultures as well as in mixed biofilms of S. pneumoniae and Haemophilus influenzae has been obtained. It should be underlined that pneumococci were HPA-labeled despite of the presence of a capsule. Although some non-pneumococcal species also bind the agglutinin, HPA-binding combined with fluorescence microscopy constitutes a suitable tool for identifying S. pneumoniae and, if used in conjunction with Gram staining and/or other suitable technique like antigen detection, it may potentially facilitate a fast and accurate diagnosis of pneumococcal infections. PMID:28769901
rpoB Gene Sequencing for Identification of Corynebacterium Species

PubMed Central

Khamis, Atieh; Raoult, Didier; La Scola, Bernard

2004-01-01

The genus Corynebacterium is a heterogeneous group of species comprising human and animal pathogens and environmental bacteria. It is defined on the basis of several phenotypic characters and the results of DNA-DNA relatedness and, more recently, 16S rRNA gene sequencing. However, the 16S rRNA gene is not polymorphic enough to ensure reliable phylogenetic studies and needs to be completely sequenced for accurate identification. The almost complete rpoB sequences of 56 Corynebacterium species were determined by both PCR and genome walking methods. In all cases the percent similarities between different species were lower than those observed by 16S rRNA gene sequencing, even for those species with degrees of high similarity. Several clusters supported by high bootstrap values were identified. In order to propose a method for strain identification which does not require sequencing of the complete rpoB sequence (approximately 3,500 bp), we identified an area with a high degree of polymorphism, bordered by conserved sequences that can be used as universal primers for PCR amplification and sequencing. The sequence of this fragment (434 to 452 bp) allows accurate species identification and may be used in the future for routine sequence-based identification of Corynebacterium species. PMID:15364970
Characterization of the product of a nonribosomal peptide synthetase-like (NRPS-like) gene using the doxycycline dependent Tet-on system in Aspergillus terreus.

PubMed

Sun, Wei-Wen; Guo, Chun-Jun; Wang, Clay C C

2016-04-01

Genome sequencing of the fungus Aspergillus terreus uncovered a number of silent core structural biosynthetic genes encoding enzymes presumed to be involved in the production of cryptic secondary metabolites. There are five nonribosomal peptide synthetase (NRPS)-like genes with the predicted A-T-TE domain architecture within the A. terreus genome. Among the five genes, only the product of pgnA remains unknown. The Tet-on system is an inducible, tunable and metabolism-independent expression system originally developed for Aspergillus niger. Here we report the adoption of the Tet-on system as an effective gene activation tool in A. terreus. Application of this system in A. terreus allowed us to uncover the product of the cryptic NRPS-like gene, pgnA. Furthermore expression of pgnA in the heterologous Aspergillus nidulans host suggested that the pgnA gene alone is necessary for phenguignardic acid (1) biosynthesis. Copyright © 2016 Elsevier Inc. All rights reserved.
Gene Discovery through Genomic Sequencing of Brucella abortus

PubMed Central

Sánchez, Daniel O.; Zandomeni, Ruben O.; Cravero, Silvio; Verdún, Ramiro E.; Pierrou, Ester; Faccio, Paula; Diaz, Gabriela; Lanzavecchia, Silvia; Agüero, Fernán; Frasch, Alberto C. C.; Andersson, Siv G. E.; Rossetti, Osvaldo L.; Grau, Oscar; Ugalde, Rodolfo A.

2001-01-01

Brucella abortus is the etiological agent of brucellosis, a disease that affects bovines and human. We generated DNA random sequences from the genome of B. abortus strain 2308 in order to characterize molecular targets that might be useful for developing immunological or chemotherapeutic strategies against this pathogen. The partial sequencing of 1,899 clones allowed the identification of 1,199 genomic sequence surveys (GSSs) with high homology (BLAST expect value < 10−5) to sequences deposited in the GenBank databases. Among them, 925 represent putative novel genes for the Brucella genus. Out of 925 nonredundant GSSs, 470 were classified in 15 categories based on cellular function. Seven hundred GSSs showed no significant database matches and remain available for further studies in order to identify their function. A high number of GSSs with homology to Agrobacterium tumefaciens and Rhizobium meliloti proteins were observed, thus confirming their close phylogenetic relationship. Among them, several GSSs showed high similarity with genes related to nodule nitrogen fixation, synthesis of nod factors, nodulation protein symbiotic plasmid, and nodule bacteroid differentiation. We have also identified several B. abortus homologs of virulence and pathogenesis genes from other pathogens, including a homolog to both the Shda gene from Salmonella enterica serovar Typhimurium and the AidA-1 gene from Escherichia coli. Other GSSs displayed significant homologies to genes encoding components of the type III and type IV secretion machineries, suggesting that Brucella might also have an active type III secretion machinery. PMID:11159979
Primer development to obtain complete coding sequence of HA and NA genes of influenza A/H3N2 virus.

PubMed

Agustiningsih, Agustiningsih; Trimarsanto, Hidayat; Setiawaty, Vivi; Artika, I Made; Muljono, David Handojo

2016-08-30

Influenza is an acute respiratory illness and has become a serious public health problem worldwide. The need to study the HA and NA genes in influenza A virus is essential since these genes frequently undergo mutations. This study describes the development of primer sets for RT-PCR to obtain complete coding sequence of Hemagglutinin (HA) and Neuraminidase (NA) genes of influenza A/H3N2 virus from Indonesia. The primers were developed based on influenza A/H3N2 sequence worldwide from Global Initiative on Sharing All Influenza Data (GISAID) and further tested using Indonesian influenza A/H3N2 archived samples of influenza-like illness (ILI) surveillance from 2008 to 2009. An optimum RT-PCR condition was acquired for all HA and NA fragments designed to cover complete coding sequence of HA and NA genes. A total of 71 samples were successfully sequenced for complete coding sequence both of HA and NA genes out of 145 samples of influenza A/H3N2 tested. The developed primer sets were suitable for obtaining complete coding sequences of HA and NA genes of Indonesian samples from 2008 to 2009.
Suppression of a NAC-like transcription factor gene improves boron-toxicity tolerance in rice.

PubMed

Ochiai, Kumiko; Shimizu, Akifumi; Okumoto, Yutaka; Fujiwara, Toru; Matoh, Toru

2011-07-01

We identified a gene responsible for tolerance to boron (B) toxicity in rice (Oryza sativa), named BORON EXCESS TOLERANT1. Using recombinant inbred lines derived from the B-toxicity-sensitive indica-ecotype cultivar IR36 and the tolerant japonica-ecotype cultivar Nekken 1, the region responsible for tolerance to B toxicity was narrowed to 49 kb on chromosome 4. Eight genes are annotated in this region. The DNA sequence in this region was compared between the B-toxicity-sensitive japonica cultivar Wataribune and the B-toxicity-tolerant japonica cultivar Nipponbare by eco-TILLING analysis and revealed a one-base insertion mutation in the open reading frame sequence of the gene Os04g0477300. The gene encodes a NAC (NAM, ATAF, and CUC)-like transcription factor and the function of the transcript is abolished in B-toxicity-tolerant cultivars. Transgenic plants in which the expression of Os04g0477300 is abolished by RNA interference gain tolerance to B toxicity.
Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

PubMed Central

Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

2014-01-01

Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613
Genomic structure and chromosomal localization of GML (GPI-anchored molecule-like protein), a gene induced by p53

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kimura, Yasutoshi; Furuhata, Tomohisa; Nakamura, Yusuke

1997-05-01

Among its known functions, tumor suppressor gene p53 serves as a transcriptional regulator and mediates various signals through activation of downstream genes. We recently identified a novel gene, GML (glycosylphosphatidylinositol (GPI)-anchored molecule-like protein), whose expression is specifically induced by wildtype p53. To characterize the GML gene further, we determined 35.8 kb of DNA sequence that included a consensus binding sequence for p53 and the entire GML gene. The GML gene consists of four exons, and the p53-binding sequence is present in the 5{prime}-flanking region. In genomic organization this gene resembles genes encoding murine Ly-6 glycoproteins, a human homologue of themore » Ly-6 family called RIG-E, and CD59; products of these genes, known as GPI-anchored proteins, are variously involved in signal transduction, cell-cell adhesion, and cell-matrix attachment. FISH analysis revealed that the GML gene is located on human chromosome 8q24.3. Genes encoding at least two other GPI-anchored molecules, E48 and RIG-E, are also located in this region. 20 refs., 2 figs., 1 tab.« less
Selectable antibiotic resistance marker gene-free transgenic rice harbouring the garlic leaf lectin gene exhibits resistance to sap-sucking planthoppers.

PubMed

Sengupta, Subhadipa; Chakraborti, Dipankar; Mondal, Hossain A; Das, Sampa

2010-03-01

Rice, the major food crop of world is severely affected by homopteran sucking pests. We introduced coding sequence of Allium sativum leaf agglutinin, ASAL, in rice cultivar IR64 to develop sustainable resistance against sap-sucking planthoppers as well as eliminated the selectable antibiotic-resistant marker gene hygromycin phosphotransferase (hpt) exploiting cre/lox site-specific recombination system. An expression vector was constructed containing the coding sequence of ASAL, a potent controlling agent against green leafhoppers (GLH, Nephotettix virescens) and brown planthopper (BPH, Nilaparvata lugens). The selectable marker (hpt) gene cassette was cloned within two lox sites of the same vector. Alongside, another vector was developed with chimeric cre recombinase gene cassette. Reciprocal crosses were performed between three single-copy T(0) plants with ASAL- lox-hpt-lox T-DNA and three single-copy T(0) plants with cre-bar T-DNA. Marker gene excisions were detected in T(1) hybrids through hygromycin sensitivity assay. Molecular analysis of T(1) plants exhibited 27.4% recombination efficiency. T(2) progenies of L03C04(1) hybrid parent showed 25% cre negative ASAL-expressing plants. Northern blot, western blot and ELISA showed significant level of ASAL expression in five marker-free T(2) progeny plants. In planta bioassay of GLH and BPH performed on these T(2) progenies exhibited radical reduction in survivability and fecundity compared with the untransformed control plants.

Evolution and structural diversification of Nictaba-like lectin genes in food crops with a focus on soybean (Glycine max).

PubMed

Van Holle, Sofie; Rougé, Pierre; Van Damme, Els J M

2017-03-01

The Nictaba family groups all proteins that show homology to Nictaba, the tobacco lectin. So far, Nictaba and an Arabidopsis thaliana homologue have been shown to be implicated in the plant stress response. The availability of more than 50 sequenced plant genomes provided the opportunity for a genome-wide identification of Nictaba -like genes in 15 species, representing members of the Fabaceae, Poaceae, Solanaceae, Musaceae, Arecaceae, Malvaceae and Rubiaceae. Additionally, phylogenetic relationships between the different species were explored. Furthermore, this study included domain organization analysis, searching for orthologous genes in the legume family and transcript profiling of the Nictaba -like lectin genes in soybean. Using a combination of BLASTp, InterPro analysis and hidden Markov models, the genomes of Medicago truncatula , Cicer arietinum , Lotus japonicus , Glycine max , Cajanus cajan , Phaseolus vulgaris , Theobroma cacao , Solanum lycopersicum , Solanum tuberosum , Coffea canephora , Oryza sativa , Zea mays, Sorghum bicolor , Musa acuminata and Elaeis guineensis were searched for Nictaba -like genes. Phylogenetic analysis was performed using RAxML and additional protein domains in the Nictaba-like sequences were identified using InterPro. Expression analysis of the soybean Nictaba -like genes was investigated using microarray data. Nictaba -like genes were identified in all studied species and analysis of the duplication events demonstrated that both tandem and segmental duplication contributed to the expansion of the Nictaba gene family in angiosperms. The single-domain Nictaba protein and the multi-domain F-box Nictaba architectures are ubiquitous among all analysed species and microarray analysis revealed differential expression patterns for all soybean Nictaba-like genes. Taken together, the comparative genomics data contributes to our understanding of the Nictaba -like gene family in species for which the occurrence of Nictaba domains had not
Massively Parallel Sequencing of Patients with Intellectual Disability, Congenital Anomalies and/or Autism Spectrum Disorders with a Targeted Gene Panel

PubMed Central

Brett, Maggie; McPherson, John; Zang, Zhi Jiang; Lai, Angeline; Tan, Ee-Shien; Ng, Ivy; Ong, Lai-Choo; Cham, Breana; Tan, Patrick; Rozen, Steve; Tan, Ene-Choo

2014-01-01

Developmental delay and/or intellectual disability (DD/ID) affects 1–3% of all children. At least half of these are thought to have a genetic etiology. Recent studies have shown that massively parallel sequencing (MPS) using a targeted gene panel is particularly suited for diagnostic testing for genetically heterogeneous conditions. We report on our experiences with using massively parallel sequencing of a targeted gene panel of 355 genes for investigating the genetic etiology of eight patients with a wide range of phenotypes including DD/ID, congenital anomalies and/or autism spectrum disorder. Targeted sequence enrichment was performed using the Agilent SureSelect Target Enrichment Kit and sequenced on the Illumina HiSeq2000 using paired-end reads. For all eight patients, 81–84% of the targeted regions achieved read depths of at least 20×, with average read depths overlapping targets ranging from 322× to 798×. Causative variants were successfully identified in two of the eight patients: a nonsense mutation in the ATRX gene and a canonical splice site mutation in the L1CAM gene. In a third patient, a canonical splice site variant in the USP9X gene could likely explain all or some of her clinical phenotypes. These results confirm the value of targeted MPS for investigating DD/ID in children for diagnostic purposes. However, targeted gene MPS was less likely to provide a genetic diagnosis for children whose phenotype includes autism. PMID:24690944
Crystal structure of a dimeric mannose-specific agglutinin from garlic: quaternary association and carbohydrate specificity.

PubMed

Chandra, N R; Ramachandraiah, G; Bachhawat, K; Dam, T K; Surolia, A; Vijayan, M

1999-01-22

A mannose-specific agglutinin, isolated from garlic bulbs, has been crystallized in the presence of a large excess of alpha-d-mannose, in space group C2 and cell dimensions, a=203.24, b=43.78, c=79.27 A, beta=112.4 degrees, with two dimers in the asymmetric unit. X-ray diffraction data were collected up to a nominal resolution of 2.4 A and the structure was solved by molecular replacement. The structure, refined to an R-factor of 22.6 % and an Rfree of 27.8 % reveals a beta-prism II fold, similar to that in the snowdrop lectin, comprising three antiparallel four-stranded beta-sheets arranged as a 12-stranded beta-barrel, with an approximate internal 3-fold symmetry. This agglutinin is, however, a dimer unlike snowdrop lectin which exists as a tetramer, despite a high degree of sequence similarity between them. A comparison of the two structures reveals a few substitutions in the garlic lectin which stabilise it into a dimer and prevent tetramer formation. Three mannose molecules have been identified on each subunit. In addition, electron density is observed for another possible mannose molecule per dimer resulting in a total of seven mannose molecules in each dimer. Although the mannose binding sites and the overall structure are similar in the subunits of snowdrop and garlic lectin, their specificities to glycoproteins such as GP120 vary considerably. These differences appear, in part, to be a direct consequence of the differences in oligomerisation, implying that variation in quaternary association may be a mode of achieving oligosaccharide specificity in bulb lectins. Copyright 1998 Academic Press.
Reconstructing the Evolutionary History of Paralogous APETALA1/FRUITFULL-Like Genes in Grasses (Poaceae)

PubMed Central

Preston, Jill C.; Kellogg, Elizabeth A.

2006-01-01

Gene duplication is an important mechanism for the generation of evolutionary novelty. Paralogous genes that are not silenced may evolve new functions (neofunctionalization) that will alter the developmental outcome of preexisting genetic pathways, partition ancestral functions (subfunctionalization) into divergent developmental modules, or function redundantly. Functional divergence can occur by changes in the spatio-temporal patterns of gene expression and/or by changes in the activities of their protein products. We reconstructed the evolutionary history of two paralogous monocot MADS-box transcription factors, FUL1 and FUL2, and determined the evolution of sequence and gene expression in grass AP1/FUL-like genes. Monocot AP1/FUL-like genes duplicated at the base of Poaceae and codon substitutions occurred under relaxed selection mostly along the branch leading to FUL2. Following the duplication, FUL1 was apparently lost from early diverging taxa, a pattern consistent with major changes in grass floral morphology. Overlapping gene expression patterns in leaves and spikelets indicate that FUL1 and FUL2 probably share some redundant functions, but that FUL2 may have become temporally restricted under partial subfunctionalization to particular stages of floret development. These data have allowed us to reconstruct the history of AP1/FUL-like genes in Poaceae and to hypothesize a role for this gene duplication in the evolution of the grass spikelet. PMID:16816429
Use of the heteroduplex mobility assay and cell sorting to select genome sequences of the CCR5 gene in HEK 293T cells edited by transcription activator-like effector nucleases

PubMed Central

Nerys-Junior, Arildo; Costa, Lendel C.; Braga-Dias, Luciene P.; Oliveira, Márcia; Rossi, Átila D.; da Cunha, Rodrigo Delvecchio; Gonçalves, Gabriel S.; Tanuri, Amilcar

2014-01-01

Engineered nucleases such as zinc finger nucleases (ZFN) and transcription activator-like effector nucleases (TALEN) are one of the most promising tools for modifying genomes. These site-specific enzymes cause double-strand breaks that allow gene disruption or gene insertion, thereby facilitating genetic manipulation. The major problem associated with this approach is the labor-intensive procedures required to screen and confirm the cellular modification by nucleases. In this work, we produced a TALEN that targets the human CCR5 gene and developed a heteroduplex mobility assay for HEK 293T cells to select positive colonies for sequencing. This approach provides a useful tool for the quick detection and easy assessment of nuclease activity. PMID:24688299
Use of the heteroduplex mobility assay and cell sorting to select genome sequences of the CCR5 gene in HEK 293T cells edited by transcription activator-like effector nucleases.

PubMed

Nerys-Junior, Arildo; Costa, Lendel C; Braga-Dias, Luciene P; Oliveira, Márcia; Rossi, Atila D; da Cunha, Rodrigo Delvecchio; Gonçalves, Gabriel S; Tanuri, Amilcar

2014-03-01

Engineered nucleases such as zinc finger nucleases (ZFN) and transcription activator-like effector nucleases (TALEN) are one of the most promising tools for modifying genomes. These site-specific enzymes cause double-strand breaks that allow gene disruption or gene insertion, thereby facilitating genetic manipulation. The major problem associated with this approach is the labor-intensive procedures required to screen and confirm the cellular modification by nucleases. In this work, we produced a TALEN that targets the human CCR5 gene and developed a heteroduplex mobility assay for HEK 293T cells to select positive colonies for sequencing. This approach provides a useful tool for the quick detection and easy assessment of nuclease activity.
Serum antileptospiral agglutinins in freshwater turtles from Southern Brazil

PubMed Central

Silva, Éverton F; Seyffert, Núbia; Cerqueira, Gustavo M.; Leihs, Karl P.; Athanazio, Daniel A.; Valente, Ana L. S.; Dellagostin, Odir A.; Brod, Claudiomar S.

2009-01-01

In this study, we observed the presence of antileptospiral agglutinins in freshwater turtles of two urban lakes of Pelotas, Southern Brazil. Forty animals (29 Trachemys dorbigny and 11 Phrynops hilarii) were captured and studied. Attempts to isolate leptospires from blood and urine samples were unsuccessful. Serum samples (titer > 100) reactive to pathogenic strains were observed in 11 animals. These data encourage surveys of pet turtles to evaluate the risk of transmission of pathogenic leptospires to humans. PMID:24031348
Effect of the lectins wheat germ agglutinin (WGA) and Ulex europaeus agglutinin (UEA-I) on the alpha-amylase secretion of rat pancreas in vitro and in vivo.

PubMed

Mikkat, U; Damm, I; Schröder, G; Schmidt, K; Wirth, C; Weber, H; Jonas, L

1998-05-01

Lectins are able to bind to cholecystokinin (CCK) receptors and other glycosylated membrane proteins. The lectins wheat germ agglutinin (WGA) and Ulex europaeus agglutinin (UEA-I) are used for affinity chromatography to isolate the highly glycosylated CCK-A receptor of pancreatic acinar cells. According to the working hypothesis that lectin binding to the CCK receptor should alter the ligand-receptor interaction, the effect of WGA and UEA-I on CCK-8-induced enzyme secretion was studied on isolated rat pancreatic acini in vitro. In vitro both lectins showed a dosage-dependent inhibition of CCK-8-induced alpha-amylase secretion of acini over 60 min. WGA showed a strong inhibitory effect on amylase secretion, approximately 40%, in vitro. UEA-I caused a smaller, but significant decrease, approximately 20%, in enzyme secretion of isolated acini. Additionally, both lectins inhibited cerulein/secretin- or cerulein-induced pancreatic secretion of rats in vivo, but not after secretin alone. The results are discussed with respect to a possible influence of both lectins on the interaction of CCK or cerulein with the CCK-A receptor.
Nucleotide Sequences of Genes Coding for Fimbrial Proteins in a Cryptic Genospecies of Haemophilus spp. Isolated from Neonatal and Genital Tract Infections

PubMed Central

Gousset, Nathalie; Rosenau, Agnes; Sizaret, Pierre-Yves; Quentin, Roland

1999-01-01

Nineteen isolates belonging to a cryptic genospecies of Haemophilus (referred to here as genital strains) isolated from genital tract infections (6 strains) and from neonatal infections (13 strains) were studied for fimbrial genes. Sixteen strains exhibit peritrichous fimbriae observed by electron microscopy. By PCR with primers corresponding to the extreme ends of the Haemophilus influenzae type b (Hib) hifA and hifD genes and Southern blotting, a hifA-like gene (named ghfA) and a hifD-like gene (named ghfD) were identified in 6 of the 19 strains. Five of these six strains were from the genital tracts of adults, and one was from a neonate. For each gene, the nucleotide sequence was identical for the six strains. A hifE-like gene (named ghfE) was amplified from only one of the 19 genital strains of Haemophilus, but the ghfE probe gave a signal in Southern hybridization with the five other strains positive for ghfA and ghfD. Therefore, these strains may carry a ghfE-like gene. The Hib fimbrial gene cluster is located between the purE and pepN genes as previously described. For the 13 genital Haemophilus strains that lack fimbrial genes, this region corresponds to a noncoding sequence. Another major fimbrial gene designated the fimbrin gene was previously identified in a nontypeable H. influenzae strain. A fimbrin-like gene was identified for all of our 19 genital strains. This gene is similar to the ompP5 gene of many Haemophilus strains. Therefore, other, unidentified genes may explain the piliation observed in electron microscopy on genital Haemophilus strains which do not possess LKP-like fimbrial genes. Fimbrial genes were significantly associated with strains isolated from the genital tract. They may confer on the strain the ability to survive in the genital tract. PMID:9864189
Identification of a novel prophage-like gene cluster actively expressed in both virulent and avirulent strains of Leptospira interrogans serovar Lai.

PubMed

Qin, Jin-Hong; Zhang, Qing; Zhang, Zhi-Ming; Zhong, Yi; Yang, Yang; Hu, Bao-Yu; Zhao, Guo-Ping; Guo, Xiao-Kui

2008-06-01

DNA microarray analysis was used to compare the differential gene expression profiles between Leptospira interrogans serovar Lai type strain 56601 and its corresponding attenuated strain IPAV. A 22-kb genomic island covering a cluster of 34 genes (i.e., genes LA0186 to LA0219) was actively expressed in both strains but concomitantly upregulated in strain 56601 in contrast to that of IPAV. Reverse transcription-PCR assays proved that the gene cluster comprised five transcripts. Gene annotation of this cluster revealed characteristics of a putative prophage-like remnant with at least 8 of 34 sequences encoding prophage-like proteins, of which the LA0195 protein is probably a putative prophage CI-like regulator. The transcription initiation activities of putative promoter-regulatory sequences of transcripts I, II, and III, all proximal to the LA0195 gene, were further analyzed in the Escherichia coli promoter probe vector pKK232-8 by assaying the reporter chloramphenicol acetyltransferase (CAT) activities. The strong promoter activities of both transcripts I and II indicated by the E. coli CAT assay were well correlated with the in vitro sequence-specific binding of the recombinant LA0195 protein to the corresponding promoter probes detected by the electrophoresis mobility shift assay. On the other hand, the promoter activity of transcript III was very low in E. coli and failed to show active binding to the LA0195 protein in vitro. These results suggested that the LA0195 protein is likely involved in the transcription of transcripts I and II. However, the identical complete DNA sequences of this prophage remnant from these two strains strongly suggests that possible regulatory factors or signal transduction systems residing outside of this region within the genome may be responsible for the differential expression profiling in these two strains.
Transgenic expression of a maize geranyl geranyl transferase gene sequence in maize callus increases resistance to ear rot pathogens

USDA-ARS?s Scientific Manuscript database

Determining the genes responsible for pest resistance in maize can allow breeders to develop varieties with lower losses and less contamination with undesirable toxins. A gene sequence coding for a geranyl geranyl transferase-like protein located in a fungal ear rot resistance quantitative trait loc...
Structural insights into the anti-HIV activity of the Oscillatoria agardhii agglutinin homolog lectin family.

PubMed

Koharudin, Leonardus M I; Kollipara, Sireesha; Aiken, Christopher; Gronenborn, Angela M

2012-09-28

Oscillatoria agardhii agglutinin homolog (OAAH) proteins belong to a recently discovered lectin family. All members contain a sequence repeat of ~66 amino acids, with the number of repeats varying among different family members. Apart from data for the founding member OAA, neither three-dimensional structures, information about carbohydrate binding specificities, nor antiviral activity data have been available up to now for any other members of the OAAH family. To elucidate the structural basis for the antiviral mechanism of OAAHs, we determined the crystal structures of Pseudomonas fluorescens and Myxococcus xanthus lectins. Both proteins exhibit the same fold, resembling the founding family member, OAA, with minor differences in loop conformations. Carbohydrate binding studies by NMR and x-ray structures of glycan-lectin complexes reveal that the number of sugar binding sites corresponds to the number of sequence repeats in each protein. As for OAA, tight and specific binding to α3,α6-mannopentaose was observed. All the OAAH proteins described here exhibit potent anti-HIV activity at comparable levels. Altogether, our results provide structural details of the protein-carbohydrate interaction for this novel lectin family and insights into the molecular basis of their HIV inactivation properties.
Identification and characterization of rhizospheric microbial diversity by 16S ribosomal RNA gene sequencing.

PubMed

Naveed, Muhammad; Mubeen, Samavia; Khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad

2014-01-01

In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization.
Identification and characterization of rhizospheric microbial diversity by 16S ribosomal RNA gene sequencing

PubMed Central

Naveed, Muhammad; Mubeen, Samavia; khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad

2014-01-01

In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization. PMID:25477935
Evolution of bacterial-like phosphoprotein phosphatases in photosynthetic eukaryotes features ancestral mitochondrial or archaeal origin and possible lateral gene transfer.

PubMed

Uhrig, R Glen; Kerk, David; Moorhead, Greg B

2013-12-01

Protein phosphorylation is a reversible regulatory process catalyzed by the opposing reactions of protein kinases and phosphatases, which are central to the proper functioning of the cell. Dysfunction of members in either the protein kinase or phosphatase family can have wide-ranging deleterious effects in both metazoans and plants alike. Previously, three bacterial-like phosphoprotein phosphatase classes were uncovered in eukaryotes and named according to the bacterial sequences with which they have the greatest similarity: Shewanella-like (SLP), Rhizobiales-like (RLPH), and ApaH-like (ALPH) phosphatases. Utilizing the wealth of data resulting from recently sequenced complete eukaryotic genomes, we conducted database searching by hidden Markov models, multiple sequence alignment, and phylogenetic tree inference with Bayesian and maximum likelihood methods to elucidate the pattern of evolution of eukaryotic bacterial-like phosphoprotein phosphatase sequences, which are predominantly distributed in photosynthetic eukaryotes. We uncovered a pattern of ancestral mitochondrial (SLP and RLPH) or archaeal (ALPH) gene entry into eukaryotes, supplemented by possible instances of lateral gene transfer between bacteria and eukaryotes. In addition to the previously known green algal and plant SLP1 and SLP2 protein forms, a more ancestral third form (SLP3) was found in green algae. Data from in silico subcellular localization predictions revealed class-specific differences in plants likely to result in distinct functions, and for SLP sequences, distinctive and possibly functionally significant differences between plants and nonphotosynthetic eukaryotes. Conserved carboxyl-terminal sequence motifs with class-specific patterns of residue substitutions, most prominent in photosynthetic organisms, raise the possibility of complex interactions with regulatory proteins.
Analysis of conifer FLOWERING LOCUS T/TERMINAL FLOWER1-like genes provides evidence for dramatic biochemical evolution in the angiosperm FT lineage.

PubMed

Klintenäs, Maria; Pin, Pierre A; Benlloch, Reyes; Ingvarsson, Pär K; Nilsson, Ove

2012-12-01

In flowering plants, homologs of the Arabidopsis phosphatidylethanolamine-binding protein (PEBP) FLOWERING LOCUS T (FT) are key components in controlling flowering time. We show here that, although FT homologs are found in all angiosperms with completed genome sequences, there is no evidence to date that FT-like genes exist in other groups of plants. Through phylogeny reconstructions and heterologous expression, we examined the biochemical function of the Picea (spruces) and Pinus (pines) PEBP families - two gymnosperm taxa phylogenetically distant from the angiosperms. We have defined a lineage of gymnosperm PEBP genes, termed the FT/TERMINAL FLOWER1 (TFL1)-like genes, that share sequence characteristics with both the angiosperm FT- and TFL1-like clades. When expressed in Arabidopsis, FT/TFL1-like genes repressed flowering, indicating that the proteins are biochemically more similar to the angiosperm TFL1-like proteins than to the FT-like proteins. This suggests that the regulation of the vegetative-to-reproductive switch might differ in gymnosperms compared with angiosperms. Molecular evolution studies suggest that plasticity at exon 4 contributes to the divergence of FT-like function in floral promotion. In addition, the presence of FT-like genes in basal angiosperms indicates that the FT-like function emerged at an early stage during the evolution of flowering plants as a means to regulate flowering time. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes.

PubMed

Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4(-/-) mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases.
X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes

PubMed Central

Hu, H; Haas, S A; Chelly, J; Van Esch, H; Raynaud, M; de Brouwer, A P M; Weinert, S; Froyen, G; Frints, S G M; Laumonnier, F; Zemojtel, T; Love, M I; Richard, H; Emde, A-K; Bienek, M; Jensen, C; Hambrock, M; Fischer, U; Langnick, C; Feldkamp, M; Wissink-Lindhout, W; Lebrun, N; Castelnau, L; Rucci, J; Montjean, R; Dorseuil, O; Billuart, P; Stuhlmann, T; Shaw, M; Corbett, M A; Gardner, A; Willis-Owen, S; Tan, C; Friend, K L; Belet, S; van Roozendaal, K E P; Jimenez-Pocquet, M; Moizard, M-P; Ronce, N; Sun, R; O'Keeffe, S; Chenna, R; van Bömmel, A; Göke, J; Hackett, A; Field, M; Christie, L; Boyle, J; Haan, E; Nelson, J; Turner, G; Baynam, G; Gillessen-Kaesbach, G; Müller, U; Steinberger, D; Budny, B; Badura-Stronka, M; Latos-Bieleńska, A; Ousager, L B; Wieacker, P; Rodríguez Criado, G; Bondeson, M-L; Annerén, G; Dufke, A; Cohen, M; Van Maldergem, L; Vincent-Delorme, C; Echenne, B; Simon-Bouy, B; Kleefstra, T; Willemsen, M; Fryns, J-P; Devriendt, K; Ullmann, R; Vingron, M; Wrogemann, K; Wienker, T F; Tzschach, A; van Bokhoven, H; Gecz, J; Jentsch, T J; Chen, W; Ropers, H-H; Kalscheuer, V M

2016-01-01

X-linked intellectual disability (XLID) is a clinically and genetically heterogeneous disorder. During the past two decades in excess of 100 X-chromosome ID genes have been identified. Yet, a large number of families mapping to the X-chromosome remained unresolved suggesting that more XLID genes or loci are yet to be identified. Here, we have investigated 405 unresolved families with XLID. We employed massively parallel sequencing of all X-chromosome exons in the index males. The majority of these males were previously tested negative for copy number variations and for mutations in a subset of known XLID genes by Sanger sequencing. In total, 745 X-chromosomal genes were screened. After stringent filtering, a total of 1297 non-recurrent exonic variants remained for prioritization. Co-segregation analysis of potential clinically relevant changes revealed that 80 families (20%) carried pathogenic variants in established XLID genes. In 19 families, we detected likely causative protein truncating and missense variants in 7 novel and validated XLID genes (CLCN4, CNKSR2, FRMPD4, KLHL15, LAS1L, RLIM and USP27X) and potentially deleterious variants in 2 novel candidate XLID genes (CDK16 and TAF1). We show that the CLCN4 and CNKSR2 variants impair protein functions as indicated by electrophysiological studies and altered differentiation of cultured primary neurons from Clcn4−/− mice or after mRNA knock-down. The newly identified and candidate XLID proteins belong to pathways and networks with established roles in cognitive function and intellectual disability in particular. We suggest that systematic sequencing of all X-chromosomal genes in a cohort of patients with genetic evidence for X-chromosome locus involvement may resolve up to 58% of Fragile X-negative cases. PMID:25644381
Two Multiplex Real-Time PCR Assays to Detect and Differentiate Acinetobacter baumannii and Non- baumannii Acinetobacter spp. Carrying blaNDM, blaOXA-23-Like, blaOXA-40-Like, blaOXA-51-Like, and blaOXA-58-Like Genes

PubMed Central

Yang, Qiu; Rui, Yongyu

2016-01-01

Nosocomial infections caused by Acinetobacter spp. resistant to carbapenems are increasingly reported worldwide. Carbapenem-resistant Acinetobacter (CRA) is becoming a serious concern with increasing patient morbidity, mortality, and lengths of hospital stay. Therefore, the rapid detection of CRA is essential for epidemiological surveillance. Polymerase chain reaction (PCR) has been extensively used for the rapid identification of most pathogens. In this study, we have developed two multiplex real-time PCR assays to detect and differentiate A. baumannii and non-A. baumannii Acinetobacter spp, and common carbapenemase genes, including blaNDM, blaOXA-23-like, blaOXA-40-like, blaOXA-51-like, and blaOXA-58-like. We demonstrate the potential utility of these assays for the direct detection of blaNDM-, blaOXA-23-like-, blaOXA-40-like-, blaOXA-51-like-, and blaOXA-58-like-positive CRA in clinical specimens. Primers were specifically designed, and two multiplex real-time PCR assays were developed: multiplex real-time PCR assay1 for the detection of Acinetobacter baumannii 16S–23S rRNA internal transcribed spacer sequence, the Acinetobacter recA gene, and class-B-metalloenzyme-encoding gene blaNDM; and multiplex real-time PCR assay2 to detect class-D-oxacillinase-encoding genes (blaOXA-23-like, blaOXA-40-like, blaOXA-51-like,and blaOXA-58-like). The assays were performed on an ABI Prism 7500 FAST Real-Time PCR System. CRA isolates were used to compare the assays with conventional PCR and sequencing. Known amounts of CRA cells were added to sputum and fecal specimens and used to test the multiplex real-time PCR assays. The results for target and nontarget amplification showed that the multiplex real-time PCR assays were specific, the limit of detection for each target was 10 copies per 20 μL reaction volume, the assays were linear over six log dilutions of the target genes (r2 > 0.99), and the Ct values of the coefficients of variation for intra- and interassay
Sequence and expression analyses of porcine ISG15 and ISG43 genes.

PubMed

Huang, Jiangnan; Zhao, Shuhong; Zhu, Mengjin; Wu, Zhenfang; Yu, Mei

2009-08-01

The coding sequences of porcine interferon-stimulated gene 15 (ISG15) and the interferon-stimulated gene (ISG43) were cloned from swine spleen mRNA. The amino acid sequences deduced from porcine ISG15 and ISG43 genes coding sequence shared 24-75% and 29-83% similarity with ISG15s and ISG43s from other vertebrates, respectively. Structural analyses revealed that porcine ISG15 comprises two ubiquitin homologues motifs (UBQ) domain and a conserved C-terminal LRLRGG conjugating motif. Porcine ISG43 contains an ubiquitin-processing proteases-like domain. Phylogenetic analyses showed that porcine ISG15 and ISG43 were mostly related to rat ISG15 and cattle ISG43, respectively. Using quantitative real-time PCR assay, significant increased expression levels of porcine ISG15 and ISG43 genes were detected in porcine kidney endothelial cells (PK15) cells treated with poly I:C. We also observed the enhanced mRNA expression of three members of dsRNA pattern-recognition receptors (PRR), TLR3, DDX58 and IFIH1, which have been reported to act as critical receptors in inducing the mRNA expression of ISG15 and ISG43 genes. However, we did not detect any induced mRNA expression of IFNalpha and IFNbeta, suggesting that transcriptional activations of ISG15 and ISG43 were mediated through IFN-independent signaling pathway in the poly I:C treated PK15 cells. Association analyses in a Landrace pig population revealed that ISG15 c.347T>C (BstUI) polymorphism and the ISG43 c.953T>G (BccI) polymorphism were significantly associated with hematological parameters and immune-related traits.

Demonstration of vascular endothelium in thyroid carcinomas using Ulex europaeus I agglutinin.

PubMed

González-Cámpora, R; Montero, C; Martin-Lacave, I; Galera, H

1986-03-01

The usefulness of using peroxidase-labelled Ulex europaeus agglutinin I for the staining of small vessels and capillaries in the capsule of thyroid tumours is demonstrated. With this procedure the scanning for small tumour deposits in those vessels and, consequently, the diagnosis of follicular carcinoma of the thyroid is facilitated.
[Successful treatment with rituximab in a patient with splenic marginal zone B-cell lymphoma accompanied by cold agglutinin disease].

PubMed

Yasuyama, Masako; Kawauchi, Kiyotaka; Otsuka, Kuniaki; Tamura, Hiroyuki; Fujibayashi, Mariko

2014-01-01

An 81-year-old man was admitted to our hospital due to dyspnea in July 2008. A physical examination revealed marked splenomegaly, and the results of laboratory tests were as follows: hemoglobin (Hb)=7.0 g/dL, Ret=6.4%, WBC=24,100/μL (Ly: 20,003/μL), indirect bilirubin=3.6 mg/dL, LDH=232 IU/L. The cold agglutinin titer was 1 : 8,192, and a direct antiglobulin test was positive. A PET scan showed abnormal accumulation in the spleen and bone marrow. A bone marrow aspirate examination and biopsy demonstrated diffuse involvement of abnormal lymphocytes that were found to be positive for CD20 and negative for CD5, CD10, and cyclin D1. The immunoglobulin genes were clonally rearranged. Based on these findings, splenic marginal zone B-cell lymphoma (SMZL) associated with cold agglutinin disease (CAD) was diagnosed. Because the patient refused splenectomy, he was treated with four cycles of rituximab therapy (375 mg/kg, once a week). The Hb level and lymphocyte count subsequently normalized and the splenomegaly resolved. One year later, he relapsed and was again treated with rituximab therapy with complete remission. CAD accompanied by SMZL is very rare. Rituximab may be chosen as an alternative and effective therapeutic option in patients with SMZL-particularly those with autoimmune hemolytic anemia.
Cloning, sequencing, and expression of the Zymomonas mobilis phosphoglycerate mutase gene (pgm) in Escherichia coli.

PubMed Central

Yomano, L P; Scopes, R K; Ingram, L O

1993-01-01

Phosphoglycerate mutase is an essential glycolytic enzyme for Zymomonas mobilis, catalyzing the reversible interconversion of 3-phosphoglycerate and 2-phosphoglycerate. The pgm gene encoding this enzyme was cloned on a 5.2-kbp DNA fragment and expressed in Escherichia coli. Recombinants were identified by using antibodies directed against purified Z. mobilis phosphoglycerate mutase. The pgm gene contains a canonical ribosome-binding site, a biased pattern of codon usage, a long upstream untranslated region, and four promoters which share sequence homology. Interestingly, adhA and a D-specific 2-hydroxyacid dehydrogenase were found on the same DNA fragment and appear to form a cluster of genes which function in central metabolism. The translated sequence for Z. mobilis pgm was in full agreement with the 40 N-terminal amino acid residues determined by protein sequencing. The primary structure of the translated sequence is highly conserved (52 to 60% identity with other phosphoglycerate mutases) and also shares extensive homology with bisphosphoglycerate mutases (51 to 59% identity). Since Southern blots indicated the presence of only a single copy of pgm in the Z. mobilis chromosome, it is likely that the cloned pgm gene functions to provide both activities. Z. mobilis phosphoglycerate mutase is unusual in that it lacks the flexible tail and lysines at the carboxy terminus which are present in the enzyme isolated from all other organisms examined. Images PMID:8320209
Suppression of a NAC-Like Transcription Factor Gene Improves Boron-Toxicity Tolerance in Rice1

PubMed Central

Ochiai, Kumiko; Shimizu, Akifumi; Okumoto, Yutaka; Fujiwara, Toru; Matoh, Toru

2011-01-01

We identified a gene responsible for tolerance to boron (B) toxicity in rice (Oryza sativa), named BORON EXCESS TOLERANT1. Using recombinant inbred lines derived from the B-toxicity-sensitive indica-ecotype cultivar IR36 and the tolerant japonica-ecotype cultivar Nekken 1, the region responsible for tolerance to B toxicity was narrowed to 49 kb on chromosome 4. Eight genes are annotated in this region. The DNA sequence in this region was compared between the B-toxicity-sensitive japonica cultivar Wataribune and the B-toxicity-tolerant japonica cultivar Nipponbare by eco-TILLING analysis and revealed a one-base insertion mutation in the open reading frame sequence of the gene Os04g0477300. The gene encodes a NAC (NAM, ATAF, and CUC)-like transcription factor and the function of the transcript is abolished in B-toxicity-tolerant cultivars. Transgenic plants in which the expression of Os04g0477300 is abolished by RNA interference gain tolerance to B toxicity. PMID:21543724
Kinetics of photobleaching of aqueous solutions of ricin agglutinin in the presence of guanidine chloride

NASA Astrophysics Data System (ADS)

Brandt, Nikolai N.; Chikishev, Andrey Y.

2002-05-01

Kinetics of background decay in Raman spectra of aqueous solutions of ricin agglutinin in the presence of guanidine chloride were measured. The differences in the kinetics of photobleaching are discussed.
Effect of Dactylogyrus catlaius (Jain 1961) infection in Labeo rohita (Hamilton 1822): innate immune responses and expression profile of some immune related genes.

PubMed

Dash, Pujarini; Kar, Banya; Mishra, Arpita; Sahoo, P K

2014-03-01

The monogenean ectoparasite, Dactylogyrus sp. is a major pathogen in freshwater aquaculture. The immune responses in parasitized fish were analyzed by quantitation of innate immune factors (natural agglutinin level, haemolysin titre, antiprotease, lysozyme and myeloperoxidase activities) in serum and immune-relevant gene expression in gill and anterior kidney. The antiprotease activity and natural agglutinin level were found to be significantly higher and lysozyme activity was significantly lower in parasitized fish. Most of the genes viz., beta2-microglobulin (beta2M), major histocompatibility complex I (MHCI), MHCII, tumor necrosis factor alpha (TNFalpha) and toll-like receptor 22 (TLR22) in gill samples were significantly down-regulated in the experimental group. In the anterior kidney, the expression of superoxide dismutase and interleukin 1beta (IL1beta) were significantly up-regulated whereas a significant down regulation of MHCII and TNFalpha was also observed. The down-regulation of most of the genes viz, MHCI, beta2M, MHCII, TLR22 and TNFalpha in infected gills indicated a well evolved mechanism in this parasite to escape the host immune response. The modulation of innate and adaptive immunity by this parasite can be further explored to understand host susceptibility.
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

PubMed

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Analysis of castor by ELISAs that distinguish Ricin and Ricinus communis agglutinin (RCA)

USDA-ARS?s Scientific Manuscript database

To facilitate the analysis of castor (Ricinus communis L.) seed fractions and germplasm for ricin content, we investigated the use of enzyme-linked immunosorbent assay (ELISA) methods to differentiate between ricin toxin and the related Ricinus communis agglutinin (RCA). Both proteins are based on ...
Expression of homing endonuclease gene and insertion-like element in sea anemone mitochondrial genomes: Lesson learned from Anemonia viridis.

PubMed

Chi, Sylvia Ighem; Urbarova, Ilona; Johansen, Steinar D

2018-04-30

The mitochondrial genomes of sea anemones are dynamic in structure. Invasion by genetic elements, such as self-catalytic group I introns or insertion-like sequences, contribute to sea anemone mitochondrial genome expansion and complexity. By using next generation sequencing we investigated the complete mtDNAs and corresponding transcriptomes of the temperate sea anemone Anemonia viridis and its closer tropical relative Anemonia majano. Two versions of fused homing endonuclease gene (HEG) organization were observed among the Actiniidae sea anemones; in-frame gene fusion and pseudo-gene fusion. We provided support for the pseudo-gene fusion organization in Anemonia species, resulting in a repressed HEG from the COI-884 group I intron. orfA, a putative protein-coding gene with insertion-like features, was present in both Anemonia species. Interestingly, orfA and COI expression were significantly up-regulated upon long-term environmental stress corresponding to low seawater pH conditions. This study provides new insights to the dynamics of sea anemone mitochondrial genome structure and function. Copyright © 2018 Elsevier B.V. All rights reserved.
Molecular genetic characterization of the RD-114 gene family of endogenous feline retroviral sequences.

PubMed Central

Reeves, R H; O'Brien, S J

1984-01-01

RD-114 is a replication-competent, xenotropic retrovirus which is homologous to a family of moderately repetitive DNA sequences present at ca. 20 copies in the normal cellular genome of domestic cats. To examine the extent and character of genomic divergence of the RD-114 gene family as well as to assess their positional association within the cat genome, we have prepared a series of molecular clones of endogenous RD-114 DNA segments from a genomic library of cat cellular DNA. Their restriction endonuclease maps were compared with each other as well as to that of the prototype-inducible RD-114 which was molecularly cloned from a chronically infected human cell line. The endogenous sequences analyzed were similar to each other in that they were colinear with RD-114 proviral DNA, were bounded by long terminal redundancies, and conserved many restriction sites in the gag and pol regions. However, the env regions of many of the sequences examined were substantially deleted. Several of the endogenous RD-114 genomes contained a novel envelope sequence which was unrelated to the env gene of the prototype RD-114 env gene but which, like RD-114 and endogenous feline leukemia virus provirus, was found only in species of the genus Felis, and not in other closely related Felidae genera. The endogenous RD-114 sequences each had a distinct cellular flank which indicates that these sequences are not tandem but dispersed nonspecifically throughout the genome. Southern analysis of cat cellular DNA confirmed the conclusions about conserved restriction sites in endogenous sequences and indicated that a single locus may be responsible for the production of the major inducible form of RD-114. Images PMID:6090693
Identification of Putative Precursor Genes for the Biosynthesis of Cannabinoid-Like Compound in Radula marginata

PubMed Central

Hussain, Tajammul; Plunkett, Blue; Ejaz, Mahwish; Espley, Richard V.; Kayser, Oliver

2018-01-01

The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Identification of genes associated with prophage-like gene transfer agents in the pathogenic intestinal spirochaetes Brachyspira hyodysenteriae, Brachyspira pilosicoli and Brachyspira intermedia.

PubMed

Motro, Yair; La, Tom; Bellgard, Matthew I; Dunn, David S; Phillips, Nyree D; Hampson, David J

2009-03-02

VSH-1 is an unusual prophage-like gene transfer agent (GTA) that has been described in the intestinal spirochaete Brachyspira hyodysenteriae. The GTA does not self-propagate, but it assembles into a virus-like particle and transfers random 7.5kb fragments of host DNA to other B. hyodysenteriae cells. To date the GTA VSH-1 has only been analysed in B. hyodysenteriae strain B204, in which 11 late function genes encoding prophage capsid, tail and lysis elements have been described. The aim of the current study was to look for these 11 genes in the near-complete genomes of B. hyodysenteriae WA1, B. pilosicoli 95/1000 and B. intermedia HB60. All 11 genes were found in the three new strains. The GTA genes in WA1 and 95/1000 were contiguous, whilst some of those in HB60 were not-although in all three strains some gene rearrangements were present. A new predicted open reading frame with potential functional importance was found in a consistent position associated with all four GTAs, located between the genes for head protein Hvp24 and tail protein Hvp53, overlapping with the hvp24 sequence. Differences in the nucleotide and predicted amino acid sequences of the GTA genes in the spirochaete strains were consistent with the overall genetic distances between the strains. Hence the GTAs in the two B. hyodysenteriae strains were considered to be strain specific variants, and were designated GTA/Bh-B204 and GTA/Bh-WA1 respectively. The GTAs in the strains of B. intermedia and B. pilosicoli were designated GTA/Bint-HB60 and GTA/Bp-95/1000 respectively. Further work is required to determine the extent to which these GTAs can transfer host genes between different Brachyspira species and strains.
Single molecule targeted sequencing for cancer gene mutation detection.

PubMed

Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

2016-05-19

With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.
Simultaneous differentiation and quantification of ricin and agglutinin by an antibody-sandwich surface plasmon resonance sensor.

PubMed

Stern, Daniel; Pauly, Diana; Zydek, Martin; Müller, Christian; Avondet, Marc A; Worbs, Sylvia; Lisdat, Fred; Dorner, Martin B; Dorner, Brigitte G

2016-04-15

Ricin is one of the most toxic plant toxins known. Its accessibility and relative ease of preparation makes it a potential agent for criminal or bio-terrorist attacks. Detection of ricin from unknown samples requires differentiation of ricin from the highly homologous Ricinus communis agglutinin which is currently not feasible using immunological methods. Here we have developed a simple and sensitive surface plasmon resonance (SPR) sensing system for rapid differentiation between ricin and agglutinin done in real time. Both lectins were quantified in a sandwich immunoassay-like setting by capturing with a cross-reactive antibody (R109) binding to both proteins while differentiating by injection of a ricin-specific antibody (R18) in a subsequent enhancement step. The SPR-assay was reproducible and sensitive for different R. communis cultivars, showing no false positive results when other lectins were tested. Quantification and differentiation of both molecules was also demonstrated from a crude castor bean extract and complex matrices. For the first time, we have demonstrated how the closely related lectins can be discerned and quantified in a single assay based on immunological methods. This novel approach delivers crucial information regarding the composition, purity, concentration, and toxicity of suspicious samples containing ricin in less than 30 minutes. Furthermore, we show how enhancement injections during SPR-measurements can be used to determine the ratio of two related proteins independently of the actual protein concentration by comparing normalized enhancement response levels. Copyright © 2015 Elsevier B.V. All rights reserved.
SxtA gene sequence analysis of dinoflagellate Alexandrium minutum

NASA Astrophysics Data System (ADS)

Norshaha, Safida Anira; Latib, Norhidayu Abdul; Usup, Gires; Yusof, Nurul Yuziana Mohd

2015-09-01

The dinoflagellate Alexandrium minutum is typically known for the production of potent neurotoxins such as saxitoxin, affecting the health of human seafood consumers via paralytic shellfish poisoning (PSP). These phenomena is related to the harmful algal blooms (HABs) that is believed to be influenced by environmental and nutritional factors. Previous study has revealed that SxtA gene is a starting gene that involved in the saxitoxin production pathway. The aim of this study was to analyse the sequence of the sxtA gene in A. minutum. The dinoflagellates culture was cultured at temperature 26°C with 16:8-hour light:dark photocycle. After the samples were harvested, RNA was extracted, complementary DNA (cDNA) was synthesised and amplified by polymerase chain reaction (PCR). The PCR products were then purified and cloned before sequenced. The SxtA sequence obtained was then analyzed in order to identify the presence of SxtA gene in Alexandrium minutum.
Variation of clinical expression in patients with Stargardt dystrophy and sequence variations in the ABCR gene.

PubMed

Fishman, G A; Stone, E M; Grover, S; Derlacki, D J; Haines, H L; Hockey, R R

1999-04-01

To report the spectrum of ophthalmic findings in patients with Stargardt dystrophy or fundus flavimaculatus who have a specific sequence variation in the ABCR gene. Twenty-nine patients with Stargardt dystrophy or fundus flavimaculatus from different pedigrees were identified with possible disease-causing sequence variations in the ABCR gene from a group of 66 patients who were screened for sequence variations in this gene. Patients underwent a routine ocular examination, including slitlamp biomicroscopy and a dilated fundus examination. Fluorescein angiography was performed on 22 patients, and electroretinographic measurements were obtained on 24 of 29 patients. Kinetic visual fields were measured with a Goldmann perimeter in 26 patients. Single-strand conformation polymorphism analysis and DNA sequencing were used to identify variations in coding sequences of the ABCR gene. Three clinical phenotypes were observed among these 29 patients. In phenotype I, 9 of 12 patients had a sequence change in exon 42 of the ABCR gene in which the amino acid glutamic acid was substituted for glycine (Gly1961Glu). In only 4 of these 9 patients was a second possible disease-causing mutation found on the other ABCR allele. In addition to an atrophic-appearing macular lesion, phenotype I was characterized by localized perifoveal yellowish white flecks, the absence of a dark choroid, and normal electroretinographic amplitudes. Phenotype II consisted of 10 patients who showed a dark choroid and more diffuse yellowish white flecks in the fundus. None exhibited the Gly1961Glu change. Phenotype III consisted of 7 patients who showed extensive atrophic-appearing changes of the retinal pigment epithelium. Electroretinographic cone and rod amplitudes were reduced. One patient showed the Gly1961Glu change. A wide variation in clinical phenotype can occur in patients with sequence changes in the ABCR gene. In individual patients, a certain phenotype seems to be associated with the presence of
Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

PubMed Central

Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

1985-01-01

Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512
Nucleotide sequences of the tet(M) genes from the American and Dutch type tetracycline resistance plasmids of Neisseria gonorrhoeae.

PubMed

Gascoyne-Binzi, D M; Heritage, J; Hawkey, P M

1993-11-01

High-level tetracycline-resistant Neisseria gonorrhoeae (TRNG) has been associated with the presence of a plasmid approximately 25.2 MDa in size which carries a Tet M tetracycline resistance determinant. Two different plasmid types, American and Dutch, have previously been described, based on the restriction endonuclease digestion pattern. In this study, the tet(M) genes from the two plasmid types have been amplified by the polymerase chain reaction (PCR) and then sequenced. The gene sequences from the two plasmids shared 96.8% identity, and showed similarities with different segments of the tet(M) gene sequences from Tn1545, Tn916 and Ureaplasma urealyticum. The data suggest that it is highly likely that the Tet M determinant found in the American type plasmid has a different origin from that present in the Dutch plasmid.
Informational structure of genetic sequences and nature of gene splicing

NASA Astrophysics Data System (ADS)

Trifonov, E. N.

1991-10-01

Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
A novel sodium bicarbonate cotransporter-like gene in an ancient duplicated region: SLC4A9 at 5q31

PubMed Central

Lipovich, Leonard; Lynch, Eric D; Lee, Ming K; King, Mary-Claire

2001-01-01

Background: Sodium bicarbonate cotransporter (NBC) genes encode proteins that execute coupled Na+ and HCO3- transport across epithelial cell membranes. We report the discovery, characterization, and genomic context of a novel human NBC-like gene, SLC4A9, on chromosome 5q31. Results: SLC4A9 was initially discovered by genomic sequence annotation and further characterized by sequencing of long-insert cDNA library clones. The predicted protein of 990 amino acids has 12 transmembrane domains and high sequence similarity to other NBCs. The 23-exon gene has 14 known mRNA isoforms. In three regions, mRNA sequence variation is generated by the inclusion or exclusion of portions of an exon. Noncoding SLC4A9 cDNAs were recovered multiple times from different libraries. The 3' untranslated region is fragmented into six alternatively spliced exons and contains expressed Alu, LINE and MER repeats. SLC4A9 has two alternative stop codons and six polyadenylation sites. Its expression is largely restricted to the kidney. In silico approaches were used to characterize two additional novel SLC4A genes and to place SLC4A9 within the context of multiple paralogous gene clusters containing members of the epidermal growth factor (EGF), ankyrin (ANK) and fibroblast growth factor (FGF) families. Seven human EGF-SLC4A-ANK-FGF clusters were found. Conclusion: The novel sodium bicarbonate cotransporter-like gene SLC4A9 demonstrates abundant alternative mRNA processing. It belongs to a growing class of functionally diverse genes characterized by inefficient highly variable splicing. The evolutionary history of the EGF-SLC4A-ANK-FGF gene clusters involves multiple rounds of duplication, apparently followed by large insertions and deletions at paralogous loci and genome-wide gene shuffling. PMID:11305939

SFM: A novel sequence-based fusion method for disease genes identification and prioritization.

PubMed

Yousef, Abdulaziz; Moghadam Charkari, Nasrollah

2015-10-21

The identification of disease genes from human genome is of great importance to improve diagnosis and treatment of disease. Several machine learning methods have been introduced to identify disease genes. However, these methods mostly differ in the prior knowledge used to construct the feature vector for each instance (gene), the ways of selecting negative data (non-disease genes) where there is no investigational approach to find them and the classification methods used to make the final decision. In this work, a novel Sequence-based fusion method (SFM) is proposed to identify disease genes. In this regard, unlike existing methods, instead of using a noisy and incomplete prior-knowledge, the amino acid sequence of the proteins which is universal data has been carried out to present the genes (proteins) into four different feature vectors. To select more likely negative data from candidate genes, the intersection set of four negative sets which are generated using distance approach is considered. Then, Decision Tree (C4.5) has been applied as a fusion method to combine the results of four independent state-of the-art predictors based on support vector machine (SVM) algorithm, and to make the final decision. The experimental results of the proposed method have been evaluated by some standard measures. The results indicate the precision, recall and F-measure of 82.6%, 85.6% and 84, respectively. These results confirm the efficiency and validity of the proposed method. Copyright © 2015 Elsevier Ltd. All rights reserved.
Mumps virus F gene and HN gene sequencing as a molecular tool to study mumps virus transmission.

PubMed

Gouma, Sigrid; Cremer, Jeroen; Parkkali, Saara; Veldhuijzen, Irene; van Binnendijk, Rob S; Koopmans, Marion P G

2016-11-01

Various mumps outbreaks have occurred in the Netherlands since 2004, particularly among persons who had received 2 doses of measles, mumps, and rubella (MMR) vaccination. Genomic typing of pathogens can be used to track outbreaks, but the established genotyping of mumps virus based on the small hydrophobic (SH) gene sequences did not provide sufficient resolution. Therefore, we expanded the sequencing to include fusion (F) gene and haemagglutinin-neuraminidase (HN) gene sequences in addition to the SH gene sequences from 109 mumps virus genotype G strains obtained between 2004 and mid 2015 in the Netherlands. When the molecular information from these 3 genes was combined, we were able to identify separate mumps virus clusters and track mumps virus transmission. The analyses suggested that multiple mumps virus introductions occurred in the Netherlands between 2004 and 2015 resulting in several mumps outbreaks throughout this period, whereas during some local outbreaks the molecular data pointed towards endemic circulation. Combined analysis of epidemiological data and sequence data collected in 2015 showed good support for the phylogenetic clustering. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Uptake, Results, and Outcomes of Germline Multiple-Gene Sequencing After Diagnosis of Breast Cancer.

PubMed

Kurian, Allison W; Ward, Kevin C; Hamilton, Ann S; Deapen, Dennis M; Abrahamse, Paul; Bondarenko, Irina; Li, Yun; Hawley, Sarah T; Morrow, Monica; Jagsi, Reshma; Katz, Steven J

2018-05-10

Low-cost sequencing of multiple genes is increasingly available for cancer risk assessment. Little is known about uptake or outcomes of multiple-gene sequencing after breast cancer diagnosis in community practice. To examine the effect of multiple-gene sequencing on the experience and treatment outcomes for patients with breast cancer. For this population-based retrospective cohort study, patients with breast cancer diagnosed from January 2013 to December 2015 and accrued from SEER registries across Georgia and in Los Angeles, California, were surveyed (n = 5080, response rate = 70%). Responses were merged with SEER data and results of clinical genetic tests, either BRCA1 and BRCA2 (BRCA1/2) sequencing only or including additional other genes (multiple-gene sequencing), provided by 4 laboratories. Type of testing (multiple-gene sequencing vs BRCA1/2-only sequencing), test results (negative, variant of unknown significance, or pathogenic variant), patient experiences with testing (timing of testing, who discussed results), and treatment (strength of patient consideration of, and surgeon recommendation for, prophylactic mastectomy), and prophylactic mastectomy receipt. We defined a patient subgroup with higher pretest risk of carrying a pathogenic variant according to practice guidelines. Among 5026 patients (mean [SD] age, 59.9 [10.7]), 1316 (26.2%) were linked to genetic results from any laboratory. Multiple-gene sequencing increasingly replaced BRCA1/2-only testing over time: in 2013, the rate of multiple-gene sequencing was 25.6% and BRCA1/2-only testing, 74.4%;in 2015 the rate of multiple-gene sequencing was 66.5% and BRCA1/2-only testing, 33.5%. Multiple-gene sequencing was more often ordered by genetic counselors (multiple-gene sequencing, 25.5% and BRCA1/2-only testing, 15.3%) and delayed until after surgery (multiple-gene sequencing, 32.5% and BRCA1/2-only testing, 19.9%). Multiple-gene sequencing substantially increased rate of detection of any
Occurrence and expression of gene transfer agent genes in marine bacterioplankton.

PubMed

Biers, Erin J; Wang, Kui; Pennington, Catherine; Belas, Robert; Chen, Feng; Moran, Mary Ann

2008-05-01

Genes with homology to the transduction-like gene transfer agent (GTA) were observed in genome sequences of three cultured members of the marine Roseobacter clade. A broader search for homologs for this host-controlled virus-like gene transfer system identified likely GTA systems in cultured Alphaproteobacteria, and particularly in marine bacterioplankton representatives. Expression of GTA genes and extracellular release of GTA particles ( approximately 50 to 70 nm) was demonstrated experimentally for the Roseobacter clade member Silicibacter pomeroyi DSS-3, and intraspecific gene transfer was documented. GTA homologs are surprisingly infrequent in marine metagenomic sequence data, however, and the role of this lateral gene transfer mechanism in ocean bacterioplankton communities remains unclear.
Characterization of a rabbit germ-line VH gene that is a candidate donor for VH gene conversion in mutant Alicia rabbits.

PubMed

Chen, H T; Alexander, C B; Mage, R G

1995-06-15

Normal rabbits preferentially rearrange the 3'-most VH gene, VH1, to encode Igs with VHa allotypes, which constitute the majority of rabbit serum Igs. A gene conversion-like mechanism is employed to diversify the primary Ab repertoire. In mutant Alicia rabbits that derived from a rabbit with VHa2 allotype, the VH1 gene was deleted. Our previous studies showed that the first functional gene (VH4) or VH4-like genes were rearranged in 2- to 8-wk-old homozygous Alicia. The VH1a2-like sequences that were found in splenic mRNA from 6-wk and older Alicia rabbits still had some residues that were typical of VH4. The appearances of sequences resembling that of VH1a2 may have been caused by gene conversions that altered the sequences of the rearranged VH or there may have been rearrangement of upstream VH1a2-like genes later in development. To investigate this further, we constructed a cosmid library and isolated a VH1a2-like gene, VH12-1-6, with a sequence almost identical to VH1a2. This gene had a deleted base in the heptamer of its recombination signal sequence. However, even if this defect diminished or eliminated its ability to rearrange, the a2-like gene could have acted as a donor for gene-conversion-like alteration of rearranged VH genes. Sequence comparisons suggested that this gene or a gene like it could have acted as a donor for gene conversion in mutant Alicia and in normal rabbits.
VH mutant rabbits lacking the VH1a2 gene develop a2+ B cells in the appendix by gene conversion-like alteration of a rearranged VH4 gene.

PubMed

Sehgal, D; Mage, R G; Schiaffella, E

1998-02-01

We investigated the molecular basis for the appearance of V(H)a2 allotype-bearing B cells in mutant Alicia rabbits. The mutation arose in an a2 rabbit; mutants exhibit altered expression of V(H) genes because of a small deletion encompassing V(H)1a2, the 3'-most gene in the V(H) locus. The V(H)1 gene is the major source of V(H)a allotype because this gene is preferentially rearranged in normal rabbits. In young homozygous ali/ali animals, the levels of a2 molecules found in the serum increase with age. In adult ali/ali rabbits, 20 to 50% of serum Igs and B cells bear a2 allotypic determinants. Previous studies suggested that positive selection results in expansion of a2 allotype-bearing B cells in the appendix of young mutant ali/ali rabbits. We separated appendix cells from a 6-wk-old Alicia rabbit by FACS based on the expression of surface IgM and a2 allotype. The VDJ portion of the expressed Ig mRNA was amplified from the IgM+ a2+ and IgM+ a2- populations by reverse transcriptase-PCR. The cDNAs from both populations were cloned and sequenced. Analysis of these sequences suggested that, in a2+ B cells, the first D proximal functional gene in Alicia rabbits, V(H)4a2, rearranged and was altered further by a gene conversion-like mechanism. Upstream V(H) genes were identified as potential gene sequence donors; V(H)9 was found to be the most frequently used gene donor. Among the a2- B cells, y33 was the most frequently rearranged gene.
Activating human genes with zinc finger proteins, transcription activator-like effectors and CRISPR/Cas9 for gene therapy and regenerative medicine.

PubMed

Gersbach, Charles A; Perez-Pinera, Pablo

2014-08-01

New technologies have recently been developed to control the expression of human genes in their native genomic context by engineering synthetic transcription factors that can be targeted to any DNA sequence. The ability to precisely regulate any gene as it occurs naturally in the genome provides a means to address a variety of diseases and disorders. This approach also circumvents some of the traditional challenges of gene therapy. In this editorial, we review the technologies that have enabled targeted human gene activation, including the engineering of transcription factors based on zinc finger proteins, transcription activator-like effectors and the CRISPR/Cas9 system. Additionally, we highlight examples in which these methods have been developed for therapeutic applications and discuss challenges and opportunities.
Rebamipide increases the mucin-like glycoprotein production in corneal epithelial cells.

PubMed

Takeji, Yasuhiro; Urashima, Hiroki; Aoki, Akihiro; Shinohara, Hisashi

2012-06-01

Dry eye is a multifactorial disease of tears and the ocular surface due to tear deficiency or excessive tear evaporation. Tear film instability is due to a disturbance in ocular surface mucin leading to a dysfunction of mucin, resulting in dry eye. In this study, we examined the effect of rebamipide, an anti-ulcer agent, on glycoconjugate production, as an indicator of mucin-like glycoprotein in cultured corneal epithelial cells. Further, we investigated the effect of rebamipide on the gene expression of membrane-associated mucins. Confluent cultured human corneal epithelial cells were incubated with rebamipide for 24 h. The glycoconjugate content in the supernatant and the cell extracts was measured by wheat germ agglutinin-enzyme-linked lectin assay combined gel-filtration method. In the experiment on mucin gene expression, cultured human corneal epithelial cells were collected at 0, 3, 6, and 12 h after administration of rebamipide. Real-time quantitative polymerase chain reaction was used to analyze the quantity of MUC1, MUC 4, and MUC16 gene expression. Rebamipide significantly increased the glycoconjugate contents in the supernatant and cell extract. In the mucin gene expression in the cells, rebamipide increased MUC1 and MUC4 gene expression, but did not increase MUC16 gene expression. Rebamipide promoted glycoconjugate, which has a property as a mucin-like glycoprotein, in human corneal epithelial cells. The increased production was mediated by MUC1 and MUC4 gene expression.
Computational sequence analysis of predicted long dsRNA transcriptomes of major crops reveals sequence complementarity with human genes.

PubMed

Jensen, Peter D; Zhang, Yuanji; Wiggins, B Elizabeth; Petrick, Jay S; Zhu, Jin; Kerstetter, Randall A; Heck, Gregory R; Ivashuta, Sergey I

2013-01-01

Long double-stranded RNAs (long dsRNAs) are precursors for the effector molecules of sequence-specific RNA-based gene silencing in eukaryotes. Plant cells can contain numerous endogenous long dsRNAs. This study demonstrates that such endogenous long dsRNAs in plants have sequence complementarity to human genes. Many of these complementary long dsRNAs have perfect sequence complementarity of at least 21 nucleotides to human genes; enough complementarity to potentially trigger gene silencing in targeted human cells if delivered in functional form. However, the number and diversity of long dsRNA molecules in plant tissue from crops such as lettuce, tomato, corn, soy and rice with complementarity to human genes that have a long history of safe consumption supports a conclusion that long dsRNAs do not present a significant dietary risk.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.

PubMed

Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T

1993-02-01

An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Transgene vaccination using Ulex europaeus agglutinin I (UEA-1) for targeted mucosal immunization against HIV-1 envelope.

PubMed

Wang, Xinhai; Kochetkova, Irina; Haddad, Asmahan; Hoyt, Teri; Hone, David M; Pascual, David W

2005-05-31

Receptor-mediated gene transfer using an M cell ligand has been shown to be an efficient method for mucosal DNA immunization. To investigate further into alternative M cell ligands, the plant lectin, Ulex europaeus agglutinin I (UEA-1), was tested. UEA-1 binds to human intestinal Caco-2 cells, and these cells can be transfected with poly-l-lysine (PL)-conjugated UEA-1 for expression of reporter cDNAs. When tested in vivo, mice nasally immunized with UEA-1-PL complexed to plasmid encoding HIV-1 envelope showed elevated systemic and mucosal antibody responses, and these were supported by tissue antibody-forming cells. Likewise, elevated envelope-specific CTLs were induced. Thus, UEA-1 mediated DNA delivery represents an alternative mucosal formulation for inducing humoral and cellular immunity against HIV-1.
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

PubMed Central

Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Hubisz, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Zhang, Peili; Liu, Jing; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catharine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenée; Verduzco, Daniel; Clerc-Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

2005-01-01

We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each arm gene order has been extensively reshuffled, leading to a minimum of 921 syntenic blocks shared between the species. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 25–55 million years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences between the species—but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila. PMID:15632085
Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis

PubMed Central

Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia

2011-01-01

Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non
GeneMachine: gene prediction and sequence annotation.

PubMed

Makalowska, I; Ryan, J F; Baxevanis, A D

2001-09-01

A number of free-standing programs have been developed in order to help researchers find potential coding regions and deduce gene structure for long stretches of what is essentially 'anonymous DNA'. As these programs apply inherently different criteria to the question of what is and is not a coding region, multiple algorithms should be used in the course of positional cloning and positional candidate projects to assure that all potential coding regions within a previously-identified critical region are identified. We have developed a gene identification tool called GeneMachine which allows users to query multiple exon and gene prediction programs in an automated fashion. BLAST searches are also performed in order to see whether a previously-characterized coding region corresponds to a region in the query sequence. A suite of Perl programs and modules are used to run MZEF, GENSCAN, GRAIL 2, FGENES, RepeatMasker, Sputnik, and BLAST. The results of these runs are then parsed and written into ASN.1 format. Output files can be opened using NCBI Sequin, in essence using Sequin as both a workbench and as a graphical viewer. The main feature of GeneMachine is that the process is fully automated; the user is only required to launch GeneMachine and then open the resulting file with Sequin. Annotations can then be made to these results prior to submission to GenBank, thereby increasing the intrinsic value of these data. GeneMachine is freely-available for download at http://genome.nhgri.nih.gov/genemachine. A public Web interface to the GeneMachine server for academic and not-for-profit users is available at http://genemachine.nhgri.nih.gov. The Web supplement to this paper may be found at http://genome.nhgri.nih.gov/genemachine/supplement/.
Accelerated Evolution of PAK3- and PIM1-like Kinase Gene Families in the Zebra Finch, Taeniopygia guttata

PubMed Central

Kong, Lesheng; Lovell, Peter V.; Heger, Andreas; Mello, Claudio V.; Ponting, Chris P.

2010-01-01

Genes encoding protein kinases tend to evolve slowly over evolutionary time, and only rarely do they appear as recent duplications in sequenced vertebrate genomes. Consequently, it was a surprise to find two families of kinase genes that have greatly and recently expanded in the zebra finch (Taeniopygia guttata) lineage. In contrast to other amniotic genomes (including chicken) that harbor only single copies of p21-activated serine/threonine kinase 3 (PAK3) and proviral integration site 1 (PIM1) genes, the zebra finch genome appeared at first to additionally contain 67 PAK3-like (PAK3L) and 51 PIM1-like (PIM1L) protein kinase genes. An exhaustive analysis of these gene models, however, revealed most to be incomplete, owing to the absence of terminal exons. After reprediction, 31 PAK3L genes and 10 PIM1L genes remain, and all but three are predicted, from the retention of functional sites and open reading frames, to be enzymatically active. PAK3L, but not PIM1L, gene sequences show evidence of recurrent episodes of positive selection, concentrated within structures spatially adjacent to N- and C-terminal protein regions that have been discarded from zebra finch PAK3L genes. At least seven zebra finch PAK3L genes were observed to be expressed in testis, whereas two sequences were found transcribed in the brain, one broadly including the song nuclei and the other in the ventricular zone and in cells resembling Bergmann's glia in the cerebellar Purkinje cell layer. Two PIM1L sequences were also observed to be expressed with broad distributions in the zebra finch brain, one in both the ventricular zone and the cerebellum and apparently associated with glial cells and the other showing neuronal cell expression and marked enrichment in midbrain/thalamic nuclei. These expression patterns do not correlate with zebra finch-specific features such as vocal learning. Nevertheless, our results show how ancient and conserved intracellular signaling molecules can be co
Antimicrobial peptide-like genes in Nasonia vitripennis: a genomic perspective

PubMed Central

2010-01-01

Background Antimicrobial peptides (AMPs) are an essential component of innate immunity which can rapidly respond to diverse microbial pathogens. Insects, as a rich source of AMPs, attract great attention of scientists in both understanding of the basic biology of the immune system and searching molecular templates for anti-infective drug design. Despite a large number of AMPs have been identified from different insect species, little information in terms of these peptides is available from parasitic insects. Results By using integrated computational approaches to systemically mining the Hymenopteran parasitic wasp Nasonia vitripennis genome, we establish the first AMP repertoire whose members exhibit extensive sequence and structural diversity and can be distinguished into multiple molecular types, including insect and fungal defensin-like peptides (DLPs) with the cysteine-stabilized α-helical and β-sheet (CSαβ) fold; Pro- or Gly-rich abaecins and hymenoptaecins; horseshoe crab tachystatin-type AMPs with the inhibitor cystine knot (ICK) fold; and a linear α-helical peptide. Inducible expression pattern of seven N. vitripennis AMP genes were verified, and two representative peptides were synthesized and functionally identified to be antibacterial. In comparison with Apis mellifera (Hymenoptera) and several non-Hymenopteran model insects, N. vitripennis has evolved a complex antimicrobial immune system with more genes and larger protein precursors. Three classical strategies that are likely responsible for the complexity increase have been recognized: 1) Gene duplication; 2) Exon duplication; and 3) Exon-shuffling. Conclusion The present study established the N. vitripennis peptidome associated with antimicrobial immunity by using a combined computational and experimental strategy. As the first AMP repertoire of a parasitic wasp, our results offer a basic platform for further studying the immunological and evolutionary significances of these newly discovered AMP-like
Sequencing of the Litchi Downy Blight Pathogen Reveals It Is a Phytophthora Species With Downy Mildew-Like Characteristics.

PubMed

Ye, Wenwu; Wang, Yang; Shen, Danyu; Li, Delong; Pu, Tianhuizi; Jiang, Zide; Zhang, Zhengguang; Zheng, Xiaobo; Tyler, Brett M; Wang, Yuanchao

2016-07-01

On the basis of its downy mildew-like morphology, the litchi downy blight pathogen was previously named Peronophythora litchii. Recently, however, it was proposed to transfer this pathogen to Phytophthora clade 4. To better characterize this unusual oomycete species and important fruit pathogen, we obtained the genome sequence of Phytophthora litchii and compared it to those from other oomycete species. P. litchii has a small genome with tightly spaced genes. On the basis of a multilocus phylogenetic analysis, the placement of P. litchii in the genus Phytophthora is strongly supported. Effector proteins predicted included 245 RxLR, 30 necrosis-and-ethylene-inducing protein-like, and 14 crinkler proteins. The typical motifs, phylogenies, and activities of these effectors were typical for a Phytophthora species. However, like the genome features of the analyzed downy mildews, P. litchii exhibited a streamlined genome with a relatively small number of genes in both core and species-specific protein families. The low GC content and slight codon preferences of P. litchii sequences were similar to those of the analyzed downy mildews and a subset of Phytophthora species. Taken together, these observations suggest that P. litchii is a Phytophthora pathogen that is in the process of acquiring downy mildew-like genomic and morphological features. Thus P. litchii may provide a novel model for investigating morphological development and genomic adaptation in oomycete pathogens.
Identification of a second flagellin gene and functional characterization of a sigma70-like promoter upstream of a Leptospira borgpetersenii flaB gene.

PubMed

Lin, Min; Dan, Hanhong; Li, Yijing

2004-02-01

Leptospira borgpetersenii, one of the causative agents of leptospirosis in both animals and humans, is a bacterial pathogen with characteristic motility that is mediated by the rotation of two periplasmic flagella (PF). The flaB gene coding for a core polypeptide subunit of PF was previously characterized by sequence analysis of its open reading frame (ORF) (M. Lin, J Biochem Mol Biol Biophys 2:181-187, 1999). The present study was undertaken to isolate and clone the uncharacterized sequence upstream of the flaB gene by using a PCR-based genome walking procedure. This has resulted in a 1470-bp genomic DNA sequence in which an 846-bp ORF coding for a 281-amino acid polypeptide (31.3 kDa) is identified 455 bp upstream from the flaB start codon. The encoded protein exhibits 72% amino acid identity to the deduced FlaB protein sequence of L. borgpetersenii and a high degree of sequence homology to the FlaB proteins of other spirochaetes. This has demonstrated for the first time that a second flaB gene homolog is present in a Leptospira species. The newly identified gene is designated flaB1, and the previously cloned flaB renamed flaB2. Within the intergenic sequence between flaB1 and flaB2, a potential stem-loop structure (12-bp inverted repeats) was identified 25 bp downstream of the flaB1 stop codon; this could serve as a transcription terminator for the flaB1 mRNA. Three E. coli-like promoter regions (I, II, and III) for binding Esigma(70), a regulatory sequence uncommonly found in flagellar genes, were predicted upstream of the flaB2 ORF. Only promoter region II contains a promoter that is functional in E. coli, as revealed at phenotypic and transcriptional levels by its capability of directing the expression of the chloramphenicol acetyltransferase (CAT) gene in the promoter probe vector pKK232-8. These observations may suggest that flaB1 and flaB2 are transcribed separately and do not form a transcriptional operon controlled by a single promoter.
PUTATIVE GENE PROMOTER SEQUENCES IN THE CHLORELLA VIRUSES

PubMed Central

Fitzgerald, Lisa A.; Boucher, Philip T.; Yanai-Balser, Giane; Suhre, Karsten; Graves, Michael V.; Van Etten, James L.

2008-01-01

Three short (7 to 9 nucleotides) highly conserved nucleotide sequences were identified in the putative promoter regions (150 bp upstream and 50 bp downstream of the ATG translation start site) of three members of the genus Chlorovirus, family Phycodnaviridae. Most of these sequences occurred in similar locations within the defined promoter regions. The sequence and location of the motifs were often conserved among homologous ORFs within the Chlorovirus family. One of these conserved sequences (AATGACA) is predominately associated with genes expressed early in virus replication. PMID:18768195
Resistance gene enrichment sequencing (RenSeq) enables reannotation of the NB-LRR gene family from sequenced plant genomes and rapid mapping of resistance loci in segregating populations

PubMed Central

Jupe, Florian; Witek, Kamil; Verweij, Walter; Śliwka, Jadwiga; Pritchard, Leighton; Etherington, Graham J; Maclean, Dan; Cock, Peter J; Leggett, Richard M; Bryan, Glenn J; Cardle, Linda; Hein, Ingo; Jones, Jonathan DG

2013-01-01

Summary RenSeq is a NB-LRR (nucleotide binding-site leucine-rich repeat) gene-targeted, Resistance gene enrichment and sequencing method that enables discovery and annotation of pathogen resistance gene family members in plant genome sequences. We successfully applied RenSeq to the sequenced potato Solanum tuberosum clone DM, and increased the number of identified NB-LRRs from 438 to 755. The majority of these identified R gene loci reside in poorly or previously unannotated regions of the genome. Sequence and positional details on the 12 chromosomes have been established for 704 NB-LRRs and can be accessed through a genome browser that we provide. We compared these NB-LRR genes and the corresponding oligonucleotide baits with the highest sequence similarity and demonstrated that ∼80% sequence identity is sufficient for enrichment. Analysis of the sequenced tomato S. lycopersicum ‘Heinz 1706’ extended the NB-LRR complement to 394 loci. We further describe a methodology that applies RenSeq to rapidly identify molecular markers that co-segregate with a pathogen resistance trait of interest. In two independent segregating populations involving the wild Solanum species S. berthaultii (Rpi-ber2) and S. ruiz-ceballosii (Rpi-rzc1), we were able to apply RenSeq successfully to identify markers that co-segregate with resistance towards the late blight pathogen Phytophthora infestans. These SNP identification workflows were designed as easy-to-adapt Galaxy pipelines. PMID:23937694

Stepwise evolution of corolla symmetry in CYCLOIDEA2-like and RADIALIS-like gene expression patterns in Lamiales.

PubMed

Zhong, Jinshun; Kellogg, Elizabeth A

2015-08-01

• CYCLOIDEA2 (CYC2)-like and RADIALIS (RAD)-like genes are needed for the normal development of corolla bilateral symmetry in Antirrhinum majus L. (snapdragon, Plantaginaceae, Lamiales). However, if and how changes in expression of CYC2-like and RAD-like genes correlate with the origin of corolla bilateral symmetry early in Lamiales remains largely unknown. The asymmetrical expression of CYC2-like and/or RAD-like genes during floral meristem development could be ancestral or derived in Plantaginaceae.• We used in situ RNA localization to examine the expression of CYC2-like and RAD-like genes in two early-diverging Lamiales.• CYC2-like and RAD-like genes are expressed broadly in the floral meristems in early-diverging Lamiales with radially symmetrical corollas, in contrast to their restricted expression in adaxial/lateral regions in core Lamiales. The expression pattern of CYC2-like genes has evolved in stepwise fashion, in that CYC2-like genes are likely expressed briefly in the floral meristem during flower development in sampled Oleaceae; prolonged expression of CYC2-like genes in petals originated in the common ancestor of Tetrachondraceae and core Lamiales, and asymmetrical expression in adaxial/lateral petals appeared later, in the common ancestor of the core Lamiales. Likewise, expression of RAD-like genes in petals appeared in early-diverging Lamiales or earlier; asymmetrical expression in adaxial/lateral petals then appeared in core Lamiales.• These data plus published reports of CYC2-like and RAD-like genes show that asymmetrical expression of these two genes is likely derived and correlates with the origins of corolla bilateral symmetry. © 2015 Botanical Society of America, Inc.
Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model.

PubMed

Kogelman, Lisette J A; Cirera, Susanna; Zhernakova, Daria V; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

2014-09-30

Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P < 0.001). Functional annotation identified pathways enlightening the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and 34.58). Moreover, detection
X-linked lymphocyte regulated gene 5c-like (Xlr5c-like) Is a Novel Target of Progesterone Action in Granulosa Cells of Periovulatory Rat Ovaries

PubMed Central

Mishra, Birendra; Park, Ji Yeon; Wilson, Kalin; Jo, Misung

2015-01-01

Progesterone (P4), acting through its nuclear receptor (PGR), plays an essential role in ovulation by mediating the expression of genes involved in ovulation and/or luteal formation. To identify ovulatory specific PGR-regulated genes, a preliminary microarray analysis was performed using rat granulosa cells treated with hCG ± RU486 (PGR antagonist). The transcript most highly down-regulated by RU486 was an EST (Expressed Sequence Tag) sequence (gb: BI289578.1) that matches with predicted sequence for Xlr5c-like mRNA. Since nothing is known about Xlr5c-like, we first characterized the expression pattern of Xlr5c-like mRNA in the rat ovary. The level of mRNA for Xlr5c-like is transiently up-regulated in granulosa cells of periovulatory follicles after hCG stimulation in PMSG-primed rat ovaries. The transient induction of Xlr5c-like mRNA was mimicked by hCG treatment in cultured granulosa cells from preovulatory ovaries. We further demonstrated that the LH-activated PKA, MEK, PI3K, and p38 signaling is involved in the increase in Xlr5c-like mRNA. The increase in Xlr5c-like mRNA was abolished by RU486. The inhibitory effect of RU486 was reversed by MPA (synthetic progestin), but not by dexamethasone (synthetic glucocorticoid). Furthermore, mutation of SP1/SP3 and PGR response element sites in the promoter region of Xlr5c-like decreased Xlr5c-like reporter activity. RU486 also inhibited Xlr5c-like reporter activity. ChIP assay verified the binding of PGR and SP3 to the Xlr5c-like promoter in periovulatory granulosa cells. Functionally, siRNA-mediated Xlr5c-like knockdown in granulosa cell cultures resulted in reduced levels of mRNA for Snap25, Cxcr4, and Adamts1. Recombinant Xlr5c-like protein expressed using an adenoviral approach was localized predominantly to the nucleus and to a lesser extent to the cytoplasm of rat granulosa cells. In conclusion, this is the first report showing the spatiotemporally regulated expression of Xlr5c-like mRNA by hCG in rat
Species identification of mutans streptococci by groESL gene sequence.

PubMed

Hung, Wei-Chung; Tsai, Jui-Chang; Hsueh, Po-Ren; Chia, Jean-San; Teng, Lee-Jene

2005-09-01

The near full-length sequences of the groESL genes were determined and analysed among eight reference strains (serotypes a to h) representing five species of mutans group streptococci. The groES sequences from these reference strains revealed that there are two lengths (285 and 288 bp) in the five species. The intergenic spacer between groES and groEL appears to be a unique marker for species, with a variable size (ranging from 111 to 310 bp) and sequence. Phylogenetic analysis of groES and groEL separated the eight serotypes into two major clusters. Strains of serotypes b, c, e and f were highly related and had groES gene sequences of the same length, 288 bp, while strains of serotypes a, d, g and h were also closely related and their groES gene sequence lengths were 285 bp. The groESL sequences in clinical isolates of three serotypes of S. mutans were analysed for intraspecies polymorphism. The results showed that the groESL sequences could provide information for differentiation among species, but were unable to distinguish serotypes of the same species. Based on the determined sequences, a PCR assay was developed that could differentiate members of the mutans streptococci by amplicon size and provide an alternative way for distinguishing mutans streptococci from other viridans streptococci.
Occurrence and Expression of Gene Transfer Agent Genes in Marine Bacterioplankton▿

PubMed Central

Biers, Erin J.; Wang, Kui; Pennington, Catherine; Belas, Robert; Chen, Feng; Moran, Mary Ann

2008-01-01

Genes with homology to the transduction-like gene transfer agent (GTA) were observed in genome sequences of three cultured members of the marine Roseobacter clade. A broader search for homologs for this host-controlled virus-like gene transfer system identified likely GTA systems in cultured Alphaproteobacteria, and particularly in marine bacterioplankton representatives. Expression of GTA genes and extracellular release of GTA particles (∼50 to 70 nm) was demonstrated experimentally for the Roseobacter clade member Silicibacter pomeroyi DSS-3, and intraspecific gene transfer was documented. GTA homologs are surprisingly infrequent in marine metagenomic sequence data, however, and the role of this lateral gene transfer mechanism in ocean bacterioplankton communities remains unclear. PMID:18359833
Molecular evolution of the CPP-like gene family in plants: insights from comparative genomics of Arabidopsis and rice.

PubMed

Yang, Zefeng; Gu, Shiliang; Wang, Xuefeng; Li, Wenjuan; Tang, Zaixiang; Xu, Chenwu

2008-09-01

CPP-like genes are members of a small family which features the existence of two similar Cys-rich domains termed CXC domains in their protein products and are distributed widely in plants and animals but do not exist in yeast. The members of this family in plants play an important role in development of reproductive tissue and control of cell division. To gain insights into how CPP-like genes evolved in plants, we conducted a comparative phylogenetic and molecular evolutionary analysis of the CPP-like gene family in Arabidopsis and rice. The results of phylogeny revealed that both gene loss and species-specific expansion contributed to the evolution of this family in Arabidopsis and rice. Both intron gain and intron loss were observed through intron/exon structure analysis for duplicated genes. Our results also suggested that positive selection was a major force during the evolution of CPP-like genes in plants, and most amino acid residues under positive selection were disproportionately located in the region outside the CXC domains. Further analysis revealed that two CXC domains and sequences connecting them might have coevolved during the long evolutionary period.
Thermodynamic parameters of the interaction of Urtica dioica agglutinin with N-acetylglucosamine and its oligomers.

PubMed

Lee, R T; Gabius, H J; Lee, Y C

1998-07-01

The interaction between Urtica dioica agglutinin (UDA) and N-acetylglucosamine (GlcNAc) and its beta(1-4)-linked oligomers was studied by fluorescence titration and isothermal titration microcalorimetry. UDA possesses one significant binding site that can be measured calorimetrically. This site is composed of three subsites, each subsite accommodating one GlcNAc residue. The interaction is enthalpically driven, and the binding area of UDA is characterized by a deltaH of interaction for a given oligosaccharide considerably smaller than that of wheat germ agglutinin (WGA), despite the fact that they both belong to a family of proteins composed entirely of hevein domains. Relatively high deltaCp values of the UDA-carbohydrate interactions and more favorable entropy term compared to WGA suggest that binding of the carbohydrate ligands by UDA has a higher hydrophobic component than that of WGA.
Extensive sequence analysis of CFTR, SCNN1A, SCNN1B, SCNN1G and SERPINA1 suggests an oligogenic basis for cystic fibrosis-like phenotypes.

PubMed

Ramos, M D; Trujillano, D; Olivar, R; Sotillo, F; Ossowski, S; Manzanares, J; Costa, J; Gartner, S; Oliva, C; Quintana, E; Gonzalez, M I; Vazquez, C; Estivill, X; Casals, T

2014-07-01

The term cystic fibrosis (CF)-like disease is used to describe patients with a borderline sweat test and suggestive CF clinical features but without two CFTR(cystic fibrosis transmembrane conductance regulator) mutations. We have performed the extensive molecular analysis of four candidate genes (SCNN1A, SCNN1B, SCNN1G and SERPINA1) in a cohort of 10 uncharacterized patients with CF and CF-like disease. We have used whole-exome sequencing to characterize mutations in the CFTR gene and these four candidate genes. CFTR molecular analysis allowed a complete characterization of three of four CF patients. Candidate variants in SCNN1A, SCNN1B, SCNN1G and SERPINA1 in six patients with CF-like phenotypes were confirmed by Sanger sequencing and were further supported by in silico predictive analysis, pedigree studies, sweat test in other family members, and analysis in CF patients and healthy subjects. Our results suggest that CF-like disease probably results from complex genotypes in several genes in an oligogenic form, with rare variants interacting with environmental factors. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Discovery and evolution of bunyavirids in arctic phantom midges and ancient bunyavirid-like sequences in insect genomes.

PubMed

Ballinger, Matthew J; Bruenn, Jeremy A; Hay, John; Czechowski, Donna; Taylor, Derek J

2014-08-01

Bunyaviridae is a large family of RNA viruses chiefly comprised of vertebrate and plant pathogens. We discovered novel bunyavirids that are approximately equally divergent from each of the five known genera. We characterized novel genome sequences for two bunyavirids, namely, Kigluaik phantom virus (KIGV), from tundra-native phantom midges (Chaoborus), and Nome phantom virus (NOMV), from tundra-invading phantom midges, and demonstrated that these bunyavirid-like sequences belong to an infectious virus by passaging KIGV in mosquito cell culture, although the infection does not seem to be well sustained beyond a few passages. Virus and host gene sequences from individuals collected on opposite ends of North America, a region spanning 4,000 km, support a long-term, vertically transmitted infection of KIGV in Chaoborus trivittatus. KIGV-like sequences ranging from single genes to full genomes are present in transcriptomes and genomes of insects belonging to six taxonomic orders, suggesting an ancient association of this clade with insect hosts. In Drosophila, endogenous virus genes have been coopted, forming an orthologous tandem gene family that has been maintained by selection during the radiation of the host genus. Our findings indicate that bunyavirid-host interactions in nonbloodsucking arthropods have been much more extensive than previously thought. Very little is known about the viral diversity in polar freshwater ponds, and perhaps less is known about the effects that climate-induced habitat changes in these regions will have on virus-host interactions in the coming years. Our results show that at the tundra-boreal boundary, a hidden viral landscape is being altered as infected boreal phantom midges colonize tundra ponds. Likewise, relatively little is known of the deeper evolutionary history of bunyavirids that has led to the stark lifestyle contrasts between some genera. The discovery of this novel bunyavirid group suggests that ancient and highly divergent
Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms

PubMed Central

Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli

2012-01-01

RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts. PMID:23119097
Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.

PubMed

Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli

2012-01-01

RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.
Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants.

PubMed

Liu, Ping-Li; Du, Liang; Huang, Yuan; Gao, Shu-Min; Yu, Meng

2017-02-07

Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases in plants and play crucial roles in development and stress responses. The evolutionary relationships among LRR-RLK genes have been investigated in flowering plants; however, no comprehensive studies have been performed for these genes in more ancestral groups. The subfamily classification of LRR-RLK genes in plants, the evolutionary history and driving force for the evolution of each LRR-RLK subfamily remain to be understood. We identified 119 LRR-RLK genes in the Physcomitrella patens moss genome, 67 LRR-RLK genes in the Selaginella moellendorffii lycophyte genome, and no LRR-RLK genes in five green algae genomes. Furthermore, these LRR-RLK sequences, along with previously reported LRR-RLK sequences from Arabidopsis thaliana and Oryza sativa, were subjected to evolutionary analyses. Phylogenetic analyses revealed that plant LRR-RLKs belong to 19 subfamilies, eighteen of which were established in early land plants, and one of which evolved in flowering plants. More importantly, we found that the basic structures of LRR-RLK genes for most subfamilies are established in early land plants and conserved within subfamilies and across different plant lineages, but divergent among subfamilies. In addition, most members of the same subfamily had common protein motif compositions, whereas members of different subfamilies showed variations in protein motif compositions. The unique gene structure and protein motif compositions of each subfamily differentiate the subfamily classifications and, more importantly, provide evidence for functional divergence among LRR-RLK subfamilies. Maximum likelihood analyses showed that some sites within four subfamilies were under positive selection. Much of the diversity of plant LRR-RLK genes was established in early land plants. Positive selection contributed to the evolution of a few LRR-RLK subfamilies.
Exome-wide Sequencing Shows Low Mutation Rates and Identifies Novel Mutated Genes in Seminomas.

PubMed

Cutcutache, Ioana; Suzuki, Yuka; Tan, Iain Beehuat; Ramgopal, Subhashini; Zhang, Shenli; Ramnarayanan, Kalpana; Gan, Anna; Lee, Heng Hong; Tay, Su Ting; Ooi, Aikseng; Ong, Choon Kiat; Bolthouse, Jonathan T; Lane, Brian R; Anema, John G; Kahnoski, Richard J; Tan, Patrick; Teh, Bin Tean; Rozen, Steven G

2015-07-01

Testicular germ cell tumors are the most common cancer diagnosed in young men, and seminomas are the most common type of these cancers. There have been no exome-wide examinations of genes mutated in seminomas or of overall rates of nonsilent somatic mutations in these tumors. The objective was to analyze somatic mutations in seminomas to determine which genes are affected and to determine rates of nonsilent mutations. Eight seminomas and matched normal samples were surgically obtained from eight patients. DNA was extracted from tissue samples and exome sequenced on massively parallel Illumina DNA sequencers. Single-nucleotide polymorphism chip-based copy number analysis was also performed to assess copy number alterations. The DNA sequencing read data were analyzed to detect somatic mutations including single-nucleotide substitutions and short insertions and deletions. The detected mutations were validated by independent sequencing and further checked for subclonality. The rate of nonsynonymous somatic mutations averaged 0.31 mutations/Mb. We detected nonsilent somatic mutations in 96 genes that were not previously known to be mutated in seminomas, of which some may be driver mutations. Many of the mutations appear to have been present in subclonal populations. In addition, two genes, KIT and KRAS, were affected in two tumors each with mutations that were previously observed in other cancers and are presumably oncogenic. Our study, the first report on exome sequencing of seminomas, detected somatic mutations in 96 new genes, several of which may be targetable drivers. Furthermore, our results show that seminoma mutation rates are five times higher than previously thought, but are nevertheless low compared to other common cancers. Similar low rates are seen in other cancers that also have excellent rates of remission achieved with chemotherapy. We examined the DNA sequences of seminomas, the most common type of testicular germ cell cancer. Our study identified 96
Cloning and sequencing of a cellobiohydrolase gene from Trichoderma harzianum FP108

Treesearch

Patrick Guilfoile; Ron Burns; Zu-Yi Gu; Matt Amundson; Fu-Hsian Chang

1999-01-01

A cbbl cellobiohydrolase gene was cloned and sequenced from the fungus Trichoderrna harzianum FP108. The cloning was performed by PCR amplification of T. harzianum genomic DNA, using PCR primers whose sequence was based on the cbbl gene from Tricboderma reesei. The 3' end of the gene was isolated by inverse...
16S-23S rRNA gene internal transcribed spacer sequences for analysis of the phylogenetic relationships among species of the genus Porphyromonas.

PubMed

Conrads, Georg; Citron, Diane M; Tyrrell, Kerin L; Horz, Hans-Peter; Goldstein, Ellie J C

2005-03-01

The 16S-23S rRNA gene internal transcribed spacer (ITS) regions of 11 reference strains of Porphyromonas species, together with Bacteroides distasonis and Tannerella forsythensis, were analysed to examine interspecies relationships. Compared with the phylogenetic tree generated using 16S rRNA gene sequences, the resolution of the ITS sequence-based tree was higher, but species positioning and clustering were similar with both approaches. The recent separation of Porphyromonas gulae and Porphyromonas gingivalis into distinct species was confirmed by the ITS data. In addition, analysis of the ITS sequences of 24 clinical isolates of Porphyromonas asaccharolytica plus the type strain ATCC 25260(T) divided the sequences into two clusters, of which one was alpha-fucosidase-positive (like the type strain) while the other was alpha-fucosidase-negative. The latter resembled the previously studied unusual extra-oral isolates of 'Porphyromonas endodontalis-like organisms' (PELOs) which could therefore be called 'Porphyromonas asaccharolytica-like organisms' (PALOs), based on the genetic identification. Moreover, the proposal of alpha-fucosidase-negative P. asaccharolytica strains as a new species should also be considered.
Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses

PubMed Central

Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael

2013-01-01

Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343
Mitochondrial gene sequences alone or combined with ITS region sequences provide firm molecular criteria for the classification of Lecanicillium species.

PubMed

Kouvelis, Vassili N; Sialakouma, Aphrodite; Typas, Milton A

2008-07-01

The recent revision of Verticillium sect. Prostrata led to the introduction of the genus Lecanicillium, which comprises the majority of the entomopathogenic strains. Sixty-five strains previously classified as Verticillium lecanii or Verticillium sp. from different geographical regions and hosts were examined and their phylogenetic relationships were determined using sequences from three mitochondrial (mt) genes [the small rRNA subunit (rns), the NADH dehydrogenase subunits 1 (nad1) and 3 (nad3)] and the ITS region. In general, single gene phylogenetic trees differentiated and placed the strains examined in well-supported (by BS analysis) groups of L. lecanii, L. longisporum, L. muscarium, and L. nodulosum, although in some cases a few uncertainties still remained. nad1 was the most informative single gene in phylogenetic analyses and was also found to contain group I introns with putative open reading frames (ORFs) encoding for GIY-YIG endonucleases. The combined use of mt gene sequences resolved taxonomic uncertainties arisen from ITS analysis and, alone or in combination with ITS sequences, helped in placing uncharacterised Verticillium lecanii and Verticillium sp. firmly into Lecanicillium species. Combined gene data from all the mt genes and all the mt genes and the ITS region together, were very similar. Furthermore, a relaxed correlation with host specificity -- at least for Homoptera -- was indicated for the rns and the combined mt gene sequences. Thus, the usefulness of mt gene sequences as a convenient molecular tool in phylogenetic studies of entomopathogenic fungi was demonstrated.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites.

PubMed

Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

2012-10-01

To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.
Prevalence of ColE1-like plasmids and kanamycin resistance genes in Salmonella enterica serovars.

PubMed

Chen, Chin-Yi; Lindsey, Rebecca L; Strobaugh, Terence P; Frye, Jonathan G; Meinersmann, Richard J

2010-10-01

Multi-antimicrobial-resistant Salmonella enterica strains frequently carry resistance genes on plasmids. Recent studies focus heavily on large conjugative plasmids, and the role that small plasmids play in resistance gene transfer is largely unknown. To expand our previous studies in assessing the prevalence of the isolates harboring ColE1-like plasmids carrying the aph gene responsible for kanamycin resistance (Kan(r)) phenotypes, 102 Kan(r) Salmonella isolates collected through the National Antimicrobial Resistance Monitoring System (NARMS) in 2005 were screened by PCR using ColE1 primer sets. Thirty isolates were found to be positive for ColE1-like replicon. Plasmids from 23 isolates were able to propagate in Escherichia coli and were subjected to further characterization. Restriction mapping revealed three major plasmid groups found in three or more isolates, with each group consisting of two to three subtypes. The aph genes from the Kan(r) Salmonella isolates were amplified by PCR, sequenced, and showed four different aph(3')-I genes. The distribution of the ColE1 plasmid groups in association with the aph gene, Salmonella serovar, and isolate source demonstrated a strong linkage of the plasmid with S. enterica serovar Typhimurium DT104. Due to their high copy number and mobility, the ColE1-like plasmids may play a critical role in transmission of antibiotic resistance genes among enteric pathogens, and these findings warrant a close monitoring of this plasmid incompatibility group.
Sequences 5' to translation start regulate expression of petunia rbcS genes.

PubMed Central

Dean, C; Favreau, M; Bedbrook, J; Dunsmuir, P

1989-01-01

The promoter sequences that contribute to quantitative differences in expression of the petunia genes (rbcS) encoding the small subunit of ribulose bisphosphate carboxylase have been characterized. The promoter regions of the two most abundantly expressed petunia rbcS genes, SSU301 and SSU611, show sequence similarity not present in other rbcS genes. We investigated the significance of these and other sequences by adding specific regions from the SSU301 promoter (the most strongly expressed gene) to equivalent regions in the SSU911 promoter (the least strongly expressed gene) and assaying the expression of the fusions in transgenic tobacco plants. In this way, we characterized an SSU301 promoter region (either from -285 to -178 or -291 to -204) which, when added to SSU911, in either orientation, increased SSU911 expression 25-fold. This increase was equivalent to that caused by addition of the entire SSU301 5'-flanking region. Replacement of SSU911 promoter sequences between -198 and the start codon with sequences from the equivalent region of SSU301 did not increase SSU911 expression significantly. The -291 to -204 SSU301 promoter fragment contributes significantly to quantitative differences in expression between the petunia rbcS genes. PMID:2535543

Sequences required for induction of neurotensin receptor gene expression during neuronal differentiation of N1E-115 neuroblastoma cells.

PubMed

Tavares, D; Tully, K; Dobner, P R

1999-10-15

The promoter region of the mouse high affinity neurotensin receptor (Ntr-1) gene was characterized, and sequences required for expression in neuroblastoma cell lines that express high affinity NT-binding sites were characterized. Me(2)SO-induced neuronal differentiation of N1E-115 neuroblastoma cells increased both the expression of the endogenous Ntr-1 gene and reporter genes driven by NTR-1 promoter sequences by 3-4-fold. Deletion analysis revealed that an 83-base pair promoter region containing the transcriptional start site is required for Me(2)SO activation. Detailed mutational analysis of this region revealed that a CACCC box and the central region of a large GC-rich palindrome are the crucial cis-regulatory elements required for Me(2)SO induction. The CACCC box is bound by at least one factor that is induced upon Me(2)SO treatment of N1E-115 cells. The Me(2)SO effect was found to be both selective and cell type-restricted. Basal expression in the neuroblastoma cell lines required a distinct set of sequences, including an Sp1-like sequence, and a sequence resembling an NGFI-A-binding site; however, a more distal 5' sequence was found to repress basal activity in N1E-115 cells. These results provide evidence that Ntr-1 gene regulation involves both positive and negative regulatory elements located in the 5'-flanking region and that Ntr-1 gene activation involves the coordinate activation or induction of several factors, including a CACCC box binding complex.
Phylogeny and expression profiling of CAD and CAD-like genes in hybrid Populus (P. deltoides × P. nigra): evidence from herbivore damage for subfunctionalization and functional divergence

PubMed Central

2010-01-01

Background Cinnamyl Alcohol Dehydrogenase (CAD) proteins function in lignin biosynthesis and play a critical role in wood development and plant defense against stresses. Previous phylogenetic studies did not include genes from seedless plants and did not reflect the deep evolutionary history of this gene family. We reanalyzed the phylogeny of CAD and CAD-like genes using a representative dataset including lycophyte and bryophyte sequences. Many CAD/CAD-like genes do not seem to be associated with wood development under normal growth conditions. To gain insight into the functional evolution of CAD/CAD-like genes, we analyzed their expression in Populus plant tissues in response to feeding damage by gypsy moth larvae (Lymantria dispar L.). Expression of CAD/CAD-like genes in Populus tissues (xylem, leaves, and barks) was analyzed in herbivore-treated and non-treated plants by real time quantitative RT-PCR. Results CAD family genes were distributed in three classes based on sequence conservation. All the three classes are represented by seedless as well as seed plants, including the class of bona fide lignin pathway genes. The expression of some CAD/CAD-like genes that are not associated with xylem development were induced following herbivore damage in leaves, while other genes were induced in only bark or xylem tissues. Five of the CAD/CAD-like genes, however, showed a shift in expression from one tissue to another between non-treated and herbivore-treated plants. Systemic expression of the CAD/CAD-like genes was generally suppressed. Conclusions Our results indicated a correlation between the evolution of the CAD gene family and lignin and that the three classes of genes may have evolved in the ancestor of land plants. Our results also suggest that the CAD/CAD-like genes have evolved a diversity of expression profiles and potentially different functions, but that they are nonetheless co-regulated under stress conditions. PMID:20509918
Intronic sequences are required for AINTEGUMENTA-LIKE6 expression in Arabidopsis flowers.

PubMed

Krizek, Beth A

2015-10-12

The AINTEGUMENTA-LIKE6/PLETHORA3 (AIL6/PLT3) gene of Arabidopsis thaliana is a key regulator of growth and patterning in both shoots and roots. AIL6 encodes an AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) transcription factor that is expressed in the root stem cell niche, the peripheral region of the shoot apical meristem and young lateral organ primordia. In flowers, AIL6 acts redundantly with AINTEGUMENTA (ANT) to regulate floral organ positioning, growth, identity and patterning. Experiments were undertaken to define the genomic regions required for AIL6 function and expression in flowers. Transgenic plants expressing a copy of the coding region of AIL6 in the context of 7.7 kb of 5' sequence and 919 bp of 3' sequence (AIL6:cAIL6-3') fail to fully complement AIL6 function when assayed in the ant-4 ail6-2 double mutant background. In contrast, a genomic copy of AIL6 with the same amount of 5' and 3' sequence (AIL6:gAIL6-3') can fully complement ant-4 ail6-2. In addition, a genomic copy of AIL6 with 590 bp of 5' sequence and 919 bp of 3' sequence (AIL6m:gAIL6-3') complements ant-4 ail6-2 and contains all regulatory elements needed to confer normal AIL6 expression in inflorescences. Efforts to map cis-regulatory elements reveal that the third intron of AIL6 contains enhancer elements that confer expression in young flowers but in a broader pattern than that of AIL6 mRNA in wild-type flowers. Some AIL6:gAIL6-3' and AIL6m:gAIL6-3' lines confer an over-rescue phenotype in the ant-4 ail6-2 background that is correlated with higher levels of AIL6 mRNA accumulation. The results presented here indicate that AIL6 intronic sequences serve as transcriptional enhancer elements. In addition, the results show that increased expression of AIL6 can partially compensate for loss of ANT function in flowers.
Development of mixed-type autoimmune hemolytic anemia and Evans' syndrome following chicken pox infection in a case of low-titer cold agglutinin disease.

PubMed

Tanaka, Yumi; Masuya, Masahiro; Katayama, Naoyuki; Miyata, Eri; Sugimoto, Yuka; Shibasaki, Tetsunori; Yamamura, Kentaro; Ohishi, Kohshi; Minami, Nobuyuki; Shiku, Hiroshi; Nobori, Tsutomu

2006-10-01

We describe a patient with low-titer cold agglutinin disease (CAD) who developed mixed-type autoimmune hemolytic anemia (AIHA) and idiopathic thrombocytopenia following chicken pox infection. At least 1 year before admission to hospital, the patient had mild hemolytic anemia associated with low-titer cold agglutinins. A severe hemolytic crisis and thrombocytopenia (Evans' syndrome) occurred several days after infection with chicken pox, and the patient was referred to our hospital. Serological findings revealed the presence of both cold agglutinins and warm-reactive autoantibodies against erythrocytes, and the diagnosis was mixed-type AIHA. Following steroid therapy, the hemoglobin (Hb) level and platelet count improved. The patient was closely followed over a 10-year period with recurrent documented hemolysis after viral or bacterial infections. Warm-reactive autoantibodies have not been detected in the last 2 years, and only the immunoglobulin M anti-I cold agglutinins with a low titer and wide thermal amplitude have remained unchanged. Therefore, the patient has received at least 10 mg prednisolone daily to maintain a Hb level of 10 g/dL. To the best of our knowledge, no adult case of low-titer CAD that has evolved into mixed-type AIHA and Evans' syndrome after chicken pox infection has been previously reported in the literature.
Compatibility of garlic (Allium sativum L.) leaf agglutinin and Cry1Ac δ-endotoxin for gene pyramiding.

PubMed

Upadhyay, Santosh Kumar; Singh, Seema; Chandrashekar, Krishnappa; Tuli, Rakesh; Singh, Pradhyumna Kumar

2012-03-01

δ-Endotoxins produced by Bacillus thuringiensis (Bt) have been used as bio-pesticides for the control of lepidopteran insect pests. Garlic (Allium sativum L.) leaf agglutinin (ASAL), being toxic to several sap-sucking pests and some lepidopteran pests, may be a good candidate for pyramiding with δ-endotoxins in transgenic plants for enhancing the range of resistance to insect pests. Since ASAL shares the midgut receptors with Cry1Ac in Helicoverpa armigera, there is possibility of antagonism in their toxicity. Our study demonstrated that ASAL increased the toxicity of Cry1Ac against H. armigera while Cry1Ac did not alter the toxicity of ASAL against cotton aphids. The two toxins interacted and increased binding of each other to brush border membrane vesicle (BBMV) proteins and to the two important receptors, alkaline phosphatase (ALP) and aminopeptidase N (APN). The results indicated that the toxins had different binding sites on the ALP and APN but influenced mutual binding. We conclude that ASAL can be safely employed with Cry1Ac for developing transgenic crops for wider insect resistance.
Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates

PubMed Central

Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

2017-01-01

Abstract The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. PMID:28981708
Egg Case Silk Gene Sequences from Argiope Spiders: Evidence for Multiple Loci and a Loss of Function Between Paralogs

PubMed Central

Chaw, R. Crystal; Collin, Matthew; Wimmer, Marjorie; Helmrick, Kara-Leigh; Hayashi, Cheryl Y.

2017-01-01

Spiders swath their eggs with silk to protect developing embryos and hatchlings. Egg case silks, like other fibrous spider silks, are primarily composed of proteins called spidroins (spidroin = spider-fibroin). Silks, and thus spidroins, are important throughout the lives of spiders, yet the evolution of spidroin genes has been relatively understudied. Spidroin genes are notoriously difficult to sequence because they are typically very long (≥ 10 kb of coding sequence) and highly repetitive. Here, we investigate the evolution of spider silk genes through long-read sequencing of Bacterial Artificial Chromosome (BAC) clones. We demonstrate that the silver garden spider Argiope argentata has multiple egg case spidroin loci with a loss of function at one locus. We also use degenerate PCR primers to search the genomic DNA of congeneric species and find evidence for multiple egg case spidroin loci in other Argiope spiders. Comparative analyses show that these multiple loci are more similar at the nucleotide level within a species than between species. This pattern is consistent with concerted evolution homogenizing gene copies within a genome. More complicated explanations include convergent evolution or recent independent gene duplications within each species. PMID:29127108
Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population

PubMed Central

Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

2016-01-01

Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991
Restricting nonclassical MHC genes coevolve with TRAV genes used by innate-like T cells in mammals

PubMed Central

Boudinot, Pierre; Mondot, Stanislas; Jouneau, Luc; Teyton, Luc; Lefranc, Marie-Paule; Lantz, Olivier

2016-01-01

Whereas major histocompatibility class-1 (MH1) proteins present peptides to T cells displaying a large T-cell receptor (TR) repertoire, MH1Like proteins, such as CD1D and MR1, present glycolipids and microbial riboflavin precursor derivatives, respectively, to T cells expressing invariant TR-α (iTRA) chains. The groove of such MH1Like, as well as iTRA chains used by mucosal-associated invariant T (MAIT) and natural killer T (NKT) cells, respectively, may result from a coevolution under particular selection pressures. Herein, we investigated the evolutionary patterns of the iTRA of MAIT and NKT cells and restricting MH1Like proteins: MR1 appeared 170 Mya and is highly conserved across mammals, evolving more slowly than other MH1Like. It has been pseudogenized or independently lost three times in carnivores, the armadillo, and lagomorphs. The corresponding TRAV1 gene also evolved slowly and harbors highly conserved complementarity determining regions 1 and 2. TRAV1 is absent exclusively from species in which MR1 is lacking, suggesting that its loss released the purifying selection on MR1. In the rabbit, which has very few NKT and no MAIT cells, a previously unrecognized iTRA was identified by sequencing leukocyte RNA. This iTRA uses TRAV41, which is highly conserved across several groups of mammals. A rabbit MH1Like gene was found that appeared with mammals and is highly conserved. It was independently lost in a few groups in which MR1 is present, like primates and Muridae, illustrating compensatory emergences of new MH1Like/Invariant T-cell combinations during evolution. Deciphering their role is warranted to search similar effector functions in humans. PMID:27170188
Spliced synthetic genes as internal controls in RNA sequencing experiments.

PubMed

Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R

2016-09-01

RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
Identification of rare genetic variants in Italian patients with dementia by targeted gene sequencing.

PubMed

Bartoletti-Stella, Anna; Baiardi, Simone; Stanzani-Maserati, Michelangelo; Piras, Silvia; Caffarra, Paolo; Raggi, Alberto; Pantieri, Roberta; Baldassari, Sara; Caporali, Leonardo; Abu-Rumeileh, Samir; Linarello, Simona; Liguori, Rocco; Parchi, Piero; Capellari, Sabina

2018-06-01

Genetics is intricately involved in the etiology of neurodegenerative dementias. The incidence of monogenic dementia among all neurodegenerative forms is unknown due to the lack of systematic studies and of patient/clinician access to extensive diagnostic procedures. In this study, we conducted targeted sequencing in 246 clinically heterogeneous patients, mainly with early-onset and/or familial neurodegenerative dementia, using a custom-designed next-generation sequencing panel covering 27 genes known to harbor mutations that can cause different types of dementia, in addition to the detection of C9orf72 repeat expansions. Forty-nine patients (19.9%) carried known pathogenic or novel, likely pathogenic, variants, involving both common (presenilin 1, presenilin 2, C9orf72, and granulin) and rare (optineurin, serpin family I member 1 and protein kinase cyclic adenosine monophosphate (cAMP)-dependent type I regulatory subunit beta) dementia-associated genes. Our results support the use of an extended next-generation sequencing panels as a quick, accurate, and cost-effective method for diagnosis in clinical practice. This approach could have a significant impact on the proportion of tested patients, especially among those with an early disease onset. Copyright © 2018 Elsevier Inc. All rights reserved.
Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites*

PubMed Central

Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying

2012-01-01

To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi’an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi’an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%–99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites. PMID:23024043
Population genetic implications from sequence variation in four Y chromosome genes.

PubMed

Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

2000-06-20

Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.
Targeted gene panel sequencing in children with very early onset inflammatory bowel disease--evaluation and prospective analysis.

PubMed

Kammermeier, Jochen; Drury, Suzanne; James, Chela T; Dziubak, Robert; Ocaka, Louise; Elawad, Mamoun; Beales, Philip; Lench, Nicholas; Uhlig, Holm H; Bacchelli, Chiara; Shah, Neil

2014-11-01

Multiple monogenetic conditions with partially overlapping phenotypes can present with inflammatory bowel disease (IBD)-like intestinal inflammation. With novel genotype-specific therapies emerging, establishing a molecular diagnosis is becoming increasingly important. We have introduced targeted next-generation sequencing (NGS) technology as a prospective screening tool in children with very early onset IBD (VEOIBD). We evaluated the coverage of 40 VEOIBD genes in two separate cohorts undergoing targeted gene panel sequencing (TGPS) (n=25) and whole exome sequencing (WES) (n=20). TGPS revealed causative mutations in four genes (IL10RA, EPCAM, TTC37 and SKIV2L) discovered unexpected phenotypes and directly influenced clinical decision making by supporting as well as avoiding haematopoietic stem cell transplantation. TGPS resulted in significantly higher median coverage when compared with WES, fewer coverage deficiencies and improved variant detection across established VEOIBD genes. Excluding or confirming known VEOIBD genotypes should be considered early in the disease course in all cases of therapy-refractory VEOIBD, as it can have a direct impact on patient management. To combine both described NGS technologies would compensate for the limitations of WES for disease-specific application while offering the opportunity for novel gene discovery in the research setting. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Massive sequencing of 70 genes reveals a myriad of missing genes or mechanisms to be uncovered in hereditary spastic paraplegias

PubMed Central

Morais, Sara; Raymond, Laure; Mairey, Mathilde; Coutinho, Paula; Brandão, Eva; Ribeiro, Paula; Loureiro, José Leal; Sequeiros, Jorge; Brice, Alexis; Alonso, Isabel; Stevanin, Giovanni

2017-01-01

Hereditary spastic paraplegias (HSP) are neurodegenerative disorders characterized by lower limb spasticity and weakness that can be complicated by other neurological or non-neurological signs. Despite a high genetic heterogeneity (>60 causative genes), 40–70% of the families remain without a molecular diagnosis. Analysis of one of the pioneer cohorts of 193 HSP families generated in the early 1990s in Portugal highlighted that SPAST and SPG11 are the most frequent diagnoses. We have now explored 98 unsolved families from this series using custom next generation sequencing panels analyzing up to 70 candidate HSP genes. We identified the likely disease-causing variant in 20 of the 98 families with KIF5A being the most frequently mutated gene. We also found 52 variants of unknown significance (VUS) in 38% of the cases. These new diagnoses resulted in 42% of solved cases in the full Portuguese cohort (81/193). Segregation of the variants was not always compatible with the presumed inheritance, indicating that the analysis of all HSP genes regardless of the inheritance mode can help to explain some cases. Our results show that there is still a large set of unknown genes responsible for HSP and most likely novel mechanisms or inheritance modes leading to the disease to be uncovered, but this will require international collaborative efforts, particularly for the analysis of VUS. PMID:28832565
Repertoire, genealogy and genomic organization of cruzipain and homologous genes in Trypanosoma cruzi, T. cruzi-like and other trypanosome species.

PubMed

Lima, Luciana; Ortiz, Paola A; da Silva, Flávia Maia; Alves, João Marcelo P; Serrano, Myrna G; Cortez, Alane P; Alfieri, Silvia C; Buck, Gregory A; Teixeira, Marta M G

2012-01-01

Trypanosoma cruzi, the agent of Chagas disease, is a complex of genetically diverse isolates highly phylogenetically related to T. cruzi-like species, Trypanosoma cruzi marinkellei and Trypanosoma dionisii, all sharing morphology of blood and culture forms and development within cells. However, they differ in hosts, vectors and pathogenicity: T. cruzi is a human pathogen infective to virtually all mammals whilst the other two species are non-pathogenic and bat restricted. Previous studies suggest that variations in expression levels and genetic diversity of cruzipain, the major isoform of cathepsin L-like (CATL) enzymes of T. cruzi, correlate with levels of cellular invasion, differentiation, virulence and pathogenicity of distinct strains. In this study, we compared 80 sequences of genes encoding cruzipain from 25 T. cruzi isolates representative of all discrete typing units (DTUs TcI-TcVI) and the new genotype Tcbat and 10 sequences of homologous genes from other species. The catalytic domain repertoires diverged according to DTUs and trypanosome species. Relatively homogeneous sequences are found within and among isolates of the same DTU except TcV and TcVI, which displayed sequences unique or identical to those of TcII and TcIII, supporting their origin from the hybridization between these two DTUs. In network genealogies, sequences from T. cruzi clustered tightly together and closer to T. c. marinkellei than to T. dionisii and largely differed from homologues of T. rangeli and T. b. brucei. Here, analysis of isolates representative of the overall biological and genetic diversity of T. cruzi and closest T. cruzi-like species evidenced DTU- and species-specific polymorphisms corroborating phylogenetic relationships inferred with other genes. Comparison of both phylogenetically close and distant trypanosomes is valuable to understand host-parasite interactions, virulence and pathogenicity. Our findings corroborate cruzipain as valuable target for drugs, vaccine
Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Onda, M.; Kudo, S.; Fukuda, M.

Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less
Tribonacci-Like Sequences and Generalized Pascal's Pyramids

ERIC Educational Resources Information Center

Anatriello, Giuseppina; Vincenzi, Giovanni

2014-01-01

A well-known result of Feinberg and Shannon states that the tribonacci sequence can be detected by the so-called "Pascal's pyramid." Here we will show that any tribonacci-like sequence can be obtained by the diagonals of the "Feinberg's triangle" associated to a suitable "generalized Pascal's pyramid."…
Genome sequence comparison reveals a candidate gene involved in male-hermaphrodite differentiation in papaya (Carica papaya) trees.

PubMed

Ueno, Hiroki; Urasaki, Naoya; Natsume, Satoshi; Yoshida, Kentaro; Tarora, Kazuhiko; Shudo, Ayano; Terauchi, Ryohei; Matsumura, Hideo

2015-04-01

The sex type of papaya (Carica papaya) is determined by the pair of sex chromosomes (XX, female; XY, male; and XY(h), hermaphrodite), in which there is a non-recombining genomic region in the Y and Y(h) chromosomes. This region is presumed to be involved in determination of males and hermaphrodites; it is designated as the male-specific region in the Y chromosome (MSY) and the hermaphrodite-specific region in the Y(h) chromosome (HSY). Here, we identified the genes determining male and hermaphrodite sex types by comparing MSY and HSY genomic sequences. In the MSY and HSY genomic regions, we identified 14,528 nucleotide substitutions and 965 short indels with a large gap and two highly diverged regions. In the predicted genes expressed in flower buds, we found no nucleotide differences leading to amino acid changes between the MSY and HSY. However, we found an HSY-specific transposon insertion in a gene (SVP like) showing a similarity to the Short Vegetative Phase (SVP) gene. Study of SVP-like transcripts revealed that the MSY allele encoded an intact protein, while the HSY allele encoded a truncated protein. Our findings demonstrated that the SVP-like gene is a candidate gene for male-hermaphrodite determination in papaya.
Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space

PubMed Central

Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R.; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J.

2013-01-01

For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. PMID:23592960

Combinatorial pooling enables selective sequencing of the barley gene space.

PubMed

Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J

2013-04-01

For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.
Network inference analysis identifies an APRR2-like gene linked to pigment accumulation in tomato and pepper fruits.

PubMed

Pan, Yu; Bradley, Glyn; Pyke, Kevin; Ball, Graham; Lu, Chungui; Fray, Rupert; Marshall, Alexandra; Jayasuta, Subhalai; Baxter, Charles; van Wijk, Rik; Boyden, Laurie; Cade, Rebecca; Chapman, Natalie H; Fraser, Paul D; Hodgman, Charlie; Seymour, Graham B

2013-03-01

Carotenoids represent some of the most important secondary metabolites in the human diet, and tomato (Solanum lycopersicum) is a rich source of these health-promoting compounds. In this work, a novel and fruit-related regulator of pigment accumulation in tomato has been identified by artificial neural network inference analysis and its function validated in transgenic plants. A tomato fruit gene regulatory network was generated using artificial neural network inference analysis and transcription factor gene expression profiles derived from fruits sampled at various points during development and ripening. One of the transcription factor gene expression profiles with a sequence related to an Arabidopsis (Arabidopsis thaliana) ARABIDOPSIS PSEUDO RESPONSE REGULATOR2-LIKE gene (APRR2-Like) was up-regulated at the breaker stage in wild-type tomato fruits and, when overexpressed in transgenic lines, increased plastid number, area, and pigment content, enhancing the levels of chlorophyll in immature unripe fruits and carotenoids in red ripe fruits. Analysis of the transcriptome of transgenic lines overexpressing the tomato APPR2-Like gene revealed up-regulation of several ripening-related genes in the overexpression lines, providing a link between the expression of this tomato gene and the ripening process. A putative ortholog of the tomato APPR2-Like gene in sweet pepper (Capsicum annuum) was associated with pigment accumulation in fruit tissues. We conclude that the function of this gene is conserved across taxa and that it encodes a protein that has an important role in ripening.
Conifer R2R3-MYB transcription factors: sequence analyses and gene expression in wood-forming tissues of white spruce (Picea glauca)

PubMed Central

Bedon, Frank; Grima-Pettenati, Jacqueline; Mackay, John

2007-01-01

Background Several members of the R2R3-MYB family of transcription factors act as regulators of lignin and phenylpropanoid metabolism during wood formation in angiosperm and gymnosperm plants. The angiosperm Arabidopsis has over one hundred R2R3-MYBs genes; however, only a few members of this family have been discovered in gymnosperms. Results We isolated and characterised full-length cDNAs encoding R2R3-MYB genes from the gymnosperms white spruce, Picea glauca (13 sequences), and loblolly pine, Pinus taeda L. (five sequences). Sequence similarities and phylogenetic analyses placed the spruce and pine sequences in diverse subgroups of the large R2R3-MYB family, although several of the sequences clustered closely together. We searched the highly variable C-terminal region of diverse plant MYBs for conserved amino acid sequences and identified 20 motifs in the spruce MYBs, nine of which have not previously been reported and three of which are specific to conifers. The number and length of the introns in spruce MYB genes varied significantly, but their positions were well conserved relative to angiosperm MYB genes. Quantitative RTPCR of MYB genes transcript abundance in root and stem tissues revealed diverse expression patterns; three MYB genes were preferentially expressed in secondary xylem, whereas others were preferentially expressed in phloem or were ubiquitous. The MYB genes expressed in xylem, and three others, were up-regulated in the compression wood of leaning trees within 76 hours of induction. Conclusion Our survey of 18 conifer R2R3-MYB genes clearly showed a gene family structure similar to that of Arabidopsis. Three of the sequences are likely to play a role in lignin metabolism and/or wood formation in gymnosperm trees, including a close homolog of the loblolly pine PtMYB4, shown to regulate lignin biosynthesis in transgenic tobacco. PMID:17397551
Comparative analysis of grapevine whole-genome gene predictions, functional annotation, categorization and integration of the predicted gene sequences

PubMed Central

2012-01-01

Background The first draft assembly and gene prediction of the grapevine genome (8X base coverage) was made available to the scientific community in 2007, and functional annotation was developed on this gene prediction. Since then additional Sanger sequences were added to the 8X sequences pool and a new version of the genomic sequence with superior base coverage (12X) was produced. Results In order to more efficiently annotate the function of the genes predicted in the new assembly, it is important to build on as much of the previous work as possible, by transferring 8X annotation of the genome to the 12X version. The 8X and 12X assemblies and gene predictions of the grapevine genome were compared to answer the question, “Can we uniquely map 8X predicted genes to 12X predicted genes?” The results show that while the assemblies and gene structure predictions are too different to make a complete mapping between them, most genes (18,725) showed a one-to-one relationship between 8X predicted genes and the last version of 12X predicted genes. In addition, reshuffled genomic sequence structures appeared. These highlight regions of the genome where the gene predictions need to be taken with caution. Based on the new grapevine gene functional annotation and in-depth functional categorization, twenty eight new molecular networks have been created for VitisNet while the existing networks were updated. Conclusions The outcomes of this study provide a functional annotation of the 12X genes, an update of VitisNet, the system of the grapevine molecular networks, and a new functional categorization of genes. Data are available at the VitisNet website (http://www.sdstate.edu/ps/research/vitis/pathways.cfm). PMID:22554261
Thermodynamics-based models of transcriptional regulation with gene sequence.

PubMed

Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing

2015-12-01

Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.
Functional Analysis of RNA Interference-Related Soybean Pod Borer (Lepidoptera) Genes Based on Transcriptome Sequences.

PubMed

Meng, Fanli; Yang, Mingyu; Li, Yang; Li, Tianyu; Liu, Xinxin; Wang, Guoyue; Wang, Zhanchun; Jin, Xianhao; Li, Wenbin

2018-01-01

RNA interference (RNAi) is useful for controlling pests of agriculturally important crops. The soybean pod borer (SPB) is the most important soybean pest in Northeastern Asia. In an earlier study, we confirmed that the SPB could be controlled via transgenic plant-mediated RNAi. Here, the SPB transcriptome was sequenced to identify RNAi-related genes, and also to establish an RNAi-of-RNAi assay system for evaluating genes involved in the SPB systemic RNAi response. The core RNAi genes, as well as genes potentially involved in double-stranded RNA (dsRNA) uptake were identified based on SPB transcriptome sequences. A phylogenetic analysis and the characterization of these core components as well as dsRNA uptake related genes revealed that they contain conserved domains essential for the RNAi pathway. The results of the RNAi-of-RNAi assay involving Laccas e 2 (a critical cuticle pigmentation gene) as a marker showed that genes encoding the sid-like ( Sil1 ), scavenger receptor class C ( Src ), and scavenger receptor class B ( Srb3 and Srb4 ) proteins of the endocytic pathway were required for SPB cellular uptake of dsRNA. The SPB response was inferred to contain three functional small RNA pathways (i.e., miRNA, siRNA, and piRNA pathways). Additionally, the SPB systemic RNA response may rely on systemic RNA interference deficient transmembrane channel-mediated and receptor-mediated endocytic pathways. The results presented herein may be useful for developing RNAi-mediated methods to control SPB infestations in soybean.
Functional Analysis of RNA Interference-Related Soybean Pod Borer (Lepidoptera) Genes Based on Transcriptome Sequences

PubMed Central

Meng, Fanli; Yang, Mingyu; Li, Yang; Li, Tianyu; Liu, Xinxin; Wang, Guoyue; Wang, Zhanchun; Jin, Xianhao; Li, Wenbin

2018-01-01

RNA interference (RNAi) is useful for controlling pests of agriculturally important crops. The soybean pod borer (SPB) is the most important soybean pest in Northeastern Asia. In an earlier study, we confirmed that the SPB could be controlled via transgenic plant-mediated RNAi. Here, the SPB transcriptome was sequenced to identify RNAi-related genes, and also to establish an RNAi-of-RNAi assay system for evaluating genes involved in the SPB systemic RNAi response. The core RNAi genes, as well as genes potentially involved in double-stranded RNA (dsRNA) uptake were identified based on SPB transcriptome sequences. A phylogenetic analysis and the characterization of these core components as well as dsRNA uptake related genes revealed that they contain conserved domains essential for the RNAi pathway. The results of the RNAi-of-RNAi assay involving Laccase 2 (a critical cuticle pigmentation gene) as a marker showed that genes encoding the sid-like (Sil1), scavenger receptor class C (Src), and scavenger receptor class B (Srb3 and Srb4) proteins of the endocytic pathway were required for SPB cellular uptake of dsRNA. The SPB response was inferred to contain three functional small RNA pathways (i.e., miRNA, siRNA, and piRNA pathways). Additionally, the SPB systemic RNA response may rely on systemic RNA interference deficient transmembrane channel-mediated and receptor-mediated endocytic pathways. The results presented herein may be useful for developing RNAi-mediated methods to control SPB infestations in soybean. PMID:29773992
Comparative analysis of the prion protein gene sequences in African lion.

PubMed

Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

2006-10-01

The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.
Diversity of Two-Domain Laccase-Like Multicopper Oxidase Genes in Streptomyces spp.: Identification of Genes Potentially Involved in Extracellular Activities and Lignocellulose Degradation during Composting of Agricultural Waste

PubMed Central

Lu, Lunhui; Zhang, Jiachao; Chen, Anwei; Chen, Ming; Jiang, Min; Yuan, Yujie; Wu, Haipeng; Lai, Mingyong; He, Yibin

2014-01-01

Traditional three-domain fungal and bacterial laccases have been extensively studied for their significance in various biotechnological applications. Growing molecular evidence points to a wide occurrence of more recently recognized two-domain laccase-like multicopper oxidase (LMCO) genes in Streptomyces spp. However, the current knowledge about their ecological role and distribution in natural or artificial ecosystems is insufficient. The aim of this study was to investigate the diversity and composition of Streptomyces two-domain LMCO genes in agricultural waste composting, which will contribute to the understanding of the ecological function of Streptomyces two-domain LMCOs with potential extracellular activity and ligninolytic capacity. A new specific PCR primer pair was designed to target the two conserved copper binding regions of Streptomyces two-domain LMCO genes. The obtained sequences mainly clustered with Streptomyces coelicolor, Streptomyces violaceusniger, and Streptomyces griseus. Gene libraries retrieved from six composting samples revealed high diversity and a rapid succession of Streptomyces two-domain LMCO genes during composting. The obtained sequence types cluster in 8 distinct clades, most of which are homologous with Streptomyces two-domain LMCO genes, but the sequences of clades III and VIII do not match with any reference sequence of known streptomycetes. Both lignocellulose degradation rates and phenol oxidase activity at pH 8.0 in the composting process were found to be positively associated with the abundance of Streptomyces two-domain LMCO genes. These observations provide important clues that Streptomyces two-domain LMCOs are potentially involved in bacterial extracellular phenol oxidase activities and lignocellulose breakdown during agricultural waste composting. PMID:24657870
Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes.

PubMed

Ohno, S

1984-01-01

Three outstanding properties uniquely qualify repeats of base oligomers as the primordial coding sequences of all polypeptide chains. First, when compared with randomly generated base sequences in general, they are more likely to have long open reading frames. Second, periodical polypeptide chains specified by such repeats are more likely to assume either alpha-helical or beta-sheet secondary structures than are polypeptide chains of random sequence. Third, provided that the number of bases in the oligomeric unit is not a multiple of 3, these internally repetitious coding sequences are impervious to randomly sustained base substitutions, deletions, and insertions. This is because the recurring periodicity of their polypeptide chains is given by three consecutive copies of the oligomeric unit translated in three different reading frames. Accordingly, when one reading frame is open, the other two are automatically open as well, all three being capable of coding for polypeptide chains of identical periodicity. Under this circumstance, a frame shift due to the deletion or insertion of a number of bases that is not a multiple of 3 fails to alter the down-stream amino acid sequence, and even a base change causing premature chain-termination can silence only one of the three potential coding units. Newly arisen coding sequences in modern organisms are oligomeric repeats, and most of the older genes retain various vestiges of their original internal repetitions. Some of the genes (e.g., oncogenes) have even inherited the property of being impervious to randomly sustained base changes.
The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.

PubMed

Motamayor, Juan C; Mockaitis, Keithanne; Schmutz, Jeremy; Haiminen, Niina; Livingstone, Donald; Cornejo, Omar; Findley, Seth D; Zheng, Ping; Utro, Filippo; Royaert, Stefan; Saski, Christopher; Jenkins, Jerry; Podicheti, Ram; Zhao, Meixia; Scheffler, Brian E; Stack, Joseph C; Feltus, Frank A; Mustiga, Guiliana M; Amores, Freddy; Phillips, Wilbert; Marelli, Jean Philippe; May, Gregory D; Shapiro, Howard; Ma, Jianxin; Bustamante, Carlos D; Schnell, Raymond J; Main, Dorrie; Gilbert, Don; Parida, Laxmi; Kuhn, David N

2013-06-03

Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits.
EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

PubMed

Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

2003-07-01

EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGENE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGENE'HOM to handle sequences from a variety of organisms. The current target of EUGENE'HOM is plant sequences. The EUGENE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl.
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications

PubMed Central

Yilmaz, Pelin; Kottmann, Renzo; Field, Dawn; Knight, Rob; Cole, James R; Amaral-Zettler, Linda; Gilbert, Jack A; Karsch-Mizrachi, Ilene; Johnston, Anjanette; Cochrane, Guy; Vaughan, Robert; Hunter, Christopher; Park, Joonhong; Morrison, Norman; Rocca-Serra, Philippe; Sterk, Peter; Arumugam, Manimozhiyan; Bailey, Mark; Baumgartner, Laura; Birren, Bruce W; Blaser, Martin J; Bonazzi, Vivien; Booth, Tim; Bork, Peer; Bushman, Frederic D; Buttigieg, Pier Luigi; Chain, Patrick S G; Charlson, Emily; Costello, Elizabeth K; Huot-Creasy, Heather; Dawyndt, Peter; DeSantis, Todd; Fierer, Noah; Fuhrman, Jed A; Gallery, Rachel E; Gevers, Dirk; Gibbs, Richard A; Gil, Inigo San; Gonzalez, Antonio; Gordon, Jeffrey I; Guralnick, Robert; Hankeln, Wolfgang; Highlander, Sarah; Hugenholtz, Philip; Jansson, Janet; Kau, Andrew L; Kelley, Scott T; Kennedy, Jerry; Knights, Dan; Koren, Omry; Kuczynski, Justin; Kyrpides, Nikos; Larsen, Robert; Lauber, Christian L; Legg, Teresa; Ley, Ruth E; Lozupone, Catherine A; Ludwig, Wolfgang; Lyons, Donna; Maguire, Eamonn; Methé, Barbara A; Meyer, Folker; Muegge, Brian; Nakielny, Sara; Nelson, Karen E; Nemergut, Diana; Neufeld, Josh D; Newbold, Lindsay K; Oliver, Anna E; Pace, Norman R; Palanisamy, Giriprakash; Peplies, Jörg; Petrosino, Joseph; Proctor, Lita; Pruesse, Elmar; Quast, Christian; Raes, Jeroen; Ratnasingham, Sujeevan; Ravel, Jacques; Relman, David A; Assunta-Sansone, Susanna; Schloss, Patrick D; Schriml, Lynn; Sinha, Rohini; Smith, Michelle I; Sodergren, Erica; Spor, Aymé; Stombaugh, Jesse; Tiedje, James M; Ward, Doyle V; Weinstock, George M; Wendel, Doug; White, Owen; Whiteley, Andrew; Wilke, Andreas; Wortman, Jennifer R; Yatsunenko, Tanya; Glöckner, Frank Oliver

2012-01-01

Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences—the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The ‘environmental packages’ apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere. PMID:21552244
Prevalence of ColE1-Like Plasmids and Kanamycin Resistance Genes in Salmonella enterica Serovars ▿

PubMed Central

Chen, Chin-Yi; Lindsey, Rebecca L.; Strobaugh, Terence P.; Frye, Jonathan G.; Meinersmann, Richard J.

2010-01-01

Multi-antimicrobial-resistant Salmonella enterica strains frequently carry resistance genes on plasmids. Recent studies focus heavily on large conjugative plasmids, and the role that small plasmids play in resistance gene transfer is largely unknown. To expand our previous studies in assessing the prevalence of the isolates harboring ColE1-like plasmids carrying the aph gene responsible for kanamycin resistance (Kanr) phenotypes, 102 Kanr Salmonella isolates collected through the National Antimicrobial Resistance Monitoring System (NARMS) in 2005 were screened by PCR using ColE1 primer sets. Thirty isolates were found to be positive for ColE1-like replicon. Plasmids from 23 isolates were able to propagate in Escherichia coli and were subjected to further characterization. Restriction mapping revealed three major plasmid groups found in three or more isolates, with each group consisting of two to three subtypes. The aph genes from the Kanr Salmonella isolates were amplified by PCR, sequenced, and showed four different aph(3′)-I genes. The distribution of the ColE1 plasmid groups in association with the aph gene, Salmonella serovar, and isolate source demonstrated a strong linkage of the plasmid with S. enterica serovar Typhimurium DT104. Due to their high copy number and mobility, the ColE1-like plasmids may play a critical role in transmission of antibiotic resistance genes among enteric pathogens, and these findings warrant a close monitoring of this plasmid incompatibility group. PMID:20693446
Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach

PubMed Central

Meyer, Pablo; Siwo, Geoffrey; Zeevi, Danny; Sharon, Eilon; Norel, Raquel; Segal, Eran; Stolovitzky, Gustavo; Siwo, Geoffrey; Rider, Andrew K.; Tan, Asako; Pinapati, Richard S.; Emrich, Scott; Chawla, Nitesh; Ferdig, Michael T.; Tung, Yi-An; Chen, Yong-Syuan; Chen, Mei-Ju May; Chen, Chien-Yu; Knight, Jason M.; Sahraeian, Sayed Mohammad Ebrahim; Esfahani, Mohammad Shahrokh; Dreos, Rene; Bucher, Philipp; Maier, Ezekiel; Saeys, Yvan; Szczurek, Ewa; Myšičková, Alena; Vingron, Martin; Klein, Holger; Kiełbasa, Szymon M.; Knisley, Jeff; Bonnell, Jeff; Knisley, Debra; Kursa, Miron B.; Rudnicki, Witold R.; Bhattacharjee, Madhuchhanda; Sillanpää, Mikko J.; Yeung, James; Meysman, Pieter; Rodríguez, Aminael Sánchez; Engelen, Kristof; Marchal, Kathleen; Huang, Yezhou; Mordelet, Fantine; Hartemink, Alexander; Pinello, Luca; Yuan, Guo-Cheng

2013-01-01

The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites. PMID:23950146
Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population.

PubMed

Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

2016-06-02

Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Sequence, distribution and chromosomal context of class I and class II pilin genes of Neisseria meningitidis identified in whole genome sequences

PubMed Central

2014-01-01

Background Neisseria meningitidis expresses type four pili (Tfp) which are important for colonisation and virulence. Tfp have been considered as one of the most variable structures on the bacterial surface due to high frequency gene conversion, resulting in amino acid sequence variation of the major pilin subunit (PilE). Meningococci express either a class I or a class II pilE gene and recent work has indicated that class II pilins do not undergo antigenic variation, as class II pilE genes encode conserved pilin subunits. The purpose of this work was to use whole genome sequences to further investigate the frequency and variability of the class II pilE genes in meningococcal isolate collections. Results We analysed over 600 publically available whole genome sequences of N. meningitidis isolates to determine the sequence and genomic organization of pilE. We confirmed that meningococcal strains belonging to a limited number of clonal complexes (ccs, namely cc1, cc5, cc8, cc11 and cc174) harbour a class II pilE gene which is conserved in terms of sequence and chromosomal context. We also identified pilS cassettes in all isolates with class II pilE, however, our analysis indicates that these do not serve as donor sequences for pilE/pilS recombination. Furthermore, our work reveals that the class II pilE locus lacks the DNA sequence motifs that enable (G4) or enhance (Sma/Cla repeat) pilin antigenic variation. Finally, through analysis of pilin genes in commensal Neisseria species we found that meningococcal class II pilE genes are closely related to pilE from Neisseria lactamica and Neisseria polysaccharea, suggesting horizontal transfer among these species. Conclusions Class II pilins can be defined by their amino acid sequence and genomic context and are present in meningococcal isolates which have persisted and spread globally. The absence of G4 and Sma/Cla sequences adjacent to the class II pilE genes is consistent with the lack of pilin subunit variation in these
Community-Level Analysis of psbA Gene Sequences and Irgarol Tolerance in Marine Periphyton▿

PubMed Central

Eriksson, K. M.; Clarke, A. K.; Franzen, L.-G.; Kuylenstierna, M.; Martinez, K.; Blanck, H.

2009-01-01

This study analyzes psbA gene sequences, predicted D1 protein sequences, species relative abundance, and pollution-induced community tolerance in marine periphyton communities exposed to the antifouling compound Irgarol 1051. The mechanism of action of Irgarol is the inhibition of photosynthetic electron transport at photosystem II by binding to the D1 protein. The metagenome of the communities was used to produce clone libraries containing fragments of the psbA gene encoding the D1 protein. Community tolerance was quantified with a short-term test for the inhibition of photosynthesis. The communities were established in a continuous flow of natural seawater through microcosms with or without added Irgarol. The selection pressure from Irgarol resulted in an altered species composition and an inducted community tolerance to Irgarol. Moreover, there was a very high diversity in the psbA gene sequences in the periphyton, and the composition of psbA and D1 fragments within the communities was dramatically altered by increased Irgarol exposure. Even though tolerance to this type of compound in land plants often depends on a single amino acid substitution (Ser264→Gly) in the D1 protein, this was not the case for marine periphyton species. Instead, the tolerance mechanism likely involves increased degradation of D1. When we compared sequences from low and high Irgarol exposure, differences in nonconserved amino acids were found only in the so-called PEST region of D1, which is involved in regulating its degradation. Our results suggest that environmental contamination with Irgarol has led to selection for high-turnover D1 proteins in marine periphyton communities at the west coast of Sweden. PMID:19088321
Gene discovery by chemical mutagenesis and whole-genome sequencing in Dictyostelium.

PubMed

Li, Cheng-Lin Frank; Santhanam, Balaji; Webb, Amanda Nicole; Zupan, Blaž; Shaulsky, Gad

2016-09-01

Whole-genome sequencing is a useful approach for identification of chemical-induced lesions, but previous applications involved tedious genetic mapping to pinpoint the causative mutations. We propose that saturation mutagenesis under low mutagenic loads, followed by whole-genome sequencing, should allow direct implication of genes by identifying multiple independent alleles of each relevant gene. We tested the hypothesis by performing three genetic screens with chemical mutagenesis in the social soil amoeba Dictyostelium discoideum Through genome sequencing, we successfully identified mutant genes with multiple alleles in near-saturation screens, including resistance to intense illumination and strong suppressors of defects in an allorecognition pathway. We tested the causality of the mutations by comparison to published data and by direct complementation tests, finding both dominant and recessive causative mutations. Therefore, our strategy provides a cost- and time-efficient approach to gene discovery by integrating chemical mutagenesis and whole-genome sequencing. The method should be applicable to many microbial systems, and it is expected to revolutionize the field of functional genomics in Dictyostelium by greatly expanding the mutation spectrum relative to other common mutagenesis methods. © 2016 Li et al.; Published by Cold Spring Harbor Laboratory Press.

Exome sequencing and arrayCGH detection of gene sequence and copy number variation between ILS and ISS mouse strains.

PubMed

Dumas, Laura; Dickens, C Michael; Anderson, Nathan; Davis, Jonathan; Bennett, Beth; Radcliffe, Richard A; Sikela, James M

2014-06-01

It has been well documented that genetic factors can influence predisposition to develop alcoholism. While the underlying genomic changes may be of several types, two of the most common and disease associated are copy number variations (CNVs) and sequence alterations of protein coding regions. The goal of this study was to identify CNVs and single-nucleotide polymorphisms that occur in gene coding regions that may play a role in influencing the risk of an individual developing alcoholism. Toward this end, two mouse strains were used that have been selectively bred based on their differential sensitivity to alcohol: the Inbred long sleep (ILS) and Inbred short sleep (ISS) mouse strains. Differences in initial response to alcohol have been linked to risk for alcoholism, and the ILS/ISS strains are used to investigate the genetics of initial sensitivity to alcohol. Array comparative genomic hybridization (arrayCGH) and exome sequencing were conducted to identify CNVs and gene coding sequence differences, respectively, between ILS and ISS mice. Mouse arrayCGH was performed using catalog Agilent 1 × 244 k mouse arrays. Subsequently, exome sequencing was carried out using an Illumina HiSeq 2000 instrument. ArrayCGH detected 74 CNVs that were strain-specific (38 ILS/36 ISS), including several ISS-specific deletions that contained genes implicated in brain function and neurotransmitter release. Among several interesting coding variations detected by exome sequencing was the gain of a premature stop codon in the alpha-amylase 2B (AMY2B) gene specifically in the ILS strain. In total, exome sequencing detected 2,597 and 1,768 strain-specific exonic gene variants in the ILS and ISS mice, respectively. This study represents the most comprehensive and detailed genomic comparison of ILS and ISS mouse strains to date. The two complementary genome-wide approaches identified strain-specific CNVs and gene coding sequence variations that should provide strong candidates to
Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

PubMed

Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

2017-11-01

The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Polyhedron-like inclusion body formation by a mutant nucleopolyhedrovirus expressing the granulin gene from a granulovirus.

PubMed

Zhou, C E; Ko, R; Maeda, S

1998-01-20

The polyhedrin gene in Bombyx mori nucleopolyhedrovirus (BmNPV) was replaced with the granulin gene of Trichoplusia ni granulovirus (TnGV). The substitution was verified by Southern hybridization, and expression of granulin by the mutant virus, BmGran, was demonstrated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and by amino acid sequencing of the predominant protein of BmGran inclusion bodies (IBs). Light and electron microscopy examination of BmGran-infected B. mori and BmN cells revealed large, cuboidal, polyhedron-like IBs in the nucleus and cytoplasm, but granules were not seen. IBs contained small, parallel, electron-dense streaks, which defined the geometric pattern of crystallization. Geometric patterns of nuclear IBs were frequently disrupted by occlusion of polyhedron envelope fragments, resulting in IB instability and fracturing. Virions were not embedded in most of the polyhedron-like IBs, but accumulated with polyhedron envelope fragments. Some virions were coated with matrix protein and were partially wrapped by polyhedron envelope. These results suggested that (1) the amino acid sequence of granulin insufficient for determining IB morphology in TnGV-infected cells, and TnGV may have genes, not present in BmNPV, that control granule formation, and (2) interactions among the virion, the IB envelope, and the matrix protein may be important in virion occlusion and IB morphology and stability.
Sequence analysis and gene expression of putative oil palm chitinase and chitinase-like proteins in response to colonization of Ganoderma boninense and Trichoderma harzianum.

PubMed

Yeoh, K-A; Othman, A; Meon, S; Abdullah, F; Ho, C-L

2013-01-01

Chitinases are glycosyl hydrolases that cleave the β-1,4-glycosidic linkages between N-acetylglucosamine residues in chitin which is a major component of fungal cell wall. Plant chitinases hydrolyze fungal chitin to chitin oligosaccharides that serve as elicitors of plant defense system against fungal pathogens. However, plants synthesize many chitinase isozymes and some of them are not pathogenesis-related. In this study, three full-length cDNA sequences encoding a putative chitinase (EgChit3-1) and two chitinase-like proteins (EgChit1-1 and EgChit5-1) have been cloned from oil palm (Elaeis guineensis) by polymerase chain reaction (PCR). The abundance of these transcripts in the roots and leaves of oil palm seedlings treated with Ganoderma boninense (a fungal pathogen) or Trichoderma harzianum (an avirulent symbiont), and a combination of both fungi at 3, 6 and 12 weeks post infection were profiled by real time quantitative reverse-transcription (qRT)-PCR. Our findings showed that the gene expression of EgChit3-1 increased significantly in the roots of oil palm seedlings treated with either G. boninense or T. harzianum and a combination of both; whereas the gene expression of EgChit1-1 in the treated roots of oil palm seedlings was not significantly higher compared to those of the untreated oil palm roots. The gene expression of EgChit5-1 was only higher in the roots of oil palm seedlings treated with T. harzianum compared to those of the untreated oil palm roots. In addition, the gene expression of EgChit1-1 and EgChit3-1 showed a significantly higher gene expression in the leaf samples of oil palm seedlings treated with either G. boninense or T. harzianum.
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

PubMed Central

Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

1984-01-01

The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
Mapping by sequencing in cotton (Gossypium hirsutum) line MD52ne identified candidate genes for fiber strength and its related quality attributes.

PubMed

Islam, Md S; Zeng, Linghe; Thyssen, Gregory N; Delhom, Christopher D; Kim, Hee Jin; Li, Ping; Fang, David D

2016-06-01

Three QTL regions controlling three fiber quality traits were validated and further fine-mapped with 27 new single nucleotide polymorphism (SNP) markers. Transcriptome analysis suggests that receptor-like kinases found within the validated QTLs are potential candidate genes responsible for superior fiber strength in cotton line MD52ne. Fiber strength, length, maturity and fineness determine the market value of cotton fibers and the quality of spun yarn. Cotton fiber strength has been recognized as a critical quality attribute in the modern textile industry. Fine mapping along with quantitative trait loci (QTL) validation and candidate gene prediction can uncover the genetic and molecular basis of fiber quality traits. Four previously-identified QTLs (qFBS-c3, qSFI-c14, qUHML-c14 and qUHML-c24) related to fiber bundle strength, short fiber index and fiber length, respectively, were validated using an F3 population that originated from a cross of MD90ne × MD52ne. A group of 27 new SNP markers generated from mapping-by-sequencing (MBS) were placed in QTL regions to improve and validate earlier maps. Our refined QTL regions spanned 4.4, 1.8 and 3.7 Mb of physical distance in the Gossypium raimondii reference genome. We performed RNA sequencing (RNA-seq) of 15 and 20 days post-anthesis fiber cells from MD52ne and MD90ne and aligned reads to the G. raimondii genome. The QTL regions contained 21 significantly differentially expressed genes (DEGs) between the two near-isogenic parental lines. SNPs that result in non-synonymous substitutions to amino acid sequences of annotated genes were identified within these DEGs, and mapped. Taken together, transcriptome and amino acid mutation analysis indicate that receptor-like kinase pathway genes are likely candidates for superior fiber strength and length in MD52ne. MBS along with RNA-seq demonstrated a powerful strategy to elucidate candidate genes for the QTLs that control complex traits in a complex genome like tetraploid
Using variable rate models to identify genes under selection in sequence pairs: their validity and limitations for EST sequences.

PubMed

Church, Sheri A; Livingstone, Kevin; Lai, Zhao; Kozik, Alexander; Knapp, Steven J; Michelmore, Richard W; Rieseberg, Loren H

2007-02-01

Using likelihood-based variable selection models, we determined if positive selection was acting on 523 EST sequence pairs from two lineages of sunflower and lettuce. Variable rate models are generally not used for comparisons of sequence pairs due to the limited information and the inaccuracy of estimates of specific substitution rates. However, previous studies have shown that the likelihood ratio test (LRT) is reliable for detecting positive selection, even with low numbers of sequences. These analyses identified 56 genes that show a signature of selection, of which 75% were not identified by simpler models that average selection across codons. Subsequent mapping studies in sunflower show four of five of the positively selected genes identified by these methods mapped to domestication QTLs. We discuss the validity and limitations of using variable rate models for comparisons of sequence pairs, as well as the limitations of using ESTs for identification of positively selected genes.
High-throughput discovery of mutations in tef semi-dwarfing genes by next-generation sequencing analysis.

PubMed

Zhu, Qihui; Smith, Shavannor M; Ayele, Mulu; Yang, Lixing; Jogi, Ansuya; Chaluvadi, Srinivasa R; Bennetzen, Jeffrey L

2012-11-01

Tef (Eragrostis tef) is a major cereal crop in Ethiopia. Lodging is the primary constraint to increasing productivity in this allotetraploid species, accounting for losses of ∼15-45% in yield each year. As a first step toward identifying semi-dwarf varieties that might have improved lodging resistance, an ∼6× fosmid library was constructed and used to identify both homeologues of the dw3 semi-dwarfing gene of Sorghum bicolor. An EMS mutagenized population, consisting of ∼21,210 tef plants, was planted and leaf materials were collected into 23 superpools. Two dwarfing candidate genes, homeologues of dw3 of sorghum and rht1 of wheat, were sequenced directly from each superpool with 454 technology, and 120 candidate mutations were identified. Out of 10 candidates tested, six independent mutations were validated by Sanger sequencing, including two predicted detrimental mutations in both dw3 homeologues with a potential to improve lodging resistance in tef through further breeding. This study demonstrates that high-throughput sequencing can identify potentially valuable mutations in under-studied plant species like tef and has provided mutant lines that can now be combined and tested in breeding programs for improved lodging resistance.
Molecular Cloning and Sequencing of Hemoglobin-Beta Gene of Channel Catfish, Ictalurus Punctatus Rafinesque

USDA-ARS?s Scientific Manuscript database

: Hemoglobin-y gene of channel catfish , lctalurus punctatus, was cloned and sequenced . Total RNA from head kidneys was isolated, reverse transcribed and amplified . The sequence of the channel catfish hemoglobin-y gene consists of 600 nucleotides . Analysis of the nucleotide sequence reveals one o...
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.

PubMed

Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa

2017-10-01

Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial
The Unique hmuY Gene Sequence as a Specific Marker of Porphyromonas gingivalis

PubMed Central

Mackiewicz, Paweł; Radwan-Oczko, Małgorzata; Kantorowicz, Małgorzata; Chomyszyn-Gajewska, Maria; Frąszczak, Magdalena; Bielecki, Marcin; Olczak, Mariusz; Olczak, Teresa

2013-01-01

Porphyromonas gingivalis, a major etiological agent of chronic periodontitis, acquires heme from host hemoproteins using the HmuY hemophore. The aim of this study was to develop a specific P. gingivalis marker based on a hmuY gene sequence. Subgingival samples were collected from 66 patients with chronic periodontitis and 40 healthy subjects and the entire hmuY gene was analyzed in positive samples. Phylogenetic analyses demonstrated that both the amino acid sequence of the HmuY protein and the nucleotide sequence of the hmuY gene are unique among P. gingivalis strains/isolates and show low identity to sequences found in other species (below 50 and 56%, respectively). In agreement with these findings, a set of hmuY gene-based primers and standard/real-time PCR with SYBR Green chemistry allowed us to specifically detect P. gingivalis in patients with chronic periodontitis (77.3%) and healthy subjects (20%), the latter possessing lower number of P. gingivalis cells and total bacterial cells. Isolates from healthy subjects possess the hmuY gene-based nucleotide sequence pattern occurring in W83/W50/A7436 (n = 4), 381/ATCC 33277 (n = 3) or TDC60 (n = 1) strains, whereas those from patients typically have TDC60 (n = 21), W83/W50/A7436 (n = 17) and 381/ATCC 33277 (n = 13) strains. We observed a significant correlation between periodontal index of risk of infectiousness (PIRI) and the presence/absence of P. gingivalis (regardless of the hmuY gene-based sequence pattern of the isolate identified [r = 0.43; P = 0.0002] and considering particular isolate pattern [r = 0.38; P = 0.0012]). In conclusion, we demonstrated that the hmuY gene sequence or its fragments may be used as one of the molecular markers of P. gingivalis. PMID:23844074
IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

PubMed Central

Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

2017-01-01

Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656
The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color

PubMed Central

2013-01-01

Background Theobroma cacao L. cultivar Matina 1-6 belongs to the most cultivated cacao type. The availability of its genome sequence and methods for identifying genes responsible for important cacao traits will aid cacao researchers and breeders. Results We describe the sequencing and assembly of the genome of Theobroma cacao L. cultivar Matina 1-6. The genome of the Matina 1-6 cultivar is 445 Mbp, which is significantly larger than a sequenced Criollo cultivar, and more typical of other cultivars. The chromosome-scale assembly, version 1.1, contains 711 scaffolds covering 346.0 Mbp, with a contig N50 of 84.4 kbp, a scaffold N50 of 34.4 Mbp, and an evidence-based gene set of 29,408 loci. Version 1.1 has 10x the scaffold N50 and 4x the contig N50 as Criollo, and includes 111 Mb more anchored sequence. The version 1.1 assembly has 4.4% gap sequence, while Criollo has 10.9%. Through a combination of haplotype, association mapping and gene expression analyses, we leverage this robust reference genome to identify a promising candidate gene responsible for pod color variation. We demonstrate that green/red pod color in cacao is likely regulated by the R2R3 MYB transcription factor TcMYB113, homologs of which determine pigmentation in Rosaceae, Solanaceae, and Brassicaceae. One SNP within the target site for a highly conserved trans-acting siRNA in dicots, found within TcMYB113, seems to affect transcript levels of this gene and therefore pod color variation. Conclusions We report a high-quality sequence and annotation of Theobroma cacao L. and demonstrate its utility in identifying candidate genes regulating traits. PMID:23731509
Genome-wide identification and expression analysis of SBP-like transcription factor genes in Moso Bamboo (Phyllostachys edulis).

PubMed

Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan

2017-06-27

The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and
Discrimination of the Lactobacillus acidophilus group using sequencing, species-specific PCR and SNaPshot mini-sequencing technology based on the recA gene.

PubMed

Huang, Chien-Hsun; Chang, Mu-Tzu; Huang, Mu-Chiou; Wang, Li-Tin; Huang, Lina; Lee, Fwu-Ling

2012-10-01

To clearly identify specific species and subspecies of the Lactobacillus acidophilus group using phenotypic and genotypic (16S rDNA sequence analysis) techniques alone is difficult. The aim of this study was to use the recA gene for species discrimination in the L. acidophilus group, as well as to develop a species-specific primer and single nucleotide polymorphism primer based on the recA gene sequence for species and subspecies identification. The average sequence similarity for the recA gene among type strains was 80.0%, and most members of the L. acidophilus group could be clearly distinguished. The species-specific primer was designed according to the recA gene sequencing, which was employed for polymerase chain reaction with the template DNA of Lactobacillus strains. A single 231-bp species-specific band was found only in L. delbrueckii. A SNaPshot mini-sequencing assay using recA as a target gene was also developed. The specificity of the mini-sequencing assay was evaluated using 31 strains of L. delbrueckii species and was able to unambiguously discriminate strains belonging to the subspecies L. delbrueckii subsp. bulgaricus. The phylogenetic relationships of most strains in the L. acidophilus group can be resolved using recA gene sequencing, and a novel method to identify the species and subspecies of the L. delbrueckii and L. delbrueckii subsp. bulgaricus was developed by species-specific polymerase chain reaction combined with SNaPshot mini-sequencing. Copyright © 2012 Society of Chemical Industry.
Antinutritive effects of wheat-germ agglutinin and other N-acetylglucosamine-specific lectins.

PubMed

Pusztai, A; Ewen, S W; Grant, G; Brown, D S; Stewart, J C; Peumans, W J; Van Damme, E J; Bardocz, S

1993-07-01

Incorporation of N-acetylglucosamine-specific agglutinins from wheat germ (Triticum aestivum; WGA), thorn apple (Datura stramonium) or nettle (Urtica dioica) rhizomes in the diet at the level of 7 g/kg reduced the apparent digestibility and utilization of dietary proteins and the growth of rats, with WGA being the most damaging. As a result of their binding and endocytosis by the epithelial cells of the small intestine, all three lectins were growth factors for the gut and interfered with its metabolism and function to varying degrees. WGA was particularly effective; it induced extensive polyamine-dependent hyperplastic and hypertrophic growth of the small bowel by increasing its content of proteins, RNA and DNA. Furthermore, an appreciable portion of the endocytosed WGA was transported across the gut wall into the systemic circulation, where it was deposited in the walls of the blood and lymphatic vessels. WGA also induced the hypertrophic growth of the pancreas and caused thymus atrophy. Although the transfer of the gene of WGA into crop plants has been advocated to increase their insect resistance, as the presence of this lectin in the diet may harm higher animals at the concentrations required to be effective against most pests, its use in plants as natural insecticide is not without health risks for man.
DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca.

PubMed Central

Lao, G; Ghangas, G S; Jung, E D; Wilson, D B

1991-01-01

The DNA sequences of the Thermomonospora fusca genes encoding cellulases E2 and E5 and the N-terminal end of E4 were determined. Each sequence contains an identical 14-bp inverted repeat upstream of the initiation codon. There were no significant homologies between the coding regions of the three genes. The E2 gene is 73% identical to the celA gene from Microbispora bispora, but this was the only homology found with other cellulase genes. E2 belongs to a family of cellulases that includes celA from M. bispora, cenA from Cellulomonas fimi, casA from an alkalophilic Streptomyces strain, and cellobiohydrolase II from Trichoderma reesei. E4 shows 44% identity to an avocado cellulase, while E5 belongs to the Bacillus cellulase family. There were strong similarities between the amino acid sequences of the E2 and E5 cellulose binding domains, and these regions also showed homology with C. fimi and Pseudomonas fluorescens cellulose binding domains. PMID:1904434
Conserved regulatory elements of the promoter sequence of the gene rpoH of enteric bacteria

PubMed Central

Ramírez-Santos, Jesús; Collado-Vides, Julio; García-Varela, Martin; Gómez-Eichelmann, M. Carmen

2001-01-01

The rpoH regulatory region of different members of the enteric bacteria family was sequenced or downloaded from GenBank and compared. In addition, the transcriptional start sites of rpoH of Yersinia frederiksenii and Proteus mirabilis, two distant members of this family, were determined. Sequences similar to the σ70 promoters P1, P4 and P5, to the σE promoter P3 and to boxes DnaA1, DnaA2, cAMP receptor protein (CRP) boxes CRP1, CRP2 and box CytR present in Escherichia coli K12, were identified in sequences of closely related bacteria such as: E.coli, Shigella flexneri, Salmonella enterica serovar Typhimurium, Citrobacter freundii, Enterobacter cloacae and Klebsiella pneumoniae. In more distant bacteria, Y.frederiksenii and P.mirabilis, the rpoH regulatory region has a distal P1-like σ70 promoter and two proximal promoters: a heat-induced σE-like promoter and a σ70 promoter. Sequences similar to the regulatory boxes were not identified in these bacteria. This study suggests that the general pattern of transcription of the rpoH gene in enteric bacteria includes a distal σ70 promoter, >200 nt upstream of the initiation codon, and two proximal promoters: a heat-induced σE-like promoter and a σ70 promoter. A second proximal σ70 promoter under catabolite-regulation is probably present only in bacteria closely related to E.coli. PMID:11139607
Characterization and Amplification of Gene-Based Simple Sequence Repeat (SSR) Markers in Date Palm.

PubMed

Zhao, Yongli; Keremane, Manjunath; Prakash, Channapatna S; He, Guohao

2017-01-01

The paucity of molecular markers limits the application of genetic and genomic research in date palm (Phoenix dactylifera L.). Availability of expressed sequence tag (EST) sequences in date palm may provide a good resource for developing gene-based markers. This study characterizes a substantial fraction of transcriptome sequences containing simple sequence repeats (SSRs) from the EST sequences in date palm. The EST sequences studied are mainly homologous to those of Elaeis guineensis and Musa acuminata. A total of 911 gene-based SSR markers, characterized with functional annotations, have provided a useful basis not only for discovering candidate genes and understanding genetic basis of traits of interest but also for developing genetic and genomic tools for molecular research in date palm, such as diversity study, quantitative trait locus (QTL) mapping, and molecular breeding. The procedures of DNA extraction, polymerase chain reaction (PCR) amplification of these gene-based SSR markers, and gel electrophoresis of PCR products are described in this chapter.
Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles.

PubMed

Gadala-Maria, Daniel; Yaari, Gur; Uduman, Mohamed; Kleinstein, Steven H

2015-02-24

Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.

RNA sequencing analysis of human podocytes reveals glucocorticoid regulated gene networks targeting non-immune pathways

PubMed Central

Jiang, Lulu; Hindmarch, Charles C. T.; Rogers, Mark; Campbell, Colin; Waterfall, Christy; Coghill, Jane; Mathieson, Peter W.; Welsh, Gavin I.

2016-01-01

Glucocorticoids are steroids that reduce inflammation and are used as immunosuppressive drugs for many diseases. They are also the mainstay for the treatment of minimal change nephropathy (MCN), which is characterised by an absence of inflammation. Their mechanisms of action remain elusive. Evidence suggests that immunomodulatory drugs can directly act on glomerular epithelial cells or ‘podocytes’, the cell type which is the main target of injury in MCN. To understand the nature of glucocorticoid effects on non-immune cell functions, we generated RNA sequencing data from human podocyte cell lines and identified the genes that are significantly regulated in dexamethasone-treated podocytes compared to vehicle-treated cells. The upregulated genes are of functional relevance to cytoskeleton-related processes, whereas the downregulated genes mostly encode pro-inflammatory cytokines and growth factors. We observed a tendency for dexamethasone-upregulated genes to be downregulated in MCN patients. Integrative analysis revealed gene networks composed of critical signaling pathways that are likely targeted by dexamethasone in podocytes. PMID:27774996
Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.

PubMed

Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu

2013-11-01

Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.
Sequence of the chloroplast 16S rRNA gene and its surrounding regions of Chlamydomonas reinhardii.

PubMed Central

Dron, M; Rahire, M; Rochaix, J D

1982-01-01

The sequence of a 2 kb DNA fragment containing the chloroplast 16S ribosomal RNA gene from Chlamydomonas reinhardii and its flanking regions has been determined. The algal 16S rRNA sequence (1475 nucleotides) and secondary structure are highly related to those found in bacteria and in the chloroplasts of higher plants. In contrast, the flanking regions are very different. In C. reinhardii the 16S rRNA gene is surrounded by AT rich segments of about 180 bases, which are followed by a long stretch of complementary bases separated from each other by 1833 nucleotides. It is likely that these structures play an important role in the folding and processing of the precursor of 16S rRNA. The primary and secondary structures of the binding sites of two ribosomal proteins in the 16SrRNAs of E. coli and C. reinhardii are considerably related. Images PMID:6296784
Sequence heterogeneity in the two 16S rRNA genes of Phormium yellow leaf phytoplasma.

PubMed Central

Liefting, L W; Andersen, M T; Beever, R E; Gardner, R C; Forster, R L

1996-01-01

Phormium yellow leaf (PYL) phytoplasma causes a lethal disease of the monocotyledon, New Zealand flax (Phormium tenax). The 16S rRNA genes of PYL phytoplasma were amplified from infected flax by PCR and cloned, and the nucleotide sequences were determined. DNA sequencing and Southern hybridization analysis of genomic DNA indicated the presence of two copies of the 16S rRNA gene. The two 16S rRNA genes exhibited sequence heterogeneity in 4 nucleotide positions and could be distinguished by the restriction enzymes BpmI and BsrI. This is the first record in which sequence heterogeneity in the 16S rRNA genes of a phytoplasma has been determined by sequence analysis. A phylogenetic tree based on 16S rRNA gene sequences showed that PYL phytoplasma is most closely related to the stolbur and German grapevine yellows phytoplasmas, which form the stolbur subgroup of the aster yellows group. This phylogenetic position of PYL phytoplasma was supported by 16S/23S spacer region sequence data. PMID:8795200
Peculiar Evolutionary History of miR390-Guided TAS3-Like Genes in Land Plants

PubMed Central

Krasnikova, Maria S.; Goryunov, Denis V.; Troitsky, Alexey V.; Solovyev, Andrey G.; Ozerova, Lydmila V.; Morozov, Sergey Y.

2013-01-01

PCR-based approach was used as a phylogenetic profiling tool to probe genomic DNA samples from representatives of evolutionary distant moss taxa, namely, classes Bryopsida, Tetraphidopsida, Polytrichopsida, Andreaeopsida, and Sphagnopsida. We found relatives of all Physcomitrella patens miR390 and TAS3-like loci in these plant taxa excluding Sphagnopsida. Importantly, cloning and sequencing of Marchantia polymorpha genomic DNA showed miR390 and TAS3-like sequences which were also found among genomic reads of M. polymorpha at NCBI database. Our data suggest that the ancient plant miR390-dependent TAS molecular machinery firstly evolved to target AP2-like mRNAs in Marchantiophyta and only then both ARF- and AP2-specific mRNAs in mosses. The presented analysis shows that moss TAS3 families may undergone losses of tasiAP2 sites during evolution toward ferns and seed plants. These data confirm that miR390-guided genes coding for ARF- and AP2-specific ta-siRNAs have been gradually changed during land plant evolution. PMID:24302881
SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments

PubMed Central

Wiehe, Thomas; Gebauer-Jung, Steffi; Mitchell-Olds, Thomas; Guigó, Roderic

2001-01-01

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors. PMID:11544202
The genome of Paenibacillus sabinae T27 provides insight into evolution, organization and functional elucidation of nif and nif-like genes.

PubMed

Li, Xinxin; Deng, Zhiping; Liu, Zhanzhi; Yan, Yongliang; Wang, Tianshu; Xie, Jianbo; Lin, Min; Cheng, Qi; Chen, Sanfeng

2014-08-27

Most biological nitrogen fixation is catalyzed by the molybdenum nitrogenase. This enzyme is a complex which contains the MoFe protein encoded by nifDK and the Fe protein encoded by nifH. In addition to nifHDK, nifHDK-like genes were found in some Archaea and Firmicutes, but their function is unclear. We sequenced the genome of Paenibacillus sabinae T27. A total of 4,793 open reading frames were predicted from its 5.27 Mb genome. The genome of P. sabinae T27 contains fifteen nitrogen fixation (nif) genes, including three nifH, one nifD, one nifK, four nifB, two nifE, two nifN, one nifX and one nifV. Of the 15 nif genes, eight nif genes (nifB, nifH, nifD, nifK, nifE, nifN, nifX and nifV) and two non-nif genes (orf1 and hesA) form a complete nif gene cluster. In addition to the nif genes, there are nitrogenase-like genes, including two nifH-like genes and five pairs of nifDK-like genes. IS elements on the flanking regions of nif and nif-like genes imply that these genes might have been obtained by horizontal gene transfer. Phylogenies of the concatenated 8 nif gene (nifB, nifH, nifD, nifK, nifE, nifN, nifX and nifV) products suggest that P. sabinae T27 is closely related to Frankia. RT-PCR analysis showed that the complete nif gene cluster is organized as an operon. We demonstrated that the complete nif gene cluster under the control of σ70-dependent promoter enabled Escherichia coli JM109 to fix nitrogen. Also, here for the first time we demonstrated that unlike nif genes, the transcriptions of nifHDK-like genes were not regulated by ammonium and oxygen, and nifH-like or nifD-like gene could not restore the nitrogenase activity of Klebsiella pneumonia nifH- and nifD- mutant strains, respectively, suggesting that nifHDK-like genes were not involved in nitrogen fixation. Our data and analysis reveal the contents and distribution of nif and nif-like genes and contribute to the study of evolutionary history of nitrogen fixation in Paenibacillus. For the first time we
Transcriptome Profiling of Bovine Milk Oligosaccharide Metabolism Genes Using RNA-Sequencing

PubMed Central

Wickramasinghe, Saumya; Hua, Serenus; Rincon, Gonzalo; Islas-Trejo, Alma; German, J. Bruce; Lebrilla, Carlito B.; Medrano, Juan F.

2011-01-01

This study examines the genes coding for enzymes involved in bovine milk oligosaccharide metabolism by comparing the oligosaccharide profiles with the expressions of glycosylation-related genes. Fresh milk samples (n = 32) were collected from four Holstein and Jersey cows at days 1, 15, 90 and 250 of lactation and free milk oligosaccharide profiles were analyzed. RNA was extracted from milk somatic cells at days 15 and 250 of lactation (n = 12) and gene expression analysis was conducted by RNA-Sequencing. A list was created of 121 glycosylation-related genes involved in oligosaccharide metabolism pathways in bovine by analyzing the oligosaccharide profiles and performing an extensive literature search. No significant differences were observed in either oligosaccharide profiles or expressions of glycosylation-related genes between Holstein and Jersey cows. The highest concentrations of free oligosaccharides were observed in the colostrum samples and a sharp decrease was observed in the concentration of free oligosaccharides on day 15, followed by progressive decrease on days 90 and 250. Ninety-two glycosylation-related genes were expressed in milk somatic cells. Most of these genes exhibited higher expression in day 250 samples indicating increases in net glycosylation-related metabolism in spite of decreases in free milk oligosaccharides in late lactation milk. Even though fucosylated free oligosaccharides were not identified, gene expression indicated the likely presence of fucosylated oligosaccharides in bovine milk. Fucosidase genes were expressed in milk and a possible explanation for not detecting fucosylated free oligosaccharides is the degradation of large fucosylated free oligosaccharides by the fucosidases. Detailed characterization of enzymes encoded by the 92 glycosylation-related genes identified in this study will provide the basic knowledge for metabolic network analysis of oligosaccharides in mammalian milk. These candidate genes will guide
Intervening sequences in a plant gene-comparison of the partial sequence of cDNA and genomic DNA of French bean phaseolin

NASA Astrophysics Data System (ADS)

Sun, S. M.; Slightom, J. L.; Hall, T. C.

1981-01-01

A plant gene coding for the major storage protein (phaseolin, G1-globulin) of the French bean was isolated from a genomic library constructed in the phage vector Charon 24A. Comparison of the nucleotide sequence of part of the gene with that of the cloned messenger RNA (cDNA) revealed the presence of three intervening sequences, all beginning with GTand ending with AG. The 5' and 3' boundaries of intervening sequences TVS-A (88 base pairs) and IVS-B (124 base pairs) are similar to those described for animal and viral genes, but the 3' boundary of IVS-C (129 base pairs) shows some differences. A sequence of 185 amino acids deduced from the cloned DMAs represents about 40% of a phaseolin polypeptide.
Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

PubMed

Hoshino, Tatsuhiko; Inagaki, Fumio

2017-01-01

Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and
The first myriapod genome sequence reveals conservative arthropod gene content and genome organisation in the centipede Strigamia maritima.

PubMed

Chipman, Ariel D; Ferrier, David E K; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S T; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C; Alonso, Claudio R; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C J; Blankenburg, Kerstin P; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K; Du Pasquier, Louis; Duncan, Elizabeth J; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D; Extavour, Cassandra G; Francisco, Liezl; Gabaldón, Toni; Gillis, William J; Goodwin-Horn, Elizabeth A; Green, Jack E; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J P; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H L; Hunn, Julia P; Hunnekuhl, Vera S; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N; Jiggins, Francis M; Jones, Tamsin E; Kaiser, Tobias S; Kalra, Divya; Kenny, Nathan J; Korchina, Viktoriya; Kovar, Christie L; Kraus, F Bernhard; Lapraz, François; Lee, Sandra L; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C; Robertson, Helen E; Robertson, Hugh M; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E; Schurko, Andrew M; Siggens, Kenneth W; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M; Willis, Judith H; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M; Worley, Kim C; Gibbs, Richard A; Akam, Michael; Richards, Stephen

2014-11-01

Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific
The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima

PubMed Central

Chipman, Ariel D.; Ferrier, David E. K.; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S. T.; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C.; Alonso, Claudio R.; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C. J.; Blankenburg, Kerstin P.; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K.; Du Pasquier, Louis; Duncan, Elizabeth J.; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D.; Extavour, Cassandra G.; Francisco, Liezl; Gabaldón, Toni; Gillis, William J.; Goodwin-Horn, Elizabeth A.; Green, Jack E.; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J. P.; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H. L.; Hunn, Julia P.; Hunnekuhl, Vera S.; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N.; Jiggins, Francis M.; Jones, Tamsin E.; Kaiser, Tobias S.; Kalra, Divya; Kenny, Nathan J.; Korchina, Viktoriya; Kovar, Christie L.; Kraus, F. Bernhard; Lapraz, François; Lee, Sandra L.; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N.; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J.; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H.; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C.; Robertson, Helen E.; Robertson, Hugh M.; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E.; Schurko, Andrew M.; Siggens, Kenneth W.; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J.; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M.; Willis, Judith H.; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M.; Worley, Kim C.; Gibbs, Richard A.; Akam, Michael; Richards, Stephen

2014-01-01

Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific
Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment.

PubMed

Zhang, Bochao; Meng, Wenzhao; Prak, Eline T Luning; Hershberg, Uri

2015-12-01

Immune repertoires are collections of lymphocytes that express diverse antigen receptor gene rearrangements consisting of Variable (V), (Diversity (D) in the case of heavy chains) and Joining (J) gene segments. Clonally related cells typically share the same germline gene segments and have highly similar junctional sequences within their third complementarity determining regions. Identifying clonal relatedness of sequences is a key step in the analysis of immune repertoires. The V gene is the most important for clone identification because it has the longest sequence and the greatest number of sequence variants. However, accurate identification of a clone's germline V gene source is challenging because there is a high degree of similarity between different germline V genes. This difficulty is compounded in antibodies, which can undergo somatic hypermutation. Furthermore, high-throughput sequencing experiments often generate partial sequences and have significant error rates. To address these issues, we describe a novel method to estimate which germline V genes (or alleles) cannot be discriminated under different conditions (read lengths, sequencing errors or somatic hypermutation frequencies). Starting with any set of germline V genes, this method measures their similarity using different sequencing lengths and calculates their likelihood of unambiguous assignment under different levels of mutation. Hence, one can identify, under different experimental and biological conditions, the germline V genes (or alleles) that cannot be uniquely identified and bundle them together into groups of specific V genes with highly similar sequences. Copyright © 2015 Elsevier B.V. All rights reserved.
Unique Trichomonas vaginalis gene sequences identified in multinational regions of Northwest China.

PubMed

Liu, Jun; Feng, Meng; Wang, Xiaolan; Fu, Yongfeng; Ma, Cailing; Cheng, Xunjia

2017-07-24

Trichomonas vaginalis (T. vaginalis) is a flagellated protozoan parasite that infects humans worldwide. This study determined the sequence of the 18S ribosomal RNA gene of T. vaginalis infecting both females and males in Xinjiang, China. Samples from 73 females and 28 males were collected and confirmed for infection with T. vaginalis, a total of 110 sequences were identified when the T. vaginalis 18S ribosomal RNA gene was sequenced. These sequences were used to prepare a phylogenetic network. The rooted network comprised three large clades and several independent branches. Most of the Xinjiang sequences were in one group. Preliminary results suggest that Xinjiang T. vaginalis isolates might be genetically unique, as indicated by the sequence of their 18S ribosomal RNA gene. Low migration rate of local people in this province may contribute to a genetic conservativeness of T. vaginalis. The unique genetic feature of our isolates may suggest a different clinical presentation of trichomoniasis, including metronidazole susceptibility, T. vaginalis virus or Mycoplasma co-infection characteristics. The transmission and evolution of Xinjiang T. vaginalis is of interest and should be studied further. More attention should be given to T. vaginalis infection in both females and males in Xinjiang.
Molecular cloning of actin genes in Trichomonas vaginalis and phylogeny inferred from actin sequences.

PubMed

Bricheux, G; Brugerolle, G

1997-08-01

The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Targeted next generation sequencing identifies functionally deleterious germline mutations in novel genes in early-onset/familial prostate cancer.

PubMed

Paulo, Paula; Maia, Sofia; Pinto, Carla; Pinto, Pedro; Monteiro, Augusta; Peixoto, Ana; Teixeira, Manuel R

2018-04-01

Considering that mutations in known prostate cancer (PrCa) predisposition genes, including those responsible for hereditary breast/ovarian cancer and Lynch syndromes, explain less than 5% of early-onset/familial PrCa, we have sequenced 94 genes associated with cancer predisposition using next generation sequencing (NGS) in a series of 121 PrCa patients. We found monoallelic truncating/functionally deleterious mutations in seven genes, including ATM and CHEK2, which have previously been associated with PrCa predisposition, and five new candidate PrCa associated genes involved in cancer predisposing recessive disorders, namely RAD51C, FANCD2, FANCI, CEP57 and RECQL4. Furthermore, using in silico pathogenicity prediction of missense variants among 18 genes associated with breast/ovarian cancer and/or Lynch syndrome, followed by KASP genotyping in 710 healthy controls, we identified "likely pathogenic" missense variants in ATM, BRIP1, CHEK2 and TP53. In conclusion, this study has identified putative PrCa predisposing germline mutations in 14.9% of early-onset/familial PrCa patients. Further data will be necessary to confirm the genetic heterogeneity of inherited PrCa predisposition hinted in this study.
Characterization of onion lectin (Allium cepa agglutinin) as an immunomodulatory protein inducing Th1-type immune response in vitro.

PubMed

Prasanna, Vaddi K; Venkatesh, Yeldur P

2015-06-01

Onion (Allium cepa), a bulb crop of economic importance, is known to have many health benefits. The major objective of the present study is to address the immunomodulatory properties of onion lectin (A. cepa agglutinin; ACA). ACA was purified from onion extract by D-mannose-agarose chromatography (yield: ~1 mg/kg). ACA is non-glycosylated and showed a molecular mass of ~12 kDa under reducing/non-reducing SDS-PAGE; glutaraldehyde cross-linking indicated that ACA is a non-covalent tetramer of ~12 kDa subunits. Its N-terminal sequence (RNVLLNNEGL; UniProt KB Accn. C0HJM8) showed 70-90% homology to mannose-specific Allium agglutinins. ACA showed specific hemagglutination activity of 8200 units/mg and is stable in the pH range 6-10 and up to 45° C. The immunomodulatory activity of ACA was assessed using the macrophage cell line, RAW264.7 and rat peritoneal macrophages; at 0.1 μg/well, it showed a significant increase (6-8-fold vs. control) in the production of nitric oxide at 24h, and significantly stimulated (2-4-fold vs. control) the production of pro-inflammatory cytokines (TNF-α and IL-12) at 24h. ACA (0.1 μg/well) enhanced the proliferation of murine thymocytes by ~4 fold (vs. control) at 24h; however, ACA does not proliferate B cell-enriched rat splenocytes. Further, it significantly elevated the expression levels of cytokines (IFN-γ and IL-2) over the control in murine thymocytes. Taken together, purified ACA induces a Th1-type immune response in vitro. Though present in low amounts, ACA may contribute to the immune-boosting potential of the popular spice onion since considerable amounts are consumed on a daily basis universally. Copyright © 2015 Elsevier B.V. All rights reserved.
Statistical learning of music- and language-like sequences and tolerance for spectral shifts.

PubMed

Daikoku, Tatsuya; Yatomi, Yutaka; Yumoto, Masato

2015-02-01

In our previous study (Daikoku, Yatomi, & Yumoto, 2014), we demonstrated that the N1m response could be a marker for the statistical learning process of pitch sequence, in which each tone was ordered by a Markov stochastic model. The aim of the present study was to investigate how the statistical learning of music- and language-like auditory sequences is reflected in the N1m responses based on the assumption that both language and music share domain generality. By using vowel sounds generated by a formant synthesizer, we devised music- and language-like auditory sequences in which higher-ordered transitional rules were embedded according to a Markov stochastic model by controlling fundamental (F0) and/or formant frequencies (F1-F2). In each sequence, F0 and/or F1-F2 were spectrally shifted in the last one-third of the tone sequence. Neuromagnetic responses to the tone sequences were recorded from 14 right-handed normal volunteers. In the music- and language-like sequences with pitch change, the N1m responses to the tones that appeared with higher transitional probability were significantly decreased compared with the responses to the tones that appeared with lower transitional probability within the first two-thirds of each sequence. Moreover, the amplitude difference was even retained within the last one-third of the sequence after the spectral shifts. However, in the language-like sequence without pitch change, no significant difference could be detected. The pitch change may facilitate the statistical learning in language and music. Statistically acquired knowledge may be appropriated to process altered auditory sequences with spectral shifts. The relative processing of spectral sequences may be a domain-general auditory mechanism that is innate to humans. Copyright © 2014 Elsevier Inc. All rights reserved.
Expansion of the receptor-like kinase/Pelle gene family and receptor-like proteins in Arabidopsis.

PubMed

Shiu, Shin Han; Bleecker, Anthony B

2003-06-01

Receptor-like kinases (RLKs) are a family of transmembrane proteins with versatile N-terminal extracellular domains and C-terminal intracellular kinases. They control a wide range of physiological responses in plants and belong to one of the largest gene families in the Arabidopsis genome with more than 600 members. Interestingly, this gene family constitutes 60% of all kinases in Arabidopsis and accounts for nearly all transmembrane kinases in Arabidopsis. Analysis of four fungal, six metazoan, and two Plasmodium sp. genomes indicates that the family was represented in all but fungal genomes, indicating an ancient origin for the family with a more recent expansion only in the plant lineages. The RLK/Pelle family can be divided into several subfamilies based on three independent criteria: the phylogeny based on kinase domain sequences, the extracellular domain identities, and intron locations and phases. A large number of receptor-like proteins (RLPs) resembling the extracellular domains of RLKs are also found in the Arabidopsis genome. However, not all RLK subfamilies have corresponding RLPs. Several RLK/Pelle subfamilies have undergone differential expansions. More than 33% of the RLK/Pelle members are found in tandem clusters, substantially higher than the genome average. In addition, 470 of the RLK/Pelle family members are located within the segmentally duplicated regions in the Arabidopsis genome and 268 of them have a close relative in the corresponding regions. Therefore, tandem duplications and segmental/whole-genome duplications represent two of the major mechanisms for the expansion of the RLK/Pelle family in Arabidopsis.
Gene expression distribution deconvolution in single-cell RNA sequencing.

PubMed

Wang, Jingshu; Huang, Mo; Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Murray, John; Raj, Arjun; Li, Mingyao; Zhang, Nancy R

2018-06-26

Single-cell RNA sequencing (scRNA-seq) enables the quantification of each gene's expression distribution across cells, thus allowing the assessment of the dispersion, nonzero fraction, and other aspects of its distribution beyond the mean. These statistical characterizations of the gene expression distribution are critical for understanding expression variation and for selecting marker genes for population heterogeneity. However, scRNA-seq data are noisy, with each cell typically sequenced at low coverage, thus making it difficult to infer properties of the gene expression distribution from raw counts. Based on a reexamination of nine public datasets, we propose a simple technical noise model for scRNA-seq data with unique molecular identifiers (UMI). We develop deconvolution of single-cell expression distribution (DESCEND), a method that deconvolves the true cross-cell gene expression distribution from observed scRNA-seq counts, leading to improved estimates of properties of the distribution such as dispersion and nonzero fraction. DESCEND can adjust for cell-level covariates such as cell size, cell cycle, and batch effects. DESCEND's noise model and estimation accuracy are further evaluated through comparisons to RNA FISH data, through data splitting and simulations and through its effectiveness in removing known batch effects. We demonstrate how DESCEND can clarify and improve downstream analyses such as finding differentially expressed genes, identifying cell types, and selecting differentiation markers. Copyright © 2018 the Author(s). Published by PNAS.

RT-PCR detection of Candida albicans ALS gene expression in the reconstituted human epithelium (RHE) model of oral candidiasis and in model biofilms.

PubMed

Green, Clayton B; Cheng, Georgina; Chandra, Jyotsna; Mukherjee, Pranab; Ghannoum, Mahmoud A; Hoyer, Lois L

2004-02-01

An RT-PCR assay was developed to analyse expression patterns of genes in the Candida albicans ALS (agglutinin-like sequence) family. Inoculation of a reconstituted human buccal epithelium (RHE) model of mucocutaneous candidiasis with strain SC5314 showed destruction of the epithelial layer by C. albicans and also formation of an upper fungal layer that had characteristics similar to a biofilm. RT-PCR analysis of total RNA samples extracted from C. albicans-inoculated buccal RHE showed that ALS1, ALS2, ALS3, ALS4, ALS5 and ALS9 were consistently detected over time as destruction of the RHE progressed. Detection of transcripts from ALS7, and particularly from ALS6, was more sporadic, but not associated with a strictly temporal pattern. The expression pattern of ALS genes in C. albicans cultures used to inoculate the RHE was similar to that observed in the RHE model, suggesting that contact of C. albicans with buccal RHE does little to alter ALS gene expression. RT-PCR analysis of RNA samples extracted from model denture and catheter biofilms showed similar gene expression patterns to the buccal RHE specimens. Results from the RT-PCR analysis of biofilm RNA specimens were consistent between various C. albicans strains during biofilm development and were comparable to gene expression patterns in planktonic cells. The RT-PCR assay described here will be useful for analysis of human clinical specimens and samples from other disease models. The method will provide further insight into the role of ALS genes and their encoded proteins in the diverse interactions between C. albicans and its host.
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

PubMed Central

2011-01-01

Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters. PMID:21682882
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants.

PubMed

Rodovalho, Cynara M; Ferro, Milene; Fonseca, Fernando Pp; Antonio, Erik A; Guilherme, Ivan R; Henrique-Silva, Flávio; Bacci, Maurício

2011-06-17

Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.
Automated Gene Ontology annotation for anonymous sequence data.

PubMed

Hennig, Steffen; Groth, Detlef; Lehrach, Hans

2003-07-01

Gene Ontology (GO) is the most widely accepted attempt to construct a unified and structured vocabulary for the description of genes and their products in any organism. Annotation by GO terms is performed in most of the current genome projects, which besides generality has the advantage of being very convenient for computer based classification methods. However, direct use of GO in small sequencing projects is not easy, especially for species not commonly represented in public databases. We present a software package (GOblet), which performs annotation based on GO terms for anonymous cDNA or protein sequences. It uses the species independent GO structure and vocabulary together with a series of protein databases collected from various sites, to perform a detailed GO annotation by sequence similarity searches. The sensitivity and the reference protein sets can be selected by the user. GOblet runs automatically and is available as a public service on our web server. The paper also addresses the reliability of automated GO annotations by using a reference set of more than 6000 human proteins. The GOblet server is accessible at http://goblet.molgen.mpg.de.
Construction and Evaluation of Normalized cDNA Libraries Enriched with Full-Length Sequences for Rapid Discovery of New Genes from Sisal (Agave sisalana Perr.) Different Developmental Stages

PubMed Central

Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

2012-01-01

To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
ABCG-like transporter of Trypanosoma cruzi involved in benznidazole resistance: gene polymorphisms disclose inter-strain intragenic recombination in hybrid isolates.

PubMed

Franco, Jaques; Ferreira, Renata C; Ienne, Susan; Zingales, Bianca

2015-04-01

Benznidazole (BZ) is one of the two drugs for Chagas disease treatment. In a previous study we showed that the Trypanosoma cruzi ABCG-like transporter gene, named TcABCG1, is over-expressed in parasite strains naturally resistant to BZ and that the gene of TcI BZ-resistant strains exhibited several single nucleotide polymorphisms (SNPs) as compared to the gene of CL Brener BZ-susceptible strain. Here we report the sequence of TcABCG1 gene of fourteen T. cruzi strains, with diverse degrees of BZ sensitivity and belonging to different discrete typing units (DTUs) and Tcbat group. Although DTU-specific SNPs and amino acid changes were identified, no direct correlation with BZ-resistance phenotype was found. Thus, it is plausible that the transporter abundance is a determinant factor for drug resistance, as pointed out above. Sequence data were used for Bayesian phylogenies and network genealogy analysis. The network showed a high degree of reticulation suggesting genetic exchange between the parasites. TcI and TcII clades were clearly separated. Tcbat sequences were close to TcI. A fourth clade clustered TcABCG1 haplotypes of TcV, TcVI and TcIII strains, with closer proximity to TcI. Analysis of the recombination patterns indicated that hybrid strains contain haplotypes that are mosaics most likely derived by intragenic recombination of parental sequences. The data confirm that TcII and TcIII as the parentals of TcV and TcVI DTUs. Since genetic fingerprint of TcI was found in TcIII, we sustain the previously proposed "Two Hybridization model" for the origin of hybrid strains. Among the twenty best BLASTP hits in databases, orthologues of TcABCG1 transporter were found in Leishmania spp. and African trypanosomes, though their function remains undescribed. Copyright © 2015 Elsevier B.V. All rights reserved.
Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex

PubMed Central

Negre, Bárbara; Casillas, Sònia; Suzanne, Magali; Sánchez-Herrero, Ernesto; Akam, Michael; Nefedov, Michael; Barbadilla, Antonio; de Jong, Pieter; Ruiz, Alfredo

2005-01-01

Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been rearranged differently in several Drosophila species, producing a striking diversity of Hox gene organizations. We investigated the genomic and functional consequences of the two HOM-C splits present in Drosophila buzzatii. Firstly, we sequenced two regions of the D. buzzatii genome, one containing the genes labial and abdominal A, and another one including proboscipedia, and compared their organization with that of D. melanogaster and D. pseudoobscura in order to map precisely the two splits. Then, a plethora of conserved noncoding sequences, which are putative enhancers, were identified around the three Hox genes closer to the splits. The position and order of these enhancers are conserved, with minor exceptions, between the three Drosophila species. Finally, we analyzed the expression patterns of the same three genes in embryos and imaginal discs of four Drosophila species with different Hox-gene organizations. The results show that their expression patterns are conserved despite the HOM-C splits. We conclude that, in Drosophila, Hox-gene clustering is not an absolute requirement for proper function. Rather, the organization of Hox genes is modular, and their clustering seems the result of phylogenetic inertia more than functional necessity. PMID:15867430
PMS2 gene mutational analysis: direct cDNA sequencing to circumvent pseudogene interference.

PubMed

Wimmer, Katharina; Wernstedt, Annekatrin

2014-01-01

The presence of highly homologous pseudocopies can compromise the mutation analysis of a gene of interest. In particular, when using PCR-based strategies, pseudogene co-amplification has to be effectively prevented. This is often achieved by using primers designed to be parental gene specific according to the reference sequence and by applying stringent PCR conditions. However, there are cases in which this approach is of limited utility. For example, it has been shown that the PMS2 gene exchanges sequences with one of its pseudogenes, named PMS2CL. This results in functional PMS2 alleles containing pseudogene-derived sequences at their 3'-end and in nonfunctional PMS2CL pseudogene alleles that contain gene-derived sequences. Hence, the paralogues cannot be distinguished according to the reference sequence. This shortcoming can be effectively circumvented by using direct cDNA sequencing. This approach is based on the selective amplification of PMS2 transcripts in two overlapping 1.6-kb RT-PCR products. In addition to avoiding pseudogene co-amplification and allele dropout, this method has also the advantage that it allows to effectively identify deletions, splice mutations, and de novo retrotransposon insertions that escape the detection of most DNA-based mutation analysis protocols.
A gene-specific non-enhancer sequence is critical for expression from the promoter of the small heat shock protein gene αB-crystallin

PubMed Central

2014-01-01

Background Deciphering of the information content of eukaryotic promoters has remained confined to universal landmarks and conserved sequence elements such as enhancers and transcription factor binding motifs, which are considered sufficient for gene activation and regulation. Gene-specific sequences, interspersed between the canonical transacting factor binding sites or adjoining them within a promoter, are generally taken to be devoid of any regulatory information and have therefore been largely ignored. An unanswered question therefore is, do gene-specific sequences within a eukaryotic promoter have a role in gene activation? Here, we present an exhaustive experimental analysis of a gene-specific sequence adjoining the heat shock element (HSE) in the proximal promoter of the small heat shock protein gene, αB-crystallin (cryab). These sequences are highly conserved between the rodents and the humans. Results Using human retinal pigment epithelial cells in culture as the host, we have identified a 10-bp gene-specific promoter sequence (GPS), which, unlike an enhancer, controls expression from the promoter of this gene, only when in appropriate position and orientation. Notably, the data suggests that GPS in comparison with the HSE works in a context-independent fashion. Additionally, when moved upstream, about a nucleosome length of DNA (−154 bp) from the transcription start site (TSS), the activity of the promoter is markedly inhibited, suggesting its involvement in local promoter access. Importantly, we demonstrate that deletion of the GPS results in complete loss of cryab promoter activity in transgenic mice. Conclusions These data suggest that gene-specific sequences such as the GPS, identified here, may have critical roles in regulating gene-specific activity from eukaryotic promoters. PMID:24589182
Purification of cold-shock-like proteins from Stigmatella aurantiaca - molecular cloning and characterization of the cspA gene.

PubMed

Stamm, I; Leclerque, A; Plaga, W

1999-09-01

Prominent low-molecular-weight proteins were isolated from vegetative cells of the myxobacterium Stigmatella aurantiaca and were found to be members of the cold-shock protein family. A first gene of this family (cspA) was cloned and sequenced. It encodes a protein of 68 amino acid residues that displays up to 71% sequence identity with other bacterial cold-shock(-like) proteins. A cysteine residue within the RNP-2 motif is a peculiarity of Stigmatella CspA. A cspA::(Deltatrp-lacZ) fusion gene construct was introduced into Stigmatella by electroporation, a method that has not been used previously for this strain. Analysis of the resultant transformants revealed that cspA transcription occurs at high levels during vegetative growth at 20 and 32 degrees C, and during fruiting body formation.
Expression of alpha-expansin and expansin-like genes in deepwater rice.

PubMed

Lee, Yi; Kende, Hans

2002-11-01

Previously, we have studied the expression and regulation of four alpha- and 14 beta-expansin genes in deepwater rice (Oryza sativa). We now report on the structure, expression, and regulation of 22 additional alpha-expansin (Os-EXP) genes, four expansin-like (Os-EXPL) genes, and one expansin-related (Os-EXPR) gene, which have recently been identified in the expressed sequence tag and genomic databases of rice. Alpha-expansins are characterized by a series of conserved Cys residues in the N-terminal half of the protein, a histidine-phenylalanine-aspartate (HFD) motif in the central region, and a series of tryptophan residues near the carboxyl terminus. Of the 22 additional alpha-expansin genes, five are expressed in internodes and leaves, three in coleoptiles, and nine in roots, with high transcript levels in the growing regions of these organs. Transcripts of five alpha-expansin genes were found in roots only. Expression of five alpha-expansin genes was induced in the internode by treatment with gibberellin (GA) and by wounding. The wound response resulted from excising stem sections or from piercing pinholes into the stem of intact plants. EXPL proteins lack the HFD motif and have two additional Cys residues in their C- and N-terminal regions. The positions of conserved tryptophan residues at the C-terminal region are different from those of alpha- and beta-expansins. Expression of the Os-EXPL3 gene is correlated with elongation and slightly induced by applied GA. However, the expression of the Os-EXPL1 and Os-EXPL2 genes showed limited correlation with cell elongation and was not induced by GA. We found no expression of the Os-EXPR1 gene in the organs examined.
[Binding studies with Ulex europaeus agglutinin I (UEA-I) of the vascular endothelium of the synovial membrane].

PubMed

Zschäbitz, A; Stofft, E

1988-01-01

The lectin binding sites of the synovium of patients with rheumatoid arthritis and osteoarthritis were investigated. It was shown that Ulex europaeus agglutinin is a constant marker of the vascular endothelium and is not induced during the course of inflammatory process in rheumatoid arthritis.
Molecular characterization of genes encoding trypsin-like enzymes from Aedes aegypti larvae and identification of digestive enzymes.

PubMed

Soares, Tatiane S; Watanabe, Renata M O; Lemos, Francisco J A; Tanaka, Aparecida S

2011-12-10

Trypsin-like enzymes play an important role in the Aedes aegypti digestive process. The trypsin-like enzymes present in adults were characterized previously, but little is known about trypsins in larvae. In the present work, we identified one of the trypsin enzymes from Ae. aegypti larval midgut using a library of trypsin gene fragments, which was the sequence known as AAEL005607 from the Ae. aegypti genome. Quantitative PCR analysis showed that AAEL005607 was transcribed in all larval instars, but it was not present in adult midgut. In order to confirm transcription data, the trypsin-like enzymes from 4th instar larvae of Ae. aegypti midgut were purified and sequenced. Purified trypsin showed identity with the amino-terminal sequence of AAEL005607, AAEL005609 and AAEL005614. These three trypsins have high amino acids identity, and could all be used as a template for the design of inhibitors. In conclusion, for the first time, digestive enzymes of 4th larval instar of Ae. aegypti were purified and characterized. The knowledge of digestive enzymes present in Ae. aegypti larvae may be helpful in the development of a larvicide. Copyright © 2011 Elsevier B.V. All rights reserved.
Sequence variations of the alpha-globin genes: scanning of high CG content genes with DHPLC and DG-DGGE.

PubMed

Lacerra, Giuseppina; Fiorito, Mirella; Musollino, Gennaro; Di Noce, Francesca; Esposito, Maria; Nigro, Vincenzo; Gaudiano, Carlo; Carestia, Clementina

2004-10-01

The alpha-globin chains are encoded by two duplicated genes (HBA2 and HBA1, 5'-3') showing overall sequence homology >96% and average CG content >60%. alpha-Thalassemia, the most prevalent worldwide autosomal recessive disorder, is a hereditary anemia caused by sequence variations of these genes in about 25% of carriers. We evaluated the overall sensitivity and suitability of DHPLC and DG-DGGE in scanning both the alpha-globin genes by carrying out a retrospective analysis of 19 variant alleles in 29 genotypes. The HBA2 alleles c.1A>G, c.79G>A, and c.281T>G, and the HBA1 allele c.475C>A were new. Three pathogenic sequence variations were associated in cis with nonpathogenic variations in all families studied; they were the HBA2 variation c.2T>C associated with c.-24C>G, and the HBA2 variations c.391G>C and c.427T>C, both associated with c.565G>A. We set up original experimental conditions for DHPLC and DG-DGGE and analyzed 10 normal subjects, 46 heterozygotes, seven homozygotes, seven compound heterozygotes, and six compound heterozygotes for a hybrid gene. Both the methodologies gave reproducible results and no false-positive was detected. DHPLC showed 100% sensitivity and DG-DGGE nearly 90%. About 100% of the sequence from the cap site to the polyA addition site could be scanned by DHPLC, about 87% by DG-DGGE. It is noteworthy that the three most common pathogenic sequence variations (HBA2 alleles c.2T>C, c.95+2_95+6del, and c.523A>G) were unambiguously detected by both the methodologies. Genotype diagnosis must be confirmed with PCR sequencing of single amplicons or with an allele-specific method. This study can be helpful for scanning genes with high CG content and offers a model suitable for duplicated genes with high homology. Copyright 2004 Wiley-Liss, Inc.
Cloning and characterization of chsD, a chitin synthase-like gene of Aspergillus fumigatus.

PubMed

Mellado, E; Specht, C A; Robbins, P W; Holden, D W

1996-09-15

A chitin synthase-like gene (chsD) was isolated from an Aspergillus fumigatus genomic DNA library. Comparisons with the predicted amino acid sequence from chsD reveals low but significant similarity to chitin synthases, to other N-acetylglucosaminyltransferases (NodC from Rhizopus spp., HasA from Streptococcus spp. and DG42 from vertebrates. A chsD- mutant strain constructed by gene disruption has a 20% reduction in total mycelial chitin content; however, no differences between the wild-type strain and the chsD- strain were found with respect to morphology, chitin synthase activity or virulence in a neutropenic murine model of aspergillosis. The results show that the chsD product has an important but inessential role in the synthesis of chitin in A. fumigatus.
On construction of stochastic genetic networks based on gene expression sequences.

PubMed

Ching, Wai-Ki; Ng, Michael M; Fung, Eric S; Akutsu, Tatsuya

2005-08-01

Reconstruction of genetic regulatory networks from time series data of gene expression patterns is an important research topic in bioinformatics. Probabilistic Boolean Networks (PBNs) have been proposed as an effective model for gene regulatory networks. PBNs are able to cope with uncertainty, corporate rule-based dependencies between genes and discover the sensitivity of genes in their interactions with other genes. However, PBNs are unlikely to use directly in practice because of huge amount of computational cost for obtaining predictors and their corresponding probabilities. In this paper, we propose a multivariate Markov model for approximating PBNs and describing the dynamics of a genetic network for gene expression sequences. The main contribution of the new model is to preserve the strength of PBNs and reduce the complexity of the networks. The number of parameters of our proposed model is O(n2) where n is the number of genes involved. We also develop efficient estimation methods for solving the model parameters. Numerical examples on synthetic data sets and practical yeast data sequences are given to demonstrate the effectiveness of the proposed model.
Phylogenetic analysis of genes involved in mycosporine-like amino acid biosynthesis in symbiotic dinoflagellates.

PubMed

Rosic, Nedeljka N

2012-04-01

Mycosporine-like amino acids (MAAs) are multifunctional secondary metabolites involved in photoprotection in many marine organisms. As well as having broad ultraviolet (UV) absorption spectra (310-362 nm), these biological sunscreens are also involved in the prevention of oxidative stress. More than 20 different MAAs have been discovered so far, characterized by distinctive chemical structures and a broad ecological distribution. Additionally, UV-screening MAA metabolites have been investigated and used in biotechnology and cosmetics. The biosynthesis of MAAs has been suggested to occur via either the shikimate or pentose phosphate pathways. Despite their wide distribution in marine and freshwater species and also the commercial application in cosmetic products, there are still a number of uncertainties regarding the genetic, biochemical, and evolutionary origin of MAAs. Here, using a transcriptome-mining approach, we identify the gene counterparts from the shikimate or pentose phosphate pathway involved in MAA biosynthesis within the sequences of the reef-building coral symbiotic dinoflagellates (genus Symbiodinium). We also report the highly similar sequences of genes from the proposed MAA biosynthetic pathway involved in the metabolism of 4-deoxygadusol (direct MAA precursor) in various Symbiodinium strains confirming their algal origin and conserved nature. Finally, we reveal the separate identity of two O-methyltransferase genes, possibly involved in MAA biosynthesis, as well as nonribosomal peptide synthetase and adenosine triphosphate grasp homologs in symbiotic dinoflagellates. This study provides a biochemical and phylogenetic overview of the genes from the proposed MAA biosynthetic pathway with a focus on coral endosymbionts.
Definition of the Cattle Killer Cell Ig–like Receptor Gene Family: Comparison with Aurochs and Human Counterparts

PubMed Central

Sanderson, Nicholas D.; Norman, Paul J.; Guethlein, Lisbeth A.; Ellis, Shirley A.; Williams, Christina; Breen, Matthew; Park, Steven D. E.; Magee, David A.; Babrzadeh, Farbod; Warry, Andrew; Watson, Mick; Bradley, Daniel G.; MacHugh, David E.; Parham, Peter

2014-01-01

Under selection pressure from pathogens, variable NK cell receptors that recognize polymorphic MHC class I evolved convergently in different species of placental mammal. Unexpectedly, diversified killer cell Ig–like receptors (KIRs) are shared by simian primates, including humans, and cattle, but not by other species. Whereas much is known of human KIR genetics and genomics, knowledge of cattle KIR is limited to nine cDNA sequences. To facilitate comparison of the cattle and human KIR gene families, we determined the genomic location, structure, and sequence of two cattle KIR haplotypes and defined KIR sequences of aurochs, the extinct wild ancestor of domestic cattle. Larger than its human counterpart, the cattle KIR locus evolved through successive duplications of a block containing ancestral KIR3DL and KIR3DX genes that existed before placental mammals. Comparison of two cattle KIR haplotypes and aurochs KIR show the KIR are polymorphic and the gene organization and content appear conserved. Of 18 genes, 8 are functional and 10 were inactivated by point mutation. Selective inactivation of KIR3DL and activating receptor genes leaves a functional cohort of one inhibitory KIR3DL, one activating KIR3DX, and six inhibitory KIR3DX. Functional KIR diversity evolved from KIR3DX in cattle and from KIR3DL in simian primates. Although independently evolved, cattle and human KIR gene families share important function-related properties, indicating that cattle KIR are NK cell receptors for cattle MHC class I. Combinations of KIR and MHC class I are the major genetic factors associated with human disease and merit investigation in cattle. PMID:25398326
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

PubMed Central

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Well-characterized sequence features of eukaryote genomes and implications for ab initio gene prediction.

PubMed

Huang, Ying; Chen, Shi-Yi; Deng, Feilong

2016-01-01

In silico analysis of DNA sequences is an important area of computational biology in the post-genomic era. Over the past two decades, computational approaches for ab initio prediction of gene structure from genome sequence alone have largely facilitated our understanding on a variety of biological questions. Although the computational prediction of protein-coding genes has already been well-established, we are also facing challenges to robustly find the non-coding RNA genes, such as miRNA and lncRNA. Two main aspects of ab initio gene prediction include the computed values for describing sequence features and used algorithm for training the discriminant function, and by which different combinations are employed into various bioinformatic tools. Herein, we briefly review these well-characterized sequence features in eukaryote genomes and applications to ab initio gene prediction. The main purpose of this article is to provide an overview to beginners who aim to develop the related bioinformatic tools.

Retrograde and transganglionic transport of horseradish peroxidase-conjugated cholera toxin B subunit, wheatgerm agglutinin and isolectin B4 from Griffonia simplicifolia I in primary afferent neurons innervating the rat urinary bladder.

PubMed

Wang, H F; Shortland, P; Park, M J; Grant, G

1998-11-01

myelinated fibres). Double labelling with other neuronal markers showed that 71%, 43% and 36% of the cholera toxin B subunit-immunoreactive cells were calcitonin gene-related peptide-, isolectin B4-binding- and substance P-positive, respectively. A few cholera toxin B subunit cells showed galanin-immunoreactivity, but none were somatostatin-, vasoactive intestinal polypeptide-, or neuropeptide Y-immunoreactive or contained fluoride-resistant acid phosphatase. The results show that cholera toxin B subunit-horseradish peroxidase is a more effective retrograde and transganglionic tracer for pelvic primary afferents from the urinary bladder than wheat germ agglutinin-horseradish peroxidase and isolectin B4-horseradish peroxidase, but in contrast to somatic nerves, it is transported mainly by unmyelinated fibres in the visceral afferents.
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)

NASA Astrophysics Data System (ADS)

Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui

1998-03-01

The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach.

PubMed

Singh, Kh Dhanachandra; Karthikeyan, Muthusamy

2014-12-01

The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
EUGÈNE'HOM: a generic similarity-based gene finder using multiple homologous sequences

PubMed Central

Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

2003-01-01

EUGÈNE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGÈNE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGÈNE'HOM to handle sequences from a variety of organisms. The current target of EUGÈNE'HOM is plant sequences. The EUGÈNE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl. PMID:12824408
Human ribosomal RNA gene: nucleotide sequence of the transcription initiation region and comparison of three mammalian genes.

PubMed Central

Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M

1982-01-01

The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460
Whole exome sequencing as a diagnostic tool for patients with ciliopathy-like phenotypes.

PubMed

Castro-Sánchez, Sheila; Álvarez-Satta, María; Tohamy, Mohamed A; Beltran, Sergi; Derdak, Sophia; Valverde, Diana

2017-01-01

Ciliopathies are a group of rare disorders characterized by a high genetic and phenotypic variability, which complicates their molecular diagnosis. Hence the need to use the latest powerful approaches to faster identify the genetic defect in these patients. We applied whole exome sequencing to six consanguineous families clinically diagnosed with ciliopathy-like disease, and for which mutations in predominant Bardet-Biedl syndrome (BBS) genes had previously been excluded. Our strategy, based on first applying several filters to ciliary variants and using many of the bioinformatics tools available, allowed us to identify causal mutations in BBS2, ALMS1 and CRB1 genes in four families, thus confirming the molecular diagnosis of ciliopathy. In the remaining two families, after first rejecting the presence of pathogenic variants in common cilia-related genes, we adopted a new filtering strategy combined with prioritisation tools to rank the final candidate genes for each case. Thus, we propose CORO2B, LMO7 and ZNF17 as novel candidate ciliary genes, but further functional studies will be needed to confirm their role. Our data show the usefulness of this strategy to diagnose patients with unclear phenotypes, and therefore the success of applying such technologies to achieve a rapid and reliable molecular diagnosis, improving genetic counselling for these patients. In addition, the described pipeline also highlights the common pitfalls associated to the large volume of data we have to face and the difficulty of assigning a functional role to these changes, hence the importance of designing the most appropriate strategy according to each case.
Sequence variants of Toll-like receptor 4 and susceptibility to prostate cancer.

PubMed

Chen, Yen-Ching; Giovannucci, Edward; Lazarus, Ross; Kraft, Peter; Ketkar, Shamika; Hunter, David J

2005-12-15

Chronic inflammation has been hypothesized to be a risk factor for prostate cancer. The Toll-like receptor 4 (TLR4) presents the bacterial lipopolysaccharide (LPS), which interacts with ligand-binding protein and CD14 (LPS receptor) and activates expression of inflammatory genes through nuclear factor-kappaB and mitogen-activated protein kinase signaling. A previous case-control study found a modest association of a polymorphism in the TLR4 gene [11381G/C, GG versus GC/CC: odds ratio (OR), 1.26] with risk of prostate cancer. We assessed if sequence variants of TLR4 were associated with the risk of prostate cancer. In a nested case-control design within the Health Professionals Follow-up Study, we identified 700 participants with prostate cancer diagnosed after they had provided a blood specimen in 1993 and before January 2000. Controls were 700 age-matched men without prostate cancer who had had a prostate-specific antigen test after providing a blood specimen. We genotyped 16 common (>5%) single nucleotide polymorphisms (SNP) discovered in a resequencing study spanning TLR4 to test for association between sequence variation in TLR4 and prostate cancer. Homozygosity for the variant alleles of eight SNPs was associated with a statistically significantly lower risk of prostate cancer (TLR4_1893, TLR4_2032, TLR4_2437, TLR4_7764, TLR4_11912, TLR4_16649, TLR4_17050, and TLR4_17923), but the TLR4_15844 polymorphism corresponding to 11381G/C was not associated with prostate cancer (GG versus CG/CC: OR, 1.01; 95% confidence interval, 0.79-1.29). Six common haplotypes (cumulative frequency, 81%) were observed; the global test for association between haplotypes and prostate cancer was statistically significant (chi(2) = 14.8 on 6 degrees of freedom; P = 0.02). Two common haplotypes were statistically significantly associated with altered risk of prostate cancer. Inherited polymorphisms of the innate immune gene TLR4 are associated with risk of prostate cancer.
Sequence variations of the partially dominant DELLA gene Rht-B1c in wheat and their functional impacts

PubMed Central

Ma, Zhengqiang

2013-01-01

Rht-B1c, allelic to the DELLA protein-encoding gene Rht-B1a, is a natural mutation documented in common wheat (Triticum aestivum). It confers variation to a number of traits related to cell and plant morphology, seed dormancy, and photosynthesis. The present study was conducted to examine the sequence variations of Rht-B1c and their functional impacts. The results showed that Rht-B1c was partially dominant or co-dominant for plant height, and exhibited an increased dwarfing effect. At the sequence level, Rht-B1c differed from Rht-B1a by one 2kb Veju retrotransposon insertion, three coding region single nucleotide polymorphisms (SNPs), one 197bp insertion, and four SNPs in the 1kb upstream sequence. Haplotype investigations, association analyses, transient expression assays, and expression profiling showed that the Veju insertion was primarily responsible for the extreme dwarfing effect. It was found that the Veju insertion changed processing of the Rht-B1c transcripts and resulted in DELLA motif primary structure disruption. Expression assays showed that Rht-B1c caused reduction of total Rht-1 transcript levels, and up-regulation of GATA-like transcription factors and genes positively regulated by these factors, suggesting that one way in which Rht-1 proteins affect plant growth and development is through GATA-like transcription factor regulation. PMID:23918966
Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Su, Y.; Zhang, H.; Madrid, R.

1994-09-01

Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less
Cloning, sequencing and characterization of lipase genes from a polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans

USDA-ARS?s Scientific Manuscript database

Lipase (lip) and lipase-specific foldase (lif) genes of a biodegradable polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans NRRL B-2649 were cloned using primers based on consensus sequences, followed by PCR-based genome walking. Sequence analyses showed a putative Lip gene-product (...
DjhnRNPA2/B1-like gene is required for planarian regeneration and tissue homeostasis.

PubMed

Dong, Zimei; Yang, Tong; Yang, Yibo; Dou, He; Chen, Guangwen

2017-10-30

The hnRNPs play important roles in physiological processes in eukaryotic organisms by regulation of pre-mRNA after transcription, including pre-mRNA splicing, mRNA stability, DNA replication and repair and telomere maintenance and so on. However, it remains unclear about the specific functions of these genes. In this study, the full-length cDNA sequence of hnRNPA2/B1-like was first cloned from Dugesia japonica, and its roles were investigated by WISH and RNAi. The results showed that: (1) DjhnRNPA2/B1-like was highly conserved during animal evolution; (2) DjhnRNPA2/B1-like mRNA was mainly distributed each side of the body in intact worms and regenerative blastemas, and its expression levels were up-regulated on days 0 and 5 after amputation; (3) the intact and regenerating worms gradually lysed or lost regeneration capacity after DjhnRNPA2/B1-like RNAi; and (4) DjhnRNPA2/B1-like expression is induced by temperature and heavy metal ion stress. The data suggests that DjhnRNPA2/B1-like is a multiple functional gene, it plays important roles in regeneration and homeostatic maintenance and it is also involved in stress responses in planarians. Our work provides basic data for the study of regenerative mechanism and stress responses in freshwater planarians. Copyright © 2017. Published by Elsevier B.V.
De novo transcriptome sequencing of axolotl blastema for identification of differentially expressed genes during limb regeneration

PubMed Central

2013-01-01

Background Salamanders are unique among vertebrates in their ability to completely regenerate amputated limbs through the mediation of blastema cells located at the stump ends. This regeneration is nerve-dependent because blastema formation and regeneration does not occur after limb denervation. To obtain the genomic information of blastema tissues, de novo transcriptomes from both blastema tissues and denervated stump ends of Ambystoma mexicanum (axolotls) 14 days post-amputation were sequenced and compared using Solexa DNA sequencing. Results The sequencing done for this study produced 40,688,892 reads that were assembled into 307,345 transcribed sequences. The N50 of transcribed sequence length was 562 bases. A similarity search with known proteins identified 39,200 different genes to be expressed during limb regeneration with a cut-off E-value exceeding 10-5. We annotated assembled sequences by using gene descriptions, gene ontology, and clusters of orthologous group terms. Targeted searches using these annotations showed that the majority of the genes were in the categories of essential metabolic pathways, transcription factors and conserved signaling pathways, and novel candidate genes for regenerative processes. We discovered and confirmed numerous sequences of the candidate genes by using quantitative polymerase chain reaction and in situ hybridization. Conclusion The results of this study demonstrate that de novo transcriptome sequencing allows gene expression analysis in a species lacking genome information and provides the most comprehensive mRNA sequence resources for axolotls. The characterization of the axolotl transcriptome can help elucidate the molecular mechanisms underlying blastema formation during limb regeneration. PMID:23815514
The Genome Sequence of the Cyanobacterium Oscillatoria sp. PCC 6506 Reveals Several Gene Clusters Responsible for the Biosynthesis of Toxins and Secondary Metabolites▿

PubMed Central

Méjean, Annick; Mazmouz, Rabia; Mann, Stéphane; Calteau, Alexandra; Médigue, Claudine; Ploux, Olivier

2010-01-01

We report a draft sequence of the genome of Oscillatoria sp. PCC 6506, a cyanobacterium that produces anatoxin-a and homoanatoxin-a, two neurotoxins, and cylindrospermopsin, a cytotoxin. Beside the clusters of genes responsible for the biosynthesis of these toxins, we have found other clusters of genes likely involved in the biosynthesis of not-yet-identified secondary metabolites. PMID:20675499
Exome sequencing in amyotrophic lateral sclerosis identifies risk genes and pathways.

PubMed

Cirulli, Elizabeth T; Lasseigne, Brittany N; Petrovski, Slavé; Sapp, Peter C; Dion, Patrick A; Leblond, Claire S; Couthouis, Julien; Lu, Yi-Fan; Wang, Quanli; Krueger, Brian J; Ren, Zhong; Keebler, Jonathan; Han, Yujun; Levy, Shawn E; Boone, Braden E; Wimbish, Jack R; Waite, Lindsay L; Jones, Angela L; Carulli, John P; Day-Williams, Aaron G; Staropoli, John F; Xin, Winnie W; Chesi, Alessandra; Raphael, Alya R; McKenna-Yasek, Diane; Cady, Janet; Vianney de Jong, J M B; Kenna, Kevin P; Smith, Bradley N; Topp, Simon; Miller, Jack; Gkazi, Athina; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan; Silani, Vincenzo; Ticozzi, Nicola; Shaw, Christopher E; Baloh, Robert H; Appel, Stanley; Simpson, Ericka; Lagier-Tourenne, Clotilde; Pulst, Stefan M; Gibson, Summer; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Grossman, Murray; Shneider, Neil A; Chung, Wendy K; Ravits, John M; Glass, Jonathan D; Sims, Katherine B; Van Deerlin, Vivianna M; Maniatis, Tom; Hayes, Sebastian D; Ordureau, Alban; Swarup, Sharan; Landers, John; Baas, Frank; Allen, Andrew S; Bedlack, Richard S; Harper, J Wade; Gitler, Aaron D; Rouleau, Guy A; Brown, Robert; Harms, Matthew B; Cooper, Gregory M; Harris, Tim; Myers, Richard M; Goldstein, David B

2015-03-27

Amyotrophic lateral sclerosis (ALS) is a devastating neurological disease with no effective treatment. We report the results of a moderate-scale sequencing study aimed at increasing the number of genes known to contribute to predisposition for ALS. We performed whole-exome sequencing of 2869 ALS patients and 6405 controls. Several known ALS genes were found to be associated, and TBK1 (the gene encoding TANK-binding kinase 1) was identified as an ALS gene. TBK1 is known to bind to and phosphorylate a number of proteins involved in innate immunity and autophagy, including optineurin (OPTN) and p62 (SQSTM1/sequestosome), both of which have also been implicated in ALS. These observations reveal a key role of the autophagic pathway in ALS and suggest specific targets for therapeutic intervention. Copyright © 2015, American Association for the Advancement of Science.
EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing.

PubMed

Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon

2014-11-01

The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
Next-Generation Sequence Analysis of the Genome of RFHVMn, the Macaque Homolog of Kaposi's Sarcoma (KS)-Associated Herpesvirus, from a KS-Like Tumor of a Pig-Tailed Macaque

PubMed Central

Bruce, A. Gregory; Ryan, Jonathan T.; Thomas, Mathew J.; Peng, Xinxia; Grundhoff, Adam; Tsai, Che-Chung

2013-01-01

The complete sequence of retroperitoneal fibromatosis-associated herpesvirus Macaca nemestrina (RFHVMn), the pig-tailed macaque homolog of Kaposi's sarcoma-associated herpesvirus (KSHV), was determined by next-generation sequence analysis of a Kaposi's sarcoma (KS)-like macaque tumor. Colinearity of genes was observed with the KSHV genome, and the core herpesvirus genes had strong sequence homology to the corresponding KSHV genes. RFHVMn lacked homologs of open reading frame 11 (ORF11) and KSHV ORFs K5 and K6, which appear to have been generated by duplication of ORFs K3 and K4 after the divergence of KSHV and RFHV. RFHVMn contained positional homologs of all other unique KSHV genes, although some showed limited sequence similarity. RFHVMn contained a number of candidate microRNA genes. Although there was little sequence similarity with KSHV microRNAs, one candidate contained the same seed sequence as the positional homolog, kshv-miR-K12-10a, suggesting functional overlap. RNA transcript splicing was highly conserved between RFHVMn and KSHV, and strong sequence conservation was noted in specific promoters and putative origins of replication, predicting important functional similarities. Sequence comparisons indicated that RFHVMn and KSHV developed in long-term synchrony with the evolution of their hosts, and both viruses phylogenetically group within the RV1 lineage of Old World primate rhadinoviruses. RFHVMn is the closest homolog of KSHV to be completely sequenced and the first sequenced RV1 rhadinovirus homolog of KSHV from a nonhuman Old World primate. The strong genetic and sequence similarity between RFHVMn and KSHV, coupled with similarities in biology and pathology, demonstrate that RFHVMn infection in macaques offers an important and relevant model for the study of KSHV in humans. PMID:24109218
Organization and sequence of four flagellin-encoding genes of Edwardsiella icataluri

USDA-ARS?s Scientific Manuscript database

Edwardsiella ictaluri, the cause of enteric septicemia in channel catfish (Ictalurus punctatus), is motile by means of peritrichous flagella. We determined the complete flagellin gene sequences and their organization in E. ictaluri by sequencing genomic segments selected from a lambda-ZAP phage gen...
Jump-and-return sandwiches: A new family of binomial-like selective inversion sequences with improved performance

NASA Astrophysics Data System (ADS)

Brenner, Tom; Chen, Johnny; Stait-Gardner, Tim; Zheng, Gang; Matsukawa, Shingo; Price, William S.

2018-03-01

A new family of binomial-like inversion sequences, named jump-and-return sandwiches (JRS), has been developed by inserting a binomial-like sequence into a standard jump-and-return sequence, discovered through use of a stochastic Genetic Algorithm optimisation. Compared to currently used binomial-like inversion sequences (e.g., 3-9-19 and W5), the new sequences afford wider inversion bands and narrower non-inversion bands with an equal number of pulses. As an example, two jump-and-return sandwich 10-pulse sequences achieved 95% inversion at offsets corresponding to 9.4% and 10.3% of the non-inversion band spacing, compared to 14.7% for the binomial-like W5 inversion sequence, i.e., they afforded non-inversion bands about two thirds the width of the W5 non-inversion band.
Chloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plants.

PubMed Central

Savard, L; Li, P; Strauss, S H; Chase, M W; Michaud, M; Bousquet, J

1994-01-01

We have estimated the time for the last common ancestor of extant seed plants by using molecular clocks constructed from the sequences of the chloroplastic gene coding for the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (rbcL) and the nuclear gene coding for the small subunit of rRNA (Rrn18). Phylogenetic analyses of nucleotide sequences indicated that the earliest divergence of extant seed plants is likely represented by a split between conifer-cycad and angiosperm lineages. Relative-rate tests were used to assess homogeneity of substitution rates among lineages, and annual angiosperms were found to evolve at a faster rate than other taxa for rbcL and, thus, these sequences were excluded from construction of molecular clocks. Five distinct molecular clocks were calibrated using substitution rates for the two genes and four divergence times based on fossil and published molecular clock estimates. The five estimated times for the last common ancestor of extant seed plants were in agreement with one another, with an average of 285 million years and a range of 275-290 million years. This implies a substantially more recent ancestor of all extant seed plants than suggested by some theories of plant evolution. PMID:8197201
Complexity of genetic sequences modified by horizontal gene transfer and degraded-DNA uptake

NASA Astrophysics Data System (ADS)

Tremberger, George; Dehipawala, S.; Nguyen, A.; Cheung, E.; Sullivan, R.; Holden, T.; Lieberman, D.; Cheung, T.

2015-09-01

Horizontal gene transfer has been a major vehicle for efficient transfer of genetic materials among living species and could be one of the sources for noncoding DNA incorporation into a genome. Our previous study of lnc- RNA sequence complexity in terms of fractal dimension and information entropy shows a tight regulation among the studied genes in numerous diseases. The role of sequence complexity in horizontal transferred genes was investigated with Mealybug in symbiotic relation with a 139K genome microbe and Deinococcus radiodurans as examples. The fractal dimension and entropy showed correlation R-sq of 0.82 (N = 6) for the studied Deinococcus radiodurans sequences. For comparison the Deinococcus radiodurans oxidative stress tolerant catalase and superoxide dismutase genes under extracellular dGMP growth condition showed R-sq ~ 0.42 (N = 6); and the studied arsenate reductase horizontal transferred genes for toxicity survival in several microorganisms showed no correlation. Simulation results showed that R-sq < 0.4 would be improbable at less than one percent chance, suggestive of additional selection pressure when compared to the R-sq ~ 0.29 (N = 21) in the studied transferred genes in Mealybug. The mild correlation of R-sq ~ 0.5 for fractal dimension versus transcription level in the studied Deinococcus radiodurans sequences upon extracellular dGMP growth condition would suggest that lower fractal dimension with less electron density fluctuation favors higher transcription level.

De Novo RNA Sequencing and Transcriptome Analysis of Colletotrichum gloeosporioides ES026 Reveal Genes Related to Biosynthesis of Huperzine A

PubMed Central

Zhang, Xiangmei; Xia, Qianqian; Zhao, Xinmei; Ahn, Youngjoon; Ahmed, Nevin; Cosoveanu, Andreea; Wang, Mo; Wang, Jialu; Shu, Shaohua

2015-01-01

Huperzine A is important in the treatment of Alzheimer’s disease. There are major challenges for the mass production of huperzine A from plants due to the limited number of huperzine-A-producing plants, as well as the low content of huperzine A in these plants. Various endophytic fungi produce huperzine A. Colletotrichum gloeosporioides ES026 was previously isolated from a huperzine-A-producing plant Huperzia serrata, and this fungus also produces huperzine A. In this study, de novo RNA sequencing of C. gloeosporioides ES026 was carried out with an Illumina HiSeq2000. A total of 4,324,299,051 bp from 50,442,617 high-quality sequence reads of ES026 were obtained. These raw data were assembled into 24,998 unigenes, 40,536,684 residues and 19,790 genes. The majority of the unique sequences were assigned to corresponding putative functions based on BLAST searches of public databases. The molecular functions, biological processes and biochemical pathways of these unique sequences were determined using gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) assignments. A gene encoding copper amine oxidase (CAO) (unigene 9322) was annotated for the conversion of cadaverine to 5-aminopentanal in the biosynthesis of huperzine A. This gene was also detected in the root, stem and leaf of H. serrata. Furthermore, a close relationship was observed between expression of the CAO gene (unigene 9322) and quantity of crude huperzine A extracted from ES026. Therefore, CAO might be involved in the biosynthesis of huperzine A and it most likely plays a key role in regulating the content of huperzine A in ES026. PMID:25799531
A useful method for the detection of ethylenediaminetetraacetic acid- and cold agglutinin-dependent pseudothrombocytopenia.

PubMed

Ozcelik, Fatih; Arslan, Erol; Serdar, Muhittin A; Yiginer, Omer; Oztosun, Muzaffer; Kayadibi, Huseyin; Kurt, Ismail

2012-11-01

Pseudothrombocytopenia (PTCP), caused by platelet (PLT) aggregation, is usually associated with ethylenediaminetetraacetic acid (EDTA)-dependent antibodies and cold aggluti-nins against PLT antigens. The aim of this study was to identify the PTCP and discover the most practical method to distinguish it from real thrombocytopenia. This study included 85 patients without hemorrhagic abnormalities and suspected PTCP. Blood samples containing EDTA, citrate and EDTA-kanamycin (KN) were analyzed at room temperature and 37°C. PTCP was detected in 24 of 85 patients. In 23 of 24 patients, EDTA-dependent pseudothrombocytopenia (EDTA-PTCP) was detected; 5 of whom had also the cold agglutinin-dependent PTCP. In only 1 of 24 patients, the cold agglu-tinin-dependent PTCP was found. In this study, no significant difference was observed in leukocyte counts comparing EDTA and citrate blood samples in cases with EDTA-PTCP. In clinical laboratories, a significant portion of the cases with low PLT counts was attributable to EDTA-PTCP and, therefore, did not require treatment. Even if these cases can be detected by bringing the blood samples containing EDTA to 37°C or by adding KN to blood samples containing EDTA, the use of blood samples containing citrate taken for erythrocyte sedimentation rate analysis is a more practical priority method.
Network Inference Analysis Identifies an APRR2-Like Gene Linked to Pigment Accumulation in Tomato and Pepper Fruits1[W][OA

PubMed Central

Pan, Yu; Bradley, Glyn; Pyke, Kevin; Ball, Graham; Lu, Chungui; Fray, Rupert; Marshall, Alexandra; Jayasuta, Subhalai; Baxter, Charles; van Wijk, Rik; Boyden, Laurie; Cade, Rebecca; Chapman, Natalie H.; Fraser, Paul D.; Hodgman, Charlie; Seymour, Graham B.

2013-01-01

Carotenoids represent some of the most important secondary metabolites in the human diet, and tomato (Solanum lycopersicum) is a rich source of these health-promoting compounds. In this work, a novel and fruit-related regulator of pigment accumulation in tomato has been identified by artificial neural network inference analysis and its function validated in transgenic plants. A tomato fruit gene regulatory network was generated using artificial neural network inference analysis and transcription factor gene expression profiles derived from fruits sampled at various points during development and ripening. One of the transcription factor gene expression profiles with a sequence related to an Arabidopsis (Arabidopsis thaliana) ARABIDOPSIS PSEUDO RESPONSE REGULATOR2-LIKE gene (APRR2-Like) was up-regulated at the breaker stage in wild-type tomato fruits and, when overexpressed in transgenic lines, increased plastid number, area, and pigment content, enhancing the levels of chlorophyll in immature unripe fruits and carotenoids in red ripe fruits. Analysis of the transcriptome of transgenic lines overexpressing the tomato APPR2-Like gene revealed up-regulation of several ripening-related genes in the overexpression lines, providing a link between the expression of this tomato gene and the ripening process. A putative ortholog of the tomato APPR2-Like gene in sweet pepper (Capsicum annuum) was associated with pigment accumulation in fruit tissues. We conclude that the function of this gene is conserved across taxa and that it encodes a protein that has an important role in ripening. PMID:23292788
Inhibition of herpes simplex virus 1 gene expression and replication by RNase P-associated external guide sequences.

PubMed

Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong

2016-06-09

An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics.
Inhibition of herpes simplex virus 1 gene expression and replication by RNase P-associated external guide sequences

PubMed Central

Liu, Jin; Shao, Luyao; Trang, Phong; Yang, Zhu; Reeves, Michael; Sun, Xu; Vu, Gia-Phong; Wang, Yu; Li, Hongjian; Zheng, Congyi; Lu, Sangwei; Liu, Fenyong

2016-01-01

An external guide sequence (EGS) is a RNA sequence which can interact with a target mRNA to form a tertiary structure like a pre-tRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, to degrade target mRNA. Previously, an in vitro selection procedure has been used by us to engineer new EGSs that are more robust in inducing human RNase P to cleave their targeted mRNAs. In this study, we constructed EGSs from a variant to target the mRNA encoding herpes simplex virus 1 (HSV-1) major transcription regulator ICP4, which is essential for the expression of viral early and late genes and viral growth. The EGS variant induced human RNase P cleavage of ICP4 mRNA sequence 60 times better than the EGS generated from a natural pre-tRNA. A decrease of about 97% and 75% in the level of ICP4 gene expression and an inhibition of about 7,000- and 500-fold in viral growth were observed in HSV infected cells expressing the variant and the pre-tRNA-derived EGS, respectively. This study shows that engineered EGSs can inhibit HSV-1 gene expression and viral growth. Furthermore, these results demonstrate the potential for engineered EGS RNAs to be developed and used as anti-HSV therapeutics. PMID:27279482
Whole Wiskott‑Aldrich syndrome protein gene deletion identified by high throughput sequencing.

PubMed

He, Xiangling; Zou, Runying; Zhang, Bing; You, Yalan; Yang, Yang; Tian, Xin

2017-11-01

Wiskott‑Aldrich syndrome (WAS) is a rare X‑linked recessive immunodeficiency disorder, characterized by thrombocytopenia, small platelets, eczema and recurrent infections associated with increased risk of autoimmunity and malignancy disorders. Mutations in the WAS protein (WASP) gene are responsible for WAS. To date, WASP mutations, including missense/nonsense, splicing, small deletions, small insertions, gross deletions, and gross insertions have been identified in patients with WAS. In addition, WASP‑interacting proteins are suspected in patients with clinical features of WAS, in whom the WASP gene sequence and mRNA levels are normal. The present study aimed to investigate the application of next generation sequencing in definitive diagnosis and clinical therapy for WAS. A 5 month‑old child with WAS who displayed symptoms of thrombocytopenia was examined. Whole exome sequence analysis of genomic DNA showed that the coverage and depth of WASP were extremely low. Quantitative polymerase chain reaction indicated total WASP gene deletion in the proband. In conclusion, high throughput sequencing is useful for the verification of WAS on the genetic profile, and has implications for family planning guidance and establishment of clinical programs.
The complete coding region sequence of river buffalo (Bubalus bubalis) SRY gene.

PubMed

Parma, Pietro; Feligini, Maria; Greppi, Gianfranco; Enne, Giuseppe

2004-02-01

The Y-linked SRY gene is responsible for testis determination in mammals. Mutations in this gene can lead to XY Gonadal Dysgenesis, an abnormal sexual phenotype described in humans, cattle, horses and river buffalo. We report here the complete river buffalo SRY sequence in order to enable the genetic diagnosis of this disease. The SRY sequence was also used to confirm the evolutionary divergence time between cattle and river buffalo 10 million years ago.
Cloning, expression, and sequence analysis of the Bacillus methanolicus C1 methanol dehydrogenase gene.

PubMed Central

de Vries, G E; Arfman, N; Terpstra, P; Dijkhuizen, L

1992-01-01

The gene (mdh) coding for methanol dehydrogenase (MDH) of thermotolerant, methylotroph Bacillus methanolicus C1 has been cloned and sequenced. The deduced amino acid sequence of the mdh gene exhibited similarity to those of five other alcohol dehydrogenase (type III) enzymes, which are distinct from the long-chain zinc-containing (type I) or short-chain zinc-lacking (type II) enzymes. Highly efficient expression of the mdh gene in Escherichia coli was probably driven from its own promoter sequence. After purification of MDH from E. coli, the kinetic and biochemical properties of the enzyme were investigated. The physiological effect of MDH synthesis in E. coli and the role of conserved sequence patterns in type III alcohol dehydrogenases have been analyzed and are discussed. Images PMID:1644761
'2A-Like' Signal Sequences Mediating Translational Recoding: A Novel Form of Dual Protein Targeting.

PubMed

Roulston, Claire; Luke, Garry A; de Felipe, Pablo; Ruan, Lin; Cope, Jonathan; Nicholson, John; Sukhodub, Andriy; Tilsner, Jens; Ryan, Martin D

2016-08-01

We report the initial characterization of an N-terminal oligopeptide '2A-like' sequence that is able to function both as a signal sequence and as a translational recoding element. Owing to this translational recoding activity, two forms of nascent polypeptide are synthesized: (i) when 2A-mediated translational recoding has not occurred: the nascent polypeptide is fused to the 2A-like N-terminal signal sequence and the fusion translation product is targeted to the exocytic pathway, and, (ii) a translation product where 2A-mediated translational recoding has occurred: the 2A-like signal sequence is synthesized as a separate translation product and, therefore, the nascent (downstream) polypeptide lacks the 2A-like signal sequence and is localized to the cytoplasm. This type of dual-functional signal sequence results, therefore, in the partitioning of the translation products between the two sub-cellular sites and represents a newly described form of dual protein targeting. © 2016 The Authors. Traffic published by John Wiley & Sons Ltd.
Expression of Colocasia esculenta tuber agglutinin in Indian mustard provides resistance against Lipaphis erysimi and the expressed protein is non-allergenic.

PubMed

Das, Ayan; Ghosh, Prithwi; Das, Sampa

2018-06-01

Transgenic Brassica juncea plants expressing Colocasia esculenta tuber agglutinin (CEA) shows the non-allergenic nature of the expressed protein leading to enhanced mortality and reduced fecundity of mustard aphid-Lipaphis erysimi. Lipaphis erysimi (common name: mustard aphid) is the most devastating sucking insect pest of Indian mustard (Brassica juncea L.). Colocasia esculenta tuber agglutinin (CEA), a GNA (Galanthus nivalis agglutinin)-related lectin has previously been reported by the present group to be effective against a wide array of hemipteran insects in artificial diet-based bioassays. In the present study, efficacy of CEA in controlling L. erysimi has been established through the development of transgenic B. juncea expressing this novel lectin. Southern hybridization of the transgenic plants confirmed stable integration of cea gene. Expression of CEA in T 0 , T 1 and T 2 transgenic plants was confirmed through western blot analysis. Level of expression of CEA in the T 2 transgenic B. juncea ranged from 0.2 to 0.47% of the total soluble protein. In the in planta insect bioassays, the CEA expressing B. juncea lines exhibited enhanced insect mortality of 70-81.67%, whereas fecundity of L. erysimi was reduced by 49.35-62.11% compared to the control plants. Biosafety assessment of the transgenic B. juncea protein containing CEA was carried out by weight of evidence approach following the recommendations by FAO/WHO (Evaluation of the allergenicity of genetically modified foods: report of a joint FAO/WHO expert consultation, 22-25 Jan, Rome, http://www.fao.org/docrep/007/y0820e/y0820e00.HTM , 2001), Codex (Codex principles and guidelines on foods derived from biotechnology, Food and Agriculture Organization of the United Nations, Rome; Codex, Codex principles and guidelines on foods derived from biotechnology, Food and Agriculture Organization of the United Nations, Rome, 2003) and ICMR (Indian Council of Medical Research, guidelines for safety assessment of
A 454 multiplex sequencing method for rapid and reliable genotyping of highly polymorphic genes in large-scale studies.

PubMed

Galan, Maxime; Guivier, Emmanuel; Caraux, Gilles; Charbonnel, Nathalie; Cosson, Jean-François

2010-05-11

High-throughput sequencing technologies offer new perspectives for biomedical, agronomical and evolutionary research. Promising progresses now concern the application of these technologies to large-scale studies of genetic variation. Such studies require the genotyping of high numbers of samples. This is theoretically possible using 454 pyrosequencing, which generates billions of base pairs of sequence data. However several challenges arise: first in the attribution of each read produced to its original sample, and second, in bioinformatic analyses to distinguish true from artifactual sequence variation. This pilot study proposes a new application for the 454 GS FLX platform, allowing the individual genotyping of thousands of samples in one run. A probabilistic model has been developed to demonstrate the reliability of this method. DNA amplicons from 1,710 rodent samples were individually barcoded using a combination of tags located in forward and reverse primers. Amplicons consisted in 222 bp fragments corresponding to DRB exon 2, a highly polymorphic gene in mammals. A total of 221,789 reads were obtained, of which 153,349 were finally assigned to original samples. Rules based on a probabilistic model and a four-step procedure, were developed to validate sequences and provide a confidence level for each genotype. The method gave promising results, with the genotyping of DRB exon 2 sequences for 1,407 samples from 24 different rodent species and the sequencing of 392 variants in one half of a 454 run. Using replicates, we estimated that the reproducibility of genotyping reached 95%. This new approach is a promising alternative to classical methods involving electrophoresis-based techniques for variant separation and cloning-sequencing for sequence determination. The 454 system is less costly and time consuming and may enhance the reliability of genotypes obtained when high numbers of samples are studied. It opens up new perspectives for the study of evolutionary
Jump-and-return sandwiches: A new family of binomial-like selective inversion sequences with improved performance.

PubMed

Brenner, Tom; Chen, Johnny; Stait-Gardner, Tim; Zheng, Gang; Matsukawa, Shingo; Price, William S

2018-03-01

A new family of binomial-like inversion sequences, named jump-and-return sandwiches (JRS), has been developed by inserting a binomial-like sequence into a standard jump-and-return sequence, discovered through use of a stochastic Genetic Algorithm optimisation. Compared to currently used binomial-like inversion sequences (e.g., 3-9-19 and W5), the new sequences afford wider inversion bands and narrower non-inversion bands with an equal number of pulses. As an example, two jump-and-return sandwich 10-pulse sequences achieved 95% inversion at offsets corresponding to 9.4% and 10.3% of the non-inversion band spacing, compared to 14.7% for the binomial-like W5 inversion sequence, i.e., they afforded non-inversion bands about two thirds the width of the W5 non-inversion band. Copyright © 2018 Elsevier Inc. All rights reserved.
Molecular phylogeny, population genetics, and evolution of heterocystous cyanobacteria using nifH gene sequences.

PubMed

Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar

2013-06-01

In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
Fine-tuning gene networks using simple sequence repeats

PubMed Central

Egbert, Robert G.; Klavins, Eric

2012-01-01

The parameters in a complex synthetic gene network must be extensively tuned before the network functions as designed. Here, we introduce a simple and general approach to rapidly tune gene networks in Escherichia coli using hypermutable simple sequence repeats embedded in the spacer region of the ribosome binding site. By varying repeat length, we generated expression libraries that incrementally and predictably sample gene expression levels over a 1,000-fold range. We demonstrate the utility of the approach by creating a bistable switch library that programmatically samples the expression space to balance the two states of the switch, and we illustrate the need for tuning by showing that the switch’s behavior is sensitive to host context. Further, we show that mutation rates of the repeats are controllable in vivo for stability or for targeted mutagenesis—suggesting a new approach to optimizing gene networks via directed evolution. This tuning methodology should accelerate the process of engineering functionally complex gene networks. PMID:22927382
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats

PubMed Central

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-01-01

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
Cloning and sequencing of a laccase gene from the lignin-degrading basidiomycete Pleurotus ostreatus.

PubMed Central

Giardina, P; Cannio, R; Martirani, L; Marzullo, L; Palmieri, G; Sannia, G

1995-01-01

The gene (pox1) encoding a phenol oxidase from Pleurotus ostreatus, a lignin-degrading basidiomycete, was cloned and sequenced, and the corresponding pox1 cDNA was also synthesized and sequenced. The isolated gene consists of 2,592 bp, with the coding sequence being interrupted by 19 introns and flanked by an upstream region in which putative CAAT and TATA consensus sequences could be identified at positions -174 and -84, respectively. The isolation of a second cDNA (pox2 cDNA), showing 84% similarity, and of the corresponding truncated genomic clones demonstrated the existence of a multigene family coding for isoforms of laccase in P. ostreatus. PCR amplifications of specific regions on the DNA of isolated monokaryons proved that the two genes are not allelic forms. The POX1 amino acid sequence deduced was compared with those of other known laccases from different fungi. PMID:7793961
Genetic diversity of avenin-like b genes in Aegilops tauschii Coss.

PubMed

Cao, Dong; Wang, Hongxia; Zhang, Bo; Liu, Baolong; Liu, Dengcai; Chen, Wenjie; Zhang, Huaigang

2018-02-01

Avenin-like storage proteins influence the rheological properties and processing quality in common wheat, and the discovery of new alleles will benefit wheat quality improvement. In this study, 13 avenin-like b alleles (TaALPb7D-A-M) were discovered in 108 Aegilops tauschii Coss. accessions. Ten alleles were reported for the first time, while the remaining three alleles were the same as alleles in other species. A total of 15 nucleotide changes were detected in the 13 alleles, resulting in only 11 amino acid changes because of synonymous mutations. Alleles TaALPb7D-E, TaALPb7D-G, and TaALPb7D-J encoded the same protein. These polymorphic sites existed in the N-terminus, Repetitive region (Left), Repetitive region (Right) and C-terminus domains, with no polymorphisms in the signal peptide sequence nor in those encoding the 18 conserved cysteine residues. Phylogenetic analysis divided the TaALPb7Ds into four clades. The Ae. tauschii alleles were distributed in all four clades, while the alleles derived from common wheat, TaALPb7D-G and TaALPb7D-C, belonged to clade III and IV, respectively. Alleles TaALPb7D-G and TaALPb7D-C were the most widely distributed, being present in nine and six countries, respectively. Iran and Turkey exhibited the highest genetic diversity with respect to TaALPb7D alleles, accessions from these countries carrying seven and six alleles, respectively, which implied that these countries were the centers of origin of the avenin-like b gene. The new alleles discovered and the phylogenetic analysis of avenin-like b genes will provide breeding materials and a theoretical basis for wheat quality improvement.
Isolation and characterization of a SEPALLATA-like gene, ZjMADS1, from marine angiosperm Zostera japonica.

PubMed

Kakinuma, Makoto; Inoue, Miho; Morita, Teruwo; Tominaga, Hiroshi; Maegawa, Miyuki; Coury, Daniel A; Amano, Hideomi

2012-05-01

In flowering plants, floral homeotic MADS-box genes, which constitute a large multigene family, play important roles in the specification of floral organs as defined by the ABCDE model. In this study, a MADS-box gene, ZjMADS1, was isolated and characterized from the marine angiosperm Zostera japonica. The predicted length of the ZjMADS1 protein was 246 amino acids (AA), and the AA sequence was most similar to those of the SEPALLATA (SEP) subfamily, corresponding to E-function genes. Southern blot analysis suggested the presence of two SEP3-like genes in the Z. japonica genome. ZjMADS1 mRNA levels were extremely high in the spadices, regardless of the developmental stage, compared to other organs from the reproductive and vegetative shoots. These results suggest that the ZjMADS1 gene may be involved in spadix development in Z. japonica and act as an E-function gene in floral organ development in marine angiosperms. Copyright © 2011 Elsevier Ltd. All rights reserved.
AST: An Automated Sequence-Sampling Method for Improving the Taxonomic Diversity of Gene Phylogenetic Trees

PubMed Central

Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

2014-01-01

A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php. PMID:24892935
AST: an automated sequence-sampling method for improving the taxonomic diversity of gene phylogenetic trees.

PubMed

Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

2014-01-01

A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.

Using complementary DNA from MyoD-transduced fibroblasts to sequence large muscle genes.

PubMed

Waddell, Leigh B; Monnier, Nicole; Cooper, Sandra T; North, Kathryn N; Clarke, Nigel F

2011-08-01

Large muscle genes are often sequenced using complementary DNA (cDNA) made from muscle messenger RNA (mRNA) to reduce the cost and workload associated with sequencing from genomic DNA. Two potential barriers are the availability of a frozen muscle biopsy, and difficulties in detecting nonsense mutations due to nonsense-mediated mRNA decay (NMD). We present patient examples showing that use of MyoD-transduced fibroblasts as a source of muscle-specific mRNA overcomes these potential difficulties in sequencing large muscle-related genes. Copyright © 2011 Wiley Periodicals, Inc.
Shewanella species as the origin of blaOXA-48 genes: insights into gene diversity, associated phenotypes and possible transfer mechanisms.

PubMed

Tacão, Marta; Araújo, Susana; Vendas, Maria; Alves, Artur; Henriques, Isabel

2018-03-01

Chromosome-encoded beta-lactamases of Shewanella spp. have been indicated as probable progenitors of bla OXA-48 -like genes. However, these have been detected in few Shewanella spp. and dissemination mechanisms are unclear. Thus, our main objective was to confirm the role of Shewanella species as progenitors of bla OXA-48 -like genes. In silico analysis of Shewanella genomes was performed to detect bla OXA-48 -like genes and context, and 43 environmental Shewanella spp. were characterised. Clonal relatedness was determined by BOX-PCR. Phylogenetic affiliation was assessed by 16S rDNA and gyrB sequencing. Antibiotic susceptibility phenotypes were determined. The bla OXA-48 -like genes and genetic context were inspected by PCR, hybridisation and sequence analysis. Gene variants were cloned in Escherichia coli and MICs were determined. Shewanella isolates were screened for integrons, plasmids and insertion sequences. Analysis of Shewanella spp. genomes showed that putative bla OXA-48 -like is present in the majority and in an identical context. Isolates presenting unique BOX profiles affiliated with 11 Shewanella spp. bla OXA-48 -like genes were detected in 22 isolates from 6 species. Genes encoded enzymes identical to OXA-48, OXA-204, OXA-181, and 7 new variants differing from OXA-48 from 2 to 82 amino acids. IS1999 was detected in 24 isolates, although not in the vicinity of bla OXA-48 genes. Recombinant E. coli strains presented altered MICs. The presence/absence of bla OXA-48 -like genes was species-related. Gene variants encoded enzymes with hydrolytic spectra similar to OXA-48-like from non-shewanellae. From the mobile elements previously described in association with bla OXA-48 -like genes, only the IS1999 was found in Shewanella, which indicates its relevance in bla OXA-48 -like genes transfer to other hosts. Copyright © 2017 Elsevier B.V. and International Society of Chemotherapy. All rights reserved.
Virus-specific DNA sequences present in cells which carry the herpes simplex virus thymidine kinase gene.

PubMed

Minson, A C; Darby, G K; Wildy, P

1979-11-01

Two independently derived cell lines which carry the herpes simplex type 2 thymidine kinase gene have been examined for the presence of HSV-2-specific DNA sequences. Both cell lines contained 1 to 3 copies per cell of a sequence lying within map co-ordinates 0.2 to 0.4 of the HSV-2 genome. Revertant cells, which contained no detectable thymidine kinase, did not contain this DNA sequence. The failure of EcoR1-restricted HSV-2 DNA to act as a donor of the thymidine kinase gene in transformation experiments suggests that the gene lies close to the EcoR1 restriction site within this sequence at a map position of approx. 0.3. The HSV-2 kinase gene is therefore approximately co-linear with the HSV-1 gene.
Cloning and sequencing of the alcohol dehydrogenase II gene from Zymomonas mobilis

DOEpatents

Ingram, Lonnie O.; Conway, Tyrrell

1992-01-01

The alcohol dehydrogenase II gene from Zymomonas mobilis has been cloned and sequenced. This gene can be expressed at high levels in other organisms to produce acetaldehyde or to convert acetaldehyde to ethanol.
VRprofile: gene-cluster-detection-based profiling of virulence and antibiotic resistance traits encoded within genome sequences of pathogenic bacteria.

PubMed

Li, Jun; Tai, Cui; Deng, Zixin; Zhong, Weihong; He, Yongqun; Ou, Hong-Yu

2017-01-10

VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Changing Hydrozoan Bauplans by Silencing Hox-Like Genes

PubMed Central

Jakob, Wolfgang; Schierwater, Bernd

2007-01-01

Regulatory genes of the Antp class have been a major factor for the invention and radiation of animal bauplans. One of the most diverse animal phyla are the Cnidaria, which are close to the root of metazoan life and which often appear in two distinct generations and a remarkable variety of body forms. Hox-like genes have been known to be involved in axial patterning in the Cnidaria and have been suspected to play roles in the genetic control of many of the observed bauplan changes. Unfortunately RNAi mediated gene silencing studies have not been satisfactory for marine invertebrate organisms thus far. No direct evidence supporting Hox-like gene induced bauplan changes in cnidarians have been documented as of yet. Herein, we report a protocol for RNAi transfection of marine invertebrates and demonstrate that knock downs of Hox-like genes in Cnidaria create substantial bauplan alterations, including the formation of multiple oral poles (“heads”) by Cnox-2 and Cnox-3 inhibition, deformation of the main body axis by Cnox-5 inhibition and duplication of tentacles by Cnox-1 inhibition. All phenotypes observed in the course of the RNAi studies were identical to those obtained by morpholino antisense oligo experiments and are reminiscent of macroevolutionary bauplan changes. The reported protocol will allow routine RNAi studies in marine invertebrates to be established. PMID:17668071
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Mitochondrial sequences of Seriatopora corals show little agreement with morphology and reveal the duplication of a tRNA gene near the control region

NASA Astrophysics Data System (ADS)

Flot, J.-F.; Licuanan, W. Y.; Nakano, Y.; Payri, C.; Cruaud, C.; Tillier, S.

2008-12-01

The taxonomy of corals of the genus Seriatopora has not previously been studied using molecular sequence markers. As a first step toward a re-evaluation of species boundaries in this genus, mitochondrial sequence variability was analyzed in 51 samples collected from Okinawa, New Caledonia, and the Philippines. Four clusters of sequences were detected that showed little concordance with species currently recognized on a morphological basis. The most likely explanation is that the skeletal characters used for species identification are highly variable (polymorphic or phenotypically plastic); alternative explanations include introgression/hybridization, or deep coalescence and the retention of ancestral mitochondrial polymorphisms. In all individuals sequenced, two copies of trnW were found on either side of the atp8 gene near the putative D-loop, a novel mitochondrial gene arrangement that may have arisen from a duplication of the trnW-atp8 region followed by a deletion of one atp8.
Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

PubMed

van der Ley, P

1988-11-01

Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing.

PubMed

Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R

2015-01-01

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles
Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

PubMed Central

Dasenko, Mark A.

2015-01-01

In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles
Rhizobium meliloti anthranilate synthase gene: cloning, sequence, and expression in Escherichia coli.

PubMed Central

Bae, Y M; Holmgren, E; Crawford, I P

1989-01-01

We determined the DNA sequence of the Rhizobium meliloti gene encoding anthranilate synthase, the first enzyme of the tryptophan pathway. Sequences similar to those seen for the two subunits of the enzyme as found in all other procaryotic species studied are present in a single open reading frame of 729 codons. This apparent gene fusion joins the C terminus of the large subunit (TrpE) to the N terminus of the small subunit (TrpG) through a short connecting segment. We designate the fused gene trpE(G). The gene is flanked by a typical rho-independent terminator at the 3' end and a complex regulatory region at the 5' end resembling those of operons under transcriptional attenuation control. The location of the promoter was determined by S1 nuclease protection, using Rhizobium mRNA. Although this promoter was inactive in Escherichia coli, mutations eliciting activity were easily obtained. One of these was a C----T change at position -9 in the -10 region. The +1 position of the mRNA is the first base of the initiation codon of the leader peptide, implying that unlike trpE(G), which has a normal Shine-Dalgarno sequence, the leader peptide gene lacks a ribosome-binding site. Images PMID:2656657
Genome-wide identification, phylogeny and expression analyses of SCARECROW-LIKE(SCL) genes in millet (Setaria italica).

PubMed

Liu, Hongyun; Qin, Jiajia; Fan, Hui; Cheng, Jinjin; Li, Lin; Liu, Zheng

2017-07-01

As a member of the GRAS gene family, SCARECROW - LIKE ( SCL ) genes encode transcriptional regulators that are involved in plant information transmission and signal transduction. In this study, 44 SCL genes including two SCARECROW genes in millet were identified to be distributed on eight chromosomes, except chromosome 6. All the millet genes contain motifs 6-8, indicating that these motifs are conserved during the evolution. SCL genes of millet were divided into eight groups based on the phylogenetic relationship and classification of Arabidopsis SCL genes. Several putative millet orthologous genes in Arabidopsis , maize and rice were identified. High throughput RNA sequencing revealed that the expressions of millet SCL genes in root, stem, leaf, spica, and along leaf gradient varied greatly. Analyses combining the gene expression patterns, gene structures, motif compositions, promoter cis -elements identification, alternative splicing of transcripts and phylogenetic relationship of SCL genes indicate that the these genes may play diverse functions. Functionally characterized SCL genes in maize, rice and Arabidopsis would provide us some clues for future characterization of their homologues in millet. To the best of our knowledge, this is the first study of millet SCL genes at the genome wide level. Our work provides a useful platform for functional analysis of SCL genes in millet, a model crop for C 4 photosynthesis and bioenergy studies.
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through
Typing of Panton-Valentine Leukocidin-Encoding Phages and lukSF-PV Gene Sequence Variation in Staphylococcus aureus from China.

PubMed

Zhao, Huanqiang; Hu, Fupin; Jin, Shu; Xu, Xiaogang; Zou, Yuhan; Ding, Baixing; He, Chunyan; Gong, Fang; Liu, Qingzhong

2016-01-01

Panton-Valentine leukocidin (PVL, encoded by lukSF-PV genes), a bi-component and pore-forming toxin, is carried by different staphylococcal bacteriophages. The prevalence of PVL in Staphylococcus aureus has been reported around the globe. However, the data on PVL-encoding phage types, lukSF-PV gene variation and chromosomal phage insertion sites for PVL-positive S. aureus are limited, especially in China. In order to obtain a more complete understanding of the molecular epidemiology of PVL-positive S. aureus, an integrated and modified PCR-based scheme was applied to detect the PVL-encoding phage types. Phage insertion locus and the lukSF-PV variant were determined by PCR and sequencing. Meanwhile, the genetic background was characterized by staphylococcal cassette chromosome mec (SCCmec) typing, staphylococcal protein A (spa) gene polymorphisms typing, pulsed-field gel electrophoresis (PFGE) typing, accessory gene regulator (agr) locus typing and multilocus sequence typing (MLST). Seventy eight (78/1175, 6.6%) isolates possessed the lukSF-PV genes and 59.0% (46/78) of PVL-positive strains belonged to CC59 lineage. Eight known different PVL-encoding phage types were detected, and Φ7247PVL/ΦST5967PVL (n = 13) and ΦPVL (n = 12) were the most prevalent among them. While 25 (25/78, 32.1%) isolates, belonging to ST30, and ST59 clones, were unable to be typed by the modified PCR-based scheme. Single nucleotide polymorphisms (SNPs) were identified at five locations in the lukSF-PV genes, two of which were non-synonymous. Maximum-likelihood tree analysis of attachment sites sequences detected six SNP profiles for attR and eight for attL, respectively. In conclusion, the PVL-positive S. aureus mainly harbored Φ7247PVL/ΦST5967PVL and ΦPVL in the regions studied. lukSF-PV gene sequences, PVL-encoding phages, and phage insertion locus generally varied with lineages. Moreover, PVL-positive clones that have emerged worldwide likely carry distinct phages.
The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paule Roth, M.; Malfroy, L.; Offer, C.

1995-07-20

Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less
Evaluation of second-generation sequencing of 19 dilated cardiomyopathy genes for clinical applications.

PubMed

Gowrisankar, Sivakumar; Lerner-Ellis, Jordan P; Cox, Stephanie; White, Emily T; Manion, Megan; LeVan, Kevin; Liu, Jonathan; Farwell, Lisa M; Iartchouk, Oleg; Rehm, Heidi L; Funke, Birgit H

2010-11-01

Medical sequencing for diseases with locus and allelic heterogeneities has been limited by the high cost and low throughput of traditional sequencing technologies. "Second-generation" sequencing (SGS) technologies allow the parallel processing of a large number of genes and, therefore, offer great promise for medical sequencing; however, their use in clinical laboratories is still in its infancy. Our laboratory offers clinical resequencing for dilated cardiomyopathy (DCM) using an array-based platform that interrogates 19 of more than 30 genes known to cause DCM. We explored both the feasibility and cost effectiveness of using PCR amplification followed by SGS technology for sequencing these 19 genes in a set of five samples enriched for known sequence alterations (109 unique substitutions and 27 insertions and deletions). While the analytical sensitivity for substitutions was comparable to that of the DCM array (98%), SGS technology performed better than the DCM array for insertions and deletions (90.6% versus 58%). Overall, SGS performed substantially better than did the current array-based testing platform; however, the operational cost and projected turnaround time do not meet our current standards. Therefore, efficient capture methods and/or sample pooling strategies that shorten the turnaround time and decrease reagent and labor costs are needed before implementing this platform into routine clinical applications.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

PubMed

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Complete nucleotide sequence of the freshwater unicellular cyanobacterium Synechococcus elongatus PCC 6301 chromosome: gene content and organization.

PubMed

Sugita, Chieko; Ogata, Koretsugu; Shikata, Masamitsu; Jikuya, Hiroyuki; Takano, Jun; Furumichi, Miho; Kanehisa, Minoru; Omata, Tatsuo; Sugiura, Masahiro; Sugita, Mamoru

2007-01-01

The entire genome of the unicellular cyanobacterium Synechococcus elongatus PCC 6301 (formerly Anacystis nidulans Berkeley strain 6301) was sequenced. The genome consisted of a circular chromosome 2,696,255 bp long. A total of 2,525 potential protein-coding genes, two sets of rRNA genes, 45 tRNA genes representing 42 tRNA species, and several genes for small stable RNAs were assigned to the chromosome by similarity searches and computer predictions. The translated products of 56% of the potential protein-coding genes showed sequence similarities to experimentally identified and predicted proteins of known function, and the products of 35% of the genes showed sequence similarities to the translated products of hypothetical genes. The remaining 9% of genes lacked significant similarities to genes for predicted proteins in the public DNA databases. Some 139 genes coding for photosynthesis-related components were identified. Thirty-seven genes for two-component signal transduction systems were also identified. This is the smallest number of such genes identified in cyanobacteria, except for marine cyanobacteria, suggesting that only simple signal transduction systems are found in this strain. The gene arrangement and nucleotide sequence of Synechococcus elongatus PCC 6301 were nearly identical to those of a closely related strain Synechococcus elongatus PCC 7942, except for the presence of a 188.6 kb inversion. The sequences as well as the gene information shown in this paper are available in the Web database, CYORF (http://www.cyano.genome.jp/).

Cloning and sequence analysis of the invertase gene INV 1 from the yeast Pichia anomala.

PubMed

Pérez, J A; Rodríguez, J; Rodríguez, L; Ruiz, T

1996-02-01

A genomic library from the yeast Pichia anomala has been constructed and employed to clone the gene encoding the sucrose-hydrolysing enzyme invertase by complementation of a sucrose non-fermenting mutant of Saccharomyces cerevisiae. The cloned gene, INV1, was sequenced and found to encode a polypeptide of 550 amino acids which contained a 22 amino-acid signal sequence and ten potential glycosylation sites. The amino-acid sequence shows significant identity with other yeast invertases and also with Kluyveromyces marxianus inulinase, a yeast beta-fructofuranosidase which has a different substrate specificity. The nucleotide sequences of the 5' and 3' non-coding regions were found to contain several consensus motifs probably involved in the initiation and termination of gene transcription.
Comprehensive sequence analysis of nine Usher syndrome genes in the UK National Collaborative Usher Study.

PubMed

Le Quesne Stabej, Polona; Saihan, Zubin; Rangesh, Nell; Steele-Stallard, Heather B; Ambrose, John; Coffey, Alison; Emmerson, Jenny; Haralambous, Elene; Hughes, Yasmin; Steel, Karen P; Luxon, Linda M; Webster, Andrew R; Bitner-Glindzicz, Maria

2012-01-01

Usher syndrome (USH) is an autosomal recessive disorder comprising retinitis pigmentosa, hearing loss and, in some cases, vestibular dysfunction. It is clinically and genetically heterogeneous with three distinctive clinical types (I-III) and nine Usher genes identified. This study is a comprehensive clinical and genetic analysis of 172 Usher patients and evaluates the contribution of digenic inheritance. The genes MYO7A, USH1C, CDH23, PCDH15, USH1G, USH2A, GPR98, WHRN, CLRN1 and the candidate gene SLC4A7 were sequenced in 172 UK Usher patients, regardless of clinical type. No subject had definite mutations (nonsense, frameshift or consensus splice site mutations) in two different USH genes. Novel missense variants were classified UV1-4 (unclassified variant): UV4 is 'probably pathogenic', based on control frequency <0.23%, identification in trans to a pathogenic/probably pathogenic mutation and segregation with USH in only one family; and UV3 ('likely pathogenic') as above, but no information on phase. Overall 79% of identified pathogenic/UV4/UV3 variants were truncating and 21% were missense changes. MYO7A accounted for 53.2%, and USH1C for 14.9% of USH1 families (USH1C:c.496+1G>A being the most common USH1 mutation in the cohort). USH2A was responsible for 79.3% of USH2 families and GPR98 for only 6.6%. No mutations were found in USH1G, WHRN or SLC4A7. One or two pathogenic/likely pathogenic variants were identified in 86% of cases. No convincing cases of digenic inheritance were found. It is concluded that digenic inheritance does not make a significant contribution to Usher syndrome; the observation of multiple variants in different genes is likely to reflect polymorphic variation, rather than digenic effects.
Frequent genes in rare diseases: panel-based next generation sequencing to disclose causal mutations in hereditary neuropathies.

PubMed

Dohrn, Maike F; Glöckle, Nicola; Mulahasanovic, Lejla; Heller, Corina; Mohr, Julia; Bauer, Christine; Riesch, Erik; Becker, Andrea; Battke, Florian; Hörtnagel, Konstanze; Hornemann, Thorsten; Suriyanarayanan, Saranya; Blankenburg, Markus; Schulz, Jörg B; Claeys, Kristl G; Gess, Burkhard; Katona, Istvan; Ferbert, Andreas; Vittore, Debora; Grimm, Alexander; Wolking, Stefan; Schöls, Ludger; Lerche, Holger; Korenke, G Christoph; Fischer, Dirk; Schrank, Bertold; Kotzaeridou, Urania; Kurlemann, Gerhard; Dräger, Bianca; Schirmacher, Anja; Young, Peter; Schlotter-Weigel, Beate; Biskup, Saskia

2017-12-01

Hereditary neuropathies comprise a wide variety of chronic diseases associated to more than 80 genes identified to date. We herein examined 612 index patients with either a Charcot-Marie-Tooth phenotype, hereditary sensory neuropathy, familial amyloid neuropathy, or small fiber neuropathy using a customized multigene panel based on the next generation sequencing technique. In 121 cases (19.8%), we identified at least one putative pathogenic mutation. Of these, 54.4% showed an autosomal dominant, 33.9% an autosomal recessive, and 11.6% an X-linked inheritance. The most frequently affected genes were PMP22 (16.4%), GJB1 (10.7%), MPZ, and SH3TC2 (both 9.9%), and MFN2 (8.3%). We further detected likely or known pathogenic variants in HINT1, HSPB1, NEFL, PRX, IGHMBP2, NDRG1, TTR, EGR2, FIG4, GDAP1, LMNA, LRSAM1, POLG, TRPV4, AARS, BIC2, DHTKD1, FGD4, HK1, INF2, KIF5A, PDK3, REEP1, SBF1, SBF2, SCN9A, and SPTLC2 with a declining frequency. Thirty-four novel variants were considered likely pathogenic not having previously been described in association with any disorder in the literature. In one patient, two homozygous mutations in HK1 were detected in the multigene panel, but not by whole exome sequencing. A novel missense mutation in KIF5A was considered pathogenic because of the highly compatible phenotype. In one patient, the plasma sphingolipid profile could functionally prove the pathogenicity of a mutation in SPTLC2. One pathogenic mutation in MPZ was identified after being previously missed by Sanger sequencing. We conclude that panel based next generation sequencing is a useful, time- and cost-effective approach to assist clinicians in identifying the correct diagnosis and enable causative treatment considerations. © 2017 International Society for Neurochemistry.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.

PubMed

Wyszyńska-Koko, J; Kurył, J

2004-01-01

MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
Trichomonas vaginalis vast BspA-like gene family: evidence for functional diversity from structural organisation and transcriptomics

PubMed Central

2010-01-01

Background Trichomonas vaginalis is the most common non-viral human sexually transmitted pathogen and importantly, contributes to facilitating the spread of HIV. Yet very little is known about its surface and secreted proteins mediating interactions with, and permitting the invasion and colonisation of, the host mucosa. Initial annotations of T. vaginalis genome identified a plethora of candidate extracellular proteins. Results Data mining of the T. vaginalis genome identified 911 BspA-like entries (TvBspA) sharing TpLRR-like leucine-rich repeats, which represent the largest gene family encoding potential extracellular proteins for the pathogen. A broad range of microorganisms encoding BspA-like proteins was identified and these are mainly known to live on mucosal surfaces, among these T. vaginalis is endowed with the largest gene family. Over 190 TvBspA proteins with inferred transmembrane domains were characterised by a considerable structural diversity between their TpLRR and other types of repetitive sequences and two subfamilies possessed distinct classic sorting signal motifs for endocytosis. One TvBspA subfamily also shared a glycine-rich protein domain with proteins from Clostridium difficile pathogenic strains and C. difficile phages. Consistent with the hypothesis that TvBspA protein structural diversity implies diverse roles, we demonstrated for several TvBspA genes differential expression at the transcript level in different growth conditions. Identified variants of repetitive segments between several TvBspA paralogues and orthologues from two clinical isolates were also consistent with TpLRR and other repetitive sequences to be functionally important. For one TvBspA protein cell surface expression and antibody responses by both female and male T. vaginalis infected patients were also demonstrated. Conclusions The biased mucosal habitat for microbial species encoding BspA-like proteins, the characterisation of a vast structural diversity for the Tv
Identification and Characterization of a Pesticide Degrading Flavobacterium Species EMBS0145 by 16S rRNA Gene Sequencing.

PubMed

Nayarisseri, Anuraj; Suppahia, Anjana; Nadh, Anuroopa G; Nair, Achuthsankar S

2015-06-01

Organophosphates like chlorpyrifos, diazinon, or malathion have become most common and indisputably most toxic pest control agents that adversely affects the human nervous system even at low levels of exposure. Because of their relatively low cost and ability to be applied on a wide range of target insects and crop, organophosphorus pesticides account for a large share of all insecticides used in India, and this in turn raises severe health concerns. In this view, the present investigation was aimed to identify novel species of Flavobacterium bacteria which is bestowed with the capacity to degrade pesticides like chlorpyrifos, diazinon, or malathion. The bacterium was isolated from agricultural soil collected from Guntur District, Andhra Pradesh, India. The samples were serially diluted, and the aliquots were incubated for a suitable time following which the suspected colony was subjected to 16S rRNA gene sequencing. The sequence thus obtained was aligned pairwise against Flavobacterium species, which resulted in identification of novel species of Flavobacterium later which was named as EMBS0145 and sequence was deposited in GenBank with Accession Number: JN794045.
Identification and characterization of a pesticide degrading flavobacterium species EMBS0145 by 16S rRNA gene sequencing.

PubMed

Nayarisseri, Anuraj; Suppahia, Anjana; Nadh, Anuroopa G; Nair, Achuthsankar S

2014-08-09

Organophosphates (OPs) like chlorpyrifos, diazinon, or malathion have become most common and indisputably most toxic pest-control agents that adversely affects the human nervous system even at low levels of exposure. Because of their relatively low cost and ability to be applied on a wide range of target insects and crop, organophosphorus pesticides account for a large share of all insecticides used in India, this in turn raises severe health concerns. In this view, the present investigation was aimed to identify novel species of Flavobacterium bacteria which is bestowed with the capacity to degrade pesticides like chlorpyrifos, diazinon or malathion. The bacterium was isolated from agricultural soil collected from Guntur District, Andhra Pradesh, India. The samples were serially diluted and the aliquots were incubated for a suitable time following which the suspected colony was subjected to 16S rRNA gene sequencing. The sequence thus obtained was aligned pairwise against Flavobacterium species, which resulted in identification of novel species of Flavobacterium later which was named as EMBS0145 and sequence was deposited in GenBank with accession number JN794045.
Suitability of partial 16S ribosomal RNA gene sequence analysis for the identification of dangerous bacterial pathogens.

PubMed

Ruppitsch, W; Stöger, A; Indra, A; Grif, K; Schabereiter-Gurtner, C; Hirschl, A; Allerberger, F

2007-03-01

In a bioterrorism event a rapid tool is needed to identify relevant dangerous bacteria. The aim of the study was to assess the usefulness of partial 16S rRNA gene sequence analysis and the suitability of diverse databases for identifying dangerous bacterial pathogens. For rapid identification purposes a 500-bp fragment of the 16S rRNA gene of 28 isolates comprising Bacillus anthracis, Brucella melitensis, Burkholderia mallei, Burkholderia pseudomallei, Francisella tularensis, Yersinia pestis, and eight genus-related and unrelated control strains was amplified and sequenced. The obtained sequence data were submitted to three public and two commercial sequence databases for species identification. The most frequent reason for incorrect identification was the lack of the respective 16S rRNA gene sequences in the database. Sequence analysis of a 500-bp 16S rDNA fragment allows the rapid identification of dangerous bacterial species. However, for discrimination of closely related species sequencing of the entire 16S rRNA gene, additional sequencing of the 23S rRNA gene or sequencing of the 16S-23S rRNA intergenic spacer is essential. This work provides comprehensive information on the suitability of partial 16S rDNA analysis and diverse databases for rapid and accurate identification of dangerous bacterial pathogens.
Phylogenetic relationships among the major lineages of the birds-of-paradise (Paradisaeidae) using mitochondrial DNA gene sequences.

PubMed

Nunn, G B; Cracraft, J

1996-06-01

Complete mitochondrial cytochrome b gene sequences were determined from 12 species of the Australo-Papuan birds-of-paradise (Paradisaeidae) representing 9 genera. Phylogenetic analysis of these and 5 previously published sequences reveals a radiation of the main paradisaeinine lineages that took place over a relatively short evolutionary time scale. The core paradisaeinines are resolved as the monophyletic sister-group to the crow-like manucodines. The genus Parotia is basal to other paradisaeinines and is not closely related to the morphologically similar genera Ptiloris and Lophorina. Three major clades within the paradisaeinine ingroup include: (1) Cicinnurus and Diphyllodes, (2) Ptiloris and Lophorina, and (3) the genus Paradisaea. The monotypic genus Seleucidis is apparently closely related to clades (1) and (2). Cytochrome b sequences did not provide evidence for the monophyly of the sicklebill genera Epimachus and Drepanornis. The paradisaeid tree is characterized by short internodal distances. Thus, some clades cannot be strongly resolved by cytochrome b sequences alone.
Diagnosis and molecular basis of mitochondrial respiratory chain disorders: exome sequencing for disease gene identification.

PubMed

Ohtake, A; Murayama, K; Mori, M; Harashima, H; Yamazaki, T; Tamaru, S; Yamashita, Y; Kishita, Y; Nakachi, Y; Kohda, M; Tokuzawa, Y; Mizuno, Y; Moriyama, Y; Kato, H; Okazaki, Y

2014-04-01

Mitochondrial disorders have the highest incidence among congenital metabolic diseases, and are thought to occur at a rate of 1 in 5000 births. About 25% of the diseases diagnosed as mitochondrial disorders in the field of pediatrics have mitochondrial DNA abnormalities, while the rest occur due to defects in genes encoded in the nucleus. The most important function of the mitochondria is biosynthesis of ATP. Mitochondrial disorders are nearly synonymous with mitochondrial respiratory chain disorder, as respiratory chain complexes serve a central role in ATP biosynthesis. By next-generation sequencing of the exome, we analyzed 104 patients with mitochondrial respiratory chain disorders. The results of analysis to date were 18 patients with novel variants in genes previously reported to be disease-causing, and 27 patients with mutations in genes suggested to be associated in some way with mitochondria, and it is likely that they are new disease-causing genes in mitochondrial disorders. This article is part of a Special Issue entitled Frontiers of Mitochondrial Research. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.
DNA sequence requirements for the accurate transcription of a protein-coding plastid gene in a plastid in vitro system from mustard (Sinapis alba L.)

PubMed Central

Link, Gerhard

1984-01-01

A nuclease-treated plastid extract from mustard (Sinapis alba L.) allows efficient transcription of cloned plastid DNA templates. In this in vitro system, the major runoff transcript of the truncated gene for the 32 000 mol. wt. photosystem II protein was accurately initiated from a site close to or identical with the in vivo start site. By using plasmids with deletions in the 5'-flanking region of this gene as templates, a DNA region required for efficient and selective initiation was detected ˜28-35 nucleotides upstream of the transcription start site. This region contains the sequence element TTGACA, which matches the consensus sequence for prokaryotic `−35' promoter elements. In the absence of this region, a region ˜13-27 nucleotides upstream of the start site still enables a basic level of specific transcription. This second region contains the sequence element TATATAA, which matches the consensus sequence for the `TATA' box of genes transcribed by RNA polymerase II (or B). The region between the `TATA'-like element and the transcription start site is not sufficient but may be required for specific transcription of the plastid gene. This latter region contains the sequence element TATACT, which resembles the prokaryotic `−10' (Pribnow) box. Based on the structural and transcriptional features of the 5' upstream region, a `promoter switch' mechanism is proposed, which may account for the developmentally regulated expression of this plastid gene. ImagesFig. 1.Fig. 2.Fig. 3.Fig. 4.Figure 5. PMID:16453540
GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes.

PubMed

Hallin, Peter F; Stærfeldt, Hans-Henrik; Rotenberg, Eva; Binnewies, Tim T; Benham, Craig J; Ussery, David W

2009-09-25

We present an interactive web application for visualizing genomic data of prokaryotic chromosomes. The tool (GeneWiz browser) allows users to carry out various analyses such as mapping alignments of homologous genes to other genomes, mapping of short sequencing reads to a reference chromosome, and calculating DNA properties such as curvature or stacking energy along the chromosome. The GeneWiz browser produces an interactive graphic that enables zooming from a global scale down to single nucleotides, without changing the size of the plot. Its ability to disproportionally zoom provides optimal readability and increased functionality compared to other browsers. The tool allows the user to select the display of various genomic features, color setting and data ranges. Custom numerical data can be added to the plot allowing, for example, visualization of gene expression and regulation data. Further, standard atlases are pre-generated for all prokaryotic genomes available in GenBank, providing a fast overview of all available genomes, including recently deposited genome sequences. The tool is available online from http://www.cbs.dtu.dk/services/gwBrowser. Supplemental material including interactive atlases is available online at http://www.cbs.dtu.dk/services/gwBrowser/suppl/.
Closed Genome Sequence of Chryseobacterium piperi Strain CTMT/ATCC BAA-1782, a Gram-Negative Bacterium with Clostridial Neurotoxin-Like Coding Sequences

PubMed Central

Wentz, Travis G.; Muruvanda, Tim; Thirunavukkarasu, Nagarajan; Hoffmann, Maria; Allard, Marc W.; Hodge, David R.; Pillai, Segaran P.; Hammack, Thomas S.; Brown, Eric W.

2017-01-01

ABSTRACT Clostridial neurotoxins, including botulinum and tetanus neurotoxins, are among the deadliest known bacterial toxins. Until recently, the horizontal mobility of this toxin gene family appeared to be limited to the genus Clostridium. We report here the closed genome sequence of Chryseobacterium piperi, a Gram-negative bacterium containing coding sequences with homology to clostridial neurotoxin family proteins. PMID:29192076
Genome-wide analysis of the cellulose synthase-like (Csl) gene family in bread wheat (Triticum aestivum L.).

PubMed

Kaur, Simerjeet; Dhugga, Kanwarpal S; Beech, Robin; Singh, Jaswinder

2017-11-03

Hemicelluloses are a diverse group of complex, non-cellulosic polysaccharides, which constitute approximately one-third of the plant cell wall and find use as dietary fibres, food additives and raw materials for biofuels. Genes involved in hemicellulose synthesis have not been extensively studied in small grain cereals. In efforts to isolate the sequences for the cellulose synthase-like (Csl) gene family from wheat, we identified 108 genes (hereafter referred to as TaCsl). Each gene was represented by two to three homeoalleles, which are named as TaCslXY_ZA, TaCslXY_ZB, or TaCslXY_ZD, where X denotes the Csl subfamily, Y the gene number and Z the wheat chromosome where it is located. A quarter of these genes were predicted to have 2 to 3 splice variants, resulting in a total of 137 putative translated products. Approximately 45% of TaCsl genes were located on chromosomes 2 and 3. Sequences from the subfamilies C and D were interspersed between the dicots and grasses but those from subfamily A clustered within each group of plants. Proximity of the dicot-specific subfamilies B and G, to the grass-specific subfamilies H and J, respectively, points to their common origin. In silico expression analysis in different tissues revealed that most of the genes were expressed ubiquitously and some were tissue-specific. More than half of the genes had introns in phase 0, one-third in phase 2, and a few in phase 1. Detailed characterization of the wheat Csl genes has enhanced the understanding of their structural, functional, and evolutionary features. This information will be helpful in designing experiments for genetic manipulation of hemicellulose synthesis with the goal of developing improved cultivars for biofuel production and increased tolerance against various stresses.
Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development.

PubMed

da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando

2016-01-01

The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir.
Transcriptome analyses of the Dof-like gene family in grapevine reveal its involvement in berry, flower and seed development

PubMed Central

da Silva, Danielle Costenaro; da Silveira Falavigna, Vítor; Fasoli, Marianna; Buffon, Vanessa; Porto, Diogo Denardi; Pappas, Georgios Joannis; Pezzotti, Mario; Pasquali, Giancarlo; Revers, Luís Fernando

2016-01-01

The Dof (DNA-binding with one finger) protein family spans a group of plant transcription factors involved in the regulation of several functions, such as plant responses to stress, hormones and light, phytochrome signaling and seed germination. Here we describe the Dof-like gene family in grapevine (Vitis vinifera L.), which consists of 25 genes coding for Dof. An extensive in silico characterization of the VviDofL gene family was performed. Additionally, the expression of the entire gene family was assessed in 54 grapevine tissues and organs using an integrated approach with microarray (cv Corvina) and real-time PCR (cv Pinot Noir) analyses. The phylogenetic analysis comparing grapevine sequences with those of Arabidopsis, tomato, poplar and already described Dof genes in other species allowed us to identify several duplicated genes. The diversification of grapevine DofL genes during evolution likely resulted in a broader range of biological roles. Furthermore, distinct expression patterns were identified between samples analyzed, corroborating such hypothesis. Our expression results indicate that several VviDofL genes perform their functional roles mainly during flower, berry and seed development, highlighting their importance for grapevine growth and production. The identification of similar expression profiles between both approaches strongly suggests that these genes have important regulatory roles that are evolutionally conserved between grapevine cvs Corvina and Pinot Noir. PMID:27610237
A flexible and economical barcoding approach for highly multiplexed amplicon sequencing of diverse target genes

PubMed Central

Herbold, Craig W.; Pelikan, Claus; Kuzyk, Orest; Hausmann, Bela; Angel, Roey; Berry, David; Loy, Alexander

2015-01-01

High throughput sequencing of phylogenetic and functional gene amplicons provides tremendous insight into the structure and functional potential of complex microbial communities. Here, we introduce a highly adaptable and economical PCR approach to barcoding and pooling libraries of numerous target genes. In this approach, we replace gene- and sequencing platform-specific fusion primers with general, interchangeable barcoding primers, enabling nearly limitless customized barcode-primer combinations. Compared to barcoding with long fusion primers, our multiple-target gene approach is more economical because it overall requires lower number of primers and is based on short primers with generally lower synthesis and purification costs. To highlight our approach, we pooled over 900 different small-subunit rRNA and functional gene amplicon libraries obtained from various environmental or host-associated microbial community samples into a single, paired-end Illumina MiSeq run. Although the amplicon regions ranged in size from approximately 290 to 720 bp, we found no significant systematic sequencing bias related to amplicon length or gene target. Our results indicate that this flexible multiplexing approach produces large, diverse, and high quality sets of amplicon sequence data for modern studies in microbial ecology. PMID:26236305
Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

PubMed

Amirhaeri, S; Wohlrab, F; Wells, R D

1995-02-17

The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.
A Multiplexed Amplicon Approach for Detecting Gene Fusions by Next-Generation Sequencing.

PubMed

Beadling, Carol; Wald, Abigail I; Warrick, Andrea; Neff, Tanaya L; Zhong, Shan; Nikiforov, Yuri E; Corless, Christopher L; Nikiforova, Marina N

2016-03-01

Chromosomal rearrangements that result in oncogenic gene fusions are clinically important drivers of many cancer types. Rapid and sensitive methods are therefore needed to detect a broad range of gene fusions in clinical specimens that are often of limited quantity and quality. We describe a next-generation sequencing approach that uses a multiplex PCR-based amplicon panel to interrogate fusion transcripts that involve 19 driver genes and 94 partners implicated in solid tumors. The panel also includes control assays that evaluate the 3'/5' expression ratios of 12 oncogenic kinases, which might be used to infer gene fusion events when the partner is unknown or not included on the panel. There was good concordance between the solid tumor fusion gene panel and other methods, including fluorescence in situ hybridization, real-time PCR, Sanger sequencing, and other next-generation sequencing panels, because 40 specimens known to harbor gene fusions were correctly identified. No specific fusion reads were observed in 59 fusion-negative specimens. The 3'/5' expression ratio was informative for fusions that involved ALK, RET, and NTRK1 but not for BRAF or ROS1 fusions. However, among 37 ALK or RET fusion-negative specimens, four exhibited elevated 3'/5' expression ratios, indicating that fusions predicted solely by 3'/5' read ratios require confirmatory testing. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Identification and Characterization of Three Differentially Expressed Genes, Encoding S-Adenosylhomocysteine Hydrolase, Methionine Aminopeptidase, and a Histone-Like Protein, in the Toxic Dinoflagellate Alexandrium fundyense†

PubMed Central

Taroncher-Oldenburg, Gaspar; Anderson, Donald M.

2000-01-01

Genes showing differential expression related to the early G1 phase of the cell cycle during synchronized circadian growth of the toxic dinoflagellate Alexandrium fundyense were identified and characterized by differential display (DD). The determination in our previous work that toxin production in Alexandrium is relegated to a narrow time frame in early G1 led to the hypothesis that transcriptionally up- or downregulated genes during this subphase of the cell cycle might be related to toxin biosynthesis. Three genes, encoding S-adenosylhomocysteine hydrolase (Sahh), methionine aminopeptidase (Map), and a histone-like protein (HAf), were isolated. Sahh was downregulated, while Map and HAf were upregulated, during the early G1 phase of the cell cycle. Sahh and Map encoded amino acid sequences with about 90 and 70% similarity to those encoded by several eukaryotic and prokaryotic Sahh and Map genes, respectively. The partial Map sequence also contained three cobalt binding motifs characteristic of all Map genes. HAf encoded an amino acid sequence with 60% similarity to those of two histone-like proteins from the dinoflagellate Crypthecodinium cohnii Biecheler. This study documents the potential of applying DD to the identification of genes that are related to physiological processes or cell cycle events in phytoplankton under conditions where small sample volumes represent an experimental constraint. The identification of an additional 21 genes with various cell cycle-related DD patterns also provides evidence for the importance of pretranslational or transcriptional regulation in dinoflagellates, contrary to previous reports suggesting the possibility that translational mechanisms are the primary means of circadian regulation in this group of organisms. PMID:10788388

Visualization and Enumeration of Bacteria Carrying a Specific Gene Sequence by In Situ Rolling Circle Amplification

PubMed Central

Maruyama, Fumito; Kenzaka, Takehiko; Yamaguchi, Nobuyasu; Tani, Katsuji; Nasu, Masao

2005-01-01

Rolling circle amplification (RCA) generates large single-stranded and tandem repeats of target DNA as amplicons. This technique was applied to in situ nucleic acid amplification (in situ RCA) to visualize and count single Escherichia coli cells carrying a specific gene sequence. The method features (i) one short target sequence (35 to 39 bp) that allows specific detection; (ii) maintaining constant fluorescent intensity of positive cells permeabilized extensively after amplicon detection by fluorescence in situ hybridization, which facilitates the detection of target bacteria in various physiological states; and (iii) reliable enumeration of target bacteria by concentration on a gelatin-coated membrane filter. To test our approach, the presence of the following genes were visualized by in situ RCA: green fluorescent protein gene, the ampicillin resistance gene and the replication origin region on multicopy pUC19 plasmid, as well as the single-copy Shiga-like toxin gene on chromosomes inside E. coli cells. Fluorescent antibody staining after in situ RCA also simultaneously identified cells harboring target genes and determined the specificity of in situ RCA. E. coli cells in a nonculturable state from a prolonged incubation were periodically sampled and used for plasmid uptake study. The numbers of cells taking up plasmids determined by in situ RCA was up to 106-fold higher than that measured by selective plating. In addition, in situ RCA allowed the detection of cells taking up plasmids even when colony-forming cells were not detected during the incubation period. By optimizing the cell permeabilization condition for in situ RCA, this method can become a valuable tool for studying free DNA uptake, especially in nonculturable bacteria. PMID:16332770
Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

USDA-ARS?s Scientific Manuscript database

Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...
In vitro infectivity and differential gene expression of Leishmania infantum metacyclic promastigotes: negative selection with peanut agglutinin in culture versus isolation from the stomodeal valve of Phlebotomus perniciosus.

PubMed

Alcolea, Pedro J; Alonso, Ana; Degayón, María A; Moreno-Paz, Mercedes; Jiménez, Maribel; Molina, Ricardo; Larraga, Vicente

2016-05-20

Leishmania infantum is the protozoan parasite responsible for zoonotic visceral leishmaniasis in the Mediterranean basin. A recent outbreak in humans has been reported in this area. The life cycle of the parasite is digenetic. The promastigote stage develops within the gut of phlebotomine sand flies, whereas amastigotes survive and multiply within phagolysosomes of mammalian host phagocytes. The major vector of L. infantum in Spain is Phlebotomus perniciosus. The axenic culture model of promastigotes is generally used because it is able to mimic the conditions of the natural environment (i.e. the sand fly vector gut). However, infectivity decreases with culture passages and infection of laboratory animals is frequently required. Enrichment of the stationary phase population in highly infective metacyclic promastigotes is achieved by negative selection with peanut agglutinin (PNA), which is possible only in certain Leishmania species such as L. major and L. infantum. In this study, in vitro infectivity and differential gene expression of cultured PNA-negative promastigotes (Pro-PNA(-)) and metacyclic promastigotes isolated from the sand fly anterior thoracic midgut (Pro-Pper) have been compared. In vitro infectivity is about 30 % higher in terms of rate of infected cells and number of amastigotes per infected cell in Pro-Pper than in Pro-PNA(-). This finding is in agreement with up-regulation of a leishmanolysin gene (gp63) and genes involved in biosynthesis of glycosylinositolphospholipids (GIPL), lipophosphoglycan (LPG) and proteophosphoglycan (PPG) in Pro-Pper. In addition, differences between Pro-Pper and Pro-PNA(-) in genes involved in important cellular processes (e.g. signaling and regulation of gene expression) have been found. Pro-Pper are significantly more infective than peanut lectin non-agglutinating ones. Therefore, negative selection with PNA is an appropriate method for isolating metacyclic promastigotes in stationary phase of axenic culture but it
Identification of rat serum alkaline phosphatase isoenzyme by means of wheat germ agglutinin.

PubMed

Wada, H; Niwa, N; Hayakawa, T; Tsuge, H

1997-01-01

Wheat germ agglutinin (WGA) precipitates bone type serum alkaline phosphatase (sALP) isoenzyme specifically. The precipitates are composed of the macromolecules of WGA and "bone type sALP" (WGA-ALP complex). In order to use bone type sALP as a marker in polyacrylamide gel electrophoresis (PAGE), a method to separate "bone type sALP" from the "WGA-ALP complex" was established by using N-acetyl-D-glucosamine (GlcNAc)-Sepharose 6E column chromatography. It was concluded that this method is useful for clinical examination in the rat.
Role of an expansin-like molecule in Dictyostelium morphogenesis and regulation of its gene expression by the signal transducer and activator of transcription protein Dd-STATa.

PubMed

Ogasawara, Shun; Shimada, Nao; Kawata, Takefumi

2009-02-01

Expansins are proteins involved in plant morphogenesis, exerting their effects on cellulose to extend cell walls. Dictyostelium is an organism that possesses expansin-like molecules, but their functions are not known. In this study, we analyzed the expL7 (expansin-like 7) gene, which has been identified as a putative target of Dd-STATa, a Dictyostelium homolog of the metazoan signal transducer and activator of transcription (STAT) proteins. Promoter fragments of the expL7 were fused to a lacZ reporter and the expression patterns determined. As expected from the behavior of the endogenous expL7 gene, the expL7/lacZ fusion gene was downregulated in Dd-STATa null slugs. In the parental strain, the expL7 promoter was activated in the anterior tip region. Mutational analysis of the promoter identified a sequence that was necessary for expression in tip cells. In addition, an activator sequence for pstAB cells was identified. These sequences act in combination with the repressor region to prevent ectopic expL7 expression in the prespore and prestalk regions of the slug and culminant. Although the expL7 null mutant showed no phenotypic change, the expL7 overexpressor showed aberrant stalk formation. These results indicate that the expansin-like molecule is important for morphogenesis in Dictyostelium.
Genetic testing of the FBN1 gene in Chinese patients with Marfan/Marfan-like syndrome.

PubMed

Yang, Hang; Luo, Mingyao; Chen, Qianlong; Fu, Yuanyuan; Zhang, Jing; Qian, Xiangyang; Sun, Xiaogang; Fan, Yuxin; Zhou, Zhou; Chang, Qian

2016-08-01

Marfan syndrome (MFS) is an autosomal dominant connective tissue disorder typically involving the ocular, skeletal and cardiovascular systems, and aortic aneurysms/dissection mainly contributes to its mortality. Here, we performed genetic testing of the FBN1 gene in 39 Chinese probands with Marfan/Marfan-like syndrome and their related family members by Sanger sequencing. In total, 29 pathogenic/likely pathogenic FBN1 mutations, including 17 novel ones, were identified. In addition, most MFS patients with aortic disease (62%) had a truncating or splicing mutation. These results expand the FBN1 mutation spectrum and enrich our knowledge of genotype-phenotype correlations. Genetic testing for MFS and its related aortic diseases is increasingly important for early intervention and treatment. Copyright © 2016 Elsevier B.V. All rights reserved.
[Sequences and expression pattern of mce gene in Leptospira interrogans of different serogroups].

PubMed

Zhang, Lei; Xue, Feng; Yan, Jie; Mao, Ya-fei; Li, Li-wei

2008-11-01

To determine the frequency of mce gene in Leptospira interrogans, and to investigate the gene transcription levels of L. interrogans before and after infecting cells. The segments of entire mce genes from 13 L.interrogans strains and 1 L.biflexa strain were amplified by PCR and then sequenced after T-A cloning. A prokaryotic expression system of mce gene was constructed; the expression and output of the target recombinant protein rMce were examined by SDS-PAGE and Western Blot assay. Rabbits were intradermally immunized with rMce to prepare the antiserum, the titer of antiserum was measured by immunodiffusion test. The transcription levels of mce gene in L.interrogans serogroup Icterohaemorrhagiae serovar lai strain 56601 before and after infecting J774A.1 cells were monitored by real-time fluorescence quantitative RT-PCR. mce gene was carried in all tested L.interrogans strains, but not in L.biflexa serogroup Semaranga serovar patoc strain Patoc I. The similarities of nucleotide and putative amino acid sequences of the cloned mce genes to the reported sequences (GenBank accession No: NP712236) were 99.02%-100% and 97.91%-100%, respectively. The constructed prokaryotic expression system of mce gene expressed rMce and the output of rMce was about 5% of the total bacterial proteins. The antiserum against whole cell of L.interrogans strain 56601 efficiently recognized rMce. After infecting J774A.1 cells, transcription levels of the mce gene in L.interrogans strain 56601 were remarkably up-regulated. The constructed prokaryotic expression system of mce gene and the prepared antiserum against rMce provide useful tools for further study of the gene function.
CDKL5 gene status in female patients with epilepsy and Rett-like features: two new mutations in the catalytic domain

PubMed Central

2012-01-01

Background Mutations in the cyclin-dependent kinase-like 5 gene (CDKL5) located in the Xp22 region have been shown to cause a subset of atypical Rett syndrome with infantile spasms or early seizures starting in the first postnatal months. Methods We performed mutation screening of CDKL5 in 60 female patients who had been identified as negative for the methyl CpG-binding protein 2 gene (MECP2) mutations, but who had current or past epilepsy, regardless of the age of onset, type, and severity. All the exons in the CDKL5 gene and their neighbouring sequences were examined, and CDKL5 rearrangements were studied by multiplex ligation-dependent probe amplification (MLPA). Results Six previously unidentified DNA changes were detected, two of which were disease-causing mutations in the catalytic domain: a frameshift mutation (c.509_510insGT; p.Glu170GlyfsX36) and a complete deletion of exon 10. Both were found in patients with seizures that started in the first month of life. Conclusions This study demonstrated the importance of CDKL5 mutations as etiological factors in neurodevelopmental disorders, and indicated that a thorough analysis of the CDKL5 gene sequence and its rearrangements should be considered in females with Rett syndrome-like phenotypes, severe encephalopathy and epilepsy with onset before 5 months of age. This study also confirmed the usefulness of MLPA as a diagnostic screening method for use in clinical practice. PMID:22867051
CDKL5 gene status in female patients with epilepsy and Rett-like features: two new mutations in the catalytic domain.

PubMed

Maortua, Hiart; Martínez-Bouzas, Cristina; Calvo, María-Teresa; Domingo, Maria-Rosario; Ramos, Feliciano; García-Ribes, Ainhoa; Martínez, María-Jesús; López-Aríztegui, María-Asunción; Puente, Nerea; Rubio, Izaskun; Tejada, María-Isabel

2012-08-06

Mutations in the cyclin-dependent kinase-like 5 gene (CDKL5) located in the Xp22 region have been shown to cause a subset of atypical Rett syndrome with infantile spasms or early seizures starting in the first postnatal months. We performed mutation screening of CDKL5 in 60 female patients who had been identified as negative for the methyl CpG-binding protein 2 gene (MECP2) mutations, but who had current or past epilepsy, regardless of the age of onset, type, and severity. All the exons in the CDKL5 gene and their neighbouring sequences were examined, and CDKL5 rearrangements were studied by multiplex ligation-dependent probe amplification (MLPA). Six previously unidentified DNA changes were detected, two of which were disease-causing mutations in the catalytic domain: a frameshift mutation (c.509_510insGT; p.Glu170GlyfsX36) and a complete deletion of exon 10. Both were found in patients with seizures that started in the first month of life. This study demonstrated the importance of CDKL5 mutations as etiological factors in neurodevelopmental disorders, and indicated that a thorough analysis of the CDKL5 gene sequence and its rearrangements should be considered in females with Rett syndrome-like phenotypes, severe encephalopathy and epilepsy with onset before 5 months of age. This study also confirmed the usefulness of MLPA as a diagnostic screening method for use in clinical practice.
Epstein-Barr virus latent gene sequences as geographical markers of viral origin: unique EBNA3 gene signatures identify Japanese viruses as distinct members of the Asian virus family.

PubMed

Sawada, Akihisa; Croom-Carter, Deborah; Kondo, Osamu; Yasui, Masahiro; Koyama-Sato, Maho; Inoue, Masami; Kawa, Keisei; Rickinson, Alan B; Tierney, Rosemary J

2011-05-01

Polymorphisms in Epstein-Barr virus (EBV) latent genes can identify virus strains from different human populations and individual strains within a population. An Asian EBV signature has been defined almost exclusively from Chinese viruses, with little information from other Asian countries. Here we sequenced polymorphic regions of the EBNA1, 2, 3A, 3B, 3C and LMP1 genes of 31 Japanese strains from control donors and EBV-associated T/NK-cell lymphoproliferative disease (T/NK-LPD) patients. Though identical to Chinese strains in their dominant EBNA1 and LMP1 alleles, Japanese viruses were subtly different at other loci. Thus, while Chinese viruses mainly fall into two families with strongly linked 'Wu' or 'Li' alleles at EBNA2 and EBNA3A/B/C, Japanese viruses all have the consensus Wu EBNA2 allele but fall into two families at EBNA3A/B/C. One family has variant Li-like sequences at EBNA3A and 3B and the consensus Li sequence at EBNA3C; the other family has variant Wu-like sequences at EBNA3A, variants of a low frequency Chinese allele 'Sp' at EBNA3B and a consensus Sp sequence at EBNA3C. Thus, EBNA3A/B/C allelotypes clearly distinguish Japanese from Chinese strains. Interestingly, most Japanese viruses also lack those immune-escape mutations in the HLA-A11 epitope-encoding region of EBNA3B that are so characteristic of viruses from the highly A11-positive Chinese population. Control donor-derived and T/NK-LPD-derived strains were similarly distributed across allelotypes and, by using allelic polymorphisms to track virus strains in patients pre- and post-haematopoietic stem-cell transplant, we show that a single strain can induce both T/NK-LPD and B-cell-lymphoproliferative disease in the same patient.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples

PubMed Central

Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.

2016-01-01

Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.

PubMed

Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M

2016-07-01

The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.
Chasing Migration Genes: A Brain Expressed Sequence Tag Resource for Summer and Migratory Monarch Butterflies (Danaus plexippus)

PubMed Central

Zhu, Haisun; Casselman, Amy; Reppert, Steven M.

2008-01-01

North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents ∼52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our “snap-shot” analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional profiling
High-Throughput Sequencing of Arabidopsis microRNAs: Evidence for Frequent Birth and Death of MIRNA Genes

PubMed Central

Fahlgren, Noah; Howell, Miya D.; Kasschau, Kristin D.; Chapman, Elisabeth J.; Sullivan, Christopher M.; Cumbie, Jason S.; Givan, Scott A.; Law, Theresa F.; Grant, Sarah R.; Dangl, Jeffery L.; Carrington, James C.

2007-01-01

In plants, microRNAs (miRNAs) comprise one of two classes of small RNAs that function primarily as negative regulators at the posttranscriptional level. Several MIRNA genes in the plant kingdom are ancient, with conservation extending between angiosperms and the mosses, whereas many others are more recently evolved. Here, we use deep sequencing and computational methods to identify, profile and analyze non-conserved MIRNA genes in Arabidopsis thaliana. 48 non-conserved MIRNA families, nearly all of which were represented by single genes, were identified. Sequence similarity analyses of miRNA precursor foldback arms revealed evidence for recent evolutionary origin of 16 MIRNA loci through inverted duplication events from protein-coding gene sequences. Interestingly, these recently evolved MIRNA genes have taken distinct paths. Whereas some non-conserved miRNAs interact with and regulate target transcripts from gene families that donated parental sequences, others have drifted to the point of non-interaction with parental gene family transcripts. Some young MIRNA loci clearly originated from one gene family but form miRNAs that target transcripts in another family. We suggest that MIRNA genes are undergoing relatively frequent birth and death, with only a subset being stabilized by integration into regulatory networks. PMID:17299599
Characterization of a gene family abundantly expressed in Oenothera organensis pollen that shows sequence similarity to polygalacturonase.

PubMed Central

Brown, S M; Crouch, M L

1990-01-01

We have isolated and characterized cDNA clones of a gene family (P2) expressed in Oenothera organensis pollen. This family contains approximately six to eight family members and is expressed at high levels only in pollen. The predicted protein sequence from a near full-length cDNA clone shows that the protein products of these genes are at least 38,000 daltons. We identified the protein encoded by one of the cDNAs in this family by using antibodies to beta-galactosidase/pollen cDNA fusion proteins. Immunoblot analysis using these antibodies identifies a family of proteins of approximately 40 kilodaltons that is present in mature pollen, indicating that these mRNAs are not stored solely for translation after pollen germination. These proteins accumulate late in pollen development and are not detectable in other parts of the plant. Although not present in unpollinated or self-pollinated styles, the 40-kilodalton to 45-kilodalton antigens are detectable in extracts from cross-pollinated styles, suggesting that the proteins are present in pollen tubes growing through the style during pollination. The proteins are also present in pollen tubes growing in vitro. Both nucleotide and amino acid sequences are similar to the published sequences for cDNAs encoding the enzyme polygalacturonase, which suggests that the P2 gene family may function in depolymerizing pectin during pollen development, germination, and tube growth. Cross-hybridizing RNAs and immunoreactive proteins were detected in pollen from a wide variety of plant species, which indicates that the P2 family of polygalacturonase-like genes are conserved and may be expressed in the pollen from many angiosperms. PMID:2152116
Decreased expression of cell adhesion genes in cancer stem-like cells isolated from primary oral squamous cell carcinomas.

PubMed

Mishra, Amrendra; Sriram, Harshini; Chandarana, Pinal; Tanavde, Vivek; Kumar, Rekha V; Gopinath, Ashok; Govindarajan, Raman; Ramaswamy, S; Sadasivam, Subhashini

2018-05-01

The goal of this study was to isolate cancer stem-like cells marked by high expression of CD44, a putative cancer stem cell marker, from primary oral squamous cell carcinomas and identify distinctive gene expression patterns in these cells. From 1 October 2013 to 4 September 2015, 76 stage III-IV primary oral squamous cell carcinoma of the gingivobuccal sulcus were resected. In all, 13 tumours were analysed by immunohistochemistry to visualise CD44-expressing cells. Expression of CD44 within The Cancer Genome Atlas-Head and Neck Squamous Cell Carcinoma RNA-sequencing data was also assessed. Seventy resected tumours were dissociated into single cells and stained with antibodies to CD44 as well as CD45 and CD31 (together referred as Lineage/Lin). From 45 of these, CD44 + Lin - and CD44 - Lin - subpopulations were successfully isolated using fluorescence-activated cell sorting, and good-quality RNA was obtained from 14 such sorted pairs. Libraries from five pairs were sequenced and the results analysed using bioinformatics tools. Reverse transcription quantitative polymerase chain reaction was performed to experimentally validate the differential expression of selected candidate genes identified from the transcriptome sequencing in the same 5 and an additional 9 tumours. CD44 was expressed on the surface of poorly differentiated tumour cells, and within the The Cancer Genome Atlas-Head and Neck Squamous Cell Carcinoma samples, its messenger RNA levels were higher in tumours compared to normal. Transcriptomics revealed that 102 genes were upregulated and 85 genes were downregulated in CD44 + Lin - compared to CD44 - Lin - cells in at least 3 of the 5 tumours sequenced. The upregulated genes included those involved in immune regulation, while the downregulated genes were enriched for genes involved in cell adhesion. Decreased expression of PCDH18, MGP, SPARCL1 and KRTDAP was confirmed by reverse transcription quantitative polymerase chain reaction. Lower expression of
Targeted gene enrichment and high-throughput sequencing for environmental biomonitoring: a case study using freshwater macroinvertebrates.

PubMed

Dowle, Eddy J; Pochon, Xavier; C Banks, Jonathan; Shearer, Karen; Wood, Susanna A

2016-09-01

Recent studies have advocated biomonitoring using DNA techniques. In this study, two high-throughput sequencing (HTS)-based methods were evaluated: amplicon metabarcoding of the cytochrome C oxidase subunit I (COI) mitochondrial gene and gene enrichment using MYbaits (targeting nine different genes including COI). The gene-enrichment method does not require PCR amplification and thus avoids biases associated with universal primers. Macroinvertebrate samples were collected from 12 New Zealand rivers. Macroinvertebrates were morphologically identified and enumerated, and their biomass determined. DNA was extracted from all macroinvertebrate samples and HTS undertaken using the illumina miseq platform. Macroinvertebrate communities were characterized from sequence data using either six genes (three of the original nine were not used) or just the COI gene in isolation. The gene-enrichment method (all genes) detected the highest number of taxa and obtained the strongest Spearman rank correlations between the number of sequence reads, abundance and biomass in 67% of the samples. Median detection rates across rare (<1% of the total abundance or biomass), moderately abundant (1-5%) and highly abundant (>5%) taxa were highest using the gene-enrichment method (all genes). Our data indicated primer biases occurred during amplicon metabarcoding with greater than 80% of sequence reads originating from one taxon in several samples. The accuracy and sensitivity of both HTS methods would be improved with more comprehensive reference sequence databases. The data from this study illustrate the challenges of using PCR amplification-based methods for biomonitoring and highlight the potential benefits of using approaches, such as gene enrichment, which circumvent the need for an initial PCR step. © 2015 John Wiley & Sons Ltd.
Nucleotide sequence of the gene encoding the nitrogenase iron protein of Thiobacillus ferrooxidans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pretorius, I.M.; Rawlings, D.E.; O'Neill, E.G.

1987-01-01

The DNA sequence was determined for the cloned Thiobacillus ferrooxidans nifH and part of the nifD genes. The DNA chains were radiolabeled with (..cap alpha..-/sup 32/P)dCTP (3000 Ci/mmol) or (..cap alpha..-/sup 35/S)dCTP (400 Ci/mmol). A putative T. ferrooxidans nifH promoter was identified whose sequences showed perfect consensus with those of the Klebsiella pneumoniae nif promoter. Two putative consensus upstream activator sequences were also identified. The amino acid sequence was deduced from the DNA sequence. In a comparison of nifH DNA sequences from T. ferrooxidans and eight other nitrogen-fixing microbes, a Rhizobium sp. isolated from Parasponia andersonii showed the greatest homologymore » (74%) and Clostridium pasteurianum (nifH1) showed the least homology (54%). In the comparison of the amino acid sequences of the Fe proteins, the Rhizobium sp. and Rhizobium japonicum showed the greatest homology (both 86%) and C. pasteurianum (nifH1 gene product) demonstrated the least homology (56%) to the T. ferrooxidans Fe protein.« less
Sequence-based identification of inositol monophosphatase-like histidinol-phosphate phosphatases (HisN) in Corynebacterium glutamicum, Actinobacteria, and beyond.

PubMed

Kulis-Horn, Robert Kasimir; Rückert, Christian; Kalinowski, Jörn; Persicke, Marcus

2017-07-18

The eighth step of L-histidine biosynthesis is carried out by an enzyme called histidinol-phosphate phosphatase (HolPase). Three unrelated HolPase families are known so far. Two of them are well studied: HAD-type HolPases known from Gammaproteobacteria like Escherichia coli or Salmonella enterica and PHP-type HolPases known from yeast and Firmicutes like Bacillus subtilis. However, the third family of HolPases, the inositol monophosphatase (IMPase)-like HolPases, present in Actinobacteria like Corynebacterium glutamicum (HisN) and plants, are poorly characterized. Moreover, there exist several IMPase-like proteins in bacteria (e.g. CysQ, ImpA, and SuhB) which are very similar to HisN but most likely do not participate in L-histidine biosynthesis. Deletion of hisN, the gene encoding the IMPase-like HolPase in C. glutamicum, does not result in complete L-histidine auxotrophy. Out of four hisN homologs present in the genome of C. glutamicum (impA, suhB, cysQ, and cg0911), only cg0911 encodes an enzyme with HolPase activity. The enzymatic properties of HisN and Cg0911 were determined, delivering the first available kinetic data for IMPase-like HolPases. Additionally, we analyzed the amino acid sequences of potential HisN, ImpA, SuhB, CysQ and Cg0911 orthologs from bacteria and identified six conserved sequence motifs for each group of orthologs. Mutational studies confirmed the importance of a highly conserved aspartate residue accompanied by several aromatic amino acid residues present in motif 5 for HolPase activity. Several bacterial proteins containing all identified HolPase motifs, but showing only moderate sequence similarity to HisN from C. glutamicum, were experimentally confirmed as IMPase-like HolPases, demonstrating the value of the identified motifs. Based on the confirmed IMPase-like HolPases two profile Hidden Markov Models (HMMs) were build using an iterative approach. These HMMs allow the fast, reliable detection and differentiation of the two
Nucleotide sequence of the beta-lactamase gene from Enterococcus faecalis HH22 and its similarity to staphylococcal beta-lactamase genes.

PubMed Central

Zscheck, K K; Murray, B E

1991-01-01

The nucleotide sequence of the constitutively produced beta-lactamase (Bla) gene from Enterococcus faecalis HH22 was shown to be identical to the published sequences of three of four staphylococcal type A beta-lactamase genes; more differences were seen with the genes for staphylococcal type C and D enzymes. One hundred forty nucleotides upstream of the beta-lactamase start codon were determined for an inducible staphylococcal beta-lactamase and were identical to those of the constitutively expressed enterococcal gene, indicating that the changes resulting in constitutive expression are not due to changes in the promoter or operator region. Moreover, complementation studies indicated that production of the enterococcal enzyme could be repressed. The genes for the enterococcal Bla and an inducible staphylococcal Bla were each cloned into a shuttle vector and transformed into enterococcal and staphylococcal recipients. The major difference between the backgrounds of the two hosts was that more enzyme was produced by the staphylococcal host, regardless of the source of the gene. The location of the enzyme was found to be host dependent, since each cloned gene generated extracellular (free) enzyme in the staphylococcus and cell-bound enzyme in the enterococcus. On the basis of the identities of the enterococcal Bla and several staphylococcal Bla sequences, these data suggest the recent spread of beta-lactamase to enterococci and also suggest the loss of a functional repressor. PMID:1952840

Clinical evaluation of panel testing by next-generation sequencing (NGS) for gene mutations in myeloid neoplasms.

PubMed

Au, Chun Hang; Wa, Anna; Ho, Dona N; Chan, Tsun Leung; Ma, Edmond S K

2016-01-22

Genomic techniques in recent years have allowed the identification of many mutated genes important in the pathogenesis of acute myeloid leukemia (AML). Together with cytogenetic aberrations, these gene mutations are powerful prognostic markers in AML and can be used to guide patient management, for example selection of optimal post-remission therapy. The mutated genes also hold promise as therapeutic targets themselves. We evaluated the applicability of a gene panel for the detection of AML mutations in a diagnostic molecular pathology laboratory. Fifty patient samples comprising 46 AML and 4 other myeloid neoplasms were accrued for the study. They consisted of 19 males and 31 females at a median age of 60 years (range: 18-88 years). A total of 54 genes (full coding exons of 15 genes and exonic hotspots of 39 genes) were targeted by 568 amplicons that ranged from 225 to 275 bp. The combined coverage was 141 kb in sequence length. Amplicon libraries were prepared by TruSight myeloid sequencing panel (Illumina, CA) and paired-end sequencing runs were performed on a MiSeq (Illumina) genome sequencer. Sequences obtained were analyzed by in-house bioinformatics pipeline, namely BWA-MEM, Samtools, GATK, Pindel, Ensembl Variant Effect Predictor and a novel algorithm ITDseek. The mean count of sequencing reads obtained per sample was 3.81 million and the mean sequencing depth was over 3000X. Seventy-seven mutations in 24 genes were detected in 37 of 50 samples (74 %). On average, 2 mutations (range 1-5) were detected per positive sample. TP53 gene mutations were found in 3 out of 4 patients with complex and unfavorable cytogenetics. Comparing NGS results with that of conventional molecular testing showed a concordance rate of 95.5 %. After further resolution and application of a novel bioinformatics algorithm ITDseek to aid the detection of FLT3 internal tandem duplication (ITD), the concordance rate was revised to 98.2 %. Gene panel testing by NGS approach was
Bioinformatics evidence for the transfer of mycosporine-like amino acid core (4-deoxygadusol) synthesizing gene from cyanobacteria to dinoflagellates and an attempt to mutate the same gene (YP_324358) in Anabaena variabilis PCC 7937.

PubMed

Singh, Shailendra P; Häder, Donat-P; Sinha, Rajeshwar P

2012-06-01

We have identified a homologue of 4-deoxygadusol (core of mycosporine-like amino acids) synthesizing gene (ZP_05036788) from Synechococcus sp. PCC 7335 that was found to have additional functionally unknown N-terminal domain similar to homologues from dinoflagellates based on the ClustalW analysis. Phylogenetic analysis revealed that Synechococcus sp. (ZP_05036788) makes a clade together with dinoflagellates and was closest to the Oxyrrhis marina. This study shows for the first time that N-terminal additional sequences that possess upstream plastid targeting sequence in Heterocapsa triquetra and Karlodinium micrum were already evolved in cyanobacteria, and plastid targeting sequence were evolved later in dinoflagellates after divergence from chloroplast lacking Oxyrrhis marina. Thus, MAAs synthesizing genes were transferred from cyanobacteria to dinoflagellates and possibly Synechococcus sp. PCC 7335 acted as a donor during lateral gene transfer event. In addition, we also tried to mutate 4-deoxygadusol synthesizing gene (YP_324358) of Anabaena variabilis PCC 7937 by homologous recombination, however, all approaches to get complete segregation of the mutants from the wild-type were unsuccessful, showing the essentiality of YP_324358 for A. variabilis PCC 7937. Copyright © 2012 Elsevier B.V. All rights reserved.
Rett-like phenotypes: expanding the genetic heterogeneity to the KCNA2 gene and first familial case of CDKL5-related disease.

PubMed

Allou, L; Julia, S; Amsallem, D; El Chehadeh, S; Lambert, L; Thevenon, J; Duffourd, Y; Saunier, A; Bouquet, P; Pere, S; Moustaïne, A; Ruaud, L; Roth, V; Jonveaux, P; Philippe, C

2017-03-01

Several genes have been implicated in Rett syndrome (RTT) in its typical and variant forms. We applied next-generation sequencing (NGS) to evaluate for mutations in known or new candidate genes in patients with variant forms of Rett or Rett-like phenotypes of unknown molecular aetiology. In the first step, we used NGS with a custom panel including MECP2, CDKL5, FOXG1, MEF2C and IQSEC2. In addition to a FOXG1 mutation in a patient with all core features of the congenital variant of RTT, we identified a missense (p.Ser240Thr) in CDKL5 in a patient who appeared to be seizure free. This missense was maternally inherited with opposite allele expression ratios in the proband and her mother. In the asymptomatic mother, the mutated copy of the CDKL5 gene was inactivated in 90% of blood cells. We also identified a premature stop codon (p.Arg926*) in IQSEC2 in a patient with a Rett-like phenotype. Finally, exome sequencing enabled us to characterize a heterozygous de novo missense (p.Val408Ala) in KCNA2 encoding the potassium channel Kv 1.2 in a girl with infantile-onset seizures variant of RTT. Our study expands the genetic heterogeneity of RTT and RTT-like phenotypes. Moreover, we report the first familial case of CDKL5-related disease. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing

PubMed Central

Weirather, Jason L.; Afshar, Pegah Tootoonchi; Clark, Tyson A.; Tseng, Elizabeth; Powers, Linda S.; Underwood, Jason G.; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-01-01

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. PMID:26040699
RNase-Resistant Virus-Like Particles Containing Long Chimeric RNA Sequences Produced by Two-Plasmid Coexpression System▿

PubMed Central

Wei, Yuxiang; Yang, Changmei; Wei, Baojun; Huang, Jie; Wang, Lunan; Meng, Shuang; Zhang, Rui; Li, Jinming

2008-01-01

RNase-resistant, noninfectious virus-like particles containing exogenous RNA sequences (armored RNA) are good candidates as RNA controls and standards in RNA virus detection. However, the length of RNA packaged in the virus-like particles with high efficiency is usually less than 500 bases. In this study, we describe a method for producing armored L-RNA. Armored L-RNA is a complex of MS2 bacteriophage coat protein and RNA produced in Escherichia coli by the induction of a two-plasmid coexpression system in which the coat protein and maturase are expressed from one plasmid and the target RNA sequence with modified MS2 stem-loop (pac site) is transcribed from another plasmid. A 3V armored L-RNA of 2,248 bases containing six gene fragments—hepatitis C virus, severe acute respiratory syndrome coronavirus (SARS-CoV1, SARS-CoV2, and SARS-CoV3), avian influenza virus matrix gene (M300), and H5N1 avian influenza virus (HA300)—was successfully expressed by the two-plasmid coexpression system and was demonstrated to have all of the characteristics of armored RNA. We evaluated the 3V armored L-RNA as a calibrator for multiple virus assays. We used the WHO International Standard for HCV RNA (NIBSC 96/790) to calibrate the chimeric armored L-RNA, which was diluted by 10-fold serial dilutions to obtain samples containing 106 to 102 copies. In conclusion, the approach we used for armored L-RNA preparation is practical and could reduce the labor and cost of quality control in multiplex RNA virus assays. Furthermore, we can assign the chimeric armored RNA with an international unit for quantitative detection. PMID:18305135
Identification and application of self-binding zipper-like sequences in SARS-CoV spike protein.

PubMed

Zhang, Si Min; Liao, Ying; Neo, Tuan Ling; Lu, Yanning; Liu, Ding Xiang; Vahlne, Anders; Tam, James P

2018-05-22

Self-binding peptides containing zipper-like sequences, such as the Leu/Ile zipper sequence within the coiled coil regions of proteins and the cross-β spine steric zippers within the amyloid-like fibrils, could bind to the protein-of-origin through homophilic sequence-specific zipper motifs. These self-binding sequences represent opportunities for the development of biochemical tools and/or therapeutics. Here, we report on the identification of a putative self-binding β-zipper-forming peptide within the severe acute respiratory syndrome-associated coronavirus spike (S) protein and its application in viral detection. Peptide array scanning of overlapping peptides covering the entire length of S protein identified 34 putative self-binding peptides of six clusters, five of which contained octapeptide core consensus sequences. The Cluster I consensus octapeptide sequence GINITNFR was predicted by the Eisenberg's 3D profile method to have high amyloid-like fibrillation potential through steric β-zipper formation. Peptide C6 containing the Cluster I consensus sequence was shown to oligomerize and form amyloid-like fibrils. Taking advantage of this, C6 was further applied to detect the S protein expression in vitro by fluorescence staining. Meanwhile, the coiled-coil-forming Leu/Ile heptad repeat sequences within the S protein were under-represented during peptide array scanning, in agreement with that long peptide lengths were required to attain high helix-mediated interaction avidity. The data suggest that short β-zipper-like self-binding peptides within the S protein could be identified through combining the peptide scanning and predictive methods, and could be exploited as biochemical detection reagents for viral infection. Copyright © 2018. Published by Elsevier Ltd.
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Analysis of selected genes associated with cardiomyopathy by next-generation sequencing.

PubMed

Szabadosova, Viktoria; Boronova, Iveta; Ferenc, Peter; Tothova, Iveta; Bernasovska, Jarmila; Zigova, Michaela; Kmec, Jan; Bernasovsky, Ivan

2018-02-01

As the leading cause of congestive heart failure, cardiomyopathy represents a heterogenous group of heart muscle disorders. Despite considerable progress being made in the genetic diagnosis of cardiomyopathy by detection of the mutations in the most prevalent cardiomyopathy genes, the cause remains unsolved in many patients. High-throughput mutation screening in the disease genes for cardiomyopathy is now possible because of using target enrichment followed by next-generation sequencing. The aim of the study was to analyze a panel of genes associated with dilated or hypertrophic cardiomyopathy based on previously published results in order to identify the subjects at risk. The method of next-generation sequencing by IlluminaHiSeq 2500 platform was used to detect sequence variants in 16 individuals diagnosed with dilated or hypertrophic cardiomyopathy. Detected variants were filtered and the functional impact of amino acid changes was predicted by computational programs. DNA samples of the 16 patients were analyzed by whole exome sequencing. We identified six nonsynonymous variants that were shown to be pathogenic in all used prediction softwares: rs3744998 (EPG5), rs11551768 (MGME1), rs148374985 (MURC), rs78461695 (PLEC), rs17158558 (RET) and rs2295190 (SYNE1). Two of the analyzed sequence variants had minor allele frequency (MAF)<0.01: rs148374985 (MURC), rs34580776 (MYBPC3). Our data support the potential role of the detected variants in pathogenesis of dilated or hypertrophic cardiomyopathy; however, the possibility that these variants might not be true disease-causing variants but are susceptibility alleles that require additional mutations or injury to cause the clinical phenotype of disease must be considered. © 2017 Wiley Periodicals, Inc.
Molecular evolution of the actin-like MreB protein gene family in wall-less bacteria.

PubMed

Ku, Chuan; Lo, Wen-Sui; Kuo, Chih-Horng

2014-04-18

The mreB gene family encodes actin-like proteins that determine cell shape by directing cell wall synthesis and often exists in one to three copies in the genomes of non-spherical bacteria. Intriguingly, while most wall-less bacteria do not have this gene, five to seven mreB homologs are found in Spiroplasma and Haloplasma, which are both characterized by cell contractility. To investigate the molecular evolution of this gene family in wall-less bacteria, we sampled the available genome sequences from these two genera and other related lineages for comparative analysis. The gene phylogenies indicated that the mreB homologs in Haloplasma are more closely related to those in Firmicutes, whereas those in Spiroplasma form a separate clade. This finding suggests that the gene family expansions in these two lineages are the results of independent ancient duplications. Moreover, the Spiroplasma mreB homologs can be classified into five clades, of which the genomic positions are largely conserved. The inference of gene gains and losses suggests that there has been an overall trend to retain only one homolog from each of the five mreB clades in the evolutionary history of Spiroplasma. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Efficacy of Exome-Targeted Capture Sequencing to Detect Mutations in Known Cerebellar Ataxia Genes.

PubMed

Coutelier, Marie; Hammer, Monia B; Stevanin, Giovanni; Monin, Marie-Lorraine; Davoine, Claire-Sophie; Mochel, Fanny; Labauge, Pierre; Ewenczyk, Claire; Ding, Jinhui; Gibbs, J Raphael; Hannequin, Didier; Melki, Judith; Toutain, Annick; Laugel, Vincent; Forlani, Sylvie; Charles, Perrine; Broussolle, Emmanuel; Thobois, Stéphane; Afenjar, Alexandra; Anheim, Mathieu; Calvas, Patrick; Castelnovo, Giovanni; de Broucker, Thomas; Vidailhet, Marie; Moulignier, Antoine; Ghnassia, Robert T; Tallaksen, Chantal; Mignot, Cyril; Goizet, Cyril; Le Ber, Isabelle; Ollagnon-Roman, Elisabeth; Pouget, Jean; Brice, Alexis; Singleton, Andrew; Durr, Alexandra

2018-05-01

Molecular diagnosis is difficult to achieve in disease groups with a highly heterogeneous genetic background, such as cerebellar ataxia (CA). In many patients, candidate gene sequencing or focused resequencing arrays do not allow investigators to reach a genetic conclusion. To assess the efficacy of exome-targeted capture sequencing to detect mutations in genes broadly linked to CA in a large cohort of undiagnosed patients and to investigate their prevalence. Three hundred nineteen index patients with CA and without a history of dominant transmission were included in the this cohort study by the Spastic Paraplegia and Ataxia Network. Centralized storage was in the DNA and cell bank of the Brain and Spine Institute, Salpetriere Hospital, Paris, France. Patients were classified into 6 clinical groups, with the largest being those with spastic ataxia (ie, CA with pyramidal signs [n = 100]). Sequencing was performed from January 1, 2014, through December 31, 2016. Detected variants were classified as very probably or definitely causative, possibly causative, or of unknown significance based on genetic evidence and genotype-phenotype considerations. Identification of variants in genes broadly linked to CA, classified in pathogenicity groups. The 319 included patients had equal sex distribution (160 female [50.2%] and 159 male patients [49.8%]; mean [SD] age at onset, 27.9 [18.6] years). The age at onset was younger than 25 years for 131 of 298 patients (44.0%) with complete clinical information. Consanguinity was present in 101 of 298 (33.9%). Very probable or definite diagnoses were achieved for 72 patients (22.6%), with an additional 19 (6.0%) harboring possibly pathogenic variants. The most frequently mutated genes were SPG7 (n = 14), SACS (n = 8), SETX (n = 7), SYNE1 (n = 6), and CACNA1A (n = 6). The highest diagnostic rate was obtained for patients with an autosomal recessive CA with oculomotor apraxia-like phenotype (6 of 17 [35.3%]) or
Extraordinary Sequence Divergence at Tsga8, an X-linked Gene Involved in Mouse Spermiogenesis

PubMed Central

Good, Jeffrey M.; Vanderpool, Dan; Smith, Kimberly L.; Nachman, Michael W.

2011-01-01

The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion–deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5′ and 3′ ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189
Massively Parallel Sequencing Detected a Mutation in the MFN2 Gene Missed by Sanger Sequencing Due to a Primer Mismatch on an SNP Site.

PubMed

Neupauerová, Jana; Grečmalová, Dagmar; Seeman, Pavel; Laššuthová, Petra

2016-05-01

We describe a patient with early onset severe axonal Charcot-Marie-Tooth disease (CMT2) with dominant inheritance, in whom Sanger sequencing failed to detect a mutation in the mitofusin 2 (MFN2) gene because of a single nucleotide polymorphism (rs2236057) under the PCR primer sequence. The severe early onset phenotype and the family history with severely affected mother (died after delivery) was very suggestive of CMT2A and this suspicion was finally confirmed by a MFN2 mutation. The mutation p.His361Tyr was later detected in the patient by massively parallel sequencing with a gene panel for hereditary neuropathies. According to this information, new primers for amplification and sequencing were designed which bind away from the polymorphic sites of the patient's DNA. Sanger sequencing with these new primers then confirmed the heterozygous mutation in the MFN2 gene in this patient. This case report shows that massively parallel sequencing may in some rare cases be more sensitive than Sanger sequencing and highlights the importance of accurate primer design which requires special attention. © 2016 John Wiley & Sons Ltd/University College London.
An enhancer-like region regulates hrp3 promoter stage-specific gene expression in the human malaria parasite Plasmodium falciparum

PubMed Central

López-Estraño, Carlos; Gopalakrishnan, Anusha M.; Semblat, Jean-Philippe; Fergus, M. Ross; Mazier, Dominique; Haldar, Kasturi

2008-01-01

The asexual blood stage of Plasmodium falciparum is comprised of morphologically distinct ring, trophozoite and schizont stages. Each of these developmental stages possesses a distinct pattern of gene expression. Regulation of P. falciparum gene expression is thought to occur, at least in part, at the promoter level. Previously, we have found that although the RNA of the P. falciparum hrp3 gene is only seen in ring-stage parasites, deletion of a specific sequensce in the 5’ end of the promoter region decreased ring-stage expression of hrp3 and enabled detection of its transcripts in trophozoite-stage parasites. In order to investigate this stage specific regulation of gene expression, we employed a series of nested deletions of the 1.7-kb hrp3 promoter. Firefly luciferase gene was used as a reporter to evaluate the role of promoter sequences in gene regulation. Using this approach, we identified a ring-stage specific regulatory region on the hrp3 promoter located between -1.7-kb and -1.1-kb from the ATG initiation codon. Small 100–150 bp truncations on this enhancer-like region failed to uncover discrete regulatory sequences, suggesting the multipartite nature of this element. The data presented in this study demonstrates that stage specific promoter activity of the hrp3 gene in P. falciparum blood stage parasites is supported, at least in-part, by a small promoter region that can function in the absence of a larger chromosomal context. PMID:17570541
A massive parallel sequencing workflow for diagnostic genetic testing of mismatch repair genes

PubMed Central

Hansen, Maren F; Neckmann, Ulrike; Lavik, Liss A S; Vold, Trine; Gilde, Bodil; Toft, Ragnhild K; Sjursen, Wenche

2014-01-01

The purpose of this study was to develop a massive parallel sequencing (MPS) workflow for diagnostic analysis of mismatch repair (MMR) genes using the GS Junior system (Roche). A pathogenic variant in one of four MMR genes, (MLH1, PMS2, MSH6, and MSH2), is the cause of Lynch Syndrome (LS), which mainly predispose to colorectal cancer. We used an amplicon-based sequencing method allowing specific and preferential amplification of the MMR genes including PMS2, of which several pseudogenes exist. The amplicons were pooled at different ratios to obtain coverage uniformity and maximize the throughput of a single-GS Junior run. In total, 60 previously identified and distinct variants (substitutions and indels), were sequenced by MPS and successfully detected. The heterozygote detection range was from 19% to 63% and dependent on sequence context and coverage. We were able to distinguish between false-positive and true-positive calls in homopolymeric regions by cross-sample comparison and evaluation of flow signal distributions. In addition, we filtered variants according to a predefined status, which facilitated variant annotation. Our study shows that implementation of MPS in routine diagnostics of LS can accelerate sample throughput and reduce costs without compromising sensitivity, compared to Sanger sequencing. PMID:24689082
Comparison of the aflR gene sequences of strains in Aspergillus section Flavi.

PubMed

Lee, Chao-Zong; Liou, Guey-Yuh; Yuan, Gwo-Fang

2006-01-01

Aflatoxins are polyketide-derived secondary metabolites produced by Aspergillus parasiticus, Aspergillus flavus, Aspergillus nomius and a few other species. The toxic effects of aflatoxins have adverse consequences for human health and agricultural economics. The aflR gene, a regulatory gene for aflatoxin biosynthesis, encodes a protein containing a zinc-finger DNA-binding motif. Although Aspergillus oryzae and Aspergillus sojae, which are used in fermented foods and in ingredient manufacture, have no record of producing aflatoxin, they have been shown to possess an aflR gene. This study examined 34 strains of Aspergillus section Flavi. The aflR gene of 23 of these strains was successfully amplified and sequenced. No aflR PCR products were found in five A. sojae strains or six strains of A. oryzae. These PCR results suggested that the aflR gene is absent or significantly different in some A. sojae and A. oryzae strains. The sequenced aflR genes from the 23 positive strains had greater than 96.6 % similarity, which was particularly conserved in the zinc-finger DNA-binding domain. The aflR gene of A. sojae has two obvious characteristics: an extra CTCATG sequence fragment and a C to T transition that causes premature termination of AFLR protein synthesis. Differences between A. parasiticus/A. sojae and A. flavus/A. oryzae aflR genes were also identified. Some strains of A. flavus as well as A. flavus var. viridis, A. oryzae var. viridis and A. oryzae var. effuses have an A. oryzae-type aflR gene. For all strains with the A. oryzae-type aflR gene, there was no evidence of aflatoxin production. It is suggested that for safety reasons, the aflR gene could be examined to assess possible aflatoxin production by Aspergillus section Flavi strains.
Dose-sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses.

PubMed

Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael

2011-01-01

Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.
Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention Through Multiple Tetraploidies in the Grasses

PubMed Central

Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael

2011-01-01

Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525
Sequence determination and analysis of the NSs genes of two tospoviruses.

PubMed

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.
Cis-acting sequences from a human surfactant protein gene confer pulmonary-specific gene expression in transgenic mice

DOE Office of Scientific and Technical Information (OSTI.GOV)

Korfhagen, T.R.; Glasser, S.W.; Wert, S.E.

1990-08-01

Pulmonary surfactant is produced in late gestation by developing type II epithelial cells lining the alveolar epithelium of the lung. Lack of surfactant at birth is associated with respiratory distress syndrome in premature infants. Surfactant protein C (SP-C) is a highly hydrophobic peptide isolated from pulmonary tissue that enhances the biophysical activity of surfactant phospholipids. Like surfactant phospholipid, SP-C is produced by epithelial cells in the distal respiratory epithelium, and its expression increases during the latter part of gestation. A chimeric gene containing 3.6 kilobases of the promoter and 5{prime}-flanking sequences of the human SP-C gene was used to expressmore » diphtheria toxin A. The SP-C-diphtheria toxin A fusion gene was injected into fertilized mouse eggs to produce transgenic mice. Affected mice developed respiratory failure in the immediate postnatal period. Morphologic analysis of lungs from affected pups showed variable but severe cellular injury confined to pulmonary tissues. Ultrastructural changes consistent with cell death and injury were prominent in the distal respiratory epithelium. Proximal components of the tracheobronchial tree were not severely affected. Transgenic animals were of normal size at birth, and structural abnormalities were not detected in nonpulmonary tissues. Lung-specific diphtheria toxin A expression controlled by the human SP-C gene injured type II epithelial cells and caused extensive necrosis of the distal respiratory epithelium. The absence of type I epithelial cells in the most severely affected transgenic animals supports the concept that developing type II cells serve as precursors to type I epithelial cells.« less
Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

PubMed

Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

2015-05-15

The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

A Bayesian taxonomic classification method for 16S rRNA gene sequences with improved species-level accuracy.

PubMed

Gao, Xiang; Lin, Huaiying; Revanna, Kashi; Dong, Qunfeng

2017-05-10

Species-level classification for 16S rRNA gene sequences remains a serious challenge for microbiome researchers, because existing taxonomic classification tools for 16S rRNA gene sequences either do not provide species-level classification, or their classification results are unreliable. The unreliable results are due to the limitations in the existing methods which either lack solid probabilistic-based criteria to evaluate the confidence of their taxonomic assignments, or use nucleotide k-mer frequency as the proxy for sequence similarity measurement. We have developed a method that shows significantly improved species-level classification results over existing methods. Our method calculates true sequence similarity between query sequences and database hits using pairwise sequence alignment. Taxonomic classifications are assigned from the species to the phylum levels based on the lowest common ancestors of multiple database hits for each query sequence, and further classification reliabilities are evaluated by bootstrap confidence scores. The novelty of our method is that the contribution of each database hit to the taxonomic assignment of the query sequence is weighted by a Bayesian posterior probability based upon the degree of sequence similarity of the database hit to the query sequence. Our method does not need any training datasets specific for different taxonomic groups. Instead only a reference database is required for aligning to the query sequences, making our method easily applicable for different regions of the 16S rRNA gene or other phylogenetic marker genes. Reliable species-level classification for 16S rRNA or other phylogenetic marker genes is critical for microbiome research. Our software shows significantly higher classification accuracy than the existing tools and we provide probabilistic-based confidence scores to evaluate the reliability of our taxonomic classification assignments based on multiple database matches to query sequences. Despite
Analysis of resistance genes of clinical Pannonibacter phragmitetus strain 31801 by complete genome sequencing.

PubMed

Ming, De-Song; Chen, Qing-Qing; Chen, Xiao-Tin

2018-05-14

To clarify the resistance mechanisms of Pannonibacter phragmitetus 31801, isolated from the blood of a liver abscess patient, at the genomic level, we performed whole genomic sequencing using a PacBio RS II single-molecule real-time long-read sequencer. Bioinformatic analysis of the resulting sequence was then carried out to identify any possible resistance genes. Analyses included Basic Local Alignment Search Tool searches against the Antibiotic Resistance Genes Database, ResFinder analysis of the genome sequence, and Resistance Gene Identifier analysis within the Comprehensive Antibiotic Resistance Database. Prophages, clustered regularly interspaced short palindromic repeats (CRISPR), and other putative virulence factors were also identified using PHAST, CRISPRfinder, and the Virulence Factors Database, respectively. The circular chromosome and single plasmid of P. phragmitetus 31801 contained multiple antibiotic resistance genes, including those coding for three different types of β-lactamase [NPS β-lactamase (EC 3.5.2.6), β-lactamase class C, and a metal-dependent hydrolase of β-lactamase superfamily I]. In addition, genes coding for subunits of several multidrug-resistance efflux pumps were identified, including those targeting macrolides (adeJ, cmeB), tetracycline (acrB, adeAB), fluoroquinolones (acrF, ceoB), and aminoglycosides (acrD, amrB, ceoB, mexY, smeB). However, apart from the tripartite macrolide efflux pump macAB-tolC, the genome did not appear to contain the complete complement of subunit genes required for production of most of the major multidrug-resistance efflux pumps.
The study of two barley Type I-like MADS-box genes as potential targets of epigenetic regulation during seed development

PubMed Central

2012-01-01

Background MADS-box genes constitute a large family of transcription factors functioning as key regulators of many processes during plant vegetative and reproductive development. Type II MADS-box genes have been intensively investigated and are mostly involved in vegetative and flowering development. A growing number of studies of Type I MADS-box genes in Arabidopsis, have assigned crucial roles for these genes in gamete and seed development and have demonstrated that a number of Type I MADS-box genes are epigenetically regulated by DNA methylation and histone modifications. However, reports on agronomically important cereals such as barley and wheat are scarce. Results Here we report the identification and characterization of two Type I-like MADS-box genes, from barley (Hordeum vulgare), a monocot cereal crop of high agronomic importance. Protein sequence and phylogenetic analysis showed that the putative proteins are related to Type I MADS-box proteins, and classified them in a distinct cereal clade. Significant differences in gene expression among seed developmental stages and between barley cultivars with varying seed size were revealed for both genes. One of these genes was shown to be induced by the seed development- and stress-related hormones ABA and JA whereas in situ hybridizations localized the other gene to specific endosperm sub-compartments. The genomic organization of the latter has high conservation with the cereal Type I-like MADS-box homologues and the chromosomal position of both genes is close to markers associated with seed quality traits. DNA methylation differences are present in the upstream and downstream regulatory regions of the barley Type I-like MADS-box genes in two different developmental stages and in response to ABA treatment which may be associated with gene expression differences. Conclusions Two barley MADS-box genes were studied that are related to Type I MADS-box genes. Differential expression in different seed developmental
The complete chloroplast genome sequence of the chlorophycean green alga Scenedesmus obliquus reveals a compact gene organization and a biased distribution of genes on the two DNA strands

PubMed Central

de Cambiaire, Jean-Charles; Otis, Christian; Lemieux, Claude; Turmel, Monique

2006-01-01

Background The phylum Chlorophyta contains the majority of the green algae and is divided into four classes. While the basal position of the Prasinophyceae is well established, the divergence order of the Ulvophyceae, Trebouxiophyceae and Chlorophyceae (UTC) remains uncertain. The five complete chloroplast DNA (cpDNA) sequences currently available for representatives of these classes display considerable variability in overall structure, gene content, gene density, intron content and gene order. Among these genomes, that of the chlorophycean green alga Chlamydomonas reinhardtii has retained the least ancestral features. The two single-copy regions, which are separated from one another by the large inverted repeat (IR), have similar sizes, rather than unequal sizes, and differ radically in both gene contents and gene organizations relative to the single-copy regions of prasinophyte and ulvophyte cpDNAs. To gain insights into the various changes that underwent the chloroplast genome during the evolution of chlorophycean green algae, we have sequenced the cpDNA of Scenedesmus obliquus, a member of a distinct chlorophycean lineage. Results The 161,452 bp IR-containing genome of Scenedesmus features single-copy regions of similar sizes, encodes 96 genes, i.e. only two additional genes (infA and rpl12) relative to its Chlamydomonas homologue and contains seven group I and two group II introns. It is clearly more compact than the four UTC algal cpDNAs that have been examined so far, displays the lowest proportion of short repeats among these algae and shows a stronger bias in clustering of genes on the same DNA strand compared to Chlamydomonas cpDNA. Like the latter genome, Scenedesmus cpDNA displays only a few ancestral gene clusters. The two chlorophycean genomes share 11 gene clusters that are not found in previously sequenced trebouxiophyte and ulvophyte cpDNAs as well as a few genes that have an unusual structure; however, their single-copy regions differ
CLINICAL PROGRESS IN INHERITED RETINAL DEGENERATIONS: GENE THERAPY CLINICAL TRIALS AND ADVANCES IN GENETIC SEQUENCING.

PubMed

Hafler, Brian P

2017-03-01

Inherited retinal dystrophies are a significant cause of vision loss and are characterized by the loss of photoreceptors and the retinal pigment epithelium (RPE). Mutations in approximately 250 genes cause inherited retinal degenerations with a high degree of genetic heterogeneity. New techniques in next-generation sequencing are allowing the comprehensive analysis of all retinal disease genes thus changing the approach to the molecular diagnosis of inherited retinal dystrophies. This review serves to analyze clinical progress in genetic diagnostic testing and implications for retinal gene therapy. A literature search of PubMed and OMIM was conducted to relevant articles in inherited retinal dystrophies. Next-generation genetic sequencing allows the simultaneous analysis of all the approximately 250 genes that cause inherited retinal dystrophies. Reported diagnostic rates range are high and range from 51% to 57%. These new sequencing tools are highly accurate with sensitivities of 97.9% and specificities of 100%. Retinal gene therapy clinical trials are underway for multiple genes including RPE65, ABCA4, CHM, RS1, MYO7A, CNGA3, CNGB3, ND4, and MERTK for which a molecular diagnosis may be beneficial for patients. Comprehensive next-generation genetic sequencing of all retinal dystrophy genes is changing the paradigm for how retinal specialists perform genetic testing for inherited retinal degenerations. Not only are high diagnostic yields obtained, but mutations in genes with novel clinical phenotypes are also identified. In the era of retinal gene therapy clinical trials, identifying specific genetic defects will increasingly be of use to identify patients who may enroll in clinical studies and benefit from novel therapies.
Preparative purification of a high-mannose type N-glycan from soy bean agglutinin by hydrazinolysis and tyrosinamide derivatization.

PubMed

Evers, D L; Hung, R L; Thomas, V H; Rice, K G

1998-12-15

The N-linked oligosaccharide from soy bean agglutinin (Man9) was isolated on a preparative scale following derivatization with Boc-tyrosine. The procedure utilized preparative hydrazinolysis to release the oligosaccharide and yielded multi-micromol quantities of Boc-tyrosine-Man9 which was characterized by 1H NMR and ES-MS. Copyright 1998 Academic Press.
The Repeat Sequences and Elevated Substitution Rates of the Chloroplast accD Gene in Cupressophytes

PubMed Central

Li, Jia; Su, Yingjuan; Wang, Ting

2018-01-01

The plastid accD gene encodes a subunit of the acetyl-CoA carboxylase (ACCase) enzyme. The length of accD gene has been supposed to expand in Cryptomeria japonica, Taiwania cryptomerioides, Cephalotaxus, Taxus chinensis, and Podocarpus lambertii, and the main reason for this phenomenon was the existence of tandemly repeated sequences. However, it is still unknown whether the accD gene length in other cupressophytes has expanded. Here, in order to investigate how widespread this phenomenon was, 18 accD sequences and its surrounding regions of cupressophyte were sequenced and analyzed. Together with 39 GenBank sequence data, our taxon sampling covered all the extant gymnosperm orders. The repetitive elements and substitution rates of accD among 57 gymnosperm species were analyzed, the results show: (1) Reading frame length of accD gene in 18 cupressophytes species has also expanded. (2) Many repetitive elements were identified in accD gene of cupressophyte lineages. (3) The synonymous and non-synonymous substitution rates of accD were accelerated in cupressophytes. (4) accD was located in rearrangement endpoints. These results suggested that repetitive elements may mediate the chloroplast genome rearrangement and accelerated the substitution rates. PMID:29731764
Rice MEL2, the RNA recognition motif (RRM) protein, binds in vitro to meiosis-expressed genes containing U-rich RNA consensus sequences in the 3'-UTR.

PubMed

Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi

2015-10-01

Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis.
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

PubMed

Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

2005-09-01

We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
Whole exome sequencing reveals concomitant mutations of multiple FA genes in individual Fanconi anemia patients

PubMed Central

2014-01-01

Background Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. Methods We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Results Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients’ clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Conclusions Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology
Whole exome sequencing reveals concomitant mutations of multiple FA genes in individual Fanconi anemia patients.

PubMed

Chang, Lixian; Yuan, Weiping; Zeng, Huimin; Zhou, Quanquan; Wei, Wei; Zhou, Jianfeng; Li, Miaomiao; Wang, Xiaomin; Xu, Mingjiang; Yang, Fengchun; Yang, Yungui; Cheng, Tao; Zhu, Xiaofan

2014-05-15

Fanconi anemia (FA) is a rare inherited genetic syndrome with highly variable clinical manifestations. Fifteen genetic subtypes of FA have been identified. Traditional complementation tests for grouping studies have been used generally in FA patients and in stepwise methods to identify the FA type, which can result in incomplete genetic information from FA patients. We diagnosed five pediatric patients with FA based on clinical manifestations, and we performed exome sequencing of peripheral blood specimens from these patients and their family members. The related sequencing data were then analyzed by bioinformatics, and the FANC gene mutations identified by exome sequencing were confirmed by PCR re-sequencing. Homozygous and compound heterozygous mutations of FANC genes were identified in all of the patients. The FA subtypes of the patients included FANCA, FANCM and FANCD2. Interestingly, four FA patients harbored multiple mutations in at least two FA genes, and some of these mutations have not been previously reported. These patients' clinical manifestations were vastly different from each other, as were their treatment responses to androstanazol and prednisone. This finding suggests that heterozygous mutation(s) in FA genes could also have diverse biological and/or pathophysiological effects on FA patients or FA gene carriers. Interestingly, we were not able to identify de novo mutations in the genes implicated in DNA repair pathways when the sequencing data of patients were compared with those of their parents. Our results indicate that Chinese FA patients and carriers might have higher and more complex mutation rates in FANC genes than have been conventionally recognized. Testing of the fifteen FANC genes in FA patients and their family members should be a regular clinical practice to determine the optimal care for the individual patient, to counsel the family and to obtain a better understanding of FA pathophysiology.
Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.

PubMed

He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y

2013-09-04

To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.
Sequence variants in four genes underlying Bardet-Biedl syndrome in consanguineous families

PubMed Central

Ullah, Asmat; Umair, Muhammad; Yousaf, Maryam; Khan, Sher Alam; Nazim-ud-din, Muhammad; Shah, Khadim; Ahmad, Farooq; Azeem, Zahid; Ali, Ghazanfar; Alhaddad, Bader; Rafique, Afzal; Jan, Abid; Haack, Tobias B.; Strom, Tim M.; Meitinger, Thomas; Ghous, Tahseen

2017-01-01

Purpose To investigate the molecular basis of Bardet-Biedl syndrome (BBS) in five consanguineous families of Pakistani origin. Methods Linkage in two families (A and B) was established to BBS7 on chromosome 4q27, in family C to BBS8 on chromosome 14q32.1, and in family D to BBS10 on chromosome 12q21.2. Family E was investigated directly with exome sequence analysis. Results Sanger sequencing revealed two novel mutations and three previously reported mutations in the BBS genes. These mutations include two deletions (c.580_582delGCA, c.1592_1597delTTCCAG) in the BBS7 gene, a missense mutation (p.Gln449His) in the BBS8 gene, a frameshift mutation (c.271_272insT) in the BBS10 gene, and a nonsense mutation (p.Ser40*) in the MKKS (BBS6) gene. Conclusions Two novel mutations and three previously reported variants, identified in the present study, further extend the body of evidence implicating BBS6, BBS7, BBS8, and BBS10 in causing BBS. PMID:28761321
Differentiation of Xylella fastidiosa Strains via Multilocus Sequence Analysis of Environmentally Mediated Genes (MLSA-E)

PubMed Central

Parker, Jennifer K.; Havird, Justin C.

2012-01-01

Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing
Differentiation of Xylella fastidiosa strains via multilocus sequence analysis of environmentally mediated genes (MLSA-E).

PubMed

Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo

2012-03-01

Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing
alpha-Amylase gene of Streptomyces limosus: nucleotide sequence, expression motifs, and amino acid sequence homology to mammalian and invertebrate alpha-amylases.

PubMed Central

Long, C M; Virolle, M J; Chang, S Y; Chang, S; Bibb, M J

1987-01-01

The nucleotide sequence of the coding and regulatory regions of the alpha-amylase gene (aml) of Streptomyces limosus was determined. High-resolution S1 mapping was used to locate the 5' end of the transcript and demonstrated that the gene is transcribed from a unique promoter. The predicted amino acid sequence has considerable identity to mammalian and invertebrate alpha-amylases, but not to those of plant, fungal, or eubacterial origin. Consistent with this is the susceptibility of the enzyme to an inhibitor of mammalian alpha-amylases. The amino-terminal sequence of the extracellular enzyme was determined, revealing the presence of a typical signal peptide preceding the mature form of the alpha-amylase. Images PMID:3500166
Isolation and characterization of a novel wheat cysteine-rich receptor-like kinase gene induced by Rhizoctonia cerealis

NASA Astrophysics Data System (ADS)

Yang, Kun; Rong, Wei; Qi, Lin; Li, Jiarui; Wei, Xuening; Zhang, Zengyan

2013-10-01

Cysteine-rich receptor kinases (CRKs) belong to the receptor-like kinase family. Little is known about CRK genes in wheat. We isolated a wheat CRK gene TaCRK1 from Rhizoctonia cerealis-resistant wheat CI12633 based on a differentially expressed sequence identified by RNA-Sequencing (RNA-Seq) analysis. TaCRK1 was more highly expressed in CI12633 than in susceptible Wenmai 6. Transcription of TaCRK1 in wheat was induced in CI12633 after R. cerealis infection and exogenous abscisic acid (ABA) treatment. The deduced TaCRK1 protein contained a signal peptide, two DUF26 domains, a transmembrane domain, and a serine/threonine protein kinase domain. Transient expression of a green fluorescence protein fused with TaCRK1 in wheat and onion indicated that TaCRK1 may localize to plasma membranes. Characterization of TaCRK1 silencing induced by virus-mediated method in CI12633 showed that the downregulation of TaCRK1 transcript did not obviously impair resistance to R. cerealis. This study paves the way to further CRK research in wheat.
Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

PubMed Central

Hunt, C; Morimoto, R I

1985-01-01

We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
Development of transgenic cotton lines expressing Allium sativum agglutinin (ASAL) for enhanced resistance against major sap-sucking pests.

PubMed

Vajhala, Chakravarthy S K; Sadumpati, Vijaya Kumar; Nunna, Hariprasad Rao; Puligundla, Sateesh Kumar; Vudem, Dashavantha Reddy; Khareedu, Venkateswara Rao

2013-01-01

Mannose-specific Allium sativum leaf agglutinin encoding gene (ASAL) and herbicide tolerance gene (BAR) were introduced into an elite cotton inbred line (NC-601) employing Agrobacterium-mediated genetic transformation. Cotton transformants were produced from the phosphinothricin (PPT)-resistant shoots obtained after co-cultivation of mature embryos with the Agrobacterium strain EHA105 harbouring recombinant binary vector pCAMBIA3300-ASAL-BAR. PCR and Southern blot analysis confirmed the presence and stable integration of ASAL and BAR genes in various transformants of cotton. Basta leaf-dip assay, northern blot, western blot and ELISA analyses disclosed variable expression of BAR and ASAL transgenes in different transformants. Transgenes, ASAL and BAR, were stably inherited and showed co-segregation in T1 generation in a Mendelian fashion for both PPT tolerance and insect resistance. In planta insect bioassays on T2 and T3 homozygous ASAL-transgenic lines revealed potent entomotoxic effects of ASAL on jassid and whitefly insects, as evidenced by significant decreases in the survival, development and fecundity of the insects when compared to the untransformed controls. Furthermore, the transgenic cotton lines conferred higher levels of resistance (1-2 score) with minimal plant damage against these major sucking pests when bioassays were carried out employing standard screening techniques. The developed transgenics could serve as a potential genetic resource in recombination breeding aimed at improving the pest resistance of cotton. This study represents the first report of its kind dealing with the development of transgenic cotton resistant to two major sap-sucking insects.
Development of Transgenic Cotton Lines Expressing Allium sativum Agglutinin (ASAL) for Enhanced Resistance against Major Sap-Sucking Pests

PubMed Central

Nunna, Hariprasad Rao; Puligundla, Sateesh Kumar; Vudem, Dashavantha Reddy; Khareedu, Venkateswara Rao

2013-01-01

Mannose-specific Allium sativum leaf agglutinin encoding gene (ASAL) and herbicide tolerance gene (BAR) were introduced into an elite cotton inbred line (NC-601) employing Agrobacterium-mediated genetic transformation. Cotton transformants were produced from the phosphinothricin (PPT)-resistant shoots obtained after co-cultivation of mature embryos with the Agrobacterium strain EHA105 harbouring recombinant binary vector pCAMBIA3300-ASAL-BAR. PCR and Southern blot analysis confirmed the presence and stable integration of ASAL and BAR genes in various transformants of cotton. Basta leaf-dip assay, northern blot, western blot and ELISA analyses disclosed variable expression of BAR and ASAL transgenes in different transformants. Transgenes, ASAL and BAR, were stably inherited and showed co-segregation in T1 generation in a Mendelian fashion for both PPT tolerance and insect resistance. In planta insect bioassays on T2 and T3 homozygous ASAL-transgenic lines revealed potent entomotoxic effects of ASAL on jassid and whitefly insects, as evidenced by significant decreases in the survival, development and fecundity of the insects when compared to the untransformed controls. Furthermore, the transgenic cotton lines conferred higher levels of resistance (1–2 score) with minimal plant damage against these major sucking pests when bioassays were carried out employing standard screening techniques. The developed transgenics could serve as a potential genetic resource in recombination breeding aimed at improving the pest resistance of cotton. This study represents the first report of its kind dealing with the development of transgenic cotton resistant to two major sap-sucking insects. PMID:24023750

Molecular cloning, sequence analysis and homology modeling of the first caudata amphibian antifreeze-like protein in axolotl (Ambystoma mexicanum).

PubMed

Zhang, Songyan; Gao, Jiuxiang; Lu, Yiling; Cai, Shasha; Qiao, Xue; Wang, Yipeng; Yu, Haining

2013-08-01

Antifreeze proteins (AFPs) refer to a class of polypeptides that are produced by certain vertebrates, plants, fungi, and bacteria and which permit their survival in subzero environments. In this study, we report the molecular cloning, sequence analysis and three-dimensional structure of the axolotl antifreeze-like protein (AFLP) by homology modeling of the first caudate amphibian AFLP. We constructed a full-length spleen cDNA library of axolotl (Ambystoma mexicanum). An EST having highest similarity (∼42%) with freeze-responsive liver protein Li16 from Rana sylvatica was identified, and the full-length cDNA was subsequently obtained by RACE-PCR. The axolotl antifreeze-like protein sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 93 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein were 10128.6 Da and 8.97, respectively. The molecular characterization of this gene and its deduced protein were further performed by detailed bioinformatics analysis. The three-dimensional structure of current AFLP was predicted by homology modeling, and the conserved residues required for functionality were identified. The homology model constructed could be of use for effective drug design. This is the first report of an antifreeze-like protein identified from a caudate amphibian.
Snake Genome Sequencing: Results and Future Prospects

PubMed Central

Kerkkamp, Harald M. I.; Kini, R. Manjunatha; Pospelov, Alexey S.; Vonk, Freek J.; Henkel, Christiaan V.; Richardson, Michael K.

2016-01-01

Snake genome sequencing is in its infancy—very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression. PMID:27916957
Snake Genome Sequencing: Results and Future Prospects.

PubMed

Kerkkamp, Harald M I; Kini, R Manjunatha; Pospelov, Alexey S; Vonk, Freek J; Henkel, Christiaan V; Richardson, Michael K

2016-12-01

Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.
Envelope-like retrotransposons in the plant kingdom: evidence of their presence in gymnosperms (Pinus pinaster).

PubMed

Miguel, Célia; Simões, Marta; Oliveira, Maria Margarida; Rocheta, Margarida

2008-11-01

Retroviruses differ from retrotransposons due to their infective capacity, which depends critically on the encoded envelope. Some plant retroelements contain domains reminiscent of the env of animal retroviruses but the number of such elements described to date is restricted to angiosperms. We show here the first evidence of the presence of putative env-like gene sequences in a gymnosperm species, Pinus pinaster (maritime pine). Using a degenerate primer approach for conserved domains of RNaseH gene, three clones from putative envelope-like retrotransposons (PpRT2, PpRT3, and PpRT4) were identified. The env-like sequences of P. pinaster clones are predicted to encode proteins with transmembrane domains. These sequences showed identity scores of up to 30% with env-like sequences belonging to different organisms. A phylogenetic analysis based on protein alignment of deduced aminoacid sequences revealed that these clones clustered with env-containing plant retrotransposons, as well as with retrotransposons from invertebrate organisms. The differences found among the sequences of maritime pine clones isolated here suggest the existence of different putative classes of env-like retroelements. The identification for the first time of env-like genes in a gymnosperm species may support the ancestrality of retroviruses among plants shedding light on their role in plant evolution.
Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fields, C.A.

1996-06-01

The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progressmore » report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.« less
IS1598 (IsPg4) distributed to abscess-forming strains of Porphyromonas gingivalis may enhance virulence through upregulation of nrdD-like gene expression.

PubMed

Sonoi, Norihiro; Maeda, Hiroshi; Murauchi, Toshimitsu; Yamamoto, Tadashi; Omori, Kazuhiro; Kokeguchi, Susumu; Naruishi, Koji; Takashiba, Shogo

2018-01-01

An insertion sequence, IS1598 (IsPg4) has been found in virulent strains of Porphyromonas gingivalis in a murine abscess model. The present study was performed to investigate the effects of genetic rearrangements by IS1598 on the phenotypic characteristics of the virulent strains. For this purpose, we searched for a common insertion site of IS1598 among the virulent strains. Through cloning and database search, a common insertion site was identified beside an nrdD-like gene in the virulent FDC 381, W83 and W50 strains. In this region, predicted promoters of the nrdD-like gene and IS1598 are located in tandem, and accumulation of nrdD-like gene mRNA was 5-fold higher in virulent strains (W83, W50, FDC 381) than avirulent strains (ATCC33277, SU63, SUNY1021, ESO59 without IS1598). The role of the nrdD-like gene in virulence of P. gingivalis was investigated by constructing a nrdD-deficient mutant. In the murine abscess model, the parental W83 strain produced necrotic abscesses, while the nrdD-deficient mutant had almost lost this ability. Insertion of IS1598 into the nrdD-like gene promoter region may be related to the phenotypic differences in virulence among P. gingivalis strains through upregulation of the expression of this gene.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

PubMed

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

PubMed Central

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Cloning and sequencing of a gene encoding a 21-kilodalton outer membrane protein from Bordetella avium and expression of the gene in Salmonella typhimurium.

PubMed Central

Gentry-Weeks, C R; Hultsch, A L; Kelly, S M; Keith, J M; Curtiss, R

1992-01-01

Three gene libraries of Bordetella avium 197 DNA were prepared in Escherichia coli LE392 by using the cosmid vectors pCP13 and pYA2329, a derivative of pCP13 specifying spectinomycin resistance. The cosmid libraries were screened with convalescent-phase anti-B. avium turkey sera and polyclonal rabbit antisera against B. avium 197 outer membrane proteins. One E. coli recombinant clone produced a 56-kDa protein which reacted with convalescent-phase serum from a turkey infected with B. avium 197. In addition, five E. coli recombinant clones were identified which produced B. avium outer membrane proteins with molecular masses of 21, 38, 40, 43, and 48 kDa. At least one of these E. coli clones, which encoded the 21-kDa protein, reacted with both convalescent-phase turkey sera and antibody against B. avium 197 outer membrane proteins. The gene for the 21-kDa outer membrane protein was localized by Tn5seq1 mutagenesis, and the nucleotide sequence was determined by dideoxy sequencing. DNA sequence analysis of the 21-kDa protein revealed an open reading frame of 582 bases that resulted in a predicted protein of 194 amino acids. Comparison of the predicted amino acid sequence of the gene encoding the 21-kDa outer membrane protein with protein sequences in the National Biomedical Research Foundation protein sequence data base indicated significant homology to the OmpA proteins of Shigella dysenteriae, Enterobacter aerogenes, E. coli, and Salmonella typhimurium and to Neisseria gonorrhoeae outer membrane protein III, Haemophilus influenzae protein P6, and Pseudomonas aeruginosa porin protein F. The gene (ompA) encoding the B. avium 21-kDa protein hybridized with 4.1-kb DNA fragments from EcoRI-digested, chromosomal DNA of Bordetella pertussis and Bordetella bronchiseptica and with 6.0- and 3.2-kb DNA fragments from EcoRI-digested, chromosomal DNA of B. avium and B. avium-like DNA, respectively. A 6.75-kb DNA fragment encoding the B. avium 21-kDa protein was subcloned into the
Diversity and duplication of DQB and DRB-like genes of the MHC in baleen whales (suborder: Mysticeti).

PubMed

Baker, C S; Vant, M D; Dalebout, M L; Lento, G M; O'Brien, S J; Yuhki, N

2006-05-01

The molecular diversity and phylogenetic relationships of two class II genes of the baleen whale major histocompatibility complex were investigated and compared to toothed whales and out-groups. Amplification of the DQB exon 2 provided sequences showing high within-species and between-species nucleotide diversity and uninterrupted reading frames consistent with functional class II loci found in related mammals (e.g., ruminants). Cloning of amplified products indicated gene duplication in the humpback whale and triplication in the southern right whale, with average nucleotide diversity of 5.9 and 6.3%, respectively, for alleles of each species. Significantly higher nonsynonymous divergence at sites coding for peptide binding (32% for humpback and 40% for southern right) suggested that these loci were subject to positive (overdominant) selection. A population survey of humpback whales detected 23 alleles, differing by up to 21% of their inferred amino acid sequences. Amplification of the DRB exon 2 resulted in two groups of sequences. One was most similar to the DRB3 of the cow and present in all whales screened to date, including toothed whales. The second was most similar to the DRB2 of the cow and was found only in the bowhead and right whales. Both loci showed low diversity among species and apparent loss of function or altered function including interruption of reading frames. Finally, comparison of inferred protein sequence of the DRB3-like locus suggested convergence with the DQB, perhaps resulting from intergenic conversion or recombination.
Isolation of nine gene sequences induced by silica in murine macrophages

DOE Office of Scientific and Technical Information (OSTI.GOV)

Segade, F.; Claudio, E.; Wrobel, K.

1995-03-01

Macrophage activation by silica is the initial step in the development of silicosis. To identify genes that might be involved in silica-mediated activation, RAW 264.7 mouse macrophages were treated with silica for 48 h, and a subtracted cDNA library enriched for silica-induced genes (SIG) was constructed and differently screened. Nine cDNA clones (designated SIG-12, -14, -20, -41, -61, -81, -91, and -111) were partially sequenced and compared with sequences in GenBank/EMBL databases. SIG-12, -14, and -20 corresponded to the genes for ribosomal proteins L13A, L32, and L26, respectively. SIG-61 is the mouse homologue of p21 RhoC. SIG-91 is identical tomore » the 67-kDa high-affinity laminin receptor. Four genes were not identified and are novel. All of the mRNAs corresponding to the nine cloned cDNAs were inducible by silica. Steady-state levels of mRNAs in RAW 264.7 cells treated with various macrophage activators and inducers of signal transduction pathways were determined. A complex pattern of induction and repression was found, indicating that upon phagocytosis of silica particles, many regulatory mechanisms of genes expression are simultaneously triggered. 55 refs., 4 figs., 1 tab.« less
Presence and Expression of Microbial Genes Regulating Soil Nitrogen Dynamics Along the Tanana River Successional Sequence

NASA Astrophysics Data System (ADS)

Boone, R. D.; Rogers, S. L.

2004-12-01

We report on work to assess the functional gene sequences for soil microbiota that control nitrogen cycle pathways along the successional sequence (willow, alder, poplar, white spruce, black spruce) on the Tanana River floodplain, Interior Alaska. Microbial DNA and mRNA were extracted from soils (0-10 cm depth) for amoA (ammonium monooxygenase), nifH (nitrogenase reductase), napA (nitrate reductase), and nirS and nirK (nitrite reductase) genes. Gene presence was determined by amplification of a conserved sequence of each gene employing sequence specific oligonucleotide primers and Polymerase Chain Reaction (PCR). Expression of the genes was measured via nested reverse transcriptase PCR amplification of the extracted mRNA. Amplified PCR products were visualized on agarose electrophoresis gels. All five successional stages show evidence for the presence and expression of microbial genes that regulate N fixation (free-living), nitrification, and nitrate reduction. We detected (1) nifH, napA, and nirK presence and amoA expression (mRNA production) for all five successional stages and (2) nirS and amoA presence and nifH, nirK, and napA expression for early successional stages (willow, alder, poplar). The results highlight that the existing body of previous process-level work has not sufficiently considered the microbial potential for a nitrate economy and free-living N fixation along the complete floodplain successional sequence.
Identification of a receptor-like protein kinase gene rapidly induced by abscisic acid, dehydration, high salt, and cold treatments in Arabidopsis thaliana.

PubMed Central

Hong, S W; Jon, J H; Kwak, J M; Nam, H G

1997-01-01

A cDNA clone for a receptor-like protein kinase gene (RPK1) was isolated from Arabidopsis thaliana. The clone is 1952 bp long with 1623 bp of an open reading frame encoding a peptide of 540 amino acids. The deduced peptide (RPK1) contains four distinctive domains characteristic of receptor kinases: (a) a putative amino-terminal signal sequence domain; (b) a domain with five extracellular leucine-rich repeat sequences; (c) a membrane-spanning domain; and (d) a cytoplasmic protein kinase domain that contains all of the 11 subdomains conserved among protein kinases. The RPK1 gene is expressed in flowers, stems, leaves, and roots. Expression of the RPK1 gene is induced within 1 h after treatment with abscisic acid (ABA). The gene is also rapidly induced by several environmental stresses such as dehydration, high salt, and low temperature, suggesting that the gene is involved in a general stress response. The dehydration-induced expression is not impaired in aba-1, abi1-1, abi2-1, and abi3-1 mutants, suggesting that the dehydration-induced expression of the RPK1 gene is ABA-independent. A possible role of this gene in the signal transduction pathway of ABA and the environmental stresses is discussed. PMID:9112773
The tripartite leader sequence is required for ectopic expression of HAdV-B and HAdV-E E3 CR1 genes.

PubMed

Bair, Camden R; Kotha Lakshmi Narayan, Poornima; Kajon, Adriana E

2017-05-01

The unique repertoire of genes that characterizes the early region 3 (E3) of the different species of human adenovirus (HAdV) likely contributes to their distinct pathogenic traits. The function of many E3 CR1 proteins remains unknown possibly due to unidentified intrinsic properties that make them difficult to express ectopically. This study shows that the species HAdV-B- and HAdV-E-specific E3 CR1 genes can be expressed from vectors carrying the HAdV tripartite leader (TPL) sequence but not from traditional mammalian expression vectors. Insertion of the TPL sequence upstream of the HAdV-B and HAdV-E E3 CR1 open reading frames was sufficient to rescue protein expression from pCI-neo constructs in transfected 293T cells. The detection of higher levels of HAdV-B and HAdV-E E3 CR1 transcripts suggests that the TPL sequence may enhance gene expression at both the transcriptional and translational levels. Our findings will facilitate the characterization of additional AdV E3 proteins. Copyright © 2017 Elsevier Inc. All rights reserved.
Cloning, sequencing, and expression of the gene encoding the high-molecular-weight cytochrome c from Desulfovibrio vulgaris Hildenborough.

PubMed Central

Pollock, W B; Loutfi, M; Bruschi, M; Rapp-Giles, B J; Wall, J D; Voordouw, G

1991-01-01

By using a synthetic deoxyoligonucleotide probe designed to recognize the structural gene for cytochrome cc3 from Desulfovibrio vulgaris Hildenborough, a 3.7-kb XhoI genomic DNA fragment containing the cc3 gene was isolated. The gene encodes a precursor polypeptide of 58.9 kDa, with an NH2-terminal signal sequence of 31 residues. The mature polypeptide (55.7 kDa) has 16 heme binding sites of the form C-X-X-C-H. Covalent binding of heme to these 16 sites gives a holoprotein of 65.5 kDa with properties similar to those of the high-molecular-weight cytochrome c (Hmc) isolated from the same strain by Higuchi et al. (Y. Higuchi, K. Inaka, N. Yasuoka, and T. Yagi, Biochim. Biophys. Acta 911:341-348, 1987). Since the data indicate that cytochrome cc3 and Hmc are the same protein, the gene has been named hmc. The Hmc polypeptide contains 31 histidinyl residues, 16 of which are integral to heme binding sites. Thus, only 15 of the 16 hemes can have bis-histidinyl coordination. A comparison of the arrangement of heme binding sites and coordinated histidines in the amino acid sequences of cytochrome c3 and Hmc from D. vulgaris Hildenborough suggests that the latter contains three cytochrome c3-like domains. Cloning of the D. vulgaris Hildenborough hmc gene into the broad-host-range vector pJRD215 and subsequent conjugational transfer of the recombinant plasmid into D. desulfuricans G200 led to expression of a periplasmic Hmc gene product with covalently bound hemes. Images PMID:1846136
GAVIN: Gene-Aware Variant INterpretation for medical sequencing.

PubMed

van der Velde, K Joeri; de Boer, Eddy N; van Diemen, Cleo C; Sikkema-Raddatz, Birgit; Abbott, Kristin M; Knopperts, Alain; Franke, Lude; Sijmons, Rolf H; de Koning, Tom J; Wijmenga, Cisca; Sinke, Richard J; Swertz, Morris A

2017-01-16

We present Gene-Aware Variant INterpretation (GAVIN), a new method that accurately classifies variants for clinical diagnostic purposes. Classifications are based on gene-specific calibrations of allele frequencies from the ExAC database, likely variant impact using SnpEff, and estimated deleteriousness based on CADD scores for >3000 genes. In a benchmark on 18 clinical gene sets, we achieve a sensitivity of 91.4% and a specificity of 76.9%. This accuracy is unmatched by 12 other tools. We provide GAVIN as an online MOLGENIS service to annotate VCF files and as an open source executable for use in bioinformatic pipelines. It can be found at http://molgenis.org/gavin .
Targeted Sequencing of Venom Genes from Cone Snail Genomes Improves Understanding of Conotoxin Molecular Evolution

PubMed Central

Mahardika, Gusti N

2018-01-01

Abstract To expand our capacity to discover venom sequences from the genomes of venomous organisms, we applied targeted sequencing techniques to selectively recover venom gene superfamilies and nontoxin loci from the genomes of 32 cone snail species (family, Conidae), a diverse group of marine gastropods that capture their prey using a cocktail of neurotoxic peptides (conotoxins). We were able to successfully recover conotoxin gene superfamilies across all species with high confidence (> 100× coverage) and used these data to provide new insights into conotoxin evolution. First, we found that conotoxin gene superfamilies are composed of one to six exons and are typically short in length (mean = ∼85 bp). Second, we expanded our understanding of the following genetic features of conotoxin evolution: 1) positive selection, where exons coding the mature toxin region were often three times more divergent than their adjacent noncoding regions, 2) expression regulation, with comparisons to transcriptome data showing that cone snails only express a fraction of the genes available in their genome (24–63%), and 3) extensive gene turnover, where Conidae species varied from 120 to 859 conotoxin gene copies. Finally, using comparative phylogenetic methods, we found that while diet specificity did not predict patterns of conotoxin evolution, dietary breadth was positively correlated with total conotoxin gene diversity. Overall, the targeted sequencing technique demonstrated here has the potential to radically increase the pace at which venom gene families are sequenced and studied, reshaping our ability to understand the impact of genetic changes on ecologically relevant phenotypes and subsequent diversification. PMID:29514313
CRISPR/Cas9-mediated gene knockout screens and target identification via whole-genome sequencing uncover host genes required for picornavirus infection.

PubMed

Kim, Heon Seok; Lee, Kyungjin; Bae, Sangsu; Park, Jeongbin; Lee, Chong-Kyo; Kim, Meehyein; Kim, Eunji; Kim, Minju; Kim, Seokjoong; Kim, Chonsaeng; Kim, Jin-Soo

2017-06-23

Several groups have used genome-wide libraries of lentiviruses encoding small guide RNAs (sgRNAs) for genetic screens. In most cases, sgRNA expression cassettes are integrated into cells by using lentiviruses, and target genes are statistically estimated by the readout of sgRNA sequences after targeted sequencing. We present a new virus-free method for human gene knockout screens using a genome-wide library of CRISPR/Cas9 sgRNAs based on plasmids and target gene identification via whole-genome sequencing (WGS) confirmation of authentic mutations rather than statistical estimation through targeted amplicon sequencing. We used 30,840 pairs of individually synthesized oligonucleotides to construct the genome-scale sgRNA library, collectively targeting 10,280 human genes ( i.e. three sgRNAs per gene). These plasmid libraries were co-transfected with a Cas9-expression plasmid into human cells, which were then treated with cytotoxic drugs or viruses. Only cells lacking key factors essential for cytotoxic drug metabolism or viral infection were able to survive. Genomic DNA isolated from cells that survived these challenges was subjected to WGS to directly identify CRISPR/Cas9-mediated causal mutations essential for cell survival. With this approach, we were able to identify known and novel genes essential for viral infection in human cells. We propose that genome-wide sgRNA screens based on plasmids coupled with WGS are powerful tools for forward genetics studies and drug target discovery. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular phylogeny of some avian species using Cytochrome b gene sequence analysis

PubMed Central

Awad, A; Khalil, S. R; Abd-Elhakim, Y. M

2015-01-01

Veritable identification and differentiation of avian species is a vital step in conservative, taxonomic, forensic, legal and other ornithological interventions. Therefore, this study involved the application of molecular approach to identify some avian species i.e. Chicken (Gallus gallus), Muskovy duck (Cairina moschata), Japanese quail (Coturnix japonica), Laughing dove (Streptopelia senegalensis), and Rock pigeon (Columba livia). Genomic DNA was extracted from blood samples and partial sequence of the mitochondrial cytochrome b gene (358 bp) was amplified and sequenced using universal primers. Sequences alignment and phylogenetic analyses were performed by CLC main workbench program. The obtained five sequences were deposited in GenBank and compared with those previously registered in GenBank. The similarity percentage was 88.60% between Gallus gallus and Coturnix japonica and 80.46% between Gallus gallus and Columba livia. The percentage of identity between the studied species and GenBank species ranged from 77.20% (Columba oenas and Anas platyrhynchos) to 100% (Gallus gallus and Gallus sonneratii, Coturnix coturnix and Coturnix japonica, Meleagris gallopavo and Columba livia). Amplification of the partial sequence of mitochondrial cytochrome b gene proved to be practical for identification of an avian species unambiguously. PMID:27175180
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.

PubMed

Abe, Takashi; Inokuchi, Hachiro; Yamada, Yuko; Muto, Akira; Iwasaki, Yuki; Ikemura, Toshimichi

2014-01-01

The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.

Ubiquitous and gene-specific regulatory 5' sequences in a sea urchin histone DNA clone coding for histone protein variants.

PubMed Central

Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L

1980-01-01

The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila.

PubMed Central

Liu, X; Gorovsky, M A

1996-01-01

A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889
CLINICAL PROGRESS IN INHERITED RETINAL DEGENERATIONS: GENE THERAPY CLINICAL TRIALS AND ADVANCES IN GENETIC SEQUENCING

PubMed Central

HAFLER, BRIAN P.

2017-01-01

Purpose Inherited retinal dystrophies are a significant cause of vision loss and are characterized by the loss of photoreceptors and the retinal pigment epithelium (RPE). Mutations in approximately 250 genes cause inherited retinal degenerations with a high degree of genetic heterogeneity. New techniques in next-generation sequencing are allowing the comprehensive analysis of all retinal disease genes thus changing the approach to the molecular diagnosis of inherited retinal dystrophies. This review serves to analyze clinical progress in genetic diagnostic testing and implications for retinal gene therapy. Methods A literature search of PubMed and OMIM was conducted to relevant articles in inherited retinal dystrophies. Results Next-generation genetic sequencing allows the simultaneous analysis of all the approximately 250 genes that cause inherited retinal dystrophies. Reported diagnostic rates range are high and range from 51% to 57%. These new sequencing tools are highly accurate with sensitivities of 97.9% and specificities of 100%. Retinal gene therapy clinical trials are underway for multiple genes including RPE65, ABCA4, CHM, RS1, MYO7A, CNGA3, CNGB3, ND4, and MERTK for which a molecular diagnosis may be beneficial for patients. Conclusion Comprehensive next-generation genetic sequencing of all retinal dystrophy genes is changing the paradigm for how retinal specialists perform genetic testing for inherited retinal degenerations. Not only are high diagnostic yields obtained, but mutations in genes with novel clinical phenotypes are also identified. In the era of retinal gene therapy clinical trials, identifying specific genetic defects will increasingly be of use to identify patients who may enroll in clinical studies and benefit from novel therapies. PMID:27753762
Isoform-level gene expression patterns in single-cell RNA-sequencing data.

PubMed

Vu, Trung Nghia; Wills, Quin F; Kalari, Krishna R; Niu, Nifang; Wang, Liewei; Pawitan, Yudi; Rantalainen, Mattias

2018-02-27

RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expression patterns at the single-cell level. In this study we propose and apply a novel method, ISOform-Patterns (ISOP), based on mixture modeling, to characterize the expression patterns of isoform pairs from the same gene in single-cell isoform-level expression data. We define six principal patterns of isoform expression relationships and describe a method for differential-pattern analysis. We demonstrate ISOP through analysis of single-cell RNA-sequencing data from a breast cancer cell line, with replication in three independent datasets. We assigned the pattern types to each of 16,562 isoform-pairs from 4,929 genes. Among those, 26% of the discovered patterns were significant (p<0.05), while remaining patterns are possibly effects of transcriptional bursting, drop-out and stochastic biological heterogeneity. Furthermore, 32% of genes discovered through differential-pattern analysis were not detected by differential-expression analysis. The effect of drop-out events, mean expression level, and properties of the expression distribution on the performances of ISOP were also investigated through simulated datasets. To conclude, ISOP provides a novel approach for characterization of isoformlevel preference, commitment and heterogeneity in single-cell RNA-sequencing data. The ISOP method has been implemented as a R package and is available at https://github.com/nghiavtr/ISOP under a GPL-3 license. mattias.rantalainen@ki.se. Supplementary data are available at Bioinformatics online.
Identification of Mycobacterium spp. of veterinary importance using rpoB gene sequencing

PubMed Central

2011-01-01

Background Studies conducted on Mycobacterium spp. isolated from human patients indicate that sequencing of a 711 bp portion of the rpoB gene can be useful in assigning a species identity, particularly for members of the Mycobacterium avium complex (MAC). Given that MAC are important pathogens in livestock, companion animals, and zoo/exotic animals, we were interested in evaluating the use of rpoB sequencing for identification of Mycobacterium isolates of veterinary origin. Results A total of 386 isolates, collected over 2008 - June 2011 from 378 animals (amphibians, reptiles, birds, and mammals) underwent PCR and sequencing of a ~ 711 bp portion of the rpoB gene; 310 isolates (80%) were identified to the species level based on similarity at ≥ 98% with a reference sequence. The remaining 76 isolates (20%) displayed < 98% similarity with reference sequences and were assigned to a clade based on their location in a neighbor-joining tree containing reference sequences. For a subset of 236 isolates that received both 16S rRNA and rpoB sequencing, 167 (70%) displayed a similar species/clade assignation for both sequencing methods. For the remaining 69 isolates, species/clade identities were different with each sequencing method. Mycobacterium avium subsp. hominissuis was the species most frequently isolated from specimens from pigs, cervids, companion animals, cattle, and exotic/zoo animals. Conclusions rpoB sequencing proved useful in identifying Mycobacterium isolates of veterinary origin to clade, species, or subspecies levels, particularly for assemblages (such as the MAC) where 16S rRNA sequencing alone is not adequate to demarcate these taxa. rpoB sequencing can represent a cost-effective identification tool suitable for routine use in the veterinary diagnostic laboratory. PMID:22118247
Urtica dioica agglutinin. A superantigenic lectin from stinging nettle rhizome.

PubMed

Galelli, A; Truffa-Bachi, P

1993-08-15

Urtica dioica agglutinin (UDA) is an unusual plant lectin that differs from all other known plant lectins with respect to its molecular structure and its extremely low specific agglutination activity. We recently reported that this small lectin (8.5 kDa) is a T cell mitogen distinguishable from classical T cell lectin mitogens by its ability to discriminate a particular population of CD4+ and CD8+ T cells as well as its capacity to induce an original pattern of T cell activation and cytokine production. The mechanism by which UDA activates T cells was investigated and compared with the conventional T cell mitogen Con A and the known superantigen staphylococcal enterotoxin B. Our data show that T cell proliferation induced by UDA is strictly dependent on AC expressing MHC class II molecules but is not MHC restricted. This proliferation can be partially inhibited by anti-I-A or anti-I-E mAb and completely blocked by a mAb recognizing monomorphic determinants on the Ia molecule. UDA indeed binds to specific carbohydrate structures present on class II molecules. UDA-induced T cell stimulation is dependent on TCR recognition of the unprocessed intact molecule in association with various Ia molecules. T cell response to UDA is clonally expressed and correlates with particular TCR V beta gene families usage. This stimulation leads to a sixfold enrichment of V beta 8.3+ T cells within 3 days. Therefore, UDA appears to use the same molecular mechanism as structurally unrelated bacterial or retroviral superantigens and we propose that this lectin is a superantigen. UDA, which is not a pathogenicity factor, could provide a useful probe for the analysis of T cell activation by superantigens.
Isolation and characterization of lymphocyte-like cells from a lamprey

PubMed Central

Mayer, Werner E.; Uinuk-ool, Tatiana; Tichy, Herbert; Gartland, Lanier A.; Klein, Jan; Cooper, Max D.

2002-01-01

Lymphocyte-like cells in the intestine of the sea lamprey, Petromyzon marinus, were isolated by flow cytometry under light-scatter conditions used for the purification of mouse intestinal lymphocytes. The purified lamprey cells were morphologically indistinguishable from mammalian lymphocytes. A cDNA library was prepared from the lamprey lymphocyte-like cells, and more than 8,000 randomly selected clones were sequenced. Homology searches comparing these ESTs with sequences deposited in the databases led to the identification of numerous genes homologous to those predominantly or characteristically expressed in mammalian lymphocytes, which included genes controlling lymphopoiesis, intracellular signaling, proliferation, migration, and involvement of lymphocytes in innate immune responses. Genes closely related to those that in gnathostomes control antigen processing and transport of antigenic peptides could be ascertained, although no sequences with significant similarity to MHC, T cell receptor, or Ig genes were found. The data suggest that the evolution of lymphocytes in the lamprey has reached a stage poised for the emergence of adaptive immunity. PMID:12388781
Clinical characterization and diagnosis of cystic fibrosis through exome sequencing in Chinese infants with Bartter-syndrome-like hypokalemia alkalosis.

PubMed

Qiu, Liru; Yang, Fengjie; He, Yonghua; Yuan, Huiqing; Zhou, Jianhua

2018-03-09

Cystic fibrosis (CF) is a fatal autosomal-recessive disease caused by mutations in the CF transmembrane conductance regulator (CFTR) gene. CF is characterized by recurrent pulmonary infection with obstructive pulmonary disease. CF is common in the Caucasian population but is rare in the Chinese population. The symptoms of early-stage CF are often untypical and may sometimes manifest as Bartter syndrome (BS)-like hypokalemic alkalosis. Therefore, the ability of doctors to differentiate CF from BS-like hypokalemic alkalosis in Chinese infants is a great challenge in the timely and accurate diagnosis of CF. In China, sporadic CF has not been diagnosed in children younger than three years of age to date. Three infants, who were initially admitted to our hospital over the period of June 2013 to September 2014 with BS-like hypokalemic alkalosis, were diagnosed with CF through exome sequencing and sweat chloride measurement. The compound heterozygous mutations of the CFTR gene were detected in two infants, and a homozygous missense mutation was found in one infant. Among the six identified mutations, two are novel point mutations (c.1526G > C and c.3062C > T) that are possibly pathogenic. The three infants are the youngest Chinese patients to have been diagnosed with sporadic CF at a very early stage. Follow-up examination showed that all of the cases remained symptom-free after early intervention, indicating the potential benefit of very early diagnosis and timely intervention in children with CF. Our results demonstrate the necessity of distinguishing CF from BS in Chinese infants with hypokalemic alkalosis and the significant diagnostic value of powerful exome sequencing for rare genetic diseases. Furthermore, our findings expand the CFTR mutation spectrum associated with CF.
[Sequence analysis of LEAFY homologous gene from Dendrobium moniliforme and application for identification of medicinal Dendrobium].

PubMed

Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu

2013-04-01

The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.
Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

PubMed

Weirather, Jason L; Afshar, Pegah Tootoonchi; Clark, Tyson A; Tseng, Elizabeth; Powers, Linda S; Underwood, Jason G; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai

2015-10-15

We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Comprehensive sequence analysis of nine Usher syndrome genes in the UK National Collaborative Usher Study

PubMed Central

Le Quesne Stabej, Polona; Saihan, Zubin; Rangesh, Nell; Steele-Stallard, Heather B; Ambrose, John; Coffey, Alison; Emmerson, Jenny; Haralambous, Elene; Hughes, Yasmin; Steel, Karen P; Luxon, Linda M; Webster, Andrew R

2011-01-01

Background Usher syndrome (USH) is an autosomal recessive disorder comprising retinitis pigmentosa, hearing loss and, in some cases, vestibular dysfunction. It is clinically and genetically heterogeneous with three distinctive clinical types (I–III) and nine Usher genes identified. This study is a comprehensive clinical and genetic analysis of 172 Usher patients and evaluates the contribution of digenic inheritance. Methods The genes MYO7A, USH1C, CDH23, PCDH15, USH1G, USH2A, GPR98, WHRN, CLRN1 and the candidate gene SLC4A7 were sequenced in 172 UK Usher patients, regardless of clinical type. Results No subject had definite mutations (nonsense, frameshift or consensus splice site mutations) in two different USH genes. Novel missense variants were classified UV1-4 (unclassified variant): UV4 is ‘probably pathogenic’, based on control frequency <0.23%, identification in trans to a pathogenic/probably pathogenic mutation and segregation with USH in only one family; and UV3 (‘likely pathogenic’) as above, but no information on phase. Overall 79% of identified pathogenic/UV4/UV3 variants were truncating and 21% were missense changes. MYO7A accounted for 53.2%, and USH1C for 14.9% of USH1 families (USH1C:c.496+1G>A being the most common USH1 mutation in the cohort). USH2A was responsible for 79.3% of USH2 families and GPR98 for only 6.6%. No mutations were found in USH1G, WHRN or SLC4A7. Conclusions One or two pathogenic/likely pathogenic variants were identified in 86% of cases. No convincing cases of digenic inheritance were found. It is concluded that digenic inheritance does not make a significant contribution to Usher syndrome; the observation of multiple variants in different genes is likely to reflect polymorphic variation, rather than digenic effects. PMID:22135276
The complete genome sequences of poxviruses isolated from a penguin and a pigeon in South Africa and comparison to other sequenced avipoxviruses.

PubMed

Offerman, Kristy; Carulei, Olivia; van der Walt, Anelda Philine; Douglass, Nicola; Williamson, Anna-Lise

2014-06-12

Two novel avipoxviruses from South Africa have been sequenced, one from a Feral Pigeon (Columba livia) (FeP2) and the other from an African penguin (Spheniscus demersus) (PEPV). We present a purpose-designed bioinformatics pipeline for analysis of next generation sequence data of avian poxviruses and compare the different avipoxviruses sequenced to date with specific emphasis on their evolution and gene content. The FeP2 (282 kbp) and PEPV (306 kbp) genomes encode 271 and 284 open reading frames respectively and are more closely related to one another (94.4%) than to either fowlpox virus (FWPV) (85.3% and 84.0% respectively) or Canarypox virus (CNPV) (62.0% and 63.4% respectively). Overall, FeP2, PEPV and FWPV have syntenic gene arrangements; however, major differences exist throughout their genomes. The most striking difference between FeP2 and the FWPV-like avipoxviruses is a large deletion of ~16 kbp from the central region of the genome of FeP2 deleting a cc-chemokine-like gene, two Variola virus B22R orthologues, an N1R/p28-like gene and a V-type Ig domain family gene. FeP2 and PEPV both encode orthologues of vaccinia virus C7L and Interleukin 10. PEPV contains a 77 amino acid long orthologue of Ubiquitin sharing 97% amino acid identity to human ubiquitin. The genome sequences of FeP2 and PEPV have greatly added to the limited repository of genomic information available for the Avipoxvirus genus. In the comparison of FeP2 and PEPV to existing sequences, FWPV and CNPV, we have established insights into African avipoxvirus evolution. Our data supports the independent evolution of these South African avipoxviruses from a common ancestral virus to FWPV and CNPV.
Nucleotide sequence analysis of the L gene of Newcastle disease virus: homologies with Sendai and vesicular stomatitis viruses.

PubMed Central

Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T

1987-01-01

The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486
Gene identification and analysis of transcripts differentially regulated in fracture healing by EST sequencing in the domestic sheep.

PubMed

Hecht, Jochen; Kuhl, Heiner; Haas, Stefan A; Bauer, Sebastian; Poustka, Albert J; Lienau, Jasmin; Schell, Hanna; Stiege, Asita C; Seitz, Volkhard; Reinhardt, Richard; Duda, Georg N; Mundlos, Stefan; Robinson, Peter N

2006-07-05

The sheep is an important model animal for testing novel fracture treatments and other medical applications. Despite these medical uses and the well known economic and cultural importance of the sheep, relatively little research has been performed into sheep genetics, and DNA sequences are available for only a small number of sheep genes. In this work we have sequenced over 47 thousand expressed sequence tags (ESTs) from libraries developed from healing bone in a sheep model of fracture healing. These ESTs were clustered with the previously available 10 thousand sheep ESTs to a total of 19087 contigs with an average length of 603 nucleotides. We used the newly identified sequences to develop RT-PCR assays for 78 sheep genes and measured differential expression during the course of fracture healing between days 7 and 42 postfracture. All genes showed significant shifts at one or more time points. 23 of the genes were differentially expressed between postfracture days 7 and 10, which could reflect an important role for these genes for the initiation of osteogenesis. The sequences we have identified in this work are a valuable resource for future studies on musculoskeletal healing and regeneration using sheep and represent an important head-start for genomic sequencing projects for Ovis aries, with partial or complete sequences being made available for over 5,800 previously unsequenced sheep genes.
On the Sequence-Directed Nature of Human Gene Mutation: The Role of Genomic Architecture and the Local DNA Sequence Environment in Mediating Gene Mutations Underlying Human Inherited Disease

PubMed Central

Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min

2011-01-01

Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507
Root parasitic plant Orobanche aegyptiaca and shoot parasitic plant Cuscuta australis obtained Brassicaceae-specific strictosidine synthase-like genes by horizontal gene transfer

PubMed Central

2014-01-01

Background Besides gene duplication and de novo gene generation, horizontal gene transfer (HGT) is another important way of acquiring new genes. HGT may endow the recipients with novel phenotypic traits that are important for species evolution and adaption to new ecological niches. Parasitic systems expectedly allow the occurrence of HGT at relatively high frequencies due to their long-term physical contact. In plants, a number of HGT events have been reported between the organelles of parasites and the hosts, but HGT between host and parasite nuclear genomes has rarely been found. Results A thorough transcriptome screening revealed that a strictosidine synthase-like (SSL) gene in the root parasitic plant Orobanche aegyptiaca and the shoot parasitic plant Cuscuta australis showed much higher sequence similarities with those in Brassicaceae than with those in their close relatives, suggesting independent gene horizontal transfer events from Brassicaceae to these parasites. These findings were strongly supported by phylogenetic analysis and their identical unique amino acid residues and deletions. Intriguingly, the nucleus-located SSL genes in Brassicaceae belonged to a new member of SSL gene family, which were originated from gene duplication. The presence of introns indicated that the transfer occurred directly by DNA integration in both parasites. Furthermore, positive selection was detected in the foreign SSL gene in O. aegyptiaca but not in C. australis. The expression of the foreign SSL genes in these two parasitic plants was detected in multiple development stages and tissues, and the foreign SSL gene was induced after wounding treatment in C. australis stems. These data imply that the foreign genes may still retain certain functions in the recipient species. Conclusions Our study strongly supports that parasitic plants can gain novel nuclear genes from distantly related host species by HGT and the foreign genes may execute certain functions in the new hosts
Root parasitic plant Orobanche aegyptiaca and shoot parasitic plant Cuscuta australis obtained Brassicaceae-specific strictosidine synthase-like genes by horizontal gene transfer.

PubMed

Zhang, Dale; Qi, Jinfeng; Yue, Jipei; Huang, Jinling; Sun, Ting; Li, Suoping; Wen, Jian-Fan; Hettenhausen, Christian; Wu, Jinsong; Wang, Lei; Zhuang, Huifu; Wu, Jianqiang; Sun, Guiling

2014-01-13

Besides gene duplication and de novo gene generation, horizontal gene transfer (HGT) is another important way of acquiring new genes. HGT may endow the recipients with novel phenotypic traits that are important for species evolution and adaption to new ecological niches. Parasitic systems expectedly allow the occurrence of HGT at relatively high frequencies due to their long-term physical contact. In plants, a number of HGT events have been reported between the organelles of parasites and the hosts, but HGT between host and parasite nuclear genomes has rarely been found. A thorough transcriptome screening revealed that a strictosidine synthase-like (SSL) gene in the root parasitic plant Orobanche aegyptiaca and the shoot parasitic plant Cuscuta australis showed much higher sequence similarities with those in Brassicaceae than with those in their close relatives, suggesting independent gene horizontal transfer events from Brassicaceae to these parasites. These findings were strongly supported by phylogenetic analysis and their identical unique amino acid residues and deletions. Intriguingly, the nucleus-located SSL genes in Brassicaceae belonged to a new member of SSL gene family, which were originated from gene duplication. The presence of introns indicated that the transfer occurred directly by DNA integration in both parasites. Furthermore, positive selection was detected in the foreign SSL gene in O. aegyptiaca but not in C. australis. The expression of the foreign SSL genes in these two parasitic plants was detected in multiple development stages and tissues, and the foreign SSL gene was induced after wounding treatment in C. australis stems. These data imply that the foreign genes may still retain certain functions in the recipient species. Our study strongly supports that parasitic plants can gain novel nuclear genes from distantly related host species by HGT and the foreign genes may execute certain functions in the new hosts.
Digital Gene Expression Analysis Based on De Novo Transcriptome Assembly Reveals New Genes Associated with Floral Organ Differentiation of the Orchid Plant Cymbidium ensifolium

PubMed Central

Yang, Fengxi; Zhu, Genfa

2015-01-01

Cymbidium ensifolium belongs to the genus Cymbidium of the orchid family. Owing to its spectacular flower morphology, C. ensifolium has considerable ecological and cultural value. However, limited genetic data is available for this non-model plant, and the molecular mechanism underlying floral organ identity is still poorly understood. In this study, we characterize the floral transcriptome of C. ensifolium and present, for the first time, extensive sequence and transcript abundance data of individual floral organs. After sequencing, over 10 Gb clean sequence data were generated and assembled into 111,892 unigenes with an average length of 932.03 base pairs, including 1,227 clusters and 110,665 singletons. Assembled sequences were annotated with gene descriptions, gene ontology, clusters of orthologous group terms, the Kyoto Encyclopedia of Genes and Genomes, and the plant transcription factor database. From these annotations, 131 flowering-associated unigenes, 61 CONSTANS-LIKE (COL) unigenes and 90 floral homeotic genes were identified. In addition, four digital gene expression libraries were constructed for the sepal, petal, labellum and gynostemium, and 1,058 genes corresponding to individual floral organ development were identified. Among them, eight MADS-box genes were further investigated by full-length cDNA sequence analysis and expression validation, which revealed two APETALA1/AGL9-like MADS-box genes preferentially expressed in the sepal and petal, two AGAMOUS-like genes particularly restricted to the gynostemium, and four DEF-like genes distinctively expressed in different floral organs. The spatial expression of these genes varied distinctly in different floral mutant corresponding to different floral morphogenesis, which validated the specialized roles of them in floral patterning and further supported the effectiveness of our in silico analysis. This dataset generated in our study provides new insights into the molecular mechanisms underlying floral
The utility of Next Generation Sequencing for molecular diagnostics in Rett syndrome.

PubMed

Vidal, Silvia; Brandi, Núria; Pacheco, Paola; Gerotina, Edgar; Blasco, Laura; Trotta, Jean-Rémi; Derdak, Sophia; Del Mar O'Callaghan, Maria; Garcia-Cazorla, Àngels; Pineda, Mercè; Armstrong, Judith

2017-09-25

Rett syndrome (RTT) is an early-onset neurodevelopmental disorder that almost exclusively affects girls and is totally disabling. Three genes have been identified that cause RTT: MECP2, CDKL5 and FOXG1. However, the etiology of some of RTT patients still remains unknown. Recently, next generation sequencing (NGS) has promoted genetic diagnoses because of the quickness and affordability of the method. To evaluate the usefulness of NGS in genetic diagnosis, we present the genetic study of RTT-like patients using different techniques based on this technology. We studied 1577 patients with RTT-like clinical diagnoses and reviewed patients who were previously studied and thought to have RTT genes by Sanger sequencing. Genetically, 477 of 1577 patients with a RTT-like suspicion have been diagnosed. Positive results were found in 30% by Sanger sequencing, 23% with a custom panel, 24% with a commercial panel and 32% with whole exome sequencing. A genetic study using NGS allows the study of a larger number of genes associated with RTT-like symptoms simultaneously, providing genetic study of a wider group of patients as well as significantly reducing the response time and cost of the study.
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.