Sample records for dna sequence segments

  1. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

    2013-06-25

    A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.

  2. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

    2011-01-18

    A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.

  3. Synthesis of DNA

    DOEpatents

    Mariella, Jr., Raymond P.

    2008-11-18

    A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.

  4. First Complete Squash leaf curl China virus Genomic Segment DNA-A Sequence from East Timor

    PubMed Central

    Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

    2017-01-01

    ABSTRACT We present here the first complete Squash leaf curl China virus (SLCCV) genomic segment DNA-A sequence from East Timor. It was isolated from a pumpkin plant. When compared with 15 complete SLCCV DNA-A genome sequences from other world regions, it most resembled the Malaysian isolate MC1 sequence. PMID:28619789

  5. Negatively supercoiled simian virus 40 DNA contains Z-DNA segments within transcriptional enhancer sequences

    NASA Technical Reports Server (NTRS)

    Nordheim, A.; Rich, A.

    1983-01-01

    Three 8-base pair (bp) segments of alternating purine-pyrimidine from the simian virus 40 enhancer region form Z-DNA on negative supercoiling; minichromosome DNase I-hypersensitive sites determined by others bracket these three segments. A survey of transcriptional enhancer sequences reveals a pattern of potential Z-DNA-forming regions which occur in pairs 50-80 bp apart. This may influence local chromatin structure and may be related to transcriptional activation.

  6. Long-range correlations and charge transport properties of DNA sequences

    NASA Astrophysics Data System (ADS)

    Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

    2010-04-01

    By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5

  7. Relations between Shannon entropy and genome order index in segmenting DNA sequences.

    PubMed

    Zhang, Yi

    2009-04-01

    Shannon entropy H and genome order index S are used in segmenting DNA sequences. Zhang [Phys. Rev. E 72, 041917 (2005)] found that the two schemes are equivalent when a DNA sequence is converted to a binary sequence of S (strong H bond) and W (weak H bond). They left the mathematical proof to mathematicians who are interested in this issue. In this paper, a possible mathematical explanation is given. Moreover, we find that Chargaff parity rule 2 is the necessary condition of the equivalence, and the equivalence disappears when a DNA sequence is regarded as a four-symbol sequence. At last, we propose that S-2(-H) may be related to species evolution.

  8. New Stopping Criteria for Segmenting DNA Sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Wentian

    2001-06-18

    We propose a solution on the stopping criterion in segmenting inhomogeneous DNA sequences with complex statistical patterns. This new stopping criterion is based on Bayesian information criterion in the model selection framework. When this criterion is applied to telomere of S.cerevisiae and the complete sequence of E.coli, borders of biologically meaningful units were identified, and a more reasonable number of domains was obtained. We also introduce a measure called segmentation strength which can be used to control the delineation of large domains. The relationship between the average domain size and the threshold of segmentation strength is determined for several genomemore » sequences.« less

  9. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1987-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113

  10. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1990-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227

  11. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1988-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330

  12. A comprehensive list of cloned human DNA sequences

    PubMed Central

    Schmidtke, Jörg; Cooper, David N.

    1989-01-01

    A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889

  13. DsaV methyltransferase and its isoschizomers contain a conserved segment that is similar to the segment in Hhai methyltransferase that is in contact with DNA bases.

    PubMed Central

    Gopal, J; Yebra, M J; Bhagwat, A S

    1994-01-01

    The methyltransferase (MTase) in the DsaV restriction--modification system methylates within 5'-CCNGG sequences. We have cloned the gene for this MTase and determined its sequence. The predicted sequence of the MTase protein contains sequence motifs conserved among all cytosine-5 MTases and is most similar to other MTases that methylate CCNGG sequences, namely M.ScrFI and M.SsoII. All three MTases methylate the internal cytosine within their recognition sequence. The 'variable' region within the three enzymes that methylate CCNGG can be aligned with the sequences of two enzymes that methylate CCWGG sequences. Remarkably, two segments within this region contain significant similarity with the region of M.HhaI that is known to contact DNA bases. These alignments suggest that many cytosine-5 MTases are likely to interact with DNA using a similar structural framework. Images PMID:7971279

  14. Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics

    PubMed Central

    2012-01-01

    Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225

  15. Genes encoding Xenopus laevis Ig L chains: Implications for the evolution of [kappa] and [lambda] chains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zezza, D.J.; Stewart, S.E.; Steiner, L.A.

    1992-12-15

    Xenopus laevis Ig contain two distinct types of L chains, designated [rho] or L1 and [sigma] or L2. The authors have analyzed Xenopus genomic DNA by Southern blotting with cDNA probes specific for L1 V and C regions. Many fragments hybridized to the V probe, but only one or two fragments hybridized to the C probe. Corresponding C, J, and V gene segments were identified on clones isolated from a genomic library prepared from the same DNA. One clone contains a C gene segment separated from a J gene segment by an intron of 3.4 kb. The J and Cmore » gene segments are nearly identical in sequence to cDNA clones analyzed previously. The C segment is somewhat more similar and the J segment considerably more similar in sequence to the corresponding segments of mammalian [kappa] chains than to those of mammalian [lambda] chains. Upstream of the J segment is a typical recombination signal sequence with a spacer of 23 bp, as in J[kappa]. A second clone from the library contains four V gene segments, separated by 2.1 to 3.6 kb. Two of these, V1 and V3, have the expected structural and regulatory features of V genes, and are very similar in sequence to each other and to mammalian V[kappa]. A third gene segment, V2, resembles V1 and V3 in its coding region and nearby 5[prime]-flanking region, but diverges in sequence 5[prime] to position [minus]95 with loss of the octamer promoter element. The fourth V-like segment is similar to the others at the 3[prime]-end, but upstream of codon 64 bears no resemblance in sequence to any Ig V region. All four V segments have typical recombination signal sequences with 12-bp spacers at their 3[prime]-ends, as in V[kappa]. Taken together, the data suggest that Xenopus L1 L chain genes are members of the [kappa] gene family. 80 refs., 9 figs.« less

  16. Method for introducing unidirectional nested deletions

    DOEpatents

    Dunn, John J.; Quesada, Mark A.; Randesi, Matthew

    2001-01-01

    Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment in the context of a cloning vector which contains an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment. Also disclosed is a method for producing single-stranded DNA probes utilizing the same cloning vector. An optimal vector, PZIP is described. Methods for introducing unidirectional deletions into a terminal location of a cloned DNA sequence which is inserted into the vector of the present invention are also disclosed. These methods are useful for introducing deletions into either or both ends of a cloned DNA insert, for high throughput sequencing of any DNA of interest.

  17. A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

    PubMed Central

    Samuels, David C.; Boys, Richard J.; Henderson, Daniel A.; Chinnery, Patrick F.

    2003-01-01

    We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a ‘molecular clock’ to determine the rate of population and species divergence. PMID:14530452

  18. Formation of rings from segments of HeLa-cell nuclear deoxyribonucleic acid

    PubMed Central

    Hardman, Norman

    1974-01-01

    Duplex segments of HeLa-cell nuclear DNA were generated by cleavage with DNA restriction endonuclease from Haemophilus influenzae. About 20–25% of the DNA segments produced, when partly degraded with exonuclease III and annealed, were found to form rings visible in the electron microscope. A further 5% of the DNA segments formed structures that were branched in configuration. Similar structures were generated from HeLa-cell DNA, without prior treatment with restriction endonuclease, when the complementary polynucleotide chains were exposed by exonuclease III action at single-chain nicks. After exposure of an average single-chain length of 1400 nucleotides per terminus at nicks in HeLa-cell DNA by exonuclease III, followed by annealing, the physical length of ring closures was estimated and found to be 0.02–0.1μm, or 50–300 base pairs. An almost identical distribution of lengths was recorded for the regions of complementary base sequence responsible for branch formation. It is proposed that most of the rings and branches are formed from classes of reiterated base sequence with an average length of 180 base pairs arranged intermittenly in HeLa-cell DNA. From the rate of formation of branched structures when HeLa-cell DNA segments were heat-denatured and annealed, it is estimated that the reiterated sequences are in families containing approximately 2400–24000 copies. ImagesPLATE 2PLATE 1 PMID:4462738

  19. MSuPDA: A Memory Efficient Algorithm for Sequence Alignment.

    PubMed

    Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon

    2016-03-01

    Space complexity is a million dollar question in DNA sequence alignments. In this regard, memory saving under pushdown automata can help to reduce the occupied spaces in computer memory. Our proposed process is that anchor seed (AS) will be selected from given data set of nucleotide base pairs for local sequence alignment. Quick splitting techniques will separate the AS from all the DNA genome segments. Selected AS will be placed to pushdown automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. AS from input unit will be matched with the DNA genome segments from stack of PDA. Match, mismatch and indel of nucleotides will be popped from the stack under the control unit of pushdown automata. During the POP operation on stack, it will free the memory cell occupied by the nucleotide base pair.

  20. Method of artificial DNA splicing by directed ligation (SDL).

    PubMed Central

    Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA

    1991-01-01

    An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363

  1. Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

    PubMed Central

    Khan, A S

    1984-01-01

    The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017

  2. MSuPDA: A memory efficient algorithm for sequence alignment.

    PubMed

    Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon

    2015-01-16

    Space complexity is a million dollar question in DNA sequence alignments. In this regards, MSuPDA (Memory Saving under Pushdown Automata) can help to reduce the occupied spaces in computer memory. Our proposed process is that Anchor Seed (AS) will be selected from given data set of Nucleotides base pairs for local sequence alignment. Quick Splitting (QS) techniques will separate the Anchor Seed from all the DNA genome segments. Selected Anchor Seed will be placed to pushdown Automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. Anchor Seed from input unit will be matched with the DNA genome segments from stack of PDA. Whatever matches, mismatches or Indel, of Nucleotides will be POP from the stack under the control of control unit of Pushdown Automata. During the POP operation on stack it will free the memory cell occupied by the Nucleotide base pair.

  3. Clones from a shooty tobacco crown gall tumor I: deletions, rearrangements and amplifications resulting in irregular T-DNA structures and organizations.

    PubMed

    Peerbolte, R; Leenhouts, K; Hooykaas-van Slogteren, G M; Hoge, J H; Wullems, G J; Schilperoort, R A

    1986-07-01

    Transformed clones from a shooty tobacco crown gall tumor, induced byAgrobacterium tumefaciens strain LBA1501, having a Tn1831 insertion in the auxin locus, were investigated for their T-DNA structure and expression. In addition to clones with the expected phenotype, i.e. phytohormone autonomy, regeneration of non-rooting shoots and octopine synthesis (Aut(+)Reg(+)Ocs(+) 'type I' clones), clones were obtained with an aberrant phenotype. Among these were the Aut(-)Reg(-)Ocs(+) 'type II' clones. Two shooty type I clones and three type II callus clones (all randomly chosen) as well as a rooting shoot regenerated from a type II clone via a high kinetin treatment, all had a T-DNA structure which differed significantly from 'regular' T-DNA structures. No Tn1831 DNA sequences were detected in these clones. The two type I clones were identical: they both contained the same highly truncated T-DNA segments. One TL-DNA segment of approximately 0.7 kb, originating form the left part of the TL-region, was present at one copy per diploid tobacco genome. Another segment with a maximum size of about 7 kb was derived from the right hand part of the TL-region and was present at minimally two copies. Three copies of a truncated TR-DNA segment were detected, probably starting at the right TR-DNA border repeat and ending halfway the regular TR-region. Indications have been obtained that at least some of the T-DNA segments are closely linked, sometimes via intervening plant DNA sequences. The type I clones harbored TL-DNA transcripts 4, 6a/b and 3 as well as TR-DNA transcript 0'. The type II clones harbored three to six highly truncated T-DNA segments, originating from the right part of the TL-region. In addition they had TR-DNA segments, similar to those of the type I clones. On Northern blots TR-DNA transcripts 0' and 1' were detected as well as the TL-DNA transcripts 3 and 6a/b and an 1800 bp hybrid transcript (tr.Y) containing gene 6b sequences. Possible origins of the observed irregularities in T-DNA structures are discussed in relation to fidelity of transformation of plant cells viaAgrobacterium.

  4. Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

    PubMed

    Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

    2018-01-10

    Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing cancer cells. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. The nucleotide sequence of a segment of Trypanosoma brucei mitochondrial maxi-circle DNA that contains the gene for apocytochrome b and some unusual unassigned reading frames.

    PubMed Central

    Benne, R; De Vries, B F; Van den Burg, J; Klaver, B

    1983-01-01

    The nucleotide sequence of a 2.5-kb segment of the maxi-circle of Trypanosoma brucei mtDNA has been determined. The segment contains the gene for apocytochrome b, which displays about 25% homology at the amino acid level to the apocytochrome b gene from fungal and mammalian mtDNAs. Northern blot and S1 nuclease analyses have yielded accurate map positions of an RNA species in an area that coincides with the reading frame. The segment also contains two pairs of overlapping unassigned reading frames, which lack homology with any known mitochondrial gene or URF. The DNA sequence in these areas is AG-rich (70%), resulting in URFs with an unusually high level of glycine and charged amino acids (60%). They may not encode proteins, in spite of their size and the fact that abundant transcripts are mapped in these areas. Images PMID:6314266

  6. Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential

    PubMed Central

    Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael

    2013-01-01

    Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328

  7. Multisegment nanowire sensors for the detection of DNA molecules.

    PubMed

    Wang, Xu; Ozkan, Cengiz S

    2008-02-01

    We describe a novel application for detecting specific single strand DNA sequences using multisegment nanowires via a straightforward surface functionalization method. Nanowires comprising CdTe-Au-CdTe segments are fabricated using electrochemical deposition, and electrical characterization indicates a p-type behavior for the multisegment nanostructures, in a back-to-back Schottky diode configuration. Such nanostructures modified with thiol-terminated probe DNA fragments could function as high fidelity sensors for biomolecules at very low concentration. The gold segment is utilized for functionalization and binding of single strand DNA (ssDNA) fragments while the CdTe segments at both ends serve to modulate the equilibrium Fermi level of the heterojunction device upon hybridization of the complementary DNA fragments (cDNA) to the ssDNA over the Au segment. Employing such multisegment nanowires could lead to the fabrication more sophisticated and high multispecificity biosensors via selective functionalization of individual segments for biowarfare sensing and medical diagnostics applications.

  8. The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins.

    PubMed Central

    Fanning, T; Singer, M

    1987-01-01

    Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227

  9. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    PubMed

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.

  10. Molecular phylogeny of grey mullets (Teleostei: Mugilidae) in Greece: evidence from sequence analysis of mtDNA segments.

    PubMed

    Papasotiropoulos, Vasilis; Klossa-Kilia, Elena; Alahiotis, Stamatis N; Kilias, George

    2007-08-01

    Mitochondrial DNA sequence analysis has been used to explore genetic differentiation and phylogenetic relationships among five species of the Mugilidae family, Mugil cephalus, Chelon labrosus, Liza aurata, Liza ramada, and Liza saliens. DNA was isolated from samples originating from the Messolongi Lagoon in Greece. Three mtDNA segments (12s rRNA, 16s rRNA, and CO I) were PCR amplified and sequenced. Sequencing analysis revealed that the greatest genetic differentiation was observed between M. cephalus and all the other species studied, while C. labrosus and L. aurata were the closest taxa. Dendrograms obtained by the neighbor-joining method and Bayesian inference analysis exhibited the same topology. According to this topology, M. cephalus is the most distinct species and the remaining taxa are clustered together, with C. labrosus and L. aurata forming a single group. The latter result brings into question the monophyletic origin of the genus Liza.

  11. Transposon-containing DNA cloning vector and uses thereof

    DOEpatents

    Berg, C.M.; Berg, D.E.; Wang, G.

    1997-07-08

    The present invention discloses a rapid method of restriction mapping, sequencing or localizing genetic features in a segment of deoxyribonucleic acid (DNA) that is up to 42 kb in size. The method in part comprises cloning of the DNA segment in a specialized cloning vector and then isolating nested deletions in either direction in vivo by intramolecular transposition into the cloned DNA. A plasmid has been prepared and disclosed. 4 figs.

  12. Transposon-containing DNA cloning vector and uses thereof

    DOEpatents

    Berg, Claire M.; Berg, Douglas E.; Wang, Gan

    1997-01-01

    The present invention discloses a rapid method of restriction mapping, sequencing or localizing genetic features in a segment of deoxyribonucleic acid (DNA) that is up to 42 kb in size. The method in part comprises cloning of the DNA segment in a specialized cloning vector and then isolating nested deletions in either direction in vivo by intramolecular transposition into the cloned DNA. A plasmid has been prepared and disclosed.

  13. Markov models of genome segmentation

    NASA Astrophysics Data System (ADS)

    Thakur, Vivek; Azad, Rajeev K.; Ramaswamy, Ram

    2007-01-01

    We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedure based on the Jensen-Shannon divergence that has been introduced earlier. Higher-order Markov models are more sensitive to the details of local patterns and in application to genome analysis, this makes it possible to segment a sequence at positions that are biologically meaningful. We show the advantage of higher-order Markov-model-based segmentation procedures in detecting compositional inhomogeneity in chimeric DNA sequences constructed from genomes of diverse species, and in application to the E. coli K12 genome, boundaries of genomic islands, cryptic prophages, and horizontally acquired regions are accurately identified.

  14. Human Chromosome 7: DNA Sequence and Biology

    PubMed Central

    Scherer, Stephen W.; Cheung, Joseph; MacDonald, Jeffrey R.; Osborne, Lucy R.; Nakabayashi, Kazuhiko; Herbrick, Jo-Anne; Carson, Andrew R.; Parker-Katiraee, Layla; Skaug, Jennifer; Khaja, Razi; Zhang, Junjun; Hudek, Alexander K.; Li, Martin; Haddad, May; Duggan, Gavin E.; Fernandez, Bridget A.; Kanematsu, Emiko; Gentles, Simone; Christopoulos, Constantine C.; Choufani, Sanaa; Kwasnicka, Dorota; Zheng, Xiangqun H.; Lai, Zhongwu; Nusskern, Deborah; Zhang, Qing; Gu, Zhiping; Lu, Fu; Zeesman, Susan; Nowaczyk, Malgorzata J.; Teshima, Ikuko; Chitayat, David; Shuman, Cheryl; Weksberg, Rosanna; Zackai, Elaine H.; Grebe, Theresa A.; Cox, Sarah R.; Kirkpatrick, Susan J.; Rahman, Nazneen; Friedman, Jan M.; Heng, Henry H. Q.; Pelicci, Pier Giuseppe; Lo-Coco, Francesco; Belloni, Elena; Shaffer, Lisa G.; Pober, Barbara; Morton, Cynthia C.; Gusella, James F.; Bruns, Gail A. P.; Korf, Bruce R.; Quade, Bradley J.; Ligon, Azra H.; Ferguson, Heather; Higgins, Anne W.; Leach, Natalia T.; Herrick, Steven R.; Lemyre, Emmanuelle; Farra, Chantal G.; Kim, Hyung-Goo; Summers, Anne M.; Gripp, Karen W.; Roberts, Wendy; Szatmari, Peter; Winsor, Elizabeth J. T.; Grzeschik, Karl-Heinz; Teebi, Ahmed; Minassian, Berge A.; Kere, Juha; Armengol, Lluis; Pujana, Miguel Angel; Estivill, Xavier; Wilson, Michael D.; Koop, Ben F.; Tosi, Sabrina; Moore, Gudrun E.; Boright, Andrew P.; Zlotorynski, Eitan; Kerem, Batsheva; Kroisel, Peter M.; Petek, Erwin; Oscier, David G.; Mould, Sarah J.; Döhner, Hartmut; Döhner, Konstanze; Rommens, Johanna M.; Vincent, John B.; Venter, J. Craig; Li, Peter W.; Mural, Richard J.; Adams, Mark D.; Tsui, Lap-Chee

    2010-01-01

    DNA sequence and annotation of the entire human chromosome 7, encompassing nearly 158 million nucleotides of DNA and 1917 gene structures, are presented. To generate a higher order description, additional structural features such as imprinted genes, fragile sites, and segmental duplications were integrated at the level of the DNA sequence with medical genetic data, including 440 chromosome rearrangement breakpoints associated with disease. This approach enabled the discovery of candidate genes for developmental diseases including autism. PMID:12690205

  15. BAC sequencing using pooled methods.

    PubMed

    Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

    2015-01-01

    Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

  16. Characterization of proviruses cloned from mink cell focus-forming virus-infected cellular DNA.

    PubMed Central

    Khan, A S; Repaske, R; Garon, C F; Chan, H W; Rowe, W P; Martin, M A

    1982-01-01

    Two proviruses were cloned from EcoRI-digested DNA extracted from mink cells chronically infected with AKR mink cell focus-forming (MCF) 247 murine leukemia virus (MuLV), using a lambda phage host vector system. One cloned MuLV DNA fragment (designated MCF 1) contained sequences extending 6.8 kilobases from an EcoRI restriction site in the 5' long terminal repeat (LTR) to an EcoRI site located in the envelope (env) region and was indistinguishable by restriction endonuclease mapping for 5.1 kilobases (except for the EcoRI site in the LTR) from the 5' end of AKR ecotropic proviral DNA. The DNA segment extending from 5.1 to 6.8 kilobases contained several restriction sites that were not present in the AKR ecotropic provirus. A 0.5-kilobase DNA segment located at the 3' end of MCF 1 DNA contained sequences which hybridized to a xenotropic env-specific DNA probe but not to labeled ecotropic env-specific DNA. This dual character of MCF 1 proviral DNA was also confirmed by analyzing heteroduplex molecules by electron microscopy. The second cloned proviral DNA (designated MCF 2) was a 6.9-kilobase EcoRI DNA fragment which contained LTR sequences at each end and a 2.0-kilobase deletion encompassing most of the env region. The MCF 2 proviral DNA proved to be a useful reagent for detecting LTRs electron microscopically due to the presence of nonoverlapping, terminally located LTR sequences which effected its circularization with DNAs containing homologous LTR sequences. Nucleotide sequence analysis demonstrated the presence of a 104-base-pair direct repeat in the LTR of MCF 2 DNA. In contrast, only a single copy of the reiterated component of the direct repeat was present in MCF 1 DNA. Images PMID:6281459

  17. Transposon-like properties of the major, long repetitive sequence family in the genome of Physarum polycephalum

    PubMed Central

    Pearston, Douglas H.; Gordon, Mairi; Hardman, Norman

    1985-01-01

    A family of long, highly-repetitive sequences, referred to previously as `HpaII-repeats', dominates the genome of the eukaryotic slime mould Physarum polycephalum. These sequences are found exclusively in scrambled clusters. They account for about one-half of the total complement of repetitive DNA in Physarum, and represent the major sequence component found in hypermethylated, 20-50 kb segments of Physarum genomic DNA that fail to be cleaved using the restriction endonuclease HpaII. The structure of this abundant repetitive element was investigated by analysing cloned segments derived from the hypermethylated genomic DNA compartment. We show that the `HpaII-repeat' forms part of a larger repetitive DNA structure, ∼8.6 kb in length, with several structural features in common with recognised eukaryotic transposable genetic elements. Scrambled clusters of the sequence probably arise as a result of transposition-like events, during which the element preferentially recombines in either orientation with target sites located in other copies of the same repeated sequence. The target sites for transposition/recombination are not related in sequence but in all cases studied they are potentially capable of promoting the formation of small `cruciforms' or `Z-DNA' structures which might be recognised during the recombination process. ImagesFig. 3.Fig. 4. PMID:16453652

  18. Fine structure of the 21S ribosomal RNA region on yeast mitochondrial DNA. III. Physical location of mitochondrial genetic markers and the molecular nature of omega.

    PubMed

    Heyting, C; Menke, H H

    1979-01-11

    1. We have determined the physical location of mitochondrial genetic markers in the 21S region of yeast mtDNA by genetic analysis of petite mutants whose mtDNA has been physically mapped on the wild-type mtDNA. 2. The order of loci, determined in this study, is in agreement with the order deduced from recombination analysis and coretention analysis except for the position of omega+: we conclude that omega+ is located between C321 (RIB-1) and E514 (RIB-3). 3. The marker E514 (RIB-3) has been localized on a DNA segment of 3800 bp, and the markers E354, E553 and cs23 (RIB-2) on a DNA segment of 1100 base pairs; both these segments overlap the 21S rRNA cistron. The marker C321 (RIB-1) has been localized within a segment of 240 bp which also overlaps the 21S rRNA cistron, and we infer on the basis of indirect evidence that this marker lies within this cistron. 4. In all our rho+ as well as rho- strains there is a one-to-one correlation between the omega+ phenotype, the ability to transmit the omega+ allele and the presence of a mtDNA segment of about 1000 bp long, located between sequences specifying RIB-3 and sequences corresponding to the loci RIB-1 and RIB-2. This segment may be inserted at this same position into omega- mtDNA by recombination. 5. The role which the different allelic forms of omega may play in the polarity of recombination is discussed.

  19. Non-canonical ribosomal DNA segments in the human genome, and nucleoli functioning.

    PubMed

    Kupriyanova, Natalia S; Netchvolodov, Kirill K; Sadova, Anastasia A; Cherepanova, Marina D; Ryskov, Alexei P

    2015-11-10

    Ribosomal DNA (rDNA) in the human genome is represented by tandem repeats of 43 kb nucleotide sequences that form nucleoli organizers (NORs) on each of five pairs of acrocentric chromosomes. RDNA-similar segments of different lengths are also present on (NOR)(-) chromosomes. Many of these segments contain nucleotide substitutions, supplementary microsatellite clusters, and extended deletions. Recently, it was shown that, in addition to ribosome biogenesis, nucleoli exhibit additional functions, such as cell-cycle regulation and response to stresses. In particular, several stress-inducible loci located in the ribosomal intergenic spacer (rIGS) produce stimuli-specific noncoding nucleolus RNAs. By mapping the 5'/3' ends of the rIGS segments scattered throughout (NOR)(-) chromosomes, we discovered that the bonds in the rIGS that were most often susceptible to disruption in the rIGS were adjacent to, or overlapped with stimuli-specific inducible loci. This suggests the interconnection of the two phenomena - nucleoli functioning and the scattering of rDNA-like sequences on (NOR)(-) chromosomes. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. Relaxation dynamics of internal segments of DNA chains in nanochannels

    NASA Astrophysics Data System (ADS)

    Jain, Aashish; Muralidhar, Abhiram; Dorfman, Kevin; Dorfman Group Team

    We will present relaxation dynamics of internal segments of a DNA chain confined in nanochannel. The results have direct application in genome mapping technology, where long DNA molecules containing sequence-specific fluorescent probes are passed through an array of nanochannels to linearize them, and then the distances between these probes (the so-called ``DNA barcode'') are measured. The relaxation dynamics of internal segments set the experimental error due to dynamic fluctuations. We developed a multi-scale simulation algorithm, combining a Pruned-Enriched Rosenbluth Method (PERM) simulation of a discrete wormlike chain model with hard spheres with Brownian dynamics (BD) simulations of a bead-spring chain. Realistic parameters such as the bead friction coefficient and spring force law parameters are obtained from PERM simulations and then mapped onto the bead-spring model. The BD simulations are carried out to obtain the extension autocorrelation functions of various segments, which furnish their relaxation times. Interestingly, we find that (i) corner segments relax faster than the center segments and (ii) relaxation times of corner segments do not depend on the contour length of DNA chain, whereas the relaxation times of center segments increase linearly with DNA chain size.

  1. A simple, rapid, high-fidelity and cost-effective PCR-based two-step DNA synthesis method for long gene sequences.

    PubMed

    Xiong, Ai-Sheng; Yao, Quan-Hong; Peng, Ri-He; Li, Xian; Fan, Hui-Qin; Cheng, Zong-Ming; Li, Yi

    2004-07-07

    Chemical synthesis of DNA sequences provides a powerful tool for modifying genes and for studying gene function, structure and expression. Here, we report a simple, high-fidelity and cost-effective PCR-based two-step DNA synthesis (PTDS) method for synthesis of long segments of DNA. The method involves two steps. (i) Synthesis of individual fragments of the DNA of interest: ten to twelve 60mer oligonucleotides with 20 bp overlap are mixed and a PCR reaction is carried out with high-fidelity DNA polymerase Pfu to produce DNA fragments that are approximately 500 bp in length. (ii) Synthesis of the entire sequence of the DNA of interest: five to ten PCR products from the first step are combined and used as the template for a second PCR reaction using high-fidelity DNA polymerase pyrobest, with the two outermost oligonucleotides as primers. Compared with the previously published methods, the PTDS method is rapid (5-7 days) and suitable for synthesizing long segments of DNA (5-6 kb) with high G + C contents, repetitive sequences or complex secondary structures. Thus, the PTDS method provides an alternative tool for synthesizing and assembling long genes with complex structures. Using the newly developed PTDS method, we have successfully obtained several genes of interest with sizes ranging from 1.0 to 5.4 kb.

  2. Short segment search method for phylogenetic analysis using nested sliding windows

    NASA Astrophysics Data System (ADS)

    Iskandar, A. A.; Bustamam, A.; Trimarsanto, H.

    2017-10-01

    To analyze phylogenetics in Bioinformatics, coding DNA sequences (CDS) segment is needed for maximal accuracy. However, analysis by CDS cost a lot of time and money, so a short representative segment by CDS, which is envelope protein segment or non-structural 3 (NS3) segment is necessary. After sliding window is implemented, a better short segment than envelope protein segment and NS3 is found. This paper will discuss a mathematical method to analyze sequences using nested sliding window to find a short segment which is representative for the whole genome. The result shows that our method can find a short segment which more representative about 6.57% in topological view to CDS segment than an Envelope segment or NS3 segment.

  3. Sequence-Dependent Persistence Length of Long DNA

    NASA Astrophysics Data System (ADS)

    Chuang, Hui-Min; Reifenberger, Jeffrey G.; Cao, Han; Dorfman, Kevin D.

    2017-12-01

    Using a high-throughput genome-mapping approach, we obtained circa 50 million measurements of the extension of internal human DNA segments in a 41 nm ×41 nm nanochannel. The underlying DNA sequences, obtained by mapping to the reference human genome, are 2.5-393 kilobase pairs long and contain percent GC contents between 32.5% and 60%. Using Odijk's theory for a channel-confined wormlike chain, these data reveal that the DNA persistence length increases by almost 20% as the percent GC content increases. The increased persistence length is rationalized by a model, containing no adjustable parameters, that treats the DNA as a statistical terpolymer with a sequence-dependent intrinsic persistence length and a sequence-independent electrostatic persistence length.

  4. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-11-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons.

  5. Exon trapping: a genetic screen to identify candidate transcribed sequences in cloned mammalian genomic DNA.

    PubMed Central

    Duyk, G M; Kim, S W; Myers, R M; Cox, D R

    1990-01-01

    Identification and recovery of transcribed sequences from cloned mammalian genomic DNA remains an important problem in isolating genes on the basis of their chromosomal location. We have developed a strategy that facilitates the recovery of exons from random pieces of cloned genomic DNA. The basis of this "exon trapping" strategy is that, during a retroviral life cycle, genomic sequences of nonviral origin are correctly spliced and may be recovered as a cDNA copy of the introduced segment. By using this genetic assay for cis-acting sequences required for RNA splicing, we have screened approximately 20 kilobase pairs of cloned genomic DNA and have recovered all four predicted exons. PMID:2247475

  6. Specific and non-specific interactions of ParB with DNA: implications for chromosome segregation

    PubMed Central

    Taylor, James A.; Pastrana, Cesar L.; Butterer, Annika; Pernstich, Christian; Gwynn, Emma J.; Sobott, Frank; Moreno-Herrero, Fernando; Dillingham, Mark S.

    2015-01-01

    The segregation of many bacterial chromosomes is dependent on the interactions of ParB proteins with centromere-like DNA sequences called parS that are located close to the origin of replication. In this work, we have investigated the binding of Bacillus subtilis ParB to DNA in vitro using a variety of biochemical and biophysical techniques. We observe tight and specific binding of a ParB homodimer to the parS sequence. Binding of ParB to non-specific DNA is more complex and displays apparent positive co-operativity that is associated with the formation of larger, poorly defined, nucleoprotein complexes. Experiments with magnetic tweezers demonstrate that non-specific binding leads to DNA condensation that is reversible by protein unbinding or force. The condensed DNA structure is not well ordered and we infer that it is formed by many looping interactions between neighbouring DNA segments. Consistent with this view, ParB is also able to stabilize writhe in single supercoiled DNA molecules and to bridge segments from two different DNA molecules in trans. The experiments provide no evidence for the promotion of non-specific DNA binding and/or condensation events by the presence of parS sequences. The implications of these observations for chromosome segregation are discussed. PMID:25572315

  7. Binding of resveratrol to the minor groove of DNA sequences with AATT and TTAA segments induces differential stability.

    PubMed

    Nair, Maya S; D'Mello, Samar; Pant, Rashmi; Poluri, Krishna Mohan

    2017-05-01

    Interactions of a natural stilbene compound, resveratrol with two DNA sequences containing AATT/TTAA segments have been studied. Resveratrol is found to interact with both the sequences. The mode of interaction has been studied using absorption, steady state fluorescence and circular dichroism spectroscopic techniques. UV-visible absorption and fluorescence studies provided the information regarding the binding constants and the stoichiometry of binding, whereas circular dichroism studies depicted the structural changes in DNA upon resveratrol binding. Our results evidenced that, though resveratrol showed similar affinity to both the sequences, the mode of interactions was different. The binding constants of resveratrol to AATT/TTAA sequences were found to be 7.55×10 5 M -1 and 5.42×10 5 M -1 respectively. Spectroscopic data evidenced for a groove binding interaction. Melting studies showed that the binding of resveratrol induces differential stability to the DNA sequences d(CGTTAACG) 2 and d(CGAATTCG) 2 . Fluorescence data showed a stoichiometry of 1:1 for d(CGAATTCG) 2 -resveratrol complex and 1:4 for d(CGTTAACG) 2 -resveratrol complex. Molecular docking studies demonstrated that resveratrol binds to the minor groove region of both the sequences to form stable complexes with varied atomic contacts to the DNA bases or backbone. Both the complexes are stabilized by hydrogen bond formation. Our results evidenced that modulation of DNA sequence within the same bases can greatly alter the binding geometry and stability of the complex upon binding to small molecule inhibitor compounds like resveratrol. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Repeated sequence sets in mitochondrial DNA molecules of root knot nematodes (Meloidogyne): nucleotide sequences, genome location and potential for host-race identification.

    PubMed Central

    Okimoto, R; Chamberlin, H M; Macfarlane, J L; Wolstenholme, D R

    1991-01-01

    Within a 7 kb segment of the mtDNA molecule of the root knot nematode, Meloidogyne javanica, that lacks standard mitochondrial genes, are three sets of strictly tandemly arranged, direct repeat sequences: approximately 36 copies of a 102 ntp sequence that contains a TaqI site; 11 copies of a 63 ntp sequence, and 5 copies of an 8 ntp sequence. The 7 kb repeat-containing segment is bounded by putative tRNAasp and tRNAf-met genes and the arrangement of sequences within this segment is: the tRNAasp gene; a unique 1,528 ntp segment that contains two highly stable hairpin-forming sequences; the 102 ntp repeat set; the 8 ntp repeat set; a unique 1,068 ntp segment; the 63 ntp repeat set; and the tRNAf-met gene. The nucleotide sequences of the 102 ntp copies and the 63 ntp copies have been conserved among the species examined. Data from Southern hybridization experiments indicate that 102 ntp and 63 ntp repeats occur in the mtDNAs of three, two and two races of M.incognita, M.hapla and M.arenaria, respectively. Nucleotide sequences of the M.incognita Race-3 102 ntp repeat were found to be either identical or highly similar to those of the M.javanica 102 ntp repeat. Differences in migration distance and number of 102 ntp repeat-containing bands seen in Southern hybridization autoradiographs of restriction-digested mtDNAs of M.javanica and the different host races of M.incognita, M.hapla and M.arenaria are sufficient to distinguish the different host races of each species. Images PMID:2027769

  9. The Gene Construction Kit: a new computer program for manipulating and presenting DNA constructs.

    PubMed

    Gross, R H

    1990-06-01

    The Gene Construction Kit is a new tool for manipulating and displaying DNA sequence information. Constructs can be displayed either graphically or as formatted sequence. Segments of DNA can be cut out with restriction enzymes and pasted into other sites. The program keeps track of staggered ends and notifies the user of incompatibilities and offers a choice of ligation options. Each segment of a construct can have its own defined thickness, pattern, direction and color. The sequence listing can be displayed in any font and style in user defined grouping. Nucleotide positions can be displayed as can restriction sites and protein sequences. The DNA can be displayed as either single- or double-stranded. Restriction sites can be readily marked. Alternative views of the DNA can be maintained and the history of the construct automatically stored. Gel electrophoresis patterns can be generated and can be used in cloning project design. Extensive comments can be stored with the construct and can be searched rapidly for key words. High quality illustrations showing multiple editable constructs with added graphics and text information can be generated for slides, posters or publication.

  10. A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

    PubMed Central

    2018-01-01

    FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722

  11. Cloning and sequence analysis of complementary DNA encoding an aberrantly rearranged human T-cell gamma chain.

    PubMed Central

    Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L

    1986-01-01

    Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221

  12. Phylogeographic Differentiation of Mitochondrial DNA in Han Chinese

    PubMed Central

    Yao, Yong-Gang; Kong, Qing-Peng; Bandelt, Hans-Jürgen; Kivisild, Toomas; Zhang, Ya-Ping

    2002-01-01

    To characterize the mitochondrial DNA (mtDNA) variation in Han Chinese from several provinces of China, we have sequenced the two hypervariable segments of the control region and the segment spanning nucleotide positions 10171–10659 of the coding region, and we have identified a number of specific coding-region mutations by direct sequencing or restriction-fragment–length–polymorphism tests. This allows us to define new haplogroups (clades of the mtDNA phylogeny) and to dissect the Han mtDNA pool on a phylogenetic basis, which is a prerequisite for any fine-grained phylogeographic analysis, the interpretation of ancient mtDNA, or future complete mtDNA sequencing efforts. Some of the haplogroups under study differ considerably in frequencies across different provinces. The southernmost provinces show more pronounced contrasts in their regional Han mtDNA pools than the central and northern provinces. These and other features of the geographical distribution of the mtDNA haplogroups observed in the Han Chinese make an initial Paleolithic colonization from south to north plausible but would suggest subsequent migration events in China that mainly proceeded from north to south and east to west. Lumping together all regional Han mtDNA pools into one fictive general mtDNA pool or choosing one or two regional Han populations to represent all Han Chinese is inappropriate for prehistoric considerations as well as for forensic purposes or medical disease studies. PMID:11836649

  13. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

    PubMed

    Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.

  14. The Complete Nucleotide Sequence of the Human Immunoglobulin Heavy Chain Variable Region Locus

    PubMed Central

    Matsuda, Fumihiko; Ishii, Kazuo; Bourvagnet, Patrice; Kuma, Kei-ichi; Hayashida, Hidenori; Miyata, Takashi; Honjo, Tasuku

    1998-01-01

    The complete nucleotide sequence of the 957-kb DNA of the human immunoglobulin heavy chain variable (VH) region locus was determined and 43 novel VH segments were identified. The region contains 123 VH segments classifiable into seven different families, of which 79 are pseudogenes. Of the 44 VH segments with an open reading frame, 39 are expressed as heavy chain proteins and 1 as mRNA, while the remaining 4 are not found in immunoglobulin cDNAs. Combinatorial diversity of VH region was calculated to be ∼6,000. Conservation of the promoter and recombination signal sequences was observed to be higher in functional VH segments than in pseudogenes. Phylogenetic analysis of 114 VH segments clearly showed clustering of the VH segments of each family. However, an independent branch in the tree contained a single VH, V4-44.1P, sharing similar levels of homology to human VH families and to those of other vertebrates. Comparison between different copies of homologous units that appear repeatedly across the locus clearly demonstrates that dynamic DNA reorganization of the locus took place at least eight times between 133 and 10 million years ago. One nonimmunoglobulin gene of unknown function was identified in the intergenic region. PMID:9841928

  15. Determination of Trichuris skrjabini by sequencing of the ITS1-5.8S-ITS2 segment of the ribosomal DNA: comparative molecular study of different species of trichurids.

    PubMed

    Cutillas, C; Oliveros, R; de Rojas, M; Guevara, D C

    2004-06-01

    Adults of Trichuris skrjahini have been isolated from the cecum of caprine hosts (Capra hircus), Trichuris ovis and Trichuris globulosa from Ovis aries (sheep) and C. hircus (goats), and Trichuris leporis from Lepus europaeus (rabbits) in Spain. Genomic DNA was isolated and the ITS1-5.8S-ITS2 segment from the ribosomal DNA (rDNA) was amplified and sequenced by polymerase chain reaction (PCR) techniques. The ITS1 of T. skrjabini, T. ovis, T. globulosa, and T. leporis was 495, 757, 757, and 536 nucleotides in length, respectively, and had G + C contents of 59.6, 58.7, 58.7, and 60.8%, respectively. Intraindividual variation was detected in the ITSI sequences of the 4 species. Furthermore, the 5.8S sequences of T. skrjabini, T. ovis, T. globulosa, and T. leporis were compared. A total of 157, 152, 153, and 157 nucleotides in length was observed in the 5.8S sequences of these 4 species, respectively. There were no sequence differences of ITS1 and 5.8S products between T. ovis and T. globulosa. Nevertheless, clear differences were detected between the ITS1 sequences of T. skrjabini, T. ovis, T. leporis, Trichuris muris, and T. arvicolae. The ITS2 fragment from the rDNA of T. skrjabini was sequenced. A comparative study of the ITS2 sequence of T. skrjabini with the previously published ITS2 sequence data of T. ovis, T. leporis, T. muris, and T. arvicolae suggested that the combined use of sequence data from both spacers would be useful in the molecular characterization of trichurid parasites.

  16. 50 years of DNA ‘Breathing’: Reflections on Old and New Approaches

    PubMed Central

    von Hippel, Peter H.; Johnson, Neil P.; Marcus, Andrew H.

    2015-01-01

    Summary The coding sequences for genes, and much other regulatory information involved in genome expression, are located ‘inside’ the DNA duplex. Thus the ‘macromolecular machines’ that read-out this information from the base sequence of the DNA must somehow access the DNA ‘interior’. Double-stranded (ds) DNA is a highly structured and cooperatively stabilized system at physiological temperatures, but is also only marginally stable and undergoes a cooperative ‘melting phase transition’ at temperatures not far above physiological. Furthermore, due to its length and heterogeneous sequence, with AT-rich segments being less stable than GC-rich segments, the DNA genome ‘melts’ in a multistate fashion. Therefore the DNA genome must also manifest thermally driven structural (‘breathing’) fluctuations at physiological temperatures that should reflect the heterogeneity of the dsDNA stability near the melting temperature. Thus many of the breathing fluctuations of dsDNA are likely also to be sequence dependent, and could well contain information that should be ‘readable’ and useable by regulatory proteins and protein complexes in site-specific binding reactions involving dsDNA ‘opening’. Our laboratory has been involved in studying the breathing fluctuations of duplex DNA for about 50 years. In this ‘Reflections’ article we present a relatively chronological overview of these studies, starting with the use of simple chemical probes (such as hydrogen exchange, formaldehyde and simple DNA ‘melting’ proteins) to examine the local stability of the dsDNA structure, and culminating in sophisticated spectroscopic approaches that can be used to monitor the breathing-dependent interactions of regulatory complexes with their duplex DNA targets in ‘real time’. PMID:23840028

  17. Nucleotide sequence of a cluster of early and late genes in a conserved segment of the vaccinia virus genome.

    PubMed Central

    Plucienniczak, A; Schroeder, E; Zettlmeissl, G; Streeck, R E

    1985-01-01

    The nucleotide sequence of a 7.6 kb vaccinia DNA segment from a genomic region conserved among different orthopox virus has been determined. This segment contains a tight cluster of 12 partly overlapping open reading frames most of which can be correlated with previously identified early and late proteins and mRNAs. Regulatory signals used by vaccinia virus have been studied. Presumptive promoter regions are rich in A, T and carry the consensus sequences TATA and AATAA spaced at 20-24 base pairs. Tandem repeats of a CTATTC consensus sequence are proposed to be involved in the termination of early transcription. PMID:2987815

  18. Sequence analysis of the canine mitochondrial DNA control region from shed hair samples in criminal investigations.

    PubMed

    Berger, C; Berger, B; Parson, W

    2012-01-01

    In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.

  19. Molecular cloning and sequencing of the cDNA and gene for a novel elastinolytic metalloproteinase from Aspergillus fumigatus and its expression in Escherichia coli.

    PubMed Central

    Sirakova, T D; Markaryan, A; Kolattukudy, P E

    1994-01-01

    An extracellular elastinolytic metalloproteinase, purified from Aspergillus fumigatus isolated from an aspergillosis and patient/and an internal peptide derived from it were subjected to N-terminal sequencing. Oligonucleotide primers based on these sequences were used to PCR amplify a segment of the metalloproteinase cDNA, which was used as a probe to isolate the cDNA and gene for this enzyme. The gene sequence matched exactly with the cDNA sequence except for the four introns that interrupted the open reading frame. According to the deduced amino acid sequence, the metalloproteinase has a signal sequence and 227 additional amino acids preceding the sequence for the mature protein of 389 amino acids with a calculated molecular mass of 42 kDa, which is close to the size of the purified mature fungal proteinase. This sequence contains segments that matched both the N terminus of the mature protein and the internal peptide. A. fumigatus metalloproteinase contains some of the conserved zinc-binding and active-site motifs characteristic of metalloproteinases but shows no overall homology with known metalloproteinases. The cDNA of the mature protein when introduced into Escherichia coli directed the expression of a protein with a size, N-terminal sequence, and immunological cross-reactivity identical to those of the native fungal enzyme. Although the enzyme in the inclusion bodies could not be renatured, expression at 30 degrees C yielded soluble enzyme that showed chromatographic behavior identical to that of the native fungal enzyme and catalyzed hydrolysis of elastin. The metalloproteinase gene described here was not found in Aspergillus flavus. Images PMID:7927676

  20. Amplification and chromosomal dispersion of human endogenous retroviral sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Steele, P.E.; Martin, M.A.; Rabson, A.B.

    1986-09-01

    Endogenous retroviral sequences have undergone amplification events involving both viral and flanking cellular sequences. The authors cloned members of an amplified family of full-length endogenous retroviral sequences. Genomic blotting, employing a flanking cellular DNA probe derived from a member of this family, revealed a similar array of reactive bands in both humans and chimpanzees, indicating that an amplification event involving retroviral and associated cellular DNA sequences occurred before the evolutionary separation of these two primates. Southern analyses of restricted somatic cell hybrid DNA preparations suggested that endogenous retroviral segments are widely dispersed in the human genome and that amplification andmore » dispersion events may be linked.« less

  1. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.

    PubMed

    Teng, Haotian; Cao, Minh Duc; Hall, Michael B; Duarte, Tania; Wang, Sheng; Coin, Lachlan J M

    2018-05-01

    Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.

  2. Spreadsheet-based program for alignment of overlapping DNA sequences.

    PubMed

    Anbazhagan, R; Gabrielson, E

    1999-06-01

    Molecular biology laboratories frequently face the challenge of aligning small overlapping DNA sequences derived from a long DNA segment. Here, we present a short program that can be used to adapt Excel spreadsheets as a tool for aligning DNA sequences, regardless of their orientation. The program runs on any Windows or Macintosh operating system computer with Excel 97 or Excel 98. The program is available for use as an Excel file, which can be downloaded from the BioTechniques Web site. Upon execution, the program opens a specially designed customized workbook and is capable of identifying overlapping regions between two sequence fragments and displaying the sequence alignment. It also performs a number of specialized functions such as recognition of restriction enzyme cutting sites and CpG island mapping without costly specialized software.

  3. Intramolecular transposition by a synthetic IS50 (Tn5) derivative

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tomcsanyi, T.; Phadnis, S.H.; Berg, D.E.

    1990-11-01

    We report the formation of deletions and inversions by intramolecular transposition of Tn5-derived mobile elements. The synthetic transposons used contained the IS50 O and I end segments and the transposase gene, a contraselectable gene encoding sucrose sensitivity (sacB), antibiotic resistance genes, and a plasmid replication origin. Both deletions and inversions were associated with loss of a 300-bp segment that is designated the vector because it is outside of the transposon. Deletions were severalfold more frequent than inversions, perhaps reflecting constraints on DNA twisting or abortive transposition. Restriction and DNA sequence analyses showed that both types of rearrangements extended from onemore » transposon end to many different sites in target DNA. In the case of inversions, transposition generated 9-bp direct repeats of target sequences.« less

  4. Superstatistical model of bacterial DNA architecture

    NASA Astrophysics Data System (ADS)

    Bogachev, Mikhail I.; Markelov, Oleg A.; Kayumov, Airat R.; Bunde, Armin

    2017-02-01

    Understanding the physical principles that govern the complex DNA structural organization as well as its mechanical and thermodynamical properties is essential for the advancement in both life sciences and genetic engineering. Recently we have discovered that the complex DNA organization is explicitly reflected in the arrangement of nucleotides depicted by the universal power law tailed internucleotide interval distribution that is valid for complete genomes of various prokaryotic and eukaryotic organisms. Here we suggest a superstatistical model that represents a long DNA molecule by a series of consecutive ~150 bp DNA segments with the alternation of the local nucleotide composition between segments exhibiting long-range correlations. We show that the superstatistical model and the corresponding DNA generation algorithm explicitly reproduce the laws governing the empirical nucleotide arrangement properties of the DNA sequences for various global GC contents and optimal living temperatures. Finally, we discuss the relevance of our model in terms of the DNA mechanical properties. As an outlook, we focus on finding the DNA sequences that encode a given protein while simultaneously reproducing the nucleotide arrangement laws observed from empirical genomes, that may be of interest in the optimization of genetic engineering of long DNA molecules.

  5. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications

    PubMed Central

    Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174

  6. Method for introducing unidirectional nested deletions

    DOEpatents

    Dunn, J.J.; Quesada, M.A.; Randesi, M.

    1999-07-27

    Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment. More specifically, the method comprises providing a recombinant DNA construct comprising a DNA segment of interest inserted in a cloning vector. The cloning vector has an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment of interest. The recombinant DNA construct is then contacted with the protein pII encoded by gene II of phage f1 thereby generating a single-stranded nick. The nicked DNA is then contacted with E. coli Exonuclease III thereby expanding the single-stranded nick into a single-stranded gap. The single-stranded gapped DNA is then contacted with a single-strand-specific endonuclease thereby producing a linearized DNA molecule containing a double-stranded deletion corresponding in size to the single-stranded gap. The DNA treated in this manner is then incubated with DNA ligase under conditions appropriate for ligation. Also disclosed is a method for producing single-stranded DNA probes. In this embodiment, single-stranded gapped DNA, produced as described above, is contacted with a DNA polymerase in the presence of labeled nucleotides to fill in the gap. This DNA is then linearized by digestion with a restriction enzyme which cuts outside the DNA segment of interest. The product of this digestion is then denatured to produce a labeled single-stranded nucleic acid probe. 1 fig.

  7. Method for introducing unidirectional nested deletions

    DOEpatents

    Dunn, John J.; Quesada, Mark A.; Randesi, Matthew

    1999-07-27

    Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment. More specifically, the method comprises providing a recombinant DNA construct comprising a DNA segment of interest inserted in a cloning vector, the cloning vector having an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment of interest. The recombinant DNA construct is then contacted with the protein pII encoded by gene II of phage f1 thereby generating a single-stranded nick. The nicked DNA is then contacted with E. coli Exonuclease III thereby expanding the single-stranded nick into a single-stranded gap. The single-stranded gapped DNA is then contacted with a single-strand-specific endonuclease thereby producing a linearized DNA molecule containing a double-stranded deletion corresponding in size to the single-stranded gap. The DNA treated in this manner is then incubated with DNA ligase under conditions appropriate for ligation. Also disclosed is a method for producing single-stranded DNA probes. In this embodiment, single-stranded gapped DNA, produced as described above, is contacted with a DNA polymerase in the presence of labeled nucleotides to fill in the gap. This DNA is then linearized by digestion with a restriction enzyme which cuts outside the DNA segment of interest. The product of this digestion is then denatured to produce a labeled single-stranded nucleic acid probe.

  8. Method for producing labeled single-stranded nucleic acid probes

    DOEpatents

    Dunn, John J.; Quesada, Mark A.; Randesi, Matthew

    1999-10-19

    Disclosed is a method for the introduction of unidirectional deletions in a cloned DNA segment. More specifically, the method comprises providing a recombinant DNA construct comprising a DNA segment of interest inserted in a cloning vector, the cloning vector having an f1 endonuclease recognition sequence adjacent to the insertion site of the DNA segment of interest. The recombinant DNA construct is then contacted with the protein pII encoded by gene II of phage f1 thereby generating a single-stranded nick. The nicked DNA is then contacted with E. coli Exonuclease III thereby expanding the single-stranded nick into a single-stranded gap. The single-stranded gapped DNA is then contacted with a single-strand-specific endonuclease thereby producing a linearized DNA molecule containing a double-stranded deletion corresponding in size to the single-stranded gap. The DNA treated in this manner is then incubated with DNA ligase under conditions appropriate for ligation. Also disclosed is a method for producing single-stranded DNA probes. In this embodiment, single-stranded gapped DNA, produced as described above, is contacted with a DNA polymerase in the presence of labeled nucleotides to fill in the gap. This DNA is then linearized by digestion with a restriction enzyme which cuts outside the DNA segment of interest. The product of this digestion is then denatured to produce a labeled single-stranded nucleic acid probe.

  9. DNABIT Compress - Genome compression algorithm.

    PubMed

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  10. Plant DNA sequences from feces: potential means for assessing diets of wild primates.

    PubMed

    Bradley, Brenda J; Stiller, Mathias; Doran-Sheehy, Diane M; Harris, Tara; Chapman, Colin A; Vigilant, Linda; Poinar, Hendrik

    2007-06-01

    Analyses of plant DNA in feces provides a promising, yet largely unexplored, means of documenting the diets of elusive primates. Here we demonstrate the promise and pitfalls of this approach using DNA extracted from fecal samples of wild western gorillas (Gorilla gorilla) and black and white colobus monkeys (Colobus guereza). From these DNA extracts we amplified, cloned, and sequenced small segments of chloroplast DNA (part of the rbcL gene) and plant nuclear DNA (ITS-2). The obtained sequences were compared to sequences generated from known plant samples and to those in GenBank to identify plant taxa in the feces. With further optimization, this method could provide a basic evaluation of minimum primate dietary diversity even when knowledge of local flora is limited. This approach may find application in studies characterizing the diets of poorly-known, unhabituated primate species or assaying consumer-resource relationships in an ecosystem. (c) 2007 Wiley-Liss, Inc.

  11. Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    PubMed

    Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

    2014-07-04

    Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was shared by mitochondrial genomes of CMS and male-fertile pepper lines, extensive genome rearrangements were detected. CMS candidate genes located on the edges of highly-rearranged CMS-specific DNA regions and near to repeat sequences. These characteristics were detected among CMS-associated genes in other species, implying a common mechanism might be involved in the evolution of CMS-associated genes.

  12. A 21.7 kb DNA segment on the left arm of yeast chromosome XIV carries WHI3, GCR2, SPX18, SPX19, an homologue to the heat shock gene SSB1 and 8 new open reading frames of unknown function.

    PubMed

    Jonniaux, J L; Coster, F; Purnelle, B; Goffeau, A

    1994-12-01

    We report the amino acid sequence of 13 open reading frames (ORF > 299 bp) located on a 21.7 kb DNA segment from the left arm of chromosome XIV of Saccharomyces cerevisiae. Five open reading frames had been entirely or partially sequenced previously: WHI3, GCR2, SPX19, SPX18 and a heat shock gene similar to SSB1. The products of 8 other ORFs are new putative proteins among which N1394 is probably a membrane protein. N1346 contains a leucine zipper pattern and the corresponding ORF presents an HAP (global regulator of respiratory genes) upstream activating sequence in the promoting region. N1386 shares homologies with the DNA structure-specific recognition protein family SSRPs and the corresponding ORF is preceded by an MCB (MluI cell cycle box) upstream activating factor.

  13. Linear and Nonlinear Statistical Characterization of DNA

    NASA Astrophysics Data System (ADS)

    Norio Oiwa, Nestor; Goldman, Carla; Glazier, James

    2002-03-01

    We find spatial order in the distribution of protein-coding (including RNAs) and control segments of GenBank genomic sequences, irrespective of ATCG content. This is achieved by correlations, histograms, fractal dimensions and singularity spectra. Estimates of these quantities in complete nuclear genome indicate that coding sequences are long-range correlated and their disposition are self-similar (multifractal) for eukaryotes. These characteristics are absent in prokaryotes, where there are few noncoding sequences, suggesting the `junk' DNA play a relevant role to the genome structure and function. Concerning the genetic message of ATCG sequences, we build a random walk (Levy flight), using DNA symmetry arguments, where we associate A, T, C and G as left, right, down and up steps, respectively. Nonlinear analysis of mitochondrial DNA walks reveal multifractal pattern based on palindromic sequences, which fold in hairpins and loops.

  14. DNA Data Visualization (DDV): Software for Generating Web-Based Interfaces Supporting Navigation and Analysis of DNA Sequence Data of Entire Genomes.

    PubMed

    Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard

    2015-01-01

    Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.

  15. The most conserved genome segments for life detection on Earth and other planets.

    PubMed

    Isenbarger, Thomas A; Carr, Christopher E; Johnson, Sarah Stewart; Finney, Michael; Church, George M; Gilbert, Walter; Zuber, Maria T; Ruvkun, Gary

    2008-12-01

    On Earth, very simple but powerful methods to detect and classify broad taxa of life by the polymerase chain reaction (PCR) are now standard practice. Using DNA primers corresponding to the 16S ribosomal RNA gene, one can survey a sample from any environment for its microbial inhabitants. Due to massive meteoritic exchange between Earth and Mars (as well as other planets), a reasonable case can be made for life on Mars or other planets to be related to life on Earth. In this case, the supremely sensitive technologies used to study life on Earth, including in extreme environments, can be applied to the search for life on other planets. Though the 16S gene has become the standard for life detection on Earth, no genome comparisons have established that the ribosomal genes are, in fact, the most conserved DNA segments across the kingdoms of life. We present here a computational comparison of full genomes from 13 diverse organisms from the Archaea, Bacteria, and Eucarya to identify genetic sequences conserved across the widest divisions of life. Our results identify the 16S and 23S ribosomal RNA genes as well as other universally conserved nucleotide sequences in genes encoding particular classes of transfer RNAs and within the nucleotide binding domains of ABC transporters as the most conserved DNA sequence segments across phylogeny. This set of sequences defines a core set of DNA regions that have changed the least over billions of years of evolution and provides a means to identify and classify divergent life, including ancestrally related life on other planets.

  16. Compositional segmentation and complexity measurement in stock indices

    NASA Astrophysics Data System (ADS)

    Wang, Haifeng; Shang, Pengjian; Xia, Jianan

    2016-01-01

    In this paper, we introduce a complexity measure based on the entropic segmentation called sequence compositional complexity (SCC) into the analysis of financial time series. SCC was first used to deal directly with the complex heterogeneity in nonstationary DNA sequences. We already know that SCC was found to be higher in sequences with long-range correlation than those with low long-range correlation, especially in the DNA sequences. Now, we introduce this method into financial index data, subsequently, we find that the values of SCC of some mature stock indices, such as S & P 500 (simplified with S & P in the following) and HSI, are likely to be lower than the SCC value of Chinese index data (such as SSE). What is more, we find that, if we classify the indices with the method of SCC, the financial market of Hong Kong has more similarities with mature foreign markets than Chinese ones. So we believe that a good correspondence is found between the SCC of the index sequence and the complexity of the market involved.

  17. Isolation and sequence characterization of DNA-A genome of a new begomovirus strain associated with severe leaf curling symptoms of Jatropha curcas L.

    PubMed

    Chauhan, Sushma; Rahman, Hifzur; Mastan, Shaik G; Pamidimarri, D V N Sudheer; Reddy, Muppala P

    2018-07-20

    Begomoviruses belong to the family Geminiviridae are associated with several disease symptoms, such as mosaic and leaf curling in Jatropha curcas. The molecular characterization of these viral strains will help in developing management strategies to control the disease. In this study, J. curcas that was infected with begomovirus and showed acute leaf curling symptoms were identified. DNA-A segment from pathogenic viral strain was isolated and sequenced. The sequenced genome was assembled and characterized in detail. The full-length DNA-A sequence was covered by primer walking. The genome sequence showed the general organization of DNA-A from begomovirus by the distribution of ORFs in both viral and anti-viral strands. The genome size ranged from 2844 bp-2852 bp. Three strains with minor nucleotide variations were identified, and a phylogenetic analysis was performed by comparing the DNA-A segments from other reported begomovirus isolates. The maximum sequence similarity was observed with Euphorbia yellow mosaic virus (FN435995). In the phylogenetic tree, no clustering was observed with previously reported begomovirus strains isolated from J. curcas host. The strains isolated in this study belong to new begomoviral strain that elicits symptoms of leaf curling in J. curcas. The results indicate that the probable origin of the strains is from Jatropha mosaic virus infecting J. gassypifolia. The strains isolated in this study are referred as Jatropha curcas leaf curl India virus (JCLCIV) based on the major symptoms exhibited by host J. curcas. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. SeeGH--a software tool for visualization of whole genome array comparative genomic hybridization data.

    PubMed

    Chi, Bryan; DeLeeuw, Ronald J; Coe, Bradley P; MacAulay, Calum; Lam, Wan L

    2004-02-09

    Array comparative genomic hybridization (CGH) is a technique which detects copy number differences in DNA segments. Complete sequencing of the human genome and the development of an array representing a tiling set of tens of thousands of DNA segments spanning the entire human genome has made high resolution copy number analysis throughout the genome possible. Since array CGH provides signal ratio for each DNA segment, visualization would require the reassembly of individual data points into chromosome profiles. We have developed a visualization tool for displaying whole genome array CGH data in the context of chromosomal location. SeeGH is an application that translates spot signal ratio data from array CGH experiments to displays of high resolution chromosome profiles. Data is imported from a simple tab delimited text file obtained from standard microarray image analysis software. SeeGH processes the signal ratio data and graphically displays it in a conventional CGH karyotype diagram with the added features of magnification and DNA segment annotation. In this process, SeeGH imports the data into a database, calculates the average ratio and standard deviation for each replicate spot, and links them to chromosome regions for graphical display. Once the data is displayed, users have the option of hiding or flagging DNA segments based on user defined criteria, and retrieve annotation information such as clone name, NCBI sequence accession number, ratio, base pair position on the chromosome, and standard deviation. SeeGH represents a novel software tool used to view and analyze array CGH data. The software gives users the ability to view the data in an overall genomic view as well as magnify specific chromosomal regions facilitating the precise localization of genetic alterations. SeeGH is easily installed and runs on Microsoft Windows 2000 or later environments.

  19. Modular structural elements in the replication origin region of Tetrahymena rDNA.

    PubMed Central

    Du, C; Sanzgiri, R P; Shaiu, W L; Choi, J K; Hou, Z; Benbow, R M; Dobbs, D L

    1995-01-01

    Computer analyses of the DNA replication origin region in the amplified rRNA genes of Tetrahymena thermophila identified a potential initiation zone in the 5'NTS [Dobbs, Shaiu and Benbow (1994), Nucleic Acids Res. 22, 2479-2489]. This region consists of a putative DNA unwinding element (DUE) aligned with predicted bent DNA segments, nuclear matrix or scaffold associated region (MAR/SAR) consensus sequences, and other common modular sequence elements previously shown to be clustered in eukaryotic chromosomal origin regions. In this study, two mung bean nuclease-hypersensitive sites in super-coiled plasmid DNA were localized within the major DUE-like element predicted by thermodynamic analyses. Three restriction fragments of the 5'NTS region predicted to contain bent DNA segments exhibited anomalous migration characteristic of bent DNA during electrophoresis on polyacrylamide gels. Restriction fragments containing the 5'NTS region bound Tetrahymena nuclear matrices in an in vitro binding assay, consistent with an association of the replication origin region with the nuclear matrix in vivo. The direct demonstration in a protozoan origin region of elements previously identified in Drosophila, chick and mammalian origin regions suggests that clusters of modular structural elements may be a conserved feature of eukaryotic chromosomal origins of replication. Images PMID:7784181

  20. HapFABIA: Identification of very short segments of identity by descent characterized by rare variants in large sequencing data

    PubMed Central

    Hochreiter, Sepp

    2013-01-01

    Identity by descent (IBD) can be reliably detected for long shared DNA segments, which are found in related individuals. However, many studies contain cohorts of unrelated individuals that share only short IBD segments. New sequencing technologies facilitate identification of short IBD segments through rare variants, which convey more information on IBD than common variants. Current IBD detection methods, however, are not designed to use rare variants for the detection of short IBD segments. Short IBD segments reveal genetic structures at high resolution. Therefore, they can help to improve imputation and phasing, to increase genotyping accuracy for low-coverage sequencing and to increase the power of association studies. Since short IBD segments are further assumed to be old, they can shed light on the evolutionary history of humans. We propose HapFABIA, a computational method that applies biclustering to identify very short IBD segments characterized by rare variants. HapFABIA is designed to detect short IBD segments in genotype data that were obtained from next-generation sequencing, but can also be applied to DNA microarray data. Especially in next-generation sequencing data, HapFABIA exploits rare variants for IBD detection. HapFABIA significantly outperformed competing algorithms at detecting short IBD segments on artificial and simulated data with rare variants. HapFABIA identified 160 588 different short IBD segments characterized by rare variants with a median length of 23 kb (mean 24 kb) in data for chromosome 1 of the 1000 Genomes Project. These short IBD segments contain 752 000 single nucleotide variants (SNVs), which account for 39% of the rare variants and 23.5% of all variants. The vast majority—152 000 IBD segments—are shared by Africans, while only 19 000 and 11 000 are shared by Europeans and Asians, respectively. IBD segments that match the Denisova or the Neandertal genome are found significantly more often in Asians and Europeans but also, in some cases exclusively, in Africans. The lengths of IBD segments and their sharing between continental populations indicate that many short IBD segments from chromosome 1 existed before humans migrated out of Africa. Thus, rare variants that tag these short IBD segments predate human migration from Africa. The software package HapFABIA is available from Bioconductor. All data sets, result files and programs for data simulation, preprocessing and evaluation are supplied at http://www.bioinf.jku.at/research/short-IBD. PMID:24174545

  1. Mini-DNA barcode in identification of the ornamental fish: A case study from Northeast India.

    PubMed

    Dhar, Bishal; Ghosh, Sankar Kumar

    2017-09-05

    The ornamental fishes were exported under the trade names or generic names, thus creating problems in species identification. In this regard, DNA barcoding could effectively elucidate the actual species status. However, the problem arises if the specimen is having taxonomic disputes, falsified by trade/generic names, etc., On the other hand, barcoding the archival museum specimens would be of greater benefit to address such issues as it would create firm, error-free reference database for rapid identification of any species. This can be achieved only by generating short sequences as DNA from chemically preserved are mostly degraded. Here we aimed to identify a short stretch of informative sites within the full-length barcode segment, capable of delineating diverse group of ornamental fish species, commonly traded from NE India. We analyzed 287 full-length barcode sequences from the major fish orders and compared the interspecific K2P distance with nucleotide substitutions patterns and found a strong correlation of interspecies distance with transversions (0.95, p<0.001). We, therefore, proposed a short stretch of 171bp (transversion rich) segment as mini-barcode. The proposed segment was compared with the full-length barcodes and found to delineate the species effectively. Successful PCR amplification and sequencing of the 171bp segment using designed primers for different orders validated it as mini-barcodes for ornamental fishes. Thus, our findings would be helpful in strengthening the global database with the sequence of archived fish species as well as an effective identification tool of the traded ornamental fish species, as a less time consuming, cost effective field-based application. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. DNABIT Compress – Genome compression algorithm

    PubMed Central

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-01

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, “DNABIT Compress” for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that “DNABIT Compress” algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases. PMID:21383923

  3. A magnetic bead-based method for concentrating DNA from human urine for downstream detection.

    PubMed

    Bordelon, Hali; Russ, Patricia K; Wright, David W; Haselton, Frederick R

    2013-01-01

    Due to the presence of PCR inhibitors, PCR cannot be used directly on most clinical samples, including human urine, without pre-treatment. A magnetic bead-based strategy is one potential method to collect biomarkers from urine samples and separate the biomarkers from PCR inhibitors. In this report, a 1 mL urine sample was mixed within the bulb of a transfer pipette containing lyophilized nucleic acid-silica adsorption buffer and silica-coated magnetic beads. After mixing, the sample was transferred from the pipette bulb to a small diameter tube, and captured biomarkers were concentrated using magnetic entrainment of beads through pre-arrayed wash solutions separated by small air gaps. Feasibility was tested using synthetic segments of the 140 bp tuberculosis IS6110 DNA sequence spiked into pooled human urine samples. DNA recovery was evaluated by qPCR. Despite the presence of spiked DNA, no DNA was detectable in unextracted urine samples, presumably due to the presence of PCR inhibitors. However, following extraction with the magnetic bead-based method, we found that ∼50% of spiked TB DNA was recovered from human urine containing roughly 5×10(3) to 5×10(8) copies of IS6110 DNA. In addition, the DNA was concentrated approximately ten-fold into water. The final concentration of DNA in the eluate was 5×10(6), 14×10(6), and 8×10(6) copies/µL for 1, 3, and 5 mL urine samples, respectively. Lyophilized and freshly prepared reagents within the transfer pipette produced similar results, suggesting that long-term storage without refrigeration is possible. DNA recovery increased with the length of the spiked DNA segments from 10±0.9% for a 75 bp DNA sequence to 42±4% for a 100 bp segment and 58±9% for a 140 bp segment. The estimated LOD was 77 copies of DNA/µL of urine. The strategy presented here provides a simple means to achieve high nucleic acid recovery from easily obtained urine samples, which does not contain inhibitors of PCR.

  4. A Magnetic Bead-Based Method for Concentrating DNA from Human Urine for Downstream Detection

    PubMed Central

    Bordelon, Hali; Russ, Patricia K.; Wright, David W.; Haselton, Frederick R.

    2013-01-01

    Due to the presence of PCR inhibitors, PCR cannot be used directly on most clinical samples, including human urine, without pre-treatment. A magnetic bead-based strategy is one potential method to collect biomarkers from urine samples and separate the biomarkers from PCR inhibitors. In this report, a 1 mL urine sample was mixed within the bulb of a transfer pipette containing lyophilized nucleic acid-silica adsorption buffer and silica-coated magnetic beads. After mixing, the sample was transferred from the pipette bulb to a small diameter tube, and captured biomarkers were concentrated using magnetic entrainment of beads through pre-arrayed wash solutions separated by small air gaps. Feasibility was tested using synthetic segments of the 140 bp tuberculosis IS6110 DNA sequence spiked into pooled human urine samples. DNA recovery was evaluated by qPCR. Despite the presence of spiked DNA, no DNA was detectable in unextracted urine samples, presumably due to the presence of PCR inhibitors. However, following extraction with the magnetic bead-based method, we found that ∼50% of spiked TB DNA was recovered from human urine containing roughly 5×103 to 5×108 copies of IS6110 DNA. In addition, the DNA was concentrated approximately ten-fold into water. The final concentration of DNA in the eluate was 5×106, 14×106, and 8×106 copies/µL for 1, 3, and 5 mL urine samples, respectively. Lyophilized and freshly prepared reagents within the transfer pipette produced similar results, suggesting that long-term storage without refrigeration is possible. DNA recovery increased with the length of the spiked DNA segments from 10±0.9% for a 75 bp DNA sequence to 42±4% for a 100 bp segment and 58±9% for a 140 bp segment. The estimated LOD was 77 copies of DNA/µL of urine. The strategy presented here provides a simple means to achieve high nucleic acid recovery from easily obtained urine samples, which does not contain inhibitors of PCR. PMID:23861895

  5. [An intriguing model for 5S rDNA sequences dispersion in the genome of freshwater stingray Potamotrygon motoro (Chondrichthyes: Potamotrygonidae)].

    PubMed

    Cruz, V P; Oliveira, C; Foresti, F

    2015-01-01

    5S rDNA genes of the stingray Potamotrygon motoro were PCR replicated, purified, cloned and sequenced. Two distinct classes of segments of different sizes were obtained. The smallest, with 342 bp units, was classified as class I, and the largest, with 1900 bp units, was designated as class II. Alignment with the consensus sequences for both classes showed changes in a few bases in the 5S rDNA genes. TATA-like sequences were detected in the nontranscribed spacer (NTS) regions of class I and a microsatellite (GCT) 10 sequence was detected in the NTS region of class II. The results obtained can help to understand the molecular organization of ribosomal genes and the mechanism of gene dispersion.

  6. Capsicum annuum dehydrin, an osmotic-stress gene in hot pepper plants.

    PubMed

    Chung, Eunsook; Kim, Soo-Yong; Yi, So Young; Choi, Doil

    2003-06-30

    Osmotic stress-related genes were selected from an EST database constructed from 7 cDNA libraries from different tissues of the hot pepper. A full-length cDNA of Capsicum annuum dehydrin (Cadhn), a late embryogenesis abundant (lea) gene, was selected from the 5' single pass sequenced cDNA clones and sequenced. The deduced polypeptide has 87% identity with potato dehydrin C17, but very little identity with the dehydrin genes of other organisms. It contains a serine-tract (S-segment) and 3 conserved lysine-rich domains (K-segments). Southern blot analysis showed that 2 copies are present in the hot pepper genome. Cadhn was induced by osmotic stress in leaf tissues as well as by the application of abscisic acid. The RNA was most abundant in green fruit. The expression of several osmotic stress-related genes was examined and Cadhn proved to be the most abundantly expressed of these in response to osmotic stress.

  7. Amplification of the entire kanamycin biosynthetic gene cluster during empirical strain improvement of Streptomyces kanamyceticus.

    PubMed

    Yanai, Koji; Murakami, Takeshi; Bibb, Mervyn

    2006-06-20

    Streptomyces kanamyceticus 12-6 is a derivative of the wild-type strain developed for industrial kanamycin (Km) production. Southern analysis and DNA sequencing revealed amplification of a large genomic segment including the entire Km biosynthetic gene cluster in the chromosome of strain 12-6. At 145 kb, the amplifiable unit of DNA (AUD) is the largest AUD reported in Streptomyces. Striking repetitive DNA sequences belonging to the clustered regularly interspaced short palindromic repeats family were found in the AUD and may play a role in its amplification. Strain 12-6 contains a mixture of different chromosomes with varying numbers of AUDs, sometimes exceeding 36 copies and producing an amplified region >5.7 Mb. The level of Km production depended on the copy number of the Km biosynthetic gene cluster, suggesting that DNA amplification occurred during strain improvement as a consequence of selection for increased Km resistance. Amplification of DNA segments including entire antibiotic biosynthetic gene clusters might be a common mechanism leading to increased antibiotic production in industrial strains.

  8. Deletion endpoint allele-specificity in the developmentally regulated elimination of an internal sequence (IES) in Paramecium.

    PubMed Central

    Dubrana, K; Le Mouël, A; Amar, L

    1997-01-01

    Ciliated protozoa undergo thousands of site-specific DNA deletion events during the programmed development of micronuclear genomes to macronuclear genomes. Two deletion elements, W1 and W2, were identified in the Paramecium primaurelia wild-type 156 strain. Here, we report the characterization of both elements in wild-type strain 168 and show that they display variant deletion patterns when compared with those of strain 156. The W1 ( 168 ) element is defective for deletion. The W2 ( 168 ) element is excised utilizing two alternative boundaries on one side, both are different from the boundary utilized to excise the W2156 element. By crossing the 156 and 168 strains, we demonstrate that the definition of all deletion endpoints are each controlled by cis -acting determinant(s) rather than by strain-specific trans-acting factor(s). Sequence comparison of all deleted DNA segments indicates that the 5'-TA-3'terminal sequence is strictly required at their ends. Furthermore the identity of the first eight base pairs of these ends to a previously established consensus sequence correlates with the frequency of the corresponding deletion events. Our data implies the existence of an adaptive convergent evolution of these Paramecium deleted DNA segment end sequences. PMID:9171098

  9. Entropic Profiler – detection of conservation in genomes using information theory

    PubMed Central

    Fernandes, Francisco; Freitas, Ana T; Almeida, Jonas S; Vinga, Susana

    2009-01-01

    Background In the last decades, with the successive availability of whole genome sequences, many research efforts have been made to mathematically model DNA. Entropic Profiles (EP) were proposed recently as a new measure of continuous entropy of genome sequences. EP represent local information plots related to DNA randomness and are based on information theory and statistical concepts. They express the weighed relative abundance of motifs for each position in genomes. Their study is very relevant because under or over-representation segments are often associated with significant biological meaning. Findings The Entropic Profiler application here presented is a new tool designed to detect and extract under and over-represented DNA segments in genomes by using EP. It allows its computation in a very efficient way by recurring to improved algorithms and data structures, which include modified suffix trees. Available through a web interface and as downloadable source code, it allows to study positions and to search for motifs inside the whole sequence or within a specified range. DNA sequences can be entered from different sources, including FASTA files, pre-loaded examples or resuming a previously saved work. Besides the EP value plots, p-values and z-scores for each motif are also computed, along with the Chaos Game Representation of the sequence. Conclusion EP are directly related with the statistical significance of motifs and can be considered as a new method to extract and classify significant regions in genomes and estimate local scales in DNA. The present implementation establishes an efficient and useful tool for whole genome analysis. PMID:19416538

  10. The last Viking King: a royal maternity case solved by ancient DNA analysis.

    PubMed

    Dissing, Jørgen; Binladen, Jonas; Hansen, Anders; Sejrsen, Birgitte; Willerslev, Eske; Lynnerup, Niels

    2007-02-14

    The last of the Danish Viking Kings, Sven Estridsen, died in a.d. 1074 and is entombed in Roskilde Cathedral with other Danish kings and queens. Sven's mother, Estrid, is entombed in a pillar across the chancel. However, while there is no reasonable doubt about the identity of Sven, there have been doubts among historians whether the woman entombed was indeed Estrid. To shed light on this problem, we have extracted and analysed mitochondrial DNA (mtDNA) from pulp of teeth from each of the two royals. Four overlapping DNA-fragments covering about 400bp of hypervariable region 1 (HVR-1) of the D-loop were PCR amplified, cloned and a number of clones with each segment were sequenced. Also a segment containing the H/non-H specific nucleotide 7028 was sequenced. Consensus sequences were determined and D-loop results were replicated in an independent laboratory. This allowed the assignment of King Sven Estridsen to haplogroup H; Estrid's sequence differed from that of Sven at two positions in HVR-1, 16093T-->C and 16304T-->C, indicating that she belongs to subgroup H5a. Given the maternal inheritance of mtDNA, offspring will have the same mtDNA sequence as their mother with the exception of rare cases where the sequence has been altered by a germ line mutation. Therefore, the observation of two sequence differences makes it highly unlikely that the entombed woman was the mother of Sven. In addition, physical examination of the skeleton and the teeth strongly indicated that this woman was much younger (approximately 35 years) at the time of death than the 70 years history records tell. Although the entombed woman cannot be the Estrid, she may well be one of Sven's two daughters-in-law who were also called Estrid and who both became queens.

  11. Pea chloroplast DNA encodes homologues of Escherichia coli ribosomal subunit S2 and the beta'-subunit of RNA polymerase.

    PubMed Central

    Cozens, A L; Walker, J E

    1986-01-01

    The nucleotide sequence has been determined of a segment of 4680 bases of the pea chloroplast genome. It adjoins a sequence described elsewhere that encodes subunits of the F0 membrane domain of the ATP-synthase complex. The sequence contains a potential gene encoding a protein which is strongly related to the S2 polypeptide of Escherichia coli ribosomes. It also encodes an incomplete protein which contains segments that are homologous to the beta'-subunit of E. coli RNA polymerase and to yeast RNA polymerases II and III. PMID:3530249

  12. When Maxwellian demon meets action at a distance. Comment on "Disentangling DNA molecules" by Alexander Vologodskii

    NASA Astrophysics Data System (ADS)

    Rybenkov, Valentin V.

    2016-09-01

    The ability of living systems to defy thermodynamics without explicitly violating it is a continued source of inspiration to many biophysicists. The story of type-2 DNA topoisomerases is a beautiful example from that book. DNA topoisomerases catalyze a concerted DNA cleavage-religation reaction, which is interjected by a strand passage event. This sequence of events results in a seemingly unhindered transfer of one piece of DNA through another upon their random collision. An obvious consequence of such transfer is a change in the topological state of the colliding DNAs; hence the name of the enzymes, topoisomerases. There are several classes of topoisomerases, which differ in how they capture the cleaved and transported DNA segments (which are often referred to as the gate and transfer segments; or the G- and T-segments, to be short). Type-2 topoisomerases have two cleavage-religation centers. They open a gate in double stranded DNA and transfer another piece of double stranded DNA through it [1]. And in doing so, they manage to collect information about the rest of the DNA and perform strand passage in a directional manner so as to take the molecule away from the thermodynamic equilibrium [2].

  13. Interactions between the promoter and first intron are involved in transcriptional control of alpha 1(I) collagen gene expression.

    PubMed Central

    Bornstein, P; McKay, J; Liska, D J; Apone, S; Devarayalu, S

    1988-01-01

    The first intron of the human collagen alpha 1(I) gene contains several positively and negatively acting elements. We have studied the transcription of collagen-human growth hormone fusion genes, containing deletions and rearrangements of collagen intronic sequences, by transient transfection of chick tendon fibroblasts and NIH 3T3 cells. In chick tendon fibroblasts, but not in 3T3 cells, inversion of intronic sequences containing a previously studied 274-base-pair segment, A274, resulted in markedly reduced human growth hormone mRNA levels as determined by an RNase protection assay. This inhibitory effect was largely alleviated when deletions were introduced in the collagen promoter of plasmids containing negatively oriented intronic sequences. Evidence for interaction of the promoter with the intronic segment, A274, was obtained by gel mobility shift assays. We suggest that promoter-intron interactions, mediated by DNA-binding proteins, regulate collagen gene transcription. Inversion of intronic segments containing critical interactive elements might then lead to an altered geometry and reduced activity of a transcriptional complex in those cells with sufficiently high levels of appropriate transcription factors. We further suggest that the deleted promoter segment plays a key role in directing DNA interactions involved in transcriptional control. Images PMID:3211130

  14. Structure of the Mecl Repressor from Staphylococcus aureus in Complex with the Cognate DNA Operator of mec

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Safo,M.; Ko, T.; Musayev, F.

    The dimeric repressor MecI regulates the mecA gene that encodes the penicillin-binding protein PBP-2a in methicillin-resistant Staphylococcus aureus (MRSA). MecI is similar to BlaI, the repressor for the blaZ gene of {beta}-lactamase. MecI and BlaI can bind to both operator DNA sequences. The crystal structure of MecI in complex with the 32 base-pair cognate DNA of mec was determined to 3.8 Angstroms resolution. MecI is a homodimer and each monomer consists of a compact N-terminal winged-helix domain, which binds to DNA, and a loosely packed C-terminal helical domain, which intertwines with its counter-monomer. The crystal contains horizontal layers of virtualmore » DNA double helices extending in three directions, which are separated by perpendicular DNA segments. Each DNA segment is bound to two MecI dimers. Similar to the BlaI-mec complex, but unlike the MecI-bla complex, the MecI repressors bind to both sides of the mec DNA dyad that contains four conserved sequences of TACA/TGTA. The results confirm the up-and-down binding to the mec operator, which may account for cooperative effect of the repressor.« less

  15. The implication of DNA bending energy for nucleosome positioning and sliding.

    PubMed

    Liu, Guoqing; Xing, Yongqiang; Zhao, Hongyu; Cai, Lu; Wang, Jianying

    2018-06-11

    Nucleosome not only directly affects cellular processes, such as DNA replication, recombination, and transcription, but also severs as a fundamentally important target of epigenetic modifications. Our previous study indicated that the bending property of DNA is important in nucleosome formation, particularly in predicting the dyad positions of nucleosomes on a DNA segment. Here, we investigated the role of bending energy in nucleosome positioning and sliding in depth to decipher sequence-directed mechanism. The results show that bending energy is a good physical index to predict the free energy in the process of nucleosome reconstitution in vitro. Our data also imply that there are at least 20% of the nucleosomes in budding yeast do not adopt canonical positioning, in which underlying sequences wrapped around histones are structurally symmetric. We also revealed distinct patterns of bending energy profile for distinctly organized chromatin structures, such as well-positioned nucleosomes, fuzzy nucleosomes, and linker regions and discussed nucleosome sliding in terms of bending energy. We proposed that the stability of a nucleosome is positively correlated with the strength of the bending anisotropy of DNA segment, and both accessibility and directionality of nucleosome sliding is likely to be modulated by diverse patterns of DNA bending energy profile.

  16. Intragenomic sequence variation at the ITS1 - ITS2 region and at the 18S and 28S nuclear ribosomal DNA genes of the New Zealand mud snail, Potamopyrgus antipodarum (Hydrobiidae: mollusca)

    USGS Publications Warehouse

    Hoy, Marshal S.; Rodriguez, Rusty J.

    2013-01-01

    Molecular genetic analysis was conducted on two populations of the invasive non-native New Zealand mud snail (Potamopyrgus antipodarum), one from a freshwater ecosystem in Devil's Lake (Oregon, USA) and the other from an ecosystem of higher salinity in the Columbia River estuary (Hammond Harbor, Oregon, USA). To elucidate potential genetic differences between the two populations, three segments of nuclear ribosomal DNA (rDNA), the ITS1-ITS2 regions and the 18S and 28S rDNA genes were cloned and sequenced. Variant sequences within each individual were found in all three rDNA segments. Folding models were utilized for secondary structure analysis and results indicated that there were many sequences which contained structure-altering polymorphisms, which suggests they could be nonfunctional pseudogenes. In addition, analysis of molecular variance (AMOVA) was used for hierarchical analysis of genetic variance to estimate variation within and among populations and within individuals. AMOVA revealed significant variation in the ITS region between the populations and among clones within individuals, while in the 5.8S rDNA significant variation was revealed among individuals within the two populations. High levels of intragenomic variation were found in the ITS regions, which are known to be highly variable in many organisms. More interestingly, intragenomic variation was also found in the 18S and 28S rDNA, which has rarely been observed in animals and is so far unreported in Mollusca. We postulate that in these P. antipodarum populations the effects of concerted evolution are diminished due to the fact that not all of the rDNA genes in their polyploid genome should be essential for sustaining cellular function. This could lead to a lessening of selection pressures, allowing mutations to accumulate in some copies, changing them into variant sequences.                   

  17. Sequence-dependent modelling of local DNA bending phenomena: curvature prediction and vibrational analysis.

    PubMed

    Vlahovicek, K; Munteanu, M G; Pongor, S

    1999-01-01

    Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).

  18. A computer aided thermodynamic approach for predicting the formation of Z-DNA in naturally occurring sequences

    NASA Technical Reports Server (NTRS)

    Ho, P. S.; Ellison, M. J.; Quigley, G. J.; Rich, A.

    1986-01-01

    The ease with which a particular DNA segment adopts the left-handed Z-conformation depends largely on the sequence and on the degree of negative supercoiling to which it is subjected. We describe a computer program (Z-hunt) that is designed to search long sequences of naturally occurring DNA and retrieve those nucleotide combinations of up to 24 bp in length which show a strong propensity for Z-DNA formation. Incorporated into Z-hunt is a statistical mechanical model based on empirically determined energetic parameters for the B to Z transition accumulated to date. The Z-forming potential of a sequence is assessed by ranking its behavior as a function of negative superhelicity relative to the behavior of similar sized randomly generated nucleotide sequences assembled from over 80,000 combinations. The program makes it possible to compare directly the Z-forming potential of sequences with different base compositions and different sequence lengths. Using Z-hunt, we have analyzed the DNA sequences of the bacteriophage phi X174, plasmid pBR322, the animal virus SV40 and the replicative form of the eukaryotic adenovirus-2. The results are compared with those previously obtained by others from experiments designed to locate Z-DNA forming regions in these sequences using probes which show specificity for the left-handed DNA conformation.

  19. An overview on genome organization of marine organisms.

    PubMed

    Costantini, Maria

    2015-12-01

    In this review we will concentrate on some general genome features of marine organisms and their evolution, ranging from vertebrate to invertebrates until unicellular organisms. Before genome sequencing, the ultracentrifugation in CsCl led to high resolution of mammalian DNA (without seeing at the sequence). The analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong in a small number of families characterized by different GC levels. The recent availability of a number of fully sequenced genomes allowed mapping very precisely the isochores, based on DNA sequences. Since isochores are tightly linked to biological properties such as gene density, replication timing and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function and evolution. This led the current level of knowledge and to further insights. Copyright © 2015. Published by Elsevier B.V.

  20. Isolation of a sex-linked DNA sequence in cranes.

    PubMed

    Duan, W; Fuerst, P A

    2001-01-01

    A female-specific DNA fragment (CSL-W; crane sex-linked DNA on W chromosome) was cloned from female whooping cranes (Grus americana). From the nucleotide sequence of CSL-W, a set of polymerase chain reaction (PCR) primers was identified which amplify a 227-230 bp female-specific fragment from all existing crane species and some other noncrane species. A duplicated versions of the DNA segment, which is found to have a larger size (231-235 bp) than CSL-W in both sexes, was also identified, and was designated CSL-NW (crane sex-linked DNA on non-W chromosome). The nucleotide similarity between the sequences of CSL-W and CSL-NW from whooping cranes was 86.3%. The CSL primers do not amplify any sequence from mammalian DNA, limiting the potential for contamination from human sources. Using the CSL primers in combination with a quick DNA extraction method allows the noninvasive identification of crane gender in less than 10 h. A test of the methodology was carried out on fully developed body feathers from 18 captive cranes and resulted in 100% successful identification.

  1. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Serwer, Philip, E-mail: serwer@uthscsa.edu; Wright, Elena T.; Liu, Zheng

    DNA packaging of phages phi29, T3 and T7 sometimes produces incompletely packaged DNA with quantized lengths, based on gel electrophoretic band formation. We discover here a packaging ATPase-free, in vitro model for packaged DNA length quantization. We use directed evolution to isolate a five-site T3 point mutant that hyper-produces tail-free capsids with mature DNA (heads). Three tail gene mutations, but no head gene mutations, are present. A variable-length DNA segment leaks from some mutant heads, based on DNase I-protection assay and electron microscopy. The protected DNA segment has quantized lengths, based on restriction endonuclease analysis: six sharp bands of DNAmore » missing 3.7–12.3% of the last end packaged. Native gel electrophoresis confirms quantized DNA expulsion and, after removal of external DNA, provides evidence that capsid radius is the quantization-ruler. Capsid-based DNA length quantization possibly evolved via selection for stalling that provides time for feedback control during DNA packaging and injection. - Graphical abstract: Highlights: • We implement directed evolution- and DNA-sequencing-based phage assembly genetics. • We purify stable, mutant phage heads with a partially leaked mature DNA molecule. • Native gels and DNase-protection show leaked DNA segments to have quantized lengths. • Native gels after DNase I-removal of leaked DNA reveal the capsids to vary in radius. • Thus, we hypothesize leaked DNA quantization via variably quantized capsid radius.« less

  2. Factors influencing the specific interaction of Neisseria gonorrhoeae with transforming DNA.

    PubMed Central

    Goodman, S D; Scocca, J J

    1991-01-01

    The specific interaction of transformable Neisseria gonorrhoeae with DNA depends on the recognition of specific 10-residue target sequences. The relative affinity for DNA between 3 and 17 kb in size appears to be linearly related to the frequency of targets on the segment and is unaffected by absolute size. The average frequency of targets in chromosomal DNA of N. gonorrhoeae appears to be approximately one per 1,000 bp. PMID:1909325

  3. Structure of the MecI repressor from Staphylococcus aureus in complex with the cognate DNA operator of mec

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Safo, Martin K., E-mail: msafo@vcu.edu; Ko, Tzu-Ping; Musayev, Faik N.

    The up-and-down binding of dimeric MecI to mecA dyad DNA may account for the cooperative effect of the repressor. The dimeric repressor MecI regulates the mecA gene that encodes the penicillin-binding protein PBP-2a in methicillin-resistant Staphylococcus aureus (MRSA). MecI is similar to BlaI, the repressor for the blaZ gene of β-lactamase. MecI and BlaI can bind to both operator DNA sequences. The crystal structure of MecI in complex with the 32 base-pair cognate DNA of mec was determined to 3.8 Å resolution. MecI is a homodimer and each monomer consists of a compact N-terminal winged-helix domain, which binds to DNA,more » and a loosely packed C-terminal helical domain, which intertwines with its counter-monomer. The crystal contains horizontal layers of virtual DNA double helices extending in three directions, which are separated by perpendicular DNA segments. Each DNA segment is bound to two MecI dimers. Similar to the BlaI–mec complex, but unlike the MecI–bla complex, the MecI repressors bind to both sides of the mec DNA dyad that contains four conserved sequences of TACA/TGTA. The results confirm the up-and-down binding to the mec operator, which may account for cooperative effect of the repressor.« less

  4. Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data.

    PubMed

    Polanski, A; Kimmel, M; Chakraborty, R

    1998-05-12

    Distribution of pairwise differences of nucleotides from data on a sample of DNA sequences from a given segment of the genome has been used in the past to draw inferences about the past history of population size changes. However, all earlier methods assume a given model of population size changes (such as sudden expansion), parameters of which (e.g., time and amplitude of expansion) are fitted to the observed distributions of nucleotide differences among pairwise comparisons of all DNA sequences in the sample. Our theory indicates that for any time-dependent population size, N(tau) (in which time tau is counted backward from present), a time-dependent coalescence process yields the distribution, p(tau), of the time of coalescence between two DNA sequences randomly drawn from the population. Prediction of p(tau) and N(tau) requires the use of a reverse Laplace transform known to be unstable. Nevertheless, simulated data obtained from three models of monotone population change (stepwise, exponential, and logistic) indicate that the pattern of a past population size change leaves its signature on the pattern of DNA polymorphism. Application of the theory to the published mtDNA sequences indicates that the current mtDNA sequence variation is not inconsistent with a logistic growth of the human population.

  5. Synthetic transcripts of double-stranded Birnavirus genome are infectious.

    PubMed Central

    Mundt, E; Vakharia, V N

    1996-01-01

    We have developed a system for generation of infectious bursal disease virus (IBDV), a segmented double-stranded RNA virus of the Birnaviridae family, with the use of synthetic transcripts derived from cloned cDNA. Independent full-length cDNA clones were constructed that contained the entire coding and noncoding regions of RNA segments A and B of two distinguishable IBDV strains of serotype I. Segment A encodes all of the structural (VP2, VP4, and VP3) and nonstructural (VP5) proteins, whereas segment B encodes the RNA-dependent RNA polymerase (VP1). Synthetic RNAs of both segments were produced by in vitro transcription of linearized plasmids with T7 RNA polymerase. Transfection of Vero cells with combined plus-sense transcripts of both segments generated infectious virus as early as 36 hr after transfection. The infectivity and specificity of the recovered chimeric virus was ascertained by the appearance of cytopathic effect in chicken embryo cells, by immunofluorescence staining of infected Vero cells with rabbit anti-IBDV serum, and by nucleotide sequence analysis of the recovered virus, respectively. In addition, transfectant viruses containing genetically tagged sequences in either segment A or segment B of IBDV were generated to confirm the feasibility of this system. The development of a reverse genetics system for double-stranded RNA viruses will greatly facilitate studies of the regulation of viral gene expression, pathogenesis, and design of a new generation of live vaccines. Images Fig. 2 Fig. 3 Fig. 4 PMID:8855321

  6. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  7. Sequence analysis of cultivated strawberry (Fragaria × ananassa Duch.) using microdissected single somatic chromosomes.

    PubMed

    Yanagi, Tomohiro; Shirasawa, Kenta; Terachi, Mayuko; Isobe, Sachiko

    2017-01-01

    Cultivated strawberry ( Fragaria  ×  ananassa Duch.) has homoeologous chromosomes because of allo-octoploidy. For example, two homoeologous chromosomes that belong to different sub-genome of allopolyploids have similar base sequences. Thus, when conducting de novo assembly of DNA sequences, it is difficult to determine whether these sequences are derived from the same chromosome. To avoid the difficulties associated with homoeologous chromosomes and demonstrate the possibility of sequencing allopolyploids using single chromosomes, we conducted sequence analysis using microdissected single somatic chromosomes of cultivated strawberry. Three hundred and ten somatic chromosomes of the Japanese octoploid strawberry 'Reiko' were individually selected under a light microscope using a microdissection system. DNA from 288 of the dissected chromosomes was successfully amplified using a DNA amplification kit. Using next-generation sequencing, we decoded the base sequences of the amplified DNA segments, and on the basis of mapping, we identified DNA sequences from 144 samples that were best matched to the reference genomes of the octoploid strawberry, F.  ×  ananassa , and the diploid strawberry, F. vesca . The 144 samples were classified into seven pseudo-molecules of F. vesca . The coverage rates of the DNA sequences from the single chromosome onto all pseudo-molecular sequences varied from 3 to 29.9%. We demonstrated an efficient method for sequence analysis of allopolyploid plants using microdissected single chromosomes. On the basis of our results, we believe that whole-genome analysis of allopolyploid plants can be enhanced using methodology that employs microdissected single chromosomes.

  8. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

    PubMed Central

    2009-01-01

    Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416

  9. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

    PubMed

    Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

    2009-08-06

    Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.

  10. Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin.

    PubMed Central

    Ananiev, E V; Phillips, R L; Rines, H W

    1998-01-01

    The recovery of maize (Zea mays L.) chromosome addition lines of oat (Avena sativa L.) from oat x maize crosses enables us to analyze the structure and composition of specific regions, such as knobs, of individual maize chromosomes. A DNA hybridization blot panel of eight individual maize chromosome addition lines revealed that 180-bp repeats found in knobs are present in each of these maize chromosomes, but the copy number varies from approximately 100 to 25, 000. Cosmid clones with knob DNA segments were isolated from a genomic library of an oat-maize chromosome 9 addition line with the help of the 180-bp knob-associated repeated DNA sequence used as a probe. Cloned knob DNA segments revealed a complex organization in which blocks of tandemly arranged 180-bp repeating units are interrupted by insertions of other repeated DNA sequences, mostly represented by individual full size copies of retrotransposable elements. There is an obvious preference for the integration of retrotransposable elements into certain sites (hot spots) of the 180-bp repeat. Sequence microheterogeneity including point mutations and duplications was found in copies of 180-bp repeats. The 180-bp repeats within an array all had the same polarity. Restriction maps constructed for 23 cloned knob DNA fragments revealed the positions of polymorphic sites and sites of integration of insertion elements. Discovery of the interspersion of retrotransposable elements among blocks of tandem repeats in maize and some other organisms suggests that this pattern may be basic to heterochromatin organization for eukaryotes. PMID:9691055

  11. The organization of repeating units in mitochondrial DNA from yeast petite mutants.

    PubMed

    Bos, J L; Heyting, C; Van der Horst, G; Borst, P

    1980-04-01

    We have reinvestigated the linkage orientation of repeating units in mtDNAs of yeast ρ(-) petite mutants containing an inverted duplication. All five petite mtDNAs studied contain a continuous segment of wild-type mtDNA, part of which is duplicated and present in inverted form in the repeat. We show by restriction enzyme analysis that the non-duplicated segments between the inverted duplications are present in random orientation in all five petite mtDNAs. There is no segregation of sub-types with unique orientation. We attribute this to the high rate of intramolecular recombination between the inverted duplications. The results provide additional evidence for the high rate of recombination of yeast mtDNA even in haploid ρ(-) petite cells.We conclude that only two types of stable sequence organization exist in petite mtDNA: petites without an inverted duplication have repeats linked in straight head-to-tail arrangement (abcabc); petites with an inverted duplication have repeats in which the non-duplicated segments are present in random orientation.

  12. Organizational differences between cytoplasmic male sterile and male fertile Brassica mitochondrial genomes are confined to a single transposed locus.

    PubMed Central

    L'Homme, Y; Brown, G G

    1993-01-01

    Comparison of the physical maps of male fertile (cam) and male sterile (pol) mitochondrial genomes of Brassica napus indicates that structural differences between the two mtDNAs are confined to a region immediately upstream of the atp6 gene. Relative to cam mtDNA, pol mtDNA possesses a 4.5 kb segment at this locus that includes a chimeric gene that is cotranscribed with atp6 and lacks an approximately 1kb region located upstream of the cam atp6 gene. The 4.5 kb pol segment is present and similarly organized in the mitochondrial genome of the common nap B.napus cytoplasm; however, the nap and pol DNA regions flanking this segment are different and the nap sequences are not expressed. The 4.5 kb CMS-associated pol segment has thus apparently undergone transposition during the evolution of the nap and pol cytoplasms and has been lost in the cam genome subsequent to the pol-cam divergence. This 4.5 kb segment comprises the single DNA region that is expressed differently in fertile, pol CMS and fertility restored pol cytoplasm plants. The finding that this locus is part of the single mtDNA region organized differently in the fertile and male sterile mitochondrial genomes provides strong support for the view that it specifies the pol CMS trait. Images PMID:8388101

  13. Phylogenetic analysis of mtDNA lineages in South American mummies.

    PubMed

    Monsalve, M V; Cardenas, F; Guhl, F; Delaney, A D; Devine, D V

    1996-07-01

    Some studies of mtDNA propose that contemporary Amerindians have descended from four haplotype groups, each defined by specific sets of polymorphisms. One recent study also found evidence of other potential founder haplotypes. We wanted to determine whether the four haplotypes in modern populations were also present in ancient South American aboriginals. We subjected mtDNA from Colombian mummies (470 to 1849 AD) to PCR amplification and restriction endonuclease analysis. The mtDNA D-loop region was surveyed for sequence variation by restriction analysis and a segment of this region was sequenced for each mummy to characterize the haplotypes. Our mummies exhibited three of the four major characteristic haplotypes of Amerindian populations defined by four markers. With sequence data obtained in the ancient samples and published data on contemporary Amerindians it was possible to infer the origin of these six mummies.

  14. A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

    PubMed

    Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

    1994-09-01

    A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.

  15. Oligo Design: a computer program for development of probes for oligonucleotide microarrays.

    PubMed

    Herold, Keith E; Rasooly, Avraham

    2003-12-01

    Oligonucleotide microarrays have demonstrated potential for the analysis of gene expression, genotyping, and mutational analysis. Our work focuses primarily on the detection and identification of bacteria based on known short sequences of DNA. Oligo Design, the software described here, automates several design aspects that enable the improved selection of oligonucleotides for use with microarrays for these applications. Two major features of the program are: (i) a tiling algorithm for the design of short overlapping temperature-matched oligonucleotides of variable length, which are useful for the analysis of single nucleotide polymorphisms and (ii) a set of tools for the analysis of multiple alignments of gene families and related short DNA sequences, which allow for the identification of conserved DNA sequences for PCR primer selection and variable DNA sequences for the selection of unique probes for identification. Note that the program does not address the full genome perspective but, instead, is focused on the genetic analysis of short segments of DNA. The program is Internet-enabled and includes a built-in browser and the automated ability to download sequences from GenBank by specifying the GI number. The program also includes several utilities, including audio recital of a DNA sequence (useful for verifying sequences against a written document), a random sequence generator that provides insight into the relationship between melting temperature and GC content, and a PCR calculator.

  16. Sequence and Analysis of the Tomato JOINTLESS Locus1

    PubMed Central

    Mao, Long; Begum, Dilara; Goff, Stephen A.; Wing, Rod A.

    2001-01-01

    A 119-kb bacterial artificial chromosome from the JOINTLESS locus on the tomato (Lycopersicon esculentum) chromosome 11 contained 15 putative genes. Repetitive sequences in this region include one copia-like LTR retrotransposon, 13 simple sequence repeats, three copies of a novel type III foldback transposon, and four putative short DNA repeats. Database searches showed that the foldback transposon and the short DNA repeats seemed to be associated preferably with genes. The predicted tomato genes were compared with the complete Arabidopsis genome. Eleven out of 15 tomato open reading frames were found to be colinear with segments on five Arabidopsis bacterial artificial chromosome/P1-derived artificial chromosome clones. The synteny patterns, however, did not reveal duplicated segments in Arabidopsis, where over half of the genome is duplicated. Our analysis indicated that the microsynteny between the tomato and Arabidopsis genomes was still conserved at a very small scale but was complicated by the large number of gene families in the Arabidopsis genome. PMID:11457984

  17. (S)-3-hydroxy-3-methylglutaryl coenzyme A reductase, a product of the mva operon of Pseudomonas mevalonii, is regulated at the transcriptional level.

    PubMed Central

    Wang, Y L; Beach, M J; Rodwell, V W

    1989-01-01

    We have cloned and sequenced a 505-base-pair (bp) segment of DNA situated upstream of mvaA, the structural gene for (S)-3-hydroxy-3-methylglutaryl coenzyme A reductase (EC 1.1.1.88) of Pseudomonas mevalonii. The DNA segment that we characterized includes the promoter region for the mva operon. Nuclease S1 mapping and primer extension analysis showed that mvaA is the promoter-proximal gene of the mva operon. Transcription initiates at -56 bp relative to the first A (+1) of the translation start site. Transcription in vivo was induced by mevalonate. Structural features of the mva promoter region include an 80-bp A + T-rich region, and -12, -24 consensus sequences that resemble sequences of sigma 54 promoters in enteric organisms. The relative amplitudes of catalytic activity, enzyme protein, and mvaA mRNA are consistent with a model of regulation of this operon at the transcriptional level. Images PMID:2477360

  18. Unusual DNA Structures Associated With Germline Genetic Activity in Caenorhabditis elegans

    PubMed Central

    Fire, Andrew; Alcazar, Rosa; Tan, Frederick

    2006-01-01

    We describe a surprising long-range periodicity that underlies a substantial fraction of C. elegans genomic sequence. Extended segments (up to several hundred nucleotides) of the C. elegans genome show a strong bias toward occurrence of AA/TT dinucleotides along one face of the helix while little or no such constraint is evident on the opposite helical face. Segments with this characteristic periodicity are highly overrepresented in intron sequences and are associated with a large fraction of genes with known germline expression in C. elegans. In addition to altering the path and flexibility of DNA in vitro, sequences of this character have been shown by others to constrain DNA∷nucleosome interactions, potentially producing a structure that could resist the assembly of highly ordered (phased) nucleosome arrays that have been proposed as a precursor to heterochromatin. We propose a number of ways that the periodic occurrence of An/Tn clusters could reflect evolution and function of genes that express in the germ cell lineage of C. elegans. PMID:16648589

  19. Role of DNA secondary structures in fragile site breakage along human chromosome 10

    PubMed Central

    Dillon, Laura W.; Pierce, Levi C. T.; Ng, Maggie C. Y.; Wang, Yuh-Hwa

    2013-01-01

    The formation of alternative DNA secondary structures can result in DNA breakage leading to cancer and other diseases. Chromosomal fragile sites, which are regions of the genome that exhibit chromosomal breakage under conditions of mild replication stress, are predicted to form stable DNA secondary structures. DNA breakage at fragile sites is associated with regions that are deleted, amplified or rearranged in cancer. Despite the correlation, unbiased examination of the ability to form secondary structures has not been evaluated in fragile sites. Here, using the Mfold program, we predict potential DNA secondary structure formation on the human chromosome 10 sequence, and utilize this analysis to compare fragile and non-fragile DNA. We found that aphidicolin (APH)-induced common fragile sites contain more sequence segments with potential high secondary structure-forming ability, and these segments clustered more densely than those in non-fragile DNA. Additionally, using a threshold of secondary structure-forming ability, we refined legitimate fragile sites within the cytogenetically defined boundaries, and identified potential fragile regions within non-fragile DNA. In vitro detection of alternative DNA structure formation and a DNA breakage cell assay were used to validate the computational predictions. Many of the regions identified by our analysis coincide with genes mutated in various diseases and regions of copy number alteration in cancer. This study supports the role of DNA secondary structures in common fragile site instability, provides a systematic method for their identification and suggests a mechanism by which DNA secondary structures can lead to human disease. PMID:23297364

  20. Recognition of the Xenopus ribosomal core promoter by the transcription factor xUBF involves multiple HMG box domains and leads to an xUBF interdomain interaction.

    PubMed

    Leblanc, B; Read, C; Moss, T

    1993-02-01

    The interaction of the ribosomal transcription factor xUBF with the RNA polymerase I core promoter of Xenopus laevis has been studied both at the DNA and protein levels. It is shown that a single xUBF-DNA complex forms over the 40S initiation site (+1) and involves at least the DNA sequences between -20 and +60 bp. DNA sequences upstream of +10 and downstream of +18 are each sufficient to direct complex formation independently. HMG box 1 of xUBF independently recognizes the sequences -20 to -1 and +1 to +22 and the addition of the N-terminal dimerization domain to HMG box 1 stabilizes its interaction with these sequences approximately 10-fold. HMG boxes 2/3 interact with the DNA downstream of +22 and can independently position xUBF across the initiation site. The C-terminal segment of xUBF, HMG boxes 4, 5 or the acidic domain, directly or indirectly interact with HMG box 1, making the core promoter sequences between -11 and -15 hypersensitive to DNase. This interaction also requires the DNA sequences between +17 and +32, i.e. the HMG box 2/3 binding site. The data suggest extensive folding of the core promoter within the xUBF complex.

  1. Cloning, sequencing, and expression of cDNA for human. beta. -glucuronidase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-02-01

    The authors report here the cDNA sequence for human placental ..beta..-glucuronidase (..beta..-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH/sub 2/-terminal amino acid sequence determined for human spleen ..beta..-glucuronidase agreed with that inferred from the DNAmore » sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human ..beta..-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human ..beta..-glucuronidase, demonstrate the existence of two populations of mRNA for ..beta..-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length.« less

  2. Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

    PubMed

    Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

    2012-12-01

    In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  3. Simultaneous detection of human mitochondrial DNA and nuclear-inserted mitochondrial-origin sequences (NumtS) using forensic mtDNA amplification strategies and pyrosequencing technology.

    PubMed

    Bintz, Brittania J; Dixon, Groves B; Wilson, Mark R

    2014-07-01

    Next-generation sequencing technologies enable the identification of minor mitochondrial DNA variants with higher sensitivity than Sanger methods, allowing for enhanced identification of minor variants. In this study, mixtures of human mtDNA control region amplicons were subjected to pyrosequencing to determine the detection threshold of the Roche GS Junior(®) instrument (Roche Applied Science, Indianapolis, IN). In addition to expected variants, a set of reproducible variants was consistently found in reads from one particular amplicon. A BLASTn search of the variant sequence revealed identity to a segment of a 611-bp nuclear insertion of the mitochondrial control region (NumtS) spanning the primer-binding sites of this amplicon (Nature 1995;378:489). Primers (Hum Genet 2012;131:757; Hum Biol 1996;68:847) flanking the insertion were used to confirm the presence or absence of the NumtS in buccal DNA extracts from twenty donors. These results further our understanding of human mtDNA variation and are expected to have a positive impact on the interpretation of mtDNA profiles using deep-sequencing methods in casework. © 2014 American Academy of Forensic Sciences.

  4. Development and validation of a D-loop mtDNA SNP assay for the screening of specimens in forensic casework.

    PubMed

    Chemale, Gustavo; Paneto, Greiciane Gaburro; Menezes, Meiga Aurea Mendes; de Freitas, Jorge Marcelo; Jacques, Guilherme Silveira; Cicarelli, Regina Maria Barretto; Fagundes, Paulo Roberto

    2013-05-01

    Mitochondrial DNA (mtDNA) analysis is usually a last resort in routine forensic DNA casework. However, it has become a powerful tool for the analysis of highly degraded samples or samples containing too little or no nuclear DNA, such as old bones and hair shafts. The gold standard methodology still constitutes the direct sequencing of polymerase chain reaction (PCR) products or cloned amplicons from the HVS-1 and HVS-2 (hypervariable segment) control region segments. Identifications using mtDNA are time consuming, expensive and can be very complex, depending on the amount and nature of the material being tested. The main goal of this work is to develop a less labour-intensive and less expensive screening method for mtDNA analysis, in order to aid in the exclusion of non-matching samples and as a presumptive test prior to final confirmatory DNA sequencing. We have selected 14 highly discriminatory single nucleotide polymorphisms (SNPs) based on simulations performed by Salas and Amigo (2010) to be typed using SNaPShot(TM) (Applied Biosystems, Foster City, CA, USA). The assay was validated by typing more than 100 HVS-1/HVS-2 sequenced samples. No differences were observed between the SNP typing and DNA sequencing when results were compared, with the exception of allelic dropouts observed in a few haplotypes. Haplotype diversity simulations were performed using 172 mtDNA sequences representative of the Brazilian population and a score of 0.9794 was obtained when the 14 SNPs were used, showing that the theoretical prediction approach for the selection of highly discriminatory SNPs suggested by Salas and Amigo (2010) was confirmed in the population studied. As the main goal of the work is to develop a screening assay to skip the sequencing of all samples in a particular case, a pair-wise comparison of the sequences was done using the selected SNPs. When both HVS-1/HVS-2 SNPs were used for simulations, at least two differences were observed in 93.2% of the comparisons performed. The assay was validated with casework samples. Results show that the method is straightforward and can be used for exclusionary purposes, saving time and laboratory resources. The assay confirms the theoretic prediction suggested by Salas and Amigo (2010). All forensic advantages, such as high sensitivity and power of discrimination, as also the disadvantages, such as the occurrence of allele dropouts, are discussed throughout the article. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  5. Bridging two scholarly islands enriches both: COI DNA barcodes for species identification versus human mitochondrial variation for the study of migrations and pathologies.

    PubMed

    Thaler, David S; Stoeckle, Mark Y

    2016-10-01

    DNA barcodes for species identification and the analysis of human mitochondrial variation have developed as independent fields even though both are based on sequences from animal mitochondria. This study finds questions within each field that can be addressed by reference to the other. DNA barcodes are based on a 648-bp segment of the mitochondrially encoded cytochrome oxidase I. From most species, this segment is the only sequence available. It is impossible to know whether it fairly represents overall mitochondrial variation. For modern humans, the entire mitochondrial genome is available from thousands of healthy individuals. SNPs in the human mitochondrial genome are evenly distributed across all protein-encoding regions arguing that COI DNA barcode is representative. Barcode variation among related species is largely based on synonymous codons. Data on human mitochondrial variation support the interpretation that most - possibly all - synonymous substitutions in mitochondria are selectively neutral. DNA barcodes confirm reports of a low variance in modern humans compared to nonhuman primates. In addition, DNA barcodes allow the comparison of modern human variance to many other extant animal species. Birds are a well-curated group in which DNA barcodes are coupled with census and geographic data. Putting modern human variation in the context of intraspecies variation among birds shows humans to be a single breeding population of average variance.

  6. Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

    PubMed

    Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

    1991-11-20

    A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.

  7. Phylogenetic Analysis of Ruminant Theileria spp. from China Based on 28S Ribosomal RNA Gene

    PubMed Central

    Gou, Huitian; Guan, Guiquan; Ma, Miling; Liu, Aihong; Liu, Zhijie; Xu, Zongke; Ren, Qiaoyun; Li, Youquan; Yang, Jifei; Chen, Ze

    2013-01-01

    Species identification using DNA sequences is the basis for DNA taxonomy. In this study, we sequenced the ribosomal large-subunit RNA gene sequences (3,037-3,061 bp) in length of 13 Chinese Theileria stocks that were infective to cattle and sheep. The complete 28S rRNA gene is relatively difficult to amplify and its conserved region is not important for phylogenetic study. Therefore, we selected the D2-D3 region from the complete 28S rRNA sequences for phylogenetic analysis. Our analyses of 28S rRNA gene sequences showed that the 28S rRNA was useful as a phylogenetic marker for analyzing the relationships among Theileria spp. in ruminants. In addition, the D2-D3 region was a short segment that could be used instead of the whole 28S rRNA sequence during the phylogenetic analysis of Theileria, and it may be an ideal DNA barcode. PMID:24327775

  8. Phylogenetic analysis of ruminant Theileria spp. from China based on 28S ribosomal RNA gene.

    PubMed

    Gou, Huitian; Guan, Guiquan; Ma, Miling; Liu, Aihong; Liu, Zhijie; Xu, Zongke; Ren, Qiaoyun; Li, Youquan; Yang, Jifei; Chen, Ze; Yin, Hong; Luo, Jianxun

    2013-10-01

    Species identification using DNA sequences is the basis for DNA taxonomy. In this study, we sequenced the ribosomal large-subunit RNA gene sequences (3,037-3,061 bp) in length of 13 Chinese Theileria stocks that were infective to cattle and sheep. The complete 28S rRNA gene is relatively difficult to amplify and its conserved region is not important for phylogenetic study. Therefore, we selected the D2-D3 region from the complete 28S rRNA sequences for phylogenetic analysis. Our analyses of 28S rRNA gene sequences showed that the 28S rRNA was useful as a phylogenetic marker for analyzing the relationships among Theileria spp. in ruminants. In addition, the D2-D3 region was a short segment that could be used instead of the whole 28S rRNA sequence during the phylogenetic analysis of Theileria, and it may be an ideal DNA barcode.

  9. An Evolutionary Classification of Genomic Function

    PubMed Central

    Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B.R.

    2015-01-01

    The pronouncements of the ENCODE Project Consortium regarding “junk DNA” exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into “functional DNA,” that is, DNA sequences that have a selected-effect function, and “rubbish DNA,” that is, sequences that do not. Functional DNA is further subdivided into “literal DNA” and “indifferent DNA.” In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into “junk DNA” and “garbage DNA.” Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA. PMID:25635041

  10. Transposition-mediated DNA re-replication in maize

    PubMed Central

    Zhang, Jianbo; Zuo, Tao; Wang, Dafang; Peterson, Thomas

    2014-01-01

    Every DNA segment in a eukaryotic genome normally replicates once and only once per cell cycle to maintain genome stability. We show here that this restriction can be bypassed through alternative transposition, a transposition reaction that utilizes the termini of two separate, nearby transposable elements (TEs). Our results suggest that alternative transposition during S phase can induce re-replication of the TEs and their flanking sequences. The DNA re-replication can spontaneously abort to generate double-strand breaks, which can be repaired to generate Composite Insertions composed of transposon termini flanking segmental duplications of various lengths. These results show how alternative transposition coupled with DNA replication and repair can significantly alter genome structure and may have contributed to rapid genome evolution in maize and possibly other eukaryotes. DOI: http://dx.doi.org/10.7554/eLife.03724.001 PMID:25406063

  11. Zuotin, a putative Z-DNA binding protein in Saccharomyces cerevisiae

    NASA Technical Reports Server (NTRS)

    Zhang, S.; Lockshin, C.; Herbert, A.; Winter, E.; Rich, A.

    1992-01-01

    A putative Z-DNA binding protein, named zuotin, was purified from a yeast nuclear extract by means of a Z-DNA binding assay using [32P]poly(dG-m5dC) and [32P]oligo(dG-Br5dC)22 in the presence of B-DNA competitor. Poly(dG-Br5dC) in the Z-form competed well for the binding of a zuotin containing fraction, but salmon sperm DNA, poly(dG-dC) and poly(dA-dT) were not effective. Negatively supercoiled plasmid pUC19 did not compete, whereas an otherwise identical plasmid pUC19(CG), which contained a (dG-dC)7 segment in the Z-form was an excellent competitor. A Southwestern blot using [32P]poly(dG-m5dC) as a probe in the presence of MgCl2 identified a protein having a molecular weight of 51 kDa. The 51 kDa zuotin was partially sequenced at the N-terminal and the gene, ZUO1, was cloned, sequenced and expressed in Escherichia coli; the expressed zuotin showed similar Z-DNA binding activity, but with lower affinity than zuotin that had been partially purified from yeast. Zuotin was deduced to have a number of potential phosphorylation sites including two CDC28 (homologous to the human and Schizosaccharomyces pombe cdc2) phosphorylation sites. The hexapeptide motif KYHPDK was found in zuotin as well as in several yeast proteins, DnaJ of E.coli, csp29 and csp32 proteins of Drosophila and the small t and large T antigens of the polyoma virus. A 60 amino acid segment of zuotin has similarity to several histone H1 sequences. Disruption of ZUO1 in yeast resulted in a slow growth phenotype.

  12. Classification of European Mtdnas from an Analysis of Three European Populations

    PubMed Central

    Torroni, A.; Huoponen, K.; Francalacci, P.; Petrozzi, M.; Morelli, L.; Scozzari, R.; Obinu, D.; Savontaus, M. L.; Wallace, D. C.

    1996-01-01

    Mitochondrial DNA (mtDNA) sequence variation was examined in Finns, Swedes and Tuscans by PCR amplification and restriction analysis. About 99% of the mtDNAs were subsumed within 10 mtDNA haplogroups (H, I, J, K, M, T, U, V, W, and X) suggesting that the identified haplogroups could encompass virtually all European mtDNAs. Because both hypervariable segments of the mtDNA control region were previously sequenced in the Tuscan samples, the mtDNA haplogroups and control region sequences could be compared. Using a combination of haplogroup-specific restriction site changes and control region nucleotide substitutions, the distribution of the haplogroups was surveyed through the published restriction site polymorphism and control region sequence data of Caucasoids. This supported the conclusion that most haplogroups observed in Europe are Caucasoid-specific, and that at least some of them occur at varying frequencies in different Caucasoid populations. The classification of almost all European mtDNA variation in a number of well defined haplogroups could provide additional insights about the origin and relationships of Caucasoid populations and the process of human colonization of Europe, and is valuable for the definition of the role played by mtDNA backgrounds in the expression of pathological mtDNA mutations PMID:8978068

  13. 'Mitominis': multiplex PCR analysis of reduced size amplicons for compound sequence analysis of the entire mtDNA control region in highly degraded samples.

    PubMed

    Eichmann, Cordula; Parson, Walther

    2008-09-01

    The traditional protocol for forensic mitochondrial DNA (mtDNA) analyses involves the amplification and sequencing of the two hypervariable segments HVS-I and HVS-II of the mtDNA control region. The primers usually span fragment sizes of 300-400 bp each region, which may result in weak or failed amplification in highly degraded samples. Here we introduce an improved and more stable approach using shortened amplicons in the fragment range between 144 and 237 bp. Ten such amplicons were required to produce overlapping fragments that cover the entire human mtDNA control region. These were co-amplified in two multiplex polymerase chain reactions and sequenced with the individual amplification primers. The primers were carefully selected to minimize binding on homoplasic and haplogroup-specific sites that would otherwise result in loss of amplification due to mis-priming. The multiplexes have successfully been applied to ancient and forensic samples such as bones and teeth that showed a high degree of degradation.

  14. Analysis of sequence variability in the macronuclear DNA of Paramecium tetraurelia: A somatic view of the germline

    PubMed Central

    Duret, Laurent; Cohen, Jean; Jubin, Claire; Dessen, Philippe; Goût, Jean-François; Mousset, Sylvain; Aury, Jean-Marc; Jaillon, Olivier; Noël, Benjamin; Arnaiz, Olivier; Bétermier, Mireille; Wincker, Patrick; Meyer, Eric; Sperling, Linda

    2008-01-01

    Ciliates are the only unicellular eukaryotes known to separate germinal and somatic functions. Diploid but silent micronuclei transmit the genetic information to the next sexual generation. Polyploid macronuclei express the genetic information from a streamlined version of the genome but are replaced at each sexual generation. The macronuclear genome of Paramecium tetraurelia was recently sequenced by a shotgun approach, providing access to the gene repertoire. The 72-Mb assembly represents a consensus sequence for the somatic DNA, which is produced after sexual events by reproducible rearrangements of the zygotic genome involving elimination of repeated sequences, precise excision of unique-copy internal eliminated sequences (IES), and amplification of the cellular genes to high copy number. We report use of the shotgun sequencing data (>106 reads representing 13× coverage of a completely homozygous clone) to evaluate variability in the somatic DNA produced by these developmental genome rearrangements. Although DNA amplification appears uniform, both of the DNA elimination processes produce sequence heterogeneity. The variability that arises from IES excision allowed identification of hundreds of putative new IESs, compared to 42 that were previously known, and revealed cases of erroneous excision of segments of coding sequences. We demonstrate that IESs in coding regions are under selective pressure to introduce premature termination of translation in case of excision failure. PMID:18256234

  15. Homology between DNA polymerases of poxviruses, herpesviruses, and adenoviruses: nucleotide sequence of the vaccinia virus DNA polymerase gene.

    PubMed Central

    Earl, P L; Jones, E V; Moss, B

    1986-01-01

    A 5400-base-pair segment of the vaccinia virus genome was sequenced and an open reading frame of 938 codons was found precisely where the DNA polymerase had been mapped by transfer of a phosphonoacetate-resistance marker. A single nucleotide substitution changing glycine at position 347 to aspartic acid accounts for the drug resistance of the mutant vaccinia virus. The 5' end of the DNA polymerase mRNA was located 80 base pairs before the methionine codon initiating the open reading frame. Correspondence between the predicted Mr 108,577 polypeptide and the 110,000 purified enzyme indicates that little or no proteolytic processing occurs. Extensive homology, extending over 435 amino acids, was found upon comparing the DNA polymerase of vaccinia virus and DNA polymerase of Epstein-Barr virus. A highly conserved sequence of 14 amino acids in the carboxyl-terminal regions of the above DNA polymerases is also present at a similar location in adenovirus DNA polymerase. This structure, which is predicted to form a turn flanked by beta-pleated sheets, may form part of an essential binding or catalytic site that accounts for its presence in DNA polymerases of poxviruses, herpesviruses, and adenoviruses. Images PMID:3012524

  16. Further delineation of nonhomologous-based recombination and evidence for subtelomeric segmental duplications in 1p36 rearrangements.

    PubMed

    D'Angelo, Carla S; Gajecka, Marzena; Kim, Chong A; Gentles, Andrew J; Glotzbach, Caron D; Shaffer, Lisa G; Koiffmann, Célia P

    2009-06-01

    The mechanisms involved in the formation of subtelomeric rearrangements are now beginning to be elucidated. Breakpoint sequencing analysis of 1p36 rearrangements has made important contributions to this line of inquiry. Despite the unique architecture of segmental duplications inherent to human subtelomeres, no common mechanism has been identified thus far and different nonexclusive recombination-repair mechanisms seem to predominate. In order to gain further insights into the mechanisms of chromosome breakage, repair, and stabilization mediating subtelomeric rearrangements in humans, we investigated the constitutional rearrangements of 1p36. Cloning of the breakpoint junctions in a complex rearrangement and three non-reciprocal translocations revealed similarities at the junctions, such as microhomology of up to three nucleotides, along with no significant sequence identity in close proximity to the breakpoint regions. All the breakpoints appeared to be unique and their occurrence was limited to non-repetitive, unique DNA sequences. Several recombination- or cleavage-associated motifs that may promote non-homologous recombination were observed in close proximity to the junctions. We conclude that NHEJ is likely the mechanism of DNA repair that generates these rearrangements. Additionally, two apparently pure terminal deletions were also investigated, and the refinement of the breakpoint regions identified two distinct genomic intervals ~25-kb apart, each containing a series of 1p36 specific segmental duplications with 90-98% identity. Segmental duplications can serve as substrates for ectopic homologous recombination or stimulate genomic rearrangements.

  17. Lineage-specific evolutionary rate in plants: Contributions of a screening for Cereus (Cactaceae).

    PubMed

    Romeiro-Brito, Monique; Moraes, Evandro M; Taylor, Nigel P; Zappi, Daniela C; Franco, Fernando F

    2016-01-01

    Predictable chloroplast DNA (cpDNA) sequences have been listed for the shallowest taxonomic studies in plants. We investigated whether plastid regions that vary between closely allied species could be applied for intraspecific studies and compared the variation of these plastid segments with two nuclear regions. We screened 16 plastid and two nuclear intronic regions for species of the genus Cereus (Cactaceae) at three hierarchical levels (species from different clades, species of the same clade, and allopatric populations). Ten plastid regions presented interspecific variation, and six of them showed variation at the intraspecific level. The two nuclear regions showed both inter- and intraspecific variation, and in general they showed higher levels of variability in almost all hierarchical levels than the plastid segments. Our data suggest no correspondence between variation of plastid regions at the interspecific and intraspecific level, probably due to lineage-specific variation in cpDNA, which appears to have less effect in nuclear data. Despite the heterogeneity in evolutionary rates of cpDNA, we highlight three plastid segments that may be considered in initial screenings in plant phylogeographic studies.

  18. Genetic instability of an oligomycin resistance mutation in yeast is associated with an amplification of a mitochondrial DNA segment.

    PubMed Central

    Ragnini, A; Fukuhara, H

    1989-01-01

    In the yeast Kluyveromyces lactis, mutations affecting mitochondrial functions are often highly unstable. In order to understand the basis of this genetic instability, we examined the case of an oligomycin resistant mutant. When the mutant was grown in the absence of the drug, the resistance was rapidly lost. This character showed a typical cytoplasmic inheritance. The unstable resistance was found to be associated with the presence of a repetitive DNA in which the repeating unit was a specific segment of the mitochondrial DNA. The amplified molecules were co-replicating with the wild type genome in the mutant cells. The spontaneous loss of the drug resistance was accompanied by the disappearance of the amplified DNA. The repetitive sequence came from a 405 base-pair segment immediately downstream of a cluster of two transfer RNA genes (threonyl 2 and glutamyl). Modified processing of these tRNAs was detected in the mutant. A possible mechanism by which these events could lead to drug resistance is discussed. Images PMID:2780315

  19. A tick-borne segmented RNA virus contains genome segments derived from unsegmented viral ancestors

    PubMed Central

    Qin, Xin-Cheng; Shi, Mang; Tian, Jun-Hua; Lin, Xian-Dan; Gao, Dong-Ya; He, Jin-Rong; Wang, Jian-Bo; Li, Ci-Xiu; Kang, Yan-Jun; Yu, Bin; Zhou, Dun-Jin; Xu, Jianguo; Plyusnin, Alexander; Holmes, Edward C.; Zhang, Yong-Zhen

    2014-01-01

    Although segmented and unsegmented RNA viruses are commonplace, the evolutionary links between these two very different forms of genome organization are unclear. We report the discovery and characterization of a tick-borne virus—Jingmen tick virus (JMTV)—that reveals an unexpected connection between segmented and unsegmented RNA viruses. The JMTV genome comprises four segments, two of which are related to the nonstructural protein genes of the genus Flavivirus (family Flaviviridae), whereas the remaining segments are unique to this virus, have no known homologs, and contain a number of features indicative of structural protein genes. Remarkably, homology searching revealed that sequences related to JMTV were present in the cDNA library from Toxocara canis (dog roundworm; Nematoda), and that shared strong sequence and structural resemblances. Epidemiological studies showed that JMTV is distributed in tick populations across China, especially Rhipicephalus and Haemaphysalis spp., and experiences frequent host-switching and genomic reassortment. To our knowledge, JMTV is the first example of a segmented RNA virus with a genome derived in part from unsegmented viral ancestors. PMID:24753611

  20. Highly conserved D-loop-like nuclear mitochondrial sequences (Numts) in tiger (Panthera tigris).

    PubMed

    Zhang, Wenping; Zhang, Zhihe; Shen, Fujun; Hou, Rong; Lv, Xiaoping; Yue, Bisong

    2006-08-01

    Using oligonucleotide primers designed to match hypervariable segments I (HVS-1) of Panthera tigris mitochondrial DNA (mtDNA), we amplified two different PCR products (500 bp and 287 bp) in the tiger (Panthera tigris), but got only one PCR product (287 bp) in the leopard (Panthera pardus). Sequence analyses indicated that the sequence of 287 bp was a D-loop-like nuclear mitochondrial sequence (Numts), indicating a nuclear transfer that occurred approximately 4.8-17 million years ago in the tiger and 4.6-16 million years ago in the leopard. Although the mtDNA D-loop sequence has a rapid rate of evolution, the 287-bp Numts are highly conserved; they are nearly identical in tiger subspecies and only 1.742% different between tiger and leopard. Thus, such sequences represent molecular 'fossils' that can shed light on evolution of the mitochondrial genome and may be the most appropriate outgroup for phylogenetic analysis. This is also proved by comparing the phylogenetic trees reconstructed using the D-loop sequence of snow leopard and the 287-bp Numts as outgroup.

  1. Forensic strategy to ensure the quality of sequencing data of mitochondrial DNA in highly degraded samples.

    PubMed

    Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki

    2014-01-01

    Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  2. The first determination of Trichuris sp. from roe deer by amplification and sequenation of the ITS1-5.8S-ITS2 segment of ribosomal DNA.

    PubMed

    Salaba, O; Rylková, K; Vadlejch, J; Petrtýl, M; Scháňková, S; Brožová, A; Jankovská, I; Jebavý, L; Langrová, I

    2013-03-01

    Trichuris nematodes were isolated from roe deer (Capreolus capreolus). At first, nematodes were determined using morphological and biometrical methods. Subsequently genomic DNA was isolated and the ITS1-5.8S-ITS2 segment from ribosomal DNA (RNA) was amplified and sequenced using PCR techniques. With u sing morphological and biometrical methods, female nematodes were identified as Trichuris globulosa, and the only male was identified as Trichuris ovis. The females were classified into four morphotypes. However, analysis of the internal transcribed spacers (ITS1-5.8S-ITS2) of specimens did not confirm this classification. Moreover, the female individuals morphologically determined as T. globulosa were molecularly identified as Trichuris discolor. In the case of the only male molecular analysis match the result of the molecular identification. Furthermore, a comparative phylogenetic study was carried out with the ITS1 and ITS2 sequences of the Trichuris species from various hosts. A comparison of biometric information from T. discolor individuals from this study was also conducted.

  3. The role of DNA repair in herpesvirus pathogenesis.

    PubMed

    Brown, Jay C

    2014-10-01

    In cells latently infected with a herpesvirus, the viral DNA is present in the cell nucleus, but it is not extensively replicated or transcribed. In this suppressed state the virus DNA is vulnerable to mutagenic events that affect the host cell and have the potential to destroy the virus' genetic integrity. Despite the potential for genetic damage, however, herpesvirus sequences are well conserved after reactivation from latency. To account for this apparent paradox, I have tested the idea that host cell-encoded mechanisms of DNA repair are able to control genetic damage to latent herpesviruses. Studies were focused on homologous recombination-dependent DNA repair (HR). Methods of DNA sequence analysis were employed to scan herpesvirus genomes for DNA features able to activate HR. Analyses were carried out with a total of 39 herpesvirus DNA sequences, a group that included viruses from the alpha-, beta- and gamma-subfamilies. The results showed that all 39 genome sequences were enriched in two or more of the eight recombination-initiating features examined. The results were interpreted to indicate that HR can stabilize latent herpesvirus genomes. The results also showed, unexpectedly, that repair-initiating DNA features differed in alpha- compared to gamma-herpesviruses. Whereas inverted and tandem repeats predominated in alpha-herpesviruses, gamma-herpesviruses were enriched in short, GC-rich initiation sequences such as CCCAG and depleted in repeats. In alpha-herpesviruses, repair-initiating repeat sequences were found to be concentrated in a specific region (the S segment) of the genome while repair-initiating short sequences were distributed more uniformly in gamma-herpesviruses. The results suggest that repair pathways are activated differently in alpha- compared to gamma-herpesviruses. Copyright © 2014. Published by Elsevier Inc.

  4. The spatial and temporal expression of Ch-en, the engrailed gene in the polychaete Chaetopterus, does not support a role in body axis segmentation

    NASA Technical Reports Server (NTRS)

    Seaver, E. C.; Paulson, D. A.; Irvine, S. Q.; Martindale, M. Q.

    2001-01-01

    We are interested in understanding whether the annelids and arthropods shared a common segmented ancestor and have approached this question by characterizing the expression pattern of the segment polarity gene engrailed (en) in a basal annelid, the polychaete Chaetopterus. We have isolated an en gene, Ch-en, from a Chaetopterus cDNA library. Genomic Southern blotting suggests that this is the only en class gene in this animal. The predicted protein sequence of the 1.2-kb cDNA clone contains all five domains characteristic of en proteins in other taxa, including the en class homeobox. Whole-mount in situ hybridization reveals that Ch-en is expressed throughout larval life in a complex spatial and temporal pattern. The Ch-en transcript is initially detected in a small number of neurons associated with the apical organ and in the posterior portion of the prototrochophore. At later stages, Ch-en is expressed in distinct patterns in the three segmented body regions (A, B, and C) of Chaetopterus. In all segments, Ch-en is expressed in a small set of segmentally iterated cells in the CNS. In the A region, Ch-en is also expressed in a small group of mesodermal cells at the base of the chaetal sacs. In the B region, Ch-en is initially expressed broadly in the mesoderm that then resolves into one band/segment coincident with morphological segmentation. The mesodermal expression in the B region is located in the anterior region of each segment, as defined by the position of ganglia in the ventral nerve cord, and is involved in the morphogenesis of segment-specific feeding structures late in larval life. We observe banded mesodermal and ectodermal staining in an anterior-posterior sequence in the C region. We do not observe a segment polarity pattern of expression of Ch-en in the ectoderm, as is observed in arthropods. Copyright 2001 Academic Press.

  5. Rapid discrimination of sequences flanking and within T-DNA insertions in the Arabidopsis genome.

    PubMed

    Ponce, M R; Quesada, V; Micol, J L

    1998-05-01

    An improvement to previous methods for recovering Arabidopsis thaliana genomic DNA flanking T-DNA insertions is presented that allows for the avoidance of some of the cloning difficulties caused by the concatameric nature of T-DNA inserts. The principle of the procedure is to categorize by size restriction fragments of mutant DNA, produced in separate digestions with NdeI and Bst1107I. Given that the sites for these two enzymes are contiguous within the pGV3850:1003 T-DNA construct, the restriction fragments obtained fall into two categories: those showing identical size in both digestions, which correspond to sequences internal to T-DNA concatamers; and those of different sizes, that contain the junctions between plant DNA and the T-DNA insert. Such a criterion makes it possible to easily distinguish the digestion products corresponding to internal T-DNA parts, which do not deserve further attention, and those which presumably include a segment of the locus of interest. Discrimination between restriction fragments of genomic mutant DNA can be made on rescued plasmids, inverse PCR amplification products or bands in a genomic blot.

  6. Molecular structure of r/GCG/d/TATACGC/ - A DNA-RNA hybrid helix joined to double helical DNA

    NASA Technical Reports Server (NTRS)

    Wang, A. H.-J.; Fujii, S.; Rich, A.; Van Boom, J. H.; Van Der Marel, G. A.; Van Boeckel, S. A. A.

    1982-01-01

    The molecule r(GCG)d(TATACGC) is self-complementary and forms two DNA-RNA hybrid segments surrounding a central region of double helical DNA; its molecular structure has been solved by X-ray analysis. All three parts of the molecule adopt a conformation which is close to that seen in the 11-fold RNA double helix. The conformation of the ribonucleotides is partly determined by water molecules bridging between the ribose O2' hydroxyl group and cytosine O2. The hybrid-DNA duplex junction contains no structural discontinuities. However, the central DNA TATA sequence has some structural irregularities.

  7. Nucleotide Sequence Database Comparison for Routine Dermatophyte Identification by Internal Transcribed Spacer 2 Genetic Region DNA Barcoding.

    PubMed

    Normand, A C; Packeu, A; Cassagne, C; Hendrickx, M; Ranque, S; Piarroux, R

    2018-05-01

    Conventional dermatophyte identification is based on morphological features. However, recent studies have proposed to use the nucleotide sequences of the rRNA internal transcribed spacer (ITS) region as an identification barcode of all fungi, including dermatophytes. Several nucleotide databases are available to compare sequences and thus identify isolates; however, these databases often contain mislabeled sequences that impair sequence-based identification. We evaluated five of these databases on a clinical isolate panel. We selected 292 clinical dermatophyte strains that were prospectively subjected to an ITS2 nucleotide sequence analysis. Sequences were analyzed against the databases, and the results were compared to clusters obtained via DNA alignment of sequence segments. The DNA tree served as the identification standard throughout the study. According to the ITS2 sequence identification, the majority of strains (255/292) belonged to the genus Trichophyton , mainly T. rubrum complex ( n = 184), T. interdigitale ( n = 40), T. tonsurans ( n = 26), and T. benhamiae ( n = 5). Other genera included Microsporum (e.g., M. canis [ n = 21], M. audouinii [ n = 10], Nannizzia gypsea [ n = 3], and Epidermophyton [ n = 3]). Species-level identification of T. rubrum complex isolates was an issue. Overall, ITS DNA sequencing is a reliable tool to identify dermatophyte species given that a comprehensive and correctly labeled database is consulted. Since many inaccurate identification results exist in the DNA databases used for this study, reference databases must be verified frequently and amended in line with the current revisions of fungal taxonomy. Before describing a new species or adding a new DNA reference to the available databases, its position in the phylogenetic tree must be verified. Copyright © 2018 American Society for Microbiology.

  8. The DNA of ciliated protozoa.

    PubMed Central

    Prescott, D M

    1994-01-01

    Ciliates contain two types of nuclei: a micronucleus and a macronucleus. The micronucleus serves as the germ line nucleus but does not express its genes. The macronucleus provides the nuclear RNA for vegetative growth. Mating cells exchange haploid micronuclei, and a new macronucleus develops from a new diploid micronucleus. The old macronucleus is destroyed. This conversion consists of amplification, elimination, fragmentation, and splicing of DNA sequences on a massive scale. Fragmentation produces subchromosomal molecules in Tetrahymena and Paramecium cells and much smaller, gene-sized molecules in hypotrichous ciliates to which telomere sequences are added. These molecules are then amplified, some to higher copy numbers than others. rDNA is differentially amplified to thousands of copies per macronucleus. Eliminated sequences include transposonlike elements and sequences called internal eliminated sequences that interrupt gene coding regions in the micronuclear genome. Some, perhaps all, of these are excised as circular molecules and destroyed. In at least some hypotrichs, segments of some micronuclear genes are scrambled in a nonfunctional order and are recorded during macronuclear development. Vegetatively growing ciliates appear to possess a mechanism for adjusting copy numbers of individual genes, which corrects gene imbalances resulting from random distribution of DNA molecules during amitosis of the macronucleus. Other distinctive features of ciliate DNA include an altered use of the conventional stop codons. Images PMID:8078435

  9. An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

    PubMed

    Bansal, Vikas

    2018-01-01

    The short read lengths of current high-throughput sequencing technologies limit the ability to recover long-range haplotype information. Dilution pool methods for preparing DNA sequencing libraries from high molecular weight DNA fragments enable the recovery of long DNA fragments from short sequence reads. These approaches require computational methods for identifying the DNA fragments using aligned sequence reads and assembling the fragments into long haplotypes. Although a number of computational methods have been developed for haplotype assembly, the problem of identifying DNA fragments from dilution pool sequence data has not received much attention. We formulate the problem of detecting DNA fragments from dilution pool sequencing experiments as a genome segmentation problem and develop an algorithm that uses dynamic programming to optimize a likelihood function derived from a generative model for the sequence reads. This algorithm uses an iterative approach to automatically infer the mean background read depth and the number of fragments in each pool. Using simulated data, we demonstrate that our method, FragmentCut, has 25-30% greater sensitivity compared with an HMM based method for fragment detection and can also detect overlapping fragments. On a whole-genome human fosmid pool dataset, the haplotypes assembled using the fragments identified by FragmentCut had greater N50 length, 16.2% lower switch error rate and 35.8% lower mismatch error rate compared with two existing methods. We further demonstrate the greater accuracy of our method using two additional dilution pool datasets. FragmentCut is available from https://bansal-lab.github.io/software/FragmentCut. vibansal@ucsd.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease

    NASA Astrophysics Data System (ADS)

    Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.

    2018-04-01

    We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.

  11. DNA extraction protocols cause differences in 16S rRNA amplicon sequencing efficiency but not in community profile composition or structure

    DOE PAGES

    None

    2014-12-01

    The recent development of methods applying next-generation sequencing to microbial community characterization has led to the proliferation of these studies in a wide variety of sample types. Yet, variation in the physical properties of environmental samples demands that optimal DNA extraction techniques be explored for each new environment. The microbiota associated with many species of insects offer an extraction challenge as they are frequently surrounded by an armored exoskeleton, inhibiting disruption of the tissues within. In this study, we examine the efficacy of several commonly used protocols for extracting bacterial DNA from ants. While bacterial community composition recovered using Illuminamore » 16S rRNA amplicon sequencing was not detectably biased by any method, the quantity of bacterial DNA varied drastically, reducing the number of samples that could be amplified and sequenced. These results indicate that the concentration necessary for dependable sequencing is around 10,000 copies of target DNA per microliter. Exoskeletal pulverization and tissue digestion increased the reliability of extractions, suggesting that these steps should be included in any study of insect-associated microorganisms that relies on obtaining microbial DNA from intact body segments. Although laboratory and analysis techniques should be standardized across diverse sample types as much as possible, minimal modifications such as these will increase the number of environments in which bacterial communities can be successfully studied.« less

  12. Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism.

    PubMed

    Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard

    2018-02-28

    Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure.

  13. Sequence-dependent response of DNA to torsional stress: a potential biological regulation mechanism

    PubMed Central

    Reymer, Anna; Zakrzewska, Krystyna; Lavery, Richard

    2018-01-01

    Abstract Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure. PMID:29267977

  14. Escaping introns in COI through cDNA barcoding of mushrooms: Pleurotus as a test case.

    PubMed

    Avin, Farhat A; Subha, Bhassu; Tan, Yee-Shin; Braukmann, Thomas W A; Vikineswary, Sabaratnam; Hebert, Paul D N

    2017-09-01

    DNA barcoding involves the use of one or more short, standardized DNA fragments for the rapid identification of species. A 648-bp segment near the 5' terminus of the mitochondrial cytochrome c oxidase subunit I (COI) gene has been adopted as the universal DNA barcode for members of the animal kingdom, but its utility in mushrooms is complicated by the frequent occurrence of large introns. As a consequence, ITS has been adopted as the standard DNA barcode marker for mushrooms despite several shortcomings. This study employed newly designed primers coupled with cDNA analysis to examine COI sequence diversity in six species of Pleurotus and compared these results with those for ITS. The ability of the COI gene to discriminate six species of Pleurotus , the commonly cultivated oyster mushroom, was examined by analysis of cDNA. The amplification success, sequence variation within and among species, and the ability to design effective primers was tested. We compared ITS sequences to their COI cDNA counterparts for all isolates. ITS discriminated between all six species, but some sequence results were uninterpretable, because of length variation among ITS copies. By comparison, a complete COI sequences were recovered from all but three individuals of Pleurotus giganteus where only the 5' region was obtained. The COI sequences permitted the resolution of all species when partial data was excluded for P. giganteus . Our results suggest that COI can be a useful barcode marker for mushrooms when cDNA analysis is adopted, permitting identifications in cases where ITS cannot be recovered or where it offers higher resolution when fresh tissue is. The suitability of this approach remains to be confirmed for other mushrooms.

  15. Chromosome evolution in the Thermotogales: large-scale inversions and strain diversification of CRISPR sequences.

    PubMed

    DeBoy, Robert T; Mongodin, Emmanuel F; Emerson, Joanne B; Nelson, Karen E

    2006-04-01

    In the present study, the chromosomes of two members of the Thermotogales were compared. A whole-genome alignment of Thermotoga maritima MSB8 and Thermotoga neapolitana NS-E has revealed numerous large-scale DNA rearrangements, most of which are associated with CRISPR DNA repeats and/or tRNA genes. These DNA rearrangements do not include the putative origin of DNA replication but move within the same replichore, i.e., the same replicating half of the chromosome (delimited by the replication origin and terminus). Based on cumulative GC skew analysis, both the T. maritima and T. neapolitana lineages contain one or two major inverted DNA segments. Also, based on PCR amplification and sequence analysis of the DNA joints that are associated with the major rearrangements, the overall chromosome architecture was found to be conserved at most DNA joints for other strains of T. neapolitana. Taken together, the results from this analysis suggest that the observed chromosomal rearrangements in the Thermotogales likely occurred by successive inversions after their divergence from a common ancestor and before strain diversification. Finally, sequence analysis shows that size polymorphisms in the DNA joints associated with CRISPRs can be explained by expansion and possibly contraction of the DNA repeat and spacer unit, providing a tool for discerning the relatedness of strains from different geographic locations.

  16. Caught in the act: the lifetime of synaptic intermediates during the search for homology on DNA

    PubMed Central

    Mani, Adam; Braslavsky, Ido; Arbel-Goren, Rinat; Stavans, Joel

    2010-01-01

    Homologous recombination plays pivotal roles in DNA repair and in the generation of genetic diversity. To locate homologous target sequences at which strand exchange can occur within a timescale that a cell’s biology demands, a single-stranded DNA-recombinase complex must search among a large number of sequences on a genome by forming synapses with chromosomal segments of DNA. A key element in the search is the time it takes for the two sequences of DNA to be compared, i.e. the synapse lifetime. Here, we visualize for the first time fluorescently tagged individual synapses formed by RecA, a prokaryotic recombinase, and measure their lifetime as a function of synapse length and differences in sequence between the participating DNAs. Surprisingly, lifetimes can be ∼10 s long when the DNAs are fully heterologous, and much longer for partial homology, consistently with ensemble FRET measurements. Synapse lifetime increases rapidly as the length of a region of full homology at either the 3′- or 5′-ends of the invading single-stranded DNA increases above 30 bases. A few mismatches can reduce dramatically the lifetime of synapses formed with nearly homologous DNAs. These results suggest the need for facilitated homology search mechanisms to locate homology successfully within the timescales observed in vivo. PMID:20044347

  17. Analysis of a library of macaque nuclear mitochondrial sequences confirms macaque origin of divergent sequences from old oral polio vaccine samples.

    PubMed

    Vartanian, Jean-Pierre; Wain-Hobson, Simon

    2002-05-28

    Nuclear mtDNA sequences (numts) are a widespread family of paralogs evolving as pseudogenes in chromosomal DNA [Zhang, D. E. & Hewitt, G. M. (1996) TREE 11, 247-251 and Bensasson, D., Zhang, D., Hartl, D. L. & Hewitt, G. M. (2001) TREE 16, 314-321]. When trying to identify the species origin of an unknown DNA sample by way of an mtDNA locus, PCR may amplify both mtDNA and numts. Indeed, occasionally numts dominate confounding attempts at species identification [Bensasson, D., Zhang, D. X. & Hewitt, G. M. (2000) Mol. Biol. Evol. 17, 406-415; Wallace, D. C., et al. (1997) Proc. Natl. Acad. Sci. USA 94, 14900-14905]. Rhesus and cynomolgus macaque mtDNA haplotypes were identified in a study of oral polio vaccine samples dating from the late 1950s [Blancou, P., et al. (2001) Nature (London) 410, 1045-1046]. They were accompanied by a number of putative numts. To confirm that these putative numts were of macaque origin, a library of numts corresponding to a small segment of 12S rDNA locus has been made by using DNA from a Chinese rhesus macaque. A broad distribution was found with up to 30% sequence variation. Phylogenetic analysis showed that the evolutionary trajectories of numts and bona fide mtDNA haplotypes do not overlap with the signal exception of the host species; mtDNA fragments are continually crossing over into the germ line. In the case of divergent mtDNA sequences from old oral polio vaccine samples [Blancou, P., et al. (2001) Nature (London) 410, 1045-1046], all were closely related to numts in the Chinese macaque library.

  18. Sequence-Level Mechanisms of Human Epigenome Evolution

    PubMed Central

    Prendergast, James G.D.; Chambers, Emily V.; Semple, Colin A.M.

    2014-01-01

    DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage. PMID:24966180

  19. Genome Calligrapher: A Web Tool for Refactoring Bacterial Genome Sequences for de Novo DNA Synthesis.

    PubMed

    Christen, Matthias; Deutsch, Samuel; Christen, Beat

    2015-08-21

    Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher  .

  20. Conformation of Tax-response elements in the human T-cell leukemia virus type I promoter.

    PubMed

    Cox, J M; Sloan, L S; Schepartz, A

    1995-12-01

    HTLV-I Tax is believed to activate viral gene expression by binding bZIP proteins (such as CREB) and increasing their affinities for proviral TRE target sites. Each 21 bp TRE target site contains an imperfect copy of the intrinsically bent CRE target site (the TRE core) surrounded by highly conserved flanking sequences. These flanking sequences are essential for maximal increases in DNA affinity and transactivation, but they are not, apparently, contacted by protein. Here we employ non-denaturing gel electrophoresis to evaluate TRE conformation in the presence and absence of bZIP proteins, and to explore the role of DNA conformation in viral transactivation. Our results show that the TRE-1 flanking sequences modulate the structure and modestly increase the affinity of a CREB bZIP peptide for the TRE-1 core recognition sequence. These flanking sequences are also essential for a maximal increase in stability of the CREB-DNA complex in the presence of Tax. The CRE-like TRE core and the TRE flanking sequences are both essential for formation of stable CREB-TRE-1 and Tax-CREB-TRE-1 complexes. These two DNA segments may have co-evolved into a unique structure capable of recognizing Tax and a bZIP protein.

  1. Zinc-binding Domain of the Bacteriophage T7 DNA Primase Modulates Binding to the DNA Template*

    PubMed Central

    Lee, Seung-Joo; Zhu, Bin; Akabayov, Barak; Richardson, Charles C.

    2012-01-01

    The zinc-binding domain (ZBD) of prokaryotic DNA primases has been postulated to be crucial for recognition of specific sequences in the single-stranded DNA template. To determine the molecular basis for this role in recognition, we carried out homolog-scanning mutagenesis of the zinc-binding domain of DNA primase of bacteriophage T7 using a bacterial homolog from Geobacillus stearothermophilus. The ability of T7 DNA primase to catalyze template-directed oligoribonucleotide synthesis is eliminated by substitution of any five-amino acid residue-long segment within the ZBD. The most significant defect occurs upon substitution of a region (Pro-16 to Cys-20) spanning two cysteines that coordinate the zinc ion. The role of this region in primase function was further investigated by generating a protein library composed of multiple amino acid substitutions for Pro-16, Asp-18, and Asn-19 followed by genetic screening for functional proteins. Examination of proteins selected from the screening reveals no change in sequence-specific recognition. However, the more positively charged residues in the region facilitate DNA binding, leading to more efficient oligoribonucleotide synthesis on short templates. The results suggest that the zinc-binding mode alone is not responsible for sequence recognition, but rather its interaction with the RNA polymerase domain is critical for DNA binding and for sequence recognition. Consequently, any alteration in the ZBD that disturbs its conformation leads to loss of DNA-dependent oligoribonucleotide synthesis. PMID:23024359

  2. Phylogenetic relationships in three species of canine Demodex mite based on partial sequences of mitochondrial 16S rDNA.

    PubMed

    Sastre, Natalia; Ravera, Ivan; Villanueva, Sergio; Altet, Laura; Bardagí, Mar; Sánchez, Armand; Francino, Olga; Ferrer, Lluís

    2012-12-01

    The historical classification of Demodex mites has been based on their hosts and morphological features. Genome sequencing has proved to be a very effective taxonomic tool in phylogenetic studies and has been applied in the classification of Demodex. Mitochondrial 16S rDNA has been demonstrated to be an especially useful marker to establish phylogenetic relationships. To amplify and sequence a segment of the mitochondrial 16S rDNA from Demodex canis and Demodex injai, as well as from the short-bodied mite called, unofficially, D. cornei and to determine their genetic proximity. Demodex mites were examined microscopically and classified as Demodex folliculorum (one sample), D. canis (four samples), D. injai (two samples) or the short-bodied species D. cornei (three samples). DNA was extracted, and a 338 bp fragment of the 16S rDNA was amplified and sequenced. The sequences of the four D. canis mites were identical and shared 99.6 and 97.3% identity with two D. canis sequences available at GenBank. The sequences of the D. cornei isolates were identical and showed 97.8, 98.2 and 99.6% identity with the D. canis isolates. The sequences of the two D. injai isolates were also identical and showed 76.6% identity with the D. canis sequence. Demodex canis and D. injai are two different species, with a genetic distance of 23.3%. It would seem that the short-bodied Demodex mite D. cornei is a morphological variant of D. canis. © 2012 The Authors. Veterinary Dermatology © 2012 ESVD and ACVD.

  3. Lineage-specific evolutionary rate in plants: Contributions of a screening for Cereus (Cactaceae)1

    PubMed Central

    Romeiro-Brito, Monique; Moraes, Evandro M.; Taylor, Nigel P.; Zappi, Daniela C.; Franco, Fernando F.

    2016-01-01

    Premise of the study: Predictable chloroplast DNA (cpDNA) sequences have been listed for the shallowest taxonomic studies in plants. We investigated whether plastid regions that vary between closely allied species could be applied for intraspecific studies and compared the variation of these plastid segments with two nuclear regions. Methods: We screened 16 plastid and two nuclear intronic regions for species of the genus Cereus (Cactaceae) at three hierarchical levels (species from different clades, species of the same clade, and allopatric populations). Results: Ten plastid regions presented interspecific variation, and six of them showed variation at the intraspecific level. The two nuclear regions showed both inter- and intraspecific variation, and in general they showed higher levels of variability in almost all hierarchical levels than the plastid segments. Discussion: Our data suggest no correspondence between variation of plastid regions at the interspecific and intraspecific level, probably due to lineage-specific variation in cpDNA, which appears to have less effect in nuclear data. Despite the heterogeneity in evolutionary rates of cpDNA, we highlight three plastid segments that may be considered in initial screenings in plant phylogeographic studies. PMID:26819857

  4. Reversal of a Neurospora Translocation by Crossing over Involving Displaced Rdna, and Methylation of the Rdna Segments That Result from Recombination

    PubMed Central

    Perkins, David D.; Metzenberg, Robert L.; Raju, Namboori B.; Selker, Eric U.; Barry, Edward G.

    1986-01-01

    In translocation OY321 of Neurospora crassa, the nucleolus organizer is divided into two segments, a proximal portion located interstitially in one interchange chromosome, and a distal portion now located terminally on another chromosome, linkage group I. In crosses of Translocation x Translocation, exceptional progeny are recovered nonselectively in which the chromosome sequence has apparently reverted to Normal. Genetic, cytological, and molecular evidence indicates that reversion is the result of meiotic crossing over between homologous displaced rDNA repeats. Marker linkages are wild type in these exceptional progeny. They differ from wild type, however, in retaining an interstitial block of rRNA genes which can be demonstrated cytologically by the presence of a second, small interstitial nucleolus and genetically by linkage of an rDNA restriction site polymorphism to the mating-type locus in linkage group I. The interstitial rDNA is more highly methylated than the terminal rDNA. The mechanism by which methylation enzymes distinguish between interstitial rDNA and terminal rDNA is unknown. Some hypotheses are considered. PMID:2947829

  5. Bloom DNA Helicase Facilitates Homologous Recombination between Diverged Homologous Sequences*

    PubMed Central

    Kikuchi, Koji; Abdel-Aziz, H. Ismail; Taniguchi, Yoshihito; Yamazoe, Mitsuyoshi; Takeda, Shunichi; Hirota, Kouji

    2009-01-01

    Bloom syndrome caused by inactivation of the Bloom DNA helicase (Blm) is characterized by increases in the level of sister chromatid exchange, homologous recombination (HR) associated with cross-over. It is therefore believed that Blm works as an anti-recombinase. Meanwhile, in Drosophila, DmBlm is required specifically to promote the synthesis-dependent strand anneal (SDSA), a type of HR not associating with cross-over. However, conservation of Blm function in SDSA through higher eukaryotes has been a matter of debate. Here, we demonstrate the function of Blm in SDSA type HR in chicken DT40 B lymphocyte line, where Ig gene conversion diversifies the immunoglobulin V gene through intragenic HR between diverged homologous segments. This reaction is initiated by the activation-induced cytidine deaminase enzyme-mediated uracil formation at the V gene, which in turn converts into abasic site, presumably leading to a single strand gap. Ig gene conversion frequency was drastically reduced in BLM−/− cells. In addition, BLM−/− cells used limited donor segments harboring higher identity compared with other segments in Ig gene conversion event, suggesting that Blm can promote HR between diverged sequences. To further understand the role of Blm in HR between diverged homologous sequences, we measured the frequency of gene targeting induced by an I-SceI-endonuclease-mediated double-strand break. BLM−/− cells showed a severer defect in the gene targeting frequency as the number of heterologous sequences increased at the double-strand break site. Conversely, the overexpression of Blm, even an ATPase-defective mutant, strongly stimulated gene targeting. In summary, Blm promotes HR between diverged sequences through a novel ATPase-independent mechanism. PMID:19661064

  6. Analysis of the DNA sequence of a 15,500 bp fragment near the left telomere of chromosome XV from Saccharomyces cerevisiae reveals a putative sugar transporter, a carboxypeptidase homologue and two new open reading frames.

    PubMed

    Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C

    1996-06-15

    We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.

  7. Molecular characterization of the canine mitochondrial DNA control region for forensic applications.

    PubMed

    Eichmann, Cordula; Parson, Walther

    2007-09-01

    The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.

  8. Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer).

    PubMed

    Chelomina, Galina N; Rozhkovan, Konstantin V; Voronova, Anastasia N; Burundukova, Olga L; Muzarok, Tamara I; Zhuravlev, Yuri N

    2016-04-01

    Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.

  9. Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer)

    PubMed Central

    Chelomina, Galina N.; Rozhkovan, Konstantin V.; Voronova, Anastasia N.; Burundukova, Olga L.; Muzarok, Tamara I.; Zhuravlev, Yuri N.

    2015-01-01

    Background Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440–640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine. PMID:27158239

  10. Reading of the non-template DNA by transcription elongation factors.

    PubMed

    Svetlov, Vladimir; Nudler, Evgeny

    2018-05-14

    Unlike transcription initiation and termination, which have easily discernable signals such as promoters and terminators, elongation is regulated through a dynamic network involving RNA/DNA pause signals and states- rather than sequence-specific protein interactions. A report by Nedialkov et al. (in press) provides experimental evidence for sequence-specific recruitment of elongation factor RfaH to transcribing RNA polymerase (RNAP) and outlines the mechanism of gene expression regulation by restraint ("locking") of the DNA non-template strand. According to this model, the elongation complex pauses at the so called "operon polarity sequence" (found in some long bacterial operons coding for virulence genes), when the usually flexible non-template DNA strand adopts a distinct hairpin-loop conformation on the surface of transcribing RNAP. Sequence-specific binding of RfaH to this DNA segment facilitates conversion of RfaH from its inactive closed to its active open conformation. The interaction network formed between RfaH, non-template DNA, and RNAP locks DNA in a conformation that renders the elongation complex resistant to pausing and termination. The effects of such locking on transcript elongation can be mimicked by restraint of the non-template strand due to its shortening. This work advances our understanding of regulation of transcript elongation and has important implications for the action of general transcription factors, such as NusG, which lack apparent sequence-specificity, as well as for the mechanisms of other processes linked to transcription such as transcription-coupled DNA repair. This article is protected by copyright. All rights reserved. © 2018 John Wiley & Sons Ltd.

  11. B-DNA to Z-DNA structural transitions in the SV40 enhancer: stabilization of Z-DNA in negatively supercoiled DNA minicircles

    NASA Technical Reports Server (NTRS)

    Gruskin, E. A.; Rich, A.

    1993-01-01

    During replication and transcription, the SV40 control region is subjected to significant levels of DNA unwinding. There are three, alternating purine-pyrimidine tracts within this region that can adopt the Z-DNA conformation in response to negative superhelix density: a single copy of ACACACAT and two copies of ATGCATGC. Since the control region is essential for both efficient transcription and replication, B-DNA to Z-DNA transitions in these vital sequence tracts may have significant biological consequences. We have synthesized DNA minicircles to detect B-DNA to Z-DNA transitions in the SV40 enhancer, and to determine the negative superhelix density required to stabilize the Z-DNA. A variety of DNA sequences, including the entire SV40 enhancer and the two segments of the enhancer with alternating purine-pyrimidine tracts, were incorporated into topologically relaxed minicircles. Negative supercoils were generated, and the resulting topoisomers were resolved by electrophoresis. Using an anti-Z-DNA Fab and an electrophoretic mobility shift assay, Z-DNA was detected in the enhancer-containing minicircles at a superhelix density of -0.05. Fab saturation binding experiments demonstrated that three, independent Z-DNA tracts were stabilized in the supercoiled minicircles. Two other minicircles, each with one of the two alternating purine-pyrimidine tracts, also contained single Z-DNA sites. These results confirm the identities of the Z-DNA-forming sequences within the control region. Moreover, the B-DNA to Z-DNA transitions were detected at superhelix densities observed during normal replication and transcription processes in the SV40 life cycle.

  12. The signature of somatic hypermutation appears to be written into the germline IgV segment repertoire.

    PubMed

    Blanden, R V; Rothenfluh, H S; Zylstra, P; Weiller, G F; Steele, E J

    1998-04-01

    We present here a unifying hypothesis for the molecular mechanism of somatic hypermutation and somatic gene conversion in IgV genes involving reverse transcription using RNA templates from the V-gene loci to produce cDNA which undergoes homologous recombination with chromosomal V(D)J DNA. Experimental evidence produced over the last 20 years is essentially consistent with this hypothesis. We also review evidence suggesting that somatically generated IgV sequences from B lymphocytes have been fed back to germline DNA over evolutionary time.

  13. Single haplotype assembly of the human genome from a hydatidiform mole.

    PubMed

    Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

    2014-12-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.

  14. Single haplotype assembly of the human genome from a hydatidiform mole

    PubMed Central

    Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

    2014-01-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144

  15. Biparental inheritance of organelles in Pelargonium: evidence for intergenomic recombination of mitochondrial DNA.

    PubMed

    Apitz, Janina; Weihe, Andreas; Pohlheim, Frank; Börner, Thomas

    2013-02-01

    While uniparental transmission of mtDNA is widespread and dominating in eukaryotes leaving mutation as the major source of genotypic diversity, recently, biparental inheritance of mitochondrial genes has been demonstrated in reciprocal crosses of Pelargonium zonale and P. inquinans. The thereby arising heteroplasmy carries the potential for recombination between mtDNAs of different descent, i.e. between the parental mitochondrial genomes. We have analyzed these Pelargonium hybrids for mitochondrial intergenomic recombination events by examining differences in DNA blot hybridization patterns of the mitochondrial genes atp1 and cob. Further investigation of these genes and their flanking regions using nucleotide sequence polymorphisms and PCR revealed DNA segments in the progeny, which contained both P. zonale and P. inquinans sequences suggesting an intergenomic recombination in hybrids of Pelargonium. This turns Pelargonium into an interesting subject for studies of recombination and evolutionary dynamics of mitochondrial genomes.

  16. Organizational heterogeneity of vertebrate genomes.

    PubMed

    Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

    2012-01-01

    Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  17. Controlled enzymatic cutting of DNA molecules adsorbed on surfaces using soft lithography

    NASA Astrophysics Data System (ADS)

    Auerbach, Alyssa; Budassi, Julia; Shea, Emily; Zhu, Ke; Sokolov, Jonathan

    2013-03-01

    The enzyme DNase I was applied to adsorbed and aligned DNA molecules (Lamda, 48.5 kilobase pairs (kbp), and T4, 165.6 kbp), stretched linearly on a surface, by stamping with a polydimethylsiloxane (PDMS) grating. The DNAs were cut by the enzyme into separated, micron-sized segments along the length of the molecules at positions determined by the grating dimensions (3-20 microns). Ozone-treated PDMS stamps were coated with DNase I solutions and placed in contact with surface-adsorbed DNA molecules deposited on a 750 polymethylmethacrylate (PMMA) film spun-cast onto a silicon substrate. The stamps were applied under pressure for times up to 15 minutes at 37 C. The cutting was observed by fluorescence microscopy imaging of DNA labeled with YOYO dye. Cutting was found to be efficient despite the steric hindrance due to surface attachment of the molecules. Methods for detaching and separating the cut segments for sequencing applications will be discussed. Supported by NSF-DMR program.

  18. Impacts of Chromatin States and Long-Range Genomic Segments on Aging and DNA Methylation

    PubMed Central

    Sun, Dan; Yi, Soojin V.

    2015-01-01

    Understanding the fundamental dynamics of epigenome variation during normal aging is critical for elucidating key epigenetic alterations that affect development, cell differentiation and diseases. Advances in the field of aging and DNA methylation strongly support the aging epigenetic drift model. Although this model aligns with previous studies, the role of other epigenetic marks, such as histone modification, as well as the impact of sampling specific CpGs, must be evaluated. Ultimately, it is crucial to investigate how all CpGs in the human genome change their methylation with aging in their specific genomic and epigenomic contexts. Here, we analyze whole genome bisulfite sequencing DNA methylation maps of brain frontal cortex from individuals of diverse ages. Comparisons with blood data reveal tissue-specific patterns of epigenetic drift. By integrating chromatin state information, divergent degrees and directions of aging-associated methylation in different genomic regions are revealed. Whole genome bisulfite sequencing data also open a new door to investigate whether adjacent CpG sites exhibit coordinated DNA methylation changes with aging. We identified significant ‘aging-segments’, which are clusters of nearby CpGs that respond to aging by similar DNA methylation changes. These segments not only capture previously identified aging-CpGs but also include specific functional categories of genes with implications on epigenetic regulation of aging. For example, genes associated with development are highly enriched in positive aging segments, which are gradually hyper-methylated with aging. On the other hand, regions that are gradually hypo-methylated with aging (‘negative aging segments’) in the brain harbor genes involved in metabolism and protein ubiquitination. Given the importance of protein ubiquitination in proteome homeostasis of aging brains and neurodegenerative disorders, our finding suggests the significance of epigenetic regulation of this posttranslational modification pathway in the aging brain. Utilizing aging segments rather than individual CpGs will provide more comprehensive genomic and epigenomic contexts to understand the intricate associations between genomic neighborhoods and developmental and aging processes. These results complement the aging epigenetic drift model and provide new insights. PMID:26091484

  19. Complete nucleotide sequences of a new bipartite begomovirus from Malvastrum sp. plants with bright yellow mosaic symptoms in South Texas.

    PubMed

    Alabi, Olufemi J; Villegas, Cecilia; Gregg, Lori; Murray, K Daniel

    2016-06-01

    Two isolates of a novel bipartite begomovirus, tentatively named malvastrum bright yellow mosaic virus (MaBYMV), were molecularly characterized from naturally infected plants of the genus Malvastrum showing bright yellow mosaic disease symptoms in South Texas. Six complete DNA-A and five DNA-B genome sequences of MaBYMV obtained from the isolates ranged in length from 2,608 to 2,609 nucleotides (nt) and 2,578 to 2,605 nt, respectively. Both genome segments shared a 178- to 180-nt common region. In pairwise comparisons, the complete DNA-A and DNA-B sequences of MaBYMV were most similar (87-88 % and 79-81 % identity, respectively) and phylogenetically related to the corresponding sequences of sida mosaic Sinaloa virus-[MX-Gua-06]. Further analysis revealed that MaBYMV is a putative recombinant virus, thus supporting the notion that malvaceous hosts may be influencing the evolution of several begomoviruses. The design of new diagnostic primers enabled the detection of MaBYMV in cohorts of Bemisia tabaci collected from symptomatic Malvastrum sp. plants, thus implicating whiteflies as potential vectors of the virus.

  20. DNA sequence divergence among derivatives of Escherichia coli K-12 detected by arbitrary primer PCR (random amplified polymorphic DNA) fingerprinting.

    PubMed Central

    Brikun, I; Suziedelis, K; Berg, D E

    1994-01-01

    Derivatives of Escherichia coli K-12 of known ancestry were characterized by random amplified polymorphic DNA (RAPD) fingerprinting to better understand genome evolution in this family of closely related strains. This sensitive method entails PCR amplification with arbitrary primers at low stringency and yields arrays of anonymous DNA fragments that are strain specific. Among 150 fragments scored, eight were polymorphic in that they were produced from some but not all strains. Seven polymorphic bands were chromosomal, and one was from the F-factor plasmid. Five of the six mapped polymorphic chromosomal bands came from just 7% of the genome, a 340-kb segment that includes the terminus of replication. Two of these were from the cryptic Rac prophage, and the inability to amplify them from strains was attributable to deletion (excision) or to rearrangement of Rac. Two other terminus-region segments that resulted in polymorphic bands appeared to have sustained point mutations that affected the ability to amplify them. Control experiments showed that RAPD bands from the 340-kb terminus-region segment and also from two plasmids (P1 and F) were represented in approximate proportion to their size. Optimization experiments showed that the concentration of thermostable polymerase strongly affected the arrays of RAPD products obtained. Comparison of RAPD polymorphisms and positions of strains exhibiting them in the pedigree suggests that many sequence changes occurred in these historic E. coli strains during their storage. We propose that the clustering of such mutations near the terminus reflects errors during completion of chromosome replication, possibly during slow growth in the stab cultures that were often used to store E. coli strains in the early years of bacterial genetics. Images PMID:8132463

  1. Birth and death of genes linked to chromosomal inversion

    PubMed Central

    Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo

    2011-01-01

    The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362

  2. Violation of an Evolutionarily Conserved Immunoglobulin Diversity Gene Sequence Preference Promotes Production of dsDNA-Specific IgG Antibodies

    PubMed Central

    Silva-Sanchez, Aaron; Liu, Cun Ren; Vale, Andre M.; Khass, Mohamed; Kapoor, Pratibha; Elgavish, Ada; Ivanov, Ivaylo I.; Ippolito, Gregory C.; Schelonka, Robert L.; Schoeb, Trenton R.; Burrows, Peter D.; Schroeder, Harry W.

    2015-01-01

    Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3), which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH) gene segment sequence content by reading frame (RF) is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1), which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies. PMID:25706374

  3. Simian virus 40 major late promoter: an upstream DNA sequence required for efficient in vitro transcription.

    PubMed Central

    Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P

    1984-01-01

    We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950

  4. Chicken immunoglobulin gamma-heavy chains: limited VH gene repertoire, combinatorial diversification by D gene segments and evolution of the heavy chain locus.

    PubMed

    Parvari, R; Avivi, A; Lentner, F; Ziv, E; Tel-Or, S; Burstein, Y; Schechter, I

    1988-03-01

    cDNA clones encoding the variable and constant regions of chicken immunoglobulin (Ig) gamma-chains were obtained from spleen cDNA libraries. Southern blots of kidney DNA show that the variable region sequences of eight cDNA clones reveal the same set of bands corresponding to approximately 30 cross-hybridizing VH genes of one subgroup. Since the VH clones were randomly selected, it is likely that the bulk of chicken H-chains are encoded by a single VH subgroup. Nucleotide sequence determinations of two cDNA clones reveal VH, D, JH and the constant region. The VH segments are closely related to each other (83% homology) as expected for VH or the same subgroup. The JHs are 15 residues long and differ by one amino acid. The Ds differ markedly in sequence (20% homology) and size (10 and 20 residues). These findings strongly indicate multiple (at least two) D genes which by a combinatorial joining mechanism diversify the H-chains, a mechanism which is not operative in the chicken L-chain locus. The most notable among the chicken Igs is the so-called 7S IgG because its H-chain differs in many important aspects from any mammalian IgG. The sequence of the C gamma cDNA reported here resolves this issue. The chicken C gamma is 426 residues long with four CH domains (unlike mammalian C gamma which has three CH domains) and it shows 25% homology to the chicken C mu. The chicken C gamma is most related to the mammalian C epsilon in length, the presence of four CH domains and the distribution of cysteines in the CH1 and CH2 domains. We propose that the unique chicken C gamma is the ancestor of the mammalian C epsilon and C gamma subclasses, and discuss the evolution of the H-chain locus from that of chicken with presumably three genes (mu, gamma, alpha) to the mammalian loci with 8-10 H-chain genes.

  5. DNA sequence analysis of a 10 624 bp fragment of the left arm of chromosome XV from Saccharomyces cerevisiae reveals a RNA binding protein, a mitochondrial protein, two ribosomal proteins and two new open reading frames.

    PubMed

    Lafuente, M J; Gamo, F J; Gancedo, C

    1996-09-01

    We have determined the sequence of a 10624 bp DNA segment located in the left arm of chromosome XV of Saccharomyces cerevisiae. The sequence contains eight open reading frames (ORFs) longer than 100 amino acids. Two of them do not present significant homology with sequences found in the databases. The product of ORF o0553 is identical to the protein encoded by the gene SMF1. Internal to it there is another ORF, o0555 that is apparently expressed. The proteins encoded by ORFs o0559 and o0565 are identical to ribosomal proteins S19.e and L18 respectively. ORF o0550 encodes a protein with an RNA binding signature including RNP motifs and stretches rich in asparagine, glutamine and arginine.

  6. Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics.

    PubMed

    Tin, Mandy Man-Ying; Economo, Evan Philip; Mikheyev, Alexander Sergeyevich

    2014-01-01

    Ancient and archival DNA samples are valuable resources for the study of diverse historical processes. In particular, museum specimens provide access to biotas distant in time and space, and can provide insights into ecological and evolutionary changes over time. However, archival specimens are difficult to handle; they are often fragile and irreplaceable, and typically contain only short segments of denatured DNA. Here we present a set of tools for processing such samples for state-of-the-art genetic analysis. First, we report a protocol for minimally destructive DNA extraction of insect museum specimens, which produced sequenceable DNA from all of the samples assayed. The 11 specimens analyzed had fragmented DNA, rarely exceeding 100 bp in length, and could not be amplified by conventional PCR targeting the mitochondrial cytochrome oxidase I gene. Our approach made these samples amenable to analysis with commonly used next-generation sequencing-based molecular analytic tools, including RAD-tagging and shotgun genome re-sequencing. First, we used museum ant specimens from three species, each with its own reference genome, for RAD-tag mapping. Were able to use the degraded DNA sequences, which were sequenced in full, to identify duplicate reads and filter them prior to base calling. Second, we re-sequenced six Hawaiian Drosophila species, with millions of years of divergence, but with only a single available reference genome. Despite a shallow coverage of 0.37 ± 0.42 per base, we could recover a sufficient number of overlapping SNPs to fully resolve the species tree, which was consistent with earlier karyotypic studies, and previous molecular studies, at least in the regions of the tree that these studies could resolve. Although developed for use with degraded DNA, all of these techniques are readily applicable to more recent tissue, and are suitable for liquid handling automation.

  7. Self-organizing approach for meta-genomes.

    PubMed

    Zhu, Jianfeng; Zheng, Wei-Mou

    2014-12-01

    We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.

  8. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk

    PubMed Central

    Curtin, Karen; Rajamanickam, Venkatesh; Jayabalan, David; Atanackovic, Djordje; Rajkumar, S. Vincent; Kumar, Shaji; Slager, Susan; Galia, Perrine; Demangel, Delphine; Salama, Mohamed; Joseph, Vijai; Lipkin, Steven M.; Dumontet, Charles; Vachon, Celine M.

    2018-01-01

    The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance–a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits. PMID:29389935

  9. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk.

    PubMed

    Waller, Rosalie G; Darlington, Todd M; Wei, Xiaomu; Madsen, Michael J; Thomas, Alun; Curtin, Karen; Coon, Hilary; Rajamanickam, Venkatesh; Musinsky, Justin; Jayabalan, David; Atanackovic, Djordje; Rajkumar, S Vincent; Kumar, Shaji; Slager, Susan; Middha, Mridu; Galia, Perrine; Demangel, Delphine; Salama, Mohamed; Joseph, Vijai; McKay, James; Offit, Kenneth; Klein, Robert J; Lipkin, Steven M; Dumontet, Charles; Vachon, Celine M; Camp, Nicola J

    2018-02-01

    The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.

  10. Selection of a DNA barcode for Nectriaceae from fungal whole-genomes.

    PubMed

    Zeng, Zhaoqing; Zhao, Peng; Luo, Jing; Zhuang, Wenying; Yu, Zhihe

    2012-01-01

    A DNA barcode is a short segment of sequence that is able to distinguish species. A barcode must ideally contain enough variation to distinguish every individual species and be easily obtained. Fungi of Nectriaceae are economically important and show high species diversity. To establish a standard DNA barcode for this group of fungi, the genomes of Neurospora crassa and 30 other filamentous fungi were compared. The expect value was treated as a criterion to recognize homologous sequences. Four candidate markers, Hsp90, AAC, CDC48, and EF3, were tested for their feasibility as barcodes in the identification of 34 well-established species belonging to 13 genera of Nectriaceae. Two hundred and fifteen sequences were analyzed. Intra- and inter-specific variations and the success rate of PCR amplification and sequencing were considered as important criteria for estimation of the candidate markers. Ultimately, the partial EF3 gene met the requirements for a good DNA barcode: No overlap was found between the intra- and inter-specific pairwise distances. The smallest inter-specific distance of EF3 gene was 3.19%, while the largest intra-specific distance was 1.79%. In addition, there was a high success rate in PCR and sequencing for this gene (96.3%). CDC48 showed sufficiently high sequence variation among species, but the PCR and sequencing success rate was 84% using a single pair of primers. Although the Hsp90 and AAC genes had higher PCR and sequencing success rates (96.3% and 97.5%, respectively), overlapping occurred between the intra- and inter-specific variations, which could lead to misidentification. Therefore, we propose the EF3 gene as a possible DNA barcode for the nectriaceous fungi.

  11. Molecular genetic characterization of the RD-114 gene family of endogenous feline retroviral sequences.

    PubMed Central

    Reeves, R H; O'Brien, S J

    1984-01-01

    RD-114 is a replication-competent, xenotropic retrovirus which is homologous to a family of moderately repetitive DNA sequences present at ca. 20 copies in the normal cellular genome of domestic cats. To examine the extent and character of genomic divergence of the RD-114 gene family as well as to assess their positional association within the cat genome, we have prepared a series of molecular clones of endogenous RD-114 DNA segments from a genomic library of cat cellular DNA. Their restriction endonuclease maps were compared with each other as well as to that of the prototype-inducible RD-114 which was molecularly cloned from a chronically infected human cell line. The endogenous sequences analyzed were similar to each other in that they were colinear with RD-114 proviral DNA, were bounded by long terminal redundancies, and conserved many restriction sites in the gag and pol regions. However, the env regions of many of the sequences examined were substantially deleted. Several of the endogenous RD-114 genomes contained a novel envelope sequence which was unrelated to the env gene of the prototype RD-114 env gene but which, like RD-114 and endogenous feline leukemia virus provirus, was found only in species of the genus Felis, and not in other closely related Felidae genera. The endogenous RD-114 sequences each had a distinct cellular flank which indicates that these sequences are not tandem but dispersed nonspecifically throughout the genome. Southern analysis of cat cellular DNA confirmed the conclusions about conserved restriction sites in endogenous sequences and indicated that a single locus may be responsible for the production of the major inducible form of RD-114. Images PMID:6090693

  12. Genetic analysis of 7 medieval skeletons from Aragonese Pyrenees

    PubMed Central

    Núńez, Carolina; Sosa, Cecilia; Baeta, Miriam; Geppert, Maria; Turnbough, Meredith; Phillips, Nicole; Casalod, Yolanda; Bolea, Miguel; Roby, Rhonda; Budowle, Bruce; Martínez-Jarreta, Begońa

    2011-01-01

    Aim To perform a genetic characterization of 7 skeletons from medieval age found in a burial site in the Aragonese Pyrenees. Methods Allele frequencies of autosomal short tandem repeats (STR) loci were determined by 3 different STR systems. Mitochondrial DNA (mtDNA) and Y-chromosome haplogroups were determined by sequencing of the hypervariable segment 1 of mtDNA and typing of phylogenetic Y chromosome single nucleotide polymorphisms (Y-SNP) markers, respectively. Possible familial relationships were also investigated. Results Complete or partial STR profiles were obtained in 3 of the 7 samples. Mitochondrial DNA haplogroup was determined in 6 samples, with 5 of them corresponding to the haplogroup H and 1 to the haplogroup U5a. Y-chromosome haplogroup was determined in 2 samples, corresponding to the haplogroup R. In one of them, the sub-branch R1b1b2 was determined. mtDNA sequences indicated that some of the individuals could be maternally related, while STR profiles indicated no direct family relationships. Conclusions Despite the antiquity of the samples and great difficulty that genetic analyses entail, the combined use of autosomal STR markers, Y-chromosome informative SNPs, and mtDNA sequences allowed us to genotype a group of skeletons from the medieval age. PMID:21674829

  13. Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

    NASA Astrophysics Data System (ADS)

    Chen, Ellson Y.

    1997-05-01

    So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.

  14. Sinorhizobium meliloti strains TII7 and A5 by Multilocus Sequence Typing (MLST) have chromsomes identical with Rm1021 and form an effective and ineffective symbiosis with Medicago truncatula line Jemalong A17, respectively

    USDA-ARS?s Scientific Manuscript database

    The strains TII7 and A5 formed an effective and ineffective symbiosis with Medicago truncatula Jemalong A17, respectively. Both were shown to have identical chromsomes with strains Rm1021 and RCR2011 using a Multilocus Sequence Typing method. The 2260 bp segments of DNA stretching from the 3’ end ...

  15. String Mining in Bioinformatics

    NASA Astrophysics Data System (ADS)

    Abouelhoda, Mohamed; Ghanem, Moustafa

    Sequence analysis is a major area in bioinformatics encompassing the methods and techniques for studying the biological sequences, DNA, RNA, and proteins, on the linear structure level. The focus of this area is generally on the identification of intra- and inter-molecular similarities. Identifying intra-molecular similarities boils down to detecting repeated segments within a given sequence, while identifying inter-molecular similarities amounts to spotting common segments among two or multiple sequences. From a data mining point of view, sequence analysis is nothing but string- or pattern mining specific to biological strings. For a long time, this point of view, however, has not been explicitly embraced neither in the data mining nor in the sequence analysis text books, which may be attributed to the co-evolution of the two apparently independent fields. In other words, although the word "data-mining" is almost missing in the sequence analysis literature, its basic concepts have been implicitly applied. Interestingly, recent research in biological sequence analysis introduced efficient solutions to many problems in data mining, such as querying and analyzing time series [49,53], extracting information from web pages [20], fighting spam mails [50], detecting plagiarism [22], and spotting duplications in software systems [14].

  16. String Mining in Bioinformatics

    NASA Astrophysics Data System (ADS)

    Abouelhoda, Mohamed; Ghanem, Moustafa

    Sequence analysis is a major area in bioinformatics encompassing the methods and techniques for studying the biological sequences, DNA, RNA, and proteins, on the linear structure level. The focus of this area is generally on the identification of intra- and inter-molecular similarities. Identifying intra-molecular similarities boils down to detecting repeated segments within a given sequence, while identifying inter-molecular similarities amounts to spotting common segments among two or multiple sequences. From a data mining point of view, sequence analysis is nothing but string- or pattern mining specific to biological strings. For a long time, this point of view, however, has not been explicitly embraced neither in the data mining nor in the sequence analysis text books, which may be attributed to the co-evolution of the two apparently independent fields. In other words, although the word “data-mining” is almost missing in the sequence analysis literature, its basic concepts have been implicitly applied. Interestingly, recent research in biological sequence analysis introduced efficient solutions to many problems in data mining, such as querying and analyzing time series [49,53], extracting information from web pages [20], fighting spam mails [50], detecting plagiarism [22], and spotting duplications in software systems [14].

  17. Identification of genes from pattern formation, tyrosine kinase, and potassium channel families by DNA amplification

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kamb, A.; Weir, M.; Rudy, B.

    1989-06-01

    The study of gene family members has been aided by the isolation of related genes on the basis of DNA homology. The authors have adapted the polymerase chain reaction to screen animal genomes very rapidly and reliably for likely gene family members. Using conserved amino acid sequences to design degenerate oligonucleotide primers, they have shown that the genome of the nematode Caenorhabditis elegans contains sequences homologous to many Drosophila genes involved in pattern formation, including the segment polarity gene wingless (vertebrate int-1), and homeobox sequences characteristic of the Antennapedia, engrailed, and paired families. In addition, they have used this methodmore » to show that C. elegans contains at least five different sequences homologous to genes in the tyrosine kinase family. Lastly, they have isolated six potassium channel sequences from humans, a result that validates the utility of the method with large genomes and suggests that human potassium channel gene diversity may be extensive.« less

  18. Discovery of a novel HLA-B*51 variant, B*51:112, in a Taiwanese bone marrow donor and identification of the plausible HLA haplotype in association with B*51:112.

    PubMed

    Yang, K L; Lee, S K; Lin, P Y

    2012-10-01

    The sequence of B*51:112 is identical to the sequence of B*51:01:01 in exons 2, 3 and 4, except the nucleotides at positions 206 (C→A) and 213 (C→G). The nucleotide replacement caused one amino acid substitution at residue 45 (T→K). The plausible HLA-A, -B and -DRB1 haplotype in association with B*51:112 may be deduced as HLA-A*02-B*51:112-DRB1*12. The generation of B*51:112 was probably as the result of a DNA recombination event where B*40:01:01 acted as a sequence donor donating a segment of the DNA sequence to the recipient sequence B*51:01:01. The donor carrying B*51:112 was a Minna Taiwanese whose ancestor came to Taiwan from the southern region of China. © 2012 Blackwell Publishing Ltd.

  19. Simultaneous Binding of Hybrid Molecules Constructed with Dual DNA-Binding Components to a G-Quadruplex and Its Proximal Duplex.

    PubMed

    Asamitsu, Sefan; Obata, Shunsuke; Phan, Anh Tuân; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

    2018-03-20

    A G-quadruplex (quadruplex) is a nucleic acid secondary structure adopted by guanine-rich sequences and is considered to be relevant to various pharmacological and biological contexts. Although a number of researchers have endeavored to discover and develop quadruplex-interactive molecules, poor ligand designability originating from topological similarity of the skeleton of diverse quadruplexes has remained a bottleneck for gaining specificity for individual quadruplexes. This work reports on hybrid molecules that were constructed with dual DNA-binding components, a cyclic imidazole/lysine polyamide (cIKP), and a hairpin pyrrole/imidazole polyamide (hPIP), with the aim toward specific quadruplex targeting by reading out the local duplex DNA sequence adjacent to designated quadruplexes in the genome. By means of circular dichroism (CD), fluorescence resonance energy transfer (FRET), surface plasmon resonance (SPR), and NMR techniques, we showed the dual and simultaneous recognition of the respective segment via hybrid molecules, and the synergistic and mutual effect of each binding component that was appropriately linked on higher binding affinity and modest sequence specificity. Monitoring quadruplex and duplex imino protons of the quadruplex/duplex motif titrated with hybrid molecules clearly revealed distinct features of the binding of hybrid molecules to the respective segments upon their simultaneous recognition. A series of the systematic and detailed binding assays described here showed that the concept of simultaneous recognition of quadruplex and its proximal duplex by hybrid molecules constructed with the dual DNA-binding components may provide a new strategy for ligand design, enabling targeting of a large variety of designated quadruplexes at specific genome locations. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Evolutionary Origin of OwlRep, a Megasatellite DNA Associated with Adaptation of Owl Monkeys to Nocturnal Lifestyle

    PubMed Central

    Nishihara, Hidenori; Stanyon, Roscoe; Kusumi, Junko; Hirai, Hirohisa

    2018-01-01

    Abstract Rod cells of many nocturnal mammals have a “non-standard” nuclear architecture, which is called the inverted nuclear architecture. Heterochromatin localizes to the central region of the nucleus. This leads to an efficient light transmission to the outer segments of photoreceptors. Rod cells of diurnal mammals have the conventional nuclear architecture. Owl monkeys (genus Aotus) are the only taxon of simian primates that has a nocturnal or cathemeral lifestyle, and this adaptation is widely thought to be secondary. Their rod cells were shown to exhibit an intermediate chromatin distribution: a spherical heterochromatin block was found in the central region of the nucleus although it was less complete than that of typical nocturnal mammals. We recently demonstrated that the primary DNA component of this heterochromatin block was OwlRep, a megasatellite DNA consisting of 187-bp-long repeat units. However, the origin of OwlRep was not known. Here we show that OwlRep was derived from HSAT6, a simple repeat sequence found in the centromere regions of human chromosomes. HSAT6 occurs widely in primates, suggesting that it was already present in the last common ancestor of extant primates. Notably, Strepsirrhini and Tarsiformes apparently carry a single HSAT6 copy, whereas many species of Simiiformes contain multiple copies. Comparison of nucleotide sequences of these copies revealed the entire process of the OwlRep formation. HSAT6, with or without flanking sequences, was segmentally duplicated in New World monkeys. Then, in the owl monkey linage after its divergence from other New World monkeys, a copy of HSAT6 was tandemly amplified, eventually forming a megasatellite DNA. PMID:29294004

  1. Mechanism of duplex DNA destabilization by RNA-guided Cas9 nuclease during target interrogation

    PubMed Central

    Mekler, Vladimir; Minakhin, Leonid; Severinov, Konstantin

    2017-01-01

    The prokaryotic clustered regularly interspaced short palindromic repeats (CRISPR)-associated 9 (Cas9) endonuclease cleaves double-stranded DNA sequences specified by guide RNA molecules and flanked by a protospacer adjacent motif (PAM) and is widely used for genome editing in various organisms. The RNA-programmed Cas9 locates the target site by scanning genomic DNA. We sought to elucidate the mechanism of initial DNA interrogation steps that precede the pairing of target DNA with guide RNA. Using fluorometric and biochemical assays, we studied Cas9/guide RNA complexes with model DNA substrates that mimicked early intermediates on the pathway to the final Cas9/guide RNA–DNA complex. The results show that Cas9/guide RNA binding to PAM favors separation of a few PAM-proximal protospacer base pairs allowing initial target interrogation by guide RNA. The duplex destabilization is mediated, in part, by Cas9/guide RNA affinity for unpaired segments of nontarget strand DNA close to PAM. Furthermore, our data indicate that the entry of double-stranded DNA beyond a short threshold distance from PAM into the Cas9/single-guide RNA (sgRNA) interior is hindered. We suggest that the interactions unfavorable for duplex DNA binding promote DNA bending in the PAM-proximal region during early steps of Cas9/guide RNA–DNA complex formation, thus additionally destabilizing the protospacer duplex. The mechanism that emerges from our analysis explains how the Cas9/sgRNA complex is able to locate the correct target sequence efficiently while interrogating numerous nontarget sequences associated with correct PAMs. PMID:28484024

  2. Mechanism of duplex DNA destabilization by RNA-guided Cas9 nuclease during target interrogation.

    PubMed

    Mekler, Vladimir; Minakhin, Leonid; Severinov, Konstantin

    2017-05-23

    The prokaryotic clustered regularly interspaced short palindromic repeats (CRISPR)-associated 9 (Cas9) endonuclease cleaves double-stranded DNA sequences specified by guide RNA molecules and flanked by a protospacer adjacent motif (PAM) and is widely used for genome editing in various organisms. The RNA-programmed Cas9 locates the target site by scanning genomic DNA. We sought to elucidate the mechanism of initial DNA interrogation steps that precede the pairing of target DNA with guide RNA. Using fluorometric and biochemical assays, we studied Cas9/guide RNA complexes with model DNA substrates that mimicked early intermediates on the pathway to the final Cas9/guide RNA-DNA complex. The results show that Cas9/guide RNA binding to PAM favors separation of a few PAM-proximal protospacer base pairs allowing initial target interrogation by guide RNA. The duplex destabilization is mediated, in part, by Cas9/guide RNA affinity for unpaired segments of nontarget strand DNA close to PAM. Furthermore, our data indicate that the entry of double-stranded DNA beyond a short threshold distance from PAM into the Cas9/single-guide RNA (sgRNA) interior is hindered. We suggest that the interactions unfavorable for duplex DNA binding promote DNA bending in the PAM-proximal region during early steps of Cas9/guide RNA-DNA complex formation, thus additionally destabilizing the protospacer duplex. The mechanism that emerges from our analysis explains how the Cas9/sgRNA complex is able to locate the correct target sequence efficiently while interrogating numerous nontarget sequences associated with correct PAMs.

  3. Somatic hypermutation and junctional diversification at Ig heavy chain loci in the nurse shark.

    PubMed

    Malecek, Karolina; Brandman, Julie; Brodsky, Jennie E; Ohta, Yuko; Flajnik, Martin F; Hsu, Ellen

    2005-12-15

    We estimate there are approximately 15 IgM H chain loci in the nurse shark genome and have characterized one locus. It consists of one V, two D, and one J germline gene segments, and the constant (C) region can be distinguished from all of the others by a unique combination of restriction endonuclease sites in Cmu2. On the basis of these Cmu2 markers, 22 cDNA clones were selected from an epigonal organ cDNA library from the same individual; their C region sequences proved to be the same up to the polyadenylation site. With the identification of the corresponding germline gene segments, CDR3 from shark H chain rearrangements could be analyzed precisely, for the first time. Considerable diversity was generated by trimming and N addition at the three junctions and by varied recombination patterns of the two D gene segments. The cDNA sequences originated from independent rearrangements events, and most carried both single and contiguous substitutions. The 53 point mutations occurred with a bias for transition changes (53%), whereas the 78 tandem substitutions, mostly 2-4 bp long, do not (36%). The nature of the substitution patterns is the same as for mutants from six loci of two nurse shark L chain isotypes, showing that somatic hypermutation events are very similar at both H and L chain genes in this early vertebrate. The cis-regulatory elements targeting somatic hypermutation must have already existed in the ancestral Ig gene, before H and L chain divergence.

  4. Application of molecular genetics method for differentiating Martes zibellina L. heart from its adulterants in traditional Chinese medicine based on mitochondrial cytochrome b gene.

    PubMed

    Li, Mingcheng; Xia, Wei; Wang, Miao; Yang, Mingyan; Zhang, Lihua; Guo, Jie

    2014-02-01

    The use of Martes zibellina L. heart as a famous kind of traditional Chinese medicine has been documented for many years in China. Identification of its authenticity as raw materials became a key in controlling of herbal preparations. In this study, the characteristics of mitochondrial cytochrome b (Cyt b) gene from four species of Martes were explored, and a specific molecular genetics technique for identifying the heart of M. zibellina L. in addition to some close relatives from their counterfeits was established. The bioinformatics was carried out to design the primers for the Cyt b gene based on the different species of Martes. PCR and sequencing technology were performed. The mt DNA was extracted from all of fresh M. zibellina L., Martes melampus. Martes flavigula. Martes martes heart samples and dry M. zibellina L. heart powder through the modified alkaline extracting method in addition to its counterfeits including the chicken heart, duck heart, goose heart, rabbit heart and Mustela vison. The complete mt DNA was separated from all samples used in the study, and the Cyt b gene with 310 bp segments was amplified only from M. zibellina L. heart as DNA template by the PCR technique. The sequencing indicated that the segment amplified by the PCR was homologous with the species of M. zibellina in GenBank. The data revealed that the primers and selected segment could be used as the genetic markers to identify M. zibellina L. heart from its counterfeits among different animal species.

  5. Genetic relationships of meadow vole (Microtus pennsylvanicus) populations in central Appalachian wetlands

    Treesearch

    K. E. Francl; T. C. Glenn; S. B. Castleberry; W. M. Ford

    2008-01-01

    We sequenced and compared variation within a 375-base-pair segment of the mitochondrial DNA control region of 323 meadow voles (Microtus pennsylvanicus (Ord. 1815)) among 14 populations to determine the influence of past and present landscape connectivity among isolated wetlands in the central Appalachian Mountains. To best explain observed...

  6. Generation and reactivation of T-cell receptor A joining region pseudogenes in primates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thiel, C.; Lanchbury, J.S.; Otting, N.

    1996-06-01

    Tandemly duplicated T-cell receptor (Tcr) AJ (J{alpha}) segments contribute significantly to TCRA chain junctional region diversity in mammals. Since only limited data exists on TCRA diversity in nonhuman primates, we examined the TCRAJ regions of 37 chimpanzee and 71 rhesus macaque TCRA cDNA clones derived from inverse polymerase chain reaction on peripheral blood mononuclear cell cDNA of healthy animals. Twenty-five different TCRAJ regions were characterized in the chimpanzee and 36 in the rhesus macaque. Each bears a close structural relationship to an equivalent human TCRAJ region. Conserved amino acid motifs are shared between all three species. There are indications thatmore » differences between nonhuman primates and humans exist in the generation of TCRAJ pseudogenes. The nucleotide and amino acid sequences of the various characterized TCRAJ of each species are reported and we compare our results to the available information on human genomic sequences. Although we provide evidence of dynamic processes modifying TCRAJ segments during primate evolution, their repertoire and primary structure appears to be relatively conserved. 21 refs., 2 figs.« less

  7. In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome

    PubMed Central

    2013-01-01

    Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783

  8. Phylogeographic Analysis of Mitochondrial DNA in Northern Asian Populations

    PubMed Central

    Derenko, Miroslava ; Malyarchuk, Boris ; Grzybowski, Tomasz ; Denisova, Galina ; Dambueva, Irina ; Perkova, Maria ; Dorzhu, Choduraa ; Luzina, Faina ; Lee, Hong Kyu ; Vanecek, Tomas ; Villems, Richard ; Zakharov, Ilia 

    2007-01-01

    To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment–length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ∼7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia. PMID:17924343

  9. Phylogeographic analysis of mitochondrial DNA in northern Asian populations.

    PubMed

    Derenko, Miroslava; Malyarchuk, Boris; Grzybowski, Tomasz; Denisova, Galina; Dambueva, Irina; Perkova, Maria; Dorzhu, Choduraa; Luzina, Faina; Lee, Hong Kyu; Vanecek, Tomas; Villems, Richard; Zakharov, Ilia

    2007-11-01

    To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment-length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ~7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia.

  10. Local Renyi entropic profiles of DNA sequences.

    PubMed

    Vinga, Susana; Almeida, Jonas S

    2007-10-16

    In a recent report the authors presented a new measure of continuous entropy for DNA sequences, which allows the estimation of their randomness level. The definition therein explored was based on the Rényi entropy of probability density estimation (pdf) using the Parzen's window method and applied to Chaos Game Representation/Universal Sequence Maps (CGR/USM). Subsequent work proposed a fractal pdf kernel as a more exact solution for the iterated map representation. This report extends the concepts of continuous entropy by defining DNA sequence entropic profiles using the new pdf estimations to refine the density estimation of motifs. The new methodology enables two results. On the one hand it shows that the entropic profiles are directly related with the statistical significance of motifs, allowing the study of under and over-representation of segments. On the other hand, by spanning the parameters of the kernel function it is possible to extract important information about the scale of each conserved DNA region. The computational applications, developed in Matlab m-code, the corresponding binary executables and additional material and examples are made publicly available at http://kdbio.inesc-id.pt/~svinga/ep/. The ability to detect local conservation from a scale-independent representation of symbolic sequences is particularly relevant for biological applications where conserved motifs occur in multiple, overlapping scales, with significant future applications in the recognition of foreign genomic material and inference of motif structures.

  11. Local Renyi entropic profiles of DNA sequences

    PubMed Central

    Vinga, Susana; Almeida, Jonas S

    2007-01-01

    Background In a recent report the authors presented a new measure of continuous entropy for DNA sequences, which allows the estimation of their randomness level. The definition therein explored was based on the Rényi entropy of probability density estimation (pdf) using the Parzen's window method and applied to Chaos Game Representation/Universal Sequence Maps (CGR/USM). Subsequent work proposed a fractal pdf kernel as a more exact solution for the iterated map representation. This report extends the concepts of continuous entropy by defining DNA sequence entropic profiles using the new pdf estimations to refine the density estimation of motifs. Results The new methodology enables two results. On the one hand it shows that the entropic profiles are directly related with the statistical significance of motifs, allowing the study of under and over-representation of segments. On the other hand, by spanning the parameters of the kernel function it is possible to extract important information about the scale of each conserved DNA region. The computational applications, developed in Matlab m-code, the corresponding binary executables and additional material and examples are made publicly available at . Conclusion The ability to detect local conservation from a scale-independent representation of symbolic sequences is particularly relevant for biological applications where conserved motifs occur in multiple, overlapping scales, with significant future applications in the recognition of foreign genomic material and inference of motif structures. PMID:17939871

  12. An innovative platform for quick and flexible joining of assorted DNA fragments

    DOE PAGES

    De Paoli, Henrique Cestari; Tuskan, Gerald A.; Yang, Xiaohan

    2016-01-13

    Successful synthetic biology efforts rely on conceptual and experimental designs in combination with testing of multi-gene constructs. Despite recent progresses, several limitations still hinder the ability to flexibly assemble and collectively share different types of DNA segments. We describe an advanced system for joining DNA fragments from a universal library that automatically maintains open reading frames (ORFs) and does not require linkers, adaptors, sequence homology, amplification or mutation (domestication) of fragments in order to work properly. Moreover, we find that this system, which is enhanced by a unique buffer formulation, provides unforeseen capabilities for testing, and sharing, complex multi-gene circuitrymore » assembled from different DNA fragments.« less

  13. Mitochondrial DNA history of Sri Lankan ethnic people: their relations within the island and with the Indian subcontinental populations.

    PubMed

    Ranaweera, Lanka; Kaewsutthi, Supannee; Win Tun, Aung; Boonyarit, Hathaichanoke; Poolsuwan, Samerchai; Lertrit, Patcharee

    2014-01-01

    Located only a short distance off the southernmost shore of the Greater Indian subcontinent, the island of Sri Lanka has long been inhabited by various ethnic populations. Mainly comprising the Vedda, Sinhalese (Up- and Low-country) and Tamil (Sri Lankan and Indian); their history of settlements on the island and the biological relationships among them have remained obscure. It has been hypothesized that the Vedda was probably the earliest inhabitants of the area, followed by Sinhalese and Tamil from the Indian mainland. This study, in which 271 individuals, representing the Sri Lankan ethnic populations mentioned, were typed for their mitochondrial DNA (mtDNA) hypervariable segment 1 (HVS-1) and part of hypervariable segment 2 (HVS-2), provides implications for their settlement history on the island. From the phylogenetic, principal coordinate and analysis of molecular variance results, the Vedda occupied a position separated from all other ethnic people of the island, who formed relatively close affiliations among themselves, suggesting a separate origin of the former. The haplotypes and analysis of molecular variance revealed that Vedda people's mitochondrial sequences are more related to the Sinhalese and Sri Lankan Tamils' than the Indian Tamils' sequences. MtDNA haplogroup analysis revealed that several West Eurasian haplogroups as well as Indian-specific mtDNA clades were found amongst the Sri Lankan populations. Through a comparison with the mtDNA HVS-1 and part of HVS-2 of Indian database, both Tamils and Sinhalese clusters were affiliated with Indian subcontinent populations than Vedda people who are believed to be the native population of the island of Sri Lanka.

  14. Knockdown resistance (kdr)-like mutations in the voltage-gated sodium channel of a malaria vector Anopheles stephensi and PCR assays for their detection.

    PubMed

    Singh, Om P; Dykes, Cherry L; Lather, Manila; Agrawal, Om P; Adak, Tridibes

    2011-03-14

    Knockdown resistance (kdr) in insects, resulting from mutation(s) in the voltage-gated sodium channel (vgsc) gene is one of the mechanisms of resistance against DDT and pyrethroid-group of insecticides. The most common mutation(s) associated with knockdown resistance in insects, including anophelines, has been reported to be present at residue Leu1014 in the IIS6 transmembrane segment of the vgsc gene. This study reports the presence of two alternative kdr-like mutations, L1014S and L1014F, at this residue in a major malaria vector Anopheles stephensi and describes new PCR assays for their detection. Part of the vgsc (IIS4-S5 linker-to-IIS6 transmembrane segment) of An. stephensi collected from Alwar (Rajasthan, India) was PCR-amplified from genomic DNA, sequenced and analysed for the presence of deduced amino acid substitution(s). Analysis of DNA sequences revealed the presence of two alternative non-synonymous point mutations at L1014 residue in the IIS6 transmembrane segment of vgsc, i.e., T>C mutation on the second position and A>T mutation on the third position of the codon, leading to Leu (TTA)-to-Ser (TCA) and -Phe (TTT) amino acid substitutions, respectively. Polymerase chain reaction (PCR) assays were developed for identification of each of these two point mutations. Genotyping of An. stephensi mosquitoes from Alwar by PCR assays revealed the presence of both mutations, with a high frequency of L1014S. The PCR assays developed for detection of the kdr mutations were specific as confirmed by DNA sequencing of PCR-genotyped samples. Two alternative kdr-like mutations, L1014S and L1014F, were detected in An. stephensi with a high allelic frequency of L1014S. The occurrence of L1014S is being reported for the first time in An. stephensi. Two specific PCR assays were developed for detection of two kdr-like mutations in An. stephensi.

  15. Cloning, sequencing, and expression of the Pseudomonas testosteroni gene encoding 3-oxosteroid delta 1-dehydrogenase.

    PubMed Central

    Plesiat, P; Grandguillot, M; Harayama, S; Vragar, S; Michel-Briand, Y

    1991-01-01

    Pseudomonas testosteroni ATCC 17410 is able to grow on testosterone. This strain was mutagenized by Tn5, and 41 mutants defective in the utilization of testosterone were isolated. One of them, called mutant 06, expressed 3-oxosteroid delta 1- and 3-oxosteroid delta 4-5 alpha-dehydrogenases only at low levels. The DNA region around the Tn5 insertion in mutant 06 was cloned into pUC19, and the 1-kbp EcoRI-BamHI segment neighbor to the Tn5 insertion was used to probe DNA from the wild-type strain. The probe hybridized to a 7.8-kbp SalI fragment. Plasmid pTES5, which is a pUC19 derivative containing this 7.8-kbp SalI fragment, was isolated after the screening by the 1-kbp EcoRI-BamHI probe. This plasmid expressed delta 1-dehydrogenase in Escherichia coli cells. The 2.2-kbp KpnI-KpnI segment of pTES5 was subcloned into pUC18, and pTEK21 was constructed. In E. coli containing the lacIq plasmid pRG1 and pTEK21, the expression of delta 1-dehydrogenase was induced by isopropyl-beta-D-thiogalactopyranoside (IPTG). The induced level was about 40 times higher than the induced level in P. testosteroni. Delta 1-Dehydrogenase synthesized in E. coli was localized in the inner membrane fraction. The minicell experiments showed that a 59-kDa polypeptide was synthesized from pTEK21, and this polypeptide was located in the inner membrane fraction. The complete nucleotide sequence of the 2.2-kbp KpnI-KpnI segment of pTEK21 was determined. An open reading frame which encodes a 62.4-kDa polypeptide and which is preceded by a Shine-Dalgarno-like sequence was identified. The first 44 amino acids of the putative product exhibited significant sequence similarity to the N-terminal sequences of lipoamide dehydrogenases. Images FIG. 4 PMID:1657885

  16. Sequential cloning of chromosomes

    DOEpatents

    Lacks, Sanford A.

    1995-07-18

    A method for sequential cloning of chromosomal DNA of a target organism is disclosed. A first DNA segment homologous to the chromosomal DNA to be sequentially cloned is isolated. The first segment has a first restriction enzyme site on either side. A first vector product is formed by ligating the homologous segment into a suitably designed vector. The first vector product is circularly integrated into the target organism's chromosomal DNA. The resulting integrated chromosomal DNA segment includes the homologous DNA segment at either end of the integrated vector segment. The integrated chromosomal DNA is cleaved with a second restriction enzyme and ligated to form a vector-containing plasmid, which is replicated in a host organism. The replicated plasmid is then cleaved with the first restriction enzyme. Next, a DNA segment containing the vector and a segment of DNA homologous to a distal portion of the previously isolated DNA segment is isolated. This segment is then ligated to form a plasmid which is replicated within a suitable host. This plasmid is then circularly integrated into the target chromosomal DNA. The chromosomal DNA containing the circularly integrated vector is treated with a third, retrorestriction (class IIS) enzyme. The cleaved DNA is ligated to give a plasmid that is used to transform a host permissive for replication of its vector. The sequential cloning process continues by repeated cycles of circular integration and excision. The excision is carried out alternately with the second and third enzymes.

  17. Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

    PubMed

    Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

    2017-10-01

    Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Characterization of a highly polymorphic region 5′ to JH in the human immunoglobulin heavy chain

    PubMed Central

    Silva, Alcino J.; Johnson, John P.; White, Raymond L.

    1987-01-01

    A cloned DNA segment 1.25 kilobases (kb) upstream from the joining segments of the human heavy chain immunoglobulin gene revealed extensive polymorphic variation at this locus, and the polymorphic pattern was stably transmitted to the next generation. Genomic restriction analysis showed that the polymorphism was caused by insertions/deletions within an MspI/BamHI fragment. Sequencing of one allele, 848 base pairs (bp) long, revealed eleven 50-base-pair tandem repeats. A second allele, 648 bp long, was cloned from a human genomic cosmid library, sequenced, and found to contain four fewer repeats than the first allele. A survey of 186 chromosomes from unrelated individuals of primarily northern European descent revealed at least six alleles. Images PMID:2884636

  19. Nonparametric Bayesian clustering to detect bipolar methylated genomic loci.

    PubMed

    Wu, Xiaowei; Sun, Ming-An; Zhu, Hongxiao; Xie, Hehuang

    2015-01-16

    With recent development in sequencing technology, a large number of genome-wide DNA methylation studies have generated massive amounts of bisulfite sequencing data. The analysis of DNA methylation patterns helps researchers understand epigenetic regulatory mechanisms. Highly variable methylation patterns reflect stochastic fluctuations in DNA methylation, whereas well-structured methylation patterns imply deterministic methylation events. Among these methylation patterns, bipolar patterns are important as they may originate from allele-specific methylation (ASM) or cell-specific methylation (CSM). Utilizing nonparametric Bayesian clustering followed by hypothesis testing, we have developed a novel statistical approach to identify bipolar methylated genomic regions in bisulfite sequencing data. Simulation studies demonstrate that the proposed method achieves good performance in terms of specificity and sensitivity. We used the method to analyze data from mouse brain and human blood methylomes. The bipolar methylated segments detected are found highly consistent with the differentially methylated regions identified by using purified cell subsets. Bipolar DNA methylation often indicates epigenetic heterogeneity caused by ASM or CSM. With allele-specific events filtered out or appropriately taken into account, our proposed approach sheds light on the identification of cell-specific genes/pathways under strong epigenetic control in a heterogeneous cell population.

  20. Molecular Darwinism: The Contingency of Spontaneous Genetic Variation

    PubMed Central

    Arber, Werner

    2011-01-01

    The availability of spontaneously occurring genetic variants is an important driving force of biological evolution. Largely thanks to experimental investigations by microbial geneticists, we know today that several different molecular mechanisms contribute to the overall genetic variations. These mechanisms can be assigned to three natural strategies to generate genetic variants: 1) local sequence changes, 2) intragenomic reshuffling of DNA segments, and 3) acquisition of a segment of foreign DNA. In these processes, specific gene products are involved in cooperation with different nongenetic elements. Some genetic variations occur fully at random along the DNA filaments, others rather with a statistical reproducibility, although at many possible sites. We have to be aware that evolution in natural ecosystems is of higher complexity than under most laboratory conditions, not at least in view of symbiotic associations and the occurrence of horizontal gene transfer. The encountered contingency of genetic variation can possibly best ensure a long-term persistence of life under steadily changing living conditions. PMID:21979160

  1. Molecular Darwinism: the contingency of spontaneous genetic variation.

    PubMed

    Arber, Werner

    2011-01-01

    The availability of spontaneously occurring genetic variants is an important driving force of biological evolution. Largely thanks to experimental investigations by microbial geneticists, we know today that several different molecular mechanisms contribute to the overall genetic variations. These mechanisms can be assigned to three natural strategies to generate genetic variants: 1) local sequence changes, 2) intragenomic reshuffling of DNA segments, and 3) acquisition of a segment of foreign DNA. In these processes, specific gene products are involved in cooperation with different nongenetic elements. Some genetic variations occur fully at random along the DNA filaments, others rather with a statistical reproducibility, although at many possible sites. We have to be aware that evolution in natural ecosystems is of higher complexity than under most laboratory conditions, not at least in view of symbiotic associations and the occurrence of horizontal gene transfer. The encountered contingency of genetic variation can possibly best ensure a long-term persistence of life under steadily changing living conditions.

  2. Mechanism of DNA-binding enhancement by the human T-cell leukaemia virus transactivator Tax.

    PubMed

    Baranger, A M; Palmer, C R; Hamm, M K; Giebler, H A; Brauweiler, A; Nyborg, J K; Schepartz, A

    1995-08-17

    Tax protein activates transcription of the human T-cell leukaemia virus type I (HTLV-I) genome through three imperfect cyclic AMP-responsive element (CRE) target sites located within the viral promoter. Previous work has shown that Tax interacts with the bZIP element of proteins that bind the CRE target site to promote peptide dimerization, suggesting an association between Tax and bZIP coiled coil. Here we show that the site of interaction with Tax is not the coiled coil, but the basic segment. This interaction increases the stability of the GCN4 bZIP dimer by 1.7 kcal mol-1 and the DNA affinity of the dimer by 1.9 kcal mol-1. The differential effect of Tax on several bZip-DNA complexes that differ in peptide sequence or DNA conformation suggests a model for Tax action based on stabilization of a distinct DNA-bound protein structure. This model may explain how Tax interacts with transcription factors of considerable sequence diversity to alter patterns of gene expression.

  3. Sequential cloning of chromosomes

    DOEpatents

    Lacks, S.A.

    1995-07-18

    A method for sequential cloning of chromosomal DNA of a target organism is disclosed. A first DNA segment homologous to the chromosomal DNA to be sequentially cloned is isolated. The first segment has a first restriction enzyme site on either side. A first vector product is formed by ligating the homologous segment into a suitably designed vector. The first vector product is circularly integrated into the target organism`s chromosomal DNA. The resulting integrated chromosomal DNA segment includes the homologous DNA segment at either end of the integrated vector segment. The integrated chromosomal DNA is cleaved with a second restriction enzyme and ligated to form a vector-containing plasmid, which is replicated in a host organism. The replicated plasmid is then cleaved with the first restriction enzyme. Next, a DNA segment containing the vector and a segment of DNA homologous to a distal portion of the previously isolated DNA segment is isolated. This segment is then ligated to form a plasmid which is replicated within a suitable host. This plasmid is then circularly integrated into the target chromosomal DNA. The chromosomal DNA containing the circularly integrated vector is treated with a third, retrorestriction (class IIS) enzyme. The cleaved DNA is ligated to give a plasmid that is used to transform a host permissive for replication of its vector. The sequential cloning process continues by repeated cycles of circular integration and excision. The excision is carried out alternately with the second and third enzymes. 9 figs.

  4. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies.

    PubMed

    Zeng, Lu; Kortschak, R Daniel; Raison, Joy M; Bertozzi, Terry; Adelson, David L

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package.

  5. Superior ab initio identification, annotation and characterisation of TEs and segmental duplications from genome assemblies

    PubMed Central

    Zeng, Lu; Kortschak, R. Daniel; Raison, Joy M.

    2018-01-01

    Transposable Elements (TEs) are mobile DNA sequences that make up significant fractions of amniote genomes. However, they are difficult to detect and annotate ab initio because of their variable features, lengths and clade-specific variants. We have addressed this problem by refining and developing a Comprehensive ab initio Repeat Pipeline (CARP) to identify and cluster TEs and other repetitive sequences in genome assemblies. The pipeline begins with a pairwise alignment using krishna, a custom aligner. Single linkage clustering is then carried out to produce families of repetitive elements. Consensus sequences are then filtered for protein coding genes and then annotated using Repbase and a custom library of retrovirus and reverse transcriptase sequences. This process yields three types of family: fully annotated, partially annotated and unannotated. Fully annotated families reflect recently diverged/young known TEs present in Repbase. The remaining two types of families contain a mixture of novel TEs and segmental duplications. These can be resolved by aligning these consensus sequences back to the genome to assess copy number vs. length distribution. Our pipeline has three significant advantages compared to other methods for ab initio repeat identification: 1) we generate not only consensus sequences, but keep the genomic intervals for the original aligned sequences, allowing straightforward analysis of evolutionary dynamics, 2) consensus sequences represent low-divergence, recently/currently active TE families, 3) segmental duplications are annotated as a useful by-product. We have compared our ab initio repeat annotations for 7 genome assemblies to other methods and demonstrate that CARP compares favourably with RepeatModeler, the most widely used repeat annotation package. PMID:29538441

  6. Contrasting Patterns of rDNA Homogenization within the Zygosaccharomyces rouxii Species Complex

    PubMed Central

    Chand Dakal, Tikam; Giudici, Paolo; Solieri, Lisa

    2016-01-01

    Arrays of repetitive ribosomal DNA (rDNA) sequences are generally expected to evolve as a coherent family, where repeats within such a family are more similar to each other than to orthologs in related species. The continuous homogenization of repeats within individual genomes is a recombination process termed concerted evolution. Here, we investigated the extent and the direction of concerted evolution in 43 yeast strains of the Zygosaccharomyces rouxii species complex (Z. rouxii, Z. sapae, Z. mellis), by analyzing two portions of the 35S rDNA cistron, namely the D1/D2 domains at the 5’ end of the 26S rRNA gene and the segment including the internal transcribed spacers (ITS) 1 and 2 (ITS regions). We demonstrate that intra-genomic rDNA sequence variation is unusually frequent in this clade and that rDNA arrays in single genomes consist of an intermixing of Z. rouxii, Z. sapae and Z. mellis-like sequences, putatively evolved by reticulate evolutionary events that involved repeated hybridization between lineages. The levels and distribution of sequence polymorphisms vary across rDNA repeats in different individuals, reflecting four patterns of rDNA evolution: I) rDNA repeats that are homogeneous within a genome but are chimeras derived from two parental lineages via recombination: Z. rouxii in the ITS region and Z. sapae in the D1/D2 region; II) intra-genomic rDNA repeats that retain polymorphisms only in ITS regions; III) rDNA repeats that vary only in their D1/D2 domains; IV) heterogeneous rDNA arrays that have both polymorphic ITS and D1/D2 regions. We argue that an ongoing process of homogenization following allodiplodization or incomplete lineage sorting gave rise to divergent evolutionary trajectories in different strains, depending upon temporal, structural and functional constraints. We discuss the consequences of these findings for Zygosaccharomyces species delineation and, more in general, for yeast barcoding. PMID:27501051

  7. Sequence-length variation of mtDNA HVS-I C-stretch in Chinese ethnic groups.

    PubMed

    Chen, Feng; Dang, Yong-hui; Yan, Chun-xia; Liu, Yan-ling; Deng, Ya-jun; Fulton, David J R; Chen, Teng

    2009-10-01

    The purpose of this study was to investigate mitochondrial DNA (mtDNA) hypervariable segment-I (HVS-I) C-stretch variations and explore the significance of these variations in forensic and population genetics studies. The C-stretch sequence variation was studied in 919 unrelated individuals from 8 Chinese ethnic groups using both direct and clone sequencing approaches. Thirty eight C-stretch haplotypes were identified, and some novel and population specific haplotypes were also detected. The C-stretch genetic diversity (GD) values were relatively high, and probability (P) values were low. Additionally, C-stretch length heteroplasmy was observed in approximately 9% of individuals studied. There was a significant correlation (r=-0.961, P<0.01) between the expansion of the cytosine sequence length in the C-stretch of HVS-I and a reduction in the number of upstream adenines. These results indicate that the C-stretch could be a useful genetic maker in forensic identification of Chinese populations. The results from the Fst and dA genetic distance matrix, neighbor-joining tree, and principal component map also suggest that C-stretch could be used as a reliable genetic marker in population genetics.

  8. Sequence and expression of two cry8 genes from Bacillus thuringiensis INTA Fr7-4, a native strain from Argentina.

    PubMed

    Navas, Laura E; Berretta, Marcelo F; Pérez, Melisa P; Amadio, Ariel F; Ortiz, Elio M; Sauka, Diego H; Benintende, Graciela B; Zandomeni, Rubén O

    2014-01-01

    We found and characterized two cry8 genes from the Bacillus thuringiensis strain INTA Fr7-4 isolated in Argentina. These genes, cry8Kb3 and cry8Pa3, are located in a tandem array within a 13,200-bp DNA segment sequenced from a preparation of total DNA. They encode 1,169- and 1,176-amino-acid proteins, respectively. Both genes were cloned with their promoter sequences and the proteins were expressed separately in an acrystalliferous strain of B. thuringiensis leading to the formation of ovoid crystals in the recombinant strains. The toxicity against larvae of Anthonomus grandis Bh. (Coleoptera: Curculionidae) of a spore-crystal suspension from the recombinant strain containing cry8Pa3 was similar to that of the parent strain INTA Fr7-4. © 2014 S. Karger AG, Basel.

  9. Principles of regulatory information conservation between mouse and human.

    PubMed

    Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P

    2014-11-20

    To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.

  10. Control of DNA strand displacement kinetics using toehold exchange.

    PubMed

    Zhang, David Yu; Winfree, Erik

    2009-12-02

    DNA is increasingly being used as the engineering material of choice for the construction of nanoscale circuits, structures, and motors. Many of these enzyme-free constructions function by DNA strand displacement reactions. The kinetics of strand displacement can be modulated by toeholds, short single-stranded segments of DNA that colocalize reactant DNA molecules. Recently, the toehold exchange process was introduced as a method for designing fast and reversible strand displacement reactions. Here, we characterize the kinetics of DNA toehold exchange and model it as a three-step process. This model is simple and quantitatively predicts the kinetics of 85 different strand displacement reactions from the DNA sequences. Furthermore, we use toehold exchange to construct a simple catalytic reaction. This work improves the understanding of the kinetics of nucleic acid reactions and will be useful in the rational design of dynamic DNA and RNA circuits and nanodevices.

  11. Ribo HRM--detection of inter- and intra-species polymorphisms within ribosomal DNA by high resolution melting analysis supported by application of artificial allelic standards.

    PubMed

    Masny, Aleksander; Jagiełło, Agata; Płucienniczak, Grażyna; Golab, Elzbieta

    2012-09-01

    Ribo HRM, a single-tube PCR and high resolution melting (HRM) assay for detection of polymorphisms in the large subunit ribosomal DNA expansion segment V, was developed on a Trichinella model. Four Trichinella species: T. spiralis (isolates ISS3 and ISS160), T. nativa (isolates ISS10 and ISS70), T. britovi (isolates ISS2 and ISS392) and T. pseudospiralis (isolates ISS13 and ISS1348) were genotyped. Cloned allelic variants of the expansion segment V were used as standards to prepare reference HRM curves characteristic for single sequences and mixtures of several cloned sequences imitating allelic composition detected in Trichinella isolates. Using the primer pair Tsr1 and Trich1bi, it was possible to amplify a fragment of the ESV and detect PCR products obtained from the genomic DNA of pools of larvae belonging to the four investigated species: T. pseudospiralis, T. spiralis, T. britovi and T. nativa, in a single tube Real-Time PCR reaction. Differences in the shape of the HRM curves of Trichinella isolates suggested the presence of differences between examined isolates of T. nativa, T. britovi and T. pseudospiralis species. No differences were observed between T. spiralis isolates. The presence of polymorphisms within the amplified ESV sequence fragment of T. nativa T. britovi and T. pseudospiralis was confirmed by sequencing of the cloned PCR products. Novel sequences were discovered and deposited in GenBank (GenBank IDs: JN971020-JN971027, JN120902.1, JN120903.1, JN120904.1, JN120906.1, JN120905.1). Screening the ESV region of Trichinella for polymorphism is possible using the genotyping assay Ribo HRM at the current state of its development. The Ribo HRM assay could be useful in phylogenetic studies of the Trichinella genus. Copyright © 2012 Elsevier B.V. All rights reserved.

  12. Evidence for louse-transmitted diseases in soldiers of Napoleon's Grand Army in Vilnius.

    PubMed

    Raoult, Didier; Dutour, Olivier; Houhamdi, Linda; Jankauskas, Rimantas; Fournier, Pierre-Edouard; Ardagna, Yann; Drancourt, Michel; Signoli, Michel; La, Vu Dang; Macia, Yves; Aboudharam, Gerard

    2006-01-01

    Many soldiers in Napoleon's Grand Army died of infectious diseases during its retreat from Russia. Because soldiers were commonly infested with body lice, it has been speculated that louse-borne infectious diseases, such as epidemic typhus (caused by Rickettsia prowazekii), were common. We investigated this possibility during recent excavations of a mass grave of Napoleon's soldiers in Vilnius, Lithuania. Segments of 5 body lice, identified morphologically and by polymerase chain reaction (PCR) amplification and sequencing, were found in earth from the grave that also contained fragments of soldiers' uniforms. DNA of Bartonella quintana (the agent of trench fever) was identified by PCR and sequencing in 3 of the lice. Similarly, PCR and sequencing of dental pulp from the remains of 35 soldiers revealed DNA of B. quintana in 7 soldiers and DNA of R. prowazekii in 3 other soldiers. Our results show that louse-borne infectious diseases affected nearly one-third of Napoleon's soldiers buried in Vilnius and indicate that these diseases might have been a major factor in the French retreat from Russia.

  13. Sequence conservation on the Y chromosome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gibson, L.H.; Yang-Feng, L.; Lau, C.

    The Y chromosome is present in all mammals and is considered to be essential to sex determination. Despite intense genomic research, only a few genes have been identified and mapped to this chromosome in humans. Several of them, such as SRY and ZFY, have been demonstrated to be conserved and Y-located in other mammals. In order to address the issue of sequence conservation on the Y chromosome, we performed fluorescence in situ hybridization (FISH) with DNA from a human Y cosmid library as a probe to study the Y chromosomes from other mammalian species. Total DNA from 3,000-4,500 cosmid poolsmore » were labeled with biotinylated-dUTP and hybridized to metaphase chromosomes. For human and primate preparations, human cot1 DNA was included in the hybridization mixture to suppress the hybridization from repeat sequences. FISH signals were detected on the Y chromosomes of human, gorilla, orangutan and baboon (Old World monkey) and were absent on those of squirrel monkey (New World monkey), Indian munjac, wood lemming, Chinese hamster, rat and mouse. Since sequence analysis suggested that specific genes, e.g. SRY and ZFY, are conserved between these two groups, the lack of detectable hybridization in the latter group implies either that conservation of the human Y sequences is limited to the Y chromosomes of the great apes and Old World monkeys, or that the size of the syntenic segment is too small to be detected under the resolution of FISH, or that homologeous sequences have undergone considerable divergence. Further studies with reduced hybridization stringency are currently being conducted. Our results provide some clues as to Y-sequence conservation across species and demonstrate the limitations of FISH across species with total DNA sequences from a particular chromosome.« less

  14. Distinct Copy Number, Coding Sequence, and Locus Methylation Patterns Underlie Rhg1-Mediated Soybean Resistance to Soybean Cyst Nematode1[W][OPEN

    PubMed Central

    Cook, David E.; Bayless, Adam M.; Wang, Kai; Guo, Xiaoli; Song, Qijian; Jiang, Jiming; Bent, Andrew F.

    2014-01-01

    Copy number variation of kilobase-scale genomic DNA segments, beyond presence/absence polymorphisms, can be an important driver of adaptive traits. Resistance to Heterodera glycines (Rhg1) is a widely utilized quantitative trait locus that makes the strongest known contribution to resistance against soybean cyst nematode (SCN), Heterodera glycines, the most damaging pathogen of soybean (Glycine max). Rhg1 was recently discovered to be a complex locus at which resistance-conferring haplotypes carry up to 10 tandem repeat copies of a 31-kb DNA segment, and three disparate genes present on each repeat contribute to SCN resistance. Here, we use whole-genome sequencing, fiber-FISH (fluorescence in situ hybridization), and other methods to discover the genetic variation at Rhg1 across 41 diverse soybean accessions. Based on copy number variation, transcript abundance, nucleic acid polymorphisms, and differentially methylated DNA regions, we find that SCN resistance is associated with multicopy Rhg1 haplotypes that form two distinct groups. The tested high-copy-number Rhg1 accessions, including plant introduction (PI) 88788, contain a flexible number of copies (seven to 10) of the 31-kb Rhg1 repeat. The identified low-copy-number Rhg1 group, including PI 548402 (Peking) and PI 437654, contains three copies of the Rhg1 repeat and a newly identified allele of Glyma18g02590 (a predicted α-SNAP [α-soluble N-ethylmaleimide–sensitive factor attachment protein]). There is strong evidence for a shared origin of the two resistance-conferring multicopy Rhg1 groups and subsequent independent evolution. Differentially methylated DNA regions also were identified within Rhg1 that correlate with SCN resistance. These data provide insights into copy number variation of multigene segments, using as the example a disease resistance trait of high economic importance. PMID:24733883

  15. Transcriptional mapping of the ribosomal RNA region of mouse L-cell mitochondrial DNA.

    PubMed Central

    Nagley, P; Clayton, D A

    1980-01-01

    The map positions in mouse mitochondrial DNA of the two ribosomal RNA genes and adjacent genes coding several small transcripts have been determined precisely by application of a procedure in which DNA-RNA hybrids have been subjected to digestion by S1 nuclease under conditions of varying severity. Digestion of the DNA-RNA hybrids with S1 nuclease yielded a series of species which were shown to contain ribosomal RNA molecules together with adjacent transcripts hybridized conjointly to a continuous segment of mitochondrial DNA. There is one small transcript about 60 bases long whose gene adjoins the sequences coding the 5'-end of the small ribosomal RNA (950 bases) and which lies approximately 200 nucleotides from the D-loop origin of heavy strand mitochondrial DNA synthesis. An 80-base transcript lies between the small and large ribosomal RNA genes, and genes for two further short transcript (each about 80 bases in length) abut the sequences coding the 3'-end of the large ribosomal RNA (approximately 1500 bases). The ability to isolate a discrete DNA-RNA hybrid species approximately 2700 base pairs in length containing all these transcripts suggests that there can be few nucleotides in this region of mouse mitochondrial DNA which are not represented as stable RNA species. Images PMID:6253898

  16. A novel gene, RSD-3/HSD-3.1, encodes a meiotic-related protein expressed in rat and human testis.

    PubMed

    Zhang, Xiaodong; Liu, Huixian; Zhang, Yan; Qiao, Yuan; Miao, Shiying; Wang, Linfang; Zhang, Jianchao; Zong, Shudong; Koide, S S

    2003-06-01

    The expression of stage-specific genes during spermatogenesis was determined by isolating two segments of rat seminiferous tubule at different stages of the germinal epithelium cycle delineated by transillumination-delineated microdissection, combined with differential display polymerase chain reaction to identify the differential transcripts formed. A total of 22 cDNAs were identified and accepted by GenBank as new expressed sequence tags. One of the expressed sequence tags was radiolabeled and used as a probe to screen a rat testis cDNA library. A novel full-length cDNA composed of 2228 bp, designated as RSD-3 (rat sperm DNA no.3, GenBank accession no. AF094609) was isolated and characterized. The reading frame encodes a polypeptide consisting of 526 amino acid residues, containing a number of DNA binding motifs and phosphorylation sites for PKC, CK-II, and p34cdc2. Northern blot of mRNA prepared from various tissues of adult rats showed that RSD-3 is expressed only in the testis. The initial expression of the RSD-3 gene was detected in the testis on the 30th postnatal day and attained adult level on the 60th postnatal day. Immunolocalization of RSD-3 in germ cells of rat testis showed that its expression is restricted to primary spermatocytes, undergoing meiosis division I. A human testis homologue of RSD-3 cDNA, designated as HSD-3.1 (GenBank accession no. AF144487) was isolated by screening the Human Testis Rapid-Screen arrayed cDNA library panels by RT-PCR. The exon-intron boundaries of HSD-3.1 gene were determined by aligning the cDNA sequence with the corresponding genome sequence. The cDNA consisted of 12 exons that span approximately 52.8 kb of the genome sequence and was mapped to chromosome 14q31.3.

  17. Modeling the Lac repressor-operator assembly: The influence of DNA looping on Lac repressor conformation

    PubMed Central

    Swigon, David; Coleman, Bernard D.; Olson, Wilma K.

    2006-01-01

    Repression of transcription of the Escherichia coli Lac operon by the Lac repressor (LacR) is accompanied by the simultaneous binding of LacR to two operators and the formation of a DNA loop. A recently developed theory of sequence-dependent DNA elasticity enables one to relate the fine structure of the LacR–DNA complex to a wide range of heretofore-unconnected experimental observations. Here, that theory is used to calculate the configuration and free energy of the DNA loop as a function of its length and base-pair sequence, its linking number, and the end conditions imposed by the LacR tetramer. The tetramer can assume two types of conformations. Whereas a rigid V-shaped structure is observed in the crystal, EM images show extended forms in which two dimer subunits are flexibly joined. Upon comparing our computed loop configurations with published experimental observations of permanganate sensitivities, DNase I cutting patterns, and loop stabilities, we conclude that linear DNA segments of short-to-medium chain length (50–180 bp) give rise to loops with the extended form of LacR and that loops formed within negatively supercoiled plasmids induce the V-shaped structure. PMID:16785444

  18. Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens

    PubMed Central

    Glinsky, Gennadi V.

    2016-01-01

    Abstract Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8–10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. PMID:27503290

  19. Transient kinetics measured with force steps discriminate between double-stranded DNA elongation and melting and define the reaction energetics

    PubMed Central

    Bongini, Lorenzo; Melli, Luca; Lombardi, Vincenzo; Bianco, Pasquale

    2014-01-01

    Under a tension of ∼65 pN, double-stranded DNA undergoes an overstretching transition from its basic (B-form) conformation to a 1.7 times longer conformation whose nature is only recently starting to be understood. Here we provide a structural and thermodynamic characterization of the transition by recording the length transient following force steps imposed on the λ-phage DNA with different melting degrees and temperatures (10–25°C). The shortening transient following a 20–35 pN force drop from the overstretching force shows a sequence of fast shortenings of double-stranded extended (S-form) segments and pauses owing to reannealing of melted segments. The lengthening transients following a 2–35 pN stretch to the overstretching force show the kinetics of a two-state reaction and indicate that the whole 70% extension is a B-S transition that precedes and is independent of melting. The temperature dependence of the lengthening transient shows that the entropic contribution to the B-S transition is one-third of the entropy change of thermal melting, reinforcing the evidence for a double-stranded S-form that maintains a significant fraction of the interstrand bonds. The cooperativity of the unitary elongation (22 bp) is independent of temperature, suggesting that structural factors, such as the nucleic acid sequence, control the transition. PMID:24353317

  20. Structural polymorphism at LCR and its role in beta-globin gene regulation.

    PubMed

    Kukreti, Shrikant; Kaur, Harpreet; Kaushik, Mahima; Bansal, Aparna; Saxena, Sarika; Kaushik, Shikha; Kukreti, Ritushree

    2010-09-01

    Information on the secondary structures and conformational manifestations of eukaryotic DNA and their biological significance with reference to gene regulation and expression is limited. The human beta-globin gene Locus Control Region (LCR), a dominant regulator of globin gene expression, is a contiguous piece of DNA with five tissue-specific DNase I-hypersensitive sites (HSs). Since these HSs have a high density of transcription factor binding sites, structural interdependencies between HSs and different promoters may directly or indirectly regulate LCR functions. Mutations and SNPs may stabilize or destabilize the local secondary structures, affecting the gene expression by changes in the protein-DNA recognition patterns. Various palindromic or quasi-palindromic segments within LCR, could cause structural polymorphism and geometrical switching of DNA. This emphasizes the importance of understanding of the sequence-dependent variations of the DNA structure. Such structural motifs might act as regulatory elements. The local conformational variability of a DNA segment or action of a DNA specific protein is key to create and maintain active chromatin domains and affect transcription of various tissue specific beta-globin genes. We, summarize here the current status of beta-globin LCR structure and function. Further structural studies at molecular level and functional genomics might solve the regulatory puzzles that control the beta-globin gene locus. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.

  1. DNA mutation motifs in the genes associated with inherited diseases.

    PubMed

    Růžička, Michal; Kulhánek, Petr; Radová, Lenka; Čechová, Andrea; Špačková, Naďa; Fajkusová, Lenka; Réblová, Kamila

    2017-01-01

    Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs) rarely associated with mutations (coldspots) and frequently associated with mutations (hotspots) exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.

  2. Sequence-independent construction of ordered combinatorial libraries with predefined crossover points.

    PubMed

    Jézéquel, Laetitia; Loeper, Jacqueline; Pompon, Denis

    2008-11-01

    Combinatorial libraries coding for mosaic enzymes with predefined crossover points constitute useful tools to address and model structure-function relationships and for functional optimization of enzymes based on multivariate statistics. The presented method, called sequence-independent generation of a chimera-ordered library (SIGNAL), allows easy shuffling of any predefined amino acid segment between two or more proteins. This method is particularly well adapted to the exchange of protein structural modules. The procedure could also be well suited to generate ordered combinatorial libraries independent of sequence similarities in a robotized manner. Sequence segments to be recombined are first extracted by PCR from a single-stranded template coding for an enzyme of interest using a biotin-avidin-based method. This technique allows the reduction of parental template contamination in the final library. Specific PCR primers allow amplification of two complementary mosaic DNA fragments, overlapping in the region to be exchanged. Fragments are finally reassembled using a fusion PCR. The process is illustrated via the construction of a set of mosaic CYP2B enzymes using this highly modular approach.

  3. [Study of three ciguatera fish poisoning cases in Xiamen city, in 2005].

    PubMed

    Luo, He-dong; Bai, Yan-yan; Zhou, Na

    2011-06-01

    To find out the reason of three ciguatera fish poisoning cases in Xiamen in 2005 and identify the fish species. The grouper implicated in food poisoning and seven other coral reef fishes collected from market were tested by mice bioassay and ciguatoxin-test kit. The mtDNA was extracted from toxic grouper meat, and Cty b gene segment was amplified and the PCR products were sequenced. The sequences were compared with those in the GenBank. The result turned out to be positive by the ciguatoxin-test kit, while the toxicity of the toxic grouper implicated in food poisoning was 0.11 mouse unit (MU)/g by mice bioassay. A 475 bp segments of Cty b gene was amplified by PCR and the sequence was 99% homologous with Epinephelus fuscoguttatus (GenBank: AY950695).No ciguatoxin in six grouper species collected from market was detected. All three food poisoning cases were caused by consumption of ciguatoxin-carrying groupers.

  4. Random and non-random monoallelic expression.

    PubMed

    Chess, Andrew

    2013-01-01

    Monoallelic expression poses an intriguing problem in epigenetics because it requires the unequal treatment of two segments of DNA that are present in the same nucleus and which can have absolutely identical sequences. This review will consider different known types of monoallelic expression. For all monoallelically expressed genes, their respective allele-specific patterns of expression have the potential to affect brain function and dysfunction.

  5. Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs.

    PubMed

    Auch, Alexander F; Klenk, Hans-Peter; Göker, Markus

    2010-01-28

    DNA-DNA hybridization (DDH) is a widely applied wet-lab technique to obtain an estimate of the overall similarity between the genomes of two organisms. To base the species concept for prokaryotes ultimately on DDH was chosen by microbiologists as a pragmatic approach for deciding about the recognition of novel species, but also allowed a relatively high degree of standardization compared to other areas of taxonomy. However, DDH is tedious and error-prone and first and foremost cannot be used to incrementally establish a comparative database. Recent studies have shown that in-silico methods for the comparison of genome sequences can be used to replace DDH. Considering the ongoing rapid technological progress of sequencing methods, genome-based prokaryote taxonomy is coming into reach. However, calculating distances between genomes is dependent on multiple choices for software and program settings. We here provide an overview over the modifications that can be applied to distance methods based in high-scoring segment pairs (HSPs) or maximally unique matches (MUMs) and that need to be documented. General recommendations on determining HSPs using BLAST or other algorithms are also provided. As a reference implementation, we introduce the GGDC web server (http://ggdc.gbdp.org).

  6. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  7. Automated analysis of time-lapse fluorescence microscopy images: from live cell images to intracellular foci.

    PubMed

    Dzyubachyk, Oleh; Essers, Jeroen; van Cappellen, Wiggert A; Baldeyron, Céline; Inagaki, Akiko; Niessen, Wiro J; Meijering, Erik

    2010-10-01

    Complete, accurate and reproducible analysis of intracellular foci from fluorescence microscopy image sequences of live cells requires full automation of all processing steps involved: cell segmentation and tracking followed by foci segmentation and pattern analysis. Integrated systems for this purpose are lacking. Extending our previous work in cell segmentation and tracking, we developed a new system for performing fully automated analysis of fluorescent foci in single cells. The system was validated by applying it to two common tasks: intracellular foci counting (in DNA damage repair experiments) and cell-phase identification based on foci pattern analysis (in DNA replication experiments). Experimental results show that the system performs comparably to expert human observers. Thus, it may replace tedious manual analyses for the considered tasks, and enables high-content screening. The described system was implemented in MATLAB (The MathWorks, Inc., USA) and compiled to run within the MATLAB environment. The routines together with four sample datasets are available at http://celmia.bigr.nl/. The software is planned for public release, free of charge for non-commercial use, after publication of this article.

  8. Dynamic evolution at pericentromeres.

    PubMed

    Hall, Anne E; Kettler, Gregory C; Preuss, Daphne

    2006-03-01

    Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.

  9. Revisiting the phylogeny of Zoanthidea (Cnidaria: Anthozoa): Staggered alignment of hypervariable sequences improves species tree inference.

    PubMed

    Swain, Timothy D

    2018-01-01

    The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Simultaneous non-contiguous deletions using large synthetic DNA and site-specific recombinases

    PubMed Central

    Krishnakumar, Radha; Grose, Carissa; Haft, Daniel H.; Zaveri, Jayshree; Alperovich, Nina; Gibson, Daniel G.; Merryman, Chuck; Glass, John I.

    2014-01-01

    Toward achieving rapid and large scale genome modification directly in a target organism, we have developed a new genome engineering strategy that uses a combination of bioinformatics aided design, large synthetic DNA and site-specific recombinases. Using Cre recombinase we swapped a target 126-kb segment of the Escherichia coli genome with a 72-kb synthetic DNA cassette, thereby effectively eliminating over 54 kb of genomic DNA from three non-contiguous regions in a single recombination event. We observed complete replacement of the native sequence with the modified synthetic sequence through the action of the Cre recombinase and no competition from homologous recombination. Because of the versatility and high-efficiency of the Cre-lox system, this method can be used in any organism where this system is functional as well as adapted to use with other highly precise genome engineering systems. Compared to present-day iterative approaches in genome engineering, we anticipate this method will greatly speed up the creation of reduced, modularized and optimized genomes through the integration of deletion analyses data, transcriptomics, synthetic biology and site-specific recombination. PMID:24914053

  11. The fate of deleted DNA produced during programmed genomic deletion events in Tetrahymena thermophila.

    PubMed Central

    Saveliev, S V; Cox, M M

    1994-01-01

    Thousands of DNA deletion events occur during macronuclear development in the ciliate Tetrahymena thermophila. In two deleted genomic regions, designated M and R, the eliminated sequences form circles that can be detected by PCR. However, the circles are not normal products of the reaction pathway. The circular forms occur at very low levels in conjugating cells, but are stable. Sequencing analysis showed that many of the circles (as many as 50% of those examined) reflected a precise deletion in the M and R regions. The remaining circles were either smaller or larger and contained varying lengths of sequences derived from the chromosomal DNA surrounding the eliminated region. The chromosomal junctions left behind after deletion were more precise, although deletions in either the M or R regions can generate any of several alternative junctions (1). Some new chromosomal junctions were detected in the present study. The results suggest that the deleted segment is released as a linear DNA species that is degraded rapidly. The species is only rarely converted to the stable circles we detect. The deletion mechanism is different from those proposed for deletion events in hypotrichous ciliates (2-4), and does not reflect a conservative site-specific recombination process such as that promoted by the bacteriophage lambda integrase (5). Images PMID:7838724

  12. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    PubMed

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  13. SRY, like HMG1, recognizes sharp angles in DNA.

    PubMed Central

    Ferrari, S; Harley, V R; Pontiggia, A; Goodfellow, P N; Lovell-Badge, R; Bianchi, M E

    1992-01-01

    HMG boxes are DNA binding domains present in chromatin proteins, general transcription factors for nucleolar and mitochondrial RNA polymerases, and gene- and tissue-specific transcriptional regulators. The HMG boxes of HMG1, an abundant component of chromatin, interact specifically with four-way junctions, DNA structures that are cross-shaped and contain angles of approximately 60 and 120 degrees between their arms. We show here also that the HMG box of SRY, the protein that determines the expression of male-specific genes in humans, recognizes four-way junction DNAs irrespective of their sequence. In addition, when SRY binds to linear duplex DNA containing its specific target AACAAAG, it produces a sharp bend. Therefore, the interaction between HMG boxes and DNA appears to be predominantly structure-specific. The production of the recognition of a kink in DNA can serve several distinct functions, such as the repair of DNA lesions, the folding of DNA segments with bound transcriptional factors into productive complexes or the wrapping of DNA in chromatin. Images PMID:1425584

  14. Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize

    PubMed Central

    Lough, Ashley N.; Faries, Kaitlyn M.; Koo, Dal-Hoe; Hussain, Abid; Roark, Leah M.; Langewisch, Tiffany L.; Backes, Teresa; Kremling, Karl A. G.; Jiang, Jiming; Birchler, James A.; Newton, Kathleen J.

    2015-01-01

    The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (∼252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ∼1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize. PMID:26333837

  15. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice.

    PubMed

    Ma, Jianxin; Bennetzen, Jeffrey L

    2006-01-10

    Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.

  16. UrQt: an efficient software for the Unsupervised Quality trimming of NGS data.

    PubMed

    Modolo, Laurent; Lerat, Emmanuelle

    2015-04-29

    Quality control is a necessary step of any Next Generation Sequencing analysis. Although customary, this step still requires manual interventions to empirically choose tuning parameters according to various quality statistics. Moreover, current quality control procedures that provide a "good quality" data set, are not optimal and discard many informative nucleotides. To address these drawbacks, we present a new quality control method, implemented in UrQt software, for Unsupervised Quality trimming of Next Generation Sequencing reads. Our trimming procedure relies on a well-defined probabilistic framework to detect the best segmentation between two segments of unreliable nucleotides, framing a segment of informative nucleotides. Our software only requires one user-friendly parameter to define the minimal quality threshold (phred score) to consider a nucleotide to be informative, which is independent of both the experiment and the quality of the data. This procedure is implemented in C++ in an efficient and parallelized software with a low memory footprint. We tested the performances of UrQt compared to the best-known trimming programs, on seven RNA and DNA sequencing experiments and demonstrated its optimality in the resulting tradeoff between the number of trimmed nucleotides and the quality objective. By finding the best segmentation to delimit a segment of good quality nucleotides, UrQt greatly increases the number of reads and of nucleotides that can be retained for a given quality objective. UrQt source files, binary executables for different operating systems and documentation are freely available (under the GPLv3) at the following address: https://lbbe.univ-lyon1.fr/-UrQt-.html .

  17. Variability and repertoire size of T-cell receptor V alpha gene segments.

    PubMed

    Becker, D M; Pattern, P; Chien, Y; Yokota, T; Eshhar, Z; Giedlin, M; Gascoigne, N R; Goodnow, C; Wolf, R; Arai, K

    The immune system of higher organisms is composed largely of two distinct cell types, B lymphocytes and T lymphocytes, each of which is independently capable of recognizing an enormous number of distinct entities through their antigen receptors; surface immunoglobulin in the case of the former, and the T-cell receptor (TCR) in the case of the latter. In both cell types, the genes encoding the antigen receptors consist of multiple gene segments which recombine during maturation to produce many possible peptides. One striking difference between B- and T-cell recognition that has not yet been resolved by the structural data is the fact that T cells generally require a major histocompatibility determinant together with an antigen whereas, in most cases, antibodies recognize antigen alone. Recently, we and others have found that a series of TCR V beta gene sequences show conservation of many of the same residues that are conserved between heavy- and light-chain immunoglobulin V regions, and these V beta sequences are predicted to have an immunoglobulin-like secondary structure. To extend these studies, we have isolated and sequenced eight additional alpha-chain complementary cDNA clones and compared them with published sequences. Analyses of these sequences, reported here, indicate that V alpha regions have many of the characteristics of V beta gene segments but differ in that they almost always occur as cross-hybridizing gene families. We conclude that there may be very different selective pressures operating on V alpha and V beta sequences and that the V alpha repertoire may be considerably larger than that of V beta.

  18. Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

    PubMed

    Nishizawa, M; Nishizawa, K

    2000-10-01

    The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.

  19. Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions

    PubMed Central

    Nishizawa, Manami; Nishizawa, Kazuhisa

    2000-01-01

    The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273

  20. Chromosomal instability mediated by non-B DNA: cruciform conformation and not DNA sequence is responsible for recurrent translocation in humans.

    PubMed

    Inagaki, Hidehito; Ohye, Tamae; Kogo, Hiroshi; Kato, Takema; Bolor, Hasbaira; Taniguchi, Mariko; Shaikh, Tamim H; Emanuel, Beverly S; Kurahashi, Hiroki

    2009-02-01

    Chromosomal aberrations have been thought to be random events. However, recent findings introduce a new paradigm in which certain DNA segments have the potential to adopt unusual conformations that lead to genomic instability and nonrandom chromosomal rearrangement. One of the best-studied examples is the palindromic AT-rich repeat (PATRR), which induces recurrent constitutional translocations in humans. Here, we established a plasmid-based model that promotes frequent intermolecular rearrangements between two PATRRs in HEK293 cells. In this model system, the proportion of PATRR plasmid that extrudes a cruciform structure correlates to the levels of rearrangement. Our data suggest that PATRR-mediated translocations are attributable to unusual DNA conformations that confer a common pathway for chromosomal rearrangements in humans.

  1. GRIM-Filter: Fast seed location filtering in DNA read mapping using processing-in-memory technologies.

    PubMed

    Kim, Jeremie S; Senol Cali, Damla; Xin, Hongyi; Lee, Donghyuk; Ghose, Saugata; Alser, Mohammed; Hassan, Hasan; Ergin, Oguz; Alkan, Can; Mutlu, Onur

    2018-05-09

    Seed location filtering is critical in DNA read mapping, a process where billions of DNA fragments (reads) sampled from a donor are mapped onto a reference genome to identify genomic variants of the donor. State-of-the-art read mappers 1) quickly generate possible mapping locations for seeds (i.e., smaller segments) within each read, 2) extract reference sequences at each of the mapping locations, and 3) check similarity between each read and its associated reference sequences with a computationally-expensive algorithm (i.e., sequence alignment) to determine the origin of the read. A seed location filter comes into play before alignment, discarding seed locations that alignment would deem a poor match. The ideal seed location filter would discard all poor match locations prior to alignment such that there is no wasted computation on unnecessary alignments. We propose a novel seed location filtering algorithm, GRIM-Filter, optimized to exploit 3D-stacked memory systems that integrate computation within a logic layer stacked under memory layers, to perform processing-in-memory (PIM). GRIM-Filter quickly filters seed locations by 1) introducing a new representation of coarse-grained segments of the reference genome, and 2) using massively-parallel in-memory operations to identify read presence within each coarse-grained segment. Our evaluations show that for a sequence alignment error tolerance of 0.05, GRIM-Filter 1) reduces the false negative rate of filtering by 5.59x-6.41x, and 2) provides an end-to-end read mapper speedup of 1.81x-3.65x, compared to a state-of-the-art read mapper employing the best previous seed location filtering algorithm. GRIM-Filter exploits 3D-stacked memory, which enables the efficient use of processing-in-memory, to overcome the memory bandwidth bottleneck in seed location filtering. We show that GRIM-Filter significantly improves the performance of a state-of-the-art read mapper. GRIM-Filter is a universal seed location filter that can be applied to any read mapper. We hope that our results provide inspiration for new works to design other bioinformatics algorithms that take advantage of emerging technologies and new processing paradigms, such as processing-in-memory using 3D-stacked memory devices.

  2. High Mitochondrial DNA Stability in B-Cell Chronic Lymphocytic Leukemia

    PubMed Central

    Cerezo, María; Bandelt, Hans-Jürgen; Martín-Guerrero, Idoia; Ardanaz, Maite; Vega, Ana; Carracedo, Ángel; García-Orad, África; Salas, Antonio

    2009-01-01

    Background Chronic Lymphocytic Leukemia (CLL) leads to progressive accumulation of lymphocytes in the blood, bone marrow, and lymphatic tissues. Previous findings have suggested that the mtDNA could play an important role in CLL. Methodology/Principal Findings The mitochondrial DNA (mtDNA) control-region was analyzed in lymphocyte cell DNA extracts and compared with their granulocyte counterpart extract of 146 patients suffering from B-Cell CLL; B-CLL (all recruited from the Basque country). Major efforts were undertaken to rule out methodological artefacts that would render a high false positive rate for mtDNA instabilities and thus lead to erroneous interpretation of sequence instabilities. Only twenty instabilities were finally confirmed, most of them affecting the homopolymeric stretch located in the second hypervariable segment (HVS-II) around position 310, which is well known to constitute an extreme mutational hotspot of length polymorphism, as these mutations are frequently observed in the general human population. A critical revision of the findings in previous studies indicates a lack of proper methodological standards, which eventually led to an overinterpretation of the role of the mtDNA in CLL tumorigenesis. Conclusions/Significance Our results suggest that mtDNA instability is not the primary causal factor in B-CLL. A secondary role of mtDNA mutations cannot be fully ruled out under the hypothesis that the progressive accumulation of mtDNA instabilities could finally contribute to the tumoral process. Recommendations are given that would help to minimize erroneous interpretation of sequencing results in mtDNA studies in tumorigenesis. PMID:19924307

  3. Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex.

    PubMed

    Shiroishi, T; Hanzawa, N; Sagai, T; Ishiura, M; Gojobori, T; Steinmetz, M; Moriwaki, K

    1990-01-01

    The wm7 haplotype of the major histocompatibility complex (MHC), derived from the Japanese wild mouse Mus musculus molossinus, enhances recombination specific to female meiosis in the K/A beta interval of the MHC. We have mapped crossover points of fifteen independent recombinants from genetic crosses of the wm7 and laboratory haplotypes. Most of them were confined to a short segment of approximately 1 kilobase (kb) of DNA between the A beta 3 and A beta 2 genes, indicating the presence of a female-specific recombinational hotspot. Its location overlaps with a sex-independent hotspot previously identified in the Mus musculus castaneus CAS3 haplotype. We have cloned and sequenced DNA fragments surrounding the hotspot from the wm7 haplotype and the corresponding regions from the hotspot-negative B10.A and C57BL/10 strains. There is no significant difference between the sequences of these three strains, or between these and the published sequences of the CAS3 and C57BL/6 strains. However, a comparison of this A beta 3/A beta 2 hotspot with a previously characterized hotspot in the E beta gene revealed that they have a very similar molecular organization. Each hotspot consists of two elements, the consensus sequence of the mouse middle repetitive MT family and the tetrameric repeated sequences, which are separated by 1 kb of DNA.

  4. Assessing the Robustness of Complete Bacterial Genome Segmentations

    NASA Astrophysics Data System (ADS)

    Devillers, Hugo; Chiapello, Hélène; Schbath, Sophie; El Karoui, Meriem

    Comparison of closely related bacterial genomes has revealed the presence of highly conserved sequences forming a "backbone" that is interrupted by numerous, less conserved, DNA fragments. Segmentation of bacterial genomes into backbone and variable regions is particularly useful to investigate bacterial genome evolution. Several software tools have been designed to compare complete bacterial chromosomes and a few online databases store pre-computed genome comparisons. However, very few statistical methods are available to evaluate the reliability of these software tools and to compare the results obtained with them. To fill this gap, we have developed two local scores to measure the robustness of bacterial genome segmentations. Our method uses a simulation procedure based on random perturbations of the compared genomes. The scores presented in this paper are simple to implement and our results show that they allow to discriminate easily between robust and non-robust bacterial genome segmentations when using aligners such as MAUVE and MGA.

  5. The basic helix-loop-helix region of the transcriptional repressor hairy and enhancer of split 1 is preorganized to bind DNA.

    PubMed

    Popovic, Matija; Wienk, Hans; Coglievina, Maristella; Boelens, Rolf; Pongor, Sándor; Pintar, Alessandro

    2014-04-01

    Hairy and enhancer of split 1, one of the main downstream effectors in Notch signaling, is a transcriptional repressor of the basic helix-loop-helix (bHLH) family. Using nuclear magnetic resonance methods, we have determined the structure and dynamics of a recombinant protein, H1H, which includes an N-terminal segment, b1, containing functionally important phosphorylation sites, the basic region b2, required for binding to DNA, and the HLH domain. We show that a proline residue in the sequence divides the protein in two parts, a flexible and disordered N-terminal region including b1 and a structured, mainly helical region comprising b2 and the HLH domain. Binding of H1H to a double strand DNA oligonucleotide was monitored through the chemical shift perturbation of backbone amide resonances, and showed that the interaction surface involves not only the b2 segment but also several residues in the b1 and HLH regions. Copyright © 2014 Wiley Periodicals, Inc.

  6. High Quality Maize Centromere 10 Sequence Reveals Evidence of Frequent Recombination Events

    PubMed Central

    Wolfgruber, Thomas K.; Nakashima, Megan M.; Schneider, Kevin L.; Sharma, Anupma; Xie, Zidian; Albert, Patrice S.; Xu, Ronghui; Bilinski, Paul; Dawe, R. Kelly; Ross-Ibarra, Jeffrey; Birchler, James A.; Presting, Gernot G.

    2016-01-01

    The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR) has presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here, we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 × 10−6 and 5 × 10−5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb from the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length CR from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB) repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. In many cases examined here, DSB repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to efficiently repair frequent DSBs in centromeres. PMID:27047500

  7. DNA Barcoding in the Cycadales: Testing the Potential of Proposed Barcoding Markers for Species Identification of Cycads

    PubMed Central

    Sass, Chodon; Little, Damon P.; Stevenson, Dennis Wm.; Specht, Chelsea D.

    2007-01-01

    Barcodes are short segments of DNA that can be used to uniquely identify an unknown specimen to species, particularly when diagnostic morphological features are absent. These sequences could offer a new forensic tool in plant and animal conservation—especially for endangered species such as members of the Cycadales. Ideally, barcodes could be used to positively identify illegally obtained material even in cases where diagnostic features have been purposefully removed or to release confiscated organisms into the proper breeding population. In order to be useful, a DNA barcode sequence must not only easily PCR amplify with universal or near-universal reaction conditions and primers, but also contain enough variation to generate unique identifiers at either the species or population levels. Chloroplast regions suggested by the Plant Working Group of the Consortium for the Barcode of Life (CBoL), and two alternatives, the chloroplast psbA-trnH intergenic spacer and the nuclear ribosomal internal transcribed spacer (nrITS), were tested for their utility in generating unique identifiers for members of the Cycadales. Ease of amplification and sequence generation with universal primers and reaction conditions was determined for each of the seven proposed markers. While none of the proposed markers provided unique identifiers for all species tested, nrITS showed the most promise in terms of variability, although sequencing difficulties remain a drawback. We suggest a workflow for DNA barcoding, including database generation and management, which will ultimately be necessary if we are to succeed in establishing a universal DNA barcode for plants. PMID:17987130

  8. Random and Non-Random Monoallelic Expression

    PubMed Central

    Chess, Andrew

    2013-01-01

    Monoallelic expression poses an intriguing problem in epigenetics because it requires the unequal treatment of two segments of DNA that are present in the same nucleus and which can have absolutely identical sequences. This review will consider different known types of monoallelic expression. For all monoallelically expressed genes, their respective allele-specific patterns of expression have the potential to affect brain function and dysfunction. PMID:22763620

  9. Colorimetric Detection of Specific DNA Segments Amplified by Polymerase Chain Reactions

    NASA Astrophysics Data System (ADS)

    Kemp, David J.; Smith, Donald B.; Foote, Simon J.; Samaras, N.; Peterson, M. Gregory

    1989-04-01

    The polymerase chain reaction (PCR) procedure has many potential applications in mass screening. We describe here a general assay for colorimetric detection of amplified DNA. The target DNA is first amplified by PCR, and then a second set of oligonucleotides, nested between the first two, is incorporated by three or more PCR cycles. These oligonucleotides bear ligands: for example, one can be biotinylated and the other can contain a site for a double-stranded DNA-binding protein. After linkage to an immobilized affinity reagent (such as a cloned DNA-binding protein, which we describe here) and labeling with a second affinity reagent (for example, avidin) linked to horseradish peroxidase, reaction with a chromogenic substrate allows detection of the amplified DNA. This amplified DNA assay (ADA) is rapid, is readily applicable to mass screening, and uses routine equipment. We show here that it can be used to detect human immunodeficiency virus sequences specifically against a background of human DNA.

  10. The 193-base pair Gsg2 (haspin) promoter region regulates germ cell-specific expression bidirectionally and synchronously.

    PubMed

    Tokuhiro, Keizo; Miyagawa, Yasushi; Yamada, Shuichi; Hirose, Mika; Ohta, Hiroshi; Nishimune, Yoshitake; Tanaka, Hiromitsu

    2007-03-01

    Haspin is a unique protein kinase expressed predominantly in haploid male germ cells. The genomic structure of haspin (Gsg2) has revealed it to be intronless, and the entire transcription unit is in an intron of the integrin alphaE (Itgae) gene. Transcription occurs from a bidirectional promoter that also generates an alternatively spliced integrin alphaE-derived mRNA (Aed). In mice, the testis-specific alternative splicing of Aed is expressed bidirectionally downstream from the Gsg2 transcription initiation site, and a segment consisting of 26 bp transcribes both genomic DNA strands between Gsg2 and the Aed transcription initiation sites. To investigate the mechanisms for this unique gene regulation, we cloned and characterized the Gsg2 promoter region. The 193-bp genomic fragment from the 5' end of the Gsg2 and Aed genes, fused with EGFP and DsRed genes, drove the expression of both proteins in haploid germ cells of transgenic mice. This promoter element contained only a GC-rich sequence, and not the previously reported DNA sequences known to bind various transcription factors--with the exception of E2F1, TCFAP2A1 (AP2), and SP1. Here, we show that the 193-bp DNA sequence is sufficient for the specific, bidirectional, and synchronous expression in germ cells in the testis. We also demonstrate the existence of germ cell nuclear factors specifically bound to the promoter sequence. This activity may be regulated by binding to the promoter sequence with germ cell-specific nuclear complex(es) without regulation via DNA methylation.

  11. Functional display of platelet-binding VWF fragments on filamentous bacteriophage.

    PubMed

    Yee, Andrew; Tan, Fen-Lai; Ginsburg, David

    2013-01-01

    von Willebrand factor (VWF) tethers platelets to sites of vascular injury via interaction with the platelet surface receptor, GPIb. To further define the VWF sequences required for VWF-platelet interaction, a phage library displaying random VWF protein fragments was screened against formalin-fixed platelets. After 3 rounds of affinity selection, DNA sequencing of platelet-bound clones identified VWF peptides mapping exclusively to the A1 domain. Aligning these sequences defined a minimal, overlapping segment spanning P1254-A1461, which encompasses the C1272-C1458 cystine loop. Analysis of phage carrying a mutated A1 segment (C1272/1458A) confirmed the requirement of the cystine loop for optimal binding. Four rounds of affinity maturation of a randomly mutagenized A1 phage library identified 10 and 14 unique mutants associated with enhanced platelet binding in the presence and absence of botrocetin, respectively, with 2 mutants (S1370G and I1372V) common to both conditions. These results demonstrate the utility of filamentous phage for studying VWF protein structure-function and identify a minimal, contiguous peptide that bind to formalin-fixed platelets, confirming the importance of the VWF A1 domain with no evidence for another independently platelet-binding segment within VWF. These findings also point to key structural elements within the A1 domain that regulate VWF-platelet adhesion.

  12. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data

    PubMed Central

    De, Rajat K.

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision. PMID:26291322

  13. CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.

    PubMed

    Sinha, Rituparna; Samaddar, Sandip; De, Rajat K

    2015-01-01

    Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision.

  14. A new species of Pseudopaludicola (Anura, Leiuperinae) from Espírito Santo, Brazil

    PubMed Central

    Baldo, Diego; Pupin, Nadya; Gasparini, João Luiz; Baptista Haddad, Célio F.

    2018-01-01

    We describe a new anuran species of the genus Pseudopaludicola that inhabits sandy areas in resting as associated to the Atlantic Forest biome in the state of Espírito Santo, Brazil. The new species is characterized by: SVL 11.7–14.6 mm in males, 14.0–16.7 mm in females; body slender; fingertips knobbed, with a central groove; hindlimbs short; abdominal fold complete; arytenoid cartilages wide; prepollex with base and two segments; prehallux with base and one segment; frontoparietal fontanelle partially exposed; advertisement call with one note composed of two isolated pulses per call; call dominant frequency ranging 4,380–4,884 Hz; diploid chromosome number 22; and Ag-NORs on 8q subterminal. In addition, its 16S rDNA sequence shows high genetic distances when compared to sequences of related species, which provides strong evidence that the new species is an independent lineage. PMID:29785347

  15. Single-molecule Protein Unfolding in Solid State Nanopores

    PubMed Central

    Talaga, David S.; Li, Jiali

    2009-01-01

    We use single silicon nitride nanopores to study folded, partially folded and unfolded single proteins by measuring their excluded volumes. The DNA-calibrated translocation signals of β-lactoglobulin and histidine-containing phosphocarrier protein match quantitatively with that predicted by a simple sum of the partial volumes of the amino acids in the polypeptide segment inside the pore when translocation stalls due to the primary charge sequence. Our analysis suggests that the majority of the protein molecules were linear or looped during translocation and that the electrical forces present under physiologically relevant potentials can unfold proteins. Our results show that the nanopore translocation signals are sensitive enough to distinguish the folding state of a protein and distinguish between proteins based on the excluded volume of a local segment of the polypeptide chain that transiently stalls in the nanopore due to the primary sequence of charges. PMID:19530678

  16. Molecular identification of Trichuris vulpis and Trichuris suis isolated from different hosts.

    PubMed

    Cutillas, Cristina; de Rojas, Manuel; Ariza, Concepción; Ubeda, José Manuel; Guevara, Diego

    2007-01-01

    Trichuris suis was isolated from the cecum of two different hosts (Sus scrofa domestica -- swine and Sus scrofa scrofa -- wild boar) and Trichuris vulpis from dogs in Sevilla, Spain. Genomic DNA was isolated and internal transcribed spacers (ITS)1-5.8S-ITS2 segment from the ribosomal DNA (rDNA) was amplified and sequenced using polymerase chain reaction techniques. The sequence of T. suis from both hosts was 1,396 bp in length while that of T. vulpis was 1,044 bp. ITS1 of both populations isolated of T. suis was 661 nucleotides in length, while the ITS2 was 534 nucleotides in length. Furthermore, the ITS1 of T. vulpis was 410 nucleotides in length, while the ITS2 was 433 nucleotides in length. One hundred fifty-four nucleotides were observed along the 5.8S gene of T. suis and T. vulpis. Intraindividual and intraspecific variations were detected in the rDNA of both species. The presence of microsatellites was observed in all the individuals assayed. Sequence analysis of the ITSs and the 5.8S gene has demonstrated no sequence differences between T. suis isolated from both hosts (S. scrofa domestica -- swine and S. scrofa scrofa -- wild boar). Nevertheless, clear differences were detected between the ITS1 and ITS2 of T. suis and T. vulpis. Furthermore, a comparative molecular analysis between both species and the previously published ITS1-5.8S-ITS2 sequence data of Trichuris ovis, Trichuris leporis, Trichuris muris, Trichuris arvicolae, and Trichuris skrjabini was carried out. A common homology zone was detected in the ITS1 sequence of all species of trichurids.

  17. Identification and analysis of pig chimeric mRNAs using RNA sequencing data

    PubMed Central

    2012-01-01

    Background Gene fusion is ubiquitous over the course of evolution. It is expected to increase the diversity and complexity of transcriptomes and proteomes through chimeric sequence segments or altered regulation. However, chimeric mRNAs in pigs remain unclear. Here we identified some chimeric mRNAs in pigs and analyzed the expression of them across individuals and breeds using RNA-sequencing data. Results The present study identified 669 putative chimeric mRNAs in pigs, of which 251 chimeric candidates were detected in a set of RNA-sequencing data. The 618 candidates had clear trans-splicing sites, 537 of which obeyed the canonical GU-AG splice rule. Only two putative pig chimera variants whose fusion junction was overlapped with that of a known human chimeric mRNA were found. A set of unique chimeric events were considered middle variances in the expression across individuals and breeds, and revealed non-significant variance between sexes. Furthermore, the genomic region of the 5′ partner gene shares a similar DNA sequence with that of the 3′ partner gene for 458 putative chimeric mRNAs. The 81 of those shared DNA sequences significantly matched the known DNA-binding motifs in the JASPAR CORE database. Four DNA motifs shared in parental genomic regions had significant similarity with known human CTCF binding sites. Conclusions The present study provided detailed information on some pig chimeric mRNAs. We proposed a model that trans-acting factors, such as CTCF, induced the spatial organisation of parental genes to the same transcriptional factory so that parental genes were coordinatively transcribed to give birth to chimeric mRNAs. PMID:22925561

  18. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

    PubMed Central

    2013-01-01

    Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200

  19. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

    PubMed

    Nagar, Anurag; Hahsler, Michael

    2013-01-01

    Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.

  20. Principles of regulatory information conservation between mouse and human

    DOE PAGES

    Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; ...

    2014-11-19

    To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human–mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and withmore » genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Lastly, single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.« less

  1. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    PubMed

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  2. Self-entanglement of long linear DNA vectors using transient non-B-DNA attachment points: a new concept for improvement of non-viral therapeutic gene delivery.

    PubMed

    Tolmachov, Oleg E

    2012-05-01

    The cell-specific and long-term expression of therapeutic transgenes often requires a full array of native gene control elements including distal enhancers, regulatory introns and chromatin organisation sequences. The delivery of such extended gene expression modules to human cells can be accomplished with non-viral high-molecular-weight DNA vectors, in particular with several classes of linear DNA vectors. All high-molecular-weight DNA vectors are susceptible to damage by shear stress, and while for some of the vectors the harmful impact of shear stress can be minimised through the transformation of the vectors to compact topological configurations by supercoiling and/or knotting, linear DNA vectors with terminal loops or covalently attached terminal proteins cannot be self-compacted in this way. In this case, the only available self-compacting option is self-entangling, which can be defined as the folding of single DNA molecules into a configuration with mutual restriction of molecular motion by the individual segments of bent DNA. A negatively charged phosphate backbone makes DNA self-repulsive, so it is reasonable to assume that a certain number of 'sticky points' dispersed within DNA could facilitate the entangling by bringing DNA segments into proximity and by interfering with the DNA slipping away from the entanglement. I propose that the spontaneous entanglement of vector DNA can be enhanced by the interlacing of the DNA with sites capable of mutual transient attachment through the formation of non-B-DNA forms, such as interacting cruciform structures, inter-segment triplexes, slipped-strand DNA, left-handed duplexes (Z-forms) or G-quadruplexes. It is expected that the non-B-DNA based entanglement of the linear DNA vectors would consist of the initial transient and co-operative non-B-DNA mediated binding events followed by tight self-ensnarement of the vector DNA. Once in the nucleoplasm of the target human cells, the DNA can be disentangled by type II topoisomerases. The technology for such self-entanglement can be an avenue for the improvement of gene delivery with high-molecular-weight naked DNA using therapeutically important methods associated with considerable shear stress. Priority applications include in vivo muscle electroporation and sonoporation for Duchenne muscular dystrophy patients, aerosol inhalation to reach the target lung cells of cystic fibrosis patients and bio-ballistic delivery to skin melanomas with the vector DNA adsorbed on gold or tungsten projectiles. Copyright © 2012 Elsevier Ltd. All rights reserved.

  3. The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification.

    PubMed

    Sanosyan, Armen; Fayd'herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

    2017-01-01

    Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load.

  4. The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification

    PubMed Central

    Fayd’herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

    2017-01-01

    Background Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Methods Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. Results BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Conclusions Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load. PMID:28850597

  5. Exploring the environmental diversity of kinetoplastid flagellates in the high-throughput DNA sequencing era

    PubMed Central

    d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius

    2015-01-01

    The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872

  6. Laser mass spectrometry for DNA fingerprinting for forensic applications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, C.H.; Tang, K.; Taranenko, N.I.

    The application of DNA fingerprinting has become very broad in forensic analysis, patient identification, diagnostic medicine, and wildlife poaching, since every individual`s DNA structure is identical within all tissues of their body. DNA fingerprinting was initiated by the use of restriction fragment length polymorphisms (RFLP). In 1987, Nakamura et al. found that a variable number of tandem repeats (VNTR) often occurred in the alleles. The probability of different individuals having the same number of tandem repeats in several different alleles is very low. Thus, the identification of VNTR from genomic DNA became a very reliable method for identification of individuals.more » DNA fingerprinting is a reliable tool for forensic analysis. In DNA fingerprinting, knowledge of the sequence of tandem repeats and restriction endonuclease sites can provide the basis for identification. The major steps for conventional DNA fingerprinting include (1) specimen processing (2) amplification of selected DNA segments by PCR, and (3) gel electrophoresis to do the final DNA analysis. In this work we propose to use laser desorption mass spectrometry for fast DNA fingerprinting. The process and advantages are discussed.« less

  7. A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family.

    PubMed

    Lucotte, Gérard

    2010-10-04

    This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon.

  8. A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family

    PubMed Central

    2010-01-01

    This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon. PMID:21092341

  9. Estimation of a Killer Whale (Orcinus orca) Population’s Diet Using Sequencing Analysis of DNA from Feces

    PubMed Central

    Ford, Michael J.; Hempelmann, Jennifer; Hanson, M. Bradley; Ayres, Katherine L.; Baird, Robin W.; Emmons, Candice K.; Lundin, Jessica I.; Schorr, Gregory S.; Wasser, Samuel K.; Park, Linda K.

    2016-01-01

    Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population’s summer diet. PMID:26735849

  10. Estimation of a Killer Whale (Orcinus orca) Population's Diet Using Sequencing Analysis of DNA from Feces.

    PubMed

    Ford, Michael J; Hempelmann, Jennifer; Hanson, M Bradley; Ayres, Katherine L; Baird, Robin W; Emmons, Candice K; Lundin, Jessica I; Schorr, Gregory S; Wasser, Samuel K; Park, Linda K

    2016-01-01

    Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population's summer diet.

  11. N-terminal segments modulate the α-helical propensities of the intrinsically disordered basic regions of bZIP proteins.

    PubMed

    Das, Rahul K; Crick, Scott L; Pappu, Rohit V

    2012-02-17

    Basic region leucine zippers (bZIPs) are modular transcription factors that play key roles in eukaryotic gene regulation. The basic regions of bZIPs (bZIP-bRs) are necessary and sufficient for DNA binding and specificity. Bioinformatic predictions and spectroscopic studies suggest that unbound monomeric bZIP-bRs are uniformly disordered as isolated domains. Here, we test this assumption through a comparative characterization of conformational ensembles for 15 different bZIP-bRs using a combination of atomistic simulations and circular dichroism measurements. We find that bZIP-bRs have quantifiable preferences for α-helical conformations in their unbound monomeric forms. This helicity varies from one bZIP-bR to another despite a significant sequence similarity of the DNA binding motifs (DBMs). Our analysis reveals that intramolecular interactions between DBMs and eight-residue segments directly N-terminal to DBMs are the primary modulators of bZIP-bR helicities. We test the accuracy of this inference by designing chimeras of bZIP-bRs to have either increased or decreased overall helicities. Our results yield quantitative insights regarding the relationship between sequence and the degree of intrinsic disorder within bZIP-bRs, and might have general implications for other intrinsically disordered proteins. Understanding how natural sequence variations lead to modulation of disorder is likely to be important for understanding the evolution of specificity in molecular recognition through intrinsically disordered regions (IDRs). Copyright © 2011 Elsevier Ltd. All rights reserved.

  12. On the connection between inherent DNA flexure and preferred binding of hydroxymethyluracil-containing DNA by the type II DNA-binding protein TF1.

    PubMed

    Grove, A; Galeone, A; Mayol, L; Geiduschek, E P

    1996-07-12

    TF1 is a member of the family of type II DNA-binding proteins, which also includes the bacterial HU proteins and the Escherichia coli integration host factor (IHF). Distinctive to TF1, which is encoded by the Bacillus subtilis bacteriophage SPO1, is its preferential binding to DNA in which thymine is replaced by 5-hydroxymethyluracil (hmU), as it is in the phage genome. TF1 binds to preferred sites within the phage genome and generates pronounced DNA bending. The extent to which DNA flexibility contributes to the sequence-specific binding of TF1, and the connection between hmU preference and DNA flexibility has been examined. Model flexible sites, consisting of consecutive mismatches, increase the affinity of thymine-containing DNA for TF1. In particular, tandem mismatches separated by nine base-pairs generate an increase, by orders of magnitude, in the affinity of TF1 for T-containing DNA with the sequence of a preferred TF1 binding site, and fully match the affinity of TF1 for this cognate site in hmU-containing DNA (Kd approximately 3 nM). Other placements of loops generate suboptimal binding. This is consistent with a significant contribution of site-specific DNA flexibility to complex formation. Analysis of complexes with hmU-DNA of decreasing length shows that a major part of the binding affinity is generated within a central 19 bp segment (delta G0 = 41.7 kJ mol-1) with more-distal DNA contributing modestly to the affinity (delta delta G = -0.42 kJ mol-1 bp-1 on increasing duplex length to 37 bp). However, a previously characterised thermostable and more tightly binding mutant TF1, TF1(E15G/T32I), derives most of its extra affinity from interaction with flanking DNA. We propose that inherent but sequence-dependent deformability of hmU-containing DNA underlies the preferential binding of TF1 and that TF1-induced DNA bendings is a result of distortions at two distinct sites separated by 9 bp of duplex DNA.

  13. The cyc1-11 mutation in yeast reverts by recombination with a nonallelic gene: composite genes determining the iso-cytochromes c.

    PubMed Central

    Ernst, J F; Stewart, J W; Sherman, F

    1981-01-01

    DNA sequence analysis of a cloned fragment directly established that the cyc1-11 mutation of iso-1-cytochrome c in the yeast Saccharomyces cerevisiae is a two-base-pair substitution that changes the CCA proline codon at amino acid position 76 to a UAA nonsense codon. Analysis of 11 revertant proteins and one cloned revertant gene showed that reversion of the cyc1-11 mutation can occur in three ways: a single base-pair substitution, which produces a serine replacement at position 76; recombination with the nonallelic CYC7 gene of iso-2-cytochrome c, which causes replacement of a segment in the cyc1-11 gene by the corresponding segment of the CYC7 gene; and either a two-base-pair substitution or recombination with the CYC7 gene, which causes the formation of the normal iso-1-cytochrome c sequence. These results demonstrate the occurrence of low frequencies of recombination between nonallelic genes having extensive but not complete homology. The formation of composite genes that share sequences from nonallelic genes may be an evolutionary mechanism for producing protein diversities and for maintaining identical sequences at different loci. Images PMID:6273865

  14. Horizontal Transfer of Segments of the 16S rRNA Genes between Species of the Streptococcus anginosus Group

    PubMed Central

    Schouls, Leo M.; Schot, Corrie S.; Jacobs, Jan A.

    2003-01-01

    The nature in variation of the 16S rRNA gene of members of the Streptococcus anginosus group was investigated by hybridization and DNA sequencing. A collection of 708 strains was analyzed by reverse line blot hybridization. This revealed the presence of distinct reaction patterns representing 11 different hybridization groups. The 16S rRNA genes of two strains of each hybridization group were sequenced to near-completion, and the sequence data confirmed the reverse line blot hybridization results. Closer inspection of the sequences revealed mosaic-like structures, strongly suggesting horizontal transfer of segments of the 16S rRNA gene between different species belonging to the Streptococcus anginosus group. Southern blot hybridization further showed that within a single strain all copies of the 16S rRNA gene had the same composition, indicating that the apparent mosaic structures were not PCR-induced artifacts. These findings indicate that the highly conserved rRNA genes are also subject to recombination and that these events may be fixed in the population. Such recombination may lead to the construction of incorrect phylogenetic trees based on the 16S rRNA genes. PMID:14645285

  15. The Mitochondrial Genome and a 60-kb Nuclear DNA Segment from Naegleria fowleri, the Causative Agent of Primary Amoebic Meningoencephalitis

    PubMed Central

    Herman, Emily K.; Greninger, Alexander L.; Visvesvara, Govinda S.; Marciano-Cabral, Francine; Dacks, Joel B.; Chiu, Charles Y.

    2013-01-01

    Naegleria fowleri is a unicellular eukaryote causing primary amoebic meningoencephalitis, a neuropathic disease killing 99% of those infected, usually within 7–14 days. N. fowleri is found globally in regions including the US and Australia. The genome of the related non-pathogenic species Naegleria gruberi has been sequenced, but the genetic basis for N. fowleri pathogenicity is unclear. To generate such insight, we sequenced and assembled the mitochondrial genome and a 60-kb segment of nuclear genome from N. fowleri. The mitochondrial genome is highly similar to its counterpart in N. gruberi in gene complement and organization, while distinct lack of synteny is observed for the nuclear segments. Even in this short (60-kb) segment, we identified examples of potential factors for pathogenesis, including ten novel N. fowleri-specific genes. We also identified a homologue of cathepsin B; proteases proposed to be involved in the pathogenesis of diverse eukaryotic pathogens, including N. fowleri. Finally, we demonstrate a likely case of horizontal gene transfer between N. fowleri and two unrelated amoebae, one of which causes granulomatous amoebic encephalitis. This initial look into the N. fowleri nuclear genome has revealed several examples of potential pathogenesis factors, improving our understanding of a neglected pathogen of increasing global importance. PMID:23360210

  16. Amplification of Mitochondrial DNA for detection of Plasmodiumvivax in Balochistan.

    PubMed

    Shahwani, Muhammad Naeem; Nisar, Samia; Aleem, Abdul; Panezai, Marina; Afridi, Sarwat; Malik, Shaukat Iqbal

    2017-05-01

    To access a new step using PCR to amplify the targeted mtDNA sequence for detecting specifically Plasmodium vivax and its co-infections, false positive and false negative results with Plasmodium falciparum. In this study we have standardized a new technical approach in which the target mitochondrial DNA sequence (mtDNA) was amplified by using a PCR technique as a tool to detect Plasmodium spp. Species specific primers were designed to hybridize with cytochrome c oxidase gene of P. vivax (cox I) and P. falciparum (cox III). Two hundred blood samples were collected on the basis of clinical symptoms which were initially examined through microscopic analysis after preparing Giemsa stained thick and thin blood smears. Afterwards genomic DNA was extracted from all samples and was then subjected to PCR amplification by using species specific primers and amplified segments were sequenced for confirmation of results. One-hundred and thirty-two blood samples were detected as positive for malaria by PCR, out of which 64 were found to be positive by PCR and 53 by both microscopy and PCR for P.vivax infection. Nine samples were found to be false negative, one P.vivax mono infection was declared as co infection by PCR and 3 samples identified as having P.falciparum gametes were confirmed as P.vivax by PCR amplification. Sensitivity and specificity were found to be 85% and 92% respectively. Results obtained through PCR method were comparatively better and reliable than microscopy.

  17. Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens.

    PubMed

    Glinsky, Gennadi V

    2016-09-19

    Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Thermodynamics-Based Models of Transcriptional Regulation by Enhancers: The Roles of Synergistic Activation, Cooperative Binding and Short-Range Repression

    PubMed Central

    He, Xin; Samee, Md. Abul Hassan; Blatti, Charles; Sinha, Saurabh

    2010-01-01

    Quantitative models of cis-regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled, or heuristic approximations of the underlying regulatory mechanisms. We have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence, as a function of transcription factor concentrations and their DNA-binding specificities. It uses statistical thermodynamics theory to model not only protein-DNA interaction, but also the effect of DNA-bound activators and repressors on gene expression. In addition, the model incorporates mechanistic features such as synergistic effect of multiple activators, short range repression, and cooperativity in transcription factor-DNA binding, allowing us to systematically evaluate the significance of these features in the context of available expression data. Using this model on segmentation-related enhancers in Drosophila, we find that transcriptional synergy due to simultaneous action of multiple activators helps explain the data beyond what can be explained by cooperative DNA-binding alone. We find clear support for the phenomenon of short-range repression, where repressors do not directly interact with the basal transcriptional machinery. We also find that the binding sites contributing to an enhancer's function may not be conserved during evolution, and a noticeable fraction of these undergo lineage-specific changes. Our implementation of the model, called GEMSTAT, is the first publicly available program for simultaneously modeling the regulatory activities of a given set of sequences. PMID:20862354

  19. DNA Barcodes of Asian Houbara Bustard (Chlamydotis undulata macqueenii)

    PubMed Central

    Arif, Ibrahim A.; Khan, Haseeb A.; Williams, Joseph B.; Shobrak, Mohammad; Arif, Waad I.

    2012-01-01

    Populations of Houbara Bustards have dramatically declined in recent years. Captive breeding and reintroduction programs have had limited success in reviving population numbers and thus new technological solutions involving molecular methods are essential for the long term survival of this species. In this study, we sequenced the 694 bp segment of COI gene of the four specimens of Asian Houbara Bustard (Chlamydotis undulata macqueenii). We also compared these sequences with earlier published barcodes of 11 individuals comprising different families of the orders Gruiformes, Ciconiiformes, Podicipediformes and Crocodylia (out group). The pair-wise sequence comparison showed a total of 254 variable sites across all the 15 sequences from different taxa. Three of the four specimens of Houbara Bustard had an identical sequence of COI gene and one individual showed a single nucleotide difference (G > A transition at position 83). Within the bustard family (Otididae), comparison among the three species (Asian Houbara Bustard, Great Bustard (Otis tarda) and the Little Bustard (Tetrax tetrax)), representing three different genera, showed 116 variable sites. For another family (Rallidae), the intra-family variable sites among the individuals of four different genera were found to be 146. The COI genetic distances among the 15 individuals varied from 0.000 to 0.431. Phylogenetic analysis using 619 bp nucleotide segment of COI clearly discriminated all the species representing different genera, families and orders. All the four specimens of Houbara Bustard formed a single clade and are clearly separated from other two individuals of the same family (Otis tarda and Tetrax tetrax). The nucleotide sequence of partial segment of COI gene effectively discriminated the closely related species. This is the first study reporting the barcodes of Houbara Bustard and would be helpful in future molecular studies, particularly for the conservation of this threatened bird in Saudi Arabia. PMID:22408462

  20. Gene transfer and gene mapping in mammalian cells in culture.

    PubMed

    Shows, T B; Sakaguchi, A Y

    1980-01-01

    The ability to transfer mammalian genes parasexually has opened new possibilities for gene mapping and fine structure mapping and offers great potential for contributing to several aspects of mammalian biology, including gene expression and genetic engineering. The DNA transferred has ranged from whole genomes to single genes and smaller segments of DNA. The transfer of whole genomes by cell fusion forms cell hybrids, which has promoted the extensive mapping of human and mouse genes. Transfer, by cell fusion, of rearranged chromosomes has contributed significantly to determining close linkage and the assignment of genes to specific chromosomal regions. Transfer of single chromosomes has been achieved utilizing microcells fused to recipient cells. Metaphase chromosomes have been isolated and used to transfer single-to-multigenic DNA segments. DNA-mediated gene transfer, simulating bacterial transformation, has achieved transfer of single-copy genes. By utilizing DNA cleaved with restriction endonucleases, gene transfer is being empolyed as a bioassay for the purification of genes. Gene mapping and the fate of transferred genes can be examined now at the molecular level using sequence-specific probles. Recently, single genes have been cloned into eucaryotic and procaryotic vectors for transfer into mammalian cells. Moreover, recombinant libraries in which entire mammalian genomes are represented collectively are a rich new source of transferable genes. Methodology for transferring mammalian genetic information and applications for mapping mammalian genes is presented and prospects for the future discussed.

  1. Negative Feedback Regulation of HIV-1 by Gene Editing Strategy.

    PubMed

    Kaminski, Rafal; Chen, Yilan; Salkind, Julian; Bella, Ramona; Young, Won-Bin; Ferrante, Pasquale; Karn, Jonathan; Malcolm, Thomas; Hu, Wenhui; Khalili, Kamel

    2016-08-16

    The CRISPR/Cas9 gene editing method is comprised of the guide RNA (gRNA) to target a specific DNA sequence for cleavage and the Cas9 endonuclease for introducing breaks in the double-stranded DNA identified by the gRNA. Co-expression of both a multiplex of HIV-1-specific gRNAs and Cas9 in cells results in the modification and/or excision of the segment of viral DNA, leading to replication-defective virus. In this study, we have personalized the activity of CRISPR/Cas9 by placing the gene encoding Cas9 under the control of a minimal promoter of HIV-1 that is activated by the HIV-1 Tat protein. We demonstrate that functional activation of CRISPR/Cas9 by Tat during the course of viral infection excises the designated segment of the integrated viral DNA and consequently suppresses viral expression. This strategy was also used in a latently infected CD4+ T-cell model after treatment with a variety of HIV-1 stimulating agents including PMA and TSA. Controlled expression of Cas9 by Tat offers a new strategy for safe implementation of the Cas9 technology for ablation of HIV-1 at a very early stage of HIV-1 replication during the course of the acute phase of infection and the reactivation of silent proviral DNA in latently infected cells.

  2. Chimeras of human complement C9 reveal the site recognized by complement regulatory protein CD59.

    PubMed

    Hüsler, T; Lockert, D H; Kaufman, K M; Sodetz, J M; Sims, P J

    1995-02-24

    CD59 antigen is a membrane glycoprotein that inhibits the activity of the C9 component of the C5b-9 membrane attack complex, thereby protecting human cells from lysis by human complement. The complement-inhibitory activity of CD59 is species-selective and is most effective toward C9 derived from human or other primate plasma. By contrast, rabbit C9, which can substitute for human C9 in the membrane attack complex, mediates unrestricted lysis of human cells. To identify the peptide segment of human C9 that is recognized by CD59, rabbit C9 cDNA clones were isolated, characterized, and used to construct hybrid cDNAs for expression of full-length human/rabbit C9 chimeras in COS-7 cells. All resulting chimeras were hemolytically active, when tested against chicken erythrocytes bearing C5b-8 complexes. Assays performed in the presence or absence of CD59 revealed that this inhibitor reduced the hemolytic activity of those chimeras containing human C9 sequence between residues 334-415, irrespective of whether the remainder of the protein contained human or rabbit sequence. By contrast, when this segment of C9 contained rabbit sequence, lytic activity was unaffected by CD59. These data establish that human C9 residues 334-415 contain the site recognized by CD59, and they suggest that sequence variability within this segment of C9 is responsible for the observed species-selective inhibitory activity of CD59.

  3. A free VP3 C-terminus is essential for the replication of infectious bursal disease virus.

    PubMed

    Mosley, Yung-Yi C; Wu, Ching Ching; Lin, Tsang Long

    2017-03-15

    Green fluorescent protein (GFP) has been successfully incorporated into the viral-like particles of infectious bursal disease virus (IBDV) with a linker at the C-terminus of VP3 in a baculovirus system. However, when the same locus in segment A was used to express GFP by a reverse genetic (RG) system, no viable GFP-expressing IBDV was recovered. To elucidate the underlying mechanism, cDNA construct of segment A with only the linker sequence (9 amino acids) was applied to generate RG IBDV virus (rIBDV). Similarly, no rIBDV was recovered. Moreover, when the incubation after transfection was extended, wildtype rIBDV without the linker was recovered suggesting a free C-terminus of VP3 might be necessary for IBDV replication. On the other hand, rIBDV could be recovered when additional sequence (up to 40 nucleotides) were inserted at the 3' noncoding region (NCR) adjacent to the stop codon of VP3, suggesting that the burden of the linker sequence was not in the stretched genome size but the disruption of the VP3 function. Finally, when the stop codon of VP3 was deleted in segment A to extend the translation into the 3' NCR without introducing additional genomic sequence, no rIBDV was recovered. Our data suggest that a free VP3 C-terminus is essential for IBDV replication. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Use of Lambda Phage DNA as a Hybrid Internal Control in a PCR-Enzyme Immunoassay To Detect Chlamydia pneumoniae

    PubMed Central

    Pham, Dien G.; Madico, Guillermo E.; Quinn, Thomas C.; Enzler, Mark J.; Smith, Thomas F.; Gaydos, Charlotte A.

    1998-01-01

    An inherent problem in the diagnostic PCR assay is the presence of ill-defined inhibitors of amplification which may cause false-negative results. Addition of an amplifiable fragment of foreign DNA in the PCR to serve as a hybrid internal control (HIC) would allow for a simple way to identify specimens containing inhibitors. Two oligonucleotide hybrid primers were synthesized to contain nucleic acid sequences of the Chlamydia pneumoniae 16S rRNA primers in a position flanking two primers that target the sequences of a 650-bp lambda phage DNA segment. By using the hybrid primers, hybrid DNA comprising a large sequence of lambda phage DNA flanked by short pieces of chlamydia DNA was subsequently generated by PCR, cloned into a plasmid vector, and purified. Plasmids containing the hybrid DNA were diluted and used as a HIC by adding them to each C. pneumoniae PCR test. Consequently, C. pneumoniae primers were able to amplify both chlamydia DNA and the HIC DNA. The production of a 689-bp HIC DNA band on an acrylamide gel indicated that the specimen contained no inhibitors and that internal conditions were compatible with PCR. Subsequently, a biotinylated RNA probe for the HIC was transcribed from a nested sequence of the HIC and was used for its hybridization. Detection of the HIC DNA-RNA hybrid was achieved by enzyme immunoassay (EIA). This PCR-EIA system with a HIC was initially tested with 12 previously PCR-positive and 14 previously PCR-negative specimens. Of the 12 PCR-positive specimens, 11 were reconfirmed as positive; 1 had a negative HIC value, indicating inhibition. Of the 14 previously PCR-negative specimens, 13 were confirmed as true negative; 1 had a negative HIC value, indicating inhibition. The assay was then used with 237 nasopharyngeal specimens from patients with pneumonia. Twenty-one of 237 (8.9%) were positive for C. pneumoniae, and 42 (17.7%) were found to inhibit the PCR. Specimens showing inhibitory activity were diluted 1:10 and were retested. Ten specimens were still inhibitory to the PCR and required further DNA purification. No additional positive samples were detected and 3 nasopharyngeal specimens remained inhibitory to PCR. Coamplification of a HIC DNA can help confirm true-negative PCR results by ruling out the presence of inhibitors of DNA amplification. PMID:9650936

  5. The genome of Eimeria spp., with special reference to Eimeria tenella--a coccidium from the chicken.

    PubMed

    Shirley, M W

    2000-04-10

    Eimeria spp. contain at least four genomes. The nuclear genome is best studied in the avian species Eimeria tenella and comprises about 60 Mbp DNA contained within ca. 14 chromosomes; other avian and lupine species appear to possess a nuclear genome of similar size. In addition, sequence data and hybridisation studies have provided direct evidence for extrachromosomal mitochondrial and plastid DNA genomes, and double-stranded RNA segments have also been described. The unique phenotype of "precocious" development that characterises some selected lines of Eimeria spp. not only provides the basis for the first generation of live attenuated vaccines, but offers a significant entrée into studies on the regulation of an apicomplexan life-cycle. With a view to identifying loci implicated in the trait of precocious development, a genetic linkage map of the genome of E. tenella is being constructed in this laboratory from analyses of the inheritance of over 400 polymorphic DNA markers in the progeny of a cross between complementary drug-resistant and precocious parents. Other projects that impinge directly or indirectly on the genome and/or genetics of Eimeria spp. are currently in progress in several laboratories, and include the derivation of expressed sequence tag data and the development of ancillary technologies such as transfection techniques. No large-scale genomic DNA sequencing projects have been reported.

  6. Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm

    PubMed Central

    Glunčić, Matko; Paar, Vladimir

    2013-01-01

    The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes). PMID:22977183

  7. Absence of ancient DNA in sub-fossil insect inclusions preserved in 'Anthropocene' Colombian copal.

    PubMed

    Penney, David; Wadsworth, Caroline; Fox, Graeme; Kennedy, Sandra L; Preziosi, Richard F; Brown, Terence A

    2013-01-01

    Insects preserved in copal, the sub-fossilized resin precursor of amber, have potential value in molecular ecological studies of recently-extinct species and of extant species that have never been collected as living specimens. The objective of the work reported in this paper was therefore to determine if ancient DNA is present in insects preserved in copal. We prepared DNA libraries from two stingless bees (Apidae: Meliponini: Trigonisca ameliae) preserved in 'Anthropocene' Colombian copal, dated to 'post-Bomb' and 10,612±62 cal yr BP, respectively, and obtained sequence reads using the GS Junior 454 System. Read numbers were low, but were significantly higher for DNA extracts prepared from crushed insects compared with extracts obtained by a non-destructive method. The younger specimen yielded sequence reads up to 535 nucleotides in length, but searches of these sequences against the nucleotide database revealed very few significant matches. None of these hits was to stingless bees though one read of 97 nucleotides aligned with two non-contiguous segments of the mitochondrial cytochrome oxidase subunit I gene of the East Asia bumblebee Bombus hypocrita. The most significant hit was for 452 nucleotides of a 470-nucleotide read that aligned with part of the genome of the root-nodulating bacterium Bradyrhizobium japonicum. The other significant hits were to proteobacteria and an actinomycete. Searches directed specifically at Apidae nucleotide sequences only gave short and insignificant alignments. All of the reads from the older specimen appeared to be artefacts. We were therefore unable to obtain any convincing evidence for the preservation of ancient DNA in either of the two copal inclusions that we studied, and conclude that DNA is not preserved in this type of material. Our results raise further doubts about claims of DNA extraction from fossil insects in amber, many millions of years older than copal.

  8. Absence of Ancient DNA in Sub-Fossil Insect Inclusions Preserved in ‘Anthropocene’ Colombian Copal

    PubMed Central

    Penney, David; Wadsworth, Caroline; Fox, Graeme; Kennedy, Sandra L.; Preziosi, Richard F.; Brown, Terence A.

    2013-01-01

    Insects preserved in copal, the sub-fossilized resin precursor of amber, have potential value in molecular ecological studies of recently-extinct species and of extant species that have never been collected as living specimens. The objective of the work reported in this paper was therefore to determine if ancient DNA is present in insects preserved in copal. We prepared DNA libraries from two stingless bees (Apidae: Meliponini: Trigonisca ameliae) preserved in ‘Anthropocene’ Colombian copal, dated to ‘post-Bomb’ and 10,612±62 cal yr BP, respectively, and obtained sequence reads using the GS Junior 454 System. Read numbers were low, but were significantly higher for DNA extracts prepared from crushed insects compared with extracts obtained by a non-destructive method. The younger specimen yielded sequence reads up to 535 nucleotides in length, but searches of these sequences against the nucleotide database revealed very few significant matches. None of these hits was to stingless bees though one read of 97 nucleotides aligned with two non-contiguous segments of the mitochondrial cytochrome oxidase subunit I gene of the East Asia bumblebee Bombus hypocrita. The most significant hit was for 452 nucleotides of a 470-nucleotide read that aligned with part of the genome of the root-nodulating bacterium Bradyrhizobium japonicum. The other significant hits were to proteobacteria and an actinomycete. Searches directed specifically at Apidae nucleotide sequences only gave short and insignificant alignments. All of the reads from the older specimen appeared to be artefacts. We were therefore unable to obtain any convincing evidence for the preservation of ancient DNA in either of the two copal inclusions that we studied, and conclude that DNA is not preserved in this type of material. Our results raise further doubts about claims of DNA extraction from fossil insects in amber, many millions of years older than copal. PMID:24039876

  9. Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome.

    PubMed

    Singh, Vinod Kumar; Krishnamachari, Annangarachari

    2016-09-01

    Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.

  10. First report of the Phe1534Cys kdr mutation in natural populations of Aedes albopictus from Brazil.

    PubMed

    Aguirre-Obando, Oscar Alexander; Martins, Ademir Jesus; Navarro-Silva, Mário Antônio

    2017-03-27

    Knockdown resistance (kdr), caused by alterations in the voltage-gated sodium channel (Na V ), is one of the mechanisms responsible for pyrethroid (PY) resistance. In the Asian tiger mosquito, Aedes albopictus, at least four different mutations were described in the IIIS6 Na V segment in populations from Asia, North America and Europe. In contrast, in Aedes aegypti at least 12 non-synonymous mutations have been reported at nine different codons, mostly in the IIS6 and IIIS6 Na V segments. The Phe1534Cys kdr mutation in the IIIS6 Na V segment is the most prevalent in populations of Ae. aegypti worldwide, also found in Ae. albopictus from Singapore. Herein, we investigated the DNA diversity corresponding to the IIS6 and IIIS6 Na V segments in natural populations of Ae. albopictus from Brazil. DNA from eight Brazilian Ae. albopictus natural populations were individually extracted and pooled by states of origin, amplified, cloned and sequenced for the corresponding IIS6 and IIIS6 Na V segments. Additionally, samples from each location were individually genotyped by an allelic specific PCR (AS-PCR) approach to obtain the genotypic and allelic frequencies for the 1534 Na V site. No non-synonymous substitutions were observed in the IIS6 sequences. However, the Phe1534Cys kdr mutation was evidenced in the Ae. albopictus Na V IIIS6 segment sequences from Paraná (PR) and Rondônia (RO) states, but not from Mato Grosso (MT) state. The 1534Cys kdr allele varied from 3% (Marilena/PR and Porto Velho/RO) to 10% (Foz do Iguaçu/PR). To our knowledge, this paper reports the first occurrence and provides distribution data of a possible kdr mutation in Ae. albopictus in South America. The emergence of a likely kdr mutation in Ae. albopitus natural populations is a signal of alert for vector control measures since PY are the most popular insecticides adopted by residents. Additionally, once the kdr allele is present, its frequency tends to increase faster under exposition to those compounds. Although the Asian tiger mosquito is not incriminated as an important vector of dengue, chikungunya and Zika viruses in South America, its importance in this regard has been extensively discussed since Ae. albopictus is rapidly spreading and can also migrate between sylvatic and urban environments. Therefore, insecticide resistance monitoring initiatives should also be extended to Ae. albopictus in Brazil in order to maintain chemical compounds as an efficient vector control tool when needed.

  11. DNA Barcode Sequence Identification Incorporating Taxonomic Hierarchy and within Taxon Variability

    PubMed Central

    Little, Damon P.

    2011-01-01

    For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple–sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple–sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment–free sequence identification algorithm–BRONX–that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple–sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user–defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini–barcode queries against a full–length barcode database). BRONX consistently produced better identifications at the genus–level for all query types. PMID:21857897

  12. Site-Specific Integration of Foreign DNA into Minimal Bacterial and Human Target Sequences Mediated by a Conjugative Relaxase

    PubMed Central

    Agúndez, Leticia; González-Prieto, Coral; Machón, Cristina; Llosa, Matxalen

    2012-01-01

    Background Bacterial conjugation is a mechanism for horizontal DNA transfer between bacteria which requires cell to cell contact, usually mediated by self-transmissible plasmids. A protein known as relaxase is responsible for the processing of DNA during bacterial conjugation. TrwC, the relaxase of conjugative plasmid R388, is also able to catalyze site-specific integration of the transferred DNA into a copy of its target, the origin of transfer (oriT), present in a recipient plasmid. This reaction confers TrwC a high biotechnological potential as a tool for genomic engineering. Methodology/Principal Findings We have characterized this reaction by conjugal mobilization of a suicide plasmid to a recipient cell with an oriT-containing plasmid, selecting for the cointegrates. Proteins TrwA and IHF enhanced integration frequency. TrwC could also catalyze integration when it is expressed from the recipient cell. Both Y18 and Y26 catalytic tyrosil residues were essential to perform the reaction, while TrwC DNA helicase activity was dispensable. The target DNA could be reduced to 17 bp encompassing TrwC nicking and binding sites. Two human genomic sequences resembling the 17 bp segment were accepted as targets for TrwC-mediated site-specific integration. TrwC could also integrate the incoming DNA molecule into an oriT copy present in the recipient chromosome. Conclusions/Significance The results support a model for TrwC-mediated site-specific integration. This reaction may allow R388 to integrate into the genome of non-permissive hosts upon conjugative transfer. Also, the ability to act on target sequences present in the human genome underscores the biotechnological potential of conjugative relaxase TrwC as a site-specific integrase for genomic modification of human cells. PMID:22292089

  13. High-sensitive electrochemical detection of point mutation based on polymerization-induced enzymatic amplification.

    PubMed

    Feng, Kejun; Zhao, Jingjin; Wu, Zai-Sheng; Jiang, Jianhui; Shen, Guoli; Yu, Ruqin

    2011-03-15

    Here a highly sensitive electrochemical method is described for the detection of point mutation in DNA. Polymerization extension reaction is applied to specifically initiate enzymatic electrochemical amplification to improve the sensitivity and enhance the performance of point mutation detection. In this work, 5'-thiolated DNA probe sequences complementary to the wild target DNA are assembled on the gold electrode. In the presence of wild target DNA, the probe is extended by DNA polymerase over the free segment of target as the template. After washing with NaOH solution, the target DNA is removed while the elongated probe sequence remains on the sensing surface. Via hybridizing to the designed biotin-labeled detection probe, the extended sequence is capable of capturing detection probe. After introducing streptavidin-conjugated alkaline phosphatase (SA-ALP), the specific binding between streptavidin and biotin mediates a catalytic reaction of ascorbic acid 2-phosphate (AA-P) substrate to produce a reducing agent ascorbic acid (AA). Then the silver ions in solution are reduced by AA, leading to the deposition of silver metal onto the electrode surface. The amount of deposited silver which is determined by the amount of wild target can be quantified by the linear sweep voltammetry (LSV). The present approach proved to be capable of detecting the wild target DNA down to a detection limit of 1.0×10(-14) M in a wide target concentration range and identifying -28 site (A to G) of the β-thalassemia gene, demonstrating that this scheme offers a highly sensitive and specific approach for point mutation detection. Copyright © 2010 Elsevier B.V. All rights reserved.

  14. African-American mitochondrial DNAs often match mtDNAs found in multiple African ethnic groups

    PubMed Central

    Ely, Bert; Wilson, Jamie Lee; Jackson, Fatimah; Jackson, Bruce A

    2006-01-01

    Background Mitochondrial DNA (mtDNA) haplotypes have become popular tools for tracing maternal ancestry, and several companies offer this service to the general public. Numerous studies have demonstrated that human mtDNA haplotypes can be used with confidence to identify the continent where the haplotype originated. Ideally, mtDNA haplotypes could also be used to identify a particular country or ethnic group from which the maternal ancestor emanated. However, the geographic distribution of mtDNA haplotypes is greatly influenced by the movement of both individuals and population groups. Consequently, common mtDNA haplotypes are shared among multiple ethnic groups. We have studied the distribution of mtDNA haplotypes among West African ethnic groups to determine how often mtDNA haplotypes can be used to reconnect Americans of African descent to a country or ethnic group of a maternal African ancestor. The nucleotide sequence of the mtDNA hypervariable segment I (HVS-I) usually provides sufficient information to assign a particular mtDNA to the proper haplogroup, and it contains most of the variation that is available to distinguish a particular mtDNA haplotype from closely related haplotypes. In this study, samples of general African-American and specific Gullah/Geechee HVS-I haplotypes were compared with two databases of HVS-I haplotypes from sub-Saharan Africa, and the incidence of perfect matches recorded for each sample. Results When two independent African-American samples were analyzed, more than half of the sampled HVS-I mtDNA haplotypes exactly matched common haplotypes that were shared among multiple African ethnic groups. Another 40% did not match any sequence in the database, and fewer than 10% were an exact match to a sequence from a single African ethnic group. Differences in the regional distribution of haplotypes were observed in the African database, and the African-American haplotypes were more likely to match haplotypes found in ethnic groups from West or West Central Africa than those found in eastern or southern Africa. Fewer than 14% of the African-American mtDNA sequences matched sequences from only West Africa or only West Central Africa. Conclusion Our database of sub-Saharan mtDNA sequences includes the most common haplotypes that are shared among ethnic groups from multiple regions of Africa. These common haplotypes have been found in half of all sub-Saharan Africans. More than 60% of the remaining haplotypes differ from the common haplotypes at a single nucleotide position in the HVS-I region, and they are likely to occur at varying frequencies within sub-Saharan Africa. However, the finding that 40% of the African-American mtDNAs analyzed had no match in the database indicates that only a small fraction of the total number of African haplotypes has been identified. In addition, the finding that fewer than 10% of African-American mtDNAs matched mtDNA sequences from a single African region suggests that few African Americans might be able to trace their mtDNA lineages to a particular region of Africa, and even fewer will be able to trace their mtDNA to a single ethnic group. However, no firm conclusions should be made until a much larger database is available. It is clear, however, that when identical mtDNA haplotypes are shared among many ethnic groups from different parts of Africa, it is impossible to determine which single ethnic group was the source of a particular maternal ancestor based on the mtDNA sequence. PMID:17038170

  15. Variola Type IB DNA Topoisomerase: DNA Binding and Supercoil Unwinding Using Engineered DNA Minicircles

    PubMed Central

    2015-01-01

    Type IB topoisomerases unwind positive and negative DNA supercoils and play a key role in removing supercoils that would otherwise accumulate at replication and transcription forks. An interesting question is whether topoisomerase activity is regulated by the topological state of the DNA, thereby providing a mechanism for targeting the enzyme to highly supercoiled DNA domains in genomes. The type IB enzyme from variola virus (vTopo) has proven to be useful in addressing mechanistic questions about topoisomerase function because it forms a reversible 3′-phosphotyrosyl adduct with the DNA backbone at a specific target sequence (5′-CCCTT-3′) from which DNA unwinding can proceed. We have synthesized supercoiled DNA minicircles (MCs) containing a single vTopo target site that provides highly defined substrates for exploring the effects of supercoil density on DNA binding, strand cleavage and ligation, and unwinding. We observed no topological dependence for binding of vTopo to these supercoiled MC DNAs, indicating that affinity-based targeting to supercoiled DNA regions by vTopo is unlikely. Similarly, the cleavage and religation rates of the MCs were not topologically dependent, but topoisomers with low superhelical densities were found to unwind more slowly than highly supercoiled topoisomers, suggesting that reduced torque at low superhelical densities leads to an increased number of cycles of cleavage and ligation before a successful unwinding event. The K271E charge reversal mutant has an impaired interaction with the rotating DNA segment that leads to an increase in the number of supercoils that were unwound per cleavage event. This result provides evidence that interactions of the enzyme with the rotating DNA segment can restrict the number of supercoils that are unwound. We infer that both superhelical density and transient contacts between vTopo and the rotating DNA determine the efficiency of supercoil unwinding. Such determinants are likely to be important in regulating the steady-state superhelical density of DNA domains in the cell. PMID:24945825

  16. Mechanism for DNA transposons to generate introns on genomic scales

    PubMed Central

    Huff, Jason T.; Zilberman, Daniel; Roy, Scott W.

    2017-01-01

    Discovered four decades ago, the existence of introns was one of the most unexpected findings in molecular biology1. Introns are sequences interrupting genes that must be removed as part of mRNA production. Genome sequencing projects have documented that most eukaryotic genes contain at least one and frequently many introns2,3. Comparison of these genomes reveals a history of long evolutionary periods with little intron gain punctuated by episodes of rapid, extensive gain2,3. However, no detailed mechanism for such episodic intron generation has been empirically supported on a sufficient scale, despite several proposals4–8. Here we show how short non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from gene sequence duplicated upon transposon insertion, allowing perfect splicing out of RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between preexisting nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases2 and prevalence of nucleosome-sized exons9 observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism plausibly accounting for episodes of rapid, extensive intron gain during eukaryotic evolution2,3. PMID:27760113

  17. Physical and genetic mapping of the dipeptidase gene DPEP1 to 16q24. 3

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Austruy, E.; Jeanpierre, C.; Junien, C.

    1993-03-01

    The authors report the subregional physical and genetic mapping on chromosome 16q of a cDNA clone selected as a potential tumor/growth suppressor sequence. By DNA sequencing and RNA expression pattern, this clone was identified as part of the renal dipeptidase gene (DPEP1). Using somatic cell hybrids carrying either different human chromosomes or chromosome 16 segments, they confirm and refine the physical mapping of DPEP1 to the chromosome 16 subregion q24.3. Two RFLPs, a biallelic polymorphism detected by TaqI and a VNTR detected by BamHI, EcoRI, and BglII, are described. Using the VNTR polymorphism, DPEP1 was shown to be linked tomore » D16S7 with a maximum lod score of 5.8 at a recombination fraction of 0.03. 14 refs., 2 figs., 2 tabs.« less

  18. [Analysis on genetic polymorphism of 5 STR loci selected from X chromosome].

    PubMed

    Liu, Qi-ji; Gong, Yao-qin; Zhang, Xi-yu; Gao, Gui-min; Li, Jiang-xia; Guo, Yi-shou

    2005-02-01

    To select short tandem repeats(STR) from X chromosome. STR is a universal genetic marker that has changeable polymorphism and stable heredity in human genome. It is a specific DNA segment composed of 2-6 base pairs as its core sequence. It is an ideal DNA marker used in linkage analysis and gene mapping. In this study, 8 short tandem repeats were selected from two genomic clones on X chromosome by using BCM Search Launcher. Primers amplifying the STR loci were designed by using Primer 3.0 according to the unique sequence flanking the STRs. Polymorphisms of the short tandem repeats in Chinese population were evaluated by PCR amplification and PAGE. Five of these STRs were polymorphic. Chi-square test indicated that the distribution of genotypes agreed with Hardy-Weinberg equilibrium (P>0.05). Five polymorphic short tandem repeats have been identified on chromosome X and will be useful for linkage analysis and gene mapping.

  19. Building a dictionary for genomes: Identification of presumptive regulatory sites by statistical analysis

    PubMed Central

    Bussemaker, Harmen J.; Li, Hao; Siggia, Eric D.

    2000-01-01

    The availability of complete genome sequences and mRNA expression data for all genes creates new opportunities and challenges for identifying DNA sequence motifs that control gene expression. An algorithm, “MobyDick,” is presented that decomposes a set of DNA sequences into the most probable dictionary of motifs or words. This method is applicable to any set of DNA sequences: for example, all upstream regions in a genome or all genes expressed under certain conditions. Identification of words is based on a probabilistic segmentation model in which the significance of longer words is deduced from the frequency of shorter ones of various lengths, eliminating the need for a separate set of reference data to define probabilities. We have built a dictionary with 1,200 words for the 6,000 upstream regulatory regions in the yeast genome; the 500 most significant words (some with as few as 10 copies in all of the upstream regions) match 114 of 443 experimentally determined sites (a significance level of 18 standard deviations). When analyzing all of the genes up-regulated during sporulation as a group, we find many motifs in addition to the few previously identified by analyzing the subclusters individually to the expression subclusters. Applying MobyDick to the genes derepressed when the general repressor Tup1 is deleted, we find known as well as putative binding sites for its regulatory partners. PMID:10944202

  20. Cloning and characterization of a DNA polymerase beta gene from Trypanosoma cruzi.

    PubMed

    Venegas, Juan A; Aslund, Lena; Solari, Aldo

    2009-06-01

    A gene coding for a DNA polymerase beta from the Trypanosoma cruzi Miranda clone, belonging to the TcI lineage, was cloned (Miranda Tcpol beta), using the information from eight peptides of the T. cruzi beta-like DNA polymerase purified previously. The gene encodes for a protein of 403 amino acids which is very similar to the two T. cruzi CL Brener (TcIIe lineage) sequences published, but has three different residues in highly conserved segments. At the amino acid level, the identity of TcI-pol beta with mitochondrial pol beta and pol beta-PAK from other trypanosomatids was between 68-80% and 22-30%, respectively. Miranda Tc-pol beta protein has an N-terminal sequence similar to that described in the mitochondrial Crithidia fasciculata pol beta, which suggests that the TcI-pol beta plays a role in the organelle. Northern and Western analyses showed that this T. cruzi gene is highly expressed both in proliferative and non-proliferative developmental forms. These results suggest that, in addition to replication of kDNA in proliferative cells, this enzyme may have another function in non-proliferative cells, such as DNA repair role similar to that which has extensively been described in a vast spectrum of eukaryotic cells.

  1. Isolation of a single rice chromosome by optical micromanipulation

    NASA Astrophysics Data System (ADS)

    Wang, Haowei; Liu, Xiaohui; Li, Yinmei; Han, Bin; Lou, Liren; Wang, Kangjun

    2004-01-01

    A new method based on optical tweezers technology is reported for the isolation of a single chromosome. A rice cell suspended in liquid was first fragmented by laser pulses (optical scalpel). Then a single released chromosome from the cell was manipulated and pulled away from other cells and oddments by optical tweezers without any direct mechanical contact. Finally the isolated single chromosome was extracted individually into a glass capillary nearby. After molecular cloning of the isolated chromosome, we obtained some specific DNA segments from the single chromosome. All these segments can be used for rice genomic sequencing. Different methods of extracting a single chromosome are compared. The advantages of optical micromanipulation method are summarized.

  2. Genes for cytochrome c oxidase subunit I, URF2, and three tRNAs in Drosophila mitochondrial DNA.

    PubMed Central

    Clary, D O; Wolstenholme, D R

    1983-01-01

    Genes for URF2, tRNAtrp, tRNAcys, tRNAtyr and cytochrome c oxidase subunit I (COI) have been identified within a sequenced segment of the Drosophila yakuba mtDNA molecule. The five genes are arranged in the order given. Transcription of the tRNAcys and tRNAtyr genes is in the same direction as replication, while transcription of the URF2, tRNAtrp and COI genes is in the opposite direction. A similar arrangement of these genes is found in mammalian mtDNA except that in the latter, the tRNAala and tRNAasn genes are located between the tRNAtrp and tRNAcys genes. Also, a sequence found between the tRNAasn and tRNAcys genes in mammalian mtDNA, which is associated with the initiation of second strand DNA synthesis, is not found in this region of the D. yakuba mtDNA molecule. As the D. yakuba COI gene lacks a standard translation initiation codon, we consider the possibility that the quadruplet ATAA may serve this function. As in other D. yakuba mitochondrial polypeptide genes, AGA codons in the URF2 and COI genes do not correspond in position to arginine-specifying codons in the equivalent genes of mouse and yeast mtDNAs, but do most frequently correspond to serine-specifying codons. PMID:6314262

  3. Molecular detection of Bartonella coopersplainsensis and B. henselae in rats from New Zealand.

    PubMed

    Vijayan Genitha Helan, J N; Grinberg, A; Gedye, K; Potter, M A; Harrus, S

    2018-06-25

    To identify Bartonella spp. in rats from New Zealand using molecular methods. DNA was extracted from the spleens of 143 black rats (Rattus rattus) captured in the Tongariro National Park, New Zealand. PCR was performed using Bartonella genus-specific primers amplifying segments of the 16S-23S rRNA internal transcribed spacer and citrate synthase (gltA) and beta subunit of the RNA polymerase (rpoB) genes. PCR products were sequenced and compared online with sequences stored in the database of the National Center for Biotechnology Information of the United States of America. DNA sequences matching Bartonella coopersplainsensis and B. henselae were detected in samples from 22/143 (15.4%) and 3/143 (2.1%) rats, respectively. Co-occurrence of B. coopersplainsensis and B. henselae sequences was observed in the sample from one rat. Gram-negative fastidious bacteria belonging to the genus Bartonella are associated with a range of human diseases. Rodents play an important role as reservoirs of a broad range of Bartonella species. To our knowledge, this is the first report of a molecular detection of Bartonella spp. DNA in rodents from New Zealand, and the first identification of B. henselae DNA in rats, worldwide. Whereas the public health significance of B. coopersplainsensis remains undefined, B. henselae is the agent of cat scratch disease, and the presence of this bacterium in rats may have public health implications. Our results are preliminary and additional analyses of larger samples, preferably by bacterial culture, would provide more information on the prevalence and diversity of Bartonella spp., in particular B. henselae, in rats.

  4. NADH:ubiquinone oxidoreductase from bovine heart mitochondria. cDNA sequences of the import precursors of the nuclear-encoded 39 kDa and 42 kDa subunits.

    PubMed Central

    Fearnley, I M; Finel, M; Skehel, J M; Walker, J E

    1991-01-01

    The 39 kDa and 42 kDa subunits of NADH:ubiquinone oxidoreductase from bovine heart mitochondria are nuclear-coded components of the hydrophobic protein fraction of the enzyme. Their amino acid sequences have been deduced from the sequences of overlapping cDNA clones. These clones were amplified from total bovine heart cDNA by means of the polymerase chain reaction, with the use of complex mixtures of oligonucleotide primers based upon fragments of protein sequence determined at the N-terminals of the proteins and at internal sites. The protein sequences of the 39 kDa and 42 kDa subunits are 345 and 320 amino acid residues long respectively, and their calculated molecular masses are 39,115 Da and 36,693 Da. Both proteins are predominantly hydrophilic, but each contains one or two hydrophobic segments that could possibly be folded into transmembrane alpha-helices. The bovine 39 kDa protein sequence is related to that of a 40 kDa subunit from complex I from Neurospora crassa mitochondria; otherwise, it is not related significantly to any known sequence, including redox proteins and two polypeptides involved in import of proteins into mitochondria, known as the mitochondrial processing peptidase and the processing-enhancing protein. Therefore the functions of the 39 kDa and 42 kDa subunits of complex I are unknown. The mitochondrial gene product, ND4, a hydrophobic component of complex I with an apparent molecular mass of about 39 kDa, has been identified in preparations of the enzyme. This subunit stains faintly with Coomassie Blue dye, and in many gel systems it is not resolved from the nuclearcoded 36 kDa subunit. Images Fig. 1. PMID:1832859

  5. Distinct patterns of alteration of myc genes associated with integration of human papillomavirus type 16 or type 45 DNA in two genital tumours.

    PubMed

    Sastre-Garau, X; Favre, M; Couturier, J; Orth, G

    2000-08-01

    We previously described two genital carcinomas (IC2, IC4) containing human papillomavirus type 16 (HPV-16)- or HPV-18-related sequences integrated in chromosomal bands containing the c-myc (8q24) or N-myc (2p24) gene, respectively. The c-myc gene was rearranged and amplified in IC2 cells without evidence of overexpression. The N-myc gene was amplified and highly transcribed in IC4 cells. Here, the sequence of an 8039 bp IC4 DNA fragment containing the integrated viral sequences and the cellular junctions is reported. A 3948 bp segment of the genome of HPV-45 encompassing the upstream regulatory region and the E6 and E7 ORFs was integrated into the untranslated part of N-myc exon 3, upstream of the N-myc polyadenylation signal. Both N-myc and HPV-45 sequences were amplified 10- to 20-fold. The 3' ends of the major N-myc transcript were mapped upstream of the 5' junction. A minor N-myc/HPV-45 fusion transcript was also identified, as well as two abundant transcripts from the HPV-45 E6-E7 region. Large amounts of N-myc protein were detected in IC4 cells. A major alteration of c-myc sequences in IC2 cells involved the insertion of a non-coding sequence into the second intron and their co-amplification with the third exon, without any evidence for the integration of HPV-16 sequences within or close to the gene. Different patterns of myc gene alterations may thus be associated with integration of HPV DNA in genital tumours, including the activation of the protooncogene via a mechanism of insertional mutagenesis and/or gene amplification.

  6. JNSViewer—A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures

    PubMed Central

    Dong, Min; Graham, Mitchell; Yadav, Nehul

    2017-01-01

    Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome) were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html. PMID:28582416

  7. Probing the Potential Role of Non-B DNA Structures at Yeast Meiosis-Specific DNA Double-Strand Breaks.

    PubMed

    Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K

    2017-05-23

    A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  8. Space pruning monotonic search for the non-unique probe selection problem.

    PubMed

    Pappalardo, Elisa; Ozkok, Beyza Ahlatcioglu; Pardalos, Panos M

    2014-01-01

    Identification of targets, generally viruses or bacteria, in a biological sample is a relevant problem in medicine. Biologists can use hybridisation experiments to determine whether a specific DNA fragment, that represents the virus, is presented in a DNA solution. A probe is a segment of DNA or RNA, labelled with a radioactive isotope, dye or enzyme, used to find a specific target sequence on a DNA molecule by hybridisation. Selecting unique probes through hybridisation experiments is a difficult task, especially when targets have a high degree of similarity, for instance in a case of closely related viruses. After preliminary experiments, performed by a canonical Monte Carlo method with Heuristic Reduction (MCHR), a new combinatorial optimisation approach, the Space Pruning Monotonic Search (SPMS) method, is introduced. The experiments show that SPMS provides high quality solutions and outperforms the current state-of-the-art algorithms.

  9. DNA analyses of the remains of the Prince Branciforte Barresi family.

    PubMed

    Rickards, O; Martínez-Labarga, C; Favaro, M; Frezza, D; Mallegni, F

    2001-01-01

    The five skeletons found buried in the church of Militello di Catania, Sicily, were tentatively identified by morphological analysis and historical reports as the remains of Prince Branciforte Barresi, two of his children, his brother and another juvenile member of the family (sixteenth and seventeenth centuries). In order to attempt to clarify the degree of relationships of the five skeletons, sex testing and mitochondrial DNA (mtDNA) sequence analysis of the hypervariable segments I and II (HV1 and HV2) of control region were performed. Moreover, the 9 bp-deletion marker of region V (COII/tRNAlys) was examined. Molecular genetic analyses were consistent with historical expectations, although they did not directly demonstrate that these are in fact the remains of the Prince and his relatives, due to the impossibility of obtaining DNA from living maternal relatives of the Prince.

  10. Structural mechanics of DNA wrapping in the nucleosome.

    PubMed

    Battistini, Federica; Hunter, Christopher A; Gardiner, Eleanor J; Packer, Martin J

    2010-02-19

    Experimental X-ray crystal structures and a database of calculated structural parameters of DNA octamers were used in combination to analyse the mechanics of DNA bending in the nucleosome core complex. The 1kx5 X-ray crystal structure of the nucleosome core complex was used to determine the relationship between local structure at the base-step level and the global superhelical conformation observed for nucleosome-bound DNA. The superhelix is characterised by a large curvature (597 degrees) in one plane and very little curvature (10 degrees) in the orthogonal plane. Analysis of the curvature at the level of 10-step segments shows that there is a uniform curvature of 30 degrees per helical turn throughout most of the structure but that there are two sharper kinks of 50 degrees at +/-2 helical turns from the central dyad base pair. The curvature is due almost entirely to the base-step parameter roll. There are large periodic variations in roll, which are in phase with the helical twist and account for 500 degrees of the total curvature. Although variations in the other base-step parameters perturb the local path of the DNA, they make minimal contributions to the total curvature. This implies that DNA bending in the nucleosome is achieved using the roll-slide-twist degree of freedom previously identified as the major degree of freedom in naked DNA oligomers. The energetics of bending into a nucleosome-bound conformation were therefore analysed using a database of structural parameters that we have previously developed for naked DNA oligomers. The minimum energy roll, the roll flexibility force constant and the maximum and minimum accessible roll values were obtained for each base step in the relevant octanucleotide context to account for the effects of conformational coupling that vary with sequence context. The distribution of base-step roll values and corresponding strain energy required to bend DNA into the nucleosome-bound conformation defined by the 1kx5 structure were obtained by applying a constant bending moment. When a single bending moment was applied to the entire sequence, the local details of the calculated structure did not match the experiment. However, when local 10-step bending moments were applied separately, the calculated structure showed excellent agreement with experiment. This implies that the protein applies variable bending forces along the DNA to maintain the superhelical path required for nucleosome wrapping. In particular, the 50 degrees kinks are constraints imposed by the protein rather than a feature of the 1kx5 DNA sequence. The kinks coincide with a relatively flexible region of the sequence, and this is probably a prerequisite for high-affinity nucleosome binding, but the bending strain energy is significantly higher at these points than for the rest of the sequence. In the most rigid regions of the sequence, a higher strain energy is also required to achieve the standard 30 degrees curvature per helical turn. We conclude that matching of the DNA sequence to the local roll periodicity required to achieve bending, together with the increased flexibility required at the kinks, determines the sequence selectivity of DNA wrapping in the nucleosome. 2009 Elsevier Ltd. All rights reserved.

  11. Coval: Improving Alignment Quality and Variant Calling Accuracy for Next-Generation Sequencing Data

    PubMed Central

    Kosugi, Shunichi; Natsume, Satoshi; Yoshida, Kentaro; MacLean, Daniel; Cano, Liliana; Kamoun, Sophien; Terauchi, Ryohei

    2013-01-01

    Accurate identification of DNA polymorphisms using next-generation sequencing technology is challenging because of a high rate of sequencing error and incorrect mapping of reads to reference genomes. Currently available short read aligners and DNA variant callers suffer from these problems. We developed the Coval software to improve the quality of short read alignments. Coval is designed to minimize the incidence of spurious alignment of short reads, by filtering mismatched reads that remained in alignments after local realignment and error correction of mismatched reads. The error correction is executed based on the base quality and allele frequency at the non-reference positions for an individual or pooled sample. We demonstrated the utility of Coval by applying it to simulated genomes and experimentally obtained short-read data of rice, nematode, and mouse. Moreover, we found an unexpectedly large number of incorrectly mapped reads in ‘targeted’ alignments, where the whole genome sequencing reads had been aligned to a local genomic segment, and showed that Coval effectively eliminated such spurious alignments. We conclude that Coval significantly improves the quality of short-read sequence alignments, thereby increasing the calling accuracy of currently available tools for SNP and indel identification. Coval is available at http://sourceforge.net/projects/coval105/. PMID:24116042

  12. Maternal Gametic Transmission of Translocations or Inversions of Human Chromosome 11p15.5 Results in Regional DNA Hypermethylation and Downregulation of CDKN1C Expression

    PubMed Central

    Smith, Adam C.; Suzuki, Masako; Thompson, Reid; Choufani, Sanaa; Higgins, Michael J.; Chiu, Idy W.; Squire, Jeremy A.; Greally, John M.; Weksberg, Rosanna

    2015-01-01

    Beckwith-Wiedemann syndrome (BWS) is an overgrowth syndrome associated with genetic or epigenetic alterations in one of two imprinted domains on chromosome 11p15.5. Rarely, chromosomal translocations or inversions of chromosome 11p15.5 are associated with BWS but the molecular pathophysiology in such cases is not understood. In our series of 3 translocation and 2 inversion patients with BWS, the chromosome 11p15.5 breakpoints map within the centromeric imprinted domain, 2. We hypothesized that either microdeletions/microduplications adjacent to the breakpoints could disrupt genomic sequences important for imprinted gene regulation. An alternate hypothesis was that epigenetic alterations of as yet unknown regulatory DNA sequences, result in the BWS phenotype. A high resolution Nimblegen custom microarray was designed representing all non-repetitive sequences in the telomeric 33 MB of the short arm of human chromosome 11. For the BWS-associated chromosome 11p15.5 translocations and inversions, we found no evidence of microdeletions/microduplications. DNA methylation was also tested on this microarray using the HpaII tiny fragment enrichment by ligation-mediated PCR (HELP) assay. This high-resolution DNA methylation microarray analysis revealed a gain of DNA methylation in the translocation/inversion patients affecting the p-ter segment of chromosome 11p15, including both imprinted domains. BWS patients that inherited a maternal translocation or inversion also demonstrated reduced expression of the growth suppressing imprinted gene, CDKN1C in Domain 2. In summary, our data demonstrate that translocations and inversions involving imprinted domain 2 on chromosome 11p15.5, alter regional DNA methylation patterns and imprinted gene expression in cis, suggesting that these epigenetic alterations are generated by an alteration in “chromatin context”. PMID:22079941

  13. Estimating Exceptionally Rare Germline and Somatic Mutation Frequencies via Next Generation Sequencing

    PubMed Central

    Yoon, Song-Ro; Arnheim, Norman; Calabrese, Peter

    2016-01-01

    We used targeted next generation deep-sequencing (Safe Sequencing System) to measure ultra-rare de novo mutation frequencies in the human male germline by attaching a unique identifier code to each target DNA molecule. Segments from three different human genes (FGFR3, MECP2 and PTPN11) were studied. Regardless of the gene segment, the particular testis donor or the 73 different testis pieces used, the frequencies for any one of the six different mutation types were consistent. Averaging over the C>T/G>A and G>T/C>A mutation types the background mutation frequency was 2.6x10-5 per base pair, while for the four other mutation types the average background frequency was lower at 1.5x10-6 per base pair. These rates far exceed the well documented human genome average frequency per base pair (~10−8) suggesting a non-biological explanation for our data. By computational modeling and a new experimental procedure to distinguish between pre-mutagenic lesion base mismatches and a fully mutated base pair in the original DNA molecule, we argue that most of the base-dependent variation in background frequency is due to a mixture of deamination and oxidation during the first two PCR cycles. Finally, we looked at a previously studied disease mutation in the PTPN11 gene and could easily distinguish true mutations from the SSS background. We also discuss the limits and possibilities of this and other methods to measure exceptionally rare mutation frequencies, and we present calculations for other scientists seeking to design their own such experiments. PMID:27341568

  14. Molecular Evolution and Mosaicism of Leptospiral Outer Membrane Proteins Involves Horizontal DNA Transfer

    PubMed Central

    Haake, David A.; Suchard, Marc A.; Kelley, Melissa M.; Dundoo, Manjula; Alt, David P.; Zuerner, Richard L.

    2004-01-01

    Leptospires belong to a genus of parasitic bacterial spirochetes that have adapted to a broad range of mammalian hosts. Mechanisms of leptospiral molecular evolution were explored by sequence analysis of four genes shared by 38 strains belonging to the core group of pathogenic Leptospira species: L. interrogans, L. kirschneri, L. noguchii, L. borgpetersenii, L. santarosai, and L. weilii. The 16S rRNA and lipL32 genes were highly conserved, and the lipL41 and ompL1 genes were significantly more variable. Synonymous substitutions are distributed throughout the ompL1 gene, whereas nonsynonymous substitutions are clustered in four variable regions encoding surface loops. While phylogenetic trees for the 16S, lipL32, and lipL41 genes were relatively stable, 8 of 38 (20%) ompL1 sequences had mosaic compositions consistent with horizontal transfer of DNA between related bacterial species. A novel Bayesian multiple change point model was used to identify the most likely sites of recombination and to determine the phylogenetic relatedness of the segments of the mosaic ompL1 genes. Segments of the mosaic ompL1 genes encoding two of the surface-exposed loops were likely acquired by horizontal transfer from a peregrine allele of unknown ancestry. Identification of the most likely sites of recombination with the Bayesian multiple change point model, an approach which has not previously been applied to prokaryotic gene sequence analysis, serves as a model for future studies of recombination in molecular evolution of genes. PMID:15090524

  15. A novel begomovirus isolated from sida contains putative cis- and trans-acting replication specificity determinants that have evolved independently in several geographical lineages.

    PubMed

    Mauricio-Castillo, J A; Torres-Herrera, S I; Cárdenas-Conejo, Y; Pastor-Palacios, G; Méndez-Lozano, J; Argüello-Astorga, G R

    2014-09-01

    A novel begomovirus isolated from a Sida rhombifolia plant collected in Sinaloa, Mexico, was characterized. The genomic components of sida mosaic Sinaloa virus (SiMSinV) shared highest sequence identity with DNA-A and DNA-B components of chino del tomate virus (CdTV), suggesting a vertical evolutionary relationship between these viruses. However, recombination analysis indicated that a short segment of SiMSinV DNA-A encompassing the plus-strand replication origin and the 5´-proximal 43 codons of the Rep gene was derived from tomato mottle Taino virus (ToMoTV). Accordingly, the putative cis- and trans-acting replication specificity determinants of SiMSinV were identical to those of ToMoTV but differed from those of CdTV. Modeling of the SiMSinV and CdTV Rep proteins revealed significant differences in the region comprising the small β1/β5 sheet element, where five putative DNA-binding specificity determinants (SPDs) of Rep (i.e., amino acid residues 5, 8, 10, 69 and 71) were previously identified. Computer-assisted searches of public databases led to identification of 33 begomoviruses from three continents encoding proteins with SPDs identical to those of the Rep encoded by SiMSinV. Sequence analysis of the replication origins demonstrated that all 33 begomoviruses harbor potential Rep-binding sites identical to those of SiMSinV. These data support the hypothesis that the Rep β1/β5 sheet region determines specificity of this protein for DNA replication origin sequences.

  16. Randomized DNA libraries construction tool: a new 3-bp 'frequent cutter' TthHB27I/sinefungin endonuclease with chemically-induced specificity.

    PubMed

    Krefft, Daria; Papkov, Aliaksei; Prusinowski, Maciej; Zylicz-Stachula, Agnieszka; Skowron, Piotr M

    2018-05-11

    Acoustic or hydrodynamic shearing, sonication and enzymatic digestion are used to fragment DNA. However, these methods have several disadvantages, such as DNA damage, difficulties in fragmentation control, irreproducibility and under-representation of some DNA segments. The DNA fragmentation tool would be a gentle enzymatic method, offering cleavage frequency high enough to eliminate DNA fragments distribution bias and allow for easy control of partial digests. Only three such frequently cleaving natural restriction endonucleases (REases) were discovered: CviJI, SetI and FaiI. Therefore, we have previously developed two artificial enzymatic specificities, cleaving DNA approximately every ~ 3-bp: TspGWI/sinefungin (SIN) and TaqII/SIN. In this paper we present the third developed specificity: TthHB27I/SIN(SAM) - a new genomic tool, based on Type IIS/IIC/IIG Thermus-family REases-methyltransferases (MTases). In the presence of dimethyl sulfoxide (DMSO) and S-adenosyl-L-methionine (SAM) or its analogue SIN, the 6-bp cognate TthHB27I recognition sequence 5'-CAARCA-3' is converted into a combined 3.2-3.0-bp 'site' or its statistical equivalent, while a cleavage distance of 11/9 nt is retained. Protocols for various modes of limited DNA digestions were developed. In the presence of DMSO and SAM or SIN, TthHB27I is transformed from rare 6-bp cutter to a very frequent one, approximately 3-bp. Thus, TthHB27I/SIN(SAM) comprises a new tool in the very low-represented segment of such prototype REases specificities. Moreover, this modified TthHB27I enzyme is uniquely suited for controlled DNA fragmentation, due to partial DNA cleavage, which is an inherent feature of the Thermus-family enzymes. Such tool can be used for quasi-random libraries generation as well as for other DNA manipulations, requiring high frequency cleavage and uniform distribution of cuts along DNA.

  17. A heterozygous putative null mutation in ROM1 without a mutation in peripherin/RDS in a family with retinitis pigmentosa

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sakuma, Hitoshi; Inana, G.; Murakami, Akira

    1995-05-20

    ROM1 is a 351-amino-acid, 37-kDa outer segment membrane protein of rod photoreceptors. ROM1 is related to peripherin/RDS, another outer segment membrane protein found in both rods and cones. The precise function of ROM1 or peripherin/RDS is not known, but they have been suggested to play important roles in the function and/or structure of the rod photoreceptor outer segment disks. A recent report implicated ROM1 in disease by suggesting that RP can be caused by a heterozygous null mutation in ROM1 but only in combination with another heterozygous mutation in peripherin/RDS. Screening of the ROM1 gene using polymerase chain reaction amplification,more » denaturing gradient gel electrophoresis, and direct DNA sequencing identified the same heterozygous putative null mutation in a family with RP.« less

  18. Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

    PubMed

    Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

    2015-02-07

    The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.

  19. The mitochondrial genome and a 60-kb nuclear DNA segment from Naegleria fowleri, the causative agent of primary amoebic meningoencephalitis.

    PubMed

    Herman, Emily K; Greninger, Alexander L; Visvesvara, Govinda S; Marciano-Cabral, Francine; Dacks, Joel B; Chiu, Charles Y

    2013-01-01

    Naegleria fowleri is a unicellular eukaryote causing primary amoebic meningoencephalitis, a neuropathic disease killing 99% of those infected, usually within 7-14 days. Naegleria fowleri is found globally in regions including the US and Australia. The genome of the related nonpathogenic species Naegleria gruberi has been sequenced, but the genetic basis for N. fowleri pathogenicity is unclear. To generate such insight, we sequenced and assembled the mitochondrial genome and a 60-kb segment of nuclear genome from N. fowleri. The mitochondrial genome is highly similar to its counterpart in N. gruberi in gene complement and organization, while distinct lack of synteny is observed for the nuclear segments. Even in this short (60-kb) segment, we identified examples of potential factors for pathogenesis, including ten novel N. fowleri-specific genes. We also identified a homolog of cathepsin B; proteases proposed to be involved in the pathogenesis of diverse eukaryotic pathogens, including N. fowleri. Finally, we demonstrate a likely case of horizontal gene transfer between N. fowleri and two unrelated amoebae, one of which causes granulomatous amoebic encephalitis. This initial look into the N. fowleri nuclear genome has revealed several examples of potential pathogenesis factors, improving our understanding of a neglected pathogen of increasing global importance. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.

  20. mtDNA variation in caste populations of Andhra Pradesh, India.

    PubMed

    Bamshad, M; Fraley, A E; Crawford, M H; Cann, R L; Busi, B R; Naidu, J M; Jorde, L B

    1996-02-01

    Various anthropological analyses have documented extensive regional variation among populations on the subcontinent of India using morphological, protein, blood group, and nuclear DNA polymorphisms. These patterns are the product of complex population structure (genetic drift, gene flow) and a population history noted for numerous branching events. As a result, the interpretation of relationships among caste populations of South India and between Indians and continental populations remains controversial. The Hindu caste system is a general model of genetic differentiation among endogamous populations stratified by social forces (e.g., religion and occupation). The mitochondrial DNA (mtDNA) molecule has unique properties that facilitate the exploration of population structure. We analyzed 36 Hindu men born in Andhra Pradesh who were unrelated matrilineally through at least 3 generations and who represent 4 caste populations: Brahmin (9), Yadava (10), Kapu (7), and Relli (10). Individuals from Africa (36), Asia (36), and Europe (36) were sampled for comparison. A 200-base-pair segment of hypervariable segment 2 (HVS2) of the mtDNA control region was sequenced in all individuals. In the Indian castes 25 distinct haplotypes are identified. Aside from the Cambridge reference sequence, only two haplotypes are shared between caste populations. Middle castes form a highly supported cluster in a neighbor-joining network. Mean nucleotide diversity within each caste is 0.015, 0.012, 0.011, and 0.012 for the Brahmin, Yadava, Kapu, and Relli, respectively. mtDNA variation is highly structured between castes (GST = 0.17; p < 0.002). The effects of social structure on mtDNA variation are much greater than those on variation measured by traditional markers. Explanations for this discordance include (1) the higher resolving power of mtDNA, (2) sex-dependent gene flow, (3) differences in male and female effective population sizes, and (4) elements of the kinship structure. Thirty distinct haplotypes are found in Africans, 17 in Asians, and 13 in Europeans. Mean nucleotide diversity is 0.019, 0.014, 0.009, and 0.007 for Africans, Indians, Asians, and Europeans, respectively. These populations are highly structured geographically (GST = 0.15; p < 0.001). The caste populations of Andhra Pradesh cluster more often with Africans than with Asians or Europeans. This is suggestive of admixture with African populations.

  1. Molecular basis of splotch and Waardenburg Pax-3 mutations.

    PubMed Central

    Chalepakis, G; Goulding, M; Read, A; Strachan, T; Gruss, P

    1994-01-01

    Pax genes control certain aspects of development, as mutations result in (semi)dominant defects apparent during embryogenesis. Pax-3 has been associated with the mouse mutant splotch (Sp) and the human Waardenburg syndrome type 1 (WS1). We have examined the molecular basis of splotch and WS1 by studying the effect of mutations on DNA binding, using a defined target sequence. Pax-3 contains two different types of functional DNA-binding domains, a paired domain and a homeodomain. Mutational analysis of Pax-3 reveals different modes of DNA binding depending on the presence of these domains. A segment of Pax-3 located between the two DNA-binding domains, including a conserved octapeptide, participates in protein homodimerization. Pax-3 mutations found in splotch alleles and WS1 individuals change DNA binding and, in the case of a protein product of the Sp allele, dimerization. These findings were taken as a basis to define the molecular nature of the mutants. Images PMID:7909605

  2. Forensic analysis of mtDNA haplotypes from two rural communities in Haiti reflects their population history.

    PubMed

    Wilson, Jamie L; Saint-Louis, Vertus; Auguste, Jensen O; Jackson, Bruce A

    2012-11-01

    Very little genetic data exist on Haitians, an estimated 1.2 million of whom, not including illegal immigrants, reside in the United States. The absence of genetic data on a population of this size reduces the discriminatory power of criminal and missing-person DNA databases in the United States and Caribbean. We present a forensic population study that provides the first genetic data set for Haiti. This study uses hypervariable segment one (HVS-1) mitochondrial DNA (mtDNA) nucleotide sequences from 291 subjects primarily from rural areas of northern and southern Haiti, where admixture would be minimal. Our results showed that the African maternal genetic component of Haitians had slightly higher West-Central African admixture than African-Americans and Dominicans, but considerably less than Afro-Brazilians. These results lay the foundation for further forensic genetics studies in the Haitian population and serve as a model for forensic mtDNA identification of individuals in other isolated or rural communities. © 2012 American Academy of Forensic Sciences.

  3. SV40 host-substituted variants: a new look at the monkey DNA inserts and recombinant junctions.

    PubMed

    Singer, Maxine; Winocour, Ernest

    2011-04-10

    The available monkey genomic data banks were examined in order to determine the chromosomal locations of the host DNA inserts in 8 host-substituted SV40 variant DNAs. Five of the 8 variants contained more than one linked monkey DNA insert per tandem repeat unit and in all cases but one, the 19 monkey DNA inserts in the 8 variants mapped to different locations in the monkey genome. The 50 parental DNAs (32 monkey and 18 SV40 DNA segments) which spanned the crossover and flanking regions that participated in monkey/monkey and monkey/SV40 recombinations were characterized by substantial levels of microhomology of up to 8 nucleotides in length; the parental DNAs also exhibited direct and inverted repeats at or adjacent to the crossover sequences. We discuss how the host-substituted SV40 variants arose and the nature of the recombination mechanisms involved. Copyright © 2011 Elsevier Inc. All rights reserved.

  4. Regulation of Immunoglobulin Class-Switch Recombination: Choreography of Noncoding Transcription, Targeted DNA Deamination, and Long-Range DNA Repair

    PubMed Central

    Matthews, Allysia J.; Zheng, Simin; DiMenna, Lauren J.; Chaudhuri, Jayanta

    2014-01-01

    Upon encountering antigens, mature IgM-positive B lymphocytes undergo class-switch recombination (CSR) wherein exons encoding the default Cμ constant coding gene segment of the immunoglobulin (Ig) heavy-chain (Igh) locus are excised and replaced with a new constant gene segment (referred to as “Ch genes”, e.g., Cγ, Cε, or Cα). The B cell thereby changes from expressing IgM to one producing IgG, IgE, or IgA, with each antibody isotype having a different effector function during an immune reaction. CSR is a DNA deletional-recombination reaction that proceeds through the generation of DNA double-strand breaks (DSBs) in repetitive switch (S) sequences preceding each Ch gene and is completed by end-joining between donor Sμ and acceptor S regions. CSR is a multistep reaction requiring transcription through S regions, the DNA cytidine deaminase AID, and the participation of several general DNA repair pathways including base excision repair, mismatch repair, and classical nonhomologous end-joining. In this review, we discuss our current understanding of how transcription through S regions generates substrates for AID-mediated deamination and how AID participates not only in the initiation of CSR but also in the conversion of deaminated residues into DSBs. Additionally, we review the multiple processes that regulate AID expression and facilitate its recruitment specifically to the Ig loci, and how deregulation of AID specificity leads to oncogenic translocations. Finally, we summarize recent data on the potential role of AID in the maintenance of the pluripotent stem cell state during epigenetic reprogramming. PMID:24507154

  5. APOBEC3A efficiently deaminates methylated, but not TET-oxidized, cytosine bases in DNA

    PubMed Central

    Schutsky, Emily K.; Nabel, Christopher S.; Davis, Amy K. F.; DeNizio, Jamie E.

    2017-01-01

    Abstract AID/APOBEC family enzymes are best known for deaminating cytosine bases to uracil in single-stranded DNA, with characteristic sequence preferences that can produce mutational signatures in targets such as retroviral and cancer cell genomes. These deaminases have also been proposed to function in DNA demethylation via deamination of either 5-methylcytosine (mC) or TET-oxidized mC bases (ox-mCs), which include 5-hydroxymethylcytosine, 5-formylcytosine and 5-carboxylcytosine. One specific family member, APOBEC3A (A3A), has been shown to readily deaminate mC, raising the prospect of broader activity on ox-mCs. To investigate this claim, we developed a novel assay that allows for parallel profiling of activity on all modified cytosines. Our steady-state kinetic analysis reveals that A3A discriminates against all ox-mCs by >3700-fold, arguing that ox-mC deamination does not contribute substantially to demethylation. A3A is, by contrast, highly proficient at C/mC deamination. Under conditions of excess enzyme, C/mC bases can be deaminated to completion in long DNA segments, regardless of sequence context. Interestingly, under limiting A3A, the sequence preferences observed with targeting unmodified cytosine are further exaggerated when deaminating mC. Our study informs how methylation, oxidation, and deamination can interplay in the genome and suggests A3A's potential utility as a biotechnological tool to discriminate between cytosine modification states. PMID:28472485

  6. Towards the delineation of the ancestral eutherian genome organization: comparative genome maps of human and the African elephant (Loxodonta africana) generated by chromosome painting.

    PubMed Central

    Frönicke, Lutz; Wienberg, Johannes; Stone, Gary; Adams, Lisa; Stanyon, Roscoe

    2003-01-01

    This study presents a whole-genome comparison of human and a representative of the Afrotherian clade, the African elephant, generated by reciprocal Zoo-FISH. An analysis of Afrotheria genomes is of special interest, because recent DNA sequence comparisons identify them as the oldest placental mammalian clade. Complete sets of whole-chromosome specific painting probes for the African elephant and human were constructed by degenerate oligonucleotide-primed PCR amplification of flow-sorted chromosomes. Comparative genome maps are presented based on their hybridization patterns. These maps show that the elephant has a moderately rearranged chromosome complement when compared to humans. The human paint probes identified 53 evolutionary conserved segments on the 27 autosomal elephant chromosomes and the X chromosome. Reciprocal experiments with elephant probes delineated 68 conserved segments in the human genome. The comparison with a recent aardvark and elephant Zoo-FISH study delineates new chromosomal traits which link the two Afrotherian species phylogenetically. In the absence of any morphological evidence the chromosome painting data offer the first non-DNA sequence support for an Afrotherian clade. The comparative human and elephant genome maps provide new insights into the karyotype organization of the proto-afrotherian, the ancestor of extant placental mammals, which most probably consisted of 2n=46 chromosomes. PMID:12965023

  7. Effect of long-term exposure to mobile phone radiation on alpha-Int1 gene sequence of Candida albicans

    PubMed Central

    Shahin-jafari, Ariyo; Bayat, Mansour; Shahhosseiny, Mohammad Hassan; Tajik, Parviz; Roudbar-mohammadi, Shahla

    2015-01-01

    Over the last decade, communication industries have witnessed a tremendous expansion, while, the biological effects of electromagnetic waves have not been fully elucidated. Current study aimed at evaluating the mutagenic effect of long-term exposure to 900-MHz radiation on alpha-Int1 gene sequences of Candida albicans. A standard 900 MHz radiation generator was used for radiation. 10 ml volumes from a stock suspension of C. albicans were transferred into 10 polystyrene tubes. Five tubes were exposed at 4 °C to a fixed magnitude of radiation with different time periods of 10, 70, 210, 350 and 490 h. The other 5 tubes were kept far enough from radiation. The samples underwent genomic DNA extraction. PCR amplification of alpha-Int1 gene sequence was done using one set of primers. PCR products were resolved using agarose gel electrophoresis and the nucleotide sequences were determined. All samples showed a clear electrophoretic band around 441 bp and further sequencing revealed the amplified DNA segments are related to alpha-Int1 gene of the yeast. No mutations in the gene were seen in radiation exposed samples. Long-term exposure of the yeast to mobile phone radiation under the above mentioned conditions had no mutagenic effect on alpha-Int1 gene sequence. PMID:27081370

  8. Effect of long-term exposure to mobile phone radiation on alpha-Int1 gene sequence of Candida albicans.

    PubMed

    Shahin-Jafari, Ariyo; Bayat, Mansour; Shahhosseiny, Mohammad Hassan; Tajik, Parviz; Roudbar-Mohammadi, Shahla

    2016-05-01

    Over the last decade, communication industries have witnessed a tremendous expansion, while, the biological effects of electromagnetic waves have not been fully elucidated. Current study aimed at evaluating the mutagenic effect of long-term exposure to 900-MHz radiation on alpha-Int1 gene sequences of Candida albicans. A standard 900 MHz radiation generator was used for radiation. 10 ml volumes from a stock suspension of C. albicans were transferred into 10 polystyrene tubes. Five tubes were exposed at 4 °C to a fixed magnitude of radiation with different time periods of 10, 70, 210, 350 and 490 h. The other 5 tubes were kept far enough from radiation. The samples underwent genomic DNA extraction. PCR amplification of alpha-Int1 gene sequence was done using one set of primers. PCR products were resolved using agarose gel electrophoresis and the nucleotide sequences were determined. All samples showed a clear electrophoretic band around 441 bp and further sequencing revealed the amplified DNA segments are related to alpha-Int1 gene of the yeast. No mutations in the gene were seen in radiation exposed samples. Long-term exposure of the yeast to mobile phone radiation under the above mentioned conditions had no mutagenic effect on alpha-Int1 gene sequence.

  9. Novel and canine genotypes of Giardia duodenalis in harbor seals ( Phoca vitulina richardsi).

    PubMed

    Gaydos, J K; Miller, W A; Johnson, C; Zornetzer, H; Melli, A; Packham, A; Jeffries, S J; Lance, M M; Conrad, P A

    2008-12-01

    Feces of harbor seals (Phoca vitulina richardsi) and hybrid glaucous-winged/western gulls (Larus glaucescens / occidentalis) from Washington State's inland marine waters were examined for Giardia and Cryptosporidium spp. to determine if genotypes carried by these wildlife species were the same genotypes that commonly infect humans and domestic animals. Using immunomagnetic separation followed by direct fluorescent antibody detection, Giardia spp. cysts were detected in 42% of seal fecal samples (41/97). Giardia-positive samples came from 90% of the sites (9/10) and the prevalence of positive seal fecal samples differed significantly among study sites. Fecal samples collected from seal haulout sites with over 400 animals were 4.7 times more likely to have Giardia spp. cysts than samples collected at smaller haulout sites. In gulls, a single Giardia sp. cyst was detected in 4% of fecal samples (3/78). Cryptosporidium spp. oocysts were not detected in any of the seals or gulls tested. Sequence analysis of a 398 bp segment of G. duodenalis DNA at the glutamate dehydrogenase locus suggested that 11 isolates originating from seals throughout the region were a novel genotype and 3 isolates obtained from a single site in south Puget Sound were the G. duodenalis canine genotype D. Real-time TaqMan PCR amplification and subsequent sequencing of a 52 bp small subunit ribosomal DNA region from novel harbor seal genotype isolates showed sequence homology to canine genotypes C and D. Sequence analysis of the 52 bp small subunit ribosomal DNA products from the 3 canine genotype isolates from seals produced mixed sequences at could not be evaluated.

  10. Genomic characterisation of the feline sarcoid-associated papillomavirus and proposed classification as Bos taurus papillomavirus type 14.

    PubMed

    Munday, John S; Thomson, Neroli; Dunowska, Magda; Knight, Cameron G; Laurie, Rebecca E; Hills, Simon

    2015-06-12

    Feline sarcoids are rare mesenchymal neoplasms of domestic and exotic cats. Previous studies have consistently detected short DNA sequences from a papillomavirus (PV), designated feline sarcoid-associated papillomavirus (FeSarPV), in these neoplasms. The FeSarPV sequence has never been detected in any non-sarcoid sample from cats but has been amplified from the skin of cattle suggesting that feline sarcoids are caused by cross-species infection by a bovine papillomavirus (BPV). The aim of the present study was to determine the genome of the PV that contains the FeSarPV sequence. Using the circular nature of PV DNA, four specifically designed 'outward facing' primers were used to amplify two approximately 4,000 bp DNA segments from a feline sarcoid. The two PCR products were sequenced using next generation sequencing and the full genome of the PV, consisting 7,966 bp, was assembled and analysed. Phylogenetic analysis revealed the PV was closely related to the species 4 delta BPVs-1, -2, and -13, but distantly related to any carnivoran PV genus. These results are consistent with feline sarcoids being caused by a BPV type and we propose a classification of BPV-14 for this novel PV. Initial analysis suggests that, like other delta BPVs, the BPV-14 E5 protein could cause mesenchymal proliferation by binding to the platelet derived growth factor beta receptor. Interestingly BPV-14 has not been detected in any equine sarcoid suggesting that BPV-14 has a host range that is limited to bovids and felids. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

    PubMed Central

    Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

    1988-01-01

    In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125

  12. Effect of DNA sequence of Fab fragment on yield characteristics and cell growth of E. coli.

    PubMed

    Kulmala, Antti; Huovinen, Tuomas; Lamminmäki, Urpo

    2017-06-19

    Codon usage is one of the factors influencing recombinant protein expression. We were interested in the codon usage of an antibody Fab fragment gene exhibiting extreme toxicity in the E. coli host. The toxic synthetic human Fab gene contained domains optimized by the "one amino acid-one codon" method. We redesigned five segments of the Fab gene with a "codon harmonization" method described by Angov et al. and studied the effects of these changes on cell viability, Fab yield and display on filamentous phage using different vectors and bacterial strains. The harmonization considerably reduced toxicity, increased Fab expression from negligible levels to 10 mg/l, and restored the display on phage. Testing the impact of the individual redesigned segments revealed that the most significant effects were conferred by changes in the constant domain of the light chain. For some of the Fab gene variants, we also observed striking differences in protein yields when cloned from a chloramphenicol resistant vector into an identical vector, except with ampicillin resistance. In conclusion, our results show that the expression of a heterodimeric secretory protein can be improved by harmonizing selected DNA segments by synonymous codons and reveal additional complexity involved in heterologous protein expression.

  13. Ag nanoclusters could efficiently quench the photoresponse of CdS quantum dots for novel energy transfer-based photoelectrochemical bioanalysis.

    PubMed

    Zhang, Ling; Sun, Yue; Liang, Yan-Yu; He, Jian-Ping; Zhao, Wei-Wei; Xu, Jing-Juan; Chen, Hong-Yuan

    2016-11-15

    Herein the influence of ultrasmall Ag nanoclusters (Ag NCs) against CdS quantum dots (QDs) in a photoelectrochemical (PEC) nanosystem was exploited for the first time, based on which a novel PEC bioanalysis was successfully developed via the efficient quenching effect of Ag NCs against the CdS QDs. In a model system, DNA assay was achieved by using molecular beacon (MB) probes anchored on a CdS QDs modified electrode, and the MB probes contain two segments that can hybridize with both target DNA sequence and the label of DNA encapsulated Ag NCs. After the MB probe was unfolded by the target DNA sequence, the labels of oligonucleotide encapsulated Ag NCs would be brought in close proximity to the CdS QDs electrode surface, and efficient photocurrent quenching of QDs could be resulted from an energy transfer process that originated from NCs. Thus, by monitoring the attenuation in the photocurrent signal, an elegant and sensitive PEC DNA bioanalysis could be accomplished. The developed biosensor displayed a linear range from 1.0pM to 10nM and the detection limit was experimentally found to be of 0.3pM. This work presents a feasible signaling principle that could act as a common basis for general PEC bioanalysis development. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Amino acid sequence of the human fibronectin receptor

    PubMed Central

    1987-01-01

    The amino acid sequence deduced from cDNA of the human placental fibronectin receptor is reported. The receptor is composed of two subunits: an alpha subunit of 1,008 amino acids which is processed into two polypeptides disulfide bonded to one another, and a beta subunit of 778 amino acids. Each subunit has near its COOH terminus a hydrophobic segment. This and other sequence features suggest a structure for the receptor in which the hydrophobic segments serve as transmembrane domains anchoring each subunit to the membrane and dividing each into a large ectodomain and a short cytoplasmic domain. The alpha subunit ectodomain has five sequence elements homologous to consensus Ca2+- binding sites of several calcium-binding proteins, and the beta subunit contains a fourfold repeat strikingly rich in cysteine. The alpha subunit sequence is 46% homologous to the alpha subunit of the vitronectin receptor. The beta subunit is 44% homologous to the human platelet adhesion receptor subunit IIIa and 47% homologous to a leukocyte adhesion receptor beta subunit. The high degree of homology (85%) of the beta subunit with one of the polypeptides of a chicken adhesion receptor complex referred to as integrin complex strongly suggests that the latter polypeptide is the chicken homologue of the fibronectin receptor beta subunit. These receptor subunit homologies define a superfamily of adhesion receptors. The availability of the entire protein sequence for the fibronectin receptor will facilitate studies on the functions of these receptors. PMID:2958481

  15. Regulation of nucleic acid and protein synthesis: a background study related to the biological effects of radiation. Final report on research activities

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    The existence of an intricate interplay of nucleic acids and nucleotides in the chain of events leading from free amino acid to completed polypeptide chain has been determined. To this was added another participant to the nucleotides in protein synthesis - diadenosine-5', 5''', p'p/sup 4/-tetraphosphate (Ap4A). Ap/sub 4/A serves as an initiation primer for DNA synthesis in a eukaryotic system catalyzed by DNA polymerase ..cap alpha... Thus the initial step in protein synthesis is linked to the first step in DNA synthesis by a small molecular weight, unique dinucleotide signal. Advances in the methodology of nucleic acid sequencing have mademore » it possible to examine the relationship between specific short segments of DNA and RNA and their function in the metabolism of the living cell. The triester method of synthesizing deoxynucleotide polymers has made it feasible to synthesize and use specific oligomeric deoxynucleotide sequences as probes of genetic function and potential viral inhibitors. The synthesis of ribonucleotide polymers has been more difficult, due almost entirely to the presence of the 2' ribosyl hydroxyl group. The possibility is now emerging, however, of employing ribonucleotide polymers as specific RNA-virus inhibitors.« less

  16. Colorimetric detection of genetically modified organisms based on exonuclease III-assisted target recycling and hemin/G-quadruplex DNAzyme amplification.

    PubMed

    Zhang, Decai; Wang, Weijia; Dong, Qian; Huang, Yunxiu; Wen, Dongmei; Mu, Yuejing; Yuan, Yong

    2017-12-21

    An isothermal colorimetric method is described for amplified detection of the CaMV 35S promoter sequence in genetically modified organism (GMO). It is based on (a) target DNA-triggered unlabeled molecular beacon (UMB) termini binding, and (b) exonuclease III (Exo III)-assisted target recycling, and (c) hemin/G-quadruplex (DNAzyme) based signal amplification. The specific binding of target to the G-quadruplex sequence-locked UMB triggers the digestion of Exo III. This, in turn, releases an active G-quadruplex segment and target DNA for successive hybridization and cleavage. The Exo III impellent recycling of targets produces numerous G-quadruplex sequences. These further associate with hemin to form DNAzymes and hence will catalyze H 2 O 2 -mediated oxidation of the chromogenic enzyme substrate ABTS 2- causing the formation of a green colored product. This finding enables a sensitive colorimetric determination of GMO DNA (at an analytical wavelength of 420 nm) at concentrations as low as 0.23 nM. By taking advantage of isothermal incubation, this method does not require sophisticated equipment or complicated syntheses. Analyses can be performed within 90 min. The method also discriminates single base mismatches. In our perception, it has a wide scope in that it may be applied to the detection of many other GMOs. Graphical abstract An isothermal and sensitive colorimetric method is described for amplified detection of CaMV 35S promoter sequence in genetically modified organism (GMO). It is based on target DNA-triggered molecular beacon (UMB) termini-binding and exonuclease III assisted target recycling, and on hemin/G-quadruplex (DNAzyme) signal amplification.

  17. T cell receptor repertoires of mice and humans are clustered in similarity networks around conserved public CDR3 sequences

    PubMed Central

    Madi, Asaf; Poran, Asaf; Shifrut, Eric; Reich-Zeliger, Shlomit; Greenstein, Erez; Zaretsky, Irena; Arnon, Tomer; Laethem, Francois Van; Singer, Alfred; Lu, Jinghua; Sun, Peter D; Cohen, Irun R; Friedman, Nir

    2017-01-01

    Diversity of T cell receptor (TCR) repertoires, generated by somatic DNA rearrangements, is central to immune system function. However, the level of sequence similarity of TCR repertoires within and between species has not been characterized. Using network analysis of high-throughput TCR sequencing data, we found that abundant CDR3-TCRβ sequences were clustered within networks generated by sequence similarity. We discovered a substantial number of public CDR3-TCRβ segments that were identical in mice and humans. These conserved public sequences were central within TCR sequence-similarity networks. Annotated TCR sequences, previously associated with self-specificities such as autoimmunity and cancer, were linked to network clusters. Mechanistically, CDR3 networks were promoted by MHC-mediated selection, and were reduced following immunization, immune checkpoint blockade or aging. Our findings provide a new view of T cell repertoire organization and physiology, and suggest that the immune system distributes its TCR sequences unevenly, attending to specific foci of reactivity. DOI: http://dx.doi.org/10.7554/eLife.22057.001 PMID:28731407

  18. Analysis of 16S libraries of mouse gastrointestinal microflora reveals a large new group of mouse intestinal bacteria.

    PubMed

    Salzman, Nita H; de Jong, Hendrik; Paterson, Yvonne; Harmsen, Hermie J M; Welling, Gjalt W; Bos, Nicolaas A

    2002-11-01

    Total genomic DNA from samples of intact mouse small intestine, large intestine, caecum and faeces was used as template for PCR amplification of 16S rRNA gene sequences with conserved bacterial primers. Phylogenetic analysis of the amplification products revealed 40 unique 16S rDNA sequences. Of these sequences, 25% (10/40) corresponded to described intestinal organisms of the mouse, including Lactobacillus spp., Helicobacter spp., segmented filamentous bacteria and members of the altered Schaedler flora (ASF360, ASF361, ASF502 and ASF519); 75% (30/40) represented novel sequences. A large number (11/40) of the novel sequences revealed a new operational taxonomic unit (OTU) belonging to the Cytophaga-Flavobacter-Bacteroides phylum, which the authors named 'mouse intestinal bacteria'. 16S rRNA probes were developed for this new OTU. Upon analysis of the novel sequences, eight were found to cluster within the Eubacterium rectale-Clostridium coccoides group and three clustered within the Bacteroides group. One of the novel sequences was distantly related to Verrucomicrobium spinosum and one was distantly related to Bacillus mycoides. Oligonucleotide probes specific for the 16S rRNA of these novel clones were generated. Using a combination of four previously described and four newly designed probes, approximately 80% of bacteria recovered from the murine large intestine and 71% of bacteria recovered from the murine caecum could be identified by fluorescence in situ hybridization (FISH).

  19. Discovery of human inversion polymorphisms by comparative analysis of human and chimpanzee DNA sequence assemblies.

    PubMed

    Feuk, Lars; MacDonald, Jeffrey R; Tang, Terence; Carson, Andrew R; Li, Martin; Rao, Girish; Khaja, Razi; Scherer, Stephen W

    2005-10-01

    With a draft genome-sequence assembly for the chimpanzee available, it is now possible to perform genome-wide analyses to identify, at a submicroscopic level, structural rearrangements that have occurred between chimpanzees and humans. The goal of this study was to investigate chromosomal regions that are inverted between the chimpanzee and human genomes. Using the net alignments for the builds of the human and chimpanzee genome assemblies, we identified a total of 1,576 putative regions of inverted orientation, covering more than 154 mega-bases of DNA. The DNA segments are distributed throughout the genome and range from 23 base pairs to 62 mega-bases in length. For the 66 inversions more than 25 kilobases (kb) in length, 75% were flanked on one or both sides by (often unrelated) segmental duplications. Using PCR and fluorescence in situ hybridization we experimentally validated 23 of 27 (85%) semi-randomly chosen regions; the largest novel inversion confirmed was 4.3 mega-bases at human Chromosome 7p14. Gorilla was used as an out-group to assign ancestral status to the variants. All experimentally validated inversion regions were then assayed against a panel of human samples and three of the 23 (13%) regions were found to be polymorphic in the human genome. These polymorphic inversions include 730 kb (at 7p22), 13 kb (at 7q11), and 1 kb (at 16q24) fragments with a 5%, 30%, and 48% minor allele frequency, respectively. Our results suggest that inversions are an important source of variation in primate genome evolution. The finding of at least three novel inversion polymorphisms in humans indicates this type of structural variation may be a more common feature of our genome than previously realized.

  20. Comparative analysis of ribosomal protein L5 sequences from bacteria of the genus Thermus.

    PubMed

    Jahn, O; Hartmann, R K; Boeckh, T; Erdmann, V A

    1991-06-01

    The genes for the ribosomal 5S rRNA binding protein L5 have been cloned from three extremely thermophilic eubacteria, Thermus flavus, Thermus thermophilus HB8 and Thermus aquaticus (Jahn et al, submitted). Genes for protein L5 from the three Thermus strains display 95% G/C in third positions of codons. Amino acid sequences deduced from the DNA sequence were shown to be identical for T flavus and T thermophilus, although the corresponding DNA sequences differed by two T to C transitions in the T thermophilus gene. Protein L5 sequences from T flavus and T thermophilus are 95% homologous to L5 from T aquaticus and 56.5% homologous to the corresponding E coli sequence. The lowest degrees of homology were found between the T flavus/T thermophilus L5 proteins and those of yeast L16 (27.5%), Halobacterium marismortui (34.0%) and Methanococcus vannielii (36.6%). From sequence comparison it becomes clear that thermostability of Thermus L5 proteins is achieved by an increase in hydrophobic interactions and/or by restriction of steric flexibility due to the introduction of amino acids with branched aliphatic side chains such as leucine. Alignment of the nine protein sequences equivalent to Thermus L5 proteins led to identification of a conserved internal segment, rich in acidic amino acids, which shows homology to subsequences of E coli L18 and L25. The occurrence of conserved sequence elements in 5S rRNA binding proteins and ribosomal proteins in general is discussed in terms of evolution and function.

  1. Public antibodies to malaria antigens generated by two LAIR1 insertion modalities.

    PubMed

    Pieper, Kathrin; Tan, Joshua; Piccoli, Luca; Foglierini, Mathilde; Barbieri, Sonia; Chen, Yiwei; Silacci-Fregni, Chiara; Wolf, Tobias; Jarrossay, David; Anderle, Marica; Abdi, Abdirahman; Ndungu, Francis M; Doumbo, Ogobara K; Traore, Boubacar; Tran, Tuan M; Jongo, Said; Zenklusen, Isabelle; Crompton, Peter D; Daubenberger, Claudia; Bull, Peter C; Sallusto, Federica; Lanzavecchia, Antonio

    2017-08-31

    In two previously described donors, the extracellular domain of LAIR1, a collagen-binding inhibitory receptor encoded on chromosome 19 (ref. 1), was inserted between the V and DJ segments of an antibody. This insertion generated, through somatic mutations, broadly reactive antibodies against RIFINs, a type of variant antigen expressed on the surface of Plasmodium falciparum-infected erythrocytes. To investigate how frequently such antibodies are produced in response to malaria infection, we screened plasma from two large cohorts of individuals living in malaria-endemic regions. Here we report that 5-10% of malaria-exposed individuals, but none of the European blood donors tested, have high levels of LAIR1-containing antibodies that dominate the response to infected erythrocytes without conferring enhanced protection against febrile malaria. By analysing the antibody-producing B cell clones at the protein, cDNA and gDNA levels, we characterized additional LAIR1 insertions between the V and DJ segments and discovered a second insertion modality whereby the LAIR1 exon encoding the extracellular domain and flanking intronic sequences are inserted into the switch region. By exon shuffling, this mechanism leads to the production of bispecific antibodies in which the LAIR1 domain is precisely positioned at the elbow between the VH and CH1 domains. Additionally, in one donor the genomic DNA encoding the VH and CH1 domains was deleted, leading to the production of a camel-like LAIR1-containing antibody. Sequencing of the switch regions of memory B cells from European blood donors revealed frequent templated inserts originating from transcribed genes that, in rare cases, comprised exons with orientations and frames compatible with expression. These results reveal different modalities of LAIR1 insertion that lead to public and dominant antibodies against infected erythrocytes and suggest that insertion of templated DNA represents an additional mechanism of antibody diversification that can be selected in the immune response against pathogens and exploited for B cell engineering.

  2. Gelsolin in Onychophora and Tardigrada with notes on its variability in the Ecdysozoa.

    PubMed

    Thiruketheeswaran, Prasath; Greven, Hartmut; D'Haese, Jochen

    2017-01-01

    Rearrangements of the filamentous actin network involve a broad range of actin binding proteins. Among these, the gelsolin proteins sever actin filaments, cap their fast growing end and nucleate actin assembly in a calcium-dependent manner. Here, we focus on the gelsolin of the onychophoran Peripatoides novaezealandiae and the eutardigrade Hypsibius dujardini. From the cDNA of P. novaezealandiae we obtained the complete coding sequence with an open reading frame of 2178bp. It encodes a protein of 726 amino acids with a calculated molecular mass of 82,610.9Da and a pI of 5.57. This sequence is comprised of six segments (S1-S6). However, analysis of data from TardiBase reveals that the gelsolin of the eutardigrade Hypsibius dujardini has only three segments (S1-S3). The coding sequence consist of 1119bp for 373 amino acids with a calculated molecular mass of 42,440.95Da and a pI of 6.17. The Peripatoides and Hypsibius gelsolin revealed both conserved binding motifs for G-actin, F-actin and phosphatidylinositol 4,5-bisphosphate (PIP 2 ), along with a full set of type-1 and type-2 Ca 2+ -binding sites which could result in the binding of eight and four calcium ions, respectively. Both gelsolin proteins lack a C-terminal latch-helix indicating a more rapid activation in the submicromolar Ca 2+ range. We suggest that a gelsolin with three segments was present in the last common ancestor of the ecdysozoan clade Panarthropoda (Onychophora, Tardigrada, Arthropoda), primarily because the gelsolin of all non-Ecdysozoa studied so far (except Chordata) reveals this number of segments. Mapping of our molecular data onto a well-established phylogeny revealed that the number of gelsolin segments does not correlate with the phylogenetic lineage but rather with particular functional demands to alter the kinetics of actin polymerization. Copyright © 2016 Elsevier Inc. All rights reserved.

  3. Reverse genetics: Its origins and prospects

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berg, P.

    1991-04-01

    The nucleotide sequence of a gene and its flanking segments alone will not tell us how its expression is regulated during development and differentiation, or in response to environmental changes. To comprehend the physiological significance of the molecular details requires biological analysis. Recombinant DNA techniques provide a powerful experimental approach. A strategy termed reverse genetics' utilizes the analysis of the activities of mutant and normal genes and experimentally constructed mutants to explore the relationship between gene structure and function thereby helping elucidate the relationship between genotype and phenotype.

  4. Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Onda, M.; Kudo, S.; Fukuda, M.

    Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less

  5. Highly sensitive detection of ESR1 mutations in cell-free DNA from patients with metastatic breast cancer using molecular barcode sequencing.

    PubMed

    Masunaga, Nanae; Kagara, Naofumi; Motooka, Daisuke; Nakamura, Shota; Miyake, Tomohiro; Tanei, Tomonori; Naoi, Yasuto; Shimoda, Masafumi; Shimazu, Kenzo; Kim, Seung Jin; Noguchi, Shinzaburo

    2018-01-01

    We aimed to develop a highly sensitive method to detect ESR1 mutations in cell-free DNA (cfDNA) using next-generation sequencing with molecular barcode (MB-NGS) targeting the hotspot segment (c.1600-1713). The sensitivity of MB-NGS was tested using serially diluted ESR1 mutant DNA and then cfDNA samples from 34 patients with metastatic breast cancer were analyzed with MB-NGS. The results of MB-NGS were validated in comparison with conventional NGS and droplet digital PCR (ddPCR). MB-NGS showed a higher sensitivity (0.1%) than NGS without barcode (1%) by reducing background errors. Of the cfDNA samples from 34 patients with metastatic breast cancer, NGS without barcode revealed seven mutations in six patients (17.6%) and MB-NGS revealed six additional mutations including three mutations not reported in the COSMIC database of breast cancer, resulting in total 13 ESR1 mutations in ten patients (29.4%). Regarding the three hotspot mutations, all the patients with mutations detected by MB-NGS had identical mutations detected by droplet digital PCR (ddPCR), and mutant allele frequency correlated very well between both (r = 0.850, p < 0.01). Moreover, all the patients without these mutations by MB-NGS were found to have no mutations by ddPCR. In conclusion, MB-NGS could successfully detect ESR1 mutations in cfDNA with a higher sensitivity of 0.1% than conventional NGS and was considered as clinically useful as ddPCR.

  6. [Association of muscle segment homeobox gene 1 polymorphisms with nonsyndromic cleft lip with or without cleft palate].

    PubMed

    Zhang, Li; Tang, Jun-Ling; Liang, Shang-Zheng

    2008-06-01

    Muscle segment homeobox gene (MSX)1 has been proposed as a gene in which mutations may contribute to nonsyndromic cleft lip with or without cleft palate (NSCL/P). To study MSX1 polymorphisms in NSCL/ P by means of polymerase chain reaction-single-strand conformation polymorphism (PCR-SSCP), and investigate the association of MSX1 exons 1 polymorphisms with NSCL/P. DNA were extracted from blood samples from NSCL/P and unrelated normal subjects. Genome DNA from peripheral leukocyte with these blood samples were extracted, which was used as template to amplify desired gene fragment of MSX1 exons 1 by means of polymerase chain reaction (PCR). The PCR products were examined by single-strand conformation polymorphism (SSCP). The MSX1 exons 1 polymorphisms were examined by sequencing if mutations were found. MSX1 genes of exon 1 mutation was not been found in the NSCL/P and unrelated normal subjects by SSCP. No correlation between MSX1 exon 1 and NSCL/P was found. MSX1 exon 1 may not be a key gene (susceptibility gene) in NSCL/P.

  7. Errant processing and structural alterations of genomes present in a varicella-zoster virus vaccine.

    PubMed Central

    Vlazny, D A; Hyman, R W

    1985-01-01

    Five minority populations of aberrant, varicella-zoster virus (VZV)-derived genomes were identified among the encapsidated DNAs obtained from the nuclear and cytoplasmic fractions of an in vitro infection initiated with a lyophilized sample of the BIKEN VZV vaccine (strain Oka). These were (i) VZV genomes, present within nuclear but not cytoplasmic viral capsids, which had been cleaved at a specific site within the short segment and which were, therefore, 3.15 megadaltons (approximately 4% of the VZV genome length) short of full length; (ii) highly deleted, repetitive VZV genomes which contained the errant cleavage site but not the usual VZV genome terminal sequences; (iii) VZV genomes into which multiples of 1 through 5 defective genome repeat units had been inserted into a homologous site; (iv) VZV genomes with additions of 0.1 or 0.18 megadaltons of DNA at both the terminal and internal ends of the short segment; and (v) VZV DNA which had lost the HindIII restriction site at map position 0.11. Images PMID:2993670

  8. First detection of multiple knockdown resistance (kdr)-like mutations in voltage-gated sodium channel using three new genotyping methods in Anopheles sinensis from Guangxi Province, China.

    PubMed

    Tan, Wei L; Li, Chun X; Wang, Zhong M; Liu, Mei D; Dong, Yan D; Feng, Xiang Y; Wu, Zhi M; Guo, Xiao X; Xing, Dan; Zhang, Ying M; Wang, Zhong C; Zhao, Tong Y

    2012-09-01

    To investigate knockdown resistance (kdr)-like mutations associated with pyrethroid resistance in Anopheles sinensis (Wiedemann, 1828), from Guangxi province, southwest China, a segment of a sodium channel gene was sequenced and genotyped using three new genotyping assays. Direct sequencing revealed the presence of TTG-to-TCG and TG-to-TTT mutations at allele position L1014, which led to L1014S and L1014F substitutions in a few individual and two novel substitutions of N1013S and L1014W in two DNA templates. A low frequency of the kdr allele mostly in the heterozygous state of L1014S and L1014F was observed in this mosquito population. In this study, the genotyping of An. sinensis using three polymerase chain reaction-based methods generated consistent results, which agreed with the results of DNA sequencing. In total, 52 mosquitoes were genotyped using a direct sequencing assay. The number of mosquitoes and their genotypes were as follows: L/L = 24, L/S = 19, L/F = 8, and F/W = 1. The allelic frequency of L1014, 1014S, and 1014F were 72, 18, and 9%, respectively.

  9. Genome-Wide Stochastic Adaptive DNA Amplification at Direct and Inverted DNA Repeats in the Parasite Leishmania

    PubMed Central

    Plourde, Marie; Gingras, Hélène; Roy, Gaétan; Lapointe, Andréanne; Leprohon, Philippe; Papadopoulou, Barbara; Corbeil, Jacques; Ouellette, Marc

    2014-01-01

    Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment. PMID:24844805

  10. Chromosome Rearrangements That Involve the Nucleolus Organizer Region in Neurospora

    PubMed Central

    Perkins, D. D.; Raju, N. B.; Barry, E. G.; Butler, D. K.

    1995-01-01

    In ~3% of Neurospora crassa rearrangements, part of a chromosome arm becomes attached to the nucleolus organizer region (NOR) at one end of chromosome 2 (linkage group V). Investigations with one inversion and nine translocations of this type are reported here. They appear genetically to be nonreciprocal and terminal. When a rearrangement is heterozygous, about one-third of viable progeny are segmental aneuploids with the translocated segment present in two copies, one in normal position and one associated with the NOR. Duplications from many of the rearrangements are highly unstable, breaking down by loss of the NOR-attached segment to restore normal chromosome sequence. When most of the rearrangements are homozygous, attenuated strands can be seen extending through the unstained nucleolus at pachytene, joining the translocated distal segment to the remainder of chromosome 2. Although the rearrangements appear genetically to be nonreciprocal, molecular evidence shows that at least several of them are physically reciprocal, with a block of rDNA repeats translocated away from the NOR. Evidence that NOR-associated breakpoints are nonterminal is also provided by intercrosses between pairs of translocations that transfer different-length segments of the same donor-chromosome arm to the NOR. PMID:8582636

  11. Is the simian virus SV40 associated with idiopathic focal segmental glomerulosclerosis in humans?

    PubMed

    Galdenzi, Gabriella; Lupo, Antonio; Anglani, Franca; Perini, Marino; Galeazzi, Luciano; Giunta, Sergio; Marcantoni, Carmelita; Del Prete, Dorella; Graziotto, Romina; D'angelo, Angela; Maschio, Giuseppe; Gambaro, Giovanni

    2003-01-01

    Glomerulosclerosis was reported in mice transgenic for the simian polyomavirus SV40 early region that contains the transforming sequences encoding the SV40 large T-antigen (TAG). This was discovered when an SV40 epidemic occurred following the use of contaminated polio vaccines during 1955-1963, and led to investigations that showed an association between SV40 infection and tumors in humans. We investigated the possible association of SV40 infection and idiopathic focal segmental glomerulosclerosis (FSGS). The study was performed in 17 Bouin-fixed, paraffin-embedded renal biopsies from FSGS patients and 10 matched biopsies from patients with IgA glomerulonephritis; all patients had undergone polio vaccination in the early 1960s. Extracted DNA was polymerase chain reaction (PCR) amplified using SV.for3/SV.rev primers and GabE1/GabE2 primers; both sets of primers map in the region of SV40 TAG sequences, and amplify a fragment of respectively 105-bp and 135-bp. The biopsies considered were those in which the DNA was sufficiently intact to allow amplification of a fragment of 102-bp of the ApoE gene. Three FSGS and none of the IgA biopsies were positive for the SV.for3/SV.rev fragment. Conversely, amplification with GabE1/GabE2 primers did not lead to any specific product in either the IgA or FSGS biopsies. Restriction fragment length polymorphism and sequencing analyses revealed that the positive results obtained with the SV.for3/SV.rev primers were due to amplicons generated by multiple dimerization of forward and reverse primers. With the limited number of patients investigated, this study excludes the hypothesis that SV40 is associated with idiopathic FSGS.

  12. Abnormal plasma DNA profiles in early ovarian cancer using a non-invasive prenatal testing platform: implications for cancer screening.

    PubMed

    Cohen, Paul A; Flowers, Nicola; Tong, Stephen; Hannan, Natalie; Pertile, Mark D; Hui, Lisa

    2016-08-24

    Non-invasive prenatal testing (NIPT) identifies fetal aneuploidy by sequencing cell-free DNA in the maternal plasma. Pre-symptomatic maternal malignancies have been incidentally detected during NIPT based on abnormal genomic profiles. This low coverage sequencing approach could have potential for ovarian cancer screening in the non-pregnant population. Our objective was to investigate whether plasma DNA sequencing with a clinical whole genome NIPT platform can detect early- and late-stage high-grade serous ovarian carcinomas (HGSOC). This is a case control study of prospectively-collected biobank samples comprising preoperative plasma from 32 women with HGSOC (16 'early cancer' (FIGO I-II) and 16 'advanced cancer' (FIGO III-IV)) and 32 benign controls. Plasma DNA from cases and controls were sequenced using a commercial NIPT platform and chromosome dosage measured. Sequencing data were blindly analyzed with two methods: (1) Subchromosomal changes were called using an open source algorithm WISECONDOR (WIthin-SamplE COpy Number aberration DetectOR). Genomic gains or losses ≥ 15 Mb were prespecified as "screen positive" calls, and mapped to recurrent copy number variations reported in an ovarian cancer genome atlas. (2) Selected whole chromosome gains or losses were reported using the routine NIPT pipeline for fetal aneuploidy. We detected 13/32 cancer cases using the subchromosomal analysis (sensitivity 40.6 %, 95 % CI, 23.7-59.4 %), including 6/16 early and 7/16 advanced HGSOC cases. Two of 32 benign controls had subchromosomal gains ≥ 15 Mb (specificity 93.8 %, 95 % CI, 79.2-99.2 %). Twelve of the 13 true positive cancer cases exhibited specific recurrent changes reported in HGSOC tumors. The NIPT pipeline resulted in one "monosomy 18" call from the cancer group, and two "monosomy X" calls in the controls. Low coverage plasma DNA sequencing used for prenatal testing detected 40.6 % of all HGSOC, including 38 % of early stage cases. Our findings demonstrate the potential of a high throughput sequencing platform to screen for early HGSOC in plasma based on characteristic multiple segmental chromosome gains and losses. The performance of this approach may be further improved by refining bioinformatics algorithms and targeting selected cancer copy number variations.

  13. A functional analysis of the spacer of V(D)J recombination signal sequences.

    PubMed

    Lee, Alfred Ian; Fugmann, Sebastian D; Cowell, Lindsay G; Ptaszek, Leon M; Kelsoe, Garnett; Schatz, David G

    2003-10-01

    During lymphocyte development, V(D)J recombination assembles antigen receptor genes from component V, D, and J gene segments. These gene segments are flanked by a recombination signal sequence (RSS), which serves as the binding site for the recombination machinery. The murine Jbeta2.6 gene segment is a recombinationally inactive pseudogene, but examination of its RSS reveals no obvious reason for its failure to recombine. Mutagenesis of the Jbeta2.6 RSS demonstrates that the sequences of the heptamer, nonamer, and spacer are all important. Strikingly, changes solely in the spacer sequence can result in dramatic differences in the level of recombination. The subsequent analysis of a library of more than 4,000 spacer variants revealed that spacer residues of particular functional importance are correlated with their degree of conservation. Biochemical assays indicate distinct cooperation between the spacer and heptamer/nonamer along each step of the reaction pathway. The results suggest that the spacer serves not only to ensure the appropriate distance between the heptamer and nonamer but also regulates RSS activity by providing additional RAG:RSS interaction surfaces. We conclude that while RSSs are defined by a "digital" requirement for absolutely conserved nucleotides, the quality of RSS function is determined in an "analog" manner by numerous complex interactions between the RAG proteins and the less-well conserved nucleotides in the heptamer, the nonamer, and, importantly, the spacer. Those modulatory effects are accurately predicted by a new computational algorithm for "RSS information content." The interplay between such binary and multiplicative modes of interactions provides a general model for analyzing protein-DNA interactions in various biological systems.

  14. Assessing information content and interactive relationships of subgenomic DNA sequences of the MHC using complexity theory approaches based on the non-extensive statistical mechanics

    NASA Astrophysics Data System (ADS)

    Karakatsanis, L. P.; Pavlos, G. P.; Iliopoulos, A. C.; Pavlos, E. G.; Clark, P. M.; Duke, J. L.; Monos, D. S.

    2018-09-01

    This study combines two independent domains of science, the high throughput DNA sequencing capabilities of Genomics and complexity theory from Physics, to assess the information encoded by the different genomic segments of exonic, intronic and intergenic regions of the Major Histocompatibility Complex (MHC) and identify possible interactive relationships. The dynamic and non-extensive statistical characteristics of two well characterized MHC sequences from the homozygous cell lines, PGF and COX, in addition to two other genomic regions of comparable size, used as controls, have been studied using the reconstructed phase space theorem and the non-extensive statistical theory of Tsallis. The results reveal similar non-linear dynamical behavior as far as complexity and self-organization features. In particular, the low-dimensional deterministic nonlinear chaotic and non-extensive statistical character of the DNA sequences was verified with strong multifractal characteristics and long-range correlations. The nonlinear indices repeatedly verified that MHC sequences, whether exonic, intronic or intergenic include varying levels of information and reveal an interaction of the genes with intergenic regions, whereby the lower the number of genes in a region, the less the complexity and information content of the intergenic region. Finally we showed the significance of the intergenic region in the production of the DNA dynamics. The findings reveal interesting content information in all three genomic elements and interactive relationships of the genes with the intergenic regions. The results most likely are relevant to the whole genome and not only to the MHC. These findings are consistent with the ENCODE project, which has now established that the non-coding regions of the genome remain to be of relevance, as they are functionally important and play a significant role in the regulation of expression of genes and coordination of the many biological processes of the cell.

  15. Structure of the dimeric exonuclease TREX1 in complex with DNA displays a proline-rich binding site for WW Domains.

    PubMed

    Brucet, Marina; Querol-Audí, Jordi; Serra, Maria; Ramirez-Espain, Ximena; Bertlik, Kamila; Ruiz, Lidia; Lloberas, Jorge; Macias, Maria J; Fita, Ignacio; Celada, Antonio

    2007-05-11

    TREX1 is the most abundant mammalian 3' --> 5' DNA exonuclease. It has been described to form part of the SET complex and is responsible for the Aicardi-Goutières syndrome in humans. Here we show that the exonuclease activity is correlated to the binding preferences toward certain DNA sequences. In particular, we have found three motifs that are selected, GAG, ACA, and CTGC. To elucidate how the discrimination occurs, we determined the crystal structures of two murine TREX1 complexes, with a nucleotide product of the exonuclease reaction, and with a single-stranded DNA substrate. Using confocal microscopy, we observed TREX1 both in nuclear and cytoplasmic subcellular compartments. Remarkably, the presence of TREX1 in the nucleus requires the loss of a C-terminal segment, which we named leucine-rich repeat 3. Furthermore, we detected the presence of a conserved proline-rich region on the surface of TREX1. This observation points to interactions with proline-binding domains. The potential interacting motif "PPPVPRPP" does not contain aromatic residues and thus resembles other sequences that select SH3 and/or Group 2 WW domains. By means of nuclear magnetic resonance titration experiments, we show that, indeed, a polyproline peptide derived from the murine TREX1 sequence interacted with the WW2 domain of the elongation transcription factor CA150. Co-immunoprecipitation studies confirmed this interaction with the full-length TREX1 protein, thereby suggesting that TREX1 participates in more functional complexes than previously thought.

  16. APOBEC3A efficiently deaminates methylated, but not TET-oxidized, cytosine bases in DNA.

    PubMed

    Schutsky, Emily K; Nabel, Christopher S; Davis, Amy K F; DeNizio, Jamie E; Kohli, Rahul M

    2017-07-27

    AID/APOBEC family enzymes are best known for deaminating cytosine bases to uracil in single-stranded DNA, with characteristic sequence preferences that can produce mutational signatures in targets such as retroviral and cancer cell genomes. These deaminases have also been proposed to function in DNA demethylation via deamination of either 5-methylcytosine (mC) or TET-oxidized mC bases (ox-mCs), which include 5-hydroxymethylcytosine, 5-formylcytosine and 5-carboxylcytosine. One specific family member, APOBEC3A (A3A), has been shown to readily deaminate mC, raising the prospect of broader activity on ox-mCs. To investigate this claim, we developed a novel assay that allows for parallel profiling of activity on all modified cytosines. Our steady-state kinetic analysis reveals that A3A discriminates against all ox-mCs by >3700-fold, arguing that ox-mC deamination does not contribute substantially to demethylation. A3A is, by contrast, highly proficient at C/mC deamination. Under conditions of excess enzyme, C/mC bases can be deaminated to completion in long DNA segments, regardless of sequence context. Interestingly, under limiting A3A, the sequence preferences observed with targeting unmodified cytosine are further exaggerated when deaminating mC. Our study informs how methylation, oxidation, and deamination can interplay in the genome and suggests A3A's potential utility as a biotechnological tool to discriminate between cytosine modification states. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags.

    PubMed

    Wu, Gary D; Lewis, James D; Hoffmann, Christian; Chen, Ying-Yu; Knight, Rob; Bittinger, Kyle; Hwang, Jennifer; Chen, Jun; Berkowsky, Ronald; Nessel, Lisa; Li, Hongzhe; Bushman, Frederic D

    2010-07-30

    Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80 degrees C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method.

  18. miR-ID: A novel, circularization-based platform for detection of microRNAs

    PubMed Central

    Kumar, Pavan; Johnston, Brian H.; Kazakov, Sergei A.

    2011-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression and have great potential as biomarkers, prognostic indicators, and therapeutic targets. Determining the expression patterns of these molecules is essential for elucidating their biogenesis, regulation, relation to disease, and response to therapy. Although PCR-based assays are commonly used for expression profiling of miRNAs, the small size, sequence heterogeneity, and (in some cases) end modifications of miRNAs constrain the performance of existing PCR methods. Here we introduce miR-ID, a novel method that avoids these constraints while providing superior sensitivity and sequence specificity at a lower cost. It also has the unique ability to differentiate unmodified small RNAs from those carrying 2′-OMe groups at their 3′-ends while detecting both forms. miR-ID is comprised of the following steps: (1) circularization of the miRNA by a ligase; (2) reverse transcription of the circularized miRNA (RTC), producing tandem repeats of a DNA sequence complementary to the miRNA; and (3) qPCR amplification of segments of this multimeric cDNA using 5′-overlapping primers and a nonspecific dye such as SYBR Green. No chemically modified probes (e.g., TaqMan) or primers (e.g., LNA) are required. The circular RNA and multimeric cDNA templates provide unmatched flexibility in the positioning of primers, which may include straddling the boundaries between these repetitive miRNA sequences. miR-ID is based on new findings that are themselves of general interest, including reverse transcription of small RNA circles and the use of 5′-overlapping primers for detection of repetitive sequences by qPCR. PMID:21169480

  19. Architecture of the 99 bp DNA-six-protein regulatory complex of the lambda att site.

    PubMed

    Sun, Xingmin; Mierke, Dale F; Biswas, Tapan; Lee, Sang Yeol; Landy, Arthur; Radman-Livaja, Marta

    2006-11-17

    The highly directional and tightly regulated recombination reaction used to site-specifically excise the bacteriophage lambda chromosome out of its E. coli host chromosome requires the binding of six sequence-specific proteins to a 99 bp segment of the phage att site. To gain structural insights into this recombination pathway, we measured 27 FRET distances between eight points on the 99 bp regulatory DNA bound with all six proteins. Triangulation of these distances using a metric matrix distance-geometry algorithm provided coordinates for these eight points. The resulting path for the protein-bound regulatory DNA, which fits well with the genetics, biochemistry, and X-ray crystal structures describing the individual proteins and their interactions with DNA, provides a new structural perspective into the molecular mechanism and regulation of the recombination reaction and illustrates a design by which different families of higher-order complexes can be assembled from different numbers and combinations of the same few proteins.

  20. Easi-CRISPR for creating knock-in and conditional knockout mouse models using long ssDNA donors.

    PubMed

    Miura, Hiromi; Quadros, Rolen M; Gurumurthy, Channabasavaiah B; Ohtsuka, Masato

    2018-01-01

    CRISPR/Cas9-based genome editing can easily generate knockout mouse models by disrupting the gene sequence, but its efficiency for creating models that require either insertion of exogenous DNA (knock-in) or replacement of genomic segments is very poor. The majority of mouse models used in research involve knock-in (reporters or recombinases) or gene replacement (e.g., conditional knockout alleles containing exons flanked by LoxP sites). A few methods for creating such models have been reported that use double-stranded DNA as donors, but their efficiency is typically 1-10% and therefore not suitable for routine use. We recently demonstrated that long single-stranded DNAs (ssDNAs) serve as very efficient donors, both for insertion and for gene replacement. We call this method efficient additions with ssDNA inserts-CRISPR (Easi-CRISPR) because it is a highly efficient technology (efficiency is typically 30-60% and reaches as high as 100% in some cases). The protocol takes ∼2 months to generate the founder mice.

  1. The Accuracy of Molecular Processes

    NASA Astrophysics Data System (ADS)

    Stavans, Joel

    Recombination is arguably one of the most fundamental mechanisms driving genetic diversity during evolution. Recombination takes place in one way or another from viruses such as HIV and polio, to bacteria, and finally to man. In both prokaryotes and eukaryotes, homologous recombination is assisted by enzymes, recombinases, that promote the exchange of strands between two segments of DNA, thereby creating new genetic combinations. In bacteria, homologous recombination takes place as a pathway for the repair of DNA lesions and also during horizontal or lateral gene transfer processes, in which cells take in exogenous pieces of DNA. This allows bacteria to evolve rapidly by acquiring large sequences of DNA, a process which would take too long by gene duplications and single mutations. I will survey recent results on the fidelity of homologous recombination as catalyzed by the bacterial recombinase RecA. These results show discrimination up to the level of single base mismatches, during the initial stages of the recombination process. A cascaded kinetic proofreading process is proposed to explain this high discrimination. Kinetic proofreading ideas are also reviewed.

  2. SOV_refine: A further refined definition of segment overlap score and its significance for protein structure similarity.

    PubMed

    Liu, Tong; Wang, Zheng

    2018-01-01

    The segment overlap score (SOV) has been used to evaluate the predicted protein secondary structures, a sequence composed of helix (H), strand (E), and coil (C), by comparing it with the native or reference secondary structures, another sequence of H, E, and C. SOV's advantage is that it can consider the size of continuous overlapping segments and assign extra allowance to longer continuous overlapping segments instead of only judging from the percentage of overlapping individual positions as Q3 score does. However, we have found a drawback from its previous definition, that is, it cannot ensure increasing allowance assignment when more residues in a segment are further predicted accurately. A new way of assigning allowance has been designed, which keeps all the advantages of the previous SOV score definitions and ensures that the amount of allowance assigned is incremental when more elements in a segment are predicted accurately. Furthermore, our improved SOV has achieved a higher correlation with the quality of protein models measured by GDT-TS score and TM-score, indicating its better abilities to evaluate tertiary structure quality at the secondary structure level. We analyzed the statistical significance of SOV scores and found the threshold values for distinguishing two protein structures (SOV_refine  > 0.19) and indicating whether two proteins are under the same CATH fold (SOV_refine > 0.94 and > 0.90 for three- and eight-state secondary structures respectively). We provided another two example applications, which are when used as a machine learning feature for protein model quality assessment and comparing different definitions of topologically associating domains. We proved that our newly defined SOV score resulted in better performance. The SOV score can be widely used in bioinformatics research and other fields that need to compare two sequences of letters in which continuous segments have important meanings. We also generalized the previous SOV definitions so that it can work for sequences composed of more than three states (e.g., it can work for the eight-state definition of protein secondary structures). A standalone software package has been implemented in Perl with source code released. The software can be downloaded from http://dna.cs.miami.edu/SOV/.

  3. Unusual intraindividual variation of the nuclear 18S rRNA gene is widespread within the Acipenseridae.

    PubMed

    Krieger, Jeannette; Hett, Anne Kathrin; Fuerst, Paul A; Birstein, Vadim J; Ludwig, Arne

    2006-01-01

    Significant intraindividual variation in the sequence of the 18S rRNA gene is unusual in animal genomes. In a previous study, multiple 18S rRNA gene sequences were observed within individuals of eight species of sturgeon from North America but not in the North American paddlefish, Polyodon spathula, in two species of Polypterus (Polypterus delhezi and Polypterus senegalus), in other primitive fishes (Erpetoichthys calabaricus, Lepisosteus osseus, Amia calva) or in a lungfish (Protopterus sp.). These observations led to the hypothesis that this unusual genetic characteristic arose within the Acipenseriformes after the presumed divergence of the sturgeon and paddlefish families. In the present study, a survey of nearly all Eurasian acipenseriform species was conducted to examine 18S rDNA variation. Intraindividual variation was not found in the polyodontid species, the Chinese paddlefish, Psephurus gladius, but variation was detected in all Eurasian acipenserid species. The comparison of sequences from two major segments of the 18S rRNA gene and identification of sites where insertion/deletion events have occurred are placed in the context of evolutionary relationships within the Acipenseriformes and the evolution of rDNA variation in this group.

  4. Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization

    PubMed Central

    Girard, Laurie D.; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G.

    2014-01-01

    The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have a high complexity cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotidesequence-dependent segment and a unique “target sequence-independent” 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets. PMID:25489607

  5. An efficient approach for cloning the dNDP-glucose synthase gene from actinomycetes and its application in Streptomyces spectabilis, a spectinomycin producer.

    PubMed

    Hyun, C; Kim, S S; Sohng, J K; Hahn, J; Kim, J; Suh, J

    2000-02-01

    Specifically designed PCR primers were applied to amplify a segment of dTDP-glucose synthase gene from six actinomycete strains. About 300-bp or 580-bp DNA fragments were obtained from all the organisms tested. By DNA sequence analysis, seven amplified fragments showed high homology with dTDP-glucose synthase genes that participate in the biosynthesis of secondary metabolites or in deoxy-sugar moieties in lipopolysaccharides. In addition, we have cloned a 45-kb region of DNA from Streptomyces spectabilis ATCC27741, a spectinomycin producer which contained the dTDP-glucose synthase and dTDP-glucose 4,6-dehydratase genes named spcD and spcE, respectively. The spcE gene was expressed in Escherichia coli and the activity was assayed in cell extracts. The enzyme showed substrate specificity only to dTDP-glucose.

  6. Cloning and Expression of cDNA for Rat Heme Oxygenase

    NASA Astrophysics Data System (ADS)

    Shibahara, Shigeki; Muller, Rita; Taguchi, Hayao; Yoshida, Tadashi

    1985-12-01

    Two cDNA clones for rat heme oxygenase have been isolated from a rat spleen cDNA library in λ gt11 by immunological screening using a specific polyclonal antibody. One of these clones has an insert of 1530 nucleotides that contains the entire protein-coding region. To confirm that the isolated cDNA encodes heme oxygenase, we transfected monkey kidney cells (COS-7) with the cDNA carried in a simian virus 40 vector. The heme oxygenase was highly expressed in endoplasmic reticulum of transfected cells. The nucleotide sequence of the cloned cDNA was determined and the primary structure of heme oxygenase was deduced. Heme oxygenase is composed of 289 amino acids and has one hydrophobic segment at its carboxyl terminus, which is probably important for the insertion of heme oxygenase into endoplasmic reticulum. The cloned cDNA was used to analyze the induction of heme oxygenase in rat liver by treatment with CoCl2 or with hemin. RNA blot analysis showed that both CoCl2 and hemin increased the amount of hybridizable mRNA, suggesting that these substances may act at the transcriptional level to increase the amount of heme oxygenase.

  7. Genomic organization and sequence of the Gus-s/sup a/ allele of the murine. beta. -glucuronidase gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Funkenstein, B.; Leary, S.L.; Stein, J.C.

    1988-03-01

    The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less

  8. Performance evaluation of DNA copy number segmentation methods.

    PubMed

    Pierre-Jean, Morgane; Rigaill, Guillem; Neuvial, Pierre

    2015-07-01

    A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. © The Author 2014. Published by Oxford University Press.

  9. Molecular mechanism of transcription inhibition by phage T7 gp2 protein.

    PubMed

    Mekler, Vladimir; Minakhin, Leonid; Sheppard, Carol; Wigneshweraraj, Sivaramesh; Severinov, Konstantin

    2011-11-11

    Escherichia coli T7 bacteriophage gp2 protein is a potent inhibitor of host RNA polymerase (RNAP). gp2 inhibits formation of open promoter complex by binding to the β' jaw, an RNAP domain that interacts with downstream promoter DNA. Here, we used an engineered promoter with an optimized sequence to obtain and characterize a specific promoter complex containing RNAP and gp2. In this complex, localized melting of promoter DNA is initiated but does not propagate to include the point of the transcription start. As a result, the complex is transcriptionally inactive. Using a highly sensitive RNAP beacon assay, we performed quantitative real-time measurements of specific binding of the RNAP-gp2 complex to promoter DNA and various promoter fragments. In this way, the effect of gp2 on RNAP interaction with promoters was dissected. As expected, gp2 greatly decreased RNAP affinity to downstream promoter duplex. However, gp2 also inhibited RNAP binding to promoter fragments that lacked downstream promoter DNA that interacts with the β' jaw. The inhibition was caused by gp2-mediated decrease of the RNAP binding affinity to template and non-template strand segments of the transcription bubble downstream of the -10 promoter element. The inhibition of RNAP interactions with single-stranded segments of the transcription bubble by gp2 is a novel effect, which may occur via allosteric mechanism that is set in motion by the gp2 binding to the β' jaw. Copyright © 2011 Elsevier Ltd. All rights reserved.

  10. Position-based scanning for comparative genomics and identification of genetic islands in Haemophilus influenzae type b.

    PubMed

    Bergman, Nicholas H; Akerley, Brian J

    2003-03-01

    Bacteria exhibit extensive genetic heterogeneity within species. In many cases, these differences account for virulence properties unique to specific strains. Several such loci have been discovered in the genome of the type b serotype of Haemophilus influenzae, a human pathogen able to cause meningitis, pneumonia, and septicemia. Here we report application of a PCR-based scanning procedure to compare the genome of a virulent type b (Hib) strain with that of the laboratory-passaged Rd KW20 strain for which a complete genome sequence is available. We have identified seven DNA segments or H. influenzae genetic islands (HiGIs) present in the type b genome and absent from the Rd genome. These segments vary in size and content and show signs of horizontal gene transfer in that their percent G+C content differs from that of the rest of the H. influenzae genome, they contain genes similar to those found on phages or other mobile elements, or they are flanked by DNA repeats. Several of these loci represent potential pathogenicity islands, because they contain genes likely to mediate interactions with the host. These newly identified genetic islands provide areas of investigation into both the evolution and pathogenesis of H. influenzae. In addition, the genome scanning approach developed to identify these islands provides a rapid means to compare the genomes of phenotypically diverse bacterial strains once the genome sequence of one representative strain has been determined.

  11. Detection of Ophiocordyceps sinensis and Its Common Adulterates Using Species-Specific Primers

    PubMed Central

    Liu, Yang; Wang, Xiao-yue; Gao, Zi-tong; Han, Jian-ping; Xiang, Li

    2017-01-01

    Ophiocordyceps sinensis is a fungus that infects Hepialidae caterpillars, mummifying the larvae and producing characteristic fruiting bodies (stromata) that are processed into one of the most valued traditional Chinese medicines (TCM). The product commands a very high price due to a high demand but a very limited supply. Adulteration with other fungi is a common problem and there is a need to test preparation for the presence of the correct fungus. In the current study, a PCR-based approach for the identification of O. sinensis based on a segment of the internal transcribed spacer (ITS) region was developed. The segments is 146-bp in size and is likely to be amplified even in materials where processing led to DNA fragmentation. Primer development was based on the alignment of sequence data generated from a total of 89 samples of O. sinensis and potential adulterants as well as sequences date from 41 Ophiocordyceps species and 26 Cordyceps species available in GenBank. Tests with primer pair, DCF4/DCR4, demonstrated generation of an amplicon from DNA extracted from O. sinensis stromata, but not from extracts derived from adulterants. Species-specific primer pairs were also developed and tested for detection of the common adulterants, Cordyceps gunnii, Cordyceps cicadae, Cordyceps militaris, Cordyceps liangshanensis and Ophiocordyceps nutans. The collection of primers developed in the present study will be useful for the authentication of preparation claiming to only contain O. sinensis and for the detection of fungi used as adulterants in these preparations. PMID:28680424

  12. Current sequencing technology makes microhaplotypes a powerful new type of genetic marker for forensics.

    PubMed

    Kidd, Kenneth K; Pakstis, Andrew J; Speed, William C; Lagacé, Robert; Chang, Joseph; Wootton, Sharon; Haigh, Eva; Kidd, Judith R

    2014-09-01

    SNPs that are molecularly very close (<10kb) will generally have extremely low recombination rates, much less than 10(-4). Multiple haplotypes will often exist because of the history of the origins of the variants at the different sites, rare recombinants, and the vagaries of random genetic drift and/or selection. Such multiallelic haplotype loci are potentially important in forensic work for individual identification, for defining ancestry, and for identifying familial relationships. The new DNA sequencing capabilities currently available make possible continuous runs of a few hundred base pairs so that we can now determine the allelic combination of multiple SNPs on each chromosome of an individual, i.e., the phase, for multiple SNPs within a small segment of DNA. Therefore, we have begun to identify regions, encompassing two to four SNPs with an extent of <200bp that define multiallelic haplotype loci. We have identified candidate regions and have collected pilot data on many candidate microhaplotype loci. Here we present 31 microhaplotype loci that have at least three alleles, have high heterozygosity, are globally informative, and are statistically independent at the population level. This study of microhaplotype loci (microhaps) provides proof of principle that such markers exist and validates their usefulness for ancestry inference, lineage-clan-family inference, and individual identification. The true value of microhaplotypes will come with sequencing methods that can establish alleles unambiguously, including disentangling of mixtures, because a single sequencing run on a single strand of DNA will encompass all of the SNPs. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  13. On the Origin of Reverse Transcriptase-Using CRISPR-Cas Systems and Their Hyperdiverse, Enigmatic Spacer Repertoires.

    PubMed

    Silas, Sukrit; Makarova, Kira S; Shmakov, Sergey; Páez-Espino, David; Mohr, Georg; Liu, Yi; Davison, Michelle; Roux, Simon; Krishnamurthy, Siddharth R; Fu, Becky Xu Hua; Hansen, Loren L; Wang, David; Sullivan, Matthew B; Millard, Andrew; Clokie, Martha R; Bhaya, Devaki; Lambowitz, Alan M; Kyrpides, Nikos C; Koonin, Eugene V; Fire, Andrew Z

    2017-07-11

    Cas1 integrase is the key enzyme of the clustered regularly interspaced short palindromic repeat (CRISPR)-Cas adaptation module that mediates acquisition of spacers derived from foreign DNA by CRISPR arrays. In diverse bacteria, the cas1 gene is fused (or adjacent) to a gene encoding a reverse transcriptase (RT) related to group II intron RTs. An RT-Cas1 fusion protein has been recently shown to enable acquisition of CRISPR spacers from RNA. Phylogenetic analysis of the CRISPR-associated RTs demonstrates monophyly of the RT-Cas1 fusion, and coevolution of the RT and Cas1 domains. Nearly all such RTs are present within type III CRISPR-Cas loci, but their phylogeny does not parallel the CRISPR-Cas type classification, indicating that RT-Cas1 is an autonomous functional module that is disseminated by horizontal gene transfer and can function with diverse type III systems. To compare the sequence pools sampled by RT-Cas1-associated and RT-lacking CRISPR-Cas systems, we obtained samples of a commercially grown cyanobacterium- Arthrospira platensis Sequencing of the CRISPR arrays uncovered a highly diverse population of spacers. Spacer diversity was particularly striking for the RT-Cas1-containing type III-B system, where no saturation was evident even with millions of sequences analyzed. In contrast, analysis of the RT-lacking type III-D system yielded a highly diverse pool but reached a point where fewer novel spacers were recovered as sequencing depth was increased. Matches could be identified for a small fraction of the non-RT-Cas1-associated spacers, and for only a single RT-Cas1-associated spacer. Thus, the principal source(s) of the spacers, particularly the hypervariable spacer repertoire of the RT-associated arrays, remains unknown. IMPORTANCE While the majority of CRISPR-Cas immune systems adapt to foreign genetic elements by capturing segments of invasive DNA, some systems carry reverse transcriptases (RTs) that enable adaptation to RNA molecules. From analysis of available bacterial sequence data, we find evidence that RT-based RNA adaptation machinery has been able to join with CRISPR-Cas immune systems in many, diverse bacterial species. To investigate whether the abilities to adapt to DNA and RNA molecules are utilized for defense against distinct classes of invaders in nature, we sequenced CRISPR arrays from samples of commercial-scale open-air cultures of Arthrospira platensis , a cyanobacterium that contains both RT-lacking and RT-containing CRISPR-Cas systems. We uncovered a diverse pool of naturally occurring immune memories, with the RT-lacking locus acquiring a number of segments matching known viral or bacterial genes, while the RT-containing locus has acquired spacers from a distinct sequence pool for which the source remains enigmatic. Copyright © 2017 Silas et al.

  14. Genetic localization of diuron- and mucidin-resistant mutants relative to a group of loci of the mitochondrial DNA controlling coenzyme QH2-cytochrome c reductase in Saccharomyces cerevisiae.

    PubMed

    Colson, A M; Slonimski, P P

    1979-01-02

    Diuron-resistance, DIU (Colson et al., 1977), antimycin-resistance, ANA (Michaelis, 1976; Burger et al., 1976), funiculosin-resistance, FUN (Pratje and Michaelis, 1977; Burger et al., 1977) and mucidin-resistance, MUC (Subik et al., 1977) are each coded by a pair of genetic loci on the mit DNA of S. cerevisiae. In the present paper, these respiratiory-competent, drug-resistant loci are localized relative to respiratory-deficient BOX mutants deficient in coenzyme QH2-cytochrome c reductase (Kotylak and Slonimski, 1976, 1977) using deletion and recombination mapping. Three drug-resistant loci possessing distinct mutated allelic forms are distinguished. DIU1 is allelic or closely linked to ANA2, FUN1 and BOX1; DIU2 is allelic or closely linked to ANA1, MUC1 and BOX4/5; MUC2 is allelic to BOX6. The high recombinant frequencies observed between the three loci (13% on the average for 33 various combinations analyzed) suggest the existence of either three genes coding for three distinct polypeptides or of a single gene coding for a single polypeptide but subdivided into three easily separable segments. The resistance of the respiratory-chain observed in vitro in the drug-resistant mutants and the allelism relationships between respiratory-competent, drug-resistant loci and coQH2-cyt c reductase deficient, BOX, loci strongly suggest that each of the three drug-resistant loci codes for a structural gene-product which is essential for the normal coQH2-cyt c reductase activity and is obviously a good candidate for a gene product of the drug-resistant loci mapped in this paper. Polypeptide length modifications of cytochrome b were observed in mutants deficient in the coQH2-cyt c red and localized at the BOX1, BOX4 and BOX6 genetic loci (Claisse et al., 1977, 1978) which are precisely the loci allelic to drug resistant mutants as shown in the present work. Taken together these two sets of data provide a strong evidence in favor of the idea that there exist three non contiguous segments of the mitochondrial DNA sequence which code for a single polypeptide sequence of cytochrome b. In each segment mutations which modify the polypeptide sequence can occur leading to the loss (BOX mutants) or to a modification (drug resistant mutants) of the enzyme activity.

  15. DNA Motion Capture Reveals the Mechanical Properties of DNA at the Mesoscale

    PubMed Central

    Price, Allen C.; Pilkiewicz, Kevin R.; Graham, Thomas G.W.; Song, Dan; Eaves, Joel D.; Loparo, Joseph J.

    2015-01-01

    Single-molecule studies probing the end-to-end extension of long DNAs have established that the mechanical properties of DNA are well described by a wormlike chain force law, a polymer model where persistence length is the only adjustable parameter. We present a DNA motion-capture technique in which DNA molecules are labeled with fluorescent quantum dots at specific sites along the DNA contour and their positions are imaged. Tracking these positions in time allows us to characterize how segments within a long DNA are extended by flow and how fluctuations within the molecule are correlated. Utilizing a linear response theory of small fluctuations, we extract elastic forces for the different, ∼2-μm-long segments along the DNA backbone. We find that the average force-extension behavior of the segments can be well described by a wormlike chain force law with an anomalously small persistence length. PMID:25992731

  16. Genetic analysis of hybridization and introgression between wild mongoose and brown lemurs.

    PubMed

    Pastorini, Jennifer; Zaramody, Alphonse; Curtis, Deborah J; Nievergelt, Caroline M; Mundy, Nicholas I

    2009-02-05

    Hybrid zones generally represent areas of secondary contact after speciation. The nature of the interaction between genes of individuals in a hybrid zone is of interest in the study of evolutionary processes. In this study, data from nuclear microsatellites and mitochondrial DNA sequences were used to genetically characterize hybridization between wild mongoose lemurs (Eulemur mongoz) and brown lemurs (E. fulvus) at Anjamena in west Madagascar. Two segments of mtDNA have been sequenced and 12 microsatellite loci screened in 162 brown lemurs and mongoose lemurs. Among the mongoose lemur population at Anjamena, we identified two F1 hybrids (one also having the mtDNA haplotype of E. fulvus) and six other individuals with putative introgressed alleles in their genotype. Principal component analysis groups both hybrids as intermediate between E. mongoz and E. fulvus and admixture analyses revealed an admixed genotype for both animals. Paternity testing proved one F1 hybrid to be fertile. Of the eight brown lemurs genotyped, all have either putative introgressed microsatellite alleles and/or the mtDNA haplotype of E. mongoz. Introgression is bidirectional for the two species, with an indication that it is more frequent in brown lemurs than in mongoose lemurs. We conclude that this hybridization occurs because mongoose lemurs have expanded their range relatively recently. Introgressive hybridization may play an important role in the unique lemur radiation, as has already been shown in other rapidly evolving animals.

  17. Role of the RS1 sequence of the cholera vibrio in amplification of the segment of plasmid DNA carrying the gene of resistance to tetracycline and the genes of cholera toxin

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fil'kova, S.L.; Il'ina, T.S.; Gintsburg, A.L.

    1988-11-01

    The hybrid plasmid pCO107, representing cointegrate 14(2)-5(2) of two plasmids, an F-derivative (pOX38) and a PBR322-derivative (pCT105) with an RS1 sequence of the cholera vibrio cloned in its makeup, contains two copes of RS1 at the sites of union of the two plasmids. Using a tetracycline resistance marker (Tc/sup R/) of the plasmid pCT105, clones were isolated which have an elevated level of resistance to tetracycline (an increase of from 4- to 30-fold). Using restriction analysis and the Southern blot method of hybridization it was shown that the increase in the level of resistance of tetracycline is associated with themore » amplification of pCT105 portion of the cointegrate, and that the process of amplification is governed by the presence of direct repeats of the RS1 sequence at its ends. The increase in the number of copies of the pCT105 segment, which contains in its composition the genes of cholera toxin (vct), is accompanied by an increase in toxin production.« less

  18. SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters.

    PubMed

    Wang, Chunlin; Lefkowitz, Elliot J

    2004-10-28

    Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. Used together, QS-search and DS-BLAST provide a flexible solution to adapt sequential similarity searching applications in high performance computing environments. Their ease of use and their ability to wrap a variety of database search programs provide an analytical architecture to assist both the seasoned bioinformaticist and the wet-bench biologist.

  19. SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters

    PubMed Central

    Wang, Chunlin; Lefkowitz, Elliot J

    2004-01-01

    Background Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. Results We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. Conclusions Used together, QS-search and DS-BLAST provide a flexible solution to adapt sequential similarity searching applications in high performance computing environments. Their ease of use and their ability to wrap a variety of database search programs provide an analytical architecture to assist both the seasoned bioinformaticist and the wet-bench biologist. PMID:15511296

  20. BCL2 oncogene translocation is mediated by a chi-like consensus

    PubMed Central

    1992-01-01

    Examination of 64 translocations involving the major breakpoint region (mbr) of the BCL2 oncogene and the immunoglobulin heavy chain locus identified three short (14, 16, and 18 bp) segments within the mbr at which translocations occurred with very high frequency. Each of these clusters was associated with a 15-bp region of sequence homology, the principal one containing an octamer related to chi, the procaryotic activator of recombination. The presence of short deletions and N nucleotide additions at the breakpoints, as well as involvement of JH and DH coding regions, suggested that these sequences served as signals capable of interacting with the VDJ recombinase complex, even though no homology with the traditional heptamer/spacer/nonamer (IgRSS) existed. Furthermore, the BCL2 signal sequences were employed in a bidirectional fashion and could mediate recombination of one mbr region with another. Segments homologous to the BCL2 signal sequences flanked individual members of the XP family of diversity gene segments, which were themselves highly overrepresented in the reciprocal products (18q-) of BCL2 translocation. We propose that the chi-like signal sequences of BCL2 represent a distinct class of recognition sites for the recombinase complex, responsible for initiating interactions between regions of DNA separated by great distances, and that BCL2 translocation begins by a recombination event between mbr and DXP chi signals. Since recombinant joints containing chi, not IgRSS, occur in brain cells expressing RAG-1 (Matsuoka, M., F. Nagawa, K. Okazaki, L. Kingsbury, K. Yoshida, U. Muller, D. T. Larue, J. A. Winer, and H. Sakano. 1991. Science [Wash. DC]. 254:81; reference 1), we further suggest that the product of this gene could mediate both BCL2 translocation and the first step of normal DJ assembly through the creation of chi joints, rather than signal or coding joints. PMID:1588282

  1. Overview of post Cohen-Boyer methods for single segment cloning and for multisegment DNA assembly

    PubMed Central

    Sands, Bryan; Brent, Roger

    2016-01-01

    In 1973, Cohen and coworkers published a foundational paper describing the cloning of DNA fragments into plasmid vectors. In it, they used DNA segments made by digestion with restriction enzymes and joined these in vitro with DNA ligase. These methods established working recombinant DNA technology and enabled the immediate start of the biotechnology industry. Since then, “classical” recombinant DNA technology using restriction enzymes and DNA ligase has matured. At the same time, researchers have developed numerous ways to generate large, complex, multisegment DNA constructions that offer advantages over classical techniques. Here, we provide an overview of “post-Cohen-Boyer” techniques used for cloning single segments into vectors (T/A, Topo cloning, Gateway and Recombineering) and for multisegment DNA assembly (Biobricks, Golden Gate, Gibson, Yeast homologous recombination in vivo, and Ligase Cycling Reaction). We compare and contrast these methods and also discuss issues that researchers should consider before choosing a particular multisegment DNA assembly method. PMID:27152131

  2. A Segmentation Method for Lung Parenchyma Image Sequences Based on Superpixels and a Self-Generating Neural Forest

    PubMed Central

    Liao, Xiaolei; Zhao, Juanjuan; Jiao, Cheng; Lei, Lei; Qiang, Yan; Cui, Qiang

    2016-01-01

    Background Lung parenchyma segmentation is often performed as an important pre-processing step in the computer-aided diagnosis of lung nodules based on CT image sequences. However, existing lung parenchyma image segmentation methods cannot fully segment all lung parenchyma images and have a slow processing speed, particularly for images in the top and bottom of the lung and the images that contain lung nodules. Method Our proposed method first uses the position of the lung parenchyma image features to obtain lung parenchyma ROI image sequences. A gradient and sequential linear iterative clustering algorithm (GSLIC) for sequence image segmentation is then proposed to segment the ROI image sequences and obtain superpixel samples. The SGNF, which is optimized by a genetic algorithm (GA), is then utilized for superpixel clustering. Finally, the grey and geometric features of the superpixel samples are used to identify and segment all of the lung parenchyma image sequences. Results Our proposed method achieves higher segmentation precision and greater accuracy in less time. It has an average processing time of 42.21 seconds for each dataset and an average volume pixel overlap ratio of 92.22 ± 4.02% for four types of lung parenchyma image sequences. PMID:27532214

  3. DNA-based nanobiostructured devices: The role of quasiperiodicity and correlation effects

    NASA Astrophysics Data System (ADS)

    Albuquerque, E. L.; Fulco, U. L.; Freire, V. N.; Caetano, E. W. S.; Lyra, M. L.; de Moura, F. A. B. F.

    2014-02-01

    The purpose of this review is to present a comprehensive and up-to-date account of the main physical properties of DNA-based nanobiostructured devices, stressing the role played by their quasi-periodicity arrangement and correlation effects. Although the DNA-like molecule is usually described as a short-ranged correlated random ladder, artificial segments can be grown following quasiperiodic sequences as, for instance, the Fibonacci and Rudin-Shapiro ones. They have interesting properties like a complex fractal spectra of energy, which can be considered as their indelible mark, and collective properties that are not shared by their constituents. These collective properties are due to the presence of long-range correlations, which are expected to be reflected somehow in their various spectra (electronic transmission, density of states, etc.) defining another description of disorder. Although long-range correlations are responsible for the effective electronic transport at specific resonant energies of finite DNA segments, much of the anomalous spread of an initially localized electron wave-packet can be accounted by short-range pair correlations, suggesting that an approach based on the inclusion of further short-range correlations on the nucleotide distribution leads to an adequate description of the electronic properties of DNA segments. The introduction of defects may generate states within the gap, and substantially improves the conductance, specially of finite branches. They usually become exponentially localized for any amount of disorder, and have the property to tailor the electronic transport properties of DNA-based nanoelectronic devices. In particular, symmetric and antisymmetric correlations have quite distinct influence on the nature of the electronic states, and a diluted distribution of defects lead to an anomalous diffusion of the electronic wave-packet. Nonlinear contributions, arising from the coupling between electrons and the molecular vibrations, promote an electronic self-trapping, thus opening up the possibility of controlling the spreading of the electronic density by an external field. The main features of DNA-based nanobiostructured devices presented in this review will include their electronic density of states, energy profiles, thermodynamic properties, localization, correlation effects, scale laws, fractal and multifractal analysis, and anhydrous crystals of their bases, among others. New features, like other nanobiostructured devices, as well as the future directions in this field are also presented and discussed.

  4. HOXBES2: a novel epididymal HOXB2 homeoprotein and its domain-specific association with spermatozoa.

    PubMed

    Prabagaran, E; Bandivdekar, A H; Dighe, V; Raghavan, V P

    2007-02-01

    The sperm from the testis acquires complete fertilizing ability and forward progressive motility following its transit through the epididymis. Acquisition of these characteristics results from the modification of the sperm proteome following interactions with epididymal secretions. In our attempts to identify epididymis-specific sperm plasma membrane proteins, a partial 2.83-kb clone was identified by immunoscreening a monkey epididymal cDNA library with an agglutinating monoclonal antibody raised against washed human spermatozoa. The sequence of the 2.83-kb clone exhibited homology to the region between 1 and 1097 bp of the homeobox gene, Hoxb2. This sequence was found to be species conserved, as revealed by RT-PCR analysis. To obtain a full-length clone of the sequence, 5' RACE-PCR (rapid amplification of cDNA ends PCR) was carried out using rat epididymal RNA as the template. It resulted in a full-length 1.657-kb cDNA encoding a 32.9-kDa putative protein. The protein designated HOXBES2 exhibited homology to the conserved 61-amino acid homeodomain region of the HOXB2 homeoprotein. However, characteristic differences were noted in its amino and carboxyl termini compared with HOXB2. A putative 30-kDa protein was detected in the tissue extracts from adult rat epididymis and caudal spermatozoa, and a 37-kDa protein was detected in the rat embryo when probed with a polyclonal antibody against HOXB2 protein. Multiple tissue Western blot and immunohistochemical analysis further indicated its expression in the cytoplasm of the principal and basal epithelial cells, with maximal expression in the distal epididymal segments. Northern blot analysis detected a single approximately 2.5-kb transcript from the adult epididymis. Indirect immunofluorescence localized the protein to the acrosome, midpiece, and equatorial segments of rat caudal and ejaculated human and monkey spermatozoa, respectively. In conclusion, we have identified and characterized a novel epididymal homeoprotein different from HOXB2 protein and hereafter referred to as HOXBES2, (HOXB2 homeodomain containing epididymis-specific sperm protein) with a probable role in fertilization.

  5. Local site preference rationalizes disentangling by DNA topoisomerases

    NASA Astrophysics Data System (ADS)

    Liu, Zhirong; Zechiedrich, Lynn; Chan, Hue Sun

    2010-03-01

    To rationalize the disentangling action of type II topoisomerases, an improved wormlike DNA model was used to delineate the degree of unknotting and decatenating achievable by selective segment passage at specific juxtaposition geometries and to determine how these activities were affected by DNA circle size and solution ionic strength. We found that segment passage at hooked geometries can reduce knot populations as dramatically as seen in experiments. Selective segment passage also provided theoretical underpinning for an intriguing empirical scaling relation between unknotting and decatenating potentials.

  6. DNA sequence homology induces cytosine-to-thymine mutation by a heterochromatin-related pathway in Neurospora

    PubMed Central

    Gladyshev, Eugene; Kleckner, Nancy

    2017-01-01

    Eukaryotic genomes contain substantial amounts of repetitive DNA organized in the form of constitutive heterochromatin and associated with repressive epigenetic modifications, such as H3K9me3 and C5-cytosine methylation (5mC). In the fungus Neurospora crassa, H3K9me3 and 5mC are catalyzed, respectively, by a conserved SUV39 histone methyltransferase DIM-5 and a DNMT1-like cytosine methyltransferase DIM-2. Here we show that DIM-2 can also mediate Repeat-Induced Point mutation (RIP) of repetitive DNA in N. crassa. We further show that DIM-2-dependent RIP requires DIM-5, HP1, and other known heterochromatin factors, implying the role of a repeat-induced heterochromatin-related process. Our previous findings suggest that the mechanism of repeat recognition for RIP involves direct interactions between homologous double-stranded (ds) DNA segments. We thus now propose that, in somatic cells, homologous dsDNA/dsDNA interactions between a small number of repeat copies can nucleate a transient heterochromatic state, which, on longer repeat arrays, may lead to the formation of constitutive heterochromatin. PMID:28459455

  7. Influence of Electron–Holes on DNA Sequence-Specific Mutation Rates

    PubMed Central

    Suárez-Villagrán, Martha Y; Azevedo, Ricardo B R; Miller, John H

    2018-01-01

    Abstract Biases in mutation rate can influence molecular evolution, yielding rates of evolution that vary widely in different parts of the genome and even among neighboring nucleotides. Here, we explore one possible mechanism of influence on sequence-specific mutation rates, the electron–hole, which can localize and potentially trigger a replication mismatch. A hole is a mobile site of positive charge created during one-electron oxidation by, for example, radiation, contact with a mutagenic agent, or oxidative stress. Its quantum wavelike properties cause it to localize at various sites with probabilities that vary widely, by orders of magnitude, and depend strongly on the local sequence. We find significant correlations between hole probabilities and mutation rates within base triplets, observed in published mutation accumulation experiments on four species of bacteria. We have also computed hole probability spectra for hypervariable segment I of the human mtDNA control region, which contains several mutational hotspots, and for heptanucleotides in noncoding regions of the human genome, whose polymorphism levels have recently been reported. We observe significant correlations between hole probabilities, and context-specific mutation and substitution rates. The correlation with hole probability cannot be explained entirely by CpG methylation in the heptanucleotide data. Peaks in hole probability tend to coincide with mutational hotspots, even in mtDNA where CpG methylation is rare. Our results suggest that hole-enhanced mutational mechanisms, such as oxidation-stabilized tautomerization and base deamination, contribute to molecular evolution. PMID:29617801

  8. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage.

    PubMed

    Brok-Volchanskaya, Vera S; Kadyrov, Farid A; Sivogrivov, Dmitry E; Kolosov, Peter M; Sokolov, Andrey S; Shlyapnikov, Michael G; Kryukov, Valentine M; Granovsky, Igor E

    2008-04-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3' 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TpsiC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages.

  9. Uneven distribution of expressed sequence tag loci on maize pachytene chromosomes

    PubMed Central

    Anderson, Lorinda K.; Lai, Ann; Stack, Stephen M.; Rizzon, Carene; Gaut, Brandon S.

    2006-01-01

    Examining the relationships among DNA sequence, meiotic recombination, and chromosome structure at a genome-wide scale has been difficult because only a few markers connect genetic linkage maps with physical maps. Here, we have positioned 1195 genetically mapped expressed sequence tag (EST) markers onto the 10 pachytene chromosomes of maize by using a newly developed resource, the RN-cM map. The RN-cM map charts the distribution of crossing over in the form of recombination nodules (RNs) along synaptonemal complexes (SCs, pachytene chromosomes) and allows genetic cM distances to be converted into physical micrometer distances on chromosomes. When this conversion is made, most of the EST markers used in the study are located distally on the chromosomes in euchromatin. ESTs are significantly clustered on chromosomes, even when only euchromatic chromosomal segments are considered. Gene density and recombination rate (as measured by EST and RN frequencies, respectively) are strongly correlated. However, crossover frequencies for telomeric intervals are much higher than was expected from their EST frequencies. For pachytene chromosomes, EST density is about fourfold higher in euchromatin compared with heterochromatin, while DNA density is 1.4 times higher in heterochromatin than in euchromatin. Based on DNA density values and the fraction of pachytene chromosome length that is euchromatic, we estimate that ∼1500 Mbp of the maize genome is in euchromatin. This overview of the organization of the maize genome will be useful in examining genome and chromosome evolution in plants. PMID:16339046

  10. Pseudomonas aeruginosa Microcolonies in Coronary Thrombi from Patients with ST-Segment Elevation Myocardial Infarction

    PubMed Central

    Hansen, Gorm Mørk; Belstrøm, Daniel; Nilsson, Martin; Helqvist, Steffen; Nielsen, Claus Henrik; Holmstrup, Palle; Tolker-Nielsen, Tim; Givskov, Michael; Hansen, Peter Riis

    2016-01-01

    Chronic infection is associated with an increased risk of atherothrombotic disease and direct bacterial infection of arteries has been suggested to contribute to the development of unstable atherosclerotic plaques. In this study, we examined coronary thrombi obtained in vivo from patients with ST-segment elevation myocardial infarction (STEMI) for the presence of bacterial DNA and bacteria. Aspirated coronary thrombi from 22 patients with STEMI were collected during primary percutaneous coronary intervention and arterial blood control samples were drawn from radial or femoral artery sheaths. Analyses were performed using 16S polymerase chain reaction and with next-generation sequencing to determine bacterial taxonomic classification. In selected thrombi with the highest relative abundance of Pseudomonas aeruginosa DNA, peptide nucleic acid fluorescence in situ hybridization (PNA-FISH) with universal and species specific probes was performed to visualize bacteria within thrombi. From the taxonomic analysis we identified a total of 55 different bacterial species. DNA from Pseudomonas aeruginosa represented the only species that was significantly associated with either thrombi or blood and was >30 times more abundant in thrombi than in arterial blood (p<0.0001). Whole and intact bacteria present as biofilm microcolonies were detected in selected thrombi using universal and P. aeruginosa-specific PNA-FISH probes. P. aeruginosa and vascular biofilm infection in culprit lesions may play a role in STEMI, but causal relationships remain to be determined. PMID:28030624

  11. Development of a nested polymerase chain reaction for amplification of a sequence of the p57 gene of Renibacterium salmoninarum that provides a highly sensitive method for detection of the bacterium in salmonid kidney

    USGS Publications Warehouse

    Chase, D.M.; Pascho, R.J.

    1998-01-01

    Nucleic acid-based assays have shown promise for diagnosing Renibacterium salmoninarum in tissues and body fluids of salmonids. DeVelopment of a nested polymerase chain reaction (PCR) method to detect a 320 bp DNA segment of the gene encoding the p57 protein of R. salmoninarum is described. Whereas a conventional PCR for a 383 bp segment of the p57 gene reliably detected 1000 R. salmoninarum cells per reaction in kidney tissue, the nested PCR detected as few as 10 R. salmoninarum per reaction in kidney tissue. Two DNA extraction methods for the nested PCR were compared and the correlation between replicate samples was generally higher in samples extracted by the QIAamp system compared with those extracted by the phenol/chloroform method. The specificity of the nested PCR was confirmed by testing DNA extracts of common bacterial fish pathogens and a panel of bacterial species reported to cause false-positive reactions in the enzyme-linked immunosorbent assay (ELISA) and the fluorescent antibody test (FAT) for R. salmoninarum. Kidney samples from 74 naturally infected chinook Salmon were examined by the nested PCR, the ELISA, and the FAT, and the detected prevalences of R. salmoninarum were 61, 47, and 43%, respectively.

  12. Sequence of the chloroplast 16S rRNA gene and its surrounding regions of Chlamydomonas reinhardii.

    PubMed Central

    Dron, M; Rahire, M; Rochaix, J D

    1982-01-01

    The sequence of a 2 kb DNA fragment containing the chloroplast 16S ribosomal RNA gene from Chlamydomonas reinhardii and its flanking regions has been determined. The algal 16S rRNA sequence (1475 nucleotides) and secondary structure are highly related to those found in bacteria and in the chloroplasts of higher plants. In contrast, the flanking regions are very different. In C. reinhardii the 16S rRNA gene is surrounded by AT rich segments of about 180 bases, which are followed by a long stretch of complementary bases separated from each other by 1833 nucleotides. It is likely that these structures play an important role in the folding and processing of the precursor of 16S rRNA. The primary and secondary structures of the binding sites of two ribosomal proteins in the 16SrRNAs of E. coli and C. reinhardii are considerably related. Images PMID:6296784

  13. Prediction of a rare chromosomal aberration simultaneously with next generation sequencing-based comprehensive chromosome screening in human preimplantation embryos for recurrent pregnancy loss.

    PubMed

    Lee, Yi-Xuan; Chen, Chien-Wen; Lin, Yi-Hui; Tzeng, Chii-Ruey; Chen, Chi-Huang

    2018-01-01

    Preimplantation genetic testing has been used widely in recent years as a part of assisted reproductive technology (ART) owing to the breakthrough development of deoxyribonucleic acid (DNA) sequencing. With the advancement of technology and increased resolution of next generation sequencing (NGS), extensive comprehensive chromosome screening along with small clinically significant deletions and duplications can possibly be performed simultaneously. Here, we present a case of rare chromosomal aberrations: 46,XY,dup(15)(q11.2q13),t(16;18)(q23;p11.2), which resulted in a normally developed adult but abnormal gametes leading to recurrent pregnancy loss (RPL). To our best knowledge, this is the first report of t(16;18) translocation with such a small exchanged segment detected by NGS platform of MiSeq system in simultaneous 24-chromosome aneuploidy screening.

  14. Ecology of the microbiome of the infected root canal system: a comparison between apical and coronal root segments

    PubMed Central

    Özok, A.R.; Persoon, I.F.; Huse, S.M.; Keijser, B.J.F.; Wesselink, P.R.; Crielaard, W.; Zaura, E.

    2016-01-01

    Aim To evaluate the microbial ecology of the coronal and apical segments of infected root canal systems using a complete sampling technique and next-generation sequencing. Methodology The roots of 23 extracted teeth with apical periodontitis were sectioned in half, horizontally, and cryo-pulverized. Bacterial communities were profiled using tagged 454 pyrosequencing of the 16S rDNA hypervariable V5–V6 region. Results The sequences were classified into 606 taxa (species or higher taxon), representing 24 bacterial phyla or candidate divisions and one archaeal phylum. Proteobacteria were more abundant in the apical samples (p<0.05), while Actinobacteria were in significantly higher proportions in the coronal samples. The apical samples harbored statistically significantly more taxa than the coronal samples (p=0.01), and showed a higher microbial diversity. Several taxa belonging to fastidious obligate anaerobes were significantly more abundant in the apical segments of the roots compared to their coronal counterparts. Conclusions Endodontic infections are more complex than reported previously. The apical part of the root canal system drives the selection of a more diverse and more anaerobe community than the coronal part. The presence of a distinct ecological niche in the apical region explains the difficulty of eradication of the infection, and emphasizes the need that new treatment approaches should be developed. PMID:22251411

  15. A Multiplex Real-Time PCR Assay to Diagnose and Separate Helicoverpa armigera and H. zea (Lepidoptera: Noctuidae) in the New World

    PubMed Central

    Gilligan, Todd M.; Tembrock, Luke R.; Farris, Roxanne E.; Barr, Norman B.; van der Straten, Marja J.; van de Vossenberg, Bart T. L. H.; Metz-Verschure, Eveline

    2015-01-01

    The Old World bollworm, Helicoverpa armigera (Hübner), and the corn earworm, H. zea (Boddie), are two of the most important agricultural pests in the world. Diagnosing these two species is difficult—adults can only be separated with a complex dissection, and larvae cannot be identified to species using morphology, necessitating the use of geographic origin for identification in most instances. With the discovery of H. armigera in the New World, identification of immature Helicoverpa based on origin is no longer possible because H. zea also occurs in all of the geographic regions where H. armigera has been discovered. DNA barcoding and restriction fragment length polymorphism (RFLP) analyses have been reported in publications to distinguish these species, but these methods both require post-PCR processing (i.e., DNA sequencing or restriction digestion) to complete. We report the first real-time PCR assay to distinguish these pests based on two hydrolysis probes that bind to a segment of the internal transcribed spacer region 2 (ITS2) amplified using a single primer pair. One probe targets H. armigera, the second probe targets H. zea, and a third probe that targets a conserved segment of 18S rDNA is used as a control of DNA quality. The assay can be completed in 50 minutes when using isolated DNA and is successfully tested on larvae intercepted at ports of entry and adults captured during domestic surveys. We demonstrate that the assay can be run in triplex with no negative effects on sensitivity, can be run using alternative real-time PCR reagents and instruments, and does not cross react with other New World Heliothinae. PMID:26558366

  16. A Multiplex Real-Time PCR Assay to Diagnose and Separate Helicoverpa armigera and H. zea (Lepidoptera: Noctuidae) in the New World.

    PubMed

    Gilligan, Todd M; Tembrock, Luke R; Farris, Roxanne E; Barr, Norman B; van der Straten, Marja J; van de Vossenberg, Bart T L H; Metz-Verschure, Eveline

    2015-01-01

    The Old World bollworm, Helicoverpa armigera (Hübner), and the corn earworm, H. zea (Boddie), are two of the most important agricultural pests in the world. Diagnosing these two species is difficult-adults can only be separated with a complex dissection, and larvae cannot be identified to species using morphology, necessitating the use of geographic origin for identification in most instances. With the discovery of H. armigera in the New World, identification of immature Helicoverpa based on origin is no longer possible because H. zea also occurs in all of the geographic regions where H. armigera has been discovered. DNA barcoding and restriction fragment length polymorphism (RFLP) analyses have been reported in publications to distinguish these species, but these methods both require post-PCR processing (i.e., DNA sequencing or restriction digestion) to complete. We report the first real-time PCR assay to distinguish these pests based on two hydrolysis probes that bind to a segment of the internal transcribed spacer region 2 (ITS2) amplified using a single primer pair. One probe targets H. armigera, the second probe targets H. zea, and a third probe that targets a conserved segment of 18S rDNA is used as a control of DNA quality. The assay can be completed in 50 minutes when using isolated DNA and is successfully tested on larvae intercepted at ports of entry and adults captured during domestic surveys. We demonstrate that the assay can be run in triplex with no negative effects on sensitivity, can be run using alternative real-time PCR reagents and instruments, and does not cross react with other New World Heliothinae.

  17. A distinct class of homeodomain proteins is encoded by two sequentially expressed Drosophila genes from the 93D/E cluster.

    PubMed Central

    Jagla, K; Stanceva, I; Dretzen, G; Bellard, F; Bellard, M

    1994-01-01

    Homeodomains appear to be one of the most frequently employed DNA-binding domains in a superfamily of transacting factors. It is likely that during evolution several sub-types of homeodomain have evolved from a common ancestral domain, resulting in distinct but closely related DNA-binding preferences. Here we describe the conservation of a distinct type of homeodomain encoded by the Drosophila lady-bird-late (lbl) gene, previously named nkch4 (1). Using degenerate PCR primers corresponding to the most divergent regions of the first and third helix of the Lbl homeodomain we have amplified, from genomic DNA of the fly, a lady-bird-like homeobox fragment. The Drosophila PCR products contained both the lbl (1) and a highly related homeobox sequence, which we named lady-bird-early (lbe). This new Drosophila gene resides directly upstream to lbl and together with tinman/NK4 (2, 3, 4, 5), bagpipe/NK3 (2, 4) S59/NK1 (4, 6) and 93Bal (7) compose the 93D/E homeobox gene cluster. Ibe and lbl are transcribed from the same strand and in a temporal order corresponding to their 5'-3' chromosomal location. Transcripts of both genes are found in the epiderm of Drosophila embryos, in cells known to express a segment polarity gene wingless (8), and their spatial and temporal colinearity of expression strongly suggests that they cooperate during segmentation. The amino-acid composition of both Lady-bird homeodomains differ from that of Antp-type at several positions involved in DNA recognition. These substitutions appear to modify DNA-binding preferences since Lbl homeodomain is unable to recognize the most common homeodomain binding TAAT motif in gel retardation experiments. Images PMID:7909370

  18. [Molecular cloning of the DNA sequence of activin beta A subunit gene mature peptides from panda and related species and its application in the research of phylogeny and taxonomy].

    PubMed

    Wang, Xiao-Jing; Wang, Xiao-Xing; Wang, Ya-Jun; Wang, Xi-Zhong; He, Guang-Xin; Chen, Hong-Wei; Fei, Li-Song

    2002-09-01

    Activin, which is included in the transforming growth factor-beta (TGF beta) superfamily of proteins and receptors, is known to have broad-ranging effects in the creatures. The mature peptide of beta A subunit of this gene, one of the most highly conserved sequence, can elevate the basal secretion of follicle-stimulating hormone (FSH) in the pituitary and FSH is pivotal to organism's reproduction. Reproduction block is one of the main reasons which cause giant panda to extinct. The sequence of Activin beta A subunit gene mature peptides has been successfully amplified from giant panda, red panda and malayan sun bear's genomic DNA by using polymerase chain reaction (PCR) with a pair of degenerate primers. The PCR products were cloned into the vector pBlueScript+ of Esherichia coli. Sequence analysis of Activin beta A subunit gene mature peptides shows that the length of this gene segment is the same (359 bp) and there is no intron in all three species. The sequence encodes a peptide of 119 amino acid residues. The homology comparison demonstrates 93.9% DNA homology and 99% homology in amino acid among these three species. Both GenBank blast search result and restriction enzyme map reveal that the sequences of Activin beta A subunit gene mature peptides of different species are highly conserved during the evolution process. Phylogeny analysis is performed with PHYLIP software package. A consistent phylogeny tree has been drawn with three different methods. The software analysis outcome accords with the academic view that giant panda has a closer relationship to the malayan sun bear than the red panda. Giant panda should be grouped into the bear family (Uersidae) with the malayan sun bear. As to the red panda, it would be better that this animal be grouped into the unique family (red panda family) because of great difference between the red panda and the bears (Uersidae).

  19. Primer-independent RNA sequencing with bacteriophage phi6 RNA polymerase and chain terminators.

    PubMed

    Makeyev, E V; Bamford, D H

    2001-05-01

    Here we propose a new general method for directly determining RNA sequence based on the use of the RNA-dependent RNA polymerase from bacteriophage phi6 and the chain terminators (RdRP sequencing). The following properties of the polymerase render it appropriate for this application: (1) the phi6 polymerase can replicate a number of single-stranded RNA templates in vitro. (2) In contrast to the primer-dependent DNA polymerases utilized in the sequencing procedure by Sanger et al. (Proc Natl Acad Sci USA, 1977, 74:5463-5467), it initiates nascent strand synthesis without a primer, starting the polymerization on the very 3'-terminus of the template. (3) The polymerase can incorporate chain-terminating nucleotide analogs into the nascent RNA chain to produce a set of base-specific termination products. Consequently, 3' proximal or even complete sequence of many target RNA molecules can be rapidly deduced without prior sequence information. The new technique proved useful for sequencing several synthetic ssRNA templates. Furthermore, using genomic segments of the bluetongue virus we show that RdRP sequencing can also be applied to naturally occurring dsRNA templates. This suggests possible uses of the method in the RNA virus research and diagnostics.

  20. Library Resources for Bac End Sequencing. Final Technical Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pieter J. de Jong

    2000-10-01

    Studies directed towards the specific aims outlined for this research award are summarized. The RPCI II Human Bac Library has been expanded by the addition of 6.9-fold genomic coverage. This segment has been generated from a MBOI partial digest of the same anonymous donor DNA used for the rest of the library. A new cloning vector, pTARBAC1, has been constructed and used in the construction of RPCI-II segment 5. This new cloning vector provides a new strategy in identifying targeted genomic regions and will greatly facilitate a large-scale analysis for positional cloning. A new maleCS7BC/6J mouse BAC library has beenmore » constructed. RPCI-23 contain 576 plates (approx 210,000 clones) and represents approximately 11-fold coverage of the mouse genome.« less

  1. Cytochrome c oxidase subunit I barcoding of the green bee-eater (Merops orientalis).

    PubMed

    Arif, I A; Khan, H A; Shobrak, M; Williams, J

    2011-10-21

    DNA barcoding using mitochondrial cytochrome c oxidase subunit I (COI) is regarded as a standard method for species identification. Recent reports have also shown extended applications of COI gene analysis in phylogeny and molecular diversity studies. The bee-eaters are a group of near passerine birds in the family Meropidae. There are 26 species worldwide; five of them are found in Saudi Arabia. Until now, GenBank included a COI barcode for only one species of bee-eater, the European bee-eater (Merops apiaster). We sequenced the 694-bp segment of the COI gene of the green bee-eater M. orientalis and compared the sequences with those of M. apiaster. Pairwise sequence comparison showed 66 variable sites across all the eight sequences from both species, with an interspecific genetic distance of 0.0362. Two and one within-species variable sites were found, with genetic distances of 0.0005 and 0.0003 for M. apiaster and M. orientalis, respectively. This is the first study reporting barcodes for M. orientalis.

  2. Mitochondrial DNA variation in bull trout (Salvelinus confluentus) from northwestern North America: implications for zoogeography and conservation.

    PubMed

    Taylor, E B; Pollard, S; Louie, D

    1999-07-01

    Bull trout, Salvelinus confluentus (Salmonidae), are distributed in northwestern North America from Nevada to Yukon Territory, largely in interior drainages. The species is of conservation concern owing to declines in abundance, particularly in southern portions of its range. To investigate phylogenetic structure within bull trout that might form the basis for the delineation of major conservation units, we conducted a mitochondrial DNA (mtDNA) survey in bull trout from throughout its range. Restriction fragment length polymorphism (RFLP) analysis of four segments of the mtDNA genome with 11 restriction enzymes resolved 21 composite haplotypes that differed by an average of 0.5% in sequence. One group of haplotypes predominated in 'coastal' areas (west of the coastal mountain ranges) while another predominated in 'interior' regions (east of the coastal mountains). The two putative lineages differed by 0.8% in sequence and were also resolved by sequencing a portion of the ND1 gene in a representative of each RFLP haplotype. Significant variation existed within individual sample sites (12% of total variation) and among sites within major geographical regions (33%), but most variation (55%) was associated with differences between coastal and interior regions. We concluded that: (i) bull trout are subdivided into coastal and interior lineages; (ii) this subdivision reflects recent historical isolation in two refugia south of the Cordilleran ice sheet during the Pleistocene: the Chehalis and Columbia refugia; and (iii) most of the molecular variation resides at the interpopulation and inter-region levels. Conservation efforts, therefore, should focus on maintaining as many populations as possible across as many geographical regions as possible within both coastal and interior lineages.

  3. Effective gene prediction by high resolution frequency estimator based on least-norm solution technique

    PubMed Central

    2014-01-01

    Linear algebraic concept of subspace plays a significant role in the recent techniques of spectrum estimation. In this article, the authors have utilized the noise subspace concept for finding hidden periodicities in DNA sequence. With the vast growth of genomic sequences, the demand to identify accurately the protein-coding regions in DNA is increasingly rising. Several techniques of DNA feature extraction which involves various cross fields have come up in the recent past, among which application of digital signal processing tools is of prime importance. It is known that coding segments have a 3-base periodicity, while non-coding regions do not have this unique feature. One of the most important spectrum analysis techniques based on the concept of subspace is the least-norm method. The least-norm estimator developed in this paper shows sharp period-3 peaks in coding regions completely eliminating background noise. Comparison of proposed method with existing sliding discrete Fourier transform (SDFT) method popularly known as modified periodogram method has been drawn on several genes from various organisms and the results show that the proposed method has better as well as an effective approach towards gene prediction. Resolution, quality factor, sensitivity, specificity, miss rate, and wrong rate are used to establish superiority of least-norm gene prediction method over existing method. PMID:24386895

  4. The utility of mtDNA and rDNA for barcoding and phylogeny of plant-parasitic nematodes from Longidoridae (Nematoda, Enoplea).

    PubMed

    Palomares-Rius, J E; Cantalapiedra-Navarrete, C; Archidona-Yuste, A; Subbotin, S A; Castillo, P

    2017-09-07

    The traditional identification of plant-parasitic nematode species by morphology and morphometric studies is very difficult because of high morphological variability that can lead to considerable overlap of many characteristics and their ambiguous interpretation. For this reason, it is essential to implement approaches to ensure accurate species identification. DNA barcoding aids in identification and advances species discovery. This study sought to unravel the use of the mitochondrial marker cytochrome c oxidase subunit 1 (coxI) as barcode for Longidoridae species identification, and as a phylogenetic marker. The results showed that mitochondrial and ribosomal markers could be used as barcoding markers, except for some species from the Xiphinema americanum group. The ITS1 region showed a promising role in barcoding for species identification because of the clear molecular variability among species. Some species presented important molecular variability in coxI. The analysis of the newly provided sequences and the sequences deposited in GenBank showed plausible misidentifications, and the use of voucher species and topotype specimens is a priority for this group of nematodes. The use of coxI and D2 and D3 expansion segments of the 28S rRNA gene did not clarify the phylogeny at the genus level.

  5. Integration of narrow-host-range vectors from Escherichia coli into the genomes of amino acid-producing corynebacteria after intergeneric conjugation.

    PubMed

    Mateos, L M; Schäfer, A; Kalinowski, J; Martin, J F; Pühler, A

    1996-10-01

    Conjugative transfer of mobilizable derivatives of the Escherichia coli narrow-host-range plasmids pBR322, pBR325, pACYC177, and pACYC184 from E. coli to species of the gram-positive genera Corynebacterium and Brevibacterium resulted in the integration of the plasmids into the genomes of the recipient bacteria. Transconjugants appeared at low frequencies and reproducibly with a delay of 2 to 3 days compared with matings with replicative vectors. Southern analysis of corynebacterial transconjugants and nucleotide sequences from insertion sites revealed that integration occurs at different locations and that different parts of the vector are involved in the process. Integration is not dependent on indigenous insertion sequence elements but results from recombination between very short homologous DNA segments (8 to 12 bp) present in the vector and in the host DNA. In the majority of the cases (90%), integration led to cointegrate formation, and in some cases, deletions or rearrangements occurred during the recombination event. Insertions were found to be quite stable even in the absence of selective pressure.

  6. Cloning, sequencing, and analysis of the griseusin polyketide synthase gene cluster from Streptomyces griseus.

    PubMed Central

    Yu, T W; Bibb, M J; Revill, W P; Hopwood, D A

    1994-01-01

    A fragment of DNA was cloned from the Streptomyces griseus K-63 genome by using genes (act) for the actinorhodin polyketide synthase (PKS) of Streptomyces coelicolor as a probe. Sequencing of a 5.4-kb segment of the cloned DNA revealed a set of five gris open reading frames (ORFs), corresponding to the act PKS genes, in the following order: ORF1 for a ketosynthase, ORF2 for a chain length-determining factor, ORF3 for an acyl carrier protein, ORF5 for a ketoreductase, and ORF4 for a cyclase-dehydrase. Replacement of the gris genes with a marker gene in the S. griseus genome by using a single-stranded suicide vector propagated in Escherichia coli resulted in loss of the ability to produce griseusins A and B, showing that the five gris genes do indeed encode the type II griseusin PKS. These genes, encoding a PKS that is programmed differently from those for other aromatic PKSs so far available, will provide further valuable material for analysis of the programming mechanism by the construction and analysis of strains carrying hybrid PKS. Images PMID:8169211

  7. Integration of narrow-host-range vectors from Escherichia coli into the genomes of amino acid-producing corynebacteria after intergeneric conjugation.

    PubMed Central

    Mateos, L M; Schäfer, A; Kalinowski, J; Martin, J F; Pühler, A

    1996-01-01

    Conjugative transfer of mobilizable derivatives of the Escherichia coli narrow-host-range plasmids pBR322, pBR325, pACYC177, and pACYC184 from E. coli to species of the gram-positive genera Corynebacterium and Brevibacterium resulted in the integration of the plasmids into the genomes of the recipient bacteria. Transconjugants appeared at low frequencies and reproducibly with a delay of 2 to 3 days compared with matings with replicative vectors. Southern analysis of corynebacterial transconjugants and nucleotide sequences from insertion sites revealed that integration occurs at different locations and that different parts of the vector are involved in the process. Integration is not dependent on indigenous insertion sequence elements but results from recombination between very short homologous DNA segments (8 to 12 bp) present in the vector and in the host DNA. In the majority of the cases (90%), integration led to cointegrate formation, and in some cases, deletions or rearrangements occurred during the recombination event. Insertions were found to be quite stable even in the absence of selective pressure. PMID:8824624

  8. Specification of anteroposterior cell fates in Caenorhabditis elegans by Drosophila Hox proteins.

    PubMed

    Hunter, C P; Kenyon, C

    1995-09-21

    Antennapedia class homeobox (Hox) genes specify cell fates in successive anteroposterior body domains in vertebrates, insects and nematodes. The DNA-binding homeodomain sequences are very similar between vertebrate and Drosophila Hox proteins, and this similarity allows vertebrate Hox proteins to function in Drosophila. In contrast, the Caenorhabditis elegans homeodomains are substantially divergent. Further, C. elegans differs from both insects and vertebrates in having a non-segmented body as well as a distinctive mode of development that involves asymmetric early cleavages and invariant cell lineages. Here we report that, despite these differences, Drosophila Hox proteins expressed in C. elegans can substitute for C. elegans Hox proteins in the control of three different cell-fate decisions: the regulation of cell migration, the specification of serotonergic neurons, and the specification of a sensory structure. We also show that the specificity of one C. elegans Hox protein is partly determined by two amino acids that have been implicated in sequence-specific DNA binding. Together these findings suggest that factors important for target recognition by specific Hox proteins have been conserved throughout much of the animal kingdom.

  9. A segmentation method for lung nodule image sequences based on superpixels and density-based spatial clustering of applications with noise

    PubMed Central

    Zhang, Wei; Zhang, Xiaolong; Qiang, Yan; Tian, Qi; Tang, Xiaoxian

    2017-01-01

    The fast and accurate segmentation of lung nodule image sequences is the basis of subsequent processing and diagnostic analyses. However, previous research investigating nodule segmentation algorithms cannot entirely segment cavitary nodules, and the segmentation of juxta-vascular nodules is inaccurate and inefficient. To solve these problems, we propose a new method for the segmentation of lung nodule image sequences based on superpixels and density-based spatial clustering of applications with noise (DBSCAN). First, our method uses three-dimensional computed tomography image features of the average intensity projection combined with multi-scale dot enhancement for preprocessing. Hexagonal clustering and morphological optimized sequential linear iterative clustering (HMSLIC) for sequence image oversegmentation is then proposed to obtain superpixel blocks. The adaptive weight coefficient is then constructed to calculate the distance required between superpixels to achieve precise lung nodules positioning and to obtain the subsequent clustering starting block. Moreover, by fitting the distance and detecting the change in slope, an accurate clustering threshold is obtained. Thereafter, a fast DBSCAN superpixel sequence clustering algorithm, which is optimized by the strategy of only clustering the lung nodules and adaptive threshold, is then used to obtain lung nodule mask sequences. Finally, the lung nodule image sequences are obtained. The experimental results show that our method rapidly, completely and accurately segments various types of lung nodule image sequences. PMID:28880916

  10. DNA motion capture reveals the mechanical properties of DNA at the mesoscale.

    PubMed

    Price, Allen C; Pilkiewicz, Kevin R; Graham, Thomas G W; Song, Dan; Eaves, Joel D; Loparo, Joseph J

    2015-05-19

    Single-molecule studies probing the end-to-end extension of long DNAs have established that the mechanical properties of DNA are well described by a wormlike chain force law, a polymer model where persistence length is the only adjustable parameter. We present a DNA motion-capture technique in which DNA molecules are labeled with fluorescent quantum dots at specific sites along the DNA contour and their positions are imaged. Tracking these positions in time allows us to characterize how segments within a long DNA are extended by flow and how fluctuations within the molecule are correlated. Utilizing a linear response theory of small fluctuations, we extract elastic forces for the different, ∼2-μm-long segments along the DNA backbone. We find that the average force-extension behavior of the segments can be well described by a wormlike chain force law with an anomalously small persistence length. Copyright © 2015 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  11. Programmed self-assembly of DNA/RNA for biomedical applications

    NASA Astrophysics Data System (ADS)

    Wang, Pengfei

    Three self-assembly strategies were utilized for assembly of novel functional DNA/RNA nanostructures. RNA-DNA hybrid origami method was developed to fabricate nano-objects (ribbon, rectangle, and triangle) with precisely controlled geometry. Unlike conventional DNA origami which use long DNA single strand as scaffold, a long RNA single strand was used instead, which was folded by short DNA single strands (staples) into prescribed objects through sequence specific hybridization between RNA and DNA. Single stranded tiles (SST) and RNA-DNA hybrid origami were utilized to fabricate a variety of barcode-like nanostructures with unique patterns by expanding a plain rectangle via introducing spacers (10-bp dsDNA segment) between parallel duplexes. Finally, complex 2D array and 3D polyhedrons with multiple patterns within one structure were assembled from simple DNA motifs. Two demonstrations of biomedical applications of DNA nanotechnology were presented. Firstly, lambda-DNA was used as template to direct the fabrication of multi-component magnetic nanoparticle chains. Nuclear magnetic relaxation (NMR) characterization showed superb magnetic relaxativity of the nanoparticle chains which have large potential to be utilized as MRI contrast agents. Secondly, DNA nanotechnology was introduced into the conformational study of a routinely used catalytic DNAzyme, the RNA-cleaving 10-23 DNAzyme. The relative angle between two flanking duplexes of the catalytic core was determined (94.8°), which shall be able to provide a clue to further understanding of the cleaving mechanism of this DNAzyme from a conformational perspective.

  12. Accelerated Evolution of the ASPM Gene Controlling Brain Size Begins Prior to Human Brain Expansion

    PubMed Central

    Solomon, Gregory; Gersch, William; Yoon, Young-Ho; Collura, Randall; Ruvolo, Maryellen; Barrett, J. Carl; Woods, C. Geoffrey; Walsh, Christopher A

    2004-01-01

    Primary microcephaly (MCPH) is a neurodevelopmental disorder characterized by global reduction in cerebral cortical volume. The microcephalic brain has a volume comparable to that of early hominids, raising the possibility that some MCPH genes may have been evolutionary targets in the expansion of the cerebral cortex in mammals and especially primates. Mutations in ASPM, which encodes the human homologue of a fly protein essential for spindle function, are the most common known cause of MCPH. Here we have isolated large genomic clones containing the complete ASPM gene, including promoter regions and introns, from chimpanzee, gorilla, orangutan, and rhesus macaque by transformation-associated recombination cloning in yeast. We have sequenced these clones and show that whereas much of the sequence of ASPM is substantially conserved among primates, specific segments are subject to high Ka/Ks ratios (nonsynonymous/synonymous DNA changes) consistent with strong positive selection for evolutionary change. The ASPM gene sequence shows accelerated evolution in the African hominoid clade, and this precedes hominid brain expansion by several million years. Gorilla and human lineages show particularly accelerated evolution in the IQ domain of ASPM. Moreover, ASPM regions under positive selection in primates are also the most highly diverged regions between primates and nonprimate mammals. We report the first direct application of TAR cloning technology to the study of human evolution. Our data suggest that evolutionary selection of specific segments of the ASPM sequence strongly relates to differences in cerebral cortical size. PMID:15045028

  13. Full-Genome Sequencing as a Basis for Molecular Epidemiology Studies of Bluetongue Virus in India

    PubMed Central

    Maan, Sushila; Maan, Narender S.; Belaganahalli, Manjunatha N.; Rao, Pavuluri Panduranga; Singh, Karam Pal; Hemadri, Divakar; Putty, Kalyani; Kumar, Aman; Batra, Kanisht; Krishnajyothi, Yadlapati; Chandel, Bharat S.; Reddy, G. Hanmanth; Nomikou, Kyriaki; Reddy, Yella Narasimha; Attoui, Houssam; Hegde, Nagendra R.; Mertens, Peter P. C.

    2015-01-01

    Since 1998 there have been significant changes in the global distribution of bluetongue virus (BTV). Ten previously exotic BTV serotypes have been detected in Europe, causing severe disease outbreaks in naïve ruminant populations. Previously exotic BTV serotypes were also identified in the USA, Israel, Australia and India. BTV is transmitted by biting midges (Culicoides spp.) and changes in the distribution of vector species, climate change, increased international travel and trade are thought to have contributed to these events. Thirteen BTV serotypes have been isolated in India since first reports of the disease in the country during 1964. Efficient methods for preparation of viral dsRNA and cDNA synthesis, have facilitated full-genome sequencing of BTV strains from the region. These studies introduce a new approach for BTV characterization, based on full-genome sequencing and phylogenetic analyses, facilitating the identification of BTV serotype, topotype and reassortant strains. Phylogenetic analyses show that most of the equivalent genome-segments of Indian BTV strains are closely related, clustering within a major eastern BTV ‘topotype’. However, genome-segment 5 (Seg-5) encoding NS1, from multiple post 1982 Indian isolates, originated from a western BTV topotype. All ten genome-segments of BTV-2 isolates (IND2003/01, IND2003/02 and IND2003/03) are closely related (>99% identity) to a South African BTV-2 vaccine-strain (western topotype). Similarly BTV-10 isolates (IND2003/06; IND2005/04) show >99% identity in all genome segments, to the prototype BTV-10 (CA-8) strain from the USA. These data suggest repeated introductions of western BTV field and/or vaccine-strains into India, potentially linked to animal or vector-insect movements, or unauthorised use of ‘live’ South African or American BTV-vaccines in the country. The data presented will help improve nucleic acid based diagnostics for Indian serotypes/topotypes, as part of control strategies. PMID:26121128

  14. Synthetic biology approach for plant protection using dsRNA.

    PubMed

    Niehl, Annette; Soininen, Marjukka; Poranen, Minna M; Heinlein, Manfred

    2018-02-26

    Pathogens induce severe damages on cultivated plants and represent a serious threat to global food security. Emerging strategies for crop protection involve the external treatment of plants with double-stranded (ds)RNA to trigger RNA interference. However, applying this technology in greenhouses and fields depends on dsRNA quality, stability and efficient large-scale production. Using components of the bacteriophage phi6, we engineered a stable and accurate in vivo dsRNA production system in Pseudomonas syringae bacteria. Unlike other in vitro or in vivo dsRNA production systems that rely on DNA transcription and postsynthetic alignment of single-stranded RNA molecules, the phi6 system is based on the replication of dsRNA by an RNA-dependent RNA polymerase, thus allowing production of high-quality, long dsRNA molecules. The phi6 replication complex was reprogrammed to multiply dsRNA sequences homologous to tobacco mosaic virus (TMV) by replacing the coding regions within two of the three phi6 genome segments with TMV sequences and introduction of these constructs into P. syringae together with the third phi6 segment, which encodes the components of the phi6 replication complex. The stable production of TMV dsRNA was achieved by combining all the three phi6 genome segments and by maintaining the natural dsRNA sizes and sequence elements required for efficient replication and packaging of the segments. The produced TMV-derived dsRNAs inhibited TMV propagation when applied to infected Nicotiana benthamiana plants. The established dsRNA production system enables the broad application of dsRNA molecules as an efficient, highly flexible, nontransgenic and environmentally friendly approach for protecting crops against viruses and other pathogens. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  15. Performances of Different Fragment Sizes for Reduced Representation Bisulfite Sequencing in Pigs.

    PubMed

    Yuan, Xiao-Long; Zhang, Zhe; Pan, Rong-Yang; Gao, Ning; Deng, Xi; Li, Bin; Zhang, Hao; Sangild, Per Torp; Li, Jia-Qi

    2017-01-01

    Reduced representation bisulfite sequencing (RRBS) has been widely used to profile genome-scale DNA methylation in mammalian genomes. However, the applications and technical performances of RRBS with different fragment sizes have not been systematically reported in pigs, which serve as one of the important biomedical models for humans. The aims of this study were to evaluate capacities of RRBS libraries with different fragment sizes to characterize the porcine genome. We found that the Msp I-digested segments between 40 and 220 bp harbored a high distribution peak at 74 bp, which were highly overlapped with the repetitive elements and might reduce the unique mapping alignment. The RRBS library of 110-220 bp fragment size had the highest unique mapping alignment and the lowest multiple alignment. The cost-effectiveness of the 40-110 bp, 110-220 bp and 40-220 bp fragment sizes might decrease when the dataset size was more than 70, 50 and 110 million reads for these three fragment sizes, respectively. Given a 50-million dataset size, the average sequencing depth of the detected CpG sites in the 110-220 bp fragment size appeared to be deeper than in the 40-110 bp and 40-220 bp fragment sizes, and these detected CpG sties differently located in gene- and CpG island-related regions. In this study, our results demonstrated that selections of fragment sizes could affect the numbers and sequencing depth of detected CpG sites as well as the cost-efficiency. No single solution of RRBS is optimal in all circumstances for investigating genome-scale DNA methylation. This work provides the useful knowledge on designing and executing RRBS for investigating the genome-wide DNA methylation in tissues from pigs.

  16. rbcL and matK earn two thumbs up as the core DNA barcode for ferns.

    PubMed

    Li, Fay-Wei; Kuo, Li-Yaung; Rothfels, Carl J; Ebihara, Atsushi; Chiou, Wen-Liang; Windham, Michael D; Pryer, Kathleen M

    2011-01-01

    DNA barcoding will revolutionize our understanding of fern ecology, most especially because the accurate identification of the independent but cryptic gametophyte phase of the fern's life history--an endeavor previously impossible--will finally be feasible. In this study, we assess the discriminatory power of the core plant DNA barcode (rbcL and matK), as well as alternatively proposed fern barcodes (trnH-psbA and trnL-F), across all major fern lineages. We also present plastid barcode data for two genera in the hyperdiverse polypod clade--Deparia (Woodsiaceae) and the Cheilanthes marginata group (currently being segregated as a new genus of Pteridaceae)--to further evaluate the resolving power of these loci. Our results clearly demonstrate the value of matK data, previously unavailable in ferns because of difficulties in amplification due to a major rearrangement of the plastid genome. With its high sequence variation, matK complements rbcL to provide a two-locus barcode with strong resolving power. With sequence variation comparable to matK, trnL-F appears to be a suitable alternative barcode region in ferns, and perhaps should be added to the core barcode region if universal primer development for matK fails. In contrast, trnH-psbA shows dramatically reduced sequence variation for the majority of ferns. This is likely due to the translocation of this segment of the plastid genome into the inverted repeat regions, which are known to have a highly constrained substitution rate. Our study provides the first endorsement of the two-locus barcode (rbcL+matK) in ferns, and favors trnL-F over trnH-psbA as a potential back-up locus. Future work should focus on gathering more fern matK sequence data to facilitate universal primer development.

  17. Complete mitochondrial DNA sequence of oyster Crassostrea hongkongensis-a case of "Tandem duplication-random loss" for genome rearrangement in Crassostrea?

    PubMed Central

    Yu, Ziniu; Wei, Zhengpeng; Kong, Xiaoyu; Shi, Wei

    2008-01-01

    Background Mitochondrial DNA sequences are extensively used as genetic markers not only for studies of population or ecological genetics, but also for phylogenetic and evolutionary analyses. Complete mt-sequences can reveal information about gene order and its variation, as well as gene and genome evolution when sequences from multiple phyla are compared. Mitochondrial gene order is highly variable among mollusks, with bivalves exhibiting the most variability. Of the 41 complete mt genomes sequenced so far, 12 are from bivalves. We determined, in the current study, the complete mitochondrial DNA sequence of Crassostrea hongkongensis. We present here an analysis of features of its gene content and genome organization in comparison with two other Crassostrea species to assess the variation within bivalves and among main groups of mollusks. Results The complete mitochondrial genome of C. hongkongensis was determined using long PCR and a primer walking sequencing strategy with genus-specific primers. The genome is 16,475 bp in length and contains 12 protein-coding genes (the atp8 gene is missing, as in most bivalves), 22 transfer tRNA genes (including a suppressor tRNA gene), and 2 ribosomal RNA genes, all of which appear to be transcribed from the same strand. A striking finding of this study is that a DNA segment containing four tRNA genes (trnk1, trnC, trnQ1 and trnN) and two duplicated or split rRNA gene (rrnL5' and rrnS) are absent from the genome, when compared with that of two other extant Crassostrea species, which is very likely a consequence of loss of a single genomic region present in ancestor of C. hongkongensis. It indicates this region seem to be a "hot spot" of genomic rearrangements over the Crassostrea mt-genomes. The arrangement of protein-coding genes in C. hongkongensis is identical to that of Crassostrea gigas and Crassostrea virginica, but higher amino acid sequence identities are shared between C. hongkongensis and C. gigas than between other pairs. There exists significant codon bias, favoring codons ending in A or T and against those ending with C. Pair analysis of genome rearrangements showed that the rearrangement distance is great between C. gigas-C. hongkongensis and C. virginica, indicating a high degree of rearrangements within Crassostrea. The determination of complete mt-genome of C. hongkongensis has yielded useful insight into features of gene order, variation, and evolution of Crassostrea and bivalve mt-genomes. Conclusion The mt-genome of C. hongkongensis shares some similarity with, and interesting differences to, other Crassostrea species and bivalves. The absence of trnC and trnN genes and duplicated or split rRNA genes from the C. hongkongensis genome is a completely novel feature not previously reported in Crassostrea species. The phenomenon is likely due to the loss of a segment that is present in other Crassostrea species and was present in ancestor of C. hongkongensis, thus a case of "tandem duplication-random loss (TDRL)". The mt-genome and new feature presented here reveal and underline the high level variation of gene order and gene content in Crassostrea and bivalves, inspiring more research to gain understanding to mechanisms underlying gene and genome evolution in bivalves and mollusks. PMID:18847502

  18. Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

    PubMed Central

    Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

    2012-01-01

    ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136

  19. Identity of the segment of human complement C8 recognized by complement regulatory protein CD59.

    PubMed

    Lockert, D H; Kaufman, K M; Chang, C P; Hüsler, T; Sodetz, J M; Sims, P J

    1995-08-25

    CD59 antigen is a membrane glycoprotein that inhibits the activity of the C5b-9 membrane attack complex (MAC), thereby protecting human cells from lysis by human complement. The inhibitory function of CD59 derives from its capacity to interact with both the C8 and C9 components of MAC, preventing assembly of membrane-inserted C9 polymer. MAC-inhibitory activity of CD59 is species-selective and is most effective when both C8 and C9 derive from human or other primate plasma. Rabbit C8 and C9, which can substitute for human C8 and C9 in MAC, mediate virtually unrestricted lysis of human cells expressing CD59. In order to identify the segment of human C8 that is recognized by CD59, recombinant peptides containing human or rabbit C8 sequence were expressed in Escherichia coli and purified. CD59 was found to specifically bind to a peptide corresponding to residues 334-385 of the human C8 alpha-subunit, and to require a disulfide bond between Cys345 and Cys369. No specific binding was observed to the corresponding sequence from rabbit C8 alpha (residues 334-386). To obtain functional evidence that this segment of human C8 alpha is selectively recognized by CD59, recombinant C8 proteins were prepared by co-transfecting COS-7 cells with human/rabbit chimeras of the C8 alpha cDNA, and cDNAs encoding the C8 beta and C8 gamma chains. Hemolytic activity of MAC formed with chimeric C8 was analyzed using target cells reconstituted with CD59. These experiments confirmed that CD59 recognizes a conformationally sensitive epitope that is within a segment of human C8 alpha internal to residues 320-415. Our data also suggest that optimal interaction of CD59 with this segment of human C8 alpha is influenced by N-terminal flanking sequence in C8 alpha and by human C8 beta, but is unaffected by C8 gamma.

  20. The mouse neuronal cell surface protein F3: a phosphatidylinositol- anchored member of the immunoglobulin superfamily related to chicken contactin

    PubMed Central

    1989-01-01

    Several members of the Ig superfamily are expressed on neural cells where they participate in surface interactions between cell bodies and processes. Their Ig domains are more closely related to each other than to Ig variable and constant domains and have been grouped into the C2 set. Here, we report the cloning and characterization of another member of this group, the mouse neuronal cell surface antigen F3. The F3 cDNA sequence contains an open reading frame that could encode a 1,020-amino acid protein consisting of a signal sequence, six Ig-like domains of the C2 type, a long premembrane region containing two segments that exhibit sequence similarity to fibronectin type III repeats and a moderately hydrophobic COOH-terminal sequence. The protein does not contain a typical transmembrane segment but appears to be attached to the membrane by a phosphatidylinositol anchor. Antibodies against the F3 protein recognize a prominent 135-kD protein in mouse brain. In fetal brain cultures, they stain the neuronal cell surface and, in cultures maintained in chemically defined medium, most prominently neurites and neurite bundles. The mouse f3 gene maps to band F of chromosome 15. The gene transcripts detected in the brain by F3 cDNA probes are developmentally regulated, the highest amounts being expressed between 1 and 2 wk after birth. The F3 nucleotide and deduced amino acid sequence show striking similarity to the recently published sequence of the chicken neuronal cell surface protein contactin. However, there are important differences between the two molecules. In contrast to F3, contactin has a transmembrane and a cytoplasmic domain. Whereas contactin is insoluble in nonionic detergent and is tightly associated with the cytoskeleton, about equal amounts of F3 distribute between buffer-soluble, nonionic detergent-soluble, and detergent- insoluble fractions. Among other neural cell surface proteins, F3 most resembles the neuronal cell adhesion protein L1, with 25% amino acid identity between their extracellular domains. Based on its structural similarity with known cell adhesion proteins of nervous tissue and with L1 in particular, we propose that F3 mediates cell surface interactions during nervous system development. PMID:2474555

  1. Genetic Diversity of Crimean Congo Hemorrhagic Fever Virus Strains from Iran

    PubMed Central

    Chinikar, Sadegh; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Nowotny, Norbert; Fooks, Anthony R.; Shah-Hosseini, Nariman

    2016-01-01

    Background: Crimean Congo hemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family and Nairovirus genus. It has a negative-sense, single stranded RNA genome approximately 19.2 kb, containing the Small, Medium, and Large segments. CCHFVs are relatively divergent in their genome sequence and grouped in seven distinct clades based on S-segment sequence analysis and six clades based on M-segment sequences. Our aim was to obtain new insights into the molecular epidemiology of CCHFV in Iran. Methods: We analyzed partial and complete nucleotide sequences of the S and M segments derived from 50 Iranian patients. The extracted RNA was amplified using one-step RT-PCR and then sequenced. The sequences were analyzed using Mega5 software. Results: Phylogenetic analysis of partial S segment sequences demonstrated that clade IV-(Asia 1), clade IV-(Asia 2) and clade V-(Europe) accounted for 80 %, 4 % and 14 % of the circulating genomic variants of CCHFV in Iran respectively. However, one of the Iranian strains (Iran-Kerman/22) was associated with none of other sequences and formed a new clade (VII). The phylogenetic analysis of complete S-segment nucleotide sequences from selected Iranian CCHFV strains complemented with representative strains from GenBank revealed similar topology as partial sequences with eight major clusters. A partial M segment phylogeny positioned the Iranian strains in either association with clade III (Asia-Africa) or clade V (Europe). Conclusion: The phylogenetic analysis revealed subtle links between distant geographic locations, which we propose might originate either from international livestock trade or from long-distance carriage of CCHFV by infected ticks via bird migration. PMID:27308271

  2. Sequence Segmentation with changeptGUI.

    PubMed

    Tasker, Edward; Keith, Jonathan M

    2017-01-01

    Many biological sequences have a segmental structure that can provide valuable clues to their content, structure, and function. The program changept is a tool for investigating the segmental structure of a sequence, and can also be applied to multiple sequences in parallel to identify a common segmental structure, thus providing a method for integrating multiple data types to identify functional elements in genomes. In the previous edition of this book, a command line interface for changept is described. Here we present a graphical user interface for this package, called changeptGUI. This interface also includes tools for pre- and post-processing of data and results to facilitate investigation of the number and characteristics of segment classes.

  3. Genomic and chromatin features shaping meiotic double-strand break formation and repair in mice

    PubMed Central

    Jasin, Maria; Lange, Julian

    2017-01-01

    ABSTRACT The SPO11-generated DNA double-strand breaks (DSBs) that initiate meiotic recombination occur non-randomly across genomes, but mechanisms shaping their distribution and repair remain incompletely understood. Here, we expand on recent studies of nucleotide-resolution DSB maps in mouse spermatocytes. We find that trimethylation of histone H3 lysine 36 around DSB hotspots is highly correlated, both spatially and quantitatively, with trimethylation of H3 lysine 4, consistent with coordinated formation and action of both PRDM9-dependent histone modifications. In contrast, the DSB-responsive kinase ATM contributes independently of PRDM9 to controlling hotspot activity, and combined action of ATM and PRDM9 can explain nearly two-thirds of the variation in DSB frequency between hotspots. DSBs were modestly underrepresented in most repetitive sequences such as segmental duplications and transposons. Nonetheless, numerous DSBs form within repetitive sequences in each meiosis and some classes of repeats are preferentially targeted. Implications of these findings are discussed for evolution of PRDM9 and its role in hybrid strain sterility in mice. Finally, we document the relationship between mouse strain-specific DNA sequence variants within PRDM9 recognition motifs and attendant differences in recombination outcomes. Our results provide further insights into the complex web of factors that influence meiotic recombination patterns. PMID:28820351

  4. Multiplex APLP System for High-Resolution Haplogrouping of Extremely Degraded East-Asian Mitochondrial DNAs

    PubMed Central

    Kakuda, Tsuneo; Shojo, Hideki; Tanaka, Mayumi; Nambiar, Phrabhakaran; Minaguchi, Kiyoshi; Umetsu, Kazuo; Adachi, Noboru

    2016-01-01

    Mitochondrial DNA (mtDNA) serves as a powerful tool for exploring matrilineal phylogeographic ancestry, as well as for analyzing highly degraded samples, because of its polymorphic nature and high copy numbers per cell. The recent advent of complete mitochondrial genome sequencing has led to improved techniques for phylogenetic analyses based on mtDNA, and many multiplex genotyping methods have been developed for the hierarchical analysis of phylogenetically important mutations. However, few high-resolution multiplex genotyping systems for analyzing East-Asian mtDNA can be applied to extremely degraded samples. Here, we present a multiplex system for analyzing mitochondrial single nucleotide polymorphisms (mtSNPs), which relies on a novel amplified product-length polymorphisms (APLP) method that uses inosine-flapped primers and is specifically designed for the detailed haplogrouping of extremely degraded East-Asian mtDNAs. We used fourteen 6-plex polymerase chain reactions (PCRs) and subsequent electrophoresis to examine 81 haplogroup-defining SNPs and 3 insertion/deletion sites, and we were able to securely assign the studied mtDNAs to relevant haplogroups. Our system requires only 1×10−13 g (100 fg) of crude DNA to obtain a full profile. Owing to its small amplicon size (<110 bp), this new APLP system was successfully applied to extremely degraded samples for which direct sequencing of hypervariable segments using mini-primer sets was unsuccessful, and proved to be more robust than conventional APLP analysis. Thus, our new APLP system is effective for retrieving reliable data from extremely degraded East-Asian mtDNAs. PMID:27355212

  5. Multiplex APLP System for High-Resolution Haplogrouping of Extremely Degraded East-Asian Mitochondrial DNAs.

    PubMed

    Kakuda, Tsuneo; Shojo, Hideki; Tanaka, Mayumi; Nambiar, Phrabhakaran; Minaguchi, Kiyoshi; Umetsu, Kazuo; Adachi, Noboru

    2016-01-01

    Mitochondrial DNA (mtDNA) serves as a powerful tool for exploring matrilineal phylogeographic ancestry, as well as for analyzing highly degraded samples, because of its polymorphic nature and high copy numbers per cell. The recent advent of complete mitochondrial genome sequencing has led to improved techniques for phylogenetic analyses based on mtDNA, and many multiplex genotyping methods have been developed for the hierarchical analysis of phylogenetically important mutations. However, few high-resolution multiplex genotyping systems for analyzing East-Asian mtDNA can be applied to extremely degraded samples. Here, we present a multiplex system for analyzing mitochondrial single nucleotide polymorphisms (mtSNPs), which relies on a novel amplified product-length polymorphisms (APLP) method that uses inosine-flapped primers and is specifically designed for the detailed haplogrouping of extremely degraded East-Asian mtDNAs. We used fourteen 6-plex polymerase chain reactions (PCRs) and subsequent electrophoresis to examine 81 haplogroup-defining SNPs and 3 insertion/deletion sites, and we were able to securely assign the studied mtDNAs to relevant haplogroups. Our system requires only 1×10-13 g (100 fg) of crude DNA to obtain a full profile. Owing to its small amplicon size (<110 bp), this new APLP system was successfully applied to extremely degraded samples for which direct sequencing of hypervariable segments using mini-primer sets was unsuccessful, and proved to be more robust than conventional APLP analysis. Thus, our new APLP system is effective for retrieving reliable data from extremely degraded East-Asian mtDNAs.

  6. Patterns of DNA barcode variation in Canadian marine molluscs.

    PubMed

    Layton, Kara K S; Martel, André L; Hebert, Paul D N

    2014-01-01

    Molluscs are the most diverse marine phylum and this high diversity has resulted in considerable taxonomic problems. Because the number of species in Canadian oceans remains uncertain, there is a need to incorporate molecular methods into species identifications. A 648 base pair segment of the cytochrome c oxidase subunit I gene has proven useful for the identification and discovery of species in many animal lineages. While the utility of DNA barcoding in molluscs has been demonstrated in other studies, this is the first effort to construct a DNA barcode registry for marine molluscs across such a large geographic area. This study examines patterns of DNA barcode variation in 227 species of Canadian marine molluscs. Intraspecific sequence divergences ranged from 0-26.4% and a barcode gap existed for most taxa. Eleven cases of relatively deep (>2%) intraspecific divergence were detected, suggesting the possible presence of overlooked species. Structural variation was detected in COI with indels found in 37 species, mostly bivalves. Some indels were present in divergent lineages, primarily in the region of the first external loop, suggesting certain areas are hotspots for change. Lastly, mean GC content varied substantially among orders (24.5%-46.5%), and showed a significant positive correlation with nearest neighbour distances. DNA barcoding is an effective tool for the identification of Canadian marine molluscs and for revealing possible cases of overlooked species. Some species with deep intraspecific divergence showed a biogeographic partition between lineages on the Atlantic, Arctic and Pacific coasts, suggesting the role of Pleistocene glaciations in the subdivision of their populations. Indels were prevalent in the barcode region of the COI gene in bivalves and gastropods. This study highlights the efficacy of DNA barcoding for providing insights into sequence variation across a broad taxonomic group on a large geographic scale.

  7. SEQassembly: A Practical Tools Program for Coding Sequences Splicing

    NASA Astrophysics Data System (ADS)

    Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming

    CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.

  8. [Study on the genetic difference of SEO type Hantaviruses].

    PubMed

    Zhang, X; Zhou, S; Wang, H; Hu, J; Guan, Z; Liu, H

    2000-10-01

    To understand the genetic type of Hantaviruses and the difference between them caused by rodents in Beijing and to furhter explore the source of the infectious factors. Hantavirus RNA, isolated from lungs of rodents captured in Beijing and positive with Hantavirus antigens with frozen sectioning and Immunofluorescent assay, were reverse-transcribed and amplified with PCR with Hantavirus-specific primers. Five of the PCR amplifications were discovered and sequenced with 300 bp sequence data of M segments (from 2003 - 2302nt according cDNA of seoul 8039 strain). Nucleotide sequence homology showed that they were sequences of SEO-type Hantavirus. Compared with SEO type Hantavirus, the nucleotide sequence homology of these samples was more than 94% while the homology of amonia acid sequence was more than 98%. When compared with HNT type Hantavirus, the homology of nucleotide sequence became less than 72% with the homology of amonia acid sequence less than 81%. Similar to other Hantavirus of SEO type, their nucleotide sequences and deduced amino acid sequences were highly preserved. Phylogenetic tree analysis showed that the five viruses could be divided into at least 4 branches. It was quite likely that there were at least two sub-type SEO viruses with 4 branches that were circulating in Beijing.

  9. Universality of long-range correlations in expansion randomization systems

    NASA Astrophysics Data System (ADS)

    Messer, P. W.; Lässig, M.; Arndt, P. F.

    2005-10-01

    We study the stochastic dynamics of sequences evolving by single-site mutations, segmental duplications, deletions, and random insertions. These processes are relevant for the evolution of genomic DNA. They define a universality class of non-equilibrium 1D expansion-randomization systems with generic stationary long-range correlations in a regime of growing sequence length. We obtain explicitly the two-point correlation function of the sequence composition and the distribution function of the composition bias in sequences of finite length. The characteristic exponent χ of these quantities is determined by the ratio of two effective rates, which are explicitly calculated for several specific sequence evolution dynamics of the universality class. Depending on the value of χ, we find two different scaling regimes, which are distinguished by the detectability of the initial composition bias. All analytic results are accurately verified by numerical simulations. We also discuss the non-stationary build-up and decay of correlations, as well as more complex evolutionary scenarios, where the rates of the processes vary in time. Our findings provide a possible example for the emergence of universality in molecular biology.

  10. Developing Single-Molecule TPM Experiments for Direct Observation of Successful RecA-Mediated Strand Exchange Reaction

    PubMed Central

    Fan, Hsiu-Fang; Cox, Michael M.; Li, Hung-Wen

    2011-01-01

    RecA recombinases play a central role in homologous recombination. Once assembled on single-stranded (ss) DNA, RecA nucleoprotein filaments mediate the pairing of homologous DNA sequences and strand exchange processes. We have designed two experiments based on tethered particle motion (TPM) to investigate the fates of the invading and the outgoing strands during E. coli RecA-mediated pairing and strand exchange at the single-molecule level in the absence of force. TPM experiments measure the tethered bead Brownian motion indicative of the DNA tether length change resulting from RecA binding and dissociation. Experiments with beads labeled on either the invading strand or the outgoing strand showed that DNA pairing and strand exchange occurs successfully in the presence of either ATP or its non-hydrolyzable analog, ATPγS. The strand exchange rates and efficiencies are similar under both ATP and ATPγS conditions. In addition, the Brownian motion time-courses suggest that the strand exchange process progresses uni-directionally in the 5′-to-3′ fashion, using a synapse segment with a wide and continuous size distribution. PMID:21765895

  11. A structural-alphabet-based strategy for finding structural motifs across protein families

    PubMed Central

    Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay

    2010-01-01

    Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797

  12. Avian acute leukemia viruses MC29 and MH2 share specific RNA sequences: Evidence for a second class of transforming genes

    PubMed Central

    Duesberg, Peter H.; Vogt, Peter K.

    1979-01-01

    The genome of the defective avian tumor virus MH2 was identified as a RNA of 5.7 kilobases by its presence in different MH2-helper virus complexes and its absence from pure helper virus, by its unique fingerprint pattern of RNase T1-resistant (T1) oligonucleotides that differed from those of two helper virus RNAs, and by its structural analogy to the RNA of MC29, another avian acute leukemia virus. Two sets of sequences were distinguished in MH2 RNA: 66% hybridized with DNA complementary to helper-independent avian tumor viruses, termed group-specific, and 34% were specific. The percentage of specific sequences is considered a minimal estimate because the MH2 RNA used was about 30% contaminated by helper virus RNA. No sequences related to the transforming src gene of avian sarcoma viruses were found in MH2. MH2 shared three large T1 oligonucleotides with MC29, two of which could also be isolated from a RNase A- and T1-resistant hybrid formed between MH2 RNA and MC29 specific cDNA. These oligonucleotides belong to a group of six that define the specific segment of MC29 RNA described previously. The group-specific sequences of MH2 and MC29 RNA shared only the two smallest out of about 20 T1 oligonucleotides associated with MH2 RNA. It is concluded that the specific sequences of MH2 and MC29 are related, and it is proposed that they are necessary for, or identical with, the onc genes of these viruses. These sequences would define a related class of transforming genes in avian tumor viruses that differs from the src genes of avian sarcoma viruses. Images PMID:221900

  13. Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

    PubMed

    Montandon, P E; Vasserot, A; Stutz, E

    1986-01-01

    We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.

  14. Segmentation and Recognition of Continuous Human Activity

    DTIC Science & Technology

    2001-01-01

    This paper presents a methodology for automatic segmentation and recognition of continuous human activity . We segment a continuous human activity into...commencement or termination. We use single action sequences for the training data set. The test sequences, on the other hand, are continuous sequences of human ... activity that consist of three or more actions in succession. The system has been tested on continuous activity sequences containing actions such as

  15. Gene capture from across the grass family in the allohexaploid Elymus repens (L.) Gould (Poaceae, Triticeae) as evidenced by ITS, GBSSI, and molecular cytogenetics.

    PubMed

    Mahelka, Václav; Kopecký, David

    2010-06-01

    Four accessions of hexaploid Elymus repens from its native Central European distribution area were analyzed using sequencing of multicopy (internal transcribed spacer, ITS) and single-copy (granule-bound starch synthase I, GBSSI) DNA in concert with genomic and fluorescent in situ hybridization (GISH and FISH) to disentangle its allopolyploid origin. Despite extensive ITS homogenization, nrDNA in E. repens allowed us to identify at least four distinct lineages. Apart from Pseudoroegneria and Hordeum, representing the major genome constituents, the presence of further unexpected alien genetic material, originating from species outside the Triticeae and close to Panicum (Paniceae) and Bromus (Bromeae), was revealed. GBSSI sequences provided information complementary to the ITS. Apart from Pseudoroegneria and Hordeum, two additional gene variants from within the Triticeae were discovered: One was Taeniatherum-like, but the other did not have a close relationship with any of the diploids sampled. GISH results were largely congruent with the sequence-based markers. GISH clearly confirmed Pseudoroegneria and Hordeum as major genome constituents and further showed the presence of a small chromosome segment corresponding to Panicum. It resided in the Hordeum subgenome and probably represents an old acquisition of a Hordeum progenitor. Spotty hybridization signals across all chromosomes after GISH with Taeniatherum and Bromus probes suggested that gene acquisition from these species is more likely due to common ancestry of the grasses or early introgression than to recent hybridization or allopolyploid origin of E. repens. Physical mapping of rDNA loci using FISH revealed that all rDNA loci except one minor were located on Pseudoroegneria-derived chromosomes, which suggests the loss of all Hordeum-derived loci but one. Because homogenization mechanisms seem to operate effectively among Pseudoroegneria-like copies in this species, incomplete ITS homogenization in our samples is probably due to an interstitial position of an individual minor rDNA locus located within the Hordeum-derived subgenome.

  16. Identification of polycomb and trithorax group responsive elements in the regulatory region of the Drosophila homeotic gene Sex combs reduced

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gindhart, J.G. Jr.; Kaufman, T.C.

    1995-02-01

    The Drosophilia homeotic gene Sex combs reduced (Scr) is necessary for the establishment and maintenance of the morphological identity of the labial and prothoracic segments. In the early embryo, its expression pattern is established through the activity of several gap and segmentation gene products, as well as other transcription factors. Once established, the Polycomb group (Pc-G) and trithorax group (trx-G) gene products maintain the spatial pattern of Scr expression for the remainder of development. We report the identification of DNA fragments in the Scr regulatory region that may be important for its regulation by Polycomb and trithorax group gene products.more » When DNA fragments containing these regulatory sequences are subcloned into P-element vectors containing a white minigene, transformants containing these constructs exhibit mosaic patterns of pigmentation in the adult eye, indicating that white minigene expression is repressed in a clonally heritable manner. The size of pigmented and nonpigmented clones in the adult eye suggests that the event determining whether a cell in the eye anlagen will express white occurs at least as early as the first larval instar. The amount of white minigene repression is reduced in some Polycomb group mutants, whereas repression is enhanced in flies mutant for a subset of trithorax group loci. The repressor activity of one fragment, normally located in Scr Intron 2, is increased when it is able to homologously pair, a property consistent with genetic data suggesting that Scr exhibits transvection. Another Scr regulatory fragment, normally located 40 kb upstream of the Scr promoter, silences ectopic expression of an Scr-lacZ fusion gene in the embryo and does so in a Polycomb-dependent manner. We propose that the regulatory sequences located within these DNA fragments may normally mediate the regulation of Scr by proteins encoded by members of Polycomb and trithorax group loci. 98 refs., 6 figs., 4 tabs.« less

  17. Modeling the relaxation of internal DNA segments during genome mapping in nanochannels.

    PubMed

    Jain, Aashish; Sheats, Julian; Reifenberger, Jeffrey G; Cao, Han; Dorfman, Kevin D

    2016-09-01

    We have developed a multi-scale model describing the dynamics of internal segments of DNA in nanochannels used for genome mapping. In addition to the channel geometry, the model takes as its inputs the DNA properties in free solution (persistence length, effective width, molecular weight, and segmental hydrodynamic radius) and buffer properties (temperature and viscosity). Using pruned-enriched Rosenbluth simulations of a discrete wormlike chain model with circa 10 base pair resolution and a numerical solution for the hydrodynamic interactions in confinement, we convert these experimentally available inputs into the necessary parameters for a one-dimensional, Rouse-like model of the confined chain. The resulting coarse-grained model resolves the DNA at a length scale of approximately 6 kilobase pairs in the absence of any global hairpin folds, and is readily studied using a normal-mode analysis or Brownian dynamics simulations. The Rouse-like model successfully reproduces both the trends and order of magnitude of the relaxation time of the distance between labeled segments of DNA obtained in experiments. The model also provides insights that are not readily accessible from experiments, such as the role of the molecular weight of the DNA and location of the labeled segments that impact the statistical models used to construct genome maps from data acquired in nanochannels. The multi-scale approach used here, while focused towards a technologically relevant scenario, is readily adapted to other channel sizes and polymers.

  18. Transfection and heat-inducible expression of molluscan promoter-luciferase reporter gene constructs in the Biomphalaria glabrata embryonic snail cell line.

    PubMed

    Yoshino, T P; Wu, X J; Liu, H D

    1998-09-01

    Studies were initiated to begin developing a genetic transformation system for cells derived from the freshwater gastropod, Biomphalaria glabrata, an intermediate host of the human blood fluke Schistosoma mansoni. Using a 70-kD heat-shock protein (HSP70) cDNA probe obtained from the B. glabrata embryonic (Bge) cell line, we cloned from Bge cells a complete HSP70 gene including a 1-kb genomic DNA fragment in its 5'-flanking region containing sequences indicative of a HSP promoter. Identified in the 5'-half (416 nucleotides) of this genomic fragment were TATA and CAAT boxes, two putative transcription initiation sites, and a series of palindromic DNA repeats with shared homology to the heat-shock element consensus sequence (Bge HSP70(0.5k) promoter). The 3'-half of this upstream flanking region was comprised of a 508-base intron located immediately 5' of the ATG start codon. To determine the functionality of the putative snail promoter sequence, Bge HSP promoter/luciferase (Luc) reporter gene constructs were introduced into Bge cells by N-(1-(2,3-dioleoyloxy) propyl)-N,N,N-trimethylammonium methylsulfate (DOTAP)-mediated transfection methods, and assayed for Luc activity 48 hr following a 1.5-hr heat-shock treatment (40 degrees C). Compared with control vectors or the Bge HSP70(0.5k/1.0k) promoter constructs at 26 degrees C, a 10- to 300-fold increase in Luc expression was obtained only in the Bge HSP70 promoter/Luc-transfected cells following heat-shock. Results of transfection experiments demonstrate that the Bge HSP70(0.5k) DNA segment contains appropriate promoter sequences for driving temperature-inducible gene expression in the Bge snail cell line. This report represents the first isolation and functional characterization of an inducible promoter from a freshwater gastropod mollusc. Successful transient expression of a foreign reporter gene in Bge cells using a homologous, inducible promoter sequence now paves the way for development of methods for stable integration and expression of snail genes of interest into the Bge cell line.

  19. Scar-less multi-part DNA assembly design automation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hillson, Nathan J.

    The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less

  20. Identification of Genes Encoding Conjugated Bile Salt Hydrolase and Transport in Lactobacillus johnsonii 100-100

    PubMed Central

    Elkins, Christopher A.; Savage, Dwayne C.

    1998-01-01

    Cytosolic extracts of Lactobacillus johnsonii 100-100 (previously reported as Lactobacillus sp. strain 100-100) contain four heterotrimeric isozymes composed of two peptides, α and β, with conjugated bile salt hydrolase (BSH) activity. We now report cloning, from the genome of strain 100-100, a 2,977-bp DNA segment that expresses BSH activity in Escherichia coli. The sequencing of this segment showed that it contained one complete and two partial open reading frames (ORFs). The 3′ partial ORF (927 nucleotides) was predicted by BLAST and confirmed with 5′ and 3′ deletions to be a BSH gene. Thermal asymmetric interlaced PCR was used to extend and complete the 948-nucleotide sequence of the BSH gene 3′ of the cloned segment. The predicted amino acid sequence of the 5′ partial ORF (651 nucleotides) was about 80% similar to the C-terminal half of the largest, complete ORF (1,353 nucleotides), and these two putative proteins were similar to several amine, multidrug resistance, and sugar transport proteins of the major facilitator superfamily. E. coli DH5α cells transformed with a construct containing these ORFs, in concert with an extracellular factor produced by strain 100-100, demonstrated levels of uptake of [14C]taurocholic acid that were increased as much as threefold over control levels. [14C]Cholic acid was taken up in similar amounts by strain DH5α pSportI (control) and DH5α p2000 (transport clones). These findings support a hypothesis that the ORFs are conjugated bile salt transport genes which may be arranged in an operon with BSH genes. PMID:9721268

  1. Genetic diversity and genetic structure of farmed and wild Chinese mitten crab (Eriocheir sinensis) populations from three major basins by mitochondrial DNA COI and Cyt b gene sequences.

    PubMed

    Zhang, Cheng; Li, Qingqing; Wu, Xugan; Liu, Qing; Cheng, Yongxu

    2017-11-20

    The Chinese mitten crab, Eriocheir sinensis, is one of the important native crab species in East Asian region, which has been widely cultured throughout China, particularly in river basins of Yangtze, Huanghe and Liaohe. This study was designed to evaluate the genetic diversity and genetic structure of cultured and wild E. sinensis populations from the three river basins based on mitochondrial DNA (mtDNA) cytochrome oxidase subunit I (COI) and cytochrome b (Cyt b). The results showed that there were 62 variable sites and 30 parsimony informative sites in the 647 bp of sequenced mtDNA COI from 335 samples. Similarly, a 637 bp segment of Cyt b provided 59 variable sites and 26 parsimony informative sites. AMOVA showed that the levels of genetic differentiation were low among six populations. Although the haplotype diversity and nucleotide diversity of Huanghe wild population had slightly higher than the other populations, there were no significant differences. There was no significant differentiation between the genetic and geographic distance of the six populations, and haplotype network diagram indicated that there may exist genetic hybrids of E. sinensis from different river basins. The results of clustering and neutrality tests revealed that the distance of geographical locations were not completely related to their genetic distance values for the six populations. In conclusion, these results have great significance for the evaluation and exploitation of germplasm resources of E. sinensis.

  2. Distant neighbor base sequence context effects in human nucleotide excision repair of a benzo[a]pyrene-derived DNA lesion

    PubMed Central

    Cai, Yuqin; Kropachev, Konstantin; Xu, Rong; Tang, Yijin; Kolbanovskii, Marina; Kolbanovskii, Alexander; Amin, Shantu; Patel, Dinshaw J.; Broyde, Suse; Geacintov, Nicholas E.

    2010-01-01

    Summary The effects of non-nearest base sequences, beyond the nucleotides flanking a DNA lesion on either side, on nucleotide excision repair (NER) in extracts from human cells were investigated. We constructed two duplexes containing the same minor groove-aligned 10S (+)-trans-anti-B[a]P-N2-dG (G*) DNA adduct, derived from the environmental carcinogen benzo[a]pyrene (B[a]P): 5′-C-C-A-T-C-G*-C-T-A-C-C-3′ (CG*C-I), and 5′-C-A-C3-A4-C5-G*-C-A-C-A-C-3′ (CG*C-II). We utilized gel electrophoresis to compare the extent of DNA bending, and molecular dynamics (MD) simulations to analyze the structural characteristics of these two DNA duplexes. The NER efficiencies are 1.6 ± 0.2 times greater in the case of the CG*C-II than the CG*C-I sequence context in 135-mer duplexes. Gel electrophoresis and self-ligation circularization experiments revealed that the CG*C-II duplex is more bent than the CG*C-I duplex, while MD simulations showed that the unique -C3-A4-C5- segment in the CG*C-II duplex plays a key role. The presence of a minor groove-positioned guanine amino group, namely, the Watson-Crick partner to C3, acts as a wedge; facilitated by a highly deformable local -C3-A4- base step, this amino group allows the B[a]P ring system to produce a more enlarged minor groove in CG*C-II than in CG*C-I, as well as a local untwisting and enlarged and flexible Roll only in the CG*C-II sequence. These structural properties fit well with our prior findings that in the case of the family of minor groove 10S (+)-trans-anti-B[a]P-N2-dG lesions, flexible bends and enlarged minor groove widths (Cai et al. (2009) J. Mol. Biol., 385: 30–44) constitute NER recognition signals, and extend our understanding of sequence context effects on NER to the neighbors that are distant to the lesion. PMID:20399214

  3. Variations in 5S rDNAs in diploid and tetraploid offspring of red crucian carp × common carp.

    PubMed

    Ye, Lihai; Zhang, Chun; Tang, Xiaojun; Chen, Yiyi; Liu, Shaojun

    2017-08-08

    The allotetraploid hybrid fish (4nAT) that was created in a previous study through an intergeneric cross between red crucian carp (Carassius auratus red var., ♀) and common carp (Cyprinus carpio L., ♂) provided an excellent platform to investigate the effect of hybridization and polyploidization on the evolution of 5S rDNA. The 5S rDNAs of paternal common carp were made up of a coding sequence (CDS) and a non-transcribed spacer (NTS) unit, and while the 5S rDNAs of maternal red crucian carp contained a CDS and a NTS unit, they also contained a variable number of interposed regions (IPRs). The CDSs of the 5S rDNAs in both parental fishes were conserved, while their NTS units seemed to have been subjected to rapid evolution. The diploid hybrid 2nF 1 inherited all the types of 5S rDNAs in both progenitors and there were no signs of homeologous recombination in the 5S rDNAs of 2nF 1 by sequencing of PCR products. We obtained two segments of 5S rDNA with a total length of 16,457 bp from allotetraploid offspring 4nAT through bacterial artificial chromosome (BAC) sequencing. Using this sequence together with the 5S rDNA sequences amplified from the genomic DNA of 4nAT, we deduced that the 5S rDNAs of 4nAT might be inherited from the maternal progenitor red crucian carp. Additionally, the IPRs in the 5S rDNAs of 4nAT contained A-repeats and TA-repeats, which was not the case for the IPRs in the 5S rDNAs of 2nF 1 . We also detected two signals of a 200-bp fragment of 5S rDNA in the chromosomes of parental progenitors and hybrid progenies by fluorescence in situ hybridization (FISH). We deduced that during the evolution of 5S rDNAs in different ploidy hybrid fishes, interlocus gene conversion events and tandem repeat insertion events might occurred in the process of polyploidization. This study provided new insights into the relationship among the evolution of 5S rDNAs, hybridization and polyploidization, which were significant in clarifying the genome evolution of polyploid fish.

  4. Genomic cloning and chromosomal localization of HRY, the human homolog to the Drosophila segmentation gene, hairy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Feder, J.N.; Jan, L.Y.; Jan, Y.N.

    The Drosophila hairy gene encodes a basic helix- loop-helix protein that functions in at least two steps during Drosophila development: (1) during embryogenesis, when it partakes in the establishment of segments, and (2) during the larval stage, when it functions negatively in determining the pattern of sensory bristles on the adult fly. In the rat, a structurally homologous gene (RHL) behaves as an immediate-early gene in its response to growth factors and can, like that in Drosophila, suppress neuronal differentiation events. Here, the authors report the genomic cloning of the human hairy gene homolog (HRY). The coding region of themore » gene is contained within four exons. The predicted amino acid sequence reveals only four amino acid differences between the human and rat genes. Analysis of the DNA sequence 5[prime] to the coding region reveals a putatitve untranslated exon. To increase the value of the HRY gene as a genetic marker and to assess its potential involvement in genetic disorders, they sublocalized the locus to chromosome 3q28-q29 by fluorescence in situ hybridization. 34 refs., 4 figs., 1 tab.« less

  5. Evolution of oesophageal adenocarcinoma from metaplastic columnar epithelium without goblet cells in Barrett's oesophagus.

    PubMed

    Lavery, Danielle L; Martinez, Pierre; Gay, Laura J; Cereser, Biancastella; Novelli, Marco R; Rodriguez-Justo, Manuel; Meijer, Sybren L; Graham, Trevor A; McDonald, Stuart A C; Wright, Nicholas A; Jansen, Marnix

    2016-06-01

    Barrett's oesophagus commonly presents as a patchwork of columnar metaplasia with and without goblet cells in the distal oesophagus. The presence of metaplastic columnar epithelium with goblet cells on oesophageal biopsy is a marker of cancer progression risk, but it is unclear whether clonal expansion and progression in Barrett's oesophagus is exclusive to columnar epithelium with goblet cells. We developed a novel method to trace the clonal ancestry of an oesophageal adenocarcinoma across an entire Barrett's segment. Clonal expansions in Barrett's mucosa were identified using cytochrome c oxidase enzyme histochemistry. Somatic mutations were identified through mitochondrial DNA sequencing and single gland whole exome sequencing. By tracing the clonal origin of an oesophageal adenocarcinoma across an entire Barrett's segment through a combination of histopathological spatial mapping and clonal ordering, we find that this cancer developed from a premalignant clonal expansion in non-dysplastic ('cardia-type') columnar metaplasia without goblet cells. Our data demonstrate the premalignant potential of metaplastic columnar epithelium without goblet cells in the context of Barrett's oesophagus. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  6. Compositional searching of CpG islands in the human genome

    NASA Astrophysics Data System (ADS)

    Luque-Escamilla, Pedro Luis; Martínez-Aroza, José; Oliver, José L.; Gómez-Lopera, Juan Francisco; Román-Roldán, Ramón

    2005-06-01

    We report on an entropic edge detector based on the local calculation of the Jensen-Shannon divergence with application to the search for CpG islands. CpG islands are pieces of the genome related to gene expression and cell differentiation, and thus to cancer formation. Searching for these CpG islands is a major task in genetics and bioinformatics. Some algorithms have been proposed in the literature, based on moving statistics in a sliding window, but its size may greatly influence the results. The local use of Jensen-Shannon divergence is a completely different strategy: the nucleotide composition inside the islands is different from that in their environment, so a statistical distance—the Jensen-Shannon divergence—between the composition of two adjacent windows may be used as a measure of their dissimilarity. Sliding this double window over the entire sequence allows us to segment it compositionally. The fusion of those segments into greater ones that satisfy certain identification criteria must be achieved in order to obtain the definitive results. We find that the local use of Jensen-Shannon divergence is very suitable in processing DNA sequences for searching for compositionally different structures such as CpG islands, as compared to other algorithms in literature.

  7. Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid

    PubMed Central

    2011-01-01

    Background Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. Results 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Conclusions Paphiopedilum species display many chromosomal rearrangements - for example, duplications, translocations, and inversions - but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants. PMID:21910890

  8. Meta-Analysis of Mitochondrial DNA Variation in the Iberian Peninsula.

    PubMed

    Barral-Arca, Ruth; Pischedda, Sara; Gómez-Carballa, Alberto; Pastoriza, Ana; Mosquera-Miguel, Ana; López-Soto, Manuel; Martinón-Torres, Federico; Álvarez-Iglesias, Vanesa; Salas, Antonio

    2016-01-01

    The Iberian Peninsula has been the focus of attention of numerous studies dealing with mitochondrial DNA (mtDNA) variation, most of them targeting the control region segment. In the present study we sequenced the control region of 3,024 Spanish individuals from areas where available data were still limited. We also compiled mtDNA haplotypes from the literature involving 4,588 sequences and 28 population groups or small regions. We meta-analyzed all these data in order to shed further light on patterns of geographic variation, taking advantage of the large sample size and geographic coverage, in contrast with the atomized sampling strategy of previous work. The results indicate that the main mtDNA haplogroups show primarily clinal geographic patterns across the Iberian geography, roughly along a North-South axis. Haplogroup HV0 (where haplogroup U is nested) is more prevalent in the Franco Cantabrian region, in good agreement with previous findings that identified this area as a climate refuge during the Last Glacial Maximum (LGM), prior to a subsequent demographic re-expansion towards Central Europe and the Mediterranean. Typical sub-Saharan and North African lineages are slightly more prevalent in South Iberia, although at low frequencies; this pattern has been shaped mainly by the transatlantic slave trade and the Arab invasion of the Iberian Peninsula. The results also indicate that summary statistics that aim to measure molecular variation, or AMOVA, have limited sensitivity to detect population substructure, in contrast to patterns revealed by phylogeographic analysis. Overall, the results suggest that mtDNA variation in Iberia is substantially stratified. These patterns might be relevant in biomedical studies given that stratification is a common cause of false positives in case-control mtDNA association studies, and should be also considered when weighting the DNA evidence in forensic casework, which is strongly dependent on haplotype frequencies.

  9. Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid.

    PubMed

    Lan, Tianying; Albert, Victor A

    2011-09-12

    Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH) studies using 5S and 25S ribosomal DNA (rDNA) probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS) to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus) to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS) sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Paphiopedilum species display many chromosomal rearrangements--for example, duplications, translocations, and inversions--but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These results make the genus a model system for the study of complex chromosomal evolution in plants.

  10. Meta-Analysis of Mitochondrial DNA Variation in the Iberian Peninsula

    PubMed Central

    Barral-Arca, Ruth; Pischedda, Sara; Gómez-Carballa, Alberto; Pastoriza, Ana; Mosquera-Miguel, Ana; López-Soto, Manuel; Martinón-Torres, Federico; Álvarez-Iglesias, Vanesa; Salas, Antonio

    2016-01-01

    The Iberian Peninsula has been the focus of attention of numerous studies dealing with mitochondrial DNA (mtDNA) variation, most of them targeting the control region segment. In the present study we sequenced the control region of 3,024 Spanish individuals from areas where available data were still limited. We also compiled mtDNA haplotypes from the literature involving 4,588 sequences and 28 population groups or small regions. We meta-analyzed all these data in order to shed further light on patterns of geographic variation, taking advantage of the large sample size and geographic coverage, in contrast with the atomized sampling strategy of previous work. The results indicate that the main mtDNA haplogroups show primarily clinal geographic patterns across the Iberian geography, roughly along a North-South axis. Haplogroup HV0 (where haplogroup U is nested) is more prevalent in the Franco Cantabrian region, in good agreement with previous findings that identified this area as a climate refuge during the Last Glacial Maximum (LGM), prior to a subsequent demographic re-expansion towards Central Europe and the Mediterranean. Typical sub-Saharan and North African lineages are slightly more prevalent in South Iberia, although at low frequencies; this pattern has been shaped mainly by the transatlantic slave trade and the Arab invasion of the Iberian Peninsula. The results also indicate that summary statistics that aim to measure molecular variation, or AMOVA, have limited sensitivity to detect population substructure, in contrast to patterns revealed by phylogeographic analysis. Overall, the results suggest that mtDNA variation in Iberia is substantially stratified. These patterns might be relevant in biomedical studies given that stratification is a common cause of false positives in case-control mtDNA association studies, and should be also considered when weighting the DNA evidence in forensic casework, which is strongly dependent on haplotype frequencies. PMID:27441366

  11. Integrated circuit layer image segmentation

    NASA Astrophysics Data System (ADS)

    Masalskis, Giedrius; Petrauskas, Romas

    2010-09-01

    In this paper we present IC layer image segmentation techniques which are specifically created for precise metal layer feature extraction. During our research we used many samples of real-life de-processed IC metal layer images which were obtained using optical light microscope. We have created sequence of various image processing filters which provides segmentation results of good enough precision for our application. Filter sequences were fine tuned to provide best possible results depending on properties of IC manufacturing process and imaging technology. Proposed IC image segmentation filter sequences were experimentally tested and compared with conventional direct segmentation algorithms.

  12. Alternative reverse genetics system for influenza viruses based on a synthesized swine 45S rRNA promoter.

    PubMed

    Wang, Kai; Huang, Qi; Yang, Zhiwei; Qi, Kezong; Liu, Hongmei; Chen, Hongjun

    2017-08-01

    We generated an alternative reverse genetics (RG) system based on a synthesized swine 45S rRNA promoter to rescue the H3N2 subtype swine influenza virus. All eight flanking segment cassettes of A/swine/Henan/7/2010 (H3N2) were amplified with ambisense expression elements from RG plasmids. All segments were then recombined with the pHC2014 vector, which contained the synthesized swine 45S rRNA promoter (spol1) and its terminal sequence (t1) in a pcDNA3 backbone. As a result, we obtained a set of RG plasmids carrying the corresponding eight-segment cassettes. We efficiently generated the H3N2 virus after transfection into 293T/PK15, PK15, and 293T cells. The efficiency of spol1-driven influenza virus rescue in PK15 cells was similar to that in 293T cells by titration using the human pol1 RG system. Our approach suggests that an alternative spol1-based RG system can produce influenza viruses.

  13. Localization of the human mitochondrial citrate transporter protein gene to chromosome 22q11 in the DiGeorge syndrome critical region

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Heisterkamp, N.; Hoeve, J.T.; Groffen, J.

    A high percentage of patients with DiGeorge syndrome and velo-cardio-facial syndrome have interstitial deletions on chromosome 22q11. The shortest region of overlap is currently estimated to be around 500 kb. Two segments of DNA from chromosome 22q11, located 160 kb apart, were cloned because they contained NotI restriction enzyme sites. In the current study we demonstrate that these segments are absent from chromosomes 22 carrying microdeletions of two different DiGeorge patients. Fluorescence in situ and Southern blot hybridization was further used to show that this locus is within the DiGeorge critical region. Phylogenetically conserved sequences adjacent to one of themore » two NotI sites hybridized to mRNAs in different human cell lines. cDNAs isolated with a probe from this segment showed it to contain the gene for the human mitochondrial citrate transporter protein. Deletion of this gene in DiGeorge may contribute to the mental deficiency seen in the patients. 35 refs., 5 figs.« less

  14. Analysis of the genetic diversity of ovine herpesvirus 2 in samples from livestock with malignant catarrhal fever.

    PubMed

    Russell, George C; Scholes, Sandra F; Twomey, David F; Courtenay, Ann E; Grant, Dawn M; Lamond, Bruce; Norris, David; Willoughby, Kim; Haig, David M; Stewart, James P

    2014-08-06

    In order to define better virus isolates from animals with malignant catarrhal fever (MCF), segments of three genes of ovine herpesvirus-2 were amplified from diagnostic samples representing MCF cases with a range of clinical presentations in cattle, including head and eye, alimentary and neurological. The variation within each gene segment was estimated by DNA sequencing, which confirmed that the newly-annotated Ov9.5 gene was significantly more polymorphic than either of the other loci tested (segments of ORF50 and ORF75), with alleles that differed at over 60% of nucleotide positions. Despite this, the nine Ov9.5 alleles characterised had identical predicted splicing patterns and could be translated into Ov9.5 polypeptides with at least 49% amino acid identity. This multi-locus approach has potential for use in epidemiological studies and in charactering chains of infection. However there was no association between specific variants of OvHV-2 and the clinical/pathological presentation of MCF in the cattle analysed. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Mitochondrial and Y-chromosomal profile of the Kazakh population from East Kazakhstan

    PubMed Central

    Tarlykov, Pavel V.; Zholdybayeva, Elena V.; Akilzhanova, Ainur R.; Nurkina, Zhannur M.; Sabitov, Zhaxylyk M.; Rakhypbekov, Tolebay K.; Ramanculov, Erlan M.

    2013-01-01

    Aim To study the genetic relationship of Kazakhs from East Kazakhstan to other Eurasian populations by examining paternal and maternal DNA lineages. Methods Whole blood samples were collected in 2010 from 160 unrelated healthy Kazakhs residing in East Kazakhstan. Genomic DNA was extracted with Wizard® genomic DNA Purification Kit. Nucleotide sequence of hypervariable segment I of mitochondrial DNA (mtDNA) was determined and analyzed. Seventeen Y-short tandem repeat (STR) loci were studied in 67 samples with the AmpFiSTR Y-filer PCR Amplification Kit. In addition, mtDNA data for 2701 individuals and Y-STR data for 677 individuals were retrieved from the literature for comparison. Results There was a high degree of genetic differentiation on the level of mitochondrial DNA. The majority of maternal lineages belonged to haplogroups common in Central Asia. In contrast, Y-STR data showed very low genetic diversity, with the relative frequency of the predominant haplotype of 0.612. Conclusion The results revealed different migration patterns in the population sample, showing there had been more migration among women. mtDNA genetic diversity in this population was equivalent to that in other Central Asian populations. Genetic evidence suggests the existence of a single paternal founder lineage in the population of East Kazakhstan, which is consistent with verbal genealogical data of the local tribes. PMID:23444242

  16. Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

    PubMed

    Wen, Chiu-Ming

    2017-08-01

    An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.

  17. Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

    PubMed

    Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

    2017-03-27

    Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.

  18. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags

    PubMed Central

    2010-01-01

    Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80°C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method. PMID:20673359

  19. Switch Transcripts in Immunoglobulin Class Switching

    NASA Astrophysics Data System (ADS)

    Lorenz, Matthias; Jung, Steffen; Radbruch, Andreas

    1995-03-01

    B cells can exchange gene segments for the constant region of the immunoglobulin heavy chain, altering the class and effector function of the antibodies that they produce. Class switching is directed to distinct classes by cytokines, which induce transcription of the targeted DNA sequences. These transcripts are processed, resulting in spliced "switch" transcripts. Switch recombination can be directed to immunoglobulin G1 (IgG1) by the heterologous human metallothionein II_A promoter in mutant mice. Induction of the structurally conserved, spliced switch transcripts is sufficient to target switch recombination to IgG1, whereas transcription alone is not.

  20. Nucleic Acid i-Motif Structures in Analytical Chemistry.

    PubMed

    Alba, Joan Josep; Sadurní, Anna; Gargallo, Raimundo

    2016-09-02

    Under the appropriate experimental conditions of pH and temperature, cytosine-rich segments in DNA or RNA sequences may produce a characteristic folded structure known as an i-motif. Besides its potential role in vivo, which is still under investigation, this structure has attracted increasing interest in other fields due to its sharp, fast and reversible pH-driven conformational changes. This "on/off" switch at molecular level is being used in nanotechnology and analytical chemistry to develop nanomachines and sensors, respectively. This paper presents a review of the latest applications of this structure in the field of chemical analysis.

  1. Amplification of Herpes Simplex Virus Types 1 and 2 and Human Herpes Virus Type 5 Polymerase Gene Segment From Formalin-Fixed Brain Tissue From Alzheimer’s Disease Patients

    DTIC Science & Technology

    2005-08-01

    The neuronal nitric oxide synthase (NOS1) gene target was amplified and sequenced in all samples tested, in addition to HSV1 , HSV2 , or Human Herpes...Triphosphate DNA Deoxyribonucleic acid GAPDH Glyceraldehyde-3 -phosphate dehydrogenase HSV Herpes Simplex Virus HSV1 Herpes Simplex Virus Type 1 HSV2 Herpes... HSV2 ) share 50-70 % homology. HSV1 is primarily associated with oral and ocular lesions, while HSV2 is primarily associated with genital and anal lesions

  2. A comparative study of the inner ear structures of artiodactyls and early cetaceans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Klingshirn, M.A.; Luo, Z.

    1994-12-31

    It has been suggested that the order Cetacea (whales and porpoises) are closely related to artiodactyls, even-hoofed ungulate mammals such as the pig and cow. Paleontological and molecular data strongly supports this concept of phylogenetic relationships. In a study of DNA sequences of two mitochondrial ribosomal gene segments of cetaceans, the artiodactyls were found to be closest related to Cetaceans. These well accepted studies on the phylogenetic affinities of artiodactyls and cetaceans cause us to conduct a comparative study of the bony structure of the inner ear of these two taxa.

  3. Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage

    PubMed Central

    Brok-Volchanskaya, Vera S.; Kadyrov, Farid A.; Sivogrivov, Dmitry E.; Kolosov, Peter M.; Sokolov, Andrey S.; Shlyapnikov, Michael G.; Kryukov, Valentine M.; Granovsky, Igor E.

    2008-01-01

    Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3′ 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TψC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages. PMID:18281701

  4. Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

    PubMed

    Spielmann, A; Stutz, E

    1983-10-25

    The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.

  5. DNA Inversion on Conjugative Plasmid pVT745

    PubMed Central

    Chen, Jinbiao; Leblanc, Donald J.; Galli, Dominique M.

    2002-01-01

    Plasmid pVT745 from Actinobacillus actinomycetemcomitans strain VT745 can be transferred to other A. actinomycetemcomitans strains at a frequency of 10−6. Screening of transconjugants revealed that the DNA of pDMG21A, a pVT745 derivative containing a kanamycin resistance gene, displayed a structural rearrangement after transfer. A 9-kb segment on the plasmid had switched orientation. The inversion was independent of RecA and required the activity of the pVT745-encoded site-specific recombinase. This recombinase, termed Inv, was highly homologous to invertases of the Din family. Two recombination sites of 22 bp, which are arranged in opposite orientation and which function as DNA crossover sequences, were identified on pVT745. One of the sites was located adjacent to the 5′ end of the invertase gene, inv. Inversion of the 9-kb segment on pVT745 derivatives has been observed in all A. actinomycetemcomitans strains tested except for the original host, VT745. This would suggest that a host factor that is either inactive or absent in VT745 is required for efficient recombination. Inactivation of the invertase in the donor strain resulted in a 1,000-fold increase in the number of transconjugants upon plasmid transfer. It is proposed that an activated invertase causes the immediate loss of the plasmid in most recipient cells after mating. No biological role has been associated with the invertase as of yet. PMID:12374826

  6. Genetic and Epigenetic Changes in Oilseed Rape (Brassica napus L.) Extracted from Intergeneric Allopolyploid and Additions with Orychophragmus.

    PubMed

    Gautam, Mayank; Dang, Yanwei; Ge, Xianhong; Shao, Yujiao; Li, Zaiyun

    2016-01-01

    Allopolyploidization with the merger of the genomes from different species has been shown to be associated with genetic and epigenetic changes. But the maintenance of such alterations related to one parental species after the genome is extracted from the allopolyploid remains to be detected. In this study, the genome of Brassica napus L. (2n = 38, genomes AACC) was extracted from its intergeneric allohexaploid (2n = 62, genomes AACCOO) with another crucifer Orychophragmus violaceus (2n = 24, genome OO), by backcrossing and development of alien addition lines. B. napus-type plants identified in the self-pollinated progenies of nine monosomic additions were analyzed by the methods of amplified fragment length polymorphism, sequence-specific amplified polymorphism, and methylation-sensitive amplified polymorphism. They showed modifications to certain extents in genomic components (loss and gain of DNA segments and transposons, introgression of alien DNA segments) and DNA methylation, compared with B. napus donor. The significant differences in the changes between the B. napus types extracted from these additions likely resulted from the different effects of individual alien chromosomes. Particularly, the additions which harbored the O. violaceus chromosome carrying dominant rRNA genes over those of B. napus tended to result in the development of plants which showed fewer changes, suggesting a role of the expression levels of alien rRNA genes in genomic stability. These results provided new cues for the genetic alterations in one parental genome that are maintained even after the genome becomes independent.

  7. Genetic and Epigenetic Changes in Oilseed Rape (Brassica napus L.) Extracted from Intergeneric Allopolyploid and Additions with Orychophragmus

    PubMed Central

    Gautam, Mayank; Dang, Yanwei; Ge, Xianhong; Shao, Yujiao; Li, Zaiyun

    2016-01-01

    Allopolyploidization with the merger of the genomes from different species has been shown to be associated with genetic and epigenetic changes. But the maintenance of such alterations related to one parental species after the genome is extracted from the allopolyploid remains to be detected. In this study, the genome of Brassica napus L. (2n = 38, genomes AACC) was extracted from its intergeneric allohexaploid (2n = 62, genomes AACCOO) with another crucifer Orychophragmus violaceus (2n = 24, genome OO), by backcrossing and development of alien addition lines. B. napus-type plants identified in the self-pollinated progenies of nine monosomic additions were analyzed by the methods of amplified fragment length polymorphism, sequence-specific amplified polymorphism, and methylation-sensitive amplified polymorphism. They showed modifications to certain extents in genomic components (loss and gain of DNA segments and transposons, introgression of alien DNA segments) and DNA methylation, compared with B. napus donor. The significant differences in the changes between the B. napus types extracted from these additions likely resulted from the different effects of individual alien chromosomes. Particularly, the additions which harbored the O. violaceus chromosome carrying dominant rRNA genes over those of B. napus tended to result in the development of plants which showed fewer changes, suggesting a role of the expression levels of alien rRNA genes in genomic stability. These results provided new cues for the genetic alterations in one parental genome that are maintained even after the genome becomes independent. PMID:27148282

  8. Evidence for degenerate tetraploidy in bdelloid rotifers.

    PubMed

    Mark Welch, David B; Mark Welch, Jessica L; Meselson, Matthew

    2008-04-01

    Rotifers of class Bdelloidea have evolved for millions of years apparently without sexual reproduction. We have sequenced 45- to 70-kb regions surrounding the four copies of the hsp82 gene of the bdelloid rotifer Philodina roseola, each of which is on a separate chromosome. The four regions comprise two colinear gene-rich pairs with gene content, order, and orientation conserved within each pair. Only a minority of genes are common to both pairs, also in the same orientation and order, but separated by gene-rich segments present in only one or the other pair. The pattern is consistent with degenerate tetraploidy with numerous segmental deletions, some in one pair of colinear chromosomes and some in the other. Divergence in 1,000-bp windows varies along an alignment of a colinear pair, from zero to as much as 20% in a pattern consistent with gene conversion associated with recombinational repair of DNA double-strand breaks. Although pairs of colinear chromosomes are a characteristic of sexually reproducing diploids and polyploids, a quite different explanation for their presence in bdelloids is suggested by the recent finding that bdelloid rotifers can recover and resume reproduction after suffering hundreds of radiation-induced DNA double-strand breaks per oocyte nucleus. Because bdelloid primary oocytes are in G(1) and therefore lack sister chromatids, we propose that bdelloid colinear chromosome pairs are maintained as templates for the repair of DNA double-strand breaks caused by the frequent desiccation and rehydration characteristic of bdelloid habitats.

  9. Genetic divergence and phylogenetic relationships in grey mullets (Teleostei: Mugilidae) based on PCR-RFLP analysis of mtDNA segments.

    PubMed

    Papasotiropoulos, V; Klossa-Kilia, E; Kilias, G; Alahiotis, S

    2002-04-01

    The genetic differentiation and phylogenetic relationships among five species of the Mugilidae family (Mugil cephalus, Chelon labrosus, Liza aurata, Liza ramada, and Liza saliens) were investigated at the mtDNA level, on samples taken from Messolongi lagoon-Greece. RFLP analysis of three PCR-amplified mtDNA gene segments (12s rRNA, 16s rRNA, and CO I) was used. Ten, eight, and nine restriction enzymes were found to have at least one recognition site at 12s rRNA, 16s rRNA, and CO I genes, respectively. Several fragment patterns were revealed to be species-specific, and thus they could be useful in species taxonomy as diagnostic markers, as well as for further evolutionary studies. Seven different haplotypes were detected. The greatest amount of genetic differentiation was observed at the interspecific level, while little variation was revealed at the intraspecific level. The highest values of nucleotide sequence divergence were observed between M. cephalus and all the other species, while the lowest was found between C. labrosus and L. saliens. Dendrograms obtained by the three different methods (UPGMA, Neighbor-Joining, and Dollo parsimony), were found to exhibit in all cases the same topology. According to this, the most distinct species is M. cephalus, while the other species are clustered in two separate groups, thefirst one containing L. aurata and L. ramada, the other L. saliens and C. labrosus. This last clustering makes the monophyletic origin of the genus Liza questionable.

  10. Genomic segments RNA1 and RNA2 of Prunus necrotic ringspot virus codetermine viral pathogenicity to adapt to alternating natural Prunus hosts.

    PubMed

    Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

    2013-05-01

    Prunus necrotic ringspot virus (PNRSV) affects Prunus fruit production worldwide. To date, numerous PNRSV isolates with diverse pathological properties have been documented. To study the pathogenicity of PNRSV, which directly or indirectly determines the economic losses of infected fruit trees, we have recently sequenced the complete genome of peach isolate Pch12 and cherry isolate Chr3, belonging to the pathogenically aggressive PV32 group and mild PV96 group, respectively. Here, we constructed the Chr3- and Pch12-derived full-length cDNA clones that were infectious in the experimental host cucumber and their respective natural Prunus hosts. Pch12-derived clones induced much more severe symptoms than Chr3 in cucumber, and the pathogenicity discrepancy between Chr3 and Pch12 was associated with virus accumulation. By reassortment of genomic segments, swapping of partial genomic segments, and site-directed mutagenesis, we identified the 3' terminal nucleotide sequence (1C region) in RNA1 and amino acid K at residue 279 in RNA2-encoded P2 as the severe virulence determinants in Pch12. Gain-of-function experiments demonstrated that both the 1C region and K279 of Pch12 were required for severe virulence and high levels of viral accumulation. Our results suggest that PNRSV RNA1 and RNA2 codetermine viral pathogenicity to adapt to alternating natural Prunus hosts, likely through mediating viral accumulation.

  11. Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

    PubMed

    Olson, Nathan D; Treangen, Todd J; Hill, Christopher M; Cepeda-Espinoza, Victoria; Ghurye, Jay; Koren, Sergey; Pop, Mihai

    2017-08-07

    Metagenomic samples are snapshots of complex ecosystems at work. They comprise hundreds of known and unknown species, contain multiple strain variants and vary greatly within and across environments. Many microbes found in microbial communities are not easily grown in culture making their DNA sequence our only clue into their evolutionary history and biological function. Metagenomic assembly is a computational process aimed at reconstructing genes and genomes from metagenomic mixtures. Current methods have made significant strides in reconstructing DNA segments comprising operons, tandem gene arrays and syntenic blocks. Shorter, higher-throughput sequencing technologies have become the de facto standard in the field. Sequencers are now able to generate billions of short reads in only a few days. Multiple metagenomic assembly strategies, pipelines and assemblers have appeared in recent years. Owing to the inherent complexity of metagenome assembly, regardless of the assembly algorithm and sequencing method, metagenome assemblies contain errors. Recent developments in assembly validation tools have played a pivotal role in improving metagenomics assemblers. Here, we survey recent progress in the field of metagenomic assembly, provide an overview of key approaches for genomic and metagenomic assembly validation and demonstrate the insights that can be derived from assemblies through the use of assembly validation strategies. We also discuss the potential for impact of long-read technologies in metagenomics. We conclude with a discussion of future challenges and opportunities in the field of metagenomic assembly and validation. © The Author 2017. Published by Oxford University Press.

  12. Direct Sequence Detection of Structured H5 Influenza Viral RNA

    PubMed Central

    Kerby, Matthew B.; Freeman, Sarah; Prachanronarong, Kristina; Artenstein, Andrew W.; Opal, Steven M.; Tripathi, Anubhav

    2008-01-01

    We describe the development of sequence-specific molecular beacons (dual-labeled DNA probes) for identification of the H5 influenza subtype, cleavage motif, and receptor specificity when hybridized directly with in vitro transcribed viral RNA (vRNA). The cloned hemagglutinin segment from a highly pathogenic H5N1 strain, A/Hanoi/30408/2005(H5N1), isolated from humans was used as template for in vitro transcription of sense-strand vRNA. The hybridization behavior of vRNA and a conserved subtype probe was characterized experimentally by varying conditions of time, temperature, and Mg2+ to optimize detection. Comparison of the hybridization rates of probe to DNA and RNA targets indicates that conformational switching of influenza RNA structure is a rate-limiting step and that the secondary structure of vRNA dominates the binding kinetics. The sensitivity and specificity of probe recognition of other H5 strains was calculated from sequence matches to the National Center for Biotechnology Information influenza database. The hybridization specificity of the subtype probes was experimentally verified with point mutations within the probe loop at five locations corresponding to the other human H5 strains. The abundance frequencies of the hemagglutinin cleavage motif and sialic acid recognition sequences were experimentally tested for H5 in all host viral species. Although the detection assay must be coupled with isothermal amplification on the chip, the new probes form the basis of a portable point-of-care diagnostic device for influenza subtyping. PMID:18403607

  13. FISH Finder: a high-throughput tool for analyzing FISH images

    PubMed Central

    Shirley, James W.; Ty, Sereyvathana; Takebayashi, Shin-ichiro; Liu, Xiuwen; Gilbert, David M.

    2011-01-01

    Motivation: Fluorescence in situ hybridization (FISH) is used to study the organization and the positioning of specific DNA sequences within the cell nucleus. Analyzing the data from FISH images is a tedious process that invokes an element of subjectivity. Automated FISH image analysis offers savings in time as well as gaining the benefit of objective data analysis. While several FISH image analysis software tools have been developed, they often use a threshold-based segmentation algorithm for nucleus segmentation. As fluorescence signal intensities can vary significantly from experiment to experiment, from cell to cell, and within a cell, threshold-based segmentation is inflexible and often insufficient for automatic image analysis, leading to additional manual segmentation and potential subjective bias. To overcome these problems, we developed a graphical software tool called FISH Finder to automatically analyze FISH images that vary significantly. By posing the nucleus segmentation as a classification problem, compound Bayesian classifier is employed so that contextual information is utilized, resulting in reliable classification and boundary extraction. This makes it possible to analyze FISH images efficiently and objectively without adjustment of input parameters. Additionally, FISH Finder was designed to analyze the distances between differentially stained FISH probes. Availability: FISH Finder is a standalone MATLAB application and platform independent software. The program is freely available from: http://code.google.com/p/fishfinder/downloads/list Contact: gilbert@bio.fsu.edu PMID:21310746

  14. DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants

    PubMed Central

    Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B.; Tóth, Gábor; Ortutay, Csaba P.; Patthy, László

    2005-01-01

    DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21 061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically. PMID:15608291

  15. DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants.

    PubMed

    Barta, Endre; Sebestyén, Endre; Pálfy, Tamás B; Tóth, Gábor; Ortutay, Csaba P; Patthy, László

    2005-01-01

    DoOP (http://doop.abc.hu/) is a database of eukaryotic promoter sequences (upstream regions) aiming to facilitate the recognition of regulatory sites conserved between species. The annotated first exons of human and Arabidopsis thaliana genes were used as queries in BLAST searches to collect the most closely related orthologous first exon sequences from Chordata and Viridiplantae species. Up to 3000 bp DNA segments upstream from these first exons constitute the clusters in the chordate and plant sections of the Database of Orthologous Promoters. Release 1.0 of DoOP contains 21,061 chordate clusters from 284 different species and 7548 plant clusters from 269 different species. The database can be used to find and retrieve promoter sequences of a given gene from various species and it is also suitable to see the most trivial conserved sequence blocks in the orthologous upstream regions. Users can search DoOP with either sequence or text (annotation) to find promoter clusters of various genes. In addition to the sequence data, the positions of the conserved sequence blocks derived from multiple alignments, the positions of repetitive elements and the positions of transcription start sites known from the Eukaryotic Promoter Database (EPD) can be viewed graphically.

  16. Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

    PubMed

    Lathe, R

    1985-05-05

    Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.

  17. Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient

    PubMed Central

    Kouprina, Natalay; Noskov, Vladimir N.; Waterfall, Joshua J.; Walker, Robert L.; Meltzer, Paul S.; Topol, Eric J.; Larionov, Vladimir

    2018-01-01

    Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer. PMID:29632643

  18. Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

    PubMed

    Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

    2010-04-01

    The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  19. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    NASA Astrophysics Data System (ADS)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  20. Complete Coding Genome Sequence for Mogiana Tick Virus, a Jingmenvirus Isolated from Ticks in Brazil

    DTIC Science & Technology

    2017-05-04

    and capable of infecting a wide range of animal hosts (1–5). Here, we report the complete coding genome sequence (i.e., only missing portions of...segmented nature of the genome was not under- stood. Therefore, only the two genome segments with detectable sequence homolo- gies to flaviviruses were...originally reported (2). We revisited the data set of Maruyama et al. (2) and assembled the complete coding sequences for all four genome segments. We

Top