genome sequence migs: Topics by Science.gov

Sample records for genome sequence migs

Enriching public descriptions of marine phages using the Genomic Standards Consortium MIGS standard

PubMed Central

Duhaime, Melissa Beth; Kottmann, Renzo; Field, Dawn; Glöckner, Frank Oliver

2011-01-01

In any sequencing project, the possible depth of comparative analysis is determined largely by the amount and quality of the accompanying contextual data. The structure, content, and storage of this contextual data should be standardized to ensure consistent coverage of all sequenced entities and facilitate comparisons. The Genomic Standards Consortium (GSC) has developed the “Minimum Information about Genome/Metagenome Sequences (MIGS/MIMS)” checklist for the description of genomes and here we annotate all 30 publicly available marine bacteriophage sequences to the MIGS standard. These annotations build on existing International Nucleotide Sequence Database Collaboration (INSDC) records, and confirm, as expected that current submissions lack most MIGS fields. MIGS fields were manually curated from the literature and placed in XML format as specified by the Genomic Contextual Data Markup Language (GCDML). These “machine-readable” reports were then analyzed to highlight patterns describing this collection of genomes. Completed reports are provided in GCDML. This work represents one step towards the annotation of our complete collection of genome sequences and shows the utility of capturing richer metadata along with raw sequences. PMID:21677864
A standard MIGS/MIMS compliant XML Schema: toward the development of the Genomic Contextual Data Markup Language (GCDML).

PubMed

Kottmann, Renzo; Gray, Tanya; Murphy, Sean; Kagan, Leonid; Kravitz, Saul; Lombardot, Thierry; Field, Dawn; Glöckner, Frank Oliver

2008-06-01

The Genomic Contextual Data Markup Language (GCDML) is a core project of the Genomic Standards Consortium (GSC) that implements the "Minimum Information about a Genome Sequence" (MIGS) specification and its extension, the "Minimum Information about a Metagenome Sequence" (MIMS). GCDML is an XML Schema for generating MIGS/MIMS compliant reports for data entry, exchange, and storage. When mature, this sample-centric, strongly-typed schema will provide a diverse set of descriptors for describing the exact origin and processing of a biological sample, from sampling to sequencing, and subsequent analysis. Here we describe the need for such a project, outline design principles required to support the project, and make an open call for participation in defining the future content of GCDML. GCDML is freely available, and can be downloaded, along with documentation, from the GSC Web site (http://gensc.org).
The minimum information about a genome sequence (MIGS) specification

PubMed Central

Field, Dawn; Garrity, George; Gray, Tanya; Morrison, Norman; Selengut, Jeremy; Sterk, Peter; Tatusova, Tatiana; Thomson, Nicholas; Allen, Michael J; Angiuoli, Samuel V; Ashburner, Michael; Axelrod, Nelson; Baldauf, Sandra; Ballard, Stuart; Boore, Jeffrey; Cochrane, Guy; Cole, James; Dawyndt, Peter; De Vos, Paul; dePamphilis, Claude; Edwards, Robert; Faruque, Nadeem; Feldman, Robert; Gilbert, Jack; Gilna, Paul; Glöckner, Frank Oliver; Goldstein, Philip; Guralnick, Robert; Haft, Dan; Hancock, David; Hermjakob, Henning; Hertz-Fowler, Christiane; Hugenholtz, Phil; Joint, Ian; Kagan, Leonid; Kane, Matthew; Kennedy, Jessie; Kowalchuk, George; Kottmann, Renzo; Kolker, Eugene; Kravitz, Saul; Kyrpides, Nikos; Leebens-Mack, Jim; Lewis, Suzanna E; Li, Kelvin; Lister, Allyson L; Lord, Phillip; Maltsev, Natalia; Markowitz, Victor; Martiny, Jennifer; Methe, Barbara; Mizrachi, Ilene; Moxon, Richard; Nelson, Karen; Parkhill, Julian; Proctor, Lita; White, Owen; Sansone, Susanna-Assunta; Spiers, Andrew; Stevens, Robert; Swift, Paul; Taylor, Chris; Tateno, Yoshio; Tett, Adrian; Turner, Sarah; Ussery, David; Vaughan, Bob; Ward, Naomi; Whetzel, Trish; Gil, Ingio San; Wilson, Gareth; Wipat, Anil

2008-01-01

With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the ‘transparency’ of the information contained in existing genomic databases. PMID:18464787
Submitting MIGS, MIMS, MIENS Information to EMBL and Standards and the Sequencing Pipelines of the Gordon and Betty Moore Foundation (GSC8 Meeting)

ScienceCinema

Vaughan, Bob; Kaye, Jon

2018-01-24

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Bob Vaughan of EMBL on submitting MIGS/MIMS/MIENS information to EMBL-EBI's system, followed by a brief talk from Jon Kaye of the Gordon and Betty Moore Foundation on standards and the foundation's sequencing pipelines at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA on Sept. 9, 2009.
Submitting MIGS, MIMS, MIENS Information to EMBL and Standards and the Sequencing Pipelines of the Gordon and Betty Moore Foundation (GSC8 Meeting)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vaughan, Bob; Kaye, Jon

2009-09-09

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Bob Vaughan of EMBL on submitting MIGS/MIMS/MIENS information to EMBL-EBI's system, followed by a brief talk from Jon Kaye of the Gordon and Bettymore » Moore Foundation on standards and the foundation's sequencing pipelines at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, CA on Sept. 9, 2009.« less
Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development

PubMed Central

Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

2017-01-01

Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae, a distant relative of the model Caenorhabditis elegans. We used this draft to identify the likely causative mutations at the O. tipulae cov-3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13, and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. PMID:28630114
Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development.

PubMed

Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

2017-08-01

Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae , a distant relative of the model Caenorhabditis elegans We used this draft to identify the likely causative mutations at the O. tipulae cov -3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13 , and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. Copyright © 2017 by the Genetics Society of America.
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

DOE PAGES

Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas; ...

2017-08-08

Here, we present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a MetagenomeAssembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Genemore » Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.« less
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas

Here, we present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a MetagenomeAssembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Genemore » Sequence (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.« less
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas

We present two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences. Both are extensions of the Minimum Information about Any (x) Sequence (MIxS). The standards are the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum Information about a Metagenome-Assembled Genome (MIMAG), including, but not limited to, assembly quality, and estimates of genome completeness and contamination. These standards can be used in combination with other GSC checklists, including the Minimum Information about a Genome Sequence (MIGS), Minimum Information about a Metagenomic Sequence (MIMS), and Minimum Information about a Marker Gene Sequencemore » (MIMARKS). Community-wide adoption of MISAG and MIMAG will facilitate more robust comparative genomic analyses of bacterial and archaeal diversity.« less
Defining Linkages between the GSC and NSF's LTER Program: How the Ecological Metadata Language (EML) Relates to GCDML and Other Outcomes

Treesearch

Inigo San Gil; Wade Sheldon; Tom Schmidt; Mark Servilla; Raul Aguilar; Corinna Gries; Tanya Gray; Dawn Field; James Cole; Jerry Yun Pan; Giri Palanisamy; Donald Henshaw; Margaret O' Brien; Linda Kinkel; Kathrine McMahon; Renzo Kottmann; Linda Amaral-Zettler; John Hobbie; Philip Goldstein; Robert P. Guralnick; James Brunt; William K. Michener

2008-01-01

The Genomic Standards Consortium (GSC) invited a representative of the Long-Term Ecological Research (LTER) to its fifth workshop to present the Ecological Metadata Language (EML) metadata standard and its relationship to the Minimum Information about a Genome/Metagenome Sequence (MIGS/MIMS) and its implementation, the Genomic Contextual Data Markup Language (GCDML)....
The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata.

PubMed

Liolios, Konstantinos; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Kyrpides, Nikos C

2008-01-01

The Genomes On Line Database (GOLD) is a comprehensive resource that provides information on genome and metagenome projects worldwide. Complete and ongoing projects and their associated metadata can be accessed in GOLD through pre-computed lists and a search page. As of September 2007, GOLD contains information on more than 2900 sequencing projects, out of which 639 have been completed and their sequence data deposited in the public databases. GOLD continues to expand with the goal of providing metadata information related to the projects and the organisms/environments towards the Minimum Information about a Genome Sequence' (MIGS) guideline. GOLD is available at http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece at http://gold.imbb.forth.gr/
Transcription factor clusters regulate genes in eukaryotic cells

PubMed Central

Hedlund, Erik G; Friemann, Rosmarie; Hohmann, Stefan

2017-01-01

Transcription is regulated through binding factors to gene promoters to activate or repress expression, however, the mechanisms by which factors find targets remain unclear. Using single-molecule fluorescence microscopy, we determined in vivo stoichiometry and spatiotemporal dynamics of a GFP tagged repressor, Mig1, from a paradigm signaling pathway of Saccharomyces cerevisiae. We find the repressor operates in clusters, which upon extracellular signal detection, translocate from the cytoplasm, bind to nuclear targets and turnover. Simulations of Mig1 configuration within a 3D yeast genome model combined with a promoter-specific, fluorescent translation reporter confirmed clusters are the functional unit of gene regulation. In vitro and structural analysis on reconstituted Mig1 suggests that clusters are stabilized by depletion forces between intrinsically disordered sequences. We observed similar clusters of a co-regulatory activator from a different pathway, supporting a generalized cluster model for transcription factors that reduces promoter search times through intersegment transfer while stabilizing gene expression. PMID:28841133
The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

PubMed

Liolios, Konstantinos; Chen, I-Min A; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Hugenholtz, Philip; Markowitz, Victor M; Kyrpides, Nikos C

2010-01-01

The Genomes On Line Database (GOLD) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2009, GOLD contains information for more than 5800 sequencing projects, of which 1100 have been completed and their sequence data deposited in a public repository. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about a (Meta)Genome Sequence (MIGS/MIMS) specification. GOLD is available at: http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece, at: http://gold.imbb.forth.gr/
The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata

PubMed Central

Liolios, Konstantinos; Chen, I-Min A.; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Hugenholtz, Philip; Markowitz, Victor M.; Kyrpides, Nikos C.

2010-01-01

The Genomes On Line Database (GOLD) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2009, GOLD contains information for more than 5800 sequencing projects, of which 1100 have been completed and their sequence data deposited in a public repository. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about a (Meta)Genome Sequence (MIGS/MIMS) specification. GOLD is available at: http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece, at: http://gold.imbb.forth.gr/ PMID:19914934
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bowers, Robert M.; Kyrpides, Nikos C.; Stepanauskas, Ramunas

The number of genomes from uncultivated microbes will soon surpass the number of isolate genomes in public databases (Hugenholtz, Skarshewski, & Parks, 2016). Technological advancements in high-throughput sequencing and assembly, including single-cell genomics and the computational extraction of genomes from metagenomes (GFMs), are largely responsible. Here we propose community standards for reporting the Minimum Information about a Single-Cell Genome (MIxS-SCG) and Minimum Information about Genomes extracted From Metagenomes (MIxS-GFM) specific for Bacteria and Archaea. The standards have been developed in the context of the International Genomics Standards Consortium (GSC) community (Field et al., 2014) and can be viewed as amore » supplement to other GSC checklists including the Minimum Information about a Genome Sequence (MIGS), Minimum information about a Metagenomic Sequence(s) (MIMS) (Field et al., 2008) and Minimum Information about a Marker Gene Sequence (MIMARKS) (P. Yilmaz et al., 2011). Community-wide acceptance of MIxS-SCG and MIxS-GFM for Bacteria and Archaea will enable broad comparative analyses of genomes from the majority of taxa that remain uncultivated, improving our understanding of microbial function, ecology, and evolution.« less
The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata

PubMed Central

Liolios, Konstantinos; Mavromatis, Konstantinos; Tavernarakis, Nektarios; Kyrpides, Nikos C.

2008-01-01

The Genomes On Line Database (GOLD) is a comprehensive resource that provides information on genome and metagenome projects worldwide. Complete and ongoing projects and their associated metadata can be accessed in GOLD through pre-computed lists and a search page. As of September 2007, GOLD contains information on more than 2900 sequencing projects, out of which 639 have been completed and their sequence data deposited in the public databases. GOLD continues to expand with the goal of providing metadata information related to the projects and the organisms/environments towards the Minimum Information about a Genome Sequence’ (MIGS) guideline. GOLD is available at http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece at http://gold.imbb.forth.gr/ PMID:17981842
[Inhibitory effect of Mig-7 silencing by retrovirus-mediated shRNA on vasculogenic mimicry, invasion and metastasis of human hepatocellular carcinoma cells in vitro].

PubMed

Qu, Bo; Sheng, Guan-Nan; Yu, Fei; Chen, Guan-Nan; Lv, Qi; Mao, Zhong-Peng; Guo, Long; Lv, Yi

2016-11-20

To explore the inhibitory effect of migration-inducing gene 7 (Mig-7) gene silencing induced by retroviral-mediated small hairpin RNA (shRNA) on vasculogenic mimicry (VM), invasion and metastasis of human hepatocellular carcinoma (HCC) cells in vitro. Two target sequences (Mig-7 shRNA-1 and Mig-7 shRNA-2) and one negative control sequence (Mig-7 shRNA-N) were synthesized. The recombinant retroviral vectors carrying Mig-7 shRNA were constructed, and HCC cell line MHCC-97H were transfected with Mig-7 shRNA-1, Mig-7 shRNA-2, Mig-7 shRNA-N, or the empty vector, or treated with 125 µg/mL recombinant human endostatin (ES). Mig-7 expression in the treated cells was detected using semi-quantitative PCR and Western blotting. The inhibitory effect of Mig-7 silencing on VM formation was investigated in a 3-dimensional cell culture system; the changes in cell adhesion, invasion and migration were assessed with intercellular adhesion assay, Transwell invasion assay and Transwell migration assay, respectively. The expression of Mig-7 at both mRNA and protein levels decreased significantly, VM formation, invasion and metastasis were suppressed, while intercellular adhesion increased significantly in MHCC-97H cells in Mig-7 shRNA-1 and Mig-7 shRNA-2 groups (P<0.05); such changes were not observed in cells transfected with Mig-7 shRNA-N or the empty vector, nor in cells treated with ES. Mig-7 silencing by retroviral-mediated shRNA significantly inhibits VM formation, invasion and metastasis and increases the intercellular adhesion of the HCC cells, while ES does not have such inhibitory effects.
MIG-seq: an effective PCR-based method for genome-wide single-nucleotide polymorphism genotyping using the next-generation sequencing platform

PubMed Central

Suyama, Yoshihisa; Matsuki, Yu

2015-01-01

Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities. PMID:26593239
Isolation of the MIG1 gene from Candida albicans and effects of its disruption on catabolite repression.

PubMed

Zaragoza, O; Rodríguez, C; Gancedo, C

2000-01-01

We have cloned a Candida albicans gene (CaMIG1) that encodes a protein homologous to the DNA-binding protein Mig1 from Saccharomyces cerevisiae (ScMig1). The C. albicans Mig1 protein (CaMig1) differs from ScMig1, in that, among other things, it lacks a putative phosphorylation site for Snf1 and presents several long stretches rich in glutamine or in asparagine, serine, and threonine and has the effector domain located at some distance (50 amino acids) from the carboxy terminus. Expression of CaMIG1 was low and was similar in glucose-, sucrose-, or ethanol-containing media. Disruption of the two CaMIG1 genomic copies had no effect in filamentation or infectivity. Levels of a glucose-repressible alpha-glucosidase, implicated in both sucrose and maltose utilization, were similar in wild-type or mig1/mig1 cells. Disruption of CaMIG1 had also no effect on the expression of the glucose-repressed gene CaGAL1. CaMIG1 was functional in S. cerevisiae, as judged by its ability to suppress the phenotypes produced by mig1 or tps1 mutations. In addition, CaMig1 formed specific complexes with the URS1 region of the S. cerevisiae FBP1 gene. The existence of a possible functional analogue of CaMIG1 in C. albicans was suggested by the results of band shift experiments.

Isolation of the MIG1 Gene from Candida albicans and Effects of Its Disruption on Catabolite Repression

PubMed Central

Zaragoza, Oscar; Rodríguez, Cristina; Gancedo, Carlos

2000-01-01

We have cloned a Candida albicans gene (CaMIG1) that encodes a protein homologous to the DNA-binding protein Mig1 from Saccharomyces cerevisiae (ScMig1). The C. albicans Mig1 protein (CaMig1) differs from ScMig1, in that, among other things, it lacks a putative phosphorylation site for Snf1 and presents several long stretches rich in glutamine or in asparagine, serine, and threonine and has the effector domain located at some distance (50 amino acids) from the carboxy terminus. Expression of CaMIG1 was low and was similar in glucose-, sucrose-, or ethanol-containing media. Disruption of the two CaMIG1 genomic copies had no effect in filamentation or infectivity. Levels of a glucose-repressible α-glucosidase, implicated in both sucrose and maltose utilization, were similar in wild-type or mig1/mig1 cells. Disruption of CaMIG1 had also no effect on the expression of the glucose-repressed gene CaGAL1. CaMIG1 was functional in S. cerevisiae, as judged by its ability to suppress the phenotypes produced by mig1 or tps1 mutations. In addition, CaMig1 formed specific complexes with the URS1 region of the S. cerevisiae FBP1 gene. The existence of a possible functional analogue of CaMIG1 in C. albicans was suggested by the results of band shift experiments. PMID:10629176
Standards and the INSDC: Submission of MIGS, MIMS, MIENS (GSC8 Meeting)

ScienceCinema

Mizrachi, Ilene

2017-12-21

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding. Research Coordination Network from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Ilene Mizrachi of the NCBI talks about submission of MIGS/MIMS/MIENS information at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 9, 2009.
Spontaneous mutations in CYC8 and MIG1 suppress the short chronological lifespan of budding yeast lacking SNF1/AMPK

PubMed Central

Maqani, Nazif; Fine, Ryan D.; Shahid, Mehreen; Li, Mingguang; Enriquez-Hesles, Elisa; Smith, Jeffrey S.

2018-01-01

Chronologically aging yeast cells are prone to adaptive regrowth, whereby mutants with a survival advantage spontaneously appear and re-enter the cell cycle in stationary phase cultures. Adaptive regrowth is especially noticeable with short-lived strains, including those defective for SNF1, the homolog of mammalian AMP-activated protein kinase (AMPK). SNF1 becomes active in response to multiple environmental stresses that occur in chronologically aging cells, including glucose depletion and oxidative stress. SNF1 is also required for the extension of chronological lifespan (CLS) by caloric restriction (CR) as defined as limiting glucose at the time of culture inoculation. To identify specific downstream SNF1 targets responsible for CLS extension during CR, we screened for adaptive regrowth mutants that restore chronological longevity to a short-lived snf1∆ parental strain. Whole genome sequencing of the adapted mutants revealed missense mutations in TPR motifs 9 and 10 of the transcriptional co-repressor Cyc8 that specifically mediate repression through the transcriptional repressor Mig1. Another mutation occurred in MIG1 itself, thus implicating the activation of Mig1-repressed genes as a key function of SNF1 in maintaining CLS. Consistent with this conclusion, the cyc8 TPR mutations partially restored growth on alternative carbon sources and significantly extended CLS compared to the snf1∆ parent. Furthermore, cyc8 TPR mutations reactivated multiple Mig1-repressed genes, including the transcription factor gene CAT8, which is responsible for activating genes of the glyoxylate and gluconeogenesis pathways. Deleting CAT8 completely blocked CLS extension by the cyc8 TPR mutations on CLS, identifying these pathways as key Snf1-regulated CLS determinants.
Recombinant yeast with improved ethanol tolerance and related methods of use

DOEpatents

Gasch, Audrey P [Madison, WI; Lewis, Jeffrey A [Madison, WI

2012-05-15

The present invention provides isolated Elo1 and Mig3 nucleic acid sequences capable of conferring increased ethanol tolerance on recombinant yeast and methods of using same in biofuel production, particularly ethanol production. Methods of bioengineering yeast using the Elo1 and, or, Mig3 nucleic acid sequences are also provided.
Improving Xylose Utilization of Saccharomyces cerevisiae by Expressing the MIG1 Mutant from the Self-Flocculating Yeast SPSC01.

PubMed

Xu, Jian-Ren; Zhao, Xin-Qing; Liu, Chen-Guang; Bai, Feng-Wu

2018-01-01

The major carbohydrate components of lignocellulosic biomass are cellulose and hemicelluloses. Saccharomyces cerevisiae cannot efficiently utilize xylose derived upon the hydrolysis of hemicelluloses. Although engineering the yeast with xylose metabolic pathway has been intensively studied, challenges are still ahead for developing robust strains for lignocellulosic bioethanol production. The main objective of this study was to reveal the role of the MIG1 mutant isolated from the self-flocculating S. cerevisiae SPSC01 in xylose utilization, glucose repression and ethanol fermentation by S. cerevisiae. The MIG1 mutant was amplified from S. cerevisiae SPSC01 by PCR and MIG1- overexpression-cassette was transformed into S. cerevisiae S288c and xylose-metabolizing strain YB-2625-T through homologous recombination. Yeast growth was measured by colony assay on plates with or without xylose supplementation. Then xylose utilization and ethanol production were further evaluated through flask fermentation when mixed sugars of glucose and xylose at 3:1 and 2:1, respectively, were supplied. Fermentation products were detected by HPLC, and activities of xylose reductase (XR), xylitol dehydrogenase (XDH) and xylulokinase (XK) were also measured. The transcription of genes regulated by the expression of the MIG1 mutant was analyzed by RTqPCR. Evolutionary relationship of various MIG1s was developed by gene sequencing and sequence alignment. No difference was observed for S288c growing with xylose when it was engineered with the overexpression or deletion of its native MIG1, but its growth was enhanced when overexpressing the MIG1 mutant from SPSC01. The submerged culture of YB-2625-T MIG1-SPSC engineered with xylose-metabolic pathway and the MIG1 mutant indicated that xylitol accumulation was decreased, and consequently, more biomass was accumulated. Furthermore, improved activities of the key enzymes such as XR, XDH and XK were detected in YB-2625-T MIG1-SPSC. Evolutionary analysis of MIG1s amplified from S. cerevisiae strains commonly used for ethanol production revealed a close relationship of SPSC01 and YB-2625. Our results demonstrated the effect of the overexpression of the MIG1 mutant from SPSC01 on xylose utilization of S. cerevisiae. This study could be an alternative strategy for engineering S. cerevisiae with improved xylose utilization. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
"A New Arm of the GSC: the RCN4GSC" and "Curation of MIGS-compliant Data" (GSC 8 Meeting)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Field, Dawn; Sterk, Peter

2009-09-09

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Dawn Field of the NERC Centre for Ecology & Hydrology briefly describes RCN4GSC and Peter Sterk of the NERC Centre for Ecology & Hydrologymore » follows with a talk on curation of MIGS-compliant data at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 9, 2009.« less
A New Arm of the GSC: The RCN4GSC and Curation of MIGS-compliant Data (GSC8 Meeting)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Field, Dawn; Sterk, Peter

2009-09-09

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding Research Coordination Network from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Dawn Field of the NERC Centre for Ecology and Hydrology briefly describes RCN4GSC and Peter Sterk of the NERC Centre for Ecology and Hydrologymore » follows with a talk on curation of MIGS-compliant data at the Genomic Standards Consortium 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 9, 2009.« less
A New Arm of the GSC: The RCN4GSC and Curation of MIGS-compliant Data (GSC8 Meeting)

ScienceCinema

Field, Dawn; Sterk, Peter

2018-01-09

The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding Research Coordination Network from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Dawn Field of the NERC Centre for Ecology and Hydrology briefly describes RCN4GSC and Peter Sterk of the NERC Centre for Ecology and Hydrology follows with a talk on curation of MIGS-compliant data at the Genomic Standards Consortium 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 9, 2009.
[Inhibitory effect of migration-inducing gene-7-shRNA recombinant retrovirus combined with endostatin on growth and metastasis of hepatoma xenograft].

PubMed

Qu, B; Chen, G N; Sheng, G N; Yu, F; Lyu, Q; Gu, Y J; Guo, L; Lyu, Y

2016-09-20

Objective: To investigate the inhibitory effect of migration-inducing gene-7(Mig-7)interfered with retrovirus-mediated RNA(shRNA)combined with recombinant human endostatin(ES)on the growth and metastasis of subcutaneous xenograft of human hepatoma cells in nude mice. Methods: Two Mig-7-mRNA oligonucleotide sequences(Mig-7-shRNA-1 and Mig-7-shRNA-2)and one sequence as a negative control(Mig-7-shRNA-N)were designed. The specific Mig-7-shRNA recombinant retrovirus expression vector plasmid was constructed and used for the transfection of human hepatoma MHCC-97H cells with high expression of Mig-7. The subcutaneous xenograft tumor model of human hepatocellular carcinoma(HCC)in nude mice was established, and according to the condition of transfection and administration, the nude mice were divided into pSIREN-M1 group, pSIREN-MN group, ES group, and pSIREN-M1+ES group. The xenograft tumor volume, mass, and metastasis were compared between groups. Immunohistochemistry was used to observe the formation of vasculogenic mimicry(VM)in xenograft tumor and the difference in tumor microvascular density(MVD), and Western blot was used to measure the expression of Mig-7 and vascular endothelial growth factor(VEGF)in each group. A one-way analysis of variance was used for comparison between groups, and the Fisher's exact test was used for comparison of continuous data between groups. Results: Compared with the pSIREN-MN group, the pSIREN-M1 group had significantly lower xenograft tumor volume, mass, and metastasis rate, Mig-7 expression, and formation of VM( P < 0.05), as well as significantly higher VEGF expression and MVD( P < 0.05). Compared with the pSIREN-MN group, the ES group had significantly lower xenograft tumor volume, mass, and metastasis rate, VEGF expression, and MVD( P < 0.05), as well as significantly higher Mig-7 expression and formation of VM( P < 0.05). Compared with the pSIREN-M1 group and the ES group, the pSIREN-M1+ES group had significantly lower xenograft tumor volume, mass, and metastasis rate, Mig-7 expression, formation of VM, VEGF expression, and MVD( P < 0.05). Conclusion: Mig-7-shRNA recombinant retrovirus combined with ES has a better inhibitory effect on the growth and metastasis of HCC xenograft tumor than Mig-7-shRNA recombinant retrovirus or ES alone. The anti-tumor angiogenesis therapy alone, which targets vascular endothelial cells in vivo, has a limited effect, since it may promote the formation of VM.
Hippo kinases maintain polarity during directional cell migration in Caenorhabditis elegans.

PubMed

Feng, Guoxin; Zhu, Zhiwen; Li, Wen-Jun; Lin, Qirong; Chai, Yongping; Dong, Meng-Qiu; Ou, Guangshuo

2017-02-01

Precise positioning of cells is crucial for metazoan development. Despite immense progress in the elucidation of the attractive cues of cell migration, the repulsive mechanisms that prevent the formation of secondary leading edges remain less investigated. Here, we demonstrate that Caenorhabditis elegans Hippo kinases promote cell migration along the anterior-posterior body axis via the inhibition of dorsal-ventral (DV) migration. Ectopic DV polarization was also demonstrated in gain-of-function mutant animals for C. elegans RhoG MIG-2. We identified serine 139 of MIG-2 as a novel conserved Hippo kinase phosphorylation site and demonstrated that purified Hippo kinases directly phosphorylate MIG-2 S139 Live imaging analysis of genome-edited animals indicates that MIG-2 S139 phosphorylation impedes actin assembly in migrating cells. Intriguingly, Hippo kinases are excluded from the leading edge in wild-type cells, while MIG-2 loss induces uniform distribution of Hippo kinases. We provide evidence that Hippo kinases inhibit RhoG activity locally and are in turn restricted to the cell body by RhoG-mediated polarization. Therefore, we propose that the Hippo-RhoG feedback regulation maintains cell polarity during directional cell motility. © 2016 The Authors.
Investigation of a miRNA-Induced Gene Silencing Technique in Petunia Reveals Alterations in miR173 Precursor Processing and the Accumulation of Secondary siRNAs from Endogenous Genes.

PubMed

Han, Yao; Zhang, Bin; Qin, Xiaoting; Li, Mingyang; Guo, Yulong

2015-01-01

MIGS (miRNA-induced gene silencing) is a straightforward and efficient gene silencing technique in Arabidopsis. It works by exploiting miR173 to trigger the production of phasiRNAs (phased small interfering RNAs). MIGS can be used in plant species other than Arabidopsis by co-expression of miR173 and target gene fragments fused to an upstream miR173 target site. However, the efficiency and technical mechanisms have not been thoroughly investigated in other plants. In this work, two vectors, pMIGS-chs and pMIGS-pds, were constructed and transformed into petunia plants. The transgenic plants showed CHS (chalcone synthase) and PDS (phytoene desaturase) gene-silencing phenotypes respectively, indicating that MIGS functions in petunia. MIGS-chs plants were used to investigate the mechanisms of this technique in petunia. Results of 5'- RACE showed that the miR173 target site was cleaved at the expected position and that endogenous CHS genes were cut at multiple positions. Small RNA deep sequencing analysis showed that the processing of Arabidopsis miR173 precursors in MIGS-chs transgenic petunia plants did not occur in exactly the same way as in Arabidopsis, suggesting differences in the machinery of miRNA processing between plant species. Small RNAs in-phase with the miR173 cleavage register were produced immediately downstream from the cleavage site and out-of-phase small RNAs were accumulated at relatively high levels from processing cycle 5 onwards. Secondary siRNAs were generated from multiple sites of endogenous CHS-A and CHS-J genes, indicating that miR173 cleavage induced siRNAs have the same ability to initiate siRNA transitivity as the siRNAs functioning in co-suppression and hpRNA silencing. On account of the simplicity of vector construction and the transitive amplification of signals from endogenous transcripts, MIGS is a good alternative gene silencing method for plants, especially for silencing a cluster of homologous genes with redundant functions.
Investigation of a miRNA-Induced Gene Silencing Technique in Petunia Reveals Alterations in miR173 Precursor Processing and the Accumulation of Secondary siRNAs from Endogenous Genes

PubMed Central

Han, Yao; Zhang, Bin; Qin, Xiaoting; Li, Mingyang; Guo, Yulong

2015-01-01

MIGS (miRNA-induced gene silencing) is a straightforward and efficient gene silencing technique in Arabidopsis. It works by exploiting miR173 to trigger the production of phasiRNAs (phased small interfering RNAs). MIGS can be used in plant species other than Arabidopsis by co-expression of miR173 and target gene fragments fused to an upstream miR173 target site. However, the efficiency and technical mechanisms have not been thoroughly investigated in other plants. In this work, two vectors, pMIGS-chs and pMIGS-pds, were constructed and transformed into petunia plants. The transgenic plants showed CHS (chalcone synthase) and PDS (phytoene desaturase) gene-silencing phenotypes respectively, indicating that MIGS functions in petunia. MIGS-chs plants were used to investigate the mechanisms of this technique in petunia. Results of 5′- RACE showed that the miR173 target site was cleaved at the expected position and that endogenous CHS genes were cut at multiple positions. Small RNA deep sequencing analysis showed that the processing of Arabidopsis miR173 precursors in MIGS-chs transgenic petunia plants did not occur in exactly the same way as in Arabidopsis, suggesting differences in the machinery of miRNA processing between plant species. Small RNAs in-phase with the miR173 cleavage register were produced immediately downstream from the cleavage site and out-of-phase small RNAs were accumulated at relatively high levels from processing cycle 5 onwards. Secondary siRNAs were generated from multiple sites of endogenous CHS-A and CHS-J genes, indicating that miR173 cleavage induced siRNAs have the same ability to initiate siRNA transitivity as the siRNAs functioning in co-suppression and hpRNA silencing. On account of the simplicity of vector construction and the transitive amplification of signals from endogenous transcripts, MIGS is a good alternative gene silencing method for plants, especially for silencing a cluster of homologous genes with redundant functions. PMID:26658695
Habitat-Lite: A GSC case study based on free text terms for environmental metadata

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kyrpides, Nikos; Hirschman, Lynette; Clark, Cheryl

2008-04-01

There is an urgent need to capture metadata on the rapidly growing number of genomic, metagenomic and related sequences, such as 16S ribosomal genes. This need is a major focus within the Genomic Standards Consortium (GSC), and Habitat is a key metadata descriptor in the proposed 'Minimum Information about a Genome Sequence' (MIGS) specification. The goal of the work described here is to provide a light-weight, easy-to-use (small) set of terms ('Habitat-Lite') that captures high-level information about habitat while preserving a mapping to the recently launched Environment Ontology (EnvO). Our motivation for building Habitat-Lite is to meet the needs ofmore » multiple users, such as annotators curating these data, database providers hosting the data, and biologists and bioinformaticians alike who need to search and employ such data in comparative analyses. Here, we report a case study based on semi-automated identification of terms from GenBank and GOLD. We estimate that the terms in the initial version of Habitat-Lite would provide useful labels for over 60% of the kinds of information found in the GenBank isolation-source field, and around 85% of the terms in the GOLD habitat field. We present a revised version of Habitat-Lite and invite the community's feedback on its further development in order to provide a minimum list of terms to capture high-level habitat information and to provide classification bins needed for future studies.« less
Comparison between hybrid laser-MIG welding and MIG welding for the invar36 alloy

NASA Astrophysics Data System (ADS)

Zhan, Xiaohong; Li, Yubo; Ou, Wenmin; Yu, Fengyi; Chen, Jie; Wei, Yanhong

2016-11-01

The invar36 alloy is suitable to produce mold of composite materials structure because it has similar thermal expansion coefficient with composite materials. In the present paper, the MIG welding and laser-MIG hybrid welding methods are compared to get the more appropriate method to overcome the poor weldability of invar36 alloy. According to the analysis of the experimental and simulated results, it has been proved that the Gauss and cone combined heat source model can characterize the laser-MIG hybrid welding heat source well. The total welding time of MIG welding is 8 times that of hybrid laser-MIG welding. The welding material consumption of MIG welding is about 4 times that of hybrid laser-MIG welding. The stress and deformation simulation indicate that the peak value of deformation during MIG welding is 3 times larger than that of hybrid laser-MIG welding.
Mig-6 regulates endometrial genes involved in cell cycle and progesterone signaling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yoo, Jung-Yoon; Kim, Tae Hoon; Lee, Jae Hee

2015-07-10

Mitogen inducible gene 6 (Mig-6) is an important mediator of progesterone (P4) signaling to inhibit estrogen (E2) signaling in the uterus. Ablation of Mig-6 in the murine uterus leads to the development of endometrial hyperplasia and E2-induced endometrial cancer. To identify the molecular pathways regulated by Mig-6, we performed microarray analysis on the uterus of ovariectomized Mig-6{sup f/f} and PGR{sup cre/+}Mig-6{sup f/f} (Mig-6{sup d/d}) mice treated with vehicle or P4 for 6 h. The results revealed that 772 transcripts were significantly regulated in the Mig-6{sup d/d} uterus treated with vehicle as compared with Mig-6{sup f/f} mice. The pathway analysis showed thatmore » Mig-6 suppressed the expression of gene-related cell cycle regulation in the absence of ovarian steroid hormone. The epithelium of Mig-6{sup d/d} mice showed a significant increase in the number of proliferative cells compared to Mig-6{sup f/f} mice. This microarray analysis also revealed that 324 genes are regulated by P4 as well as Mig-6. Cited2, the developmentally important transcription factor, was identified as being regulated by the P4-Mig-6 axis. To determine the role of Cited2 in the uterus, we used the mice with Cited2 that were conditionally ablated in progesterone receptor-positive cells (PGR{sup cre/+}Cited2{sup f/f}; Cited2{sup d/d}). Ablation of Cited2 in the uterus resulted in a significant reduction in the ability of the uterus to undergo a hormonally induced decidual reaction. Identification and analysis of these responsive genes will help define the role of P4 as well as Mig-6 in regulating uterine biology. - Highlights: • We identify Mig-6- and P4-regulated uterine genes by microarray analysis. • Mig-6 suppresses cell cycle progression and epithelial cell proliferation in uterus. • We identify the Mig-6 dependent induced genes by P4. • Cited2 plays an important role for decidualization as a P4 and Mig-6 target gene.« less
Migraine and Complex Regional Pain Syndrome: A Case-Referent Clinical Study

PubMed Central

Cooley, Corinne; Foley-Saldena, Katharine; Cowan, Robert P.

2017-01-01

We studied clinical phenotype differences between migraineurs with CRPS (Mig + CRPS) and those without (Mig − CRPS). Mig + CRPS cases and Mig − CRPS referents aged ≥18 years were enrolled. Diagnosis was made in accordance with International Classification of Headache Disorders-3 beta (ICHD-3 beta) for migraine and Budapest Criteria for CRPS. Migraines both with and without aura were included. A total of 70 Mig + CRPS cases (13% males, mean age 48 years) and 80 Mig − CRPS referents (17% males, mean age 51 years) were included. 33% of Mig + CRPS and 38% of Mig − CRPS exhibited episodic migraine (EM) while 66% of Mig + CRPS and 62% of Mig − CRPS had chronic migraine (CM) (OR = 0.98, CI 0.36, 2.67). Median duration of CRPS was 3 years among EM + CRPS and 6 years among CM + CRPS cohort (p < 0.02). Mig + CRPS (57%) carried higher psychological and medical comorbidities compared to Mig − CRPS (6%) (OR 16.7, CI 10.2, 23.6). Higher migraine frequency was associated with longer CRPS duration. Migraineurs who developed CRPS had higher prevalence of psychological and medical disorders. Alleviating migraineurs' psychological and medical comorbidities may help lower CRPS occurrence. PMID:29214172
Exploring the interactions of EGFR with phosphorylated Mig6 by molecular dynamics simulations and MM-PBSA calculations.

PubMed

Zhang, Yue; Zheng, Qing-Chuan

2018-06-14

Mig6, a negative regulator, directly binds to epidermal growth factor receptor (EGFR), including Mig6-segment1 and Mig6-segment2. Mig6 requires phosphorylation of Y394 on Mig6-segment2 in order to inhibit EGFR. Two phosphorylation pathways for Y394 have been previously reported and the first way may phosphorylate Y394 primed by Y395 phosphorylation. Besides, the binding mechanism of phosphorylated Mig6-segment2 with EGFR has not been elucidated clearly. Focused on EGFR complex with phosphorylated Mig6-segment2, molecular dynamics (MD) simulations were performed to explore the interactions of Mig6-segment2 with EGFR. Our results indicate a probable phosphorylation pathway on Y394 and some key residues of EGFR play important roles in binding to phosphorylated Mig6-segment2. In addition, a special L-shaped structure was found to be possibly associated with irreversible inhibition of EGFR by Mig6. Our work can give meaningful information to better understand the phosphorylation pathways for Y394 and the interactions of EGFR binding to phosphorylated Mig6-segment2. Copyright © 2018 Elsevier Ltd. All rights reserved.
Study on factors affecting the droplet temperature in plasma MIG welding process

NASA Astrophysics Data System (ADS)

Mamat, Sarizam Bin; Tashiro, Shinichi; Tanaka, Manabu; Yusoff, Mahani

2018-04-01

In the present study, the mechanism to control droplet temperature in the plasma MIG welding was discussed based on the measurements of the droplet temperature for a wide range of MIG currents with different plasma electrode diameters. The measurements of the droplet temperatures were conducted using a two color temperature measurement method. The droplet temperatures in the plasma MIG welding were then compared with those in the conventional MIG welding. As a result, the droplet temperature in the plasma MIG welding was found to be reduced in comparison with the conventional MIG welding under the same MIG current. Especially when the small plasma electrode diameter was used, the decrease in the droplet temperature reached maximally 500 K. Also, for a particular WFS, the droplet temperatures in the plasma MIG welding were lower than those in the conventional MIG welding. It is suggested that the use of plasma contributes to reducing the local heat input into the base metal by the droplet. The presence of the plasma surrounding the wire is considered to increase the electron density in its vicinity, resulting in the arc attachment expanding upwards along the wire surface to disperse the MIG current. This dispersion of MIG current causes a decrease in current density on the droplet surface, lowering the droplet temperature. Furthermore, dispersed MIG current also weakens the electromagnetic pinch force acting on the neck of the wire above the droplet. This leads to a larger droplet diameter with increased surface area through lower frequency of droplet detachment to decrease the MIG current density on the droplet surface, as compared to the conventional MIG welding at the same MIG current. Thus, the lower droplet temperature is caused by the reduction of heat flux into the droplet. Consequently, the mechanism to control droplet temperature in the plasma MIG welding was clarified.
The impact of MIG1 and/or MIG2 disruption on aerobic metabolism of succinate dehydrogenase negative Saccharomyces cerevisiae.

PubMed

Cao, Hailong; Yue, Min; Li, Shuguang; Bai, Xuefang; Zhao, Xiaoming; Du, Yuguang

2011-02-01

The zinc finger proteins Mig1 and Mig2 play important roles in glucose repression of Saccharomyces cerevisiae. To investigate whether the alleviation of glucose effect would result in an increase in aerobic succinate production, MIG1 and/or MIG2 were disrupted in a succinate dehydrogenase (SDH)-negative S. cerevisiae strain. Moreover, their impacts on physiology of the SDH-negative S. cerevisiae strain were studied under fully aerobic conditions when glucose was the sole carbon source. Our results showed that the succinate production for the SDH-negative S. cerevisiae was very low even under fully aerobic conditions. Furthermore, deletion of MIG1 and/or MIG2 did not result in an increase in succinate production in the SDH-negative S. cerevisiae strain. However, the synthesis of acetate was significantly affected by MIG1 deletion or in combination with MIG2 deletion. The acetate production for the mig1/mig2 double mutant BS2M was reduced by 69.72% compared to the parent strain B2S. In addition, the amount of ethanol produced by BS2M was slightly decreased. With the mig2 mutant BSM2, the concentrations of pyruvate and glycerol were increased by 26.23% and 15.28%, respectively, compared to the parent strain B2S.
The Transcriptional Response of Candida albicans to Weak Organic Acids, Carbon Source, and MIG1 Inactivation Unveils a Role for HGT16 in Mediating the Fungistatic Effect of Acetic Acid

PubMed Central

Cottier, Fabien; Tan, Alrina Shin Min; Yurieva, Marina; Liao, Webber; Lum, Josephine; Poidinger, Michael; Zolezzi, Francesca; Pavelka, Norman

2017-01-01

Candida albicans is a resident fungus of the human intestinal microflora. Commonly isolated at low abundance in healthy people, C. albicans outcompetes local microbiota during candidiasis episodes. Under normal conditions, members of the human gastrointestinal (GI) microbiota were shown to keep C. albicans colonization under control. By releasing weak organic acids (WOAs), bacteria are able to moderate yeast growth. This mechanism displays a synergistic effect in vitro with the absence of glucose in medium of culture, which underlines the complex interactions that C. albicans faces in its natural environment. Inactivation of the transcriptional regulator MIG1 in C. albicans results in a lack of sensitivity to this synergistic outcome. To decipher C. albicans transcriptional responses to glucose, WOAs, and the role of MIG1, we performed RNA sequencing (RNA-seq) on four biological replicates exposed to combinations of these three parameters. We were able to characterize the (i) glucose response, (ii) response to acetic and butyric acid, (iii) MIG1 regulation of C. albicans, and (iv) genes responsible for WOA resistance. We identified a group of six genes linked to WOA sensitivity in a glucose-MIG1-dependent manner and inactivated one of these genes, the putative glucose transporter HGT16, in a SC5314 wild-type background. As expected, the mutant displayed a partial complementation to WOA resistance in the absence of glucose. This result points toward a mechanism of WOA sensitivity in C. albicans involving membrane transporters, which could be exploited to control yeast colonization in human body niches. PMID:28877970

Type I γ Phosphatidylinositol Phosphate 5-Kinase i5 Controls the Ubiquitination and Degradation of the Tumor Suppressor Mitogen-inducible Gene 6*

PubMed Central

Sun, Ming; Cai, Jinyang; Anderson, Richard A.; Sun, Yue

2016-01-01

Mitogen-inducible gene 6 (Mig6) is a tumor suppressor, and the disruption of Mig6 expression is associated with cancer development. Mig6 directly interacts with epidermal growth factor receptor (EGFR) to suppress the activation and downstream signaling of EGFR. Therefore, loss of Mig6 enhances EGFR-mediated signaling and promotes EGFR-dependent carcinogenesis. The molecular mechanism modulating Mig6 expression in cancer remains unclear. Here we demonstrate that type I γ phosphatidylinositol phosphate 5-kinase i5 (PIPKIγi5), an enzyme producing phosphatidylinositol 4,5-bisphosphate (PtdIns(4,5)P2), stabilizes Mig6 expression. Knockdown of PIPKIγi5 leads to the loss of Mig6 expression, which dramatically enhances and prolongs EGFR-mediated cell signaling. Loss of PIPKIγi5 significantly promotes Mig6 protein degradation via proteasomes, but it does not affect the Mig6 mRNA level. PIPKIγi5 directly interacts with the E3 ubiquitin ligase neuronal precursor cell-expressed developmentally down-regulated 4-1 (NEDD4-1). The C-terminal domain of PIPKIγi5 and the WW1 and WW2 domains of NEDD4-1 are required for their interaction. The C2 domain of NEDD4-1 is required for its interaction with PtdIns(4,5)P2. By binding with NEDD4-1 and producing PtdIns(4,5)P2, PIPKIγi5 perturbs NEDD4-1-mediated Mig6 ubiquitination and the subsequent proteasomal degradation. Thus, loss of NEDD4-1 can rescue Mig6 expression in PIPKIγi5 knockdown cells. In this way, PIPKIγi5, NEDD4-1, and Mig6 form a novel molecular nexus that controls EGFR activation and downstream signaling. PMID:27557663
Study on Microstructure and Mechanical Properties of 304 Stainless Steel Joints by Tig-Mig Hybrid Welding

NASA Astrophysics Data System (ADS)

Ogundimu, Emmanuel O.; Akinlabi, Esther T.; Erinosho, Mutiu F.

Stainless steel is a family of Fe-based alloys having excellent resistance to corrosion and as such has been used imperatively for kitchen utensils, transportation, building constructions and much more. This paper presents the work conducted on the material characterizations of a tungsten inert gas (TIG)-metal inert gas (MIG) hybrid welded joint of type 304 austenitic stainless steel. The welding processes were conducted in three phases. The phases of welding employed are MIG welding using a current of 170A, TIG welding using a current of 190A, and a hybrid TIG-MIG welding with currents of 190/170A, respectively. The MIG, TIG, and hybrid TIG-MIG weldments were characterized with incomplete penetration, full penetration and excess penetration of weld. Intergranular austenite was created toward transition and heat affected zones. The thickness of the delta ferrite (δ-Fe) formed in the microstructures of the TIG weld is more than the thickness emerged in the microstructures of MIG and hybrid TIG-MIG welds. A TIG-MIG hybrid weld of specimen welded at the currents of 190/170A has the highest ultimate tensile strength value and percentage elongation of 397.72MPa and 35.7%. The TIG-MIG hybrid welding can be recommended for high-tech industrial applications such as nuclear, aircraft, food processing, and automobile industry.
Type I γ Phosphatidylinositol Phosphate 5-Kinase i5 Controls the Ubiquitination and Degradation of the Tumor Suppressor Mitogen-inducible Gene 6.

PubMed

Sun, Ming; Cai, Jinyang; Anderson, Richard A; Sun, Yue

2016-10-07

Mitogen-inducible gene 6 (Mig6) is a tumor suppressor, and the disruption of Mig6 expression is associated with cancer development. Mig6 directly interacts with epidermal growth factor receptor (EGFR) to suppress the activation and downstream signaling of EGFR. Therefore, loss of Mig6 enhances EGFR-mediated signaling and promotes EGFR-dependent carcinogenesis. The molecular mechanism modulating Mig6 expression in cancer remains unclear. Here we demonstrate that type I γ phosphatidylinositol phosphate 5-kinase i5 (PIPKIγi5), an enzyme producing phosphatidylinositol 4,5-bisphosphate (PtdIns(4,5)P 2 ), stabilizes Mig6 expression. Knockdown of PIPKIγi5 leads to the loss of Mig6 expression, which dramatically enhances and prolongs EGFR-mediated cell signaling. Loss of PIPKIγi5 significantly promotes Mig6 protein degradation via proteasomes, but it does not affect the Mig6 mRNA level. PIPKIγi5 directly interacts with the E3 ubiquitin ligase neuronal precursor cell-expressed developmentally down-regulated 4-1 (NEDD4-1). The C-terminal domain of PIPKIγi5 and the WW1 and WW2 domains of NEDD4-1 are required for their interaction. The C2 domain of NEDD4-1 is required for its interaction with PtdIns(4,5)P 2 By binding with NEDD4-1 and producing PtdIns(4,5)P 2 , PIPKIγi5 perturbs NEDD4-1-mediated Mig6 ubiquitination and the subsequent proteasomal degradation. Thus, loss of NEDD4-1 can rescue Mig6 expression in PIPKIγi5 knockdown cells. In this way, PIPKIγi5, NEDD4-1, and Mig6 form a novel molecular nexus that controls EGFR activation and downstream signaling. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Influence of shielding gas pressure on welding characteristics in CO2 laser-MIG hybrid welding process

NASA Astrophysics Data System (ADS)

Chen, Yanbin; Lei, Zhenglong; Li, Liqun; Wu, Lin

2006-01-01

The droplet transfer behavior and weld characteristics have been investigated under different pressures of shielding gas in CO2 laser and metal inert/active gas (laser-MIG) hybrid welding process. The experimental results indicate that the inherent droplet transfer frequency and stable welding range of conventional MIG arc are changed due to the interaction between CO2 laser beam and MIG arc in laser-MIG hybrid welding process, and the shielding gas pressure has a crucial effect on welding characteristics. When the pressure of shielding gas is low in comparison with MIG welding, the frequency of droplet transfer decreases, and the droplet transfer becomes unstable in laser-MIG hybrid welding. So the penetration depth decreases, which shows the characteristic of unstable hybrid welding. However, when the pressure of shielding gas increases to a critical value, the hybrid welding characteristic is changed from unstable hybrid welding to stable hybrid welding, and the frequency of droplet transfer and the penetration depth increase significantly.
Mitogen-Inducible Gene-6 Mediates Feedback Inhibition from Mutated BRAF towards the Epidermal Growth Factor Receptor and Thereby Limits Malignant Transformation

PubMed Central

Milewska, Malgorzata; Romano, David; Herrero, Ana; Guerriero, Maria Luisa; Birtwistle, Marc; Quehenberger, Franz; Hatzl, Stefan; Kholodenko, Boris N.; Segatto, Oreste; Kolch, Walter; Zebisch, Armin

2015-01-01

BRAF functions in the RAS-extracellular signal-regulated kinase (ERK) signaling cascade. Activation of this pathway is necessary to mediate the transforming potential of oncogenic BRAF, however, it may also cause a negative feedback that inhibits the epidermal growth factor receptor (EGFR). Mitogen-inducible gene-6 (MIG-6) is a potent inhibitor of the EGFR and has been demonstrated to function as a tumor suppressor. As MIG-6 can be induced via RAS-ERK signaling, we investigated its potential involvement in this negative regulatory loop. Focus formation assays were performed and demonstrated that MIG-6 significantly reduces malignant transformation induced by oncogenic BRAF. Although this genetic interaction was mirrored by a physical interaction between MIG-6 and BRAF, we did not observe a direct regulation of BRAF kinase activity by MIG-6. Interestingly, a selective chemical EGFR inhibitor suppressed transformation to a similar degree as MIG-6, whereas combining these approaches had no synergistic effect. By analyzing a range of BRAF mutated and wildtype cell line models, we could show that BRAF V600E causes a strong upregulation of MIG-6, which was mediated at the transcriptional level via the RAS-ERK pathway and resulted in downregulation of EGFR activation. This feedback loop is operational in tumors, as shown by the analysis of almost 400 patients with papillary thyroid cancer (PTC). Presence of BRAF V600E correlated with increased MIG-6 expression on the one hand, and with inactivation of the EGFR and of PI3K/AKT signaling on the other hand. Importantly, we also observed a more aggressive disease phenotype when BRAF V600E coexisted with low MIG-6 expression. Finally, analysis of methylation data was performed and revealed that higher methylation of MIG-6 correlated to its decreased expression. Taken together, we demonstrate that MIG-6 efficiently reduces cellular transformation driven by oncogenic BRAF by orchestrating a negative feedback circuit directed towards the EGFR. PMID:26065894
Assessment of the biological effects of welding fumes emitted from metal inert gas welding processes of aluminium and zinc-plated materials in humans.

PubMed

Hartmann, L; Bauer, M; Bertram, J; Gube, M; Lenz, K; Reisgen, U; Schettgen, T; Kraus, T; Brand, P

2014-03-01

The aim of this study was to investigate biological effects and potential health risks due to two different metal-inert-gas (MIG) welding fumes (MIG welding of aluminium and MIG soldering of zinc coated steel) in healthy humans. In a threefold cross-over design study 12 male subjects were exposed to three different exposure scenarios. Exposures were performed under controlled conditions in the Aachener Workplace Simulation Laboratory (AWSL). On three different days the subjects were either exposed to filtered ambient air, to welding fumes from MIG welding of aluminium, or to fumes from MIG soldering of zinc coated materials. Exposure was performed for 6 h and the average fume concentration was 2.5 mg m(-3). Before, directly after, 1 day after, and 7 days after exposure spirometric and impulse oscillometric measurements were performed, exhaled breath condensate (EBC) was collected and blood samples were taken and analyzed for inflammatory markers. During MIG welding of aluminium high ozone concentrations (up to 250 μg m(-3)) were observed, whereas ozone was negligible for MIG soldering. For MIG soldering, concentrations of high-sensitivity CRP (hsCRP) and factor VIII were significantly increased but remained mostly within the normal range. The concentration of neutrophils increased in tendency. For MIG welding of aluminium, the lung function showed significant decreases in Peak Expiratory Flow (PEF) and Mean Expiratory Flow at 75% vital capacity (MEF 75) 7 days after exposure. The concentration of ristocetin cofactor was increased. The observed increase of hsCRP during MIG-soldering can be understood as an indicator for asymptomatic systemic inflammation probably due to zinc (zinc concentration 1.5 mg m(-3)). The change in lung function observed after MIG welding of aluminium may be attributed to ozone inhalation, although the late response (7 days after exposure) is surprising. Copyright © 2013 Elsevier GmbH. All rights reserved.
MIG-6 negatively regulates STAT3 phosphorylation in uterine epithelial cells

PubMed Central

Yoo, Jung-Yoon; Yang, Woo Sub; Lee, Jae Hee; Kim, Byung Gak; Broaddus, Russell R.; Lim, Jeong M.; Kim, Tae Hoon; Jeong, Jae-Wook

2017-01-01

Endometrial cancer is the most common malignancy of the female genital tract. Progesterone (P4) has been used for several decades in endometrial cancer treatment, especially in women who wish to retain fertility. However, it is unpredictable which patients will respond to P4 treatment and which may have a P4 resistant cancer. Therefore, identifying the mechanism of P4 resistance is essential to improve the therapies for endometrial cancer. Mitogen-inducible gene 6 (Mig-6) is a critical mediator of progesterone receptor (PGR) action in the uterus. In order to study the function of Mig-6 in P4 resistance, we generated a mouse model in which we specifically ablated Mig-6 in uterine epithelial cells using Sprr2f-cre mice (Sprr2fcre+Mig-6f/f). Female mutant mice develop endometrial hyperplasia due to aberrant phosphorylation of STAT3 and proliferation of the endometrial epithelial cells. The results from our immunoprecipitation and cell culture experiments showed that MIG-6 inhibited phosphorylation of STAT3 via protein interactions. Our previous study showed P4 resistance in mice with Mig-6 ablation in Pgr positive cells (Pgrcre/+Mig-6f/f). However, Sprr2fcre+Mig-6f/f mice were P4 responsive. P4 treatment significantly decreased STAT3 phosphorylation and epithelial proliferation in the uterus of mutant mice. We showed that Mig-6 has an important function of tumor suppressor via inhibition of STAT3 phosphorylation in uterine epithelial cells and the anti-tumor effects of P4 are mediated by the endometrial stroma. This data helps to develop a new signaling pathway in the regulation of steroid hormones in the uterus, and to overcome P4 resistance in human reproductive diseases, such as endometrial cancer. PMID:28925396
Postoperative Outcomes of Minimally Invasive Gastrectomy Versus Open Gastrectomy During the Early Introduction of Minimally Invasive Gastrectomy in the Netherlands: A Population-based Cohort Study.

PubMed

Brenkman, Hylke J F; Gisbertz, Suzanne S; Slaman, Annelijn E; Goense, Lucas; Ruurda, Jelle P; van Berge Henegouwen, Mark I; van Hillegersberg, Richard

2017-11-01

To compare postoperative outcomes of minimally invasive gastrectomy (MIG) to open gastrectomy (OG) for cancer during the introduction of MIG in the Netherlands. Between 2011 and 2015, the use of MIG increased from 4% to 53% in the Netherlands. This population-based cohort study included all patients with curable gastric adenocarcinoma that underwent gastrectomy between 2011 and 2015, registered in the Dutch Upper GI Cancer Audit. Patients with missing preoperative data, and patients in whom no lymphadenectomy or reconstruction was performed were excluded. Propensity score matching was applied to create comparable groups between patients receiving MIG or OG, using year of surgery and other potential confounders. Morbidity, mortality, and hospital stay were evaluated. Of the 1697 eligible patients, 813 were discarded after propensity score matching; 442 and 442 patients who underwent MIG and OG, respectively, remained. Conversions occurred in 10% of the patients during MIG. Although the overall postoperative morbidity (37% vs 40%, P = 0.489) and mortality rates (6% vs 4%, P = 0.214) were comparable between the 2 groups, patients who underwent MIG experienced less wound complications (2% vs 5%, P = 0.006). Anastomotic leakage occurred in 8% of the patients after MIG, and in 7% after OG (P = 0.525). The median hospital stay declined over the years for both procedures (11 to 8 days, P < 0.001). Overall, hospital stay was shorter after MIG compared with OG (8 vs 10 days, P < 0.001). MIG was safely introduced in the Netherlands, with overall morbidity and mortality comparable with OG, less wound complications and shorter hospitalization.
Defining linkages between the GSC and NSF's LTER program: How the Ecological Metadata Language (EML) relates to GCDML and other outcomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Inigo, Gil San; Servilla, Mark; Brunt, James

2008-06-01

The Genomic Standards Consortium (GSC) invited a representative of the Long-Term Ecological Research (LTER) to its fifth workshop to present the Ecological Metadata Language (EML) metadata standard and its relationship to the Minimum Information about a Genome/Metagenome Sequence (MIGS/MIMS) and its implementation, the Genomic Contextual Data Markup Language (GCDML). The LTER is one of the top National Science Foundation (NSF) programs in biology since 1980, representing diverse ecosystems and creating long-term, interdisciplinary research, synthesis of information, and theory. The adoption of EML as the LTER network standard has been key to build network synthesis architectures based on high-quality standardized metadata.more » EML is the NSF-recognized metadata standard for LTER, and EML is a criteria used to review the LTER program progress. At the workshop, a potential crosswalk between the GCDML and EML was explored. Also, collaboration between the LTER and GSC developers was proposed to join efforts toward a common metadata cataloging designer's tool. The community adoption success of a metadata standard depends, among other factors, on the tools and trainings developed to use the standard. LTER's experience in embracing EML may help GSC to achieve similar success. A possible collaboration between LTER and GSC to provide training opportunities for GCDML and the associated tools is being explored. Finally, LTER is investigating EML enhancements to better accommodate genomics data, possibly integrating the GCDML schema into EML. All these action items have been accepted by the LTER contingent, and further collaboration between the GSC and LTER is expected.« less
Defining linkages between the GSC and NSF's LTER program: how the Ecological Metadata Language (EML) relates to GCDML and other outcomes.

PubMed

Gil, Inigo San; Sheldon, Wade; Schmidt, Tom; Servilla, Mark; Aguilar, Raul; Gries, Corinna; Gray, Tanya; Field, Dawn; Cole, James; Pan, Jerry Yun; Palanisamy, Giri; Henshaw, Donald; O'Brien, Margaret; Kinkel, Linda; McMahon, Katherine; Kottmann, Renzo; Amaral-Zettler, Linda; Hobbie, John; Goldstein, Philip; Guralnick, Robert P; Brunt, James; Michener, William K

2008-06-01

The Genomic Standards Consortium (GSC) invited a representative of the Long-Term Ecological Research (LTER) to its fifth workshop to present the Ecological Metadata Language (EML) metadata standard and its relationship to the Minimum Information about a Genome/Metagenome Sequence (MIGS/MIMS) and its implementation, the Genomic Contextual Data Markup Language (GCDML). The LTER is one of the top National Science Foundation (NSF) programs in biology since 1980, representing diverse ecosystems and creating long-term, interdisciplinary research, synthesis of information, and theory. The adoption of EML as the LTER network standard has been key to build network synthesis architectures based on high-quality standardized metadata. EML is the NSF-recognized metadata standard for LTER, and EML is a criteria used to review the LTER program progress. At the workshop, a potential crosswalk between the GCDML and EML was explored. Also, collaboration between the LTER and GSC developers was proposed to join efforts toward a common metadata cataloging designer's tool. The community adoption success of a metadata standard depends, among other factors, on the tools and trainings developed to use the standard. LTER's experience in embracing EML may help GSC to achieve similar success. A possible collaboration between LTER and GSC to provide training opportunities for GCDML and the associated tools is being explored. Finally, LTER is investigating EML enhancements to better accommodate genomics data, possibly integrating the GCDML schema into EML. All these action items have been accepted by the LTER contingent, and further collaboration between the GSC and LTER is expected.
Transmembrane protein MIG-13 links the Wnt signaling and Hox genes to the cell polarity in neuronal migration

PubMed Central

Wang, Xiangming; Zhou, Fanli; Lv, Sijing; Yi, Peishan; Zhu, Zhiwen; Yang, Yihong; Feng, Guoxin; Li, Wei; Ou, Guangshuo

2013-01-01

Directional cell migration is a fundamental process in neural development. In Caenorhabditis elegans, Q neuroblasts on the left (QL) and right (QR) sides of the animal generate cells that migrate in opposite directions along the anteroposterior body axis. The homeobox (Hox) gene lin-39 promotes the anterior migration of QR descendants (QR.x), whereas the canonical Wnt signaling pathway activates another Hox gene, mab-5, to ensure the QL descendants’ (QL.x) posterior migration. However, the regulatory targets of LIN-39 and MAB-5 remain elusive. Here, we showed that MIG-13, an evolutionarily conserved transmembrane protein, cell-autonomously regulates the asymmetric distribution of the actin cytoskeleton in the leading migratory edge. We identified mig-13 as a cellular target of LIN-39 and MAB-5. LIN-39 establishes QR.x anterior polarity by binding to the mig-13 promoter and promoting mig-13 expression, whereas MAB-5 inhibits QL.x anterior polarity by associating with the lin-39 promoter and downregulating lin-39 and mig-13 expression. Thus, MIG-13 links the Wnt signaling and Hox genes that guide migrations, to the actin cytoskeleton, which executes the motility response in neuronal migration. PMID:23784779
Minimally-invasive glaucoma surgeries (MIGS) for open angle glaucoma: A systematic review and meta-analysis

PubMed Central

Maule, Milena; Ceccarelli, Manuela; Fea, Antonio Maria

2017-01-01

Background MIGS have been developed as a surgical alternative for glaucomatous patients. Purpose To analyze the change in intraocular pressure (IOP) and glaucoma medications using different MIGS devices (Trabectome, iStent, Excimer Laser Trabeculotomy (ELT), iStent Supra, CyPass, XEN, Hydrus, Fugo Blade, Ab interno canaloplasty, Goniscopy-assisted transluminal trabeculotomy) as a solo procedure or in association with phacoemulsification. Methods Randomized control trials (RCT) and non-RCT (non randomized comparative studies, NRS, and before-after studies) were included. Studies with at least one year of follow-up in patients affected by primary open angle glaucoma, pseudoexfoliative glaucoma or pigmentary glaucoma were considered. Risk of Bias assessment was performed using the Cochrane Risk of Bias and the ROBINS-I tools. The main outcome was the effect of MIGS devices compared to medical therapy, cataract surgery, other glaucoma surgeries and other MIGS on both IOP and use of glaucoma medications 12 months after surgery. Outcomes measures were the mean difference in the change of IOP and glaucoma medication compared to baseline at one and two years and all ocular adverse events. The current meta-analysis is registered on PROSPERO (reference n° CRD42016037280). Results Over a total of 3,069 studies, nine RCT and 21 case series with a total of 2.928 eyes were included. Main concerns about risk of bias in RCTs were lack of blinding, allocation concealment and attrition bias while in non-RCTs they were represented by patients’ selection, masking of participants and co-intervention management. Limited evidence was found based on both RCTs and non RCTs that compared MIGS surgery with medical therapy or other MIGS. In before-after series, MIGS surgery seemed effective in lowering both IOP and glaucoma drug use. MIGS showed a good safety profile: IOP spikes were the most frequent complications and no cases of infection or BCVA loss due to glaucoma were reported. Conclusions Although MIGS seem efficient in the reduction of the IOP and glaucoma medication and show good safety profile, this evidence is mainly derived from non-comparative studies and further, good quality RCTs are warranted. PMID:28850575
Minimally-invasive glaucoma surgeries (MIGS) for open angle glaucoma: A systematic review and meta-analysis.

PubMed

Lavia, Carlo; Dallorto, Laura; Maule, Milena; Ceccarelli, Manuela; Fea, Antonio Maria

2017-01-01

MIGS have been developed as a surgical alternative for glaucomatous patients. To analyze the change in intraocular pressure (IOP) and glaucoma medications using different MIGS devices (Trabectome, iStent, Excimer Laser Trabeculotomy (ELT), iStent Supra, CyPass, XEN, Hydrus, Fugo Blade, Ab interno canaloplasty, Goniscopy-assisted transluminal trabeculotomy) as a solo procedure or in association with phacoemulsification. Randomized control trials (RCT) and non-RCT (non randomized comparative studies, NRS, and before-after studies) were included. Studies with at least one year of follow-up in patients affected by primary open angle glaucoma, pseudoexfoliative glaucoma or pigmentary glaucoma were considered. Risk of Bias assessment was performed using the Cochrane Risk of Bias and the ROBINS-I tools. The main outcome was the effect of MIGS devices compared to medical therapy, cataract surgery, other glaucoma surgeries and other MIGS on both IOP and use of glaucoma medications 12 months after surgery. Outcomes measures were the mean difference in the change of IOP and glaucoma medication compared to baseline at one and two years and all ocular adverse events. The current meta-analysis is registered on PROSPERO (reference n° CRD42016037280). Over a total of 3,069 studies, nine RCT and 21 case series with a total of 2.928 eyes were included. Main concerns about risk of bias in RCTs were lack of blinding, allocation concealment and attrition bias while in non-RCTs they were represented by patients' selection, masking of participants and co-intervention management. Limited evidence was found based on both RCTs and non RCTs that compared MIGS surgery with medical therapy or other MIGS. In before-after series, MIGS surgery seemed effective in lowering both IOP and glaucoma drug use. MIGS showed a good safety profile: IOP spikes were the most frequent complications and no cases of infection or BCVA loss due to glaucoma were reported. Although MIGS seem efficient in the reduction of the IOP and glaucoma medication and show good safety profile, this evidence is mainly derived from non-comparative studies and further, good quality RCTs are warranted.
Interactions of UNC-34 Enabled With Rac GTPases and the NIK Kinase MIG-15 in Caenorhabditis elegans Axon Pathfinding and Neuronal Migration

PubMed Central

Shakir, M. Afaq; Gill, Jason S.; Lundquist, Erik A.

2006-01-01

Many genes that affect axon pathfinding and cell migration have been identified. Mechanisms by which these genes and the molecules they encode interact with one another in pathways and networks to control developmental events are unclear. Rac GTPases, the cytoskeletal signaling molecule Enabled, and NIK kinase have all been implicated in regulating axon pathfinding and cell migration. Here we present evidence that, in Caenorhabditis elegans, three Rac GTPases, CED-10, RAC-2, and MIG-2, define three redundant pathways that each control axon pathfinding, and that the NIK kinase MIG-15 acts in each Rac pathway. Furthermore, we show that the Enabled molecule UNC-34 defines a fourth partially redundant pathway that acts in parallel to Rac/MIG-15 signaling in axon pathfinding. Enabled and the three Racs also act redundantly to mediate AQR and PQR neuronal cell migration. The Racs and UNC-34 Ena might all control the formation of actin-based protrusive structures (lamellipodia and filopodia) that mediate growth cone outgrowth and cell migration. MIG-15 does not act with the three Racs in execution of cell migration. Rather, MIG-15 affects direction of PQR neuronal migration, similar to UNC-40 and DPY-19, which control initial Q cell polarity, and Wnt signaling, which acts later to control Q cell-directed migration. MIG-2 Rac, which acts with CED-10 Rac, RAC-2 Rac, and UNC-34 Ena in axon pathfinding and cell migration, also acts with MIG-15 in PQR directional migration. PMID:16204220
Microprobe investigation of brittle segregates in aluminum MIG and TIG welds

NASA Technical Reports Server (NTRS)

Larssen, P. A.; Miller, E. L.

1968-01-01

Quantitative microprobe analysis of segregated particles in aluminum MIG /Metal Inert Gas/ and TIG /Tungsten Inert Gas/ welds indicated that there were about ten different kinds of particles, corresponding to ten different intermetallic compounds. Differences between MIG and TIG welds related to the individual cooling rates of these welds.
Savings estimate for a Medicare insured group

PubMed Central

Birnbaum, Howard; Holland, Stephen K.; Lenhart, Gregory; Reilly, Helena L.; Hoffman, Kevin; Pardo, Dennis P.

1991-01-01

Estimates of the savings potential of a managed-care program for a Medicare retiree population in Michigan under a hypothetical Medicare insured group (MIG) are presented in this article. In return for receiving an experience-rated capitation payment, a MIG would administer all Medicare and employer complementary benefits for its enrollees. A study of the financial and operational feasibility of implementing a MIG for retirees of a national corporation involving an analysis of 1986 claims data finds that selected managed-care initiatives implemented by a MIG would generate an annual savings of 3.8 percent of total (Medicare plus complementary) expenditures. Although savings are less than the 5 percent to be retained by Medicare, this finding illustrates the potential for savings from managed-care initiatives to Medicare generally and to MIGs elsewhere, where savings may be greater if constraints are less restrictive. PMID:10113700
IP-10 and MIG are compartmentalized at the site of disease during pleural and meningeal tuberculosis and are decreased after antituberculosis treatment.

PubMed

Yang, Qianting; Cai, Yi; Zhao, Wei; Wu, Fan; Zhang, Mingxia; Luo, Kai; Zhang, Yan; Liu, Haiying; Zhou, Boping; Kornfeld, Hardy; Chen, Xinchun

2014-12-01

The diagnosis of active tuberculosis (TB) disease remains a challenge, especially in high-burden settings. Cytokines and chemokines are important in the pathogenesis of TB. Here we investigate the usefulness of circulating and compartmentalized cytokines/chemokines for diagnosis of TB. The levels of multiple cytokines/chemokines in plasma, pleural fluid (PF), and cerebrospinal fluid (CSF) were determined by Luminex liquid array-based multiplexed immunoassays. Three of 26 cytokines/chemokines in plasma were significantly different between TB and latent tuberculosis infection (LTBI). Among them, IP-10 and MIG had the highest diagnostic values, with an area under the receiver operating characteristic curve (ROC AUC) of 0.92 for IP-10 and 0.86 for MIG for distinguishing TB from LTBI. However, IP-10 and MIG levels in plasma were not different between TB and non-TB lung disease. In contrast, compartmentalized IP-10 and MIG in the PF and CSF showed promising diagnostic values in discriminating TB and non-TB pleural effusion (AUC = 0.87 for IP-10 and 0.93 for MIG), as well as TB meningitis and non-TB meningitis (AUC = 0.9 for IP-10 and 0.95 for MIG). A longitudinal study showed that the plasma levels of IP-10, MIG, granulocyte colony-stimulating factor (G-CSF), and gamma interferon (IFN-γ) decreased, while the levels of MCP-1/CCL2 and eotaxin-1/CCL11 increased, after successful treatment of TB. Our findings provide a practical methodology for discriminating active TB from LTBI by sequential IFN-γ release assays (IGRAs) and plasma IP-10 testing, while increased IP-10 and MIG at the site of infection (PF or CSF) can be used as a marker for distinguishing pleural effusion and meningitis caused by TB from those of non-TB origins. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Transcriptional responses to glucose at different glycolytic rates in Saccharomyces cerevisiae.

PubMed

Elbing, Karin; Ståhlberg, Anders; Hohmann, Stefan; Gustafsson, Lena

2004-12-01

The addition of glucose to Saccharomyces cerevisiae cells causes reprogramming of gene expression. Glucose is sensed by membrane receptors as well as (so far elusive) intracellular sensing mechanisms. The availability of four yeast strains that display different hexose uptake capacities allowed us to study glucose-induced effects at different glycolytic rates. Rapid glucose responses were observed in all strains able to take up glucose, consistent with intracellular sensing. The degree of long-term responses, however, clearly correlated with the glycolytic rate: glucose-stimulated expression of genes encoding enzymes of the lower part of glycolysis showed an almost linear correlation with the glycolytic rate, while expression levels of genes encoding gluconeogenic enzymes and invertase (SUC2) showed an inverse correlation. Glucose control of SUC2 expression is mediated by the Snf1-Mig1 pathway. Mig1 dephosphorylation upon glucose addition is known to lead to repression of target genes. Mig1 was initially dephosphorylated upon glucose addition in all strains able to take up glucose, but remained dephosphorylated only at high glycolytic rates. Remarkably, transient Mig1-dephosphorylation was accompanied by the repression of SUC2 expression at high glycolytic rates, but stimulated SUC2 expression at low glycolytic rates. This suggests that Mig1-mediated repression can be overruled by factors mediating induction via a low glucose signal. At low and moderate glycolytic rates, Mig1 was partly dephosphorylated both in the presence of phosphorylated, active Snf1, and unphosphorylated, inactive Snf1, indicating that Mig1 was actively phosphorylated and dephosphorylated simultaneously, suggesting independent control of both processes. Taken together, it appears that glucose addition affects the expression of SUC2 as well as Mig1 activity by both Snf1-dependent and -independent mechanisms that can now be dissected and resolved as early and late/sustained responses.
Influence of laser on the droplet behavior in short-circuiting, globular, and spray modes of hybrid fiber laser-MIG welding

NASA Astrophysics Data System (ADS)

Cai, Chuang; Feng, Jiecai; Li, Liqun; Chen, Yanbin

2016-09-01

The effects of laser on the droplet behavior in short-circuiting, globular, and spray modes of hybrid fiber laser-MIG welding were studied. Transfer sequence of a droplet, welding current wave and morphology of plasma in the three modes of arc welding and hybrid welding were comparatively investigated. Compared with arc welding, the transfer frequency and landing location of droplet in the three modes of hybrid welding changed. In short-circuiting and globular modes, the droplet transfer was promoted by the laser, while the droplet transfer was hindered by the laser in spray mode. The magnitudes and directions of electromagnetic force and plasma drag force acting on the droplet were the keys to affect the droplet behavior. The magnitudes and directions of electromagnetic force and plasma drag force were converted due to the variation of the current distribution into the droplet, which were caused by the laser induced plasma with low ionization potential.
MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU.

PubMed

Katsigiannis, Stamos; Zacharia, Eleni; Maroulis, Dimitris

2017-05-01

Complementary DNA (cDNA) microarray is a powerful tool for simultaneously studying the expression level of thousands of genes. Nevertheless, the analysis of microarray images remains an arduous and challenging task due to the poor quality of the images that often suffer from noise, artifacts, and uneven background. In this study, the MIGS-GPU [Microarray Image Gridding and Segmentation on Graphics Processing Unit (GPU)] software for gridding and segmenting microarray images is presented. MIGS-GPU's computations are performed on the GPU by means of the compute unified device architecture (CUDA) in order to achieve fast performance and increase the utilization of available system resources. Evaluation on both real and synthetic cDNA microarray images showed that MIGS-GPU provides better performance than state-of-the-art alternatives, while the proposed GPU implementation achieves significantly lower computational times compared to the respective CPU approaches. Consequently, MIGS-GPU can be an advantageous and useful tool for biomedical laboratories, offering a user-friendly interface that requires minimum input in order to run.

The effects of welding parameters on ultra-violet light emissions, ozone and CrVI formation in MIG welding.

PubMed

Dennis, J H; Mortazavi, S B; French, M J; Hewitt, P J; Redding, C R

1997-01-01

This paper describes the relationships between ultra-violet emission, ozone generation and CrVI production in MIG welding which were measured as a function of shield gas flow rate, welding voltage, electrode stick-out and shield gas composition using an automatic welding rig that permitted MIG welding under reproducible conditions. The experimental results are interpreted in terms of the physico-chemical processes occurring in the micro- and macro-environments of the arc as part of research into process modification to reduce occupational exposure to ozone and CrVI production rates in MIG welding. We believe the techniques described here, and in particular the use of what we have termed u.v.-ozone measurements, will prove useful in further study of ozone generation and CrVI formation and may be applied in the investigation of engineering control of occupational exposure in MIG and other welding process such as Manual Metal Arc (MMA) and Tungsten Inert Gas (TIG).
Interleukin-18, Interferon-γ, IP-10, and Mig Expression in Epstein-Barr Virus-Induced Infectious Mononucleosis and Posttransplant Lymphoproliferative Disease

PubMed Central

Setsuda, Joyce; Teruya-Feldstein, Julie; Harris, Nancy L.; Ferry, Judith A.; Sorbara, Lynn; Gupta, Ghanshyam; Jaffe, Elaine S.; Tosato, Giovanna

1999-01-01

T cell immunodeficiency plays an important role in the pathogenesis of posttransplant lymphoproliferative disease (PTLD) by permitting the unbridled expansion of Epstein-Barr virus (EBV)-infected B lymphocytes. However, factors other than T cell function may contribute to PTLD pathogenesis because PTLD infrequently develops even in the context of severe T cell immunodeficiency, and athymic mice that are T-cell-immunodeficient can reject EBV-immortalized cells. Here we report that PTLD tissues express significantly lower levels of IL-18, interferon-γ (IFN-γ), Mig, and RANTES compared to lymphoid tissues diagnosed with acute EBV-induced infectious mononucleosis, as assessed by semiquantitative RT-PCR analysis. Other cytokines and chemokines are expressed at similar levels. Immunohistochemistry confirmed that PTLD tissues contain less IL-18 and Mig protein than tissues with infectious mononucleosis. IL-18, primarily a monocyte product, promotes the secretion of IFN-γ, which stimulates Mig and RANTES expression. Both IL-18 and Mig display antitumor activity in mice involving inhibition of angiogenesis. These results document greater expression of IL-18, IFN-γ, Mig, and RANTES in lymphoid tissues with acute EBV-induced infectious mononucleosis compared to tissues with PTLD and raise the possibility that these mediators participate in critical host responses to EBV infection. PMID:10393857
Evaluation of Cathode Heater Assembly for 42 GHz, 200 kW Gyrotron

NASA Astrophysics Data System (ADS)

Sharma, S. K.; Singh, Narendra Kumar; Singh, Udaybir; Khatun, Hasina; Kumar, Nitin; Alaria, M. K.; Raju, R. S.; Jain, P. K.; Sinha, A. K.

2014-09-01

In this paper, the evaluation of cathode-heater assembly of magnetron injection gun (MIG) for 42 GHz, 200 kW gyrotron is presented. The cathode-heater assembly is purchased from M/S SEMICON.The cathode-heater assembly is experimentally studied in three different conditions; in a belljar system, during vacuum processing of MIG and during MIG testing to ensure the required rise of cathode surface temperature for pre-set heater power.
The Metadata Coverage Index (MCI): A standardized metric for quantifying database metadata richness.

PubMed

Liolios, Konstantinos; Schriml, Lynn; Hirschman, Lynette; Pagani, Ioanna; Nosrat, Bahador; Sterk, Peter; White, Owen; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Taylor, Chris; Kyrpides, Nikos C; Field, Dawn

2012-07-30

Variability in the extent of the descriptions of data ('metadata') held in public repositories forces users to assess the quality of records individually, which rapidly becomes impractical. The scoring of records on the richness of their description provides a simple, objective proxy measure for quality that enables filtering that supports downstream analysis. Pivotally, such descriptions should spur on improvements. Here, we introduce such a measure - the 'Metadata Coverage Index' (MCI): the percentage of available fields actually filled in a record or description. MCI scores can be calculated across a database, for individual records or for their component parts (e.g., fields of interest). There are many potential uses for this simple metric: for example; to filter, rank or search for records; to assess the metadata availability of an ad hoc collection; to determine the frequency with which fields in a particular record type are filled, especially with respect to standards compliance; to assess the utility of specific tools and resources, and of data capture practice more generally; to prioritize records for further curation; to serve as performance metrics of funded projects; or to quantify the value added by curation. Here we demonstrate the utility of MCI scores using metadata from the Genomes Online Database (GOLD), including records compliant with the 'Minimum Information about a Genome Sequence' (MIGS) standard developed by the Genomic Standards Consortium. We discuss challenges and address the further application of MCI scores; to show improvements in annotation quality over time, to inform the work of standards bodies and repository providers on the usability and popularity of their products, and to assess and credit the work of curators. Such an index provides a step towards putting metadata capture practices and in the future, standards compliance, into a quantitative and objective framework.
Symbiotic Fungi Control Plant Root Cortex Development through the Novel GRAS Transcription Factor MIG1.

PubMed

Heck, Carolin; Kuhn, Hannah; Heidt, Sven; Walter, Stefanie; Rieger, Nina; Requena, Natalia

2016-10-24

In an approaching scenario of soil nutrient depletion, root association with soil microorganisms can be key for plant health and sustainability [1-3]. Symbiotic arbuscular mycorrhizal (AM) fungi are major players in helping plants growing under nutrient starvation conditions. They provide plants with minerals like phosphate and, furthermore, act as modulators of plant growth altering the root developmental program [4, 5]. However, the precise mechanisms involved in this latter process are not well understood. Here, we show that AM fungi are able to modulate root cortex development in Medicago truncatula by activating a novel GRAS-domain transcription factor, MIG1, that determines the size of cortical root cells. MIG1 expression peaks in arbuscule-containing cells, suggesting a role in cell remodeling during fungal accommodation. Roots ectopically expressing MIG1 become thicker due to an increase in the number and width of cortical cells. This phenotype is fully counteracted by gibberellin (GA) and phenocopied with a GA biosynthesis inhibitor or by expression of a dominant DELLA (Δ18DELLA1) protein. MIG1 downregulation leads to malformed arbuscules, a phenotype rescued by Δ18DELLA1, suggesting that MIG1 intersects with the GA signaling to control cell morphogenesis through DELLA1. DELLA1 was shown to be a central node controlling arbuscule branching [6-8]. Now we provide evidence that, together with MIG1, DELLA1 is responsible for radial cortical cell expansion during arbuscule development. Our data point toward DELLA proteins being not only longitudinal root growth repressors [9] but also positive regulators of cortical radial cell expansion, extending the knowledge of how DELLAs control root growth. Copyright © 2016 Elsevier Ltd. All rights reserved.
[Study on the arc spectral information for welding quality diagnosis].

PubMed

Li, Zhi-Yong; Gu, Xiao-Yan; Li, Huan; Yang, Li-Jun

2009-03-01

Through collecting the spectral signals of TIG and MIG welding arc with spectrometer, the arc light radiations were analyzed based on the basic theory of plasma physics. The radiation of welding arc distributes over a broad range of frequency, from infrared to ultraviolet. The arc spectrum is composed of line spectra and continuous spectra. Due to the variation of metal density in the welding arc, there is great difference between the welding arc spectra of TIG and MIG in both their intensity and distribution. The MIG welding arc provides more line spectra of metal and the intensity of radiation is greater than TIG. The arc spectrum of TIG welding is stable during the welding process, disturbance factors that cause the spectral variations can be reflected by the spectral line related to the corresponding element entering the welding arc. The arc spectrum of MIG welding will fluctuate severely due to droplet transfer, which produces "noise" in the line spectrum aggregation zone. So for MIG welding, the spectral zone lacking spectral line is suitable for welding quality diagnosis. According to the characteristic of TIG and MIG, special spectral zones were selected for welding quality diagnosis. For TIG welding, the selected zone is in ultraviolet zone (230-300 nm). For MIG welding, the selected zone is in visible zone (570-590 nm). With the basic theory provided for welding quality diagnosis, the integral intensity of spectral signal in the selected zone of welding process with disturbing factor was studied to prove the theory. The results show that the welding quality and disturbance factors can be diagnosed with good signal to noise ratio in the selected spectral zone compared with signal in other spectral zone. The spectral signal can be used for real-time diagnosis of the welding quality.
Update on Minimally Invasive Glaucoma Surgery (MIGS) and New Implants

PubMed Central

Brandão, Lívia M.; Grieshaber, Matthias C.

2013-01-01

Traditional glaucoma surgery has been challenged by the advent of innovative techniques and new implants in the past few years. There is an increasing demand for safer glaucoma surgery offering patients a timely surgical solution in reducing intraocular pressure (IOP) and improving their quality of life. The new procedures and devices aim to lower IOP with a higher safety profile than fistulating surgery (trabeculectomy/drainage tubes) and are collectively termed “minimally invasive glaucoma surgery (MIGS).” The main advantage of MIGS is that they are nonpenetrating and/or bleb-independent procedures, thus avoiding the major complications of fistulating surgery related to blebs and hypotony. In this review, the clinical results of the latest techniques and devices are presented by their approach, ab interno (trabeculotomy, excimer laser trabeculotomy, trabecular microbypass, suprachoroidal shunt, and intracanalicular scaffold) and ab externo (canaloplasty, Stegmann Canal Expander, suprachoroidal Gold microshunt). The drawback of MIGS is that some of these procedures produce a limited IOP reduction compared to trabeculectomy. Currently, MIGS is performed in glaucoma patients with early to moderate disease and preferably in combination with cataract surgery. PMID:24369494
Numerical Simulation of MIG for 42 GHz, 200 kW Gyrotron

NASA Astrophysics Data System (ADS)

Singh, Udaybir; Bera, Anirban; Kumar, Narendra; Purohit, L. P.; Sinha, Ashok K.

2010-06-01

A triode type magnetron injection gun (MIG) of a 42 GHz, 200 kW gyrotron for an Indian TOKAMAK system is designed by using the commercially available code EGUN. The operating voltages of the modulating anode and the accelerating anode are 29 kV and 65 kV respectively. The operating mode of the gyrotron is TE03 and it is operated in fundamental harmonic. The simulated results of MIG obtained with the EGUN code are validated with another trajectory code TRAK.
Survey report: control technology for autobody repair and painting shops at Church Brother's Collision Repair, Greenwood, Indiana, October 10-11, 1991

DOE Office of Scientific and Technical Information (OSTI.GOV)

Heitbrink, W.A.; Cooper, T.C.; Edmonds, M.A.

1992-03-01

A study was made to evaluate and document the effectiveness of a metal inert gas (MIG) welder with built in ventilation to control potentially hazardous conditions at Church Brother's Collision Repair (SIC-7531), Greenwood, Indiana. Air contaminant exposures were measured during a 1 hour repair job while using a ventilated MIG welder and while using a conventional MIG welder. The ventilation system of the MIG did reduce worker exposure to welding fumes. However, the sampling was done on a single repair job, thus limiting the conclusions which can be drawn from the study. Some welding fumes were not captured by themore » ventilated welder, suggesting that the MIG with ventilation provided incomplete control of the generated fumes. In some cases the metal on the other side of the welding area became sufficiently hot to generate its own fumes. The car body itself appears to block the capture of these fumes by the ventilated MIG welder. When welding inside the car without the ventilated welder, the fumes generated were more concentrated than those generated by welding outside of the car under similar conditions. There is a decreased dilution of the fumes inside the car due to a lack of air movement. The authors conclude that while the control technique appeared to lessen exposure to welding fumes, additional investigation is needed to verify the data.« less
Arc/Arg3.1 governs inflammatory dendritic cell migration from the skin and thereby controls T cell activation.

PubMed

Ufer, Friederike; Vargas, Pablo; Engler, Jan Broder; Tintelnot, Joseph; Schattling, Benjamin; Winkler, Hana; Bauer, Simone; Kursawe, Nina; Willing, Anne; Keminer, Oliver; Ohana, Ora; Salinas-Riester, Gabriela; Pless, Ole; Kuhl, Dietmar; Friese, Manuel A

2016-09-23

Skin-migratory dendritic cells (migDCs) are pivotal antigen-presenting cells that continuously transport antigens to draining lymph nodes and regulate immune responses. However, identification of migDCs is complicated by the lack of distinguishing markers, and it remains unclear which molecules determine their migratory capacity during inflammation. We show that, in the skin, the neuronal plasticity molecule activity-regulated cytoskeleton-associated protein/activity-regulated gene 3.1 (Arc/Arg3.1) was strictly confined to migDCs. Mechanistically, Arc/Arg3.1 was required for accelerated DC migration during inflammation because it regulated actin dynamics through nonmuscle myosin II. Accordingly, Arc/Arg3.1-dependent DC migration was critical for mounting T cell responses in experimental autoimmune encephalomyelitis and allergic contact dermatitis. Thus, Arc/Arg3.1 was restricted to migDCs in the skin and drove fast DC migration by exclusively coordinating cytoskeletal changes in response to inflammatory challenges. These findings commend Arc/Arg3.1 as a universal switch in migDCs that may be exploited to selectively modify immune responses. Copyright © 2016, American Association for the Advancement of Science.
Transmembrane proteins UNC-40/DCC, PTP-3/LAR, and MIG-21 control anterior-posterior neuroblast migration with left-right functional asymmetry in Caenorhabditis elegans.

PubMed

Sundararajan, Lakshmi; Lundquist, Erik A

2012-12-01

Migration of neurons and neural crest cells is of central importance to the development of nervous systems. In Caenorhabditis elegans, the QL neuroblast on the left migrates posteriorly, and QR on the right migrates anteriorly, despite similar lineages and birth positions with regard to the left-right axis. Initial migration is independent of a Wnt signal that controls later anterior-posterior Q descendant migration. Previous studies showed that the transmembrane proteins UNC-40/DCC and MIG-21, a novel thrombospondin type I repeat containing protein, act redundantly in left-side QL posterior migration. Here we show that the LAR receptor protein tyrosine phosphatase PTP-3 acts with MIG-21 in parallel to UNC-40 in QL posterior migration. We also show that in right-side QR, the UNC-40 and PTP-3/MIG-21 pathways mutually inhibit each other's role in posterior migration, allowing anterior QR migration. Finally, we present evidence that these proteins act autonomously in the Q neuroblasts. These studies indicate an inherent left-right asymmetry in the Q neuroblasts with regard to UNC-40, PTP-3, and MIG-21 function that results in posterior vs. anterior migration.
MIG1-dependent and MIG1-independent regulation of GAL gene expression in Saccharomyces cerevisiae: role of Imp2p.

PubMed

Alberti, Adriana; Lodi, Tiziana; Ferrero, Iliana; Donnini, Claudia

2003-10-15

Imp2p (Yil154c) is a transcriptional activator involved in glucose derepression of the maltose, galactose and raffinose utilization pathways and in resistance to thermal, oxidative or osmotic stress. We analysed the role of Imp2 in the regulation of GAL genes. Imp2 was shown to have a positive effect on glucose derepression of Leloir pathway genes and their activator gene GAL4. The effect of Imp2 on galactose metabolism was shown to be partially dependent on Mig1p. The Mig1-independent role depends on Nrg1p. However, disruption of both MIG1 and NRG1 only partially relieves the glucose repression of GAL genes in the Deltaimp2 mutant, indicating that Imp2 must also have other function(s). Moreover, the interaction between IMP2 and GAL6/BLH1, a recently isolated gene involved in the regulation of GAL genes that shares with Imp2 the ability to protect cells from the glycopeptide bleomycin, was also analysed. The results suggest a major role of Imp2 in a GAL6-independent pathway. Copyright 2003 John Wiley & Sons, Ltd.
Research on Novel High-Power Microwave/Millimeter Wave Sources and Applications

DTIC Science & Technology

2010-08-28

density with acceptable operating temperature and lifetime. The MIG is optimized with the EGUN code for a cath- ode voltage Vb of 100 kV and a beam...emission suppression. Figure 2 is an EGUN drawing of the MIG configuration/ dimensions and electron trajectories. The design is flexible TABLE I. Predicted...and measured MIG parameters. EGUN prediction smooth cathode Measurement Voltage kV 100.0 100.0 Current A 8.0 8.0 0 1.40 1.40 vz /vz0 3.5% 4.6
Safety and feasibility of minimally invasive gastrectomy during the early introduction in the Netherlands: short-term oncological outcomes comparable to open gastrectomy.

PubMed

Brenkman, H J F; Ruurda, J P; Verhoeven, R H A; van Hillegersberg, R

2017-09-01

Minimally invasive techniques for gastric cancer surgery have recently been introduced in the Netherlands, based on a proctoring program. The aim of this population-based cohort study was to evaluate the short-term oncological outcomes of minimally invasive gastrectomy (MIG) during its introduction in the Netherlands. The Netherlands Cancer Registry identified all patients with gastric adenocarcinoma who underwent gastrectomy with curative intent between 2010 and 2014. Multivariable analysis was performed to compare MIG and open gastrectomy (OG) on lymph node yield (≥15), R0 resection rate, and 1-year overall survival. The pooled learning curve per center of MIG was evaluated by groups of five subsequent procedures. Between 2010 and 2014, a total of 277 (14%) patients underwent MIG and 1633 (86%) patients underwent OG. During this period, the use of MIG and neoadjuvant chemotherapy increased from 4% to 39% (p < 0.001) and from 47% to 62% (p < 0.001), respectively. The median lymph node yield increased from 12 to 20 (p < 0.001), and the R0 resection rate remained stable, from 86% to 91% (p = 0.080). MIG and OG had a comparable lymph node yield (OR, 1.01; 95% CI, 0.75-1.36), R0 resection rate (OR, 0.86; 95% CI, 0.54-1.37), and 1-year overall survival (HR, 0.99; 95% CI, 0.75-1.32). A pooled learning curve of ten procedures was demonstrated for MIG, after which the conversion rate (13%-2%; p = 0.001) and lymph node yield were at a desired level (18-21; p = 0.045). With a proctoring program, the introduction of minimally invasive gastrectomy in Western countries is feasible and can be performed safely.
C3 glomerulopathy associated with monoclonal Ig is a distinct subtype.

PubMed

Ravindran, Aishwarya; Fervenza, Fernando C; Smith, Richard J H; Sethi, Sanjeev

2018-05-02

Monoclonal immunoglobulins (MIg) may play a causal role in C3 glomerulopathy (C3G) by impairing regulation of the alternative pathway of complement. Ninety-five patients with C3G were tested for MIg of which 36 were positive. Their mean age at diagnosis was 60 years and among patient 50 years and older, 65.1% had a MIg. At presentation, median serum creatinine and proteinuria were 1.9 mg/dL and 3.0 g/24 hours. Hematuria was present in 32 (88.9%) patients. Twelve (34.3%) patients had low C3 levels. C3 nephritic factor was detected in 45.8% patients; pathogenic variants in complement protein genes were rare. Hematologic evaluation revealed monoclonal gammopathy of renal significance in 26 patients, multiple myeloma in five, smoldering multiple myeloma in two, and chronic lymphocytic leukemia, lymphoma, or type I cryoglobulin each in one patient. After a median follow-up of 43.6 months, the median serum creatinine and proteinuria were 1.4 mg/dL and 0.8g/24 hours. Nine patients developed ESRD. Sixteen patients received MIg-targeted treatment, 17 patients received non-targeted treatment while three patients were managed conservatively. Of the 16 patients receiving MIg-targeted treatment, ten achieved complete/very good/partial hematologic response. Of these, seven achieved a complete/partial/stable renal response. Five patients receiving targeted treatment did not achieve hematologic response, none had a renal response. Patients receiving targeted treatment were more likely to have multiple myeloma/smoldering multiple myeloma. Patients receiving non-targeted treatment were more likely to have monoclonal gammopathy of renal significance. Thus, C3G with MIg is seen in older patients, C3 nephritic factor is the most common autoantibody detected, and MIg-targeted treatment may result in remission and stabilization of kidney function in a subset of these patients. Copyright © 2018 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
A Nonlinear Mixed Effects Approach for Modeling the Cell-To-Cell Variability of Mig1 Dynamics in Yeast

PubMed Central

Almquist, Joachim; Bendrioua, Loubna; Adiels, Caroline Beck; Goksör, Mattias; Hohmann, Stefan; Jirstrand, Mats

2015-01-01

The last decade has seen a rapid development of experimental techniques that allow data collection from individual cells. These techniques have enabled the discovery and characterization of variability within a population of genetically identical cells. Nonlinear mixed effects (NLME) modeling is an established framework for studying variability between individuals in a population, frequently used in pharmacokinetics and pharmacodynamics, but its potential for studies of cell-to-cell variability in molecular cell biology is yet to be exploited. Here we take advantage of this novel application of NLME modeling to study cell-to-cell variability in the dynamic behavior of the yeast transcription repressor Mig1. In particular, we investigate a recently discovered phenomenon where Mig1 during a short and transient period exits the nucleus when cells experience a shift from high to intermediate levels of extracellular glucose. A phenomenological model based on ordinary differential equations describing the transient dynamics of nuclear Mig1 is introduced, and according to the NLME methodology the parameters of this model are in turn modeled by a multivariate probability distribution. Using time-lapse microscopy data from nearly 200 cells, we estimate this parameter distribution according to the approach of maximizing the population likelihood. Based on the estimated distribution, parameter values for individual cells are furthermore characterized and the resulting Mig1 dynamics are compared to the single cell times-series data. The proposed NLME framework is also compared to the intuitive but limited standard two-stage (STS) approach. We demonstrate that the latter may overestimate variabilities by up to almost five fold. Finally, Monte Carlo simulations of the inferred population model are used to predict the distribution of key characteristics of the Mig1 transient response. We find that with decreasing levels of post-shift glucose, the transient response of Mig1 tend to be faster, more extended, and displays an increased cell-to-cell variability. PMID:25893847
Transgenic Expression of Bcl-xL or Bcl-2 by Murine B Cells Enhances the In Vivo Antipolysaccharide, but Not Antiprotein, Response to Intact Streptococcus pneumoniae

DTIC Science & Technology

2007-01-01

primary IgG anti-PS, vs antiprotein, re- sponse and a greater dependence on B cell membrane Ig (mIg) signaling, mediated by Bruton’s tyrosine kinase ( Btk ...WT, wild type; TI, T cell independent; TD, T cell dependent; Pn, Streptococcus pneumoniae; mIg, membrane Ig; Btk , Bruton’s tyrosine kinase; GC...were more dependent than antiprotein responses, on Btk -dependent mIg FIGURE 2. Anti-PS responses to PPS14-PspA C-PS-PspA conjugate vaccine or to
Transmembrane Proteins UNC-40/DCC, PTP-3/LAR, and MIG-21 Control Anterior–Posterior Neuroblast Migration with Left–Right Functional Asymmetry in Caenorhabditis elegans

PubMed Central

Sundararajan, Lakshmi; Lundquist, Erik A.

2012-01-01

Migration of neurons and neural crest cells is of central importance to the development of nervous systems. In Caenorhabditis elegans, the QL neuroblast on the left migrates posteriorly, and QR on the right migrates anteriorly, despite similar lineages and birth positions with regard to the left–right axis. Initial migration is independent of a Wnt signal that controls later anterior–posterior Q descendant migration. Previous studies showed that the transmembrane proteins UNC-40/DCC and MIG-21, a novel thrombospondin type I repeat containing protein, act redundantly in left-side QL posterior migration. Here we show that the LAR receptor protein tyrosine phosphatase PTP-3 acts with MIG-21 in parallel to UNC-40 in QL posterior migration. We also show that in right-side QR, the UNC-40 and PTP-3/MIG-21 pathways mutually inhibit each other’s role in posterior migration, allowing anterior QR migration. Finally, we present evidence that these proteins act autonomously in the Q neuroblasts. These studies indicate an inherent left–right asymmetry in the Q neuroblasts with regard to UNC-40, PTP-3, and MIG-21 function that results in posterior vs. anterior migration. PMID:23051647
Regulation of the ErbB network by the MIG6 feedback loop in physiology, tumor suppression and responses to oncogene-targeted therapeutics.

PubMed

Anastasi, Sergio; Lamberti, Dante; Alemà, Stefano; Segatto, Oreste

2016-02-01

The ErbB signaling network instructs the execution of key cellular programs, such as cell survival, proliferation and motility, through the generation of robust signals of defined strength and duration. In contrast, unabated ErbB signaling disrupts tissue homeostasis and leads to cell transformation. Cells oppose the threat inherent in excessive ErbB activity through several mechanisms of negative feedback regulation. Inducible feedback inhibitors (IFIs) are expressed in the context of transcriptional responses triggered by ErbB signaling, thus being uniquely suited to regulate ErbB activity during the execution of complex cellular programs. This review focuses on MIG6, an IFI that restrains ErbB signaling by mediating ErbB kinase suppression and receptor down-regulation. We will review key issues in MIG6 function, regulation and tumor suppressor activity. Subsequently, the role for MIG6 loss in the pathogenesis of tumors driven by ErbB oncogenes as well as in the generation of cellular addiction to ErbB signaling will be discussed. We will conclude by analyzing feedback inhibition by MIG6 in the context of therapies directed against ErbB and non-ErbB oncogenes. Copyright © 2015 Elsevier Ltd. All rights reserved.
The β subunit of yeast AMP-activated protein kinase directs substrate specificity in response to alkaline stress.

PubMed

Chandrashekarappa, Dakshayini G; McCartney, Rhonda R; O'Donnell, Allyson F; Schmidt, Martin C

2016-12-01

Saccharomyces cerevisiae express three isoforms of Snf1 kinase that differ by which β subunit is present, Gal83, Sip1 or Sip2. Here we investigate the abundance, activation, localization and signaling specificity of the three Snf1 isoforms. The relative abundance of these isoforms was assessed by quantitative immunoblotting using two different protein extraction methods and by fluorescence microscopy. The Gal83 containing isoform is the most abundant in all assays while the abundance of the Sip1 and Sip2 isoforms is typically underestimated especially in glass-bead extractions. Earlier studies to assess Snf1 isoform function utilized gene deletions as a means to inactivate specific isoforms. Here we use point mutations in Gal83 and Sip2 and a 17 amino acid C-terminal truncation of Sip1 to inactivate specific isoforms without affecting their abundance or association with the other subunits. The effect of low glucose and alkaline stresses was examined for two Snf1 phosphorylation substrates, the Mig1 and Mig2 proteins. Any of the three isoforms was capable of phosphorylating Mig1 in response to glucose stress. In contrast, the Gal83 isoform of Snf1 was both necessary and sufficient for the phosphorylation of the Mig2 protein in response to alkaline stress. Alkaline stress led to the activation of all three isoforms yet only the Gal83 isoform translocates to the nucleus and phosphorylates Mig2. Deletion of the SAK1 gene blocked nuclear translocation of Gal83 and signaling to Mig2. These data strongly support the idea that Snf1 signaling specificity is mediated by localization of the different Snf1 isoforms. Copyright © 2016 Elsevier Inc. All rights reserved.

The β subunit of yeast AMP-activated protein kinase directs substrate specificity in response to alkaline stress

PubMed Central

Chandrashekarappa, Dakshayini G.; McCartney, Rhonda R.; O’Donnell, Allyson F.; Schmidt, Martin C.

2016-01-01

Saccharomyces cerevisiae express three isoforms of Snf1 kinase that differ by which β subunit is present, Gal83, Sip1 or Sip2. Here we investigate the abundance, activation, localization and signaling specificity of the three Snf1 isoforms. The relative abundance of these isoforms was assessed by quantitative immunoblotting using two different protein extraction methods and by fluorescence microscopy. The Gal83 containing isoform is the most abundant in all assays while the abundance of the Sip1 and Sip2 isoforms is typically underestimated especially in glass-bead extractions. Earlier studies to assess Snf1 isoform function utilized gene deletions as a means to inactivate specific isoforms. Here we use point mutations in Gal83 and Sip2 and a 17 amino acid C-terminal truncation of Sip1 to inactivate specific isoforms without affecting their abundance or association with the other subunits. The effect of low glucose and alkaline stresses was examined for two Snf1 phosphorylation substrates, the Mig1 and Mig2 proteins. Any of the three isoforms was capable of phosphorylating Mig1 in response to glucose stress. In contrast, the Gal83 isoform of Snf 1 was both necessary and sufficient for the phosphorylation of the Mig2 protein in response to alkaline stress. Alkaline stress led to the activation of all three isoforms yet only the Gal83 isoform translocates to the nucleus and phosphorylates Mig2. Deletion of the SAK1 gene blocked nuclear translocation of Gal83 and signaling to Mig2. These data strongly support the idea that Snf1 signaling specificity is mediated by localization of the different Snf1 isoforms. PMID:27592031
Analysis of WC/Ni-Based Coatings Deposited by Controlled Short-Circuit MIG Welding

NASA Astrophysics Data System (ADS)

Vespa, P.; Pinard, P. T.; Gauvin, R.; Brochu, M.

2012-06-01

This study investigates the recently developed controlled short-circuit metal inert gas (CSC-MIG) welding system for depositing WC/Ni-based claddings on carbon steel substrates. WC/Ni-based coatings deposited by CSC-MIG were analyzed by optical light microscopy and scanning electron microscopy (SEM) equipped with energy dispersive spectroscopy (EDS) and electron backscatter diffraction (EBSD) capabilities. X-ray diffraction (XRD) and hardness measurements of depositions are also reported. The CSC-MIG welding system provides a significant amount of user control over the current waveform during welding and has lower heat input when compared with traditional MIG welding. Heat input for the analyzed coatings ranged from 10.1 to 108.7 J/mm. Metallurgically bonded coatings free from spatter and with 0.75% average porosity were produced. It was found that the detrimental decarburization of the WC particles seen in thermal spray systems does not occur when welding with the CSC-MIG. Precipitation of a reaction layer around the reinforcing phase was identified as WC; the average thickness of which increases from 3.8 to 7.2 μm for the low and high heat input condition, respectively. Precipitation of newly formed WC particles was observed; their size distribution increased from D 50 of 2.4 μm in the low heat input weldment to 6.75 μm in the high heat input weldment. The level of dilution of the reinforcing phase increases significantly with heat input. The hardness of the deposited coatings decreases from 587 HV10 to 410 HV10 when the energy input was increased from 10.1 to 108.7 J/mm.
IFN-γ, IL-2, IP-10, and MIG as Biomarkers of Exposure to Leishmania spp., and of Cure in Human Visceral Leishmaniasis.

PubMed

Ibarra-Meneses, Ana V; Ghosh, Prakash; Hossain, Faria; Chowdhury, Rajashree; Mondal, Dinesh; Alvar, Jorge; Moreno, Javier; Carrillo, Eugenia

2017-01-01

New biomarkers are needed for monitoring the effectiveness of treatment for visceral leishmaniasis (VL). They might also improve the detection of the asymptomatic population in Leishmania- endemic areas. This paper examines the IL-2, IFN-γ, IFN-γ-induced protein 10 (IP-10), and monokine-induced-by-IFN-γ (MIG) levels in whole blood-stimulated in vitro with soluble Leishmania antigen (SLA)-taken from asymptomatic individuals and patients treated for VL living in a post-outbreak ( Leishmania infantum ) area in Spain, and in an endemic ( Leishmania donovani ) area of Bangladesh. IP-10 was found to be an accurate global marker of asymptomatic subjects with positive cellular/humoral tests, while MIG was found to be a better marker of contact with L. donovani than IL-2 but no for those with L. infantum . Determining IP-10, MIG, and IFN-γ levels proved useful in monitoring the cellular immune response following treatment for active disease caused by L. infantum .
IFN-γ, IL-2, IP-10, and MIG as Biomarkers of Exposure to Leishmania spp., and of Cure in Human Visceral Leishmaniasis

PubMed Central

Ibarra-Meneses, Ana V.; Ghosh, Prakash; Hossain, Faria; Chowdhury, Rajashree; Mondal, Dinesh; Alvar, Jorge; Moreno, Javier; Carrillo, Eugenia

2017-01-01

New biomarkers are needed for monitoring the effectiveness of treatment for visceral leishmaniasis (VL). They might also improve the detection of the asymptomatic population in Leishmania-endemic areas. This paper examines the IL-2, IFN-γ, IFN-γ-induced protein 10 (IP-10), and monokine-induced-by-IFN-γ (MIG) levels in whole blood—stimulated in vitro with soluble Leishmania antigen (SLA)—taken from asymptomatic individuals and patients treated for VL living in a post-outbreak (Leishmania infantum) area in Spain, and in an endemic (Leishmania donovani) area of Bangladesh. IP-10 was found to be an accurate global marker of asymptomatic subjects with positive cellular/humoral tests, while MIG was found to be a better marker of contact with L. donovani than IL-2 but no for those with L. infantum. Determining IP-10, MIG, and IFN-γ levels proved useful in monitoring the cellular immune response following treatment for active disease caused by L. infantum. PMID:28620584
Moderate volume of high relative training intensity produces greater strength gains compared with low and high volumes in competitive weightlifters.

PubMed

González-Badillo, Juan José; Izquierdo, Mikel; Gorostiaga, Esteban M

2006-02-01

The purpose of this study was to examine the effect of 3 volumes of heavy resistance, average relative training intensity (expressed as a percentage of 1 repetition maximum that represented the absolute kilograms lifted divided by the number of repetitions performed) programs on maximal strength (1RM) in Snatch (Sn), Clean & Jerk (C&J), and Squat (Sq). Twenty-nine experienced (>3 years), trained junior weightlifters were randomly assigned into 1 of 3 groups: low-intensity group (LIG; n = 12), moderate-intensity group (MIG; n = 9), and high-intensity group (HIG; n = 8). All subjects trained for 10 weeks, 4-5 days a week, in a periodized routine using the same exercises and training volume (expressed as total number of repetitions performed at intensities equal to or greater than 60% of 1RM), but different programmed total repetitions at intensities of >90-100% of 1RM for the entire 10-week period: LIG (46 repetitions), MIG (93 repetitions), and HIG (184 repetitions). During the training period, MIG and LIG showed a significant increase (p < 0.01-0.05) for C&J (10.5% and 3% for MIG and LIG, respectively) and Sq (9.5% and 5.3% for MIG and LIG, respectively), whereas in HIG the increase took place only in Sq (6.9%, p < 0.05). A calculation of effect sizes revealed greater strength gains in the MIG than in HIG or LIG. There were no significant differences between LIG and HIG training volume-induced strength gains. All the subjects in HIG were unable to fully accomplish the repetitions programmed at relative intensities greater than 90% of 1RM. The present results indicate that short-term resistance training using moderate volumes of high relative intensity tended to produce higher enhancements in weightlifting performance compared with low and high volumes of high relative training intensities of equal total volume in experienced, trained young weightlifters. Therefore, for the present population of weightlifters, it may be beneficial to use the MIG training protocol to improve the weightlifting program at least in a short-term (10 weeks) cycle of training.
Numerical Simulation of Single-anode and Double-anode Magnetron Injection Guns for 127.5 GHz 1 MW Gyrotron

NASA Astrophysics Data System (ADS)

Singh, Udaybir; Kumar, Nitin; Kumar, Anil; Purohit, Laxmi Prasad; Sinha, Ashok Kumar

2011-07-01

This paper presents the design of two types of magnetron injection guns (MIG's) for 1 MW, 127.5 GHz gyrotron. TE24,8 mode has been chosen as the operating mode. In-house developed code MIGSYN has been used to estimate the initial gun parameters. The electron trajectory tracing program EGUN and in-house developed code MIGANS have been used to optimize the single-anode and the double-anode design for 80 kV, 40 A MIG. The parametric analysis of MIG has also been presented. The advantages and the disadvantages of each kind of configuration have been critically examined.
Virtual reality welder training

NASA Astrophysics Data System (ADS)

White, Steven A.; Reiners, Dirk; Prachyabrued, Mores; Borst, Christoph W.; Chambers, Terrence L.

2010-01-01

This document describes the Virtual Reality Simulated MIG Lab (sMIG), a system for Virtual Reality welder training. It is designed to reproduce the experience of metal inert gas (MIG) welding faithfully enough to be used as a teaching tool for beginning welding students. To make the experience as realistic as possible it employs physically accurate and tracked input devices, a real-time welding simulation, real-time sound generation and a 3D display for output. Thanks to being a fully digital system it can go beyond providing just a realistic welding experience by giving interactive and immediate feedback to the student to avoid learning wrong movements from day 1.
Effect of welding process on the microstructure and properties of dissimilar weld joints between low alloy steel and duplex stainless steel

NASA Astrophysics Data System (ADS)

Wang, Jing; Lu, Min-xu; Zhang, Lei; Chang, Wei; Xu, Li-ning; Hu, Li-hua

2012-06-01

To obtain high-quality dissimilar weld joints, the processes of metal inert gas (MIG) welding and tungsten inert gas (TIG) welding for duplex stainless steel (DSS) and low alloy steel were compared in this paper. The microstructure and corrosion morphology of dissimilar weld joints were observed by scanning electron microscopy (SEM); the chemical compositions in different zones were detected by energy-dispersive spectroscopy (EDS); the mechanical properties were measured by microhardness test, tensile test, and impact test; the corrosion behavior was evaluated by polarization curves. Obvious concentration gradients of Ni and Cr exist between the fusion boundary and the type II boundary, where the hardness is much higher. The impact toughness of weld metal by MIG welding is higher than that by TIG welding. The corrosion current density of TIG weld metal is higher than that of MIG weld metal in a 3.5wt% NaCl solution. Galvanic corrosion happens between low alloy steel and weld metal, revealing the weakness of low alloy steel in industrial service. The quality of joints produced by MIG welding is better than that by TIG welding in mechanical performance and corrosion resistance. MIG welding with the filler metal ER2009 is the suitable welding process for dissimilar metals jointing between UNS S31803 duplex stainless steel and low alloy steel in practical application.
Glucose de-repression by yeast AMP-activated protein kinase SNF1 is controlled via at least two independent steps.

PubMed

García-Salcedo, Raúl; Lubitz, Timo; Beltran, Gemma; Elbing, Karin; Tian, Ye; Frey, Simone; Wolkenhauer, Olaf; Krantz, Marcus; Klipp, Edda; Hohmann, Stefan

2014-04-01

The AMP-activated protein kinase, AMPK, controls energy homeostasis in eukaryotic cells but little is known about the mechanisms governing the dynamics of its activation/deactivation. The yeast AMPK, SNF1, is activated in response to glucose depletion and mediates glucose de-repression by inactivating the transcriptional repressor Mig1. Here we show that overexpression of the Snf1-activating kinase Sak1 results, in the presence of glucose, in constitutive Snf1 activation without alleviating glucose repression. Co-overexpression of the regulatory subunit Reg1 of the Glc-Reg1 phosphatase complex partly restores glucose regulation of Snf1. We generated a set of 24 kinetic mathematical models based on dynamic data of Snf1 pathway activation and deactivation. The models that reproduced our experimental observations best featured (a) glucose regulation of both Snf1 phosphorylation and dephosphorylation, (b) determination of the Mig1 phosphorylation status in the absence of glucose by Snf1 activity only and (c) a regulatory step directing active Snf1 to Mig1 under glucose limitation. Hence it appears that glucose de-repression via Snf1-Mig1 is regulated by glucose via at least two independent steps: the control of activation of the Snf1 kinase and directing active Snf1 to inactivating its target Mig1. © 2014 FEBS.
The porosity formation mechanism in the laser-MIG hybrid welded joint of Invar alloy

NASA Astrophysics Data System (ADS)

Zhan, Xiaohong; Gao, Qiyu; Gu, Cheng; Sun, Weihua; Chen, Jicheng; Wei, Yanhong

2017-10-01

The porosity formation mechanism in the laser-metal inter gas (MIG) multi-layer hybrid welded (HW) joint of 19.05 mm thick Invar alloy is investigated. The microstructure characteristics and energy dispersive spectroscopy (EDS) are analyzed. The phase identification was conducted by the X-ray diffractometer (XRD). Experimental results show that the generation of porosity is caused by the relatively low laser power in the root pass and low current in the cover pass. It is also indicated that the microstructures of the welded joints are mainly observed to be columnar crystal and equiaxial crystal, which are closely related to the porosity formation. The EDS results show that oxygen content is significantly high in the inner wall of the porosity. The XRD results indicate that the BM and the WB of laser-MIG HW all are composed of Fe0.64Ni0.36 and γ-(Fe,Ni). When the weld pool is cooled quickly, [NiO] [FeO] and [MnO] are formed that react on C to generate CO/CO2 gases. The porosity of laser-MIG HW for Invar alloy is oxygen pore. The root source of metallurgy porosity formation is that the dissolved gases are hard to escape sufficiently and thus exist in the weld pool. Furthermore, 99.99% pure Argon is recommended as protective gas in the laser-MIG HW of Invar alloy.
The 630 nm MIG and the vertical neutral wind in the low latitude nighttime thermosphere

NASA Technical Reports Server (NTRS)

Herrero, F. A.; Meriwether, J. W., Jr.

1994-01-01

It is shown that large negative divergences (gradients) in the horizontal neutral wind in the equatorial thermosphere can support downward neutral winds in excess of 20 m/s. With attention to the meridional and vertical winds only, the pressure tendency equation is used to derive the expression U(sub z0) approximately equals (Partial derivative U(sub y)/Partial derivative y)H for the vertical wind U(sub z0) at the reference altitude for the pressure tendency equation; H is the atmospheric density scale height, and (Partial derivative U(sub y)/Partial derivative y) is the meridional wind gradient. The velocity gradient associated with the Meridional Intensity Gradient (MIG) of the O((sup 1)D) emission (630 nm) at low latitudes is used to estimate the vertical neutral wind in the MIG region. Velocity gradients derived from MIG data are about 0.5 (m/s)/km) or more, indicating that the MIG region may contain downward neutral winds in excess of 20 m/s. Though direct measurements of the vertical wind are scarce, Fabry-Perot interferometer data of the equatorial F-region above Natal, Brazil, showed downward winds of 30 m/s occurring during a strong meridional wind convergence in 1982. In-situ measurements with the WATS instrument on the DE-2 satellite also show large vertical neutral winds in the equatorial region.
Functional Coordination of WAVE and WASP in C. elegans Neuroblast Migration.

PubMed

Zhu, Zhiwen; Chai, Yongping; Jiang, Yuxiang; Li, Wenjing; Hu, Huifang; Li, Wei; Wu, Jia-Wei; Wang, Zhi-Xin; Huang, Shanjin; Ou, Guangshuo

2016-10-24

Directional cell migration is critical for metazoan development. We define two molecular pathways that activate the Arp2/3 complex during neuroblast migration in Caenorhabditis elegans. The transmembrane protein MIG-13/Lrp12 is linked to the Arp2/3 nucleation-promoting factors WAVE or WASP through direct interactions with ABL-1 or SEM-5/Grb2, respectively. WAVE mutations partially impaired F-actin organization and decelerated cell migration, and WASP mutations did not inhibit cell migration but enhanced migration defects in WAVE-deficient cells. Purified SEM-5 and MIG-2 synergistically stimulated the F-actin branching activity of WASP-Arp2/3 in vitro. In GFP knockin animals, WAVE and WASP were largely organized into separate clusters at the leading edge, and the amount of WASP was less than WAVE but could be elevated by WAVE mutations. Our results indicate that the MIG-13-WAVE pathway provides the major force for directional cell motility, whereas MIG-13-WASP partially compensates for its loss, underscoring their coordinated activities in facilitating robust cell migration. Copyright © 2016 Elsevier Inc. All rights reserved.
A stone extraction facilitation device to achieve an improved technique for performing LCBDE.

PubMed

Wenner, D E; Whitwam, P; Rosser, J; Hashmi, S; Wenner, D E

2005-01-01

Laparoscopic common bile duct exploration (LCBDE) has proved to be a safe, cost-effective way to treat common bile duct (CBD) stones. Despite this, LCBDE has not gained widespread adoption by surgeons. The technique has proved difficult to master, and damage to the fragile choledochoscope by grasping forceps and passage through the port valves has been problematic. Cases involving large, impacted, or multiple stones have required conversion to open exploration of CBD. The Multichannel Instrument Guide (MIG) is introduced as a solution for these problems. The MIG is a J-shaped plastic extrusion with three lumens. It is flexible and can be straightened for insertion through a 10-mm port. The MIG facilitates insertion of a flexible 2.8- to 3.2-mm choledochoscope into the CBD. At the same time, additional tools such as balloon or irrigation catheters and lithotripters can be introduced into the CBD. These can be manipulated under video guidance via the choledochoscope. This procedural multitasking allows for a more efficient LCBDE. The authors describe their initial experience using the MIG for 23 patients. Of the 23 patients who underwent LCBDE procedures, 20 had stones in the CBD. Multiple stones were present in 48% of the patients; impacted stones were present in 26% of the patients; and stones larger than 1 cm were present in 26% of the patients. A 95% stone clearance rate was achieved. Difficult cases with large, impacted or multiple stones were resolved using the MIG. Two choledochoscopes were damaged; one during surgery and one during cleaning and storage. The MIG has demonstrated significant advantages over previously described techniques. The device secures biliary tract access and allows procedural multitasking while protecting the delicate and expensive equipment. Subsequently, a simplified technique algorithm can be followed that may encourage more surgeons to adopt the routine performance of LCBDE.
The Fat-like Cadherin CDH-4 Acts Cell-Non-Autonomously in Anterior-Posterior Neuroblast Migration

PubMed Central

Sundararajan, Lakshmi; Norris, Megan L.; Schöneich, Sebastian; Ackley, Brian D.; Lundquist, Erik A.

2014-01-01

Directed migration of neurons is critical in the normal and pathological development of the brain and central nervous system. In C. elegans, the bilateral Q neuroblasts, QR on the right and QL on the left, migrate anteriorly and posteriorly, respectively. Initial protrusion and migration of the Q neuroblasts is autonomously controlled by the transmembrane proteins UNC-40/DCC, PTP-3/LAR, and MIG-21. As QL migrates posteriorly, it encounters and EGL-20/Wnt signal that induces MAB-5/Hox expression that drives QL descendant posterior migration. QR migrates anteriorly away from EGL-20/Wnt and does not activate MAB-5/Hox, resulting in anterior QR descendant migration. A forward genetic screen for new mutations affecting initial Q migrations identified alleles of cdh-4, which caused defects in both QL and QR directional migration similar to unc-40, ptp-3, and mig-21. Previous studies showed that in QL, PTP-3/LAR and MIG-21 act in a pathway in parallel to UNC-40/DCC to drive posterior QL migration. Here we show genetic evidence that CDH-4 acts in the PTP-3/MIG-21 pathway in parallel to UNC-40/DCC to direct posterior QL migration. In QR, the PTP-3/MIG-21 and UNC-40/DCC pathways mutually inhibit each other, allowing anterior QR migration. We report here that CDH-4 acts in both the PTP-3/MIG-21 and UNC-40/DCC pathways in mutual inhibition in QR, and that CDH-4 acts cell-non-autonomously. Interaction of CDH-4 with UNC-40/DCC in QR but not QL represents an inherent left-right asymmetry in the Q cells, the nature of which is not understood. We conclude that CDH-4 might act as a permissive signal for each Q neuroblast to respond differently to anterior-posterior guidance information based upon inherent left-right asymmetries in the Q neuroblasts. PMID:24954154
Efficacy of telephone and mail intervention in patient compliance with antihypertensive drugs in hypertension. ETECUM-HTA study.

PubMed

Márquez Contreras, Emilio; Vegazo García, Onofre; Martel Claros, Nieves; Gil Guillén, Vicente; de la Figuera von Wichmann, Mariano; Casado Martínez, José Joaquín; Fernández, Raúl

2005-01-01

To study the efficacy of telephone and mail intervention in therapeutic compliance among patients with mild to moderate hypertension. A prospective controlled multicenter clinical trial. Eighty-five primary care centers in Spain, with a duration of 6 months. A total of 636 patients with newly diagnosed or uncontrolled hypertension were included. Interventions. The patients were randomized and distributed between the following groups: (i) control (CG) - under routine clinical management; (ii) mail intervention (MIG) - received a mailed message reinforcing compliance and reminding of the visits (15 days, 2 and 4 months); (iii) telephone intervention (TIG) - received a telephone call at 15 days, then at 7 and 15 weeks. Five visits were scheduled, with the measurement of blood pressure and counting of tablets. Compliers were defined as subjects showing 80-110% drug consumption. Calculations were made of mean percentage compliance (MPC) and compliers, mean blood pressure and percentage controlled subjects. Five hundred and thirty-eight patients completed the study (261 males); 85.5% were compliers (CI = 82.5-88.5; n = 460). The MPC was 95.1+/-19.6% (CI = 93.28-96.92). The CG consisted of 182 individuals, MIG = 172 and TIG = 184. Compliers represented 69.2% of the CG (CI 62.5-75.9%), 91.3% (CI = 87.1-95.5) of the MIG (p = 0.0001) and 96.2% of the TIG (CI 93.5-98.9%); the final MPC was 89.6%+/-15 in CG, 96.6%+/-12 in MIG and 99.1+/-26.8 in TIG (p = 0.0001). The percentage of controlled subjects was 47.2% in CG (CI = 40-54.4), 61.3% in MIG (CI = 54.1-68.5%) and 63.3% in TIG (CI = 56.4-70.2%) (p<0.05). TIG and MIG are effective measures for improving patient compliance in hypertension.
Adapted motivational interviewing to improve the uptake of treatment for glaucoma in Nigeria: study protocol for a randomized controlled trial.

PubMed

Abdull, Mohammed M; Gilbert, Clare; McCambridge, Jim; Evans, Jennifer

2014-04-29

Glaucoma is a chronic eye disease associated with irreversible visual loss. In Africa, glaucoma patients often present late, with very advanced disease. One-off procedures, such as laser or surgery, are recommended in Africa because of lack of or poor adherence to medical treatment. However, acceptance of surgery is usually extremely low. To prevent blindness, adherence to treatment needs to improve, using acceptable, replicable and cost-effective interventions. After reviewing the literature and interviewing patients in Bauchi (Nigeria) motivational interviewing (MI) was selected as the intervention for this trial, with adaptation for glaucoma (MIG). MI is designed to strengthen personal motivation for, and commitment to a specific goal by eliciting and exploring a person's reasons for change within an atmosphere of acceptance and compassion. The aim of this study is to assess whether MIG increases the uptake of laser or surgery amongst glaucoma patients where this is the recommended treatment. The hypothesis is that MIG increases the uptake of treatment. This will be the first trial of MI in Africa. This is a hospital based, single centre, randomized controlled trial of MIG plus an information sheet on glaucoma and its treatment (the latter being "standard care") compared with standard care alone for glaucoma patients where the treatment recommended is surgery or laser.Those eligible for the trial are adults aged 17 years and above who live within 200 km of Bauchi with advanced glaucoma where the examining ophthalmologist recommends surgery or laser. After obtaining written informed consent, participants will be randomly allocated to MIG plus standard care, or standard care alone. Motivational interviewing will be delivered in Hausa or English by one of two MIG trained personnel. One hundred and fifty participants will be recruited to each arm. The primary outcome is the proportion of participants undergoing laser or surgery within two months of the date given to re attend for the procedure. MIG quality will be assessed using the validated MI treatment integrity scale. Motivational interviewing may be an important tool to increase the acceptance of treatment for glaucoma. The approach is potentially scalable and may be useful for other chronic conditions in Africa. ISRCTN79330571 (Controlled-Trials.com).
Adapted motivational interviewing to improve the uptake of treatment for glaucoma in Nigeria: study protocol for a randomized controlled trial

PubMed Central

2014-01-01

Background Glaucoma is a chronic eye disease associated with irreversible visual loss. In Africa, glaucoma patients often present late, with very advanced disease. One-off procedures, such as laser or surgery, are recommended in Africa because of lack of or poor adherence to medical treatment. However, acceptance of surgery is usually extremely low. To prevent blindness, adherence to treatment needs to improve, using acceptable, replicable and cost-effective interventions. After reviewing the literature and interviewing patients in Bauchi (Nigeria) motivational interviewing (MI) was selected as the intervention for this trial, with adaptation for glaucoma (MIG). MI is designed to strengthen personal motivation for, and commitment to a specific goal by eliciting and exploring a person’s reasons for change within an atmosphere of acceptance and compassion. The aim of this study is to assess whether MIG increases the uptake of laser or surgery amongst glaucoma patients where this is the recommended treatment. The hypothesis is that MIG increases the uptake of treatment. This will be the first trial of MI in Africa. Methods This is a hospital based, single centre, randomized controlled trial of MIG plus an information sheet on glaucoma and its treatment (the latter being “standard care”) compared with standard care alone for glaucoma patients where the treatment recommended is surgery or laser. Those eligible for the trial are adults aged 17 years and above who live within 200 km of Bauchi with advanced glaucoma where the examining ophthalmologist recommends surgery or laser. After obtaining written informed consent, participants will be randomly allocated to MIG plus standard care, or standard care alone. Motivational interviewing will be delivered in Hausa or English by one of two MIG trained personnel. One hundred and fifty participants will be recruited to each arm. The primary outcome is the proportion of participants undergoing laser or surgery within two months of the date given to re attend for the procedure. MIG quality will be assessed using the validated MI treatment integrity scale. Discussion Motivational interviewing may be an important tool to increase the acceptance of treatment for glaucoma. The approach is potentially scalable and may be useful for other chronic conditions in Africa. Trial registration ISRCTN79330571 (Controlled-Trials.com). PMID:24773760
Sulforaphane inhibits the interferon-γ-induced expression of MIG, IP-10 and I-TAC in INS‑1 pancreatic β-cells through the downregulation of IRF-1, STAT-1 and PKB.

PubMed

Park, Yu-Kyoung; Ramalingam, Mahesh; Kim, Shin; Jang, Byeong-Churl; Park, Jong Wook

2017-09-01

Sulforaphane (SFN) is a dietary isothiocyanate abundantly available in cruciferous vegetables and has been shown to possess anti-inflammatory and immunomodulatory activities. Chemokines are important mediators of inflammation and immune responses due to their ability to recruit and activate macrophages and leukocytes. To date, little is known about the SFN-mediated regulation of chemokine expression in pancreatic β-cells. In this study, we investigated the inhibitory effects and mechanisms of SFN on the interferon-γ (IFN-γ)-induced expression of a subset of chemokines, including monokine induced by IFN-γ (MIG), IFN-inducible protein of 10 kDa (IP-10) and IFN-inducible T‑cell alpha chemoattractant (I-TAC), in INS‑1 cells, a rat pancreatic β-cell line. Notably, IFN-γ treatment led to an increase in the mRNA expression levels of MIG, IP-10 and I-TAC in the INS‑1 cells. However, SFN strongly blocked the mRNA expressions of MIG, IP-10 and I-TAC induced by IFN-γ in INS‑1 cells. On the mechanistic level, SFN significanlty decreased not only the mRNA expression levels of interferon regulatory factor-1 (IRF-1), but also the phosphorylation levels of signal transducer and activator of transcription-1 (STAT-1) and protein kinase B (PKB) which were induced by IFN-γ in the INS‑1 cells. Pharmacological inhibition experiments further revealed that treatment with JAK inhibitor I weakly inhibited the IFN-γ-induced expression of IP-10, whereas it strongly suppressed the IFN-γ-induced expression of MIG and I-TAC in the INS‑1 cells. Moreover, treatment with LY294002, a PI3K/PKB inhibitor, was able to slightly repress IFN‑γ‑induced expressions of MIG and I-TAC, but not IP-10, in INS‑1 cells. Importantly, the IFN-γ-induced increase in the expression levels of MIG, IP-10 and I-TAC in the INS-1 cells was strongly inhibited by SFN, but not by other natural substances, such as curcumin, sanguinarine, resveratrol, triptolide and epigallocatechin gallate (EGCG), suggesting the specificity of SFN in downregulating the levels of these chemokines. To the best of our knowledge, these results collectively demonstrate for the first time that SFN strongly inhibits the IFN-γ-induced expression of MIG, IP-10 and I-TAC in INS‑1 cells and this inhibition is, at least in part, mediated through the reduced expression and phosphorylation levels of IRF-1, STAT-1 and PKB.
Challenging the New World Order: The Arms Transfer Policies of the Russian Republic

DTIC Science & Technology

1993-10-01

SU-22 fighter, SU-24 and SU-25 ground attack planes, MiG-29, MiG-31 fighters, 11-76 transports, " secondhand " AN-24 and Yak-40 passenger aircraft, i.e...notably in the Third World, and is the author of a forthcoming study of the Soviet Commissariat of Nationalities and editor of books on Soviet
Design of a double-anode magnetron-injection gun for the W-band gyrotron

NASA Astrophysics Data System (ADS)

Jang, Kwang Ho; Choi, Jin Joo; So, Joon Ho

2015-07-01

A double-anode magnetron-injection gun (MIG) was designed. The MIG is for a W-band 10-kW gyrotron. Analytic equations based on adiabatic theory and angular momentum conservation were used to examine the initial design parameters such as the cathode angle, and the radius of the beam emitting surface. The MIG's performances were predicted by using an electron trajectory code, the EGUN code. The beam spread of the axial velocity, Δvz/vz, obtained from the EGUN code was observed to be 1.34% at α = 1.3. The cathode edge emission and the thermal effect were modeled. The cathode edge emission was found to have a major effect on the velocity spread. The electron beam's quality was significantly improved by affixing non-emissive cylinders to the cathode.

Convergence of prevalence rates of diabetes and cardiometabolic risk factors in middle and low income groups in urban India: 10-year follow-up of the Chennai Urban Population Study.

PubMed

Deepa, Mohan; Anjana, Ranjit Mohan; Manjula, Datta; Narayan, K M Venkat; Mohan, Viswanathan

2011-07-01

The aim of this study was to look for temporal changes in the prevalence of diabetes and cardiometabolic risk factors in two residential colonies in Chennai. Chennai Urban Population Study (CUPS) was carried out between 1996-1998 in Chennai in two residential colonies representing the middle income group (MIG) and lower income group (LIG), respectively. The MIG had twice the prevalence rate of diabetes as the LIG and higher prevalence rates of hypertension, obesity, and dyslipidemia. They were motivated to increase their physical activity, which led to the building of a park. The LIG was given standard lifestyle advice. Follow-up surveys of both colonies were performed after a period of 10 years. In the MIG, the prevalence of diabetes increased from 12.4 to 15.4% (24% increase), while in the LIG, it increased from 6.5 to 15.3% (135% increase, p < .001). In the LIG, the prevalence rates of central obesity (baseline vs follow-up, male: 30.8 vs 50.9%, p < .001; female: 16.9 vs 49.8%, p < .001), hypertension (8.4 vs 20.1%, p < .001), hypercholesterolemia (14.2 vs. 20.4%, p < .05), and hypertriglyceridemia (8.0 vs 23.5%, p < .001) significantly increased and became similar to that seen in the MIG. There is a rapid reversal of socioeconomic gradient for diabetes and cardiometabolic risk factors in urban India with a convergence of prevalence rates among people in the MIG and LIG. This could have a serious economic impact on poor people in developing countries such as India. © 2011 Diabetes Technology Society.
Disturbed Cartilage and Joint Homeostasis Resulting From a Loss of Mitogen-Inducible Gene 6 in a Mouse Model of Joint Dysfunction

PubMed Central

Pest, Michael A.; Russell, Bailey A.; Zhang, Yu-Wen; Jeong, Jae-Wook; Beier, Frank

2017-01-01

Objective Mitogen-inducible gene 6 (MIG-6) regulates epidermal growth factor receptor (EGFR) signaling in synovial joint tissues. Whole-body knockout of the Mig6 gene in mice has been shown to induce osteoarthritis and joint degeneration. To evaluate the role of chondrocytes in this process, Mig6 was conditionally deleted from Col2a1-expressing cell types in the cartilage of mice. Methods Bone and cartilage in the synovial joints of cartilage-specific Mig6-deleted (knockout [KO]) mice and control littermates were compared. Histologic staining and immunohistochemical analyses were used to evaluate joint pathology as well as the expression of key extracellular matrix and regulatory proteins. Calcified tissue in synovial joints was assessed by micro–computed tomography (micro-CT) and whole-skeleton staining. Results Formation of long bones was found to be normal in KO animals. Cartilage thickness and proteoglycan staining of articular cartilage in the knee joints of 12-week-old KO mice were increased as compared to controls, with higher cellularity throughout the tissue. Radiopaque chondro-osseous nodules appeared in the knees of KO animals by 12 weeks of age and progressed to calcified bone–like tissue by 36 weeks of age. Nodules were also observed in the spine of 36-week-old animals. Erosion of bone at ligament entheses was evident by 12 weeks of age, by both histologic and micro-CT assessment. Conclusion MIG-6 expression in chondrocytes is important for the maintenance of cartilage and joint homeostasis. Dysregulation of EGFR signaling in chondrocytes results in anabolic activity in cartilage, but erosion of ligament entheses and the formation of ectopic chondro-osseous nodules severely disturb joint physiology. PMID:24966136
A novel radiation hard pixel design for space applications

NASA Astrophysics Data System (ADS)

Aurora, A. M.; Marochkin, V. V.; Tuuva, T.

2017-11-01

We have developed a novel radiation hard photon detector concept based on Modified Internal Gate Field Effect Transistor (MIGFET) wherein a buried Modified Internal Gate (MIG) is implanted underneath a channel of a FET. In between the MIG and the channel of the FET there is depleted semiconductor material forming a potential barrier between charges in the channel and similar type signal charges located in the MIG. The signal charges in the MIG have a measurable effect on the conductance of the channel. In this paper a radiation hard double MIGFET pixel is investigated comprising two MIGFETs. By transferring the signal charges between the two MIGs Non-Destructive Correlated Double Sampling Readout (NDCDSR) is enabled. The radiation hardness of the proposed double MIGFET structure stems from the fact that interface related issues can be considerably mitigated. The reason for this is, first of all, that interface generated dark noise can be completely avoided and secondly, that interface generated 1/f noise can be considerably reduced due to a deep buried channel readout configuration. Electrical parameters of the double MIGFET pixel have been evaluated by 3D TCAD simulation study. Simulation results show the absence of interface generated dark noise, significantly reduced interface generated 1/f noise, well performing NDCDSR operation, and blooming protection due to an inherent vertical anti-blooming structure. In addition, the backside illuminated thick fully depleted pixel design results in low crosstalk due to lack of diffusion and good quantum efficiency from visible to Near Infra-Red (NIR) light. These facts result in excellent Signal-to-Noise Ratio (SNR) and very low crosstalk enabling thus excellent image quality. The simulation demonstrates the charge to current conversion gain for source current read-out to be 1.4 nA/e.
A Comparison of Endothelial Cell Loss in Combined Cataract and MIGS (Hydrus) Procedure to Phacoemulsification Alone: 6-Month Results

PubMed Central

Fea, Antonio M.; Consolandi, Giulia; Pignata, Giulia; Cannizzo, Paola Maria Loredana; Lavia, Carlo; Billia, Filippo; Rolle, Teresa; Grignolo, Federico M.

2015-01-01

Purpose. To compare the corneal endothelial cell loss after phacoemulsification, alone or combined with microinvasive glaucoma surgery (MIGS), in nonglaucomatous versus primary open angle glaucoma (POAG) eyes affected by age-related cataract. Methods. 62 eyes of 62 patients were divided into group 1 (n = 25, affected by age-related cataract) and group 2 (n = 37, affected by age-related cataract and POAG). All patients underwent cataract surgery. Group 2 was divided into subgroups A (n = 19, cataract surgery alone) and B (n = 18, cataract surgery and MIGS). Prior to and 6 months after surgery the patients' endothelium was studied. Main outcomes were CD (cell density), SD (standard deviation), CV (coefficient of variation), and 6A (hexagonality coefficient) variations after surgeries. Results. There were no significant differences among the groups concerning preoperative endothelial parameters. The differences in CD before and after surgery were significant in all groups: 9.1% in group 1, 17.24% in group 2A, and 11.71% in group 2B. All endothelial parameters did not significantly change after surgery. Conclusions. Phacoemulsification determined a loss of endothelial cells in all groups. After surgery the change in endothelial parameters after MIGS was comparable to the ones of patients who underwent cataract surgery alone. PMID:26664740
Headache disorders in children and adolescents: their association with psychological, behavioral, and socio-environmental factors.

PubMed

Kröner-Herwig, Birgit; Gassmann, Jennifer

2012-10-01

This cross-sectional study on a randomly drawn population sample of children and adolescents (n = 3399; aged 9 to 15) aimed at the assessment of patterns of associations between psychosocial variables and primary headache disorders like migraine (MIG) or tension-type headache. A headache-free group served as a control. Data on headache and psychological trait variables (eg, internalizing symptoms), behavioral factors (eg, physical activities), and socio-environmental factors (eg, life events) were gathered by questionnaire. Logistic regression analyses were conducted with headache types (MIG, tension-type, and non-classifiable headache) as dependent variables. The pattern of correlations was largely congruent between the headache disorders. Associations were closest regarding maladaptive psychological traits (in particular internalizing symptoms with an odds ratio > 4 regarding MIG) compared with socio-environmental factors and particularly the behavioral factors. Unfavorable psychological traits and socio-environmental strains demonstrated distinctly stronger associations with MIG than tension-type headache and explained more variance in the occurrence of pediatric headache disorders than parental headache. Sex-specific analyses showed similarities as well as differences regarding the correlations, and in general, the associations were stronger in girls than boys. A common path model as posited by several researchers in the field may explain the parallelism in biopsychosocial vulnerability regarding the different headache disorders. © 2012 American Headache Society.
Design of a Double Anode Magnetron Injection Gun for Q-band Gyro-TWT Using Boundary Element Method

NASA Astrophysics Data System (ADS)

Li, Zhiliang; Feng, Jinjun; Liu, Bentian

2018-04-01

This paper presents a novel design code for double anode magnetron injection guns (MIGs) in gyro-devices based on boundary element method (BEM). The physical and mathematical models were constructed, and then the code using BEM for MIG's calculation was developed. Using the code, a double anode MIG for a Q-band gyrotron traveling-wave tube (gyro-TWT) amplifier operating in the circular TE01 mode at the fundamental cyclotron harmonic was designed. In order to verify the reliability of this code, velocity spread and guiding center radius of the MIG simulated by the BEM code were compared with these from the commonly used EGUN code, showing a reasonable agreement. Then, a Q-band gyro-TWT was fabricated and tested. The testing results show that the device has achieved an average power of 5kW and peak power ≥ 150 kW at a 3% duty cycle within bandwidth of 2 GHz, and maximum output peak power of 220 kW, with a corresponding saturated gain of 50.9 dB and efficiency of 39.8%. This paper demonstrates that the BEM code can be used as an effective approach for analysis of electron optics system in gyro-devices.
Inactivation of the transcription factor mig1 (YGL035C) in Saccharomyces cerevisiae improves tolerance towards monocarboxylic weak acids: acetic, formic and levulinic acid.

PubMed

Balderas-Hernández, Victor E; Correia, Kevin; Mahadevan, Radhakrishnan

2018-06-06

Toxic concentrations of monocarboxylic weak acids present in lignocellulosic hydrolyzates affect cell integrity and fermentative performance of Saccharomyces cerevisiae. In this work, we report the deletion of the general catabolite repressor Mig1p as a strategy to improve the tolerance of S. cerevisiae towards inhibitory concentrations of acetic, formic or levulinic acid. In contrast with the wt yeast, where the growth and ethanol production were ceased in presence of acetic acid 5 g/L or formic acid 1.75 g/L (initial pH not adjusted), the m9 strain (Δmig1::kan) produced 4.06 ± 0.14 and 3.87 ± 0.06 g/L of ethanol, respectively. Also, m9 strain tolerated a higher concentration of 12.5 g/L acetic acid (initial pH adjusted to 4.5) without affecting its fermentative performance. Moreover, m9 strain produced 33% less acetic acid and 50-70% less glycerol in presence of weak acids, and consumed acetate and formate as carbon sources under aerobic conditions. Our results show that the deletion of Mig1p provides a single gene deletion target for improving the acid tolerance of yeast strains significantly.
Micro-invasive glaucoma surgery (MIGS): a review of surgical procedures using stents

PubMed Central

Pillunat, Lutz E; Erb, Carl; Jünemann, Anselm GM; Kimmich, Friedemann

2017-01-01

Over the last decade several novel surgical treatment options and devices for glaucoma have been developed. All these developments aim to cause as little trauma as possible to the eye, to safely, effectively, and sustainably reduce intraocular pressure (IOP), to produce reproducible results, and to be easy to adopt. The term “micro-invasive glaucoma surgery (MIGS)” was used for summarizing all these procedures. Currently MIGS is gaining more and more interest and popularity. The possible reduction of the number of glaucoma medications, the ab interno approach without damaging the conjunctival tissue, and the probably safer procedures compared to incisional surgical methods may explain the increased interest in MIGS. The use of glaucoma drainage implants for lowering IOP in difficult-to-treat patients has been established for a long time, however, a variety of new glaucoma micro-stents are being manufactured by using various materials and are available to increase aqueous outflow via different pathways. This review summarizes published results of randomized clinical studies and extensive case report series on these devices, including Schlemm’s canal stents (iStent®, iStent® inject, Hydrus), suprachoroidal stents (CyPass®, iStent® Supra), and subconjunctival stents (XEN). The article summarizes the findings of published material on efficacy and safety for each of these approaches. PMID:28919702
BAC sequencing using pooled methods.

PubMed

Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

2015-01-01

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.
Three-dimensional simulation of triode-type MIG for 1 MW, 120 GHz gyrotron for ECRH applications

NASA Astrophysics Data System (ADS)

Singh, Udaybir; Kumar, Nitin; Kumar, Narendra; Kumar, Anil; Sinha, A. K.

2012-01-01

In this paper, the three-dimensional simulation of triode-type magnetron injection gun (MIG) for 120 GHz, 1 MW gyrotron is presented. The operating voltages of the modulating anode and the accelerating anode are 57 kV and 80 kV respectively. The high order TE 22,6 mode is selected as the operating mode and the electron beam is launched at the first radial maxima for the fundamental beam-mode operation. The initial design is obtained by using the in-house developed code MIGSYN. The numerical simulation is performed by using the commercially available code CST-Particle Studio (PS). The simulated results of MIG obtained by using CST-PS are validated with other simulation codes EGUN and TRAK, respectively. The results on the design output parameters obtained by using these three codes are found to be in close agreement.
Numerical Design of Megawatt Gyrotron with 120 GHz Frequency and 50% Efficiency for Plasma Fusion Application

NASA Astrophysics Data System (ADS)

Kumar, Nitin; Singh, Udaybir; Kumar, Anil; Bhattacharya, Ranajoy; Singh, T. P.; Sinha, A. K.

2013-02-01

The design of 120 GHz, 1 MW gyrotron for plasma fusion application is presented in this paper. The mode selection is carried out considering the aim of minimum mode competition, minimum cavity wall heating, etc. On the basis of the selected operating mode, the interaction cavity design and beam-wave interaction computation are carried out by using the PIC code. The design of triode type Magnetron Injection Gun (MIG) is also presented. Trajectory code EGUN, synthesis code MIGSYN and data analysis code MIGANS are used in the MIG designing. Further, the design of MIG is also validated by using the another trajectory code TRAK. The design results of beam dumping system (collector) and RF window are also presented. Depressed collector is designed to enhance the overall tube efficiency. The design study confirms >1 MW output power with tube efficiency around 50% (with collector efficiency).
Safety and Efficacy of Microinvasive Glaucoma Surgery

PubMed Central

Chen, David Z.

2017-01-01

Microinvasive glaucoma surgery (MIGS) is emerging as a new therapeutic option for glaucoma patients who wish to reduce their medication burden and avoid the postoperative complications of conventional glaucoma filtration surgery. These devices differ in terms of their efficacy and safety profile. Schlemm's canal devices have the most favorable safety profile at the compromise of modest efficacy, while subconjunctival and suprachoroidal devices are potentially more effective at lowering the intraocular pressure at the expense of a higher rate of complications. This review consolidates the latest evidence on the efficacy and safety of the MIGS devices in clinical use and provides an overview on upcoming devices which would likely also become viable treatment options in the near future. These clinical data would assist a glaucoma surgeon in selecting the most appropriate MIGS device for each patient based on the glaucoma severity and patient expectations. PMID:28512578
Genome-wide analysis of signal transducers and regulators of mitochondrial dysfunction in Saccharomyces cerevisiae.

PubMed

Singh, Keshav K; Rasmussen, Anne Karin; Rasmussen, Lene Juel

2004-04-01

Mitochondrial dysfunction is a hallmark of cancer cells. However, genetic response to mitochondrial dysfunction during carcinogenesis is unknown. To elucidate genetic response to mitochondrial dysfunction we used Saccharomyces cerevisiae as a model system. We analyzed genome-wide expression of nuclear genes involved in signal transduction and transcriptional regulation in a wild-type yeast and a yeast strain lacking the mitochondrial genome (rho(0)). Our analysis revealed that the gene encoding cAMP-dependent protein kinase subunit 3 (PKA3) was upregulated. However, the gene encoding cAMP-dependent protein kinase subunit 2 (PKA2) and the VTC1, PTK2, TFS1, CMK1, and CMK2 genes, involved in signal transduction, were downregulated. Among the known transcriptional factors, OPI1, MIG2, INO2, and ROX1 belonged to the upregulated genes, whereas MSN4, MBR1, ZMS1, ZAP1, TFC3, GAT1, ADR1, CAT8, and YAP4 including RFA1 were downregulated. RFA1 regulates DNA repair genes at the transcriptional level. RFA is also involved directly in DNA recombination, DNA replication, and DNA base excision repair. Downregulation of RFA1 in rho(0) cells is consistent with our finding that mitochondrial dysfunction leads to instability of the nuclear genome. Together, our data suggest that gene(s) involved in mitochondria-to-nucleus communication play a role in mutagenesis and may be implicated in carcinogenesis.
Assessing the Methodological Quality of Glaucoma Clinical Practice Guidelines and Their Recommendations on Microinvasive Glaucoma Surgery: A Systematic Review.

PubMed

Michaelov, Evan; Armstrong, James J; Nguyen, Mary; Instrum, Bridget; Lam, Tracey; Denstedt, James; Hutnik, Cindy M L

2018-02-01

Clinical practice guidelines (CPG) are regarded by many as critical communications providing guidance within specific medical fields. Over a decade ago, the first microinvasive glaucoma surgical (MIGS) procedures were introduced. Since then, a number of these novel intraocular pressure controlling surgical options have been approved worldwide. Governing bodies and health care administration often utilize CPGs when considering funding for newer technologies. This highlights the importance of well-written, accurate, and up-to-date CPGs in the rapidly evolving field of MIGS. If CPGs are unable to fill this role, their use in treatment decision-making is doing a disservice to patients, who will be denied currently available and potentially superior care. To determine the overall value of a CPG, the methodological quality with which it was developed, in addition to the current relevance and appropriateness of its recommendations, should be evaluated. The objective of the present study was to assess the methodological quality of currently available international glaucoma CPGs, as well as their coverage of MIGS as a surrogate marker of relevance and appropriateness to policy-makers and ophthalmologists alike. To identify potentially relevant CPGs, a predefined search strategy was used to search the following databases: Medline, EMBASE, BIOSIS, and Web of Science. All CPGs related to adult glaucoma and published in English were included. CPG methodological quality was assessed by 3 individuals using the Appraisal of Guidelines for Research and Evaluation II (AGREE II) tool. Studies were then assessed for coverage of MIGS devices and procedures. Search strategy and subsequent screening identified 11 CPGs for analysis. Eight were of high quality according to the AGREE II criteria. Three included basic information on MIGS, but none provided specific recommendations regarding their indications or which patient populations would benefit most. Many international glaucoma CPGs are of high methodological quality. However, coverage of MIGS is sparse, nonspecific and in many instances, absent. This causes CPGs to be a suboptimal source in guiding physicians and health policy-makers in areas characterized by novel and/or rapidly evolving technologies. Mechanisms to incorporate updated evidence in CPGs would have to be considered before they can be used as a source of contemporary clinical decision-making.
Assessment of biological chromium among stainless steel and mild steel welders in relation to welding processes.

PubMed

Edmé, J L; Shirali, P; Mereau, M; Sobaszek, A; Boulenguez, C; Diebold, F; Haguenoer, J M

1997-01-01

Air and biological monitoring were used for assessing external and internal chromium exposure among 116 stainless steel welders (SS welders) using manual metal arc (MMA), metal inert gas (MIG) and tungsten inert gas (TIG) welding processes (MMA: n = 57; MIG: n = 37; TIG: n = 22) and 30 mild steel welders (MS welders) using MMA and MIG welding processes (MMA: n = 14; MIG: n = 16). The levels of atmospheric total chromium were evaluated after personal air monitoring. The mean values for the different groups of SS welders were 201 micrograms/m3 (MMA) and 185 micrograms/m3 (MIG), 52 micrograms/m3 (TIG) and for MS welders 8.1 micrograms/m3 (MMA) and 7.3 micrograms/m3 (MIG). The curve of cumulative frequency distribution from biological monitoring among SS welders showed chromium geometric mean concentrations in whole blood of 3.6 micrograms/l (95th percentile = 19.9), in plasma of 3.3 micrograms/l (95th percentile = 21.0) and in urine samples of 6.2 micrograms/l (95th percentile = 58.0). Among MS welders, mean values in whole blood and plasma were rather more scattered (1.8 micrograms/l, 95th percentile = 9.3 and 1.3 micrograms/l, 95th percentile = 8.4, respectively) and in urine the value was 2.4 micrograms/l (95th percentile = 13.3). The analysis of variance of chromium concentrations in plasma previously showed a metal effect (F = 29.7, P < 0.001), a process effect (F = 22.2, P < 0.0001) but no metal-process interaction (F = 1.3, P = 0.25). Concerning urinary chromium concentration, the analysis of variance also showed a metal effect (F = 30, P < 0.0001), a process effect (F = 72, P < 0.0001) as well as a metal-process interaction (F = 13.2, P = 0.0004). Throughout the study we noted any significant differences between smokers and non-smokers among welders. Taking in account the relationships between chromium concentrations in whole, plasma or urine and the different welding process. MMA-SS is definitely different from other processes because the biological values are clearly higher. These higher levels are due to the very significant concentrations of total soluble chromium, mainly hexavalent chromium, in welding fumes.
Whole-genome sequencing for comparative genomics and de novo genome assembly.

PubMed

Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

2015-01-01

Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).
Metal-induced gap states in ferroelectric capacitors and its relationship with complex band structures

NASA Astrophysics Data System (ADS)

Junquera, Javier; Aguado-Puente, Pablo

2013-03-01

At metal-isulator interfaces, the metallic wave functions with an energy eigenvalue within the band gap decay exponentially inside the dielectric (metal-induced gap states, MIGS). These MIGS can be actually regarded as Bloch functions with an associated complex wave vector. Usually only real values of the wave vectors are discussed in text books, since infinite periodicity is assumed and, in that situation, wave functions growing exponentially in any direction would not be physically valid. However, localized wave functions with an exponential decay are indeed perfectly valid solution of the Schrodinger equation in the presence of defects, surfaces or interfaces. For this reason, properties of MIGS have been typically discussed in terms of the complex band structure of bulk materials. The probable dependence on the interface particulars has been rarely taken into account explicitly due to the difficulties to include them into the model or simulations. We aim to characterize from first-principles simulations the MIGS in realistic ferroelectric capacitors and their connection with the complex band structure of the ferroelectric material. We emphasize the influence of the real interface beyond the complex band structure of bulk materials. Financial support provided by MICINN Grant FIS2009-12721-C04-02, and by the European Union Grant No. CP-FP 228989-2 ``OxIDes''. Computer resources provided by the RES.
Assessment of the Incorporation of Patient-Centric Outcomes in Studies of Minimally Invasive Glaucoma Surgical Devices

PubMed Central

Le, Jimmy T.; Viswanathan, Shilpa; Tarver, Michelle E.; Eydelman, Malvina; Li, Tianjing

2017-01-01

IMPORTANCE Minimally invasive glaucoma surgical (MIGS) devices are one option for lowering intraocular pressure in patients with glaucoma. OBJECTIVE To examine how often existing clinical studies of MIGS devices registered on ClinicalTrials.gov measure patient-centric outcomes that patients value directly. DESIGN, SETTING, AND PARTICIPANTS We searched ClinicalTrials.gov, a registry of publicly and privately supported clinical studies, on February 20, 2015, for records of MIGS device studies involving patients with glaucoma. Two investigators independently abstracted study design and outcome details from eligible records. We classified outcomes as patient-centric or not patient-centric using a prespecified definition. MAIN OUTCOMES AND MEASURES Proportion of patient-centric and nonpatient-centric outcomes registered on ClinicalTrials.gov. RESULTS We identified 51 eligible studies specifying 127 outcomes. Reduction in intraocular pressure was the most frequent outcome specified (78/127; 61%) and a primary outcome in 41 studies. Patient-centric outcomes—such as adverse events (n = 19; 15%), topical medication use (n = 16; 13%), visual acuity (n = 4; 3%), and health-related quality of life (n = 1; 1%)—were less frequently specified (n = 40; 32%) and a primary outcome in only 12 studies. CONCLUSION AND RELEVANCE Patient-centric outcomes that provide insight into the relative desirability and acceptability of the benefits and risks of MIGS devices are not well represented in current clinical studies. PMID:27389667
Smoking impact on grip strength and fatigue resistance: implications for exercise and hand therapy practice.

PubMed

Al-Obaidi, Saud; Al-Sayegh, Nowall; Nadar, Mohammed

2014-07-01

Grip strength assessment reflects on overall health of the musculoskeletal system and is a predictor of functional prognosis and mortality. The purpose of this study was: examine whether grip-strength and fatigue resistance are impaired in smokers, determine if smoking-related impairments (fatigue-index) can be predicted by demographic data, duration of smoking, packets smoked-per-day, and physical activity. Maximum isometric grip strength (MIGS) of male smokers (n = 111) and nonsmokers (n = 66) was measured before/after induced fatigue using Jamar dynamometer at 5-handle positions. Fatigue index was calculated based on percentage change in MIGS initially and after induced fatigue. Number of repetitions to squeeze the soft rubber ball to induce fatigue was significantly lower in smokers compared with nonsmokers (t = 10.6, P < .001 dominant hand; t = 13.9, P < .001 nondominant), demonstrating a significantly higher fatigue-index for smokers than nonsmokers (t = -8.7, P < .001 dominant hand; t = -6.0, P < .001 nondominant). The effect of smoking status on MIGS scores was significantly different between smokers and nonsmokers after induced fatigue (β = -3.98, standard error = 0.59, P < .001) where smokers experienced on average a reduction of nearly 4 MIGS less than nonsmokers before fatigue. Smoking status was the strongest significant independent predictor of the fatigue-index. Smokers demonstrated reduced grip strength and fast fatigability in comparison with nonsmokers.
SDN-1/Syndecan Acts in Parallel to the Transmembrane Molecule MIG-13 to Promote Anterior Neuroblast Migration.

PubMed

Sundararajan, Lakshmi; Norris, Megan L; Lundquist, Erik A

2015-05-28

The Q neuroblasts in Caenorhabditis elegans display left-right asymmetry in their migration, with QR and descendants on the right migrating anteriorly, and QL and descendants on the left migrating posteriorly. Initial QR and QL migration is controlled by the transmembrane receptors UNC-40/DCC, PTP-3/LAR, and the Fat-like cadherin CDH-4. After initial migration, QL responds to an EGL-20/Wnt signal that drives continued posterior migration by activating MAB-5/Hox activity in QL but not QR. QR expresses the transmembrane protein MIG-13, which is repressed by MAB-5 in QL and which drives anterior migration of QR descendants. A screen for new Q descendant AQR and PQR migration mutations identified mig-13 as well as hse-5, the gene encoding the glucuronyl C5-epimerase enzyme, which catalyzes epimerization of glucuronic acid to iduronic acid in the heparan sulfate side chains of heparan sulfate proteoglycans (HSPGs). Of five C. elegans HSPGs, we found that only SDN-1/Syndecan affected Q migrations. sdn-1 mutants showed QR descendant AQR anterior migration defects, and weaker QL descendant PQR migration defects. hse-5 affected initial Q migration, whereas sdn-1 did not. sdn-1 and hse-5 acted redundantly in AQR and PQR migration, but not initial Q migration, suggesting the involvement of other HSPGs in Q migration. Cell-specific expression studies indicated that SDN-1 can act in QR to promote anterior migration. Genetic interactions between sdn-1, mig-13, and mab-5 suggest that MIG-13 and SDN-1 act in parallel to promote anterior AQR migration and that SDN-1 also controls posterior migration. Together, our results indicate previously unappreciated complexity in the role of multiple signaling pathways and inherent left-right asymmetry in the control of Q neuroblast descendant migration. Copyright © 2015 Sundararajan et al.

Prevalence of CMV infection among staff in a metropolitan children's hospital - occupational health screening findings.

PubMed

Stranzinger, Johanna; Kindel, Jutta; Henning, Melanie; Wendeler, Dana; Nienhaus, Albert

2016-01-01

Background: Staff in children's hospitals may run an increased risk of cytomegalovirus (CMV) contact infection leading to a congenital CMV fetopathy during pregnancy. The main risk factor is close contact with inapparent carriers of CMV among infants (<3 years). We therefore examined CMV seroprevalence (SP) and possible risk factors for CMV infection among staff at a children's hospital. Method: In 2014, staff at a metropolitan children's hospital were offered a CMV antibody test in the context of occupational health screening. Besides of anti-CMV immunoglobulin G (anti-CMV IgG) gender, age, profession, number of children and migration background were assessed and used as independent variables in multiple logistic regression. Women without a migration background (MIG) were considered as a separate group. Results: The study included 219 employees. Women showed a significant higher risk than men of being CMV-positive (adjusted odds ratio [aOR] 3.0; 95% CI 1.1-7.8). The risk among age groups of 30 and over was double that of the under-30s (aOR 2.0; 95% CI 1.0-3.9); among those aged 40-plus it was aOR 2.3 (95% CI 1.1-4.7). Staff with an MIG tested more often positive than those without an MIG (95.5% versus 45.7%). CMV SP was 47.7% among women without an MIG. In this subgroup the probability of CMV infection increased with age (p=0.08) as well. Conclusion: In the staff group as a whole there was a significant correlation between CMV SP, country of origin and age. We found no significant differences between occupational groups; perhaps our random sample was too small. Given the low CMV SP particularly in those without MIG, women who want to have children in particular must be protected from CMV infection. Follow-up studies should be undertaken to test whether good workplace hygiene offers sufficient protection for pregnant women and could be an alternative to prohibiting certain activities.
SDN-1/Syndecan Acts in Parallel to the Transmembrane Molecule MIG-13 to Promote Anterior Neuroblast Migration

PubMed Central

Sundararajan, Lakshmi; Norris, Megan L.; Lundquist, Erik A.

2015-01-01

The Q neuroblasts in Caenorhabditis elegans display left-right asymmetry in their migration, with QR and descendants on the right migrating anteriorly, and QL and descendants on the left migrating posteriorly. Initial QR and QL migration is controlled by the transmembrane receptors UNC-40/DCC, PTP-3/LAR, and the Fat-like cadherin CDH-4. After initial migration, QL responds to an EGL-20/Wnt signal that drives continued posterior migration by activating MAB-5/Hox activity in QL but not QR. QR expresses the transmembrane protein MIG-13, which is repressed by MAB-5 in QL and which drives anterior migration of QR descendants. A screen for new Q descendant AQR and PQR migration mutations identified mig-13 as well as hse-5, the gene encoding the glucuronyl C5-epimerase enzyme, which catalyzes epimerization of glucuronic acid to iduronic acid in the heparan sulfate side chains of heparan sulfate proteoglycans (HSPGs). Of five C. elegans HSPGs, we found that only SDN-1/Syndecan affected Q migrations. sdn-1 mutants showed QR descendant AQR anterior migration defects, and weaker QL descendant PQR migration defects. hse-5 affected initial Q migration, whereas sdn-1 did not. sdn-1 and hse-5 acted redundantly in AQR and PQR migration, but not initial Q migration, suggesting the involvement of other HSPGs in Q migration. Cell-specific expression studies indicated that SDN-1 can act in QR to promote anterior migration. Genetic interactions between sdn-1, mig-13, and mab-5 suggest that MIG-13 and SDN-1 act in parallel to promote anterior AQR migration and that SDN-1 also controls posterior migration. Together, our results indicate previously unappreciated complexity in the role of multiple signaling pathways and inherent left-right asymmetry in the control of Q neuroblast descendant migration. PMID:26022293
Autocrine CCL2, CXCL4, CXCL9 and CXCL10 signal in retinal endothelial cells and are enhanced in diabetic retinopathy.

PubMed

Nawaz, M I; Van Raemdonck, K; Mohammad, G; Kangave, D; Van Damme, J; Abu El-Asrar, A M; Struyf, S

2013-04-01

This study aimed at examining the presence and role of chemokines (angiogenic CCL2/MCP-1 and angiostatic CXCL4/PF-4, CXCL9/Mig, CXCL10/IP-10) in proliferative diabetic retinopathy (PDR). Regulated chemokine production in human retinal microvascular cells (HRMEC) and chemokine levels in vitreous samples from 40 PDR and 29 non-diabetic patients were analyzed. MCP-1, PF-4, Mig, IP-10 and VEGF levels in vitreous fluid from PDR patients were significantly higher than in controls. Except for IP-10, cytokine levels were significantly higher in PDR with active neovascularization and PDR without traction retinal detachment (TRD) than those in inactive PDR, PDR with TRD and control subjects. Exploratory regression analysis identified associations between higher levels of IP-10 and inactive PDR and PDR with TRD. VEGF levels correlated positively with MCP-1 and IP-10. Significant positive correlations were observed between MCP-1 and IP-10 levels. In line with these clinical findings Western blot analysis revealed increased PF-4 expression in diabetic rat retinas. HRMEC produced MCP-1, Mig and IP-10 after stimulation with IFN-γ, IL-1β or lipopolysaccharide. IFN-γ synergistically enhanced Mig and IP-10 production in response to IL-1β or lipopolysaccharide. MCP-1 was produced by HRMEC in response to VEGF treatment and activated HRMEC via the ERK and Akt/PKB pathway. On the other hand, phosphorylation of ERK induced by VEGF and MCP-1 was inhibited by PF-4, Mig and IP-10. In accordance with inhibition of angiogenic signal transduction pathways, PF-4 inhibited in vitro migration of HRMEC. Thus, regulatory roles for chemokines in PDR were demonstrated. In particular, IP-10 might be associated with the resolution of active PDR and the development of TRD. Copyright © 2013 Elsevier Ltd. All rights reserved.
The Role of Minimally Invasive Glaucoma Surgery Devices in the Management of Glaucoma

PubMed Central

Fingeret, Murray; Dickerson, Jaime E.

2018-01-01

SIGNIFICANCE Noncompliance is a problem affecting glaucoma patients. Approaches to improve adherence include the use of drug-delivery systems and safer forms of surgery. Minimally invasive glaucoma surgery (MIGS) has reduced complications, particularly in combination with cataract surgery, and with its good intraocular pressure (IOP) reduction may reduce or eliminate glaucoma medications. Glaucoma is a progressive disease and a leading cause of irreversible blindness. Elevated IOP is the most important risk factor, but effective medical management is dependent on patient adherence. This review summarizes the adherence problem in glaucoma and the efforts, including MIGS, to provide effective IOP control that is not dependent on patient compliance. The current understanding of patient adherence to pharmacological treatment of glaucoma is discussed including the challenges facing glaucoma patients. Historical approaches to providing IOP control in a sustained and reliable way are presented culminating in a review of the burgeoning use of MIGS devices. It is estimated that, in the United States, 27% of prescriptions written, across all medications, are not filled or are filled but not taken. For ocular hypotensive medications, even when filled, a large percentage (which varies widely by study) are not instilled as prescribed. To address this problem, methods for sustained drug delivery have been and continue to be developed, as well as surgical and laser approaches. Most recently, MIGS devices have gained popularity because of the ease of implantation during cataract surgery, favorable safety profile, and the possibility for effective and long-lasting IOP lowering, as well as the reduction or elimination of need for IOP-lowering medication. Poor adherence to treatment is relatively common among glaucoma patients and is associated with progression of disease. Recommending MIGS implantation during cataract surgery may offer optometrists a valuable treatment option in managing glaucoma patients, particularly where good adherence is in doubt. PMID:29370021
An efficient approach to BAC based assembly of complex genomes.

PubMed

Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David

2016-01-01

There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
Human genetics and genomics a decade after the release of the draft sequence of the human genome.

PubMed

Naidoo, Nasheen; Pawitan, Yudi; Soong, Richie; Cooper, David N; Ku, Chee-Seng

2011-10-01

Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.
Human genetics and genomics a decade after the release of the draft sequence of the human genome

PubMed Central

2011-01-01

Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605
Genome sequence of Phytophthora ramorum: implications for management

Treesearch

Brett Tyler; Sucheta Tripathy; Nik Grunwald; Kurt Lamour; Kelly Ivors; Matteo Garbelotto; Daniel Rokhsar; Nik Putnam; Igor Grigoriev; Jeffrey Boore

2006-01-01

A draft genome sequence has been determined for Phytophthora ramorum, together with a draft sequence of the soybean pathogen Phytophthora sojae. The P. ramorum genome was sequenced to a depth of 7-fold coverage, while the P. sojae genome was sequenced to a depth of 9-fold coverage. The genome...
Genome Science: A Video Tour of the Washington University Genome Sequencing Center for High School and Undergraduate Students

ERIC Educational Resources Information Center

Flowers, Susan K.; Easter, Carla; Holmes, Andrea; Cohen, Brian; Bednarski, April E.; Mardis, Elaine R.; Wilson, Richard K.; Elgin, Sarah C. R.

2005-01-01

Sequencing of the human genome has ushered in a new era of biology. The technologies developed to facilitate the sequencing of the human genome are now being applied to the sequencing of other genomes. In 2004, a partnership was formed between Washington University School of Medicine Genome Sequencing Center's Outreach Program and Washington…
Graphene integrated circuits: new prospects towards receiver realisation.

PubMed

Saeed, Mohamed; Hamed, Ahmed; Wang, Zhenxing; Shaygan, Mehrdad; Neumaier, Daniel; Negra, Renato

2017-12-21

This work demonstrates a design approach which enables the fabrication of fully integrated radio frequency (RF) and millimetre-wave frequency direct-conversion graphene receivers by adapting the frontend architecture to exploit the state-of-the-art performance of the recently reported wafer-scale CVD metal-insulator-graphene (MIG) diodes. As a proof-of-concept, we built a fully integrated microwave receiver in the frequency range 2.1-2.7 GHz employing the strong nonlinearity and the high responsivity of MIG diodes to successfully receive and demodulate complex, digitally modulated communication signals at 2.45 GHz. In addition, the fabricated receiver uses zero-biased MIG diodes and consumes zero dc power. With the flexibility to be fabricated on different substrates, the prototype receiver frontend is fabricated on a low-cost, glass substrate utilising a custom-developed MMIC process backend which enables the high performance of passive components. The measured performance of the prototype makes it suitable for Internet-of-Things (IoT) and Radio Frequency Identification (RFID) systems for medical and communication applications.
Relationship between welding fume concentration and systemic inflammation after controlled exposure of human subjects with welding fumes from metal inert gas brazing of zinc-coated materials.

PubMed

Brand, Peter; Bauer, Marcus; Gube, Monika; Lenz, Klaus; Reisgen, Uwe; Spiegel-Ciobanu, Vilia Elena; Kraus, Thomas

2014-01-01

It has been shown that exposure of subjects to emissions from a metal inert gas (MIG) brazing process of zinc-coated material led to an increase of high-sensitivity C-reactive protein (hsCRP) in the blood. In this study, the no-observed-effect level (NOEL) for such emissions was assessed. Twelve healthy subjects were exposed for 6 hours to different concentrations of MIG brazing fumes under controlled conditions. High-sensitivity C-reactive protein was measured in the blood. For welding fumes containing 1.20 and 1.50 mg m zinc, high-sensitivity C-reactive protein was increased the day after exposure. For 0.90 mg m zinc, no increase was detected. These data indicate that the no-observed-effect level for emissions from a MIG brazing process of zinc-coated material in respect to systemic inflammation is found for welding fumes with zinc concentrations between 0.90 and 1.20 mg m.
TECHNIQUES AND OUTCOMES OF MINIMALLY-INVASIVE TRABECULAR ABLATION AND BYPASS SURGERY

PubMed Central

Kaplowitz, Kevin; Schuman, Joel S.; Loewen, Nils A.

2014-01-01

Minimally invasive glaucoma surgeries (MIGS) can improve the conventional, pressure dependent outflow by bypassing or ablating the trabecular meshwork or create alternative drainage routes into the suprachoroidal or subconjunctival space. They have a highly favorable risk profile compared to penetrating surgeries and lower intraocular pressure with variable efficacy that may depend on the extent of outflow segments accessed. Since they are highly standardized procedures that use clear corneal incisions, they can elegantly be combined with cataract and refractive procedures to improve vision in the same session. There is a growing need for surgeons to become proficient in MIGS to address the increasing prevalence of glaucoma and cataracts in a well-informed, aging population. Techniques of visualization and instrumentation in an anatomically highly confined space with semi-transparent tissues are fundamentally different from other anterior segment surgeries and present even experienced surgeons with a substantial learning curve. Here, we provide practical tips and review techniques and outcomes of TM bypass and ablation MIGS. PMID:24338085
Origin of the Y genome in Elymus and its relationship to other genomes in Triticeae based on evidence from elongation factor G (EF-G) gene sequences.

PubMed

Sun, Genlou; Komatsuda, Takao

2010-08-01

It is well known that Elymus arose through hybridization between representatives of different genera. Cytogenetic analyses show that all its members include the St genome in combination with one or more of four other genomes, the H, Y, P, and W genomes. The origins of the H, P, and W genomes are known, but not for the Y genome. We analyzed the single copy nuclear gene coding for elongation factor G (EF-G) from 28 accessions of polyploid Elymus species and 45 accessions of diploid Triticeae species in order to investigate origin of the Y genome and its relationship to other genomes in the tribe Triticeae. Sequence comparisons among the St, H, Y, P, W, and E genomes detected genome-specific polymorphisms at 66 nucleotide positions. The St and Y genomes are relatively dissimilar. The phylogeny of the Y genome sequences was investigated for the first time. They were most similar to the W genome sequences. The Y genome sequences were placed in two different groups. These two groups were included in an unresolved clade that included the W and E sequences as well as sequences from many annual species. The H genomes sequences were in a clade with the F, P, and Ns genome sequences as sister groups. These two clades were more closely related to each other and to the L and Xp genomes than they were to the St genome sequences. These data support the hypothesis that the Y genome evolved in a diploid species and has a different origin from the St genome. Copyright 2010 Elsevier Inc. All rights reserved.
Company profile: Complete Genomics Inc.

PubMed

Reid, Clifford

2011-02-01

Complete Genomics Inc. is a life sciences company that focuses on complete human genome sequencing. It is taking a completely different approach to DNA sequencing than other companies in the industry. Rather than building a general-purpose platform for sequencing all organisms and all applications, it has focused on a single application - complete human genome sequencing. The company's Complete Genomics Analysis Platform (CGA™ Platform) comprises an integrated package of biochemistry, instrumentation and software that sequences human genomes at the highest quality, lowest cost and largest scale available. Complete Genomics offers a turnkey service that enables customers to outsource their human genome sequencing to the company's genome sequencing center in Mountain View, CA, USA. Customers send in their DNA samples, the company does all the library preparation, DNA sequencing, assembly and variant analysis, and customers receive research-ready data that they can use for biological discovery.
Curated eutherian third party data gene data sets.

PubMed

Premzl, Marko

2016-03-01

The free available eutherian genomic sequence data sets advanced scientific field of genomics. Of note, future revisions of gene data sets were expected, due to incompleteness of public eutherian genomic sequence assemblies and potential genomic sequence errors. The eutherian comparative genomic analysis protocol was proposed as guidance in protection against potential genomic sequence errors in public eutherian genomic sequences. The protocol was applicable in updates of 7 major eutherian gene data sets, including 812 complete coding sequences deposited in European Nucleotide Archive as curated third party data gene data sets.
Approaches for in silico finishing of microbial genome sequences

PubMed Central

Kremer, Frederico Schmitt; McBride, Alan John Alexander; Pinto, Luciano da Silva

2017-01-01

Abstract The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as “drafts”, incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing. PMID:28898352
Approaches for in silico finishing of microbial genome sequences.

PubMed

Kremer, Frederico Schmitt; McBride, Alan John Alexander; Pinto, Luciano da Silva

The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as "drafts", incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing.
Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

PubMed Central

2011-01-01

Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
Single-cell RNA sequencing reveals an altered gene expression pattern as a result of CRISPR/cas9-mediated deletion of Gene 33/Mig6 and chronic exposure to hexavalent chromium in human lung epithelial cells.

PubMed

Park, Soyoung; Zhang, Xiaowen; Li, Cen; Yin, Changhong; Li, Jiangwei; Fallon, John T; Huang, Weihua; Xu, Dazhong

2017-09-01

Gene 33 (Mig6, ERRFI1) is an adaptor protein with multiple cellular functions. We recently reported that depletion of this protein promotes lung epithelial cell transformation induced by hexavalent chromium [Cr(VI)]. However, the early molecular events that mediate this process are not clear. In the present study, we used single-cell RNA sequencing to compare gene expression profiles between BEAS-2B lung epithelial cells chronically exposed to a sublethal dose of Cr(VI) with or without CRISPR/cas9-mediated deletion of Gene 33. Our data reveal 83 differentially expressed genes. The most notable changes are genes associated with cell adhesion, oxidative stresses, protein ubiquitination, epithelial-mesenchymal transition/metastasis, and WNT signaling. Up-regulation of some neuro-specific genes is also evident, particularly ubiquitin carboxyl-terminal hydrolase L1 (UCHL1), a deubiquitinase and potential biomarker for lung cancer. Gene 33 deletion and/or Cr(VI) exposure did not cause discernable changes in cell morphology. However, Gene 33 deletion led to a modest but significant reduction of cells in the G2/M phase of the cell cycle regardless of Cr(VI) exposure. Gene 33 deletion also significantly reduced cell proliferation. Interestingly, Cr(VI) exposure eliminated the difference in cell proliferation between the two genotypes. Gene 33 deletion also significantly elevated cell migration. Our data indicate that combined Gene 33 deletion and chronic Cr(VI) exposure produces a gene expression pattern and a phenotype resemble those of the transformed lung epithelial cells. Given the known association of UCHL1 with lung cancer, we propose that UCHL1 is an important player in the early stage of lung epithelial cell transformation and tumorigenesis. Copyright © 2017 Elsevier Inc. All rights reserved.
Whole Genome Sequencing of Greater Amberjack (Seriola dumerili) for SNP Identification on Aligned Scaffolds and Genome Structural Variation Analysis Using Parallel Resequencing

PubMed Central

Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori

2018-01-01

Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397

Rapid and accurate pyrosequencing of angiosperm plastid genomes

PubMed Central

Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E

2006-01-01

Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid genome sequence was generated for a significant reduction in time and cost over traditional shotgun-based genome sequencing techniques, although with approximately half the coverage of previously reported GS 20 de novo genome sequence. The GS 20 should be broadly applicable to angiosperm plastid genome sequencing, and therefore promises to expand the scale of plant genetic and phylogenetic research dramatically. PMID:16934154
Bipolar localization of the group II intron Ll.LtrB is maintained in Escherichia coli deficient in nucleoid condensation, chromosome partitioning and DNA replication.

PubMed

Beauregard, Arthur; Chalamcharla, Venkata R; Piazza, Carol Lyn; Belfort, Marlene; Coros, Colin J

2006-11-01

Group II introns are mobile genetic elements that invade their cognate intron-minus alleles via an RNA intermediate, in a process known as retrohoming. They can also retrotranspose to ectopic sites at low frequency. In Escherichia coli, retrotransposition of the lactococcal group II intron, Ll.LtrB, occurs preferentially within the Ori and Ter macrodomains of the E. coli chromosome. These macrodomains migrate towards the poles of the cell, where the intron-encoded protein, LtrA, localizes. Here we investigate whether alteration of nucleoid condensation, chromosome partitioning and replication affect retrotransposition frequencies, as well as bipolar localization of the Ll.LtrB intron integration and LtrA distribution in E. coli. We thus examined these properties in the absence of the nucleoid-associated proteins H-NS, StpA and MukB, in variants of partitioning functions including the centromere-like sequence migS and the actin homologue MreB, as well as in the replication mutants DeltaoriC, seqA, tus and topoIV (ts). Although there were some dramatic fluctuations in retrotransposition levels in these hosts, bipolar localization of integration events was maintained. LtrA was consistently found in nucleoid-free regions, with its localization to the cellular poles being largely preserved in these hosts. Together, these results suggest that bipolar localization of group II intron retrotransposition results from the residence of the intron-encoded protein at the poles of the cell.
Genome Sequencing of Steroid Producing Bacteria Using Ion Torrent Technology and a Reference Genome.

PubMed

Sola-Landa, Alberto; Rodríguez-García, Antonio; Barreiro, Carlos; Pérez-Redondo, Rosario

2017-01-01

The Next-Generation Sequencing technology has enormously eased the bacterial genome sequencing and several tens of thousands of genomes have been sequenced during the last 10 years. Most of the genome projects are published as draft version, however, for certain applications the complete genome sequence is required.In this chapter, we describe the strategy that allowed the complete genome sequencing of Mycobacterium neoaurum NRRL B-3805, an industrial strain exploited for steroid production, using Ion Torrent sequencing reads and the genome of a close strain as the reference. This protocol can be applied to analyze the genetic variations between closely related strains; for example, to elucidate the point mutations between a parental strain and a random mutagenesis-derived mutant.
BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes.

PubMed

Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

2016-07-01

The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Whole-genome random sequencing and assembly of Haemophilus influenzae Rd

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fleischmann, R.D.; Adams, M.D.; White, O.

1995-07-28

An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism. 46 refs., 4 figs., 4 tabs.
Fungal genome sequencing: basic biology to biotechnology.

PubMed

Sharma, Krishna Kant

2016-08-01

The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.
Genome Improvement at JGI-HAGSC

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.

Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence.more » For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.« less
Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.

2005-08-26

Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. Amore » minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.« less
Reference-quality genome sequence of Aegilops tauschii, the source of wheat D genome, shows that recombination shapes genome structure and evolution

USDA-ARS?s Scientific Manuscript database

Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat and an important genetic resource for wheat. A reference-quality sequence for the Ae. tauschii genome was produced with a combination of ordered-clone sequencing, whole-genome shotgun sequencing, and BioNano optical geno...
Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Onda, M.; Kudo, S.; Fukuda, M.

Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less
in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhou, Xiaofan; Peris, David; Kominek, Jacek

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimentalmore » design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.« less
in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

DOE PAGES

Zhou, Xiaofan; Peris, David; Kominek, Jacek; ...

2016-09-16

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimentalmore » design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.« less
Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

PubMed

Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

2014-01-01

A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.
The diploid genome sequence of an Asian individual

PubMed Central

Wang, Jun; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin; Guo, Yiran; Feng, Binxiao; Li, Heng; Lu, Yao; Fang, Xiaodong; Liang, Huiqing; Du, Zhenglin; Li, Dong; Zhao, Yiqing; Hu, Yujie; Yang, Zhenzhen; Zheng, Hancheng; Hellmann, Ines; Inouye, Michael; Pool, John; Yi, Xin; Zhao, Jing; Duan, Jinjie; Zhou, Yan; Qin, Junjie; Ma, Lijia; Li, Guoqing; Yang, Zhentao; Zhang, Guojie; Yang, Bin; Yu, Chang; Liang, Fang; Li, Wenjie; Li, Shaochuan; Li, Dawei; Ni, Peixiang; Ruan, Jue; Li, Qibin; Zhu, Hongmei; Liu, Dongyuan; Lu, Zhike; Li, Ning; Guo, Guangwu; Zhang, Jianguo; Ye, Jia; Fang, Lin; Hao, Qin; Chen, Quan; Liang, Yu; Su, Yeyang; san, A.; Ping, Cuo; Yang, Shuang; Chen, Fang; Li, Li; Zhou, Ke; Zheng, Hongkun; Ren, Yuanyuan; Yang, Ling; Gao, Yang; Yang, Guohua; Li, Zhuo; Feng, Xiaoli; Kristiansen, Karsten; Wong, Gane Ka-Shu; Nielsen, Rasmus; Durbin, Richard; Bolund, Lars; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian

2009-01-01

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. PMID:18987735
Snake Genome Sequencing: Results and Future Prospects

PubMed Central

Kerkkamp, Harald M. I.; Kini, R. Manjunatha; Pospelov, Alexey S.; Vonk, Freek J.; Henkel, Christiaan V.; Richardson, Michael K.

2016-01-01

Snake genome sequencing is in its infancy—very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression. PMID:27916957
Snake Genome Sequencing: Results and Future Prospects.

PubMed

Kerkkamp, Harald M I; Kini, R Manjunatha; Pospelov, Alexey S; Vonk, Freek J; Henkel, Christiaan V; Richardson, Michael K

2016-12-01

Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.
Prevalence of CMV infection among staff in a metropolitan children’s hospital – occupational health screening findings

PubMed Central

Stranzinger, Johanna; Kindel, Jutta; Henning, Melanie; Wendeler, Dana; Nienhaus, Albert

2016-01-01

Background: Staff in children’s hospitals may run an increased risk of cytomegalovirus (CMV) contact infection leading to a congenital CMV fetopathy during pregnancy. The main risk factor is close contact with inapparent carriers of CMV among infants (<3 years). We therefore examined CMV seroprevalence (SP) and possible risk factors for CMV infection among staff at a children’s hospital. Method: In 2014, staff at a metropolitan children’s hospital were offered a CMV antibody test in the context of occupational health screening. Besides of anti-CMV immunoglobulin G (anti-CMV IgG) gender, age, profession, number of children and migration background were assessed and used as independent variables in multiple logistic regression. Women without a migration background (MIG) were considered as a separate group. Results: The study included 219 employees. Women showed a significant higher risk than men of being CMV-positive (adjusted odds ratio [aOR] 3.0; 95% CI 1.1–7.8). The risk among age groups of 30 and over was double that of the under-30s (aOR 2.0; 95% CI 1.0–3.9); among those aged 40-plus it was aOR 2.3 (95% CI 1.1–4.7). Staff with an MIG tested more often positive than those without an MIG (95.5% versus 45.7%). CMV SP was 47.7% among women without an MIG. In this subgroup the probability of CMV infection increased with age (p=0.08) as well. Conclusion: In the staff group as a whole there was a significant correlation between CMV SP, country of origin and age. We found no significant differences between occupational groups; perhaps our random sample was too small. Given the low CMV SP particularly in those without MIG, women who want to have children in particular must be protected from CMV infection. Follow-up studies should be undertaken to test whether good workplace hygiene offers sufficient protection for pregnant women and could be an alternative to prohibiting certain activities. PMID:27730028
Tanacetum parthenium and Salix alba (Mig-RL) combination in migraine prophylaxis: a prospective, open-label study.

PubMed

Shrivastava, R; Pechadre, J C; John, G W

2006-01-01

Tanacetum parthenium (feverfew) has been used traditionally to treat migraine, and although its mechanism of action is not fully understood, serotonin 5-HT receptor blocking effects have been suggested. T. parthenium and Salix alba (white willow) either alone or in combination (Mig-RL) were recently shown to inhibit binding to 5-HT(2A/2C) receptors; T. parthenium failed to recognise 5-HT(1D) receptors, whereas S. alba or the combination did. It was hypothesised that S. alba in combination with T. parthenium may provide superior migraine prophylactic activity compared with T. parthenium alone. A prospective, open-label study was performed in 12 patients diagnosed with migraine without aura. Twelve weeks' treatment with T. parthenium 300 mg plus S. alba 300 mg (Mig-RL) twice daily was administered to determine the effects of therapy on migraine attack frequency (primary efficacy criterion), intensity and duration (secondary efficacy criteria), and quality of life, together with tolerability for patients. Attack frequency was reduced by 57.2% at 6 weeks (p < 0.029) and by 61.7% at 12 weeks (p < 0.025) in nine of ten patients, with 70% patients having a reduction of at least 50%. Attack intensity was reduced by 38.7% at 6 weeks (p < 0.005) and by 62.6% at 12 weeks (p < 0.004) in ten of ten patients, with 70% of patients having a reduction of at least 50%. Attack duration decreased by 67.2% at 6 weeks (p < 0.001) and by 76.2% at 12 weeks (p < 0.001) in ten of ten patients. Two patients were excluded for reasons unrelated to treatment. Self-assessed general health, physical performance, memory and anxiety also improved by the end of the study. Mig-RL treatment was well tolerated and no adverse events occurred. The remarkable efficacy of Mig-RL in not only reducing the frequency of migraine attacks but also their pain intensity and duration in this trial warrants further investigation of this therapy in a double-blind, randomised, placebo-controlled investigation involving a larger patient population.
Sequencing and comparative genomic analysis of 1227 Felis catus cDNA sequences enriched for developmental, clinical and nutritional phenotypes

PubMed Central

2012-01-01

Background The feline genome is valuable to the veterinary and model organism genomics communities because the cat is an obligate carnivore and a model for endangered felids. The initial public release of the Felis catus genome assembly provided a framework for investigating the genomic basis of feline biology. However, the entire set of protein coding genes has not been elucidated. Results We identified and characterized 1227 protein coding feline sequences, of which 913 map to public sequences and 314 are novel. These sequences have been deposited into NCBI's genbank database and complement public genomic resources by providing additional protein coding sequences that fill in some of the gaps in the feline genome assembly. Through functional and comparative genomic analyses, we gained an understanding of the role of these sequences in feline development, nutrition and health. Specifically, we identified 104 orthologs of human genes associated with Mendelian disorders. We detected negative selection within sequences with gene ontology annotations associated with intracellular trafficking, cytoskeleton and muscle functions. We detected relatively less negative selection on protein sequences encoding extracellular networks, apoptotic pathways and mitochondrial gene ontology annotations. Additionally, we characterized feline cDNA sequences that have mouse orthologs associated with clinical, nutritional and developmental phenotypes. Together, this analysis provides an overview of the value of our cDNA sequences and enhances our understanding of how the feline genome is similar to, and different from other mammalian genomes. Conclusions The cDNA sequences reported here expand existing feline genomic resources by providing high-quality sequences annotated with comparative genomic information providing functional, clinical, nutritional and orthologous gene information. PMID:22257742
Whole Genome Complete Resequencing of Bacillus subtilis Natto by Combining Long Reads with High-Quality Short Reads

PubMed Central

Kamada, Mayumi; Hase, Sumitaka; Sato, Kengo; Toyoda, Atsushi; Fujiyama, Asao; Sakakibara, Yasubumi

2014-01-01

De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS) platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food “natto.” The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome. PMID:25329997

Sequencing intractable DNA to close microbial genomes.

PubMed

Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A

2012-01-01

Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
First complete genome sequence of infectious laryngotracheitis virus

PubMed Central

2011-01-01

Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528
Insights from 20 years of bacterial genome sequencing

DOE PAGES

Land, Miriam L.; Hauser, Loren; Jun, Se-Ran; ...

2015-02-27

Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date,more » there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome sequencing? There are many practical applications, such as genome-scale metabolic modeling, biosurveillance, bioforensics, and infectious disease epidemiology. In the near future, high-throughput sequencing of patient metagenomic samples could revolutionize medicine in terms of speed and accuracy of finding pathogens and knowing how to treat them.« less
Insights from 20 years of bacterial genome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Land, Miriam L.; Hauser, Loren; Jun, Se-Ran

Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date,more » there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome sequencing? There are many practical applications, such as genome-scale metabolic modeling, biosurveillance, bioforensics, and infectious disease epidemiology. In the near future, high-throughput sequencing of patient metagenomic samples could revolutionize medicine in terms of speed and accuracy of finding pathogens and knowing how to treat them.« less
RefSeq microbial genomes database: new representation and annotation strategy.

PubMed

Tatusova, Tatiana; Ciufo, Stacy; Fedorov, Boris; O'Neill, Kathleen; Tolstoy, Igor

2014-01-01

The source of the microbial genomic sequences in the RefSeq collection is the set of primary sequence records submitted to the International Nucleotide Sequence Database public archives. These can be accessed through the Entrez search and retrieval system at http://www.ncbi.nlm.nih.gov/genome. Next-generation sequencing has enabled researchers to perform genomic sequencing at rates that were unimaginable in the past. Microbial genomes can now be sequenced in a matter of hours, which has led to a significant increase in the number of assembled genomes deposited in the public archives. This huge increase in DNA sequence data presents new challenges for the annotation, analysis and visualization bioinformatics tools. New strategies have been developed for the annotation and representation of reference genomes and sequence variations derived from population studies and clinical outbreaks.
Gene calling and bacterial genome annotation with BG7.

PubMed

Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

2015-01-01

New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).
PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.

PubMed

Liu, Chao Chun; Ji, Hezhao

2018-06-26

The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Whole genome sequence analysis of BT-474 using complete Genomics' standard and long fragment read technologies.

PubMed

Ciotlos, Serban; Mao, Qing; Zhang, Rebecca Yu; Li, Zhenyu; Chin, Robert; Gulbahce, Natali; Liu, Sophie Jia; Drmanac, Radoje; Peters, Brock A

2016-01-01

The cell line BT-474 is a popular cell line for studying the biology of cancer and developing novel drugs. However, there is no complete, published genome sequence for this highly utilized scientific resource. In this study we sought to provide a comprehensive and useful data set for the scientific community by generating a whole genome sequence for BT-474. Five μg of genomic DNA, isolated from an early passage of the BT-474 cell line, was used to generate a whole genome sequence (114X coverage) using Complete Genomics' standard sequencing process. To provide additional variant phasing and structural variation data we also processed and analyzed two separate libraries of 5 and 6 individual cells to depths of 99X and 87X, respectively, using Complete Genomics' Long Fragment Read (LFR) technology. BT-474 is a highly aneuploid cell line with an extremely complex genome sequence. This ~300X total coverage genome sequence provides a more complete understanding of this highly utilized cell line at the genomic level.
Highly effective sequencing whole chloroplast genomes of angiosperms by nine novel universal primer pairs.

PubMed

Yang, Jun-Bo; Li, De-Zhu; Li, Hong-Tao

2014-09-01

Chloroplast genomes supply indispensable information that helps improve the phylogenetic resolution and even as organelle-scale barcodes. Next-generation sequencing technologies have helped promote sequencing of complete chloroplast genomes, but compared with the number of angiosperms, relatively few chloroplast genomes have been sequenced. There are two major reasons for the paucity of completely sequenced chloroplast genomes: (i) massive amounts of fresh leaves are needed for chloroplast sequencing and (ii) there are considerable gaps in the sequenced chloroplast genomes of many plants because of the difficulty of isolating high-quality chloroplast DNA, preventing complete chloroplast genomes from being assembled. To overcome these obstacles, all known angiosperm chloroplast genomes available to date were analysed, and then we designed nine universal primer pairs corresponding to the highly conserved regions. Using these primers, angiosperm whole chloroplast genomes can be amplified using long-range PCR and sequenced using next-generation sequencing methods. The primers showed high universality, which was tested using 24 species representing major clades of angiosperms. To validate the functionality of the primers, eight species representing major groups of angiosperms, that is, early-diverging angiosperms, magnoliids, monocots, Saxifragales, fabids, malvids and asterids, were sequenced and assembled their complete chloroplast genomes. In our trials, only 100 mg of fresh leaves was used. The results show that the universal primer set provided an easy, effective and feasible approach for sequencing whole chloroplast genomes in angiosperms. The designed universal primer pairs provide a possibility to accelerate genome-scale data acquisition and will therefore magnify the phylogenetic resolution and species identification in angiosperms. © 2014 John Wiley & Sons Ltd.
Sequencing and assembly of the 22-gb loblolly pine genome.

PubMed

Zimin, Aleksey; Stevens, Kristian A; Crepeau, Marc W; Holtz-Morris, Ann; Koriabine, Maxim; Marçais, Guillaume; Puiu, Daniela; Roberts, Michael; Wegrzyn, Jill L; de Jong, Pieter J; Neale, David B; Salzberg, Steven L; Yorke, James A; Langley, Charles H

2014-03-01

Conifers are the predominant gymnosperm. The size and complexity of their genomes has presented formidable technical challenges for whole-genome shotgun sequencing and assembly. We employed novel strategies that allowed us to determine the loblolly pine (Pinus taeda) reference genome sequence, the largest genome assembled to date. Most of the sequence data were derived from whole-genome shotgun sequencing of a single megagametophyte, the haploid tissue of a single pine seed. Although that constrained the quantity of available DNA, the resulting haploid sequence data were well-suited for assembly. The haploid sequence was augmented with multiple linking long-fragment mate pair libraries from the parental diploid DNA. For the longest fragments, we used novel fosmid DiTag libraries. Sequences from the linking libraries that did not match the megagametophyte were identified and removed. Assembly of the sequence data were aided by condensing the enormous number of paired-end reads into a much smaller set of longer "super-reads," rendering subsequent assembly with an overlap-based assembly algorithm computationally feasible. To further improve the contiguity and biological utility of the genome sequence, additional scaffolding methods utilizing independent genome and transcriptome assemblies were implemented. The combination of these strategies resulted in a draft genome sequence of 20.15 billion bases, with an N50 scaffold size of 66.9 kbp.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

PubMed

VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

2015-11-26

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
The first genome sequences of human bocaviruses from Vietnam

PubMed Central

Thanh, Tran Tan; Van, Hoang Minh Tu; Hong, Nguyen Thi Thu; Nhu, Le Nguyen Truc; Anh, Nguyen To; Tuan, Ha Manh; Hien, Ho Van; Tuong, Nguyen Manh; Kien, Trinh Trung; Khanh, Truong Huu; Nhan, Le Nguyen Thanh; Hung, Nguyen Thanh; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier; Tan, Le Van

2017-01-01

As part of an ongoing effort to generate complete genome sequences of hand, foot and mouth disease-causing enteroviruses directly from clinical specimens, two complete coding sequences and two partial genomic sequences of human bocavirus 1 (n=3) and 2 (n=1) were co-amplified and sequenced, representing the first genome sequences of human bocaviruses from Vietnam. The sequences may aid future study aiming at understanding the evolution of the virus. PMID:28090592
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

PubMed

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
The Complete Mitochondrial Genome of Gossypium hirsutum and Evolutionary Analysis of Higher Plant Mitochondrial Genomes

PubMed Central

Su, Aiguo; Geng, Jianing; Grover, Corrinne E.; Hu, Songnian; Hua, Jinping

2013-01-01

Background Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. Methodology/Principal Findings We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. Conclusion The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species. PMID:23940520
The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

PubMed

Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

2013-01-01

Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

PubMed Central

2010-01-01

Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24). The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS) sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity) elsewhere in the genome, but only 23% have identical copies (99% identity). The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is a feasible goal. PMID:20609256
Use of low-coverage, large-insert, short-read data for rapid and accurate generation of enhanced-quality draft Pseudomonas genome sequences.

PubMed

O'Brien, Heath E; Gong, Yunchen; Fung, Pauline; Wang, Pauline W; Guttman, David S

2011-01-01

Next-generation genomic technology has both greatly accelerated the pace of genome research as well as increased our reliance on draft genome sequences. While groups such as the Genomics Standards Consortium have made strong efforts to promote genome standards there is a still a general lack of uniformity among published draft genomes, leading to challenges for downstream comparative analyses. This lack of uniformity is a particular problem when using standard draft genomes that frequently have large numbers of low-quality sequencing tracts. Here we present a proposal for an "enhanced-quality draft" genome that identifies at least 95% of the coding sequences, thereby effectively providing a full accounting of the genic component of the genome. Enhanced-quality draft genomes are easily attainable through a combination of small- and large-insert next-generation, paired-end sequencing. We illustrate the generation of an enhanced-quality draft genome by re-sequencing the plant pathogenic bacterium Pseudomonas syringae pv. phaseolicola 1448A (Pph 1448A), which has a published, closed genome sequence of 5.93 Mbp. We use a combination of Illumina paired-end and mate-pair sequencing, and surprisingly find that de novo assemblies with 100x paired-end coverage and mate-pair sequencing with as low as low as 2-5x coverage are substantially better than assemblies based on higher coverage. The rapid and low-cost generation of large numbers of enhanced-quality draft genome sequences will be of particular value for microbial diagnostics and biosecurity, which rely on precise discrimination of potentially dangerous clones from closely related benign strains.
Illuminating the Black Box of Genome Sequence Assembly: A Free Online Tool to Introduce Students to Bioinformatics

ERIC Educational Resources Information Center

Taylor, D. Leland; Campbell, A. Malcolm; Heyer, Laurie J.

2013-01-01

Next-generation sequencing technologies have greatly reduced the cost of sequencing genomes. With the current sequencing technology, a genome is broken into fragments and sequenced, producing millions of "reads." A computer algorithm pieces these reads together in the genome assembly process. PHAST is a set of online modules…
Exome-wide DNA capture and next generation sequencing in domestic and wild species.

PubMed

Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon

2011-07-05

Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Mosaic Graphs and Comparative Genomics in Phage Communities

PubMed Central

Belcaid, Mahdi; Bergeron, Anne

2010-01-01

Abstract Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities. PMID:20874413

Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.

PubMed

Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R

2017-07-01

The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.
De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis

PubMed Central

Nowrousian, Minou; Stajich, Jason E.; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D.; Pöggeler, Stefanie; Read, Nick D.; Seiler, Stephan; Smith, Kristina M.; Zickler, Denise; Kück, Ulrich; Freitag, Michael

2010-01-01

Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology. PMID:20386741
De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.

PubMed

Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael

2010-04-08

Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.
Newborn Sequencing in Genomic Medicine and Public Health

PubMed Central

Agrawal, Pankaj B.; Bailey, Donald B.; Beggs, Alan H.; Brenner, Steven E.; Brower, Amy M.; Cakici, Julie A.; Ceyhan-Birsoy, Ozge; Chan, Kee; Chen, Flavia; Currier, Robert J.; Dukhovny, Dmitry; Green, Robert C.; Harris-Wai, Julie; Holm, Ingrid A.; Iglesias, Brenda; Joseph, Galen; Kingsmore, Stephen F.; Koenig, Barbara A.; Kwok, Pui-Yan; Lantos, John; Leeder, Steven J.; Lewis, Megan A.; McGuire, Amy L.; Milko, Laura V.; Mooney, Sean D.; Parad, Richard B.; Pereira, Stacey; Petrikin, Joshua; Powell, Bradford C.; Powell, Cynthia M.; Puck, Jennifer M.; Rehm, Heidi L.; Risch, Neil; Roche, Myra; Shieh, Joseph T.; Veeraraghavan, Narayanan; Watson, Michael S.; Willig, Laurel; Yu, Timothy W.; Urv, Tiina; Wise, Anastasia L.

2017-01-01

The rapid development of genomic sequencing technologies has decreased the cost of genetic analysis to the extent that it seems plausible that genome-scale sequencing could have widespread availability in pediatric care. Genomic sequencing provides a powerful diagnostic modality for patients who manifest symptoms of monogenic disease and an opportunity to detect health conditions before their development. However, many technical, clinical, ethical, and societal challenges should be addressed before such technology is widely deployed in pediatric practice. This article provides an overview of the Newborn Sequencing in Genomic Medicine and Public Health Consortium, which is investigating the application of genome-scale sequencing in newborns for both diagnosis and screening. PMID:28096516
A novel bioinformatics method for efficient knowledge discovery by BLSOM from big genomic sequence data.

PubMed

Bai, Yu; Iwasaki, Yuki; Kanaya, Shigehiko; Zhao, Yue; Ikemura, Toshimichi

2014-01-01

With remarkable increase of genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data. Self-Organizing Map (SOM) is an effective tool for clustering and visualizing high-dimensional data such as oligonucleotide composition on one map. By modifying the conventional SOM, we have previously developed Batch-Learning SOM (BLSOM), which allows classification of sequence fragments according to species, solely depending on the oligonucleotide composition. In the present study, we introduce the oligonucleotide BLSOM used for characterization of vertebrate genome sequences. We first analyzed pentanucleotide compositions in 100 kb sequences derived from a wide range of vertebrate genomes and then the compositions in the human and mouse genomes in order to investigate an efficient method for detecting differences between the closely related genomes. BLSOM can recognize the species-specific key combination of oligonucleotide frequencies in each genome, which is called a "genome signature," and the specific regions specifically enriched in transcription-factor-binding sequences. Because the classification and visualization power is very high, BLSOM is an efficient powerful tool for extracting a wide range of information from massive amounts of genomic sequences (i.e., big sequence data).
Genome Science: A Video Tour of the Washington University Genome Sequencing Center for High School and Undergraduate Students

PubMed Central

2005-01-01

Sequencing of the human genome has ushered in a new era of biology. The technologies developed to facilitate the sequencing of the human genome are now being applied to the sequencing of other genomes. In 2004, a partnership was formed between Washington University School of Medicine Genome Sequencing Center's Outreach Program and Washington University Department of Biology Science Outreach to create a video tour depicting the processes involved in large-scale sequencing. “Sequencing a Genome: Inside the Washington University Genome Sequencing Center” is a tour of the laboratory that follows the steps in the sequencing pipeline, interspersed with animated explanations of the scientific procedures used at the facility. Accompanying interviews with the staff illustrate different entry levels for a career in genome science. This video project serves as an example of how research and academic institutions can provide teachers and students with access and exposure to innovative technologies at the forefront of biomedical research. Initial feedback on the video from undergraduate students, high school teachers, and high school students provides suggestions for use of this video in a classroom setting to supplement present curricula. PMID:16341256
From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

PubMed

Kwok, Hin; Chiang, Alan Kwok Shing

2016-02-24

Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.
Initial sequencing and comparative analysis of the mouse genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Waterston, Robert H.; Lindblad-Toh, Kerstin; Birney, Ewan

2002-12-15

The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of themore » genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.« less
Tapping the promise of genomics in species with complex, nonmodel genomes.

PubMed

Hirsch, Candice N; Buell, C Robin

2013-01-01

Genomics is enabling a renaissance in all disciplines of plant biology. However, many plant genomes are complex and remain recalcitrant to current genomic technologies. The complexities of these nonmodel plant genomes are attributable to gene and genome duplication, heterozygosity, ploidy, and/or repetitive sequences. Methods are available to simplify the genome and reduce these barriers, including inbreeding and genome reduction, making these species amenable to current sequencing and assembly methods. Some, but not all, of the complexities in nonmodel genomes can be bypassed by sequencing the transcriptome rather than the genome. Additionally, comparative genomics approaches, which leverage phylogenetic relatedness, can aid in the interpretation of complex genomes. Although there are limitations in accessing complex nonmodel plant genomes using current sequencing technologies, genome manipulation and resourceful analyses can allow access to even the most recalcitrant plant genomes.
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

PubMed

Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

2011-01-01

Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Parents' interest in whole-genome sequencing of newborns.

PubMed

Goldenberg, Aaron J; Dodson, Daniel S; Davis, Matthew M; Tarini, Beth A

2014-01-01

The aim of this study was to assess parents' interest in whole-genome sequencing for newborns. We conducted a survey of a nationally representative sample of 1,539 parents about their interest in whole-genome sequencing of newborns. Participants were randomly presented with one of two scenarios that differed in the venue of testing: one offered whole-genome sequencing through a state newborn screening program, whereas the other offered whole-genome sequencing in a pediatrician's office. Overall interest in having future newborns undergo whole-genome sequencing was generally high among parents. If whole-genome sequencing were offered through a state's newborn-screening program, 74% of parents were either definitely or somewhat interested in utilizing this technology. If offered in a pediatrician's office, 70% of parents were either definitely or somewhat interested. Parents in both groups most frequently identified test accuracy and the ability to prevent a child from developing a disease as "very important" in making a decision to have a newborn's whole genome sequenced. These data may help health departments and children's health-care providers anticipate parents' level of interest in genomic screening for newborns. As whole-genome sequencing is integrated into clinical and public health services, these findings may inform the development of educational strategies and outreach messages for parents.
Following the Flag: An Air Force Officer Provides an Eyewitness View of Major Events and Policies during the Cold War

DTIC Science & Technology

2010-03-01

940. The First Pursuit Group of the Army Air Corps (AAC) did its gun- nery training at Phelps-Collins Airport near my hometown of WEST POINT 2 Alpena ...Apparently the MiGs were on a radar approach to Antung, their base just across the Yalu. What to do? “Dreams of glory— Alpena lieutenant gets four MiGs on...unintended result. Highest and Fastest The first time I flew was in an old Ford trimotor transport from Alpena to Detroit. I was about 14. An early memory
Coding Complete Genome for the Mogiana Tick Virus, a Jingmenvirus Isolated from Ticks in Brazil

DTIC Science & Technology

2017-05-04

sequences for all four genome segments. We downloaded the raw Illumina sequence reads from the NCBI Short Read Archive (GenBank...MGTV genome segments through sequence similarity (BLASTN) to the published genome of Jingmen tick virus (JMTV) isolate SY84 (GenBank: KJ001579-KJ001582...2014. Standards for sequencing viral genomes in the era of high-throughput sequencing . MBio 5:e01360–14. 8. Bankevich A, Nurk S, Antipov
A one-page summary report of genome sequencing for the healthy adult.

PubMed

Vassy, Jason L; McLaughlin, Heather M; McLaughlin, Heather L; MacRae, Calum A; Seidman, Christine E; Lautenbach, Denise; Krier, Joel B; Lane, William J; Kohane, Isaac S; Murray, Michael F; McGuire, Amy L; Rehm, Heidi L; Green, Robert C

2015-01-01

As genome sequencing technologies increasingly enter medical practice, genetics laboratories must communicate sequencing results effectively to nongeneticist physicians. We describe the design and delivery of a clinical genome sequencing report, including a one-page summary suitable for interpretation by primary care physicians. To illustrate our preliminary experience with this report, we summarize the genomic findings from 10 healthy participants in a study of genome sequencing in primary care. © 2015 S. Karger AG, Basel.
A One-Page Summary Report of Genome Sequencing for the Healthy Adult

PubMed Central

Vassy, Jason L.; McLaughlin, Heather M.; MacRae, Calum A.; Seidman, Christine E.; Lautenbach, Denise; Krier, Joel B.; Lane, William J.; Kohane, Isaac S.; Murray, Michael F.; McGuire, Amy L.; Rehm, Heidi L.; Green, Robert C.

2015-01-01

As genome sequencing technologies increasingly enter medical practice, genetics laboratories must communicate sequencing results effectively to non-geneticist physicians. We describe the design and delivery of a clinical genome sequencing report, including a one-page summary suitable for interpretation by primary care physicians. To illustrate our preliminary experience with this report, we summarize the genomic findings from ten healthy patient participants in a study of genome sequencing in primary care. PMID:25612602
Consequences of Normalizing Transcriptomic and Genomic Libraries of Plant Genomes Using a Duplex-Specific Nuclease and Tetramethylammonium Chloride

PubMed Central

Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

2013-01-01

Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce. PMID:23409088
Consequences of normalizing transcriptomic and genomic libraries of plant genomes using a duplex-specific nuclease and tetramethylammonium chloride.

PubMed

Matvienko, Marta; Kozik, Alexander; Froenicke, Lutz; Lavelle, Dean; Martineau, Belinda; Perroud, Bertrand; Michelmore, Richard

2013-01-01

Several applications of high throughput genome and transcriptome sequencing would benefit from a reduction of the high-copy-number sequences in the libraries being sequenced and analyzed, particularly when applied to species with large genomes. We adapted and analyzed the consequences of a method that utilizes a thermostable duplex-specific nuclease for reducing the high-copy components in transcriptomic and genomic libraries prior to sequencing. This reduces the time, cost, and computational effort of obtaining informative transcriptomic and genomic sequence data for both fully sequenced and non-sequenced genomes. It also reduces contamination from organellar DNA in preparations of nuclear DNA. Hybridization in the presence of 3 M tetramethylammonium chloride (TMAC), which equalizes the rates of hybridization of GC and AT nucleotide pairs, reduced the bias against sequences with high GC content. Consequences of this method on the reduction of high-copy and enrichment of low-copy sequences are reported for Arabidopsis and lettuce.
Personal Genome Sequencing in Ostensibly Healthy Individuals and the PeopleSeq Consortium

PubMed Central

Linderman, Michael D.; Nielsen, Daiva E.; Green, Robert C.

2016-01-01

Thousands of ostensibly healthy individuals have had their exome or genome sequenced, but a much smaller number of these individuals have received any personal genomic results from that sequencing. We term those projects in which ostensibly healthy participants can receive sequencing-derived genetic findings and may also have access to their genomic data as participatory predispositional personal genome sequencing (PPGS). Here we are focused on genome sequencing applied in a pre-symptomatic context and so define PPGS to exclude diagnostic genome sequencing intended to identify the molecular cause of suspected or diagnosed genetic disease. In this report we describe the design of completed and underway PPGS projects, briefly summarize the results reported to date and introduce the PeopleSeq Consortium, a newly formed collaboration of PPGS projects designed to collect much-needed longitudinal outcome data. PMID:27023617
Research progress of plant population genomics based on high-throughput sequencing.

PubMed

Wang, Yun-sheng

2016-08-01

Population genomics, a new paradigm for population genetics, combine the concepts and techniques of genomics with the theoretical system of population genetics and improve our understanding of microevolution through identification of site-specific effect and genome-wide effects using genome-wide polymorphic sites genotypeing. With the appearance and improvement of the next generation high-throughput sequencing technology, the numbers of plant species with complete genome sequences increased rapidly and large scale resequencing has also been carried out in recent years. Parallel sequencing has also been done in some plant species without complete genome sequences. These studies have greatly promoted the development of population genomics and deepened our understanding of the genetic diversity, level of linking disequilibium, selection effect, demographical history and molecular mechanism of complex traits of relevant plant population at a genomic level. In this review, I briely introduced the concept and research methods of population genomics and summarized the research progress of plant population genomics based on high-throughput sequencing. I also discussed the prospect as well as existing problems of plant population genomics in order to provide references for related studies.
Genomic Diversity and Evolution of the Lyssaviruses

PubMed Central

Delmas, Olivier; Holmes, Edward C.; Talbi, Chiraz; Larrous, Florence; Dacheux, Laurent; Bouchier, Christiane; Bourhy, Hervé

2008-01-01

Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses. PMID:18446239

Fungal Genomics for Energy and Environment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor V.

2013-03-11

Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Sequencing Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for usersmore » to nominate new species for sequencing. Over 200 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.« less
Serendipitous discovery of Wolbachia genomes in multiple Drosophila species.

PubMed

Salzberg, Steven L; Dunning Hotopp, Julie C; Delcher, Arthur L; Pop, Mihai; Smith, Douglas R; Eisen, Michael B; Nelson, William C

2005-01-01

The Trace Archive is a repository for the raw, unanalyzed data generated by large-scale genome sequencing projects. The existence of this data offers scientists the possibility of discovering additional genomic sequences beyond those originally sequenced. In particular, if the source DNA for a sequencing project came from a species that was colonized by another organism, then the project may yield substantial amounts of genomic DNA, including near-complete genomes, from the symbiotic or parasitic organism. By searching the publicly available repository of DNA sequencing trace data, we discovered three new species of the bacterial endosymbiont Wolbachia pipientis in three different species of fruit fly: Drosophila ananassae, D. simulans, and D. mojavensis. We extracted all sequences with partial matches to a previously sequenced Wolbachia strain and assembled those sequences using customized software. For one of the three new species, the data recovered were sufficient to produce an assembly that covers more than 95% of the genome; for a second species the data produce the equivalent of a 'light shotgun' sampling of the genome, covering an estimated 75-80% of the genome; and for the third species the data cover approximately 6-7% of the genome. The results of this study reveal an unexpected benefit of depositing raw data in a central genome sequence repository: new species can be discovered within this data. The differences between these three new Wolbachia genomes and the previously sequenced strain revealed numerous rearrangements and insertions within each lineage and hundreds of novel genes. The three new genomes, with annotation, have been deposited in GenBank.
Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing

PubMed Central

Eastman, Alexander W.; Yuan, Ze-Chun

2015-01-01

Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID:25653642
Why Assembling Plant Genome Sequences Is So Challenging

PubMed Central

Claros, Manuel Gonzalo; Bautista, Rocío; Guerrero-Fernández, Darío; Benzerki, Hicham; Seoane, Pedro; Fernández-Pozo, Noé

2012-01-01

In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed. PMID:24832233
Insights from Human/Mouse genome comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pennacchio, Len A.

2003-03-30

Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestrymore » of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE Office of Scientific and Technical Information (OSTI.GOV)

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE PAGES

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

2015-11-11

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
It’s More Than Stamp Collecting: How Genome Sequencing Can Unify Biological Research

PubMed Central

Richards, Stephen

2015-01-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, whilst the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to “Big Science” survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. PMID:26003218
It's more than stamp collecting: how genome sequencing can unify biological research.

PubMed

Richards, Stephen

2015-07-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, while the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to 'big science' survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. Copyright © 2015 Elsevier Ltd. All rights reserved.
Complete Genome Sequence of Pigmentation Negative Yersinia Pestis strain Cadman Running head: Complete Genome Sequence of Y. pestis strain Cadman

DTIC Science & Technology

2016-10-27

Institute of Infectious Diseases, Fort Detrick, Frederick, Maryland, USA 9 10 11 Running head: Complete Genome Sequence of Y. pestis strain Cadman...1 Complete Genome Sequence of Pigmentation Negative Yersinia pestis strain Cadman 1 2 3 Sean Lovetta, Kitty Chaseb, Galina Korolevaa, Gustavo...we report the genome sequence of Yersinia pestis strain Cadman, an attenuated strain 25 lacking the pgm locus. Y. pestis is the causative agent of
MIPS: a database for genomes and protein sequences.

PubMed Central

Mewes, H W; Heumann, K; Kaps, A; Mayer, K; Pfeiffer, F; Stocker, S; Frishman, D

1999-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried near Munich, Germany, develops and maintains genome oriented databases. It is commonplace that the amount of sequence data available increases rapidly, but not the capacity of qualified manual annotation at the sequence databases. Therefore, our strategy aims to cope with the data stream by the comprehensive application of analysis tools to sequences of complete genomes, the systematic classification of protein sequences and the active support of sequence analysis and functional genomics projects. This report describes the systematic and up-to-date analysis of genomes (PEDANT), a comprehensive database of the yeast genome (MYGD), a database reflecting the progress in sequencing the Arabidopsis thaliana genome (MATD), the database of assembled, annotated human EST clusters (MEST), and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). MIPS provides access through its WWW server (http://www.mips.biochem.mpg.de) to a spectrum of generic databases, including the above mentioned as well as a database of protein families (PROTFAM), the MITOP database, and the all-against-all FASTA database. PMID:9847138
The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets.

PubMed

Droege, Marcus; Hill, Brendon

2008-08-31

The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.
Newborn Sequencing in Genomic Medicine and Public Health.

PubMed

Berg, Jonathan S; Agrawal, Pankaj B; Bailey, Donald B; Beggs, Alan H; Brenner, Steven E; Brower, Amy M; Cakici, Julie A; Ceyhan-Birsoy, Ozge; Chan, Kee; Chen, Flavia; Currier, Robert J; Dukhovny, Dmitry; Green, Robert C; Harris-Wai, Julie; Holm, Ingrid A; Iglesias, Brenda; Joseph, Galen; Kingsmore, Stephen F; Koenig, Barbara A; Kwok, Pui-Yan; Lantos, John; Leeder, Steven J; Lewis, Megan A; McGuire, Amy L; Milko, Laura V; Mooney, Sean D; Parad, Richard B; Pereira, Stacey; Petrikin, Joshua; Powell, Bradford C; Powell, Cynthia M; Puck, Jennifer M; Rehm, Heidi L; Risch, Neil; Roche, Myra; Shieh, Joseph T; Veeraraghavan, Narayanan; Watson, Michael S; Willig, Laurel; Yu, Timothy W; Urv, Tiina; Wise, Anastasia L

2017-02-01

The rapid development of genomic sequencing technologies has decreased the cost of genetic analysis to the extent that it seems plausible that genome-scale sequencing could have widespread availability in pediatric care. Genomic sequencing provides a powerful diagnostic modality for patients who manifest symptoms of monogenic disease and an opportunity to detect health conditions before their development. However, many technical, clinical, ethical, and societal challenges should be addressed before such technology is widely deployed in pediatric practice. This article provides an overview of the Newborn Sequencing in Genomic Medicine and Public Health Consortium, which is investigating the application of genome-scale sequencing in newborns for both diagnosis and screening. Copyright © 2017 by the American Academy of Pediatrics.
Comparative sequence analysis of Sordaria macrospora and Neurospora crassa as a means to improve genome annotation.

PubMed

Nowrousian, Minou; Würtz, Christian; Pöggeler, Stefanie; Kück, Ulrich

2004-03-01

One of the most challenging parts of large scale sequencing projects is the identification of functional elements encoded in a genome. Recently, studies of genomes of up to six different Saccharomyces species have demonstrated that a comparative analysis of genome sequences from closely related species is a powerful approach to identify open reading frames and other functional regions within genomes [Science 301 (2003) 71, Nature 423 (2003) 241]. Here, we present a comparison of selected sequences from Sordaria macrospora to their corresponding Neurospora crassa orthologous regions. Our analysis indicates that due to the high degree of sequence similarity and conservation of overall genomic organization, S. macrospora sequence information can be used to simplify the annotation of the N. crassa genome.
Multiplexed fragaria chloroplast genome sequencing

Treesearch

W. Njuguna; A. Liston; R. Cronn; N.V. Bassil

2010-01-01

A method to sequence multiple chloroplast genomes using ultra high throughput sequencing technologies was recently described. Complete chloroplast genome sequences can resolve phylogenetic relationships at low taxonomic levels and identify informative point mutations and indels. The objective of this research was to sequence multiple Fragaria...
Human Genome Sequencing in Health and Disease

PubMed Central

Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

2013-01-01

Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

PubMed Central

2009-01-01

Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes. PMID:19656416
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

PubMed

Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

2009-08-06

Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The results of the present work provide important new information about the structure and content of conifer genomic DNA that will guide future efforts to sequence and assemble conifer genomes.
Whole-genome sequencing in bacteriology: state of the art

PubMed Central

Dark, Michael J

2013-01-01

Over the last ten years, genome sequencing capabilities have expanded exponentially. There have been tremendous advances in sequencing technology, DNA sample preparation, genome assembly, and data analysis. This has led to advances in a number of facets of bacterial genomics, including metagenomics, clinical medicine, bacterial archaeology, and bacterial evolution. This review examines the strengths and weaknesses of techniques in bacterial genome sequencing, upcoming technologies, and assembly techniques, as well as highlighting recent studies that highlight new applications for bacterial genomics. PMID:24143115
A web-based genomic sequence database for the Streptomycetaceae: a tool for systematics and genome mining

USDA-ARS?s Scientific Manuscript database

The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...

Complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 using PacBio single-molecule real-time technology

USDA-ARS?s Scientific Manuscript database

We report the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 isolated in Minnesota, USA. The R1-1 genome, generated by de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies....
Draft Genome Sequence of a Rare Smut Relative, Tilletiaria anomala UBC 951

DOE PAGES

Toome, Merje; Kuo, Alan; Henrissat, Bernard; ...

2014-06-12

We present the draft genome sequence of the smut fungus Tilletiaria anomala UBC 951 (Basidiomycota, Ustilaginomycotina). The sequenced genome size is 18.7 Mb, consisting of 289 scaffolds and a total of 6,810 predicted genes. This is the first genome sequence published for a fungus in the order Georgefisheriales (Exobasidiomycetes).
Reducing assembly complexity of microbial genomes with single-molecule sequencing.

PubMed

Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M

2013-01-01

The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.
Insights into Conifer Giga-Genomes1

PubMed Central

De La Torre, Amanda R.; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K.; Jansson, Stefan; Jones, Steven J.M.; Keeling, Christopher I.; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

2014-01-01

Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world’s forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20–30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325
Insights into conifer giga-genomes.

PubMed

De La Torre, Amanda R; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K; Jansson, Stefan; Jones, Steven J M; Keeling, Christopher I; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

2014-12-01

Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. © 2014 American Society of Plant Biologists. All Rights Reserved.
Draft genome sequence of an aflatoxigenic Aspergillus species, A. bombycis

USDA-ARS?s Scientific Manuscript database

The genome of the A. bombycis Type strain was sequenced using a Personal Genome Machine, followed by annotation of its predicted genes. The genome size for A. bombycis was found to be approximately 37 Mb and contained 12,266 genes. This announcement introduces a sequenced genome for an aflatoxigenic...
Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis.

PubMed

Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi

2015-11-20

The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
CXCR3 chemokine receptor-induced chemotaxis in human airway epithelial cells: role of p38 MAPK and PI3K signaling pathways.

PubMed

Shahabuddin, Syed; Ji, Rong; Wang, Ping; Brailoiu, Eugene; Dun, Na; Yang, Yi; Aksoy, Mark O; Kelsen, Steven G

2006-07-01

Human airway epithelial cells (HAEC) constitutively express the CXC chemokine receptor CXCR3, which regulates epithelial cell movement. In diseases such as chronic obstructive pulmonary disease and asthma, characterized by denudation of the epithelial lining, epithelial cell migration may contribute to airway repair and reconstitution. This study compared the potency and efficacy of three CXCR3 ligands, I-TAC/CXCL11, IP-10/CXCL10, and Mig/CXCL9, as inducers of chemotaxis in HAEC and examined the underlying signaling pathways involved. Studies were performed in cultured HAEC from normal subjects and the 16-HBE cell line. In normal HAEC, the efficacy of I-TAC-induced chemotaxis was 349 +/- 88% (mean +/- SE) of the medium control and approximately one-half the response to epidermal growth factor, a highly potent chemoattractant. In normal HAEC, Mig, IP-10, and I-TAC induced chemotaxis with similar potency and a rank order of efficacy of I-TAC = IP-10 > Mig. Preincubation with pertussis toxin completely blocked CXCR3-induced migration. Of interest, intracellular [Ca(2+)] did not rise in response to I-TAC, IP-10, or Mig. I-TAC induced a rapid phosphorylation (5-10 min) of two of the three MAPKs, i.e., p38 and ERK1/2. Pretreatment of HAEC with the p38 inhibitor SB 20358 or the PI3K inhibitor wortmannin dose-dependently inhibited the chemotactic response to I-TAC. In contrast, the ERK1/2 inhibitor U0126 had no effect on chemotaxis. These data indicate that in HAEC, CXCR3-mediated chemotaxis involves a G protein, which activates both the p38 MAPK and PI3K pathways in a calcium-independent fashion.
Lymphoid follicle cells in chronic obstructive pulmonary disease overexpress the chemokine receptor CXCR3.

PubMed

Kelsen, Steven G; Aksoy, Mark O; Georgy, Mary; Hershman, Richard; Ji, Rong; Li, Xiuxia; Hurford, Matthew; Solomides, Charalambos; Chatila, Wissam; Kim, Victor

2009-05-01

The mechanisms underlying formation of lung lymphoid follicles (LF) in chronic obstructive pulmonary disease (COPD) are unknown. The chemokine receptor CXCR3 regulates immune responses in secondary lymphoid structures elsewhere in the body and is highly expressed by Th1 lymphocytes in the airway in COPD. Because chemokine receptors control inflammatory cell homing to inflamed tissue, we reasoned that CXCR3 may contribute to LF formation in COPD. We assessed the expression of CXCR3 and its ligands (IP-10/CXCL10, Mig/CXCL9, and ITAC/CXCL11) by LF cells in never-smokers, smokers without COPD, and subjects with COPD. CXCR3, IP-10, Mig, and ITAC expression were assessed in lung sections from 46 subjects (never-smokers, smokers without COPD [S], and subjects with COPD in GOLD stages 1-4) by immunohistochemistry. CXCR3-expressing T cells (CD8+ or CD4+) and B cells (CD20+) were topographically distributed at the follicle periphery and center, respectively. The percentage of immunohistochemically identified CXCR3+ cells increased progressively while proceeding from S through GOLD 3-4 (P < 0.01 for GOLD 3-4 vs. S). Moreover, the number of CXCR3+ follicular cells correlated inversely with FEV(1) (r = 0.60). The CXCR3 ligands IP-10 and Mig were expressed by several cell types in and around the follicle, including CD68+ dendritic cells/ macrophages, airway epithelial cells, endothelial cells, and T and B cells. These results suggest that LF form in the COPD lung by recruitment and/or retention of CXCR3-expressing T and B lymphocytes, which are attracted to the region through production of CXCR3 ligands IP-10 and Mig by lung structural and follicular cells.
Lymphoid Follicle Cells in Chronic Obstructive Pulmonary Disease Overexpress the Chemokine Receptor CXCR3

PubMed Central

Kelsen, Steven G.; Aksoy, Mark O.; Georgy, Mary; Hershman, Richard; Ji, Rong; Li, XiuXia; Hurford, Matthew; Solomides, Charalambos; Chatila, Wissam; Kim, Victor

2009-01-01

Rationale: The mechanisms underlying formation of lung lymphoid follicles (LF) in chronic obstructive pulmonary disease (COPD) are unknown. The chemokine receptor CXCR3 regulates immune responses in secondary lymphoid structures elsewhere in the body and is highly expressed by Th1 lymphocytes in the airway in COPD. Because chemokine receptors control inflammatory cell homing to inflamed tissue, we reasoned that CXCR3 may contribute to LF formation in COPD. Objectives: We assessed the expression of CXCR3 and its ligands (IP-10/CXCL10, Mig/CXCL9, and ITAC/CXCL11) by LF cells in never-smokers, smokers without COPD, and subjects with COPD. Methods: CXCR3, IP-10, Mig, and ITAC expression were assessed in lung sections from 46 subjects (never-smokers, smokers without COPD [S], and subjects with COPD in GOLD stages 1–4) by immunohistochemistry. Measurements and Main Results: CXCR3-expressing T cells (CD8+ or CD4+) and B cells (CD20+) were topographically distributed at the follicle periphery and center, respectively. The percentage of immunohistochemically identified CXCR3+ cells increased progressively while proceeding from S through GOLD 3–4 (P < 0.01 for GOLD 3–4 vs. S). Moreover, the number of CXCR3+ follicular cells correlated inversely with FEV1 (r = 0.60). The CXCR3 ligands IP-10 and Mig were expressed by several cell types in and around the follicle, including CD68+ dendritic cells/ macrophages, airway epithelial cells, endothelial cells, and T and B cells. Conclusions: These results suggest that LF form in the COPD lung by recruitment and/or retention of CXCR3-expressing T and B lymphocytes, which are attracted to the region through production of CXCR3 ligands IP-10 and Mig by lung structural and follicular cells. PMID:19218194
Microinvasive Glaucoma Stent (MIGS) Surgery With Concomitant Phakoemulsification Cataract Extraction: Outcomes and the Learning Curve.

PubMed

Al-Mugheiry, Toby S; Cate, Heidi; Clark, Allan; Broadway, David C

2017-07-01

To evaluate learning effects with respect to outcomes of a microinvasive glaucoma stent (MIGS) inserted during cataract surgery in glaucoma patients. Single surgeon, observational cohort study of 25 consecutive Ivantis Hydrus microstent insertions, with a minimum follow-up of 12 months. A learning curve analysis was performed by assessing hypotensive effect, adverse effects, and surgical procedure duration, with respect to consecutive case number. Success was defined with respect to various intraocular pressure (IOP) targets (21, 18, 15 mm Hg) and reduction in required antiglaucoma medications. Complete success was defined as achieving target IOP without antiglaucoma therapy. No clinically significant adverse events or learning effects were identified, although surgical time reduced with consecutive case number. Mean follow-up was 16.8 months. At final follow-up the mean IOP for all eyes was reduced from 18.1 (±3.6) mm Hg [and a simulated untreated value of 25.9 (±5.2) mm Hg] to 15.3 (±2.2) mm Hg (P=0.007; <0.0001) and the mean number of topical antiglaucoma medications was reduced from 1.96 (±0.96) to 0.04 (±0.20) (P<0.0001). Complete success (IOP<21 mm Hg, no medications) was 96% at final follow-up. Complete success (IOP<18 mm Hg, no medications) was 80% at final follow-up, but only 32% with a target IOP of <15 mm Hg (no medications). No significant learning curve effects were observed for a trained surgeon with respect to MIGS microstent insertion performed at the time of cataract surgery. Adjunctive MIGS surgery was successful in lowering IOP to <18 mm Hg and reducing/abolishing the requirement for antiglaucoma medication in eyes with open-angle glaucoma, but less successful at achieving low IOP levels (<15 mm Hg).
[Arc spectrum diagnostic and heat coupling mechanism analysis of double wire pulsed MIG welding].

PubMed

Liu, Yong-qiang; Li, Huan; Yang, Li-jun; Zheng, Kai; Gao, Ying

2015-01-01

A double wire pulsed MIG welding test system was built in the present paper, in order to analyze the heat-coupling mechanism of double wire pulsed MIG welding, and study are temperature field. Spectroscopic technique was used in diagnostic analysis of the are, plasma radiation was collected by using hollow probe method to obtain the arc plasma optical signal The electron temperature of double wire pulsed MIG welding arc plasma was calculated by using Boltzmann diagram method, the electron temperature distribution was obtained, a comprehensive analysis of the arc was conducted combined with the high speed camera technology and acquisition means of electricity signal. The innovation of this paper is the combination of high-speed camera image information of are and optical signal of arc plasma to analyze the coupling mechanism for dual arc, and a more intuitive analysis for are temperature field was conducted. The test results showed that a push-pull output was achieved and droplet transfer mode was a drop in a pulse in the welding process; Two arcs attracted each other under the action of a magnetic field, and shifted to the center of the arc in welding process, so a new heat center was formed at the geometric center of the double arc, and flowing up phenomenon occurred on the arc; Dual arc electronic temperature showed an inverted V-shaped distribution overall, and at the geometric center of the double arc, the arc electron temperature at 3 mm off the workpiece surface was the highest, which was 16,887.66 K, about 4,900 K higher than the lowest temperature 11,963.63 K.
Near-Infrared 1064 nm Laser Modulates Migratory Dendritic Cells To Augment the Immune Response to Intradermal Influenza Vaccine.

PubMed

Morse, Kaitlyn; Kimizuka, Yoshifumi; Chan, Megan P K; Shibata, Mai; Shimaoka, Yusuke; Takeuchi, Shu; Forbes, Benjamin; Nirschl, Christopher; Li, Binghao; Zeng, Yang; Bronson, Roderick T; Katagiri, Wataru; Shigeta, Ayako; Sîrbulescu, Ruxandra F; Chen, Huabiao; Tan, Rhea Y Y; Tsukada, Kosuke; Brauns, Timothy; Gelfand, Jeffrey; Sluder, Ann; Locascio, Joseph J; Poznansky, Mark C; Anandasabapathy, Niroshana; Kashiwagi, Satoshi

2017-08-15

Brief exposure of skin to near-infrared (NIR) laser light has been shown to augment the immune response to intradermal vaccination and thus act as an immunologic adjuvant. Although evidence indicates that the NIR laser adjuvant has the capacity to activate innate subsets including dendritic cells (DCs) in skin as conventional adjuvants do, the precise immunological mechanism by which the NIR laser adjuvant acts is largely unknown. In this study we sought to identify the cellular target of the NIR laser adjuvant by using an established mouse model of intradermal influenza vaccination and examining the alteration of responses resulting from genetic ablation of specific DC populations. We found that a continuous wave (CW) NIR laser adjuvant broadly modulates migratory DC (migDC) populations, specifically increasing and activating the Lang + and CD11b - Lang - subsets in skin, and that the Ab responses augmented by the CW NIR laser are dependent on DC subsets expressing CCR2 and Langerin. In comparison, a pulsed wave NIR laser adjuvant showed limited effects on the migDC subsets. Our vaccination study demonstrated that the efficacy of the CW NIR laser is significantly better than that of the pulsed wave laser, indicating that the CW NIR laser offers a desirable immunostimulatory microenvironment for migDCs. These results demonstrate the unique ability of the NIR laser adjuvant to selectively target specific migDC populations in skin depending on its parameters, and highlight the importance of optimization of laser parameters for desirable immune protection induced by an NIR laser-adjuvanted vaccine. Copyright © 2017 by The American Association of Immunologists, Inc.
Partial Shotgun Sequencing of the Boechera stricta Genome Reveals Extensive Microsynteny and Promoter Conservation with Arabidopsis1[W

PubMed Central

Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas

2006-01-01

Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

PubMed Central

2012-01-01

Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920
Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.

PubMed

Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay

2013-01-01

Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.
Large-scale contamination of microbial isolate genomes by Illumina PhiX control.

PubMed

Mukherjee, Supratim; Huntemann, Marcel; Ivanova, Natalia; Kyrpides, Nikos C; Pati, Amrita

2015-01-01

With the rapid growth and development of sequencing technologies, genomes have become the new go-to for exploring solutions to some of the world's biggest challenges such as searching for alternative energy sources and exploration of genomic dark matter. However, progress in sequencing has been accompanied by its share of errors that can occur during template or library preparation, sequencing, imaging or data analysis. In this study we screened over 18,000 publicly available microbial isolate genome sequences in the Integrated Microbial Genomes database and identified more than 1000 genomes that are contaminated with PhiX, a control frequently used during Illumina sequencing runs. Approximately 10% of these genomes have been published in literature and 129 contaminated genomes were sequenced under the Human Microbiome Project. Raw sequence reads are prone to contamination from various sources and are usually eliminated during downstream quality control steps. Detection of PhiX contaminated genomes indicates a lapse in either the application or effectiveness of proper quality control measures. The presence of PhiX contamination in several publicly available isolate genomes can result in additional errors when such data are used in comparative genomics analyses. Such contamination of public databases have far-reaching consequences in the form of erroneous data interpretation and analyses, and necessitates better measures to proofread raw sequences before releasing them to the broader scientific community.
The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes

PubMed Central

Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

2012-01-01

The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

PubMed

Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-09-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

PubMed Central

Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-01-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341

Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes.

PubMed

Rius, Nuria; Guillén, Yolanda; Delprat, Alejandra; Kapusta, Aurélie; Feschotte, Cédric; Ruiz, Alfredo

2016-05-10

Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy.
Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens.

PubMed

Staats, Martijn; Erkens, Roy H J; van de Vossenberg, Bart; Wieringa, Jan J; Kraaijeveld, Ken; Stielow, Benjamin; Geml, József; Richardson, James E; Bakker, Freek T

2013-01-01

Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today's next-generation sequencing (NGS) world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae) herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22-82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus) were generated with 81.4-97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2-71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes), but at least generating vital comparative genomic data for testing (phylo)genetic, demographic and genetic hypotheses, that become increasingly more horizontal. Furthermore, NGS of historical DNA enables recovering crucial genetic information from old type specimens that to date have remained mostly unutilized and, thus, opens up a new frontier for taxonomic research as well.
Draft Sequences of the Radish (Raphanus sativus L.) Genome

PubMed Central

Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

2014-01-01

Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699
Detection of a divergent variant of grapevine virus F by next-generation sequencing.

PubMed

Molenaar, Nicholas; Burger, Johan T; Maree, Hans J

2015-08-01

The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Dissection of the Octoploid Strawberry Genome by Deep Sequencing of the Genomes of Fragaria Species

PubMed Central

Hirakawa, Hideki; Shirasawa, Kenta; Kosugi, Shunichi; Tashiro, Kosuke; Nakayama, Shinobu; Yamada, Manabu; Kohara, Mistuyo; Watanabe, Akiko; Kishida, Yoshie; Fujishiro, Tsunakazu; Tsuruoka, Hisano; Minami, Chiharu; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yanagi, Tomohiro; Guoxin, Qin; Maeda, Fumi; Ishikawa, Masami; Kuhara, Satoru; Sato, Shusei; Tabata, Satoshi; Isobe, Sachiko N.

2014-01-01

Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species. PMID:24282021
The first genome sequence of a metatherian herpesvirus: Macropodid herpesvirus 1.

PubMed

Vaz, Paola K; Mahony, Timothy J; Hartley, Carol A; Fowler, Elizabeth V; Ficorilli, Nino; Lee, Sang W; Gilkerson, James R; Browning, Glenn F; Devlin, Joanne M

2016-01-22

While many placental herpesvirus genomes have been fully sequenced, the complete genome of a marsupial herpesvirus has not been described. Here we present the first genome sequence of a metatherian herpesvirus, Macropodid herpesvirus 1 (MaHV-1). The MaHV-1 viral genome was sequenced using an Illumina MiSeq sequencer, de novo assembly was performed and the genome was annotated. The MaHV-1 genome was 140 kbp in length and clustered phylogenetically with the primate simplexviruses, sharing 67% nucleotide sequence identity with Human herpesviruses 1 and 2. The MaHV-1 genome contained 66 predicted open reading frames (ORFs) homologous to those in other herpesvirus genomes, but lacked homologues of UL3, UL4, UL56 and glycoprotein J. This is the first alphaherpesvirus genome that has been found to lack the UL3 and UL4 homologues. We identified six novel ORFs and confirmed their transcription by RT-PCR. This is the first genome sequence of a herpesvirus that infects metatherians, a taxonomically unique mammalian clade. Members of the Simplexvirus genus are remarkably conserved, so the absence of ORFs otherwise retained in eutherian and avian alphaherpesviruses contributes to our understanding of the Alphaherpesvirinae. Further study of metatherian herpesvirus genetics and pathogenesis provides a unique approach to understanding herpesvirus-mammalian interactions.
Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology

PubMed Central

Lu, You; Samac, Deborah A.; Glazebrook, Jane

2015-01-01

We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. PMID:25953184
Complete Genome Sequences of Two Vesicular Stomatitis Virus Isolates Collected in Mexico.

PubMed

Velazquez-Salinas, Lauro; Isa, Pavel; Pauszek, Steven J; Rodriguez, Luis L

2017-09-14

We report two full-genome sequences of vesicular stomatitis New Jersey virus (VSNJV) obtained by Illumina next-generation sequencing of RNA isolated from epithelial suspensions of cattle naturally infected in Mexico. These genomes represent the first full-genome sequences of vesicular stomatitis New Jersey viruses circulating in Mexico deposited in the GenBank database.
Genome Sequences of Pseudomonas spp. Isolated from Cereal Crops

PubMed Central

Stiller, Jiri; Covarelli, Lorenzo; Lindeberg, Magdalen; Shivas, Roger G.; Manners, John M.

2013-01-01

Compared to those of dicot-infecting bacteria, the available genome sequences of bacteria that infect wheat and barley are limited. Herein, we report the draft genome sequences of four pseudomonads originally isolated from these cereals. These genome sequences provide a useful resource for comparative analyses within the genus and for cross-kingdom analyses of plant pathogenesis. PMID:23661484
Rhipicephalus microplus dataset of nonredundant raw sequence reads from 454 GS FLX sequencing of Cot-selected (Cot = 660) genomic DNA

USDA-ARS?s Scientific Manuscript database

A reassociation kinetics-based approach was used to reduce the complexity of genomic DNA from the Deutsch laboratory strain of the cattle tick, Rhipicephalus microplus, to facilitate genome sequencing. Selected genomic DNA (Cot value = 660) was sequenced using 454 GS FLX technology, resulting in 356...
Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34

DOE PAGES

Anderson, Iain J.; DasSarma, Priya; Lucas, Susan; ...

2016-09-10

Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.
Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34

DOE Office of Scientific and Technical Information (OSTI.GOV)

Anderson, Iain J.; DasSarma, Priya; Lucas, Susan

Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. In conclusion, this genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.
Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

USDA-ARS?s Scientific Manuscript database

The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...
Genome-wide copy number variation in the bovine genome detected using low coverage sequence of popular beef breeds

USDA-ARS?s Scientific Manuscript database

Genomic structural variations are an important source of genetic diversity. Copy number variations (CNVs), gains and losses of large regions of genomic sequence between individuals of a species, are known to be associated with both diseases and phenotypic traits. Deeply sequenced genomes are often u...
A new strategy for genome assembly using short sequence reads and reduced representation libraries.

PubMed

Young, Andrew L; Abaan, Hatice Ozel; Zerbino, Daniel; Mullikin, James C; Birney, Ewan; Margulies, Elliott H

2010-02-01

We have developed a novel approach for using massively parallel short-read sequencing to generate fast and inexpensive de novo genomic assemblies comparable to those generated by capillary-based methods. The ultrashort (<100 base) sequences generated by this technology pose specific biological and computational challenges for de novo assembly of large genomes. To account for this, we devised a method for experimentally partitioning the genome using reduced representation (RR) libraries prior to assembly. We use two restriction enzymes independently to create a series of overlapping fragment libraries, each containing a tractable subset of the genome. Together, these libraries allow us to reassemble the entire genome without the need of a reference sequence. As proof of concept, we applied this approach to sequence and assembled the majority of the 125-Mb Drosophila melanogaster genome. We subsequently demonstrate the accuracy of our assembly method with meaningful comparisons against the current available D. melanogaster reference genome (dm3). The ease of assembly and accuracy for comparative genomics suggest that our approach will scale to future mammalian genome-sequencing efforts, saving both time and money without sacrificing quality.
Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo) genome assembly and analysis

USDA-ARS?s Scientific Manuscript database

Next-generation sequencing technologies were used to rapidly and efficiently sequence the genome of the domestic turkey (Meleagris gallopavo). The current genome assembly (~1.1 Gb) includes 917 Mb of sequence assigned to chromosomes. Innate heterozygosity of the sequenced bird allowed discovery of...
Population-Sequencing as a Biomarker of Burkholderia mallei and Burkholderia pseudomallei Evolution through Microbial Forensic Analysis.

PubMed

Jakupciak, John P; Wells, Jeffrey M; Karalus, Richard J; Pawlowski, David R; Lin, Jeffrey S; Feldman, Andrew B

2013-01-01

Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations.
Population-Sequencing as a Biomarker of Burkholderia mallei and Burkholderia pseudomallei Evolution through Microbial Forensic Analysis

PubMed Central

Jakupciak, John P.; Wells, Jeffrey M.; Karalus, Richard J.; Pawlowski, David R.; Lin, Jeffrey S.; Feldman, Andrew B.

2013-01-01

Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations. PMID:24455204
Genomic sequencing of Pleistocene cave bears

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noonan, James P.; Hofreiter, Michael; Smith, Doug

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less
Genome-wide comparative analysis of four Indian Drosophila species.

PubMed

Mohanty, Sujata; Khanna, Radhika

2017-12-01

Comparative analysis of multiple genomes of closely or distantly related Drosophila species undoubtedly creates excitement among evolutionary biologists in exploring the genomic changes with an ecology and evolutionary perspective. We present herewith the de novo assembled whole genome sequences of four Drosophila species, D. bipectinata, D. takahashii, D. biarmipes and D. nasuta of Indian origin using Next Generation Sequencing technology on an Illumina platform along with their detailed assembly statistics. The comparative genomics analysis, e.g. gene predictions and annotations, functional and orthogroup analysis of coding sequences and genome wide SNP distribution were performed. The whole genome of Zaprionus indianus of Indian origin published earlier by us and the genome sequences of previously sequenced 12 Drosophila species available in the NCBI database were included in the analysis. The present work is a part of our ongoing genomics project of Indian Drosophila species.

Complete Genomic Sequence and Comparative Analysis of the Genome Segments of Sweet Potato Chlorotic Stunt Virus in China

PubMed Central

Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling

2014-01-01

Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926
GFinisher: a new strategy to refine and finish bacterial genome assemblies

NASA Astrophysics Data System (ADS)

Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.

2016-10-01

Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.
The Comprehensive Phytopathogen Genomics Resource: a web-based resource for data-mining plant pathogen genomes.

PubMed

Hamilton, John P; Neeno-Eckwall, Eric C; Adhikari, Bishwo N; Perna, Nicole T; Tisserat, Ned; Leach, Jan E; Lévesque, C André; Buell, C Robin

2011-01-01

The Comprehensive Phytopathogen Genomics Resource (CPGR) provides a web-based portal for plant pathologists and diagnosticians to view the genome and trancriptome sequence status of 806 bacterial, fungal, oomycete, nematode, viral and viroid plant pathogens. Tools are available to search and analyze annotated genome sequences of 74 bacterial, fungal and oomycete pathogens. Oomycete and fungal genomes are obtained directly from GenBank, whereas bacterial genome sequences are downloaded from the A Systematic Annotation Package (ASAP) database that provides curation of genomes using comparative approaches. Curated lists of bacterial genes relevant to pathogenicity and avirulence are also provided. The Plant Pathogen Transcript Assemblies Database provides annotated assemblies of the transcribed regions of 82 eukaryotic genomes from publicly available single pass Expressed Sequence Tags. Data-mining tools are provided along with tools to create candidate diagnostic markers, an emerging use for genomic sequence data in plant pathology. The Plant Pathogen Ribosomal DNA (rDNA) database is a resource for pathogens that lack genome or transcriptome data sets and contains 131 755 rDNA sequences from GenBank for 17 613 species identified as plant pathogens and related genera. Database URL: http://cpgr.plantbiology.msu.edu.
GFinisher: a new strategy to refine and finish bacterial genome assemblies.

PubMed

Guizelini, Dieval; Raittz, Roberto T; Cruz, Leonardo M; Souza, Emanuel M; Steffens, Maria B R; Pedrosa, Fabio O

2016-10-10

Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.
Genome Analysis of the Domestic Dog (Korean Jindo) by Massively Parallel Sequencing

PubMed Central

Kim, Ryong Nam; Kim, Dae-Soo; Choi, Sang-Haeng; Yoon, Byoung-Ha; Kang, Aram; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Jong-Joo; Ha, Ji-Hong; Toyoda, Atsushi; Fujiyama, Asao; Kim, Aeri; Kim, Min-Young; Park, Kun-Hyang; Lee, Kang Seon; Park, Hong-Seog

2012-01-01

Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics. PMID:22474061
Whole genome sequence analysis of unidentified genetically modified papaya for development of a specific detection method.

PubMed

Nakamura, Kosuke; Kondo, Kazunari; Akiyama, Hiroshi; Ishigaki, Takumi; Noguchi, Akio; Katsumata, Hiroshi; Takasaki, Kazuto; Futo, Satoshi; Sakata, Kozue; Fukuda, Nozomi; Mano, Junichi; Kitta, Kazumi; Tanaka, Hidenori; Akashi, Ryo; Nishimaki-Mogami, Tomoko

2016-08-15

Identification of transgenic sequences in an unknown genetically modified (GM) papaya (Carica papaya L.) by whole genome sequence analysis was demonstrated. Whole genome sequence data were generated for a GM-positive fresh papaya fruit commodity detected in monitoring using real-time polymerase chain reaction (PCR). The sequences obtained were mapped against an open database for papaya genome sequence. Transgenic construct- and event-specific sequences were identified as a GM papaya developed to resist infection from a Papaya ringspot virus. Based on the transgenic sequences, a specific real-time PCR detection method for GM papaya applicable to various food commodities was developed. Whole genome sequence analysis enabled identifying unknown transgenic construct- and event-specific sequences in GM papaya and development of a reliable method for detecting them in papaya food commodities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies

PubMed Central

Lasitschka, Bärbel; Jones, David; Northcott, Paul; Hutter, Barbara; Jäger, Natalie; Kool, Marcel; Taylor, Michael; Lichter, Peter; Pfister, Stefan; Wolf, Stephan; Brors, Benedikt; Eils, Roland

2013-01-01

The emergence of high-throughput, next-generation sequencing technologies has dramatically altered the way we assess genomes in population genetics and in cancer genomics. Currently, there are four commonly used whole-genome sequencing platforms on the market: Illumina’s HiSeq2000, Life Technologies’ SOLiD 4 and its completely redesigned 5500xl SOLiD, and Complete Genomics’ technology. A number of earlier studies have compared a subset of those sequencing platforms or compared those platforms with Sanger sequencing, which is prohibitively expensive for whole genome studies. Here we present a detailed comparison of the performance of all currently available whole genome sequencing platforms, especially regarding their ability to call SNVs and to evenly cover the genome and specific genomic regions. Unlike earlier studies, we base our comparison on four different samples, allowing us to assess the between-sample variation of the platforms. We find a pronounced GC bias in GC-rich regions for Life Technologies’ platforms, with Complete Genomics performing best here, while we see the least bias in GC-poor regions for HiSeq2000 and 5500xl. HiSeq2000 gives the most uniform coverage and displays the least sample-to-sample variation. In contrast, Complete Genomics exhibits by far the smallest fraction of bases not covered, while the SOLiD platforms reveal remarkable shortcomings, especially in covering CpG islands. When comparing the performance of the four platforms for calling SNPs, HiSeq2000 and Complete Genomics achieve the highest sensitivity, while the SOLiD platforms show the lowest false positive rate. Finally, we find that integrating sequencing data from different platforms offers the potential to combine the strengths of different technologies. In summary, our results detail the strengths and weaknesses of all four whole-genome sequencing platforms. It indicates application areas that call for a specific sequencing platform and disallow other platforms. This helps to identify the proper sequencing platform for whole genome studies with different application scopes. PMID:23776689
Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms

PubMed Central

Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

2015-01-01

Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450
Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms.

PubMed

Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

2015-01-01

Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources.
Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome

PubMed Central

2011-01-01

Background One of the key goals of oak genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of forests to increase their health and productivity. Deep-coverage large-insert genomic libraries are a crucial tool for attaining this objective. We report herein the construction of a BAC library for Quercus robur, its characterization and an analysis of BAC end sequences. Results The EcoRI library generated consisted of 92,160 clones, 7% of which had no insert. Levels of chloroplast and mitochondrial contamination were below 3% and 1%, respectively. Mean clone insert size was estimated at 135 kb. The library represents 12 haploid genome equivalents and, the likelihood of finding a particular oak sequence of interest is greater than 99%. Genome coverage was confirmed by PCR screening of the library with 60 unique genetic loci sampled from the genetic linkage map. In total, about 20,000 high-quality BAC end sequences (BESs) were generated by sequencing 15,000 clones. Roughly 5.88% of the combined BAC end sequence length corresponded to known retroelements while ab initio repeat detection methods identified 41 additional repeats. Collectively, characterized and novel repeats account for roughly 8.94% of the genome. Further analysis of the BESs revealed 1,823 putative genes suggesting at least 29,340 genes in the oak genome. BESs were aligned with the genome sequences of Arabidopsis thaliana, Vitis vinifera and Populus trichocarpa. One putative collinear microsyntenic region encoding an alcohol acyl transferase protein was observed between oak and chromosome 2 of V. vinifera. Conclusions This BAC library provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing. BES analysis provided insight into the structure of the oak genome. These sequences will be used in the assembly of a future genome sequence for oak. PMID:21645357
T-joints of Ti alloys with hybrid laser-MIG welding: macro-graphic and micro-hardness analyses

NASA Astrophysics Data System (ADS)

Spina, R.; Sorgente, D.; Palumbo, G.; Scintilla, L. D.; Brandizzi, M.; Satriano, A. A.; Tricarico, L.

2012-03-01

Titanium alloys are characterized by high mechanical properties and elevated corrosion resistance. The combination of laser welding with MIG/GMAW has proven to improve beneficial effects of both processes (keyhole, gap-bridging ability) while limiting their drawbacks (high thermal gradient, low mechanical resistance) In this paper, the hybrid Laser-GMAW welding of Ti-6Al-4V 3-mm thick sheets is investigated using a specific designed trailing shield. The joint geometry was the double fillet welded T-joint. Bead morphologies, microstructures and mechanical properties (micro-hardness) of welds were evaluated and compared to those achieved for the base metals.
XEN Gel Implant: a new surgical approach in glaucoma.

PubMed

Chaudhary, Ankita; Salinas, Lauriane; Guidotti, Jacopo; Mermoud, André; Mansouri, Kaweh

2018-01-01

Glaucoma is a leading cause of blindness worldwide. Intraocular pressure (IOP) lowering is the only effective treatment strategy. Traditional glaucoma surgeries are generally considered to be unpredictable and associated with a high rate of complications. This has led to the development of a novel XEN Gel Implant, a type of minimally invasive glaucoma surgery (MIGS), lowering the IOP without extensive surgical dissection. Areas covered: A literature search was undertaken on PubMed using the terms XEN glaucoma, gelatin microstent, and MIGS. All the articles and case reports on XEN Gel Implant and selected articles on MIGS were studied and reviewed. We have discussed the results of most studies on XEN Gel Implant related to its efficacy, safety and success. Expert commentary: The XEN Gel Implant effectively lowers IOP and medication use, with a favorable safety profile. Long-term data on its success and cost-effectiveness are lacking. The studies have shown it to be without any serious adverse events and to have good safety profile encouraging future research on this novel implant. There is a need to correctly identify selection criteria for patients, who would benefit the most from the XEN Gel Implant.
Neuronal cell migration in C. elegans: regulation of Hox gene expression and cell position.

PubMed

Harris, J; Honigberg, L; Robinson, N; Kenyon, C

1996-10-01

In C. elegans, the Hox gene mab-5, which specifies the fates of cells in the posterior body region, has been shown to direct the migrations of certain cells within its domain of function. mab-5 expression switches on in the neuroblast QL as it migrates into the posterior body region. mab-5 activity is then required for the descendants of QL to migrate to posterior rather than anterior positions. What information activates Hox gene expression during this cell migration? How are these cells subsequently guided to their final positions? We address these questions by describing four genes, egl-20, mig-14, mig-1 and lin-17, that are required to activate expression of mab-5 during migration of the QL neuroblast. We find that two of these genes, egl-20 and mig-14, also act in a mab-5-independent way to determine the final stopping points of the migrating Q descendants. The Q descendants do not migrate toward any obvious physical targets in wild-type or mutant animals. Therefore, these genes appear to be part of a system that positions the migrating Q descendants along the anteroposterior axis.
Thermo-Mechanical Modeling of Laser-Mig Hybrid Welding (lmhw)

NASA Astrophysics Data System (ADS)

Kounde, Ludovic; Engel, Thierry; Bergheau, Jean-Michel; Boisselier, Didier

2011-01-01

Hybrid welding is a combination of two different technologies such as laser (Nd: YAG, CO2…) and electric arc welding (MIG, MAG / TIG …) developed to assemble thick metal sheets (over 3 mm) in order to reduce the required laser power. As a matter of fact, hybrid welding is a lso used in the welding of thin materials to benefit from process, deep penetration and gap limit. But the thermo-mechanical behaviour of thin parts assembled by LMHW technology for railway cars production is far from being controlled the modeling and simulation contribute to the assessment of the causes and effects of the thermo mechanical behaviour in the assembled parts. In order to reproduce the morphology of melted and heat-affected zones, two analytic functions were combined to model the heat source of LMHW. On one hand, we applied a so-called "diaboloïd" (DB) which is a modified hyperboloid, based on experimental parameters and the analysis of the macrographs of the welds. On the other hand, we used a so-called "double ellipsoïd" (DE) which takes the MIG only contribution including the bead into account. The comparison between experimental result and numerical result shows a good agreement.
Exposure of welders and other metal workers to ELF magnetic fields.

PubMed

Skotte, J H; Hjøllund, H I

1997-01-01

This study assessed exposure to extremely low frequency (ELF) magnetic fields of welders and other metal workers and compared exposure from different welding processes. Exposure to ELF magnetic fields was measured for 50 workers selected from a nationwide cohort of metal workers and 15 nonrandomly selected full-time welders in a shipyard. The measurements were carried out with personal exposure meters during 3 days of work for the metal workers and I day of work for the shipyard welders. To record a large dynamic range of ELF magnetic field values, the measurements were carried out with "high/low" pairs of personal exposure meters. Additional measurements of static magnetic fields at fixed positions close to welding installations were done with a Hall-effect fluxmeter. The total time of measurement was 1273 hours. The metal workers reported welding activity for 5.8% of the time, and the median of the work-period mean exposure to ELF magnetic fields was 0.18 microT. DC metal inert or active gas welding (MIG/MAG) was used 80% of the time for welding, and AC manual metal arc welding (MMA) was used 10% of the time. The shipyard welders reported welding activity for 56% of the time, and the median and maximum of the workday mean exposure to ELF magnetic fields was 4.70 and 27.5 microT, respectively. For full-shift welders the average workday mean was 21.2 microT for MMA welders and 2.3 microT for MIG/MAG welders. The average exposure during the effective time of welding was estimated to be 65 microT for the MMA welding process and 7 microT for the MIG/MAG welding process. The time of exposure above 1 microT was found to be a useful measure of the effective time of welding. Large differences in exposure to ELF magnetic fields were found between different groups of welders, depending on the welding process and effective time of welding. MMA (AC) welding caused roughly 10 times higher exposure to ELF magnetic fields compared with MIG/MAG (DC) welding. The measurements of static fields suggest that the combined exposure to static and ELF fields of MIG/MAG (DC) welders and the exposure to ELF fields of MMA (AC) welders are roughly of the same level.
Evaluation of occupational exposure to toxic metals released in the process of aluminum welding.

PubMed

Matczak, Wanda; Gromiec, Jan

2002-04-01

The objective of this study was to evaluate occupational exposure to welding fumes and its elements on aluminum welders in Polish industry. The study included 52 MIG/Al fume samples and 18 TIG/Al samples in 3 plants. Air samples were collected in the breathing zone of welders (total and respirable dust). Dust concentration was determined gravimetrically, and the elements in the collected dust were determined by AAS. Mean time-weighted average (TWA) concentrations of the welding dusts/fumes and their components in the breathing zone obtained for different welding processes were, in mg/m3: MIG/Al fumes mean 6.0 (0.8-17.8), Al 2.1 (0.1-7.7), Mg 0.2 (< 0.1-0.9), Mn 0.014 (0.002-0.049), Cu 0.011 (0.002-0.092), Zn 0.016 (0.002-0.14), Pb 0.009 (0.005-0.025), Cr 0.003 (0.002-0.007), and TIG/Al fumes 0.7 (0.3-1.4), Al 0.17 (0.07-0.50). A correlation has been found between the concentration of the main components and the fume/dust concentrations in MIG/Al and TIG/Al fumes. Mean percentages of the individual components in MIG/Al fumes/dusts were Al: 30 (9-56) percent; Mg: 3 (1-5.6) percent; Mn: 0.2 (0.1-0.3) percent; Cu: 0.2 (< 0.1-1.8) percent; Zn: 0.2 (< 0.1-0.8) percent; Pb: 0.2 (< 0.1-1) percent; Cr: < 0.1 percent. The proportion of the respirable fraction in the fumes and their constituents varied between 10 percent and 100 percent. The results showed that MIG/Al fumes concentration was 1.2 times higher than the American Conference of Governmental Industrial Hygienists (ACGIH) threshold limit value (TLV), and the index of the combined exposure to the determined agents was 2.3 (0.4-8.0), mostly because of high Al2O3 contribution. The background concentrations of the components (ca. 5-10 times lower than those in the breathing zone of the welders) did not exceed the Polish MAC value. The elemental composition of total and respirable fume/dust may differ considerably depending on welding methods, the nature of welding-related operations, and work environment conditions.
The Reference Genome Sequence of Saccharomyces cerevisiae: Then and Now

PubMed Central

Engel, Stacia R.; Dietrich, Fred S.; Fisk, Dianna G.; Binkley, Gail; Balakrishnan, Rama; Costanzo, Maria C.; Dwight, Selina S.; Hitz, Benjamin C.; Karra, Kalpana; Nash, Robert S.; Weng, Shuai; Wong, Edith D.; Lloyd, Paul; Skrzypek, Marek S.; Miyasato, Stuart R.; Simison, Matt; Cherry, J. Michael

2014-01-01

The genome of the budding yeast Saccharomyces cerevisiae was the first completely sequenced from a eukaryote. It was released in 1996 as the work of a worldwide effort of hundreds of researchers. In the time since, the yeast genome has been intensively studied by geneticists, molecular biologists, and computational scientists all over the world. Maintenance and annotation of the genome sequence have long been provided by the Saccharomyces Genome Database, one of the original model organism databases. To deepen our understanding of the eukaryotic genome, the S. cerevisiae strain S288C reference genome sequence was updated recently in its first major update since 1996. The new version, called “S288C 2010,” was determined from a single yeast colony using modern sequencing technologies and serves as the anchor for further innovations in yeast genomic science. PMID:24374639
Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

PubMed Central

Oduru, Sreedhar; Campbell, Janee L; Karri, SriTulasi; Hendry, William J; Khan, Shafiq A; Williams, Simon C

2003-01-01

Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells. PMID:12783626
Real-time, portable genome sequencing for Ebola surveillance.

PubMed

Quick, Joshua; Loman, Nicholas J; Duraffour, Sophie; Simpson, Jared T; Severi, Ettore; Cowley, Lauren; Bore, Joseph Akoi; Koundouno, Raymond; Dudas, Gytis; Mikhail, Amy; Ouédraogo, Nobila; Afrough, Babak; Bah, Amadou; Baum, Jonathan Hj; Becker-Ziaja, Beate; Boettcher, Jan-Peter; Cabeza-Cabrerizo, Mar; Camino-Sanchez, Alvaro; Carter, Lisa L; Doerrbecker, Juiliane; Enkirch, Theresa; Dorival, Isabel Graciela García; Hetzelt, Nicole; Hinzmann, Julia; Holm, Tobias; Kafetzopoulou, Liana Eleni; Koropogui, Michel; Kosgey, Abigail; Kuisma, Eeva; Logue, Christopher H; Mazzarelli, Antonio; Meisel, Sarah; Mertens, Marc; Michel, Janine; Ngabo, Didier; Nitzsche, Katja; Pallash, Elisa; Patrono, Livia Victoria; Portmann, Jasmine; Repits, Johanna Gabriella; Rickett, Natasha Yasmin; Sachse, Andrea; Singethan, Katrin; Vitoriano, Inês; Yemanaberhan, Rahel L; Zekeng, Elsa G; Trina, Racine; Bello, Alexander; Sall, Amadou Alpha; Faye, Ousmane; Faye, Oumar; Magassouba, N'Faly; Williams, Cecelia V; Amburgey, Victoria; Winona, Linda; Davis, Emily; Gerlach, Jon; Washington, Franck; Monteil, Vanessa; Jourdain, Marine; Bererd, Marion; Camara, Alimou; Somlare, Hermann; Camara, Abdoulaye; Gerard, Marianne; Bado, Guillaume; Baillet, Bernard; Delaune, Déborah; Nebie, Koumpingnin Yacouba; Diarra, Abdoulaye; Savane, Yacouba; Pallawo, Raymond Bernard; Gutierrez, Giovanna Jaramillo; Milhano, Natacha; Roger, Isabelle; Williams, Christopher J; Yattara, Facinet; Lewandowski, Kuiama; Taylor, Jamie; Rachwal, Philip; Turner, Daniel; Pollakis, Georgios; Hiscox, Julian A; Matthews, David A; O'Shea, Matthew K; Johnston, Andrew McD; Wilson, Duncan; Hutley, Emma; Smit, Erasmus; Di Caro, Antonino; Woelfel, Roman; Stoecker, Kilian; Fleischmann, Erna; Gabriel, Martin; Weller, Simon A; Koivogui, Lamine; Diallo, Boubacar; Keita, Sakoba; Rambaut, Andrew; Formenty, Pierre; Gunther, Stephan; Carroll, Miles W

2016-02-11

The Ebola virus disease epidemic in West Africa is the largest on record, responsible for over 28,599 cases and more than 11,299 deaths. Genome sequencing in viral outbreaks is desirable to characterize the infectious agent and determine its evolutionary rate. Genome sequencing also allows the identification of signatures of host adaptation, identification and monitoring of diagnostic targets, and characterization of responses to vaccines and treatments. The Ebola virus (EBOV) genome substitution rate in the Makona strain has been estimated at between 0.87 × 10(-3) and 1.42 × 10(-3) mutations per site per year. This is equivalent to 16-27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic. Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions. Genomic surveillance during the epidemic has been sporadic owing to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities. To address this problem, here we devise a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. We present sequence data and analysis of 142 EBOV samples collected during the period March to October 2015. We were able to generate results less than 24 h after receiving an Ebola-positive sample, with the sequencing process taking as little as 15-60 min. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks.
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

PubMed

Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

2012-02-17

The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.

The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

PubMed

Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

2013-01-01

Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.
Comparative Genomic and Transcriptomic Characterization of the Toxigenic Marine Dinoflagellate Alexandrium ostenfeldii

PubMed Central

Jaeckisch, Nina; Yang, Ines; Wohlrab, Sylke; Glöckner, Gernot; Kroymann, Juergen; Vogel, Heiko; Cembella, Allan; John, Uwe

2011-01-01

Many dinoflagellate species are notorious for the toxins they produce and ecological and human health consequences associated with harmful algal blooms (HABs). Dinoflagellates are particularly refractory to genomic analysis due to the enormous genome size, lack of knowledge about their DNA composition and structure, and peculiarities of gene regulation, such as spliced leader (SL) trans-splicing and mRNA transposition mechanisms. Alexandrium ostenfeldii is known to produce macrocyclic imine toxins, described as spirolides. We characterized the genome of A. ostenfeldii using a combination of transcriptomic data and random genomic clones for comparison with other dinoflagellates, particularly Alexandrium species. Examination of SL sequences revealed similar features as in other dinoflagellates, including Alexandrium species. SL sequences in decay indicate frequent retro-transposition of mRNA species. This probably contributes to overall genome complexity by generating additional gene copies. Sequencing of several thousand fosmid and bacterial artificial chromosome (BAC) ends yielded a wealth of simple repeats and tandemly repeated longer sequence stretches which we estimated to comprise more than half of the whole genome. Surprisingly, the repeats comprise a very limited set of 79–97 bp sequences; in part the genome is thus a relatively uniform sequence space interrupted by coding sequences. Our genomic sequence survey (GSS) represents the largest genomic data set of a dinoflagellate to date. Alexandrium ostenfeldii is a typical dinoflagellate with respect to its transcriptome and mRNA transposition but demonstrates Alexandrium-like stop codon usage. The large portion of repetitive sequences and the organization within the genome is in agreement with several other studies on dinoflagellates using different approaches. It remains to be determined whether this unusual composition is directly correlated to the exceptionally genome organization of dinoflagellates with a low amount of histones and histone-like proteins. PMID:22164224
Detection of somatic, subclonal and mosaic CNVs from sequencing | Division of Cancer Prevention

Cancer.gov

Progress in technology has made individual genome sequencing a clinical reality, with partial genome sequencing already in use in clinical care. In fact, it is expected that within a few years whole genome sequencing will be a standard procedure that will allow discovering personal genomic variants of all types and thus greatly facilitate individualized medicine. However, fast
The Douglas-fir genome sequence reveals specialization of the photosynthetic apparatus in Pinaceae

Treesearch

David B. Neale; Patrick E. McGuire; Nicholas C. Wheeler; Kristian A. Stevens; Marc W. Crepeau; Charis Cardeno; Aleksey V. Zimin; Daniela Puiu; Geo M. Pertea; U. Uzay Sezen; Claudio Casola; Tomasz E. Koralewski; Robin Paul; Daniel Gonzalez-Ibeas; Sumaira Zaman; Richard Cronn; Mark Yandell; Carson Holt; Charles H. Langley; James A. Yorke; Steven L. Salzberg; Jill L. Wegrzyn

2017-01-01

A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50...
Draft Genome sequence of Frankia sp. strains CN3 , an atypical, non-infective (Nod-) ineffective (Fix-) isolate from Coriaria nepalensis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Bruce, David

2013-01-01

We report here the genome sequence of Frankia sp. strain CN3, which was isolated from Coriaria nepalensis. This genome sequence is the first from the fourth lineage of Frankia, that are unable to re-infect actinorhizal plants. At 10 Mb, it represents the largest Frankia genome sequenced to date.
Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology.

PubMed

Lu, You; Samac, Deborah A; Glazebrook, Jane; Ishimaru, Carol A

2015-05-07

We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. Copyright © 2015 Lu et al.
Complete Genome Sequences of Two Vesicular Stomatitis Virus Isolates Collected in Mexico

PubMed Central

Isa, Pavel; Pauszek, Steven J.; Rodriguez, Luis L.

2017-01-01

ABSTRACT We report two full-genome sequences of vesicular stomatitis New Jersey virus (VSNJV) obtained by Illumina next-generation sequencing of RNA isolated from epithelial suspensions of cattle naturally infected in Mexico. These genomes represent the first full-genome sequences of vesicular stomatitis New Jersey viruses circulating in Mexico deposited in the GenBank database. PMID:28912331
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.

PubMed

Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar

2015-01-01

Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes

PubMed Central

Chaves, Raquel; Ferreira, Daniela; Mendes-da-Silva, Ana; Meles, Susana; Adega, Filomena

2017-01-01

Abstract In recent years, a growing body of evidence has recognized the tandem repeat sequences, and specifically satellite DNA, as a functional class of sequences in the genomic “dark matter.” Using an original, complementary, and thus an eclectic experimental design, we show that the cat archetypal satellite DNA sequence, FA-SAT, is “frozen” conservatively in several Bilateria genomes. We found different genomic FA-SAT architectures, and the interspersion pattern was conserved. In Carnivora genomes, the FA-SAT-related sequences are also amplified, with the predominance of a specific FA-SAT variant, at the heterochromatic regions. We inspected the cat genome project to locate FA-SAT array flanking regions and revealed an intensive intermingling with transposable elements. Our results also show that FA-SAT-related sequences are transcribed and that the most abundant FA-SAT variant is not always the most transcribed. We thus conclude that the DNA sequences of FA-SAT and their transcripts are “frozen” in these genomes. Future work is needed to disclose any putative function that these sequences may play in these genomes. PMID:29608678
A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition.

PubMed

Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong

2016-11-21

Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato.
Brassica ASTRA: an integrated database for Brassica genomic research.

PubMed

Love, Christopher G; Robinson, Andrew J; Lim, Geraldine A C; Hopkins, Clare J; Batley, Jacqueline; Barker, Gary; Spangenberg, German C; Edwards, David

2005-01-01

Brassica ASTRA is a public database for genomic information on Brassica species. The database incorporates expressed sequences with Swiss-Prot and GenBank comparative sequence annotation as well as secondary Gene Ontology (GO) annotation derived from the comparison with Arabidopsis TAIR GO annotations. Simple sequence repeat molecular markers are identified within resident sequences and mapped onto the closely related Arabidopsis genome sequence. Bacterial artificial chromosome (BAC) end sequences derived from the Multinational Brassica Genome Project are also mapped onto the Arabidopsis genome sequence enabling users to identify candidate Brassica BACs corresponding to syntenic regions of Arabidopsis. This information is maintained in a MySQL database with a web interface providing the primary means of interrogation. The database is accessible at http://hornbill.cspp.latrobe.edu.au.
GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

PubMed

Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

2013-04-10

Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.
A Rapid Whole Genome Sequencing and Analysis System Supporting Genomic Epidemiology (7th Annual SFAF Meeting, 2012)

DOE Office of Scientific and Technical Information (OSTI.GOV)

FitzGerald, Michael

2012-06-01

Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
A Rapid Whole Genome Sequencing and Analysis System Supporting Genomic Epidemiology (7th Annual SFAF Meeting, 2012)

ScienceCinema

FitzGerald, Michael

2018-01-11

Michael FitzGerald on "A rapid whole genome sequencing and analysis system supporting genomic epidemiology" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Molecular characterization of faba bean necrotic yellows viruses in Tunisia.

PubMed

Kraberger, Simona; Kumari, Safaa G; Najar, Asma; Stainton, Daisy; Martin, Darren P; Varsani, Arvind

2018-03-01

Faba bean necrotic yellows virus (FBNYV) (genus Nanovirus; family Nanoviridae) has a genome comprising eight individually encapsidated circular single-stranded DNA components. It has frequently been found infecting faba bean (Vicia faba L.) and chickpea (Cicer arietinum L.) in association with satellite molecules (alphasatellites). Genome sequences of FBNYV from Azerbaijan, Egypt, Iran, Morocco, Spain and Syria have been determined previously and we now report the first five genome sequences of FBNYV and associated alphasatellites from faba bean sampled in Tunisia. In addition, we have determined the genome sequences of two additional FBNYV isolates from chickpea plants sampled in Syria and Iran. All individual FBNYV genome component sequences that were determined here share > 84% nucleotide sequence identity with FBNYV sequences available in public databases, with the DNA-M component displaying the highest degree of diversity. As with other studied nanoviruses, recombination and genome component reassortment occurs frequently both between FBNYV genomes and between genomes of nanoviruses belonging to other species.
High quality de novo sequencing and assembly of the Saccharomyces arboricolus genome

PubMed Central

2013-01-01

Background Comparative genomics is a formidable tool to identify functional elements throughout a genome. In the past ten years, studies in the budding yeast Saccharomyces cerevisiae and a set of closely related species have been instrumental in showing the benefit of analyzing patterns of sequence conservation. Increasing the number of closely related genome sequences makes the comparative genomics approach more powerful and accurate. Results Here, we report the genome sequence and analysis of Saccharomyces arboricolus, a yeast species recently isolated in China, that is closely related to S. cerevisiae. We obtained high quality de novo sequence and assemblies using a combination of next generation sequencing technologies, established the phylogenetic position of this species and considered its phenotypic profile under multiple environmental conditions in the light of its gene content and phylogeny. Conclusions We suggest that the genome of S. arboricolus will be useful in future comparative genomics analysis of the Saccharomyces sensu stricto yeasts. PMID:23368932
The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.

PubMed

Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C

2012-01-01

The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.

PubMed

Li, Qing; Hermanson, Peter J; Springer, Nathan M

2018-01-01

DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Exploiting long read sequencing technologies to establish high quality highly contiguous pig reference genome assemblies

USDA-ARS?s Scientific Manuscript database

The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...
Next-Generation Genomics Facility at C-CAMP: Accelerating Genomic Research in India

PubMed Central

S, Chandana; Russiachand, Heikham; H, Pradeep; S, Shilpa; M, Ashwini; S, Sahana; B, Jayanth; Atla, Goutham; Jain, Smita; Arunkumar, Nandini; Gowda, Malali

2014-01-01

Next-Generation Sequencing (NGS; http://www.genome.gov/12513162) is a recent life-sciences technological revolution that allows scientists to decode genomes or transcriptomes at a much faster rate with a lower cost. Genomic-based studies are in a relatively slow pace in India due to the non-availability of genomics experts, trained personnel and dedicated service providers. Using NGS there is a lot of potential to study India's national diversity (of all kinds). We at the Centre for Cellular and Molecular Platforms (C-CAMP) have launched the Next Generation Genomics Facility (NGGF) to provide genomics service to scientists, to train researchers and also work on national and international genomic projects. We have HiSeq1000 from Illumina and GS-FLX Plus from Roche454. The long reads from GS FLX Plus, and high sequence depth from HiSeq1000, are the best and ideal hybrid approaches for de novo and re-sequencing of genomes and transcriptomes. At our facility, we have sequenced around 70 different organisms comprising of more than 388 genomes and 615 transcriptomes – prokaryotes and eukaryotes (fungi, plants and animals). In addition we have optimized other unique applications such as small RNA (miRNA, siRNA etc), long Mate-pair sequencing (2 to 20 Kb), Coding sequences (Exome), Methylome (ChIP-Seq), Restriction Mapping (RAD-Seq), Human Leukocyte Antigen (HLA) typing, mixed genomes (metagenomes) and target amplicons, etc. Translating DNA sequence data from NGS sequencer into meaningful information is an important exercise. Under NGGF, we have bioinformatics experts and high-end computing resources to dissect NGS data such as genome assembly and annotation, gene expression, target enrichment, variant calling (SSR or SNP), comparative analysis etc. Our services (sequencing and bioinformatics) have been utilized by more than 45 organizations (academia and industry) both within India and outside, resulting several publications in peer-reviewed journals and several genomic/transcriptomic data is available at NCBI.

Initial characterization of the large genome of the salamander Ambystoma mexicanum using shotgun and laser capture chromosome sequencing

PubMed Central

Keinath, Melissa C.; Timoshevskiy, Vladimir A.; Timoshevskaya, Nataliya Y.; Tsonis, Panagiotis A.; Voss, S. Randal; Smith, Jeramiah J.

2015-01-01

Vertebrates exhibit substantial diversity in genome size, and some of the largest genomes exist in species that uniquely inform diverse areas of basic and biomedical research. For example, the salamander Ambystoma mexicanum (the Mexican axolotl) is a model organism for studies of regeneration, development and genome evolution, yet its genome is ~10× larger than the human genome. As part of a hierarchical approach toward improving genome resources for the species, we generated 600 Gb of shotgun sequence data and developed methods for sequencing individual laser-captured chromosomes. Based on these data, we estimate that the A. mexicanum genome is ~32 Gb. Notably, as much as 19 Gb of the A. mexicanum genome can potentially be considered single copy, which presumably reflects the evolutionary diversification of mobile elements that accumulated during an ancient episode of genome expansion. Chromosome-targeted sequencing permitted the development of assemblies within the constraints of modern computational platforms, allowed us to place 2062 genes on the two smallest A. mexicanum chromosomes and resolves key events in the history of vertebrate genome evolution. Our analyses show that the capture and sequencing of individual chromosomes is likely to provide valuable information for the systematic sequencing, assembly and scaffolding of large genomes. PMID:26553646
Initial characterization of the large genome of the salamander Ambystoma mexicanum using shotgun and laser capture chromosome sequencing.

PubMed

Keinath, Melissa C; Timoshevskiy, Vladimir A; Timoshevskaya, Nataliya Y; Tsonis, Panagiotis A; Voss, S Randal; Smith, Jeramiah J

2015-11-10

Vertebrates exhibit substantial diversity in genome size, and some of the largest genomes exist in species that uniquely inform diverse areas of basic and biomedical research. For example, the salamander Ambystoma mexicanum (the Mexican axolotl) is a model organism for studies of regeneration, development and genome evolution, yet its genome is ~10× larger than the human genome. As part of a hierarchical approach toward improving genome resources for the species, we generated 600 Gb of shotgun sequence data and developed methods for sequencing individual laser-captured chromosomes. Based on these data, we estimate that the A. mexicanum genome is ~32 Gb. Notably, as much as 19 Gb of the A. mexicanum genome can potentially be considered single copy, which presumably reflects the evolutionary diversification of mobile elements that accumulated during an ancient episode of genome expansion. Chromosome-targeted sequencing permitted the development of assemblies within the constraints of modern computational platforms, allowed us to place 2062 genes on the two smallest A. mexicanum chromosomes and resolves key events in the history of vertebrate genome evolution. Our analyses show that the capture and sequencing of individual chromosomes is likely to provide valuable information for the systematic sequencing, assembly and scaffolding of large genomes.
Quasispecies Analyses of the HIV-1 Near-full-length Genome With Illumina MiSeq

PubMed Central

Ode, Hirotaka; Matsuda, Masakazu; Matsuoka, Kazuhiro; Hachiya, Atsuko; Hattori, Junko; Kito, Yumiko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru

2015-01-01

Human immunodeficiency virus type-1 (HIV-1) exhibits high between-host genetic diversity and within-host heterogeneity, recognized as quasispecies. Because HIV-1 quasispecies fluctuate in terms of multiple factors, such as antiretroviral exposure and host immunity, analyzing the HIV-1 genome is critical for selecting effective antiretroviral therapy and understanding within-host viral coevolution mechanisms. Here, to obtain HIV-1 genome sequence information that includes minority variants, we sought to develop a method for evaluating quasispecies throughout the HIV-1 near-full-length genome using the Illumina MiSeq benchtop deep sequencer. To ensure the reliability of minority mutation detection, we applied an analysis method of sequence read mapping onto a consensus sequence derived from de novo assembly followed by iterative mapping and subsequent unique error correction. Deep sequencing analyses of aHIV-1 clone showed that the analysis method reduced erroneous base prevalence below 1% in each sequence position and discarded only < 1% of all collected nucleotides, maximizing the usage of the collected genome sequences. Further, we designed primer sets to amplify the HIV-1 near-full-length genome from clinical plasma samples. Deep sequencing of 92 samples in combination with the primer sets and our analysis method provided sufficient coverage to identify >1%-frequency sequences throughout the genome. When we evaluated sequences of pol genes from 18 treatment-naïve patients' samples, the deep sequencing results were in agreement with Sanger sequencing and identified numerous additional minority mutations. The results suggest that our deep sequencing method would be suitable for identifying within-host viral population dynamics throughout the genome. PMID:26617593
Illumina Synthetic Long Read Sequencing Allows Recovery of Missing Sequences even in the “Finished” C. elegans Genome

PubMed Central

Li, Runsheng; Hsieh, Chia-Ling; Young, Amanda; Zhang, Zhihong; Ren, Xiaoliang; Zhao, Zhongying

2015-01-01

Most next-generation sequencing platforms permit acquisition of high-throughput DNA sequences, but the relatively short read length limits their use in genome assembly or finishing. Illumina has recently released a technology called Synthetic Long-Read Sequencing that can produce reads of unusual length, i.e., predominately around 10 Kb. However, a systematic assessment of their use in genome finishing and assembly is still lacking. We evaluate the promise and deficiency of the long reads in these aspects using isogenic C. elegans genome with no gap. First, the reads are highly accurate and capable of recovering most types of repetitive sequences. However, the presence of tandem repetitive sequences prevents pre-assembly of long reads in the relevant genomic region. Second, the reads are able to reliably detect missing but not extra sequences in the C. elegans genome. Third, the reads of smaller size are more capable of recovering repetitive sequences than those of bigger size. Fourth, at least 40 Kbp missing genomic sequences are recovered in the C. elegans genome using the long reads. Finally, an N50 contig size of at least 86 Kbp can be achieved with 24×reads but with substantial mis-assembly errors, highlighting a need for novel assembly algorithm for the long reads. PMID:26039588
The genome sequence of sweet cherry (Prunus avium) for use in genomics-assisted breeding.

PubMed

Shirasawa, Kenta; Isuzugawa, Kanji; Ikenaga, Mitsunobu; Saito, Yutaro; Yamamoto, Toshiya; Hirakawa, Hideki; Isobe, Sachiko

2017-10-01

We determined the genome sequence of sweet cherry (Prunus avium) using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. We predicted 43,349 complete and partial protein-encoding genes. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site-associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers. The genomic information helps us to identify agronomically important genes and will accelerate genetic studies and breeding programs for sweet cherries. Further information on the genomic sequences and DNA markers is available in DBcherry (http://cherry.kazusa.or.jp (8 May 2017, date last accessed)). © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Genome Sequencing and Assembly by Long Reads in Plants

PubMed Central

Li, Changsheng; Lin, Feng; An, Dong; Huang, Ruidong

2017-01-01

Plant genomes generated by Sanger and Next Generation Sequencing (NGS) have provided insight into species diversity and evolution. However, Sanger sequencing is limited in its applications due to high cost, labor intensity, and low throughput, while NGS reads are too short to resolve abundant repeats and polyploidy, leading to incomplete or ambiguous assemblies. The advent and improvement of long-read sequencing by Third Generation Sequencing (TGS) methods such as PacBio and Nanopore have shown promise in producing high-quality assemblies for complex genomes. Here, we review the development of sequencing, introducing the application as well as considerations of experimental design in TGS of plant genomes. We also introduce recent revolutionary scaffolding technologies including BioNano, Hi-C, and 10× Genomics. We expect that the informative guidance for genome sequencing and assembly by long reads will benefit the initiation of scientists’ projects. PMID:29283420
MIPS: a database for protein sequences and complete genomes.

PubMed Central

Mewes, H W; Hani, J; Pfeiffer, F; Frishman, D

1998-01-01

The MIPS group [Munich Information Center for Protein Sequences of the German National Center for Environment and Health (GSF)] at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, is involved in a number of data collection activities, including a comprehensive database of the yeast genome, a database reflecting the progress in sequencing the Arabidopsis thaliana genome, the systematic analysis of other small genomes and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). Through its WWW server (http://www.mips.biochem.mpg.de ) MIPS provides access to a variety of generic databases, including a database of protein families as well as automatically generated data by the systematic application of sequence analysis algorithms. The yeast genome sequence and its related information was also compiled on CD-ROM to provide dynamic interactive access to the 16 chromosomes of the first eukaryotic genome unraveled. PMID:9399795
G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences.

PubMed

Lenis, Vasileios Panagiotis E; Swain, Martin; Larkin, Denis M

2018-05-01

Cross-species whole-genome sequence alignment is a critical first step for genome comparative analyses, ranging from the detection of sequence variants to studies of chromosome evolution. Animal genomes are large and complex, and whole-genome alignment is a computationally intense process, requiring expensive high-performance computing systems due to the need to explore extensive local alignments. With hundreds of sequenced animal genomes available from multiple projects, there is an increasing demand for genome comparative analyses. Here, we introduce G-Anchor, a new, fast, and efficient pipeline that uses a strictly limited but highly effective set of local sequence alignments to anchor (or map) an animal genome to another species' reference genome. G-Anchor makes novel use of a databank of highly conserved DNA sequence elements. We demonstrate how these elements may be aligned to a pair of genomes, creating anchors. These anchors enable the rapid mapping of scaffolds from a de novo assembled genome to chromosome assemblies of a reference species. Our results demonstrate that G-Anchor can successfully anchor a vertebrate genome onto a phylogenetically related reference species genome using a desktop or laptop computer within a few hours and with comparable accuracy to that achieved by a highly accurate whole-genome alignment tool such as LASTZ. G-Anchor thus makes whole-genome comparisons accessible to researchers with limited computational resources. G-Anchor is a ready-to-use tool for anchoring a pair of vertebrate genomes. It may be used with large genomes that contain a significant fraction of evolutionally conserved DNA sequences and that are not highly repetitive, polypoid, or excessively fragmented. G-Anchor is not a substitute for whole-genome aligning software but can be used for fast and accurate initial genome comparisons. G-Anchor is freely available and a ready-to-use tool for the pairwise comparison of two genomes.
Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Scanning the human genome at kilobase resolution.

PubMed

Chen, Jun; Kim, Yeong C; Jung, Yong-Chul; Xuan, Zhenyu; Dworkin, Geoff; Zhang, Yanming; Zhang, Michael Q; Wang, San Ming

2008-05-01

Normal genome variation and pathogenic genome alteration frequently affect small regions in the genome. Identifying those genomic changes remains a technical challenge. We report here the development of the DGS (Ditag Genome Scanning) technique for high-resolution analysis of genome structure. The basic features of DGS include (1) use of high-frequent restriction enzymes to fractionate the genome into small fragments; (2) collection of two tags from two ends of a given DNA fragment to form a ditag to represent the fragment; (3) application of the 454 sequencing system to reach a comprehensive ditag sequence collection; (4) determination of the genome origin of ditags by mapping to reference ditags from known genome sequences; (5) use of ditag sequences directly as the sense and antisense PCR primers to amplify the original DNA fragment. To study the relationship between ditags and genome structure, we performed a computational study by using the human genome reference sequences as a model, and analyzed the ditags experimentally collected from the well-characterized normal human DNA GM15510 and the leukemic human DNA of Kasumi-1 cells. Our studies show that DGS provides a kilobase resolution for studying genome structure with high specificity and high genome coverage. DGS can be applied to validate genome assembly, to compare genome similarity and variation in normal populations, and to identify genomic abnormality including insertion, inversion, deletion, translocation, and amplification in pathological genomes such as cancer genomes.
Whole-Genome Sequence Variation among Multiple Isolates of Pseudomonas aeruginosa

PubMed Central

Spencer, David H.; Kas, Arnold; Smith, Eric E.; Raymond, Christopher K.; Sims, Elizabeth H.; Hastings, Michele; Burns, Jane L.; Kaul, Rajinder; Olson, Maynard V.

2003-01-01

Whole-genome shotgun sequencing was used to study the sequence variation of three Pseudomonas aeruginosa isolates, two from clonal infections of cystic fibrosis patients and one from an aquatic environment, relative to the genomic sequence of reference strain PAO1. The majority of the PAO1 genome is represented in these strains; however, at least three prominent islands of PAO1-specific sequence are apparent. Conversely, ∼10% of the sequencing reads derived from each isolate fail to align with the PAO1 backbone. While average sequence variation among all strains is roughly 0.5%, regions of pronounced differences were evident in whole-genome scans of nucleotide diversity. We analyzed two such divergent loci, the pyoverdine and O-antigen biosynthesis regions, by complete resequencing. A thorough analysis of isolates collected over time from one of the cystic fibrosis patients revealed independent mutations resulting in the loss of O-antigen synthesis alternating with a mucoid phenotype. Overall, we conclude that most of the PAO1 genome represents a core P. aeruginosa backbone sequence while the strains addressed in this study possess additional genetic material that accounts for at least 10% of their genomes. Approximately half of these additional sequences are novel. PMID:12562802
Complete Coding Genome Sequence for Mogiana Tick Virus, a Jingmenvirus Isolated from Ticks in Brazil

DTIC Science & Technology

2017-05-04

and capable of infecting a wide range of animal hosts (1–5). Here, we report the complete coding genome sequence (i.e., only missing portions of...segmented nature of the genome was not under- stood. Therefore, only the two genome segments with detectable sequence homolo- gies to flaviviruses were...originally reported (2). We revisited the data set of Maruyama et al. (2) and assembled the complete coding sequences for all four genome segments. We
Effects of informed consent for individual genome sequencing on relevant knowledge.

PubMed

Kaphingst, K A; Facio, F M; Cheng, M-R; Brooks, S; Eidem, H; Linn, A; Biesecker, B B; Biesecker, L G

2012-11-01

Increasing availability of individual genomic information suggests that patients will need knowledge about genome sequencing to make informed decisions, but prior research is limited. In this study, we examined genome sequencing knowledge before and after informed consent among 311 participants enrolled in the ClinSeq™ sequencing study. An exploratory factor analysis of knowledge items yielded two factors (sequencing limitations knowledge; sequencing benefits knowledge). In multivariable analysis, high pre-consent sequencing limitations knowledge scores were significantly related to education [odds ratio (OR): 8.7, 95% confidence interval (CI): 2.45-31.10 for post-graduate education, and OR: 3.9; 95% CI: 1.05, 14.61 for college degree compared with less than college degree] and race/ethnicity (OR: 2.4, 95% CI: 1.09, 5.38 for non-Hispanic Whites compared with other racial/ethnic groups). Mean values increased significantly between pre- and post-consent for the sequencing limitations knowledge subscale (6.9-7.7, p < 0.0001) and sequencing benefits knowledge subscale (7.0-7.5, p < 0.0001); increase in knowledge did not differ by sociodemographic characteristics. This study highlights gaps in genome sequencing knowledge and underscores the need to target educational efforts toward participants with less education or from minority racial/ethnic groups. The informed consent process improved genome sequencing knowledge. Future studies could examine how genome sequencing knowledge influences informed decision making. © 2012 John Wiley & Sons A/S.
First full-length genome sequence of the polerovirus luffa aphid-borne yellows virus (LABYV) reveals the presence of at least two consensus sequences in an isolate from Thailand.

PubMed

Knierim, Dennis; Maiss, Edgar; Kenyon, Lawrence; Winter, Stephan; Menzel, Wulf

2015-10-01

Luffa aphid-borne yellows virus (LABYV) was proposed as the name for a previously undescribed polerovirus based on partial genome sequences obtained from samples of cucurbit plants collected in Thailand between 2008 and 2013. In this study, we determined the first full-length genome sequence of LABYV. Based on phylogenetic analysis and genome properties, it is clear that this virus represents a distinct species in the genus Polerovirus. Analysis of sequences from sample TH24, which was collected in 2010 from a luffa plant in Thailand, reveals the presence of two different full-length genome consensus sequences.
The Impact of Post-Traumatic Stress Disorder on the Burden of Migraine: Results From the National Comorbidity Survey-Replication.

PubMed

Rao, Aruna S; Scher, Ann I; Vieira, Rebeca V A; Merikangas, Kathleen R; Metti, Andrea L; Peterlin, B Lee

2015-01-01

Post-traumatic stress disorder (PTSD) has been linked with migraine in prior studies. To evaluate the individual and joint burdens of migraine and PTSD in a population-based cohort. The National Comorbidity Survey-Replication (NCS-R) is a general population study conducted in the United States from February 2001-April 2003. PTSD and migraine were assessed, and four groups defined based on their migraine and PTSD status. The four groups included those with no migraine and no PTSD (controls, n=4535), those with migraine and without PTSD (migraine alone, n=236), those with PTSD and without migraine (PTSD alone, n=244), and those with both migraine and PTSD (mig+PTSD, n=68). Logistic and Poisson regression models were used to assess the association between dichotomous/multilevel outcome variables indicating financial, health, and interpersonal burdens and each migraine/PTSD group. Compared to controls, those with Mig+PTSD were more likely to be in the low poverty index (48% vs 41%, AOR 2.16; CI: 1.10, 4.24) and were less likely to be working for pay or profit in the past week (50% vs 68%, AOR 0.42; CI: 0.24, 0.74) but not those with migraine or PTSD alone. Additionally, the number of days where work quality was cut due to physical or mental health or substance abuse in the past month was greater in all groups compared to controls: (1) migraine alone: mean 2.57 (SEM 0.32) vs mean 1.09 (SEM 0.08) days, ARR=2.39; CI: 2.19, 2.62; (2) PTSD alone: mean 2.43 (SEM 0.33) vs mean 1.09 (SEM 0.08) days, ARR=2.09; CI: 1.91, 2.29; (3) mig+PTSD: mean 8.2 (SEM 0.79) vs 1.09 (SEM 0.08) days, ARR 6.79; CI 6.16, 7.49; and was over 2.5-fold greater in those mig+PTSD than migraine alone (mean 8.0 [SEM 0.79] vs 2.6 days [SEM 0.72], ARR 2.77; CI: 2.45, 3.14). The likelihood of having difficulty getting along or maintaining a social life was also increased in all groups relative to controls: (1) migraine alone: 21% vs 5.4%, AOR 4.20; CI: 2.62, 6.74; (2) PTSD alone: 18% vs 5.4%, AOR 3.40; CI: 2.40, 4.82; (3) Mig+PTSD: 39% vs 5.4%, AOR 9.95; CI: 5.72, 17.32, and was 2-fold greater in those with Mig+PTSD as compared to those with migraine alone (AOR 2.32; CI: 1.15, 4.69). These findings support the need for those who treat migraine patients to be aware of the comorbidity with PTSD, as these patients may be particularly prone to adverse financial, health, and interpersonal disease burdens. © 2015 American Headache Society.
Twenty-one genome sequences from Pseudomonas species and 19 genome sequences from diverse bacteria isolated from the rhizosphere and endosphere of Populus deltoides.

PubMed

Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn M; Johnson, Courtney M; Martin, Stanton L; Land, Miriam L; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A

2012-11-01

To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated.
First High-Quality Draft Genome Sequence of Pasteurella multocida Sequence Type 128 Isolated from Infected Bone.

PubMed

Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Yule, Catherine M; Gan, Han Ming

2016-03-03

We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida. Copyright © 2016 Kavousi et al.
Full-Genome Sequence of Infectious Laryngotracheitis Virus (Gallid Alphaherpesvirus 1) Strain VFAR-043, Isolated in Peru

PubMed Central

Bendezu Eguis, Jorge; Montesinos, Ricardo; Fernández-Díaz, Manolo

2018-01-01

ABSTRACT We report here the first genome sequence of infectious laryngotracheitis virus isolated in Peru from tracheal tissues of layer chickens. The genome showed 99.98% identity to the J2 strain genome sequence. Single nucleotide polymorphisms were detected in five gene-coding sequences related to vaccine development, virus attachment, and viral immune evasion. PMID:29519822
All about the Human Genome Project (HGP)

MedlinePlus

... CSER), and Genome Sequencing Informatics Tools (GS-IT) Comparative Genomics Background information prepared for the media on ... other species to the human sequence. Background on Comparative Genomic Analysis New Process to Prioritize Animal Genomes ...
JGI Fungal Genomics Program

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor V.

2011-03-14

Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functionalmore » genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here« less

Radiation hybrid maps of D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes

USDA-ARS?s Scientific Manuscript database

The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high-resolution genome maps saturated with ordered markers to assist in anchoring and orienting BAC contigs/ sequence scaffolds for whole genome sequence assembly. Radiation hybrid (RH) mapping has proven to be an e...
Organizational heterogeneity of vertebrate genomes.

PubMed

Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

2012-01-01

Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Comparison of the genomic sequence of the microminipig, a novel breed of swine, with the genomic database for conventional pig.

PubMed

Miura, Naoki; Kucho, Ken-Ichi; Noguchi, Michiko; Miyoshi, Noriaki; Uchiumi, Toshiki; Kawaguchi, Hiroaki; Tanimoto, Akihide

2014-01-01

The microminipig, which weighs less than 10 kg at an early stage of maturity, has been reported as a potential experimental model animal. Its extremely small size and other distinct characteristics suggest the possibility of a number of differences between the genome of the microminipig and that of conventional pigs. In this study, we analyzed the genomes of two healthy microminipigs using a next-generation sequencer SOLiD™ system. We then compared the obtained genomic sequences with a genomic database for the domestic pig (Sus scrofa). The mapping coverage of sequenced tag from the microminipig to conventional pig genomic sequences was greater than 96% and we detected no clear, substantial genomic variance from these data. The results may indicate that the distinct characteristics of the microminipig derive from small-scale alterations in the genome, such as Single Nucleotide Polymorphisms or translational modifications, rather than large-scale deletion or insertion polymorphisms. Further investigation of the entire genomic sequence of the microminipig with methods enabling deeper coverage is required to elucidate the genetic basis of its distinct phenotypic traits. Copyright © 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Genome survey sequencing of red swamp crayfish Procambarus clarkii.

PubMed

Shi, Linlin; Yi, Shaokui; Li, Yanhe

2018-06-21

Red swamp crayfish, Procambarus clarkii, presently is an important aquatic commercial species in China. The crayfish is a hot area of research focus, and its genetic improvement is quite urgent for the crayfish aquaculture in China. However, the knowledge of its genomic landscape is limited. In this study, a survey of P. clarkii genome was investigated based on Illumina's Solexa sequencing platform. Meanwhile, its genome size was estimated using flow cytometry. Interestingly, the genome size estimated is about 8.50 Gb by flow cytometry and 1.86 Gb with genome survey sequencing. Based on the assembled genome sequences, total of 136,962 genes and 152,268 exons were predicted, and the predicted genes ranged from 150 to 12,807 bp in length. The survey sequences could help accelerate the progress of gene discovery involved in genetic diversity and evolutionary analysis, even though it could not successfully applied for estimation of P. clarkii genome size.
Fueling the Future with Fungal Genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor V.

2014-10-27

Genomes of fungi relevant to energy and environment are in focus of the JGI Fungal Genomic Program. One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts and pathogens) and biorefinery processes (cellulose degradation and sugar fermentation) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Science Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for users to nominate new species for sequencing. Over 400 fungal genomes have beenmore » sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics will lead to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such ‘parts’ suggested by comparative genomics and functional analysis in these areas are presented here.« less
The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

PubMed

Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

2016-10-11

Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.

PubMed

van Binsbergen, Rianne; Calus, Mario P L; Bink, Marco C A M; van Eeuwijk, Fred A; Schrooten, Chris; Veerkamp, Roel F

2015-09-17

In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered.
First genome report on novel sequence types of Neisseria meningitidis: ST12777 and ST12778.

PubMed

Veeraraghavan, Balaji; Lal, Binesh; Devanga Ragupathi, Naveen Kumar; Neeravi, Iyyan Raj; Jeyaraman, Ranjith; Varghese, Rosemol; Paul, Miracle Magdalene; Baskaran, Ashtawarthani; Ranjan, Ranjini

2018-03-01

Neisseria meningitidis is an important causative agent of meningitis and/or sepsis with high morbidity and mortality. Baseline genome data on N. meningitidis, especially from developing countries such as India, are lacking. This study aimed to investigate the whole genome sequences of N. meningitidis isolates from a tertiary care centre in India. Whole-genome sequencing was performed using an Ion Torrent™ Personal Genome Machine™ (PGM) with 400-bp chemistry. Data were assembled de novo using SPAdes Genome Assembler v.5.0.0.0. Sequence annotation was performed through PATRIC, RAST and the NCBI PGAAP server. Downstream analysis of the isolates was performed using the Center for Genomic Epidemiology databases for antimicrobial resistance genes and sequence types. Virulence factors and CRISPR were analysed using the PubMLST database and CRISPRFinder, respectively. This study reports the whole genome shotgun sequences of eight N. meningitidis isolates from bloodstream infections. The genome data revealed two novel sequence types (ST12777 and ST12778), along with ST11, ST437 and ST6928. The virulence profile of the isolates matched their sequence types. All isolates were negative for plasmid-mediated resistance genes. To the best of our knowledge, this is the first report of ST11 and ST437 N. meningitidis isolates in India along with two novel sequence types (ST12777 and ST12778). These results indicate that the sequence types circulating in India are diverse and require continuous monitoring. Further studies strengthening the genome data on N. meningitidis are required to understand the prevalence, spread, exact resistance and virulence mechanisms along with serotypes. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer.

PubMed

Istace, Benjamin; Friedrich, Anne; d'Agata, Léo; Faye, Sébastien; Payen, Emilie; Beluche, Odette; Caradec, Claudia; Davidas, Sabrina; Cruaud, Corinne; Liti, Gianni; Lemainque, Arnaud; Engelen, Stefan; Wincker, Patrick; Schacherer, Joseph; Aury, Jean-Marc

2017-02-01

Oxford Nanopore Technologies Ltd (Oxford, UK) have recently commercialized MinION, a small single-molecule nanopore sequencer, that offers the possibility of sequencing long DNA fragments from small genomes in a matter of seconds. The Oxford Nanopore technology is truly disruptive; it has the potential to revolutionize genomic applications due to its portability, low cost, and ease of use compared with existing long reads sequencing technologies. The MinION sequencer enables the rapid sequencing of small eukaryotic genomes, such as the yeast genome. Combined with existing assembler algorithms, near complete genome assemblies can be generated and comprehensive population genomic analyses can be performed. Here, we resequenced the genome of the Saccharomyces cerevisiae S288C strain to evaluate the performance of nanopore-only assemblers. Then we de novo sequenced and assembled the genomes of 21 isolates representative of the S. cerevisiae genetic diversity using the MinION platform. The contiguity of our assemblies was 14 times higher than the Illumina-only assemblies and we obtained one or two long contigs for 65 % of the chromosomes. This high contiguity allowed us to accurately detect large structural variations across the 21 studied genomes. Because of the high completeness of the nanopore assemblies, we were able to produce a complete cartography of transposable elements insertions and inspect structural variants that are generally missed using a short-read sequencing strategy. Our analyses show that the Oxford Nanopore technology is already usable for de novo sequencing and assembly; however, non-random errors in homopolymers require polishing the consensus using an alternate sequencing technology. © The Author 2017. Published by Oxford University Press.
de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer

PubMed Central

Istace, Benjamin; Friedrich, Anne; d'Agata, Léo; Faye, Sébastien; Payen, Emilie; Beluche, Odette; Caradec, Claudia; Davidas, Sabrina; Cruaud, Corinne; Liti, Gianni; Lemainque, Arnaud; Engelen, Stefan; Wincker, Patrick; Schacherer, Joseph

2017-01-01

Abstract Background: Oxford Nanopore Technologies Ltd (Oxford, UK) have recently commercialized MinION, a small single-molecule nanopore sequencer, that offers the possibility of sequencing long DNA fragments from small genomes in a matter of seconds. The Oxford Nanopore technology is truly disruptive; it has the potential to revolutionize genomic applications due to its portability, low cost, and ease of use compared with existing long reads sequencing technologies. The MinION sequencer enables the rapid sequencing of small eukaryotic genomes, such as the yeast genome. Combined with existing assembler algorithms, near complete genome assemblies can be generated and comprehensive population genomic analyses can be performed. Results: Here, we resequenced the genome of the Saccharomyces cerevisiae S288C strain to evaluate the performance of nanopore-only assemblers. Then we de novo sequenced and assembled the genomes of 21 isolates representative of the S. cerevisiae genetic diversity using the MinION platform. The contiguity of our assemblies was 14 times higher than the Illumina-only assemblies and we obtained one or two long contigs for 65 % of the chromosomes. This high contiguity allowed us to accurately detect large structural variations across the 21 studied genomes. Conclusion: Because of the high completeness of the nanopore assemblies, we were able to produce a complete cartography of transposable elements insertions and inspect structural variants that are generally missed using a short-read sequencing strategy. Our analyses show that the Oxford Nanopore technology is already usable for de novo sequencing and assembly; however, non-random errors in homopolymers require polishing the consensus using an alternate sequencing technology. PMID:28369459
Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

PubMed Central

2011-01-01

Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed. PMID:21794110
Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes.

PubMed

Feltus, Frank A; Saski, Christopher A; Mockaitis, Keithanne; Haiminen, Niina; Parida, Laxmi; Smith, Zachary; Ford, James; Staton, Margaret E; Ficklin, Stephen P; Blackmon, Barbara P; Cheng, Chun-Huai; Schnell, Raymond J; Kuhn, David N; Motamayor, Juan-Carlos

2011-07-27

BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.
Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.

PubMed

Ghanem, Mostafa; El-Gazzar, Mohamed

2018-05-01

Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
High-Throughput resequencing of maize landraces at genomic regions associated with flowering time

USDA-ARS?s Scientific Manuscript database

Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequenci...
GSP: A web-based platform for designing genome-specific primers in polyploids

USDA-ARS?s Scientific Manuscript database

The sequences among subgenomes in a polyploid species have high similarity. This makes difficult to design genome-specific primers for sequence analysis. We present a web-based platform named GSP for designing genome-specific primers to distinguish subgenome sequences in the polyploid genome backgr...
Fluorescence in situ hybridization and optical mapping to correct scaffold arrangement in the tomato genome

USDA-ARS?s Scientific Manuscript database

Modern biological analyses are often assisted by recent technologies making the sequencing of complex genomes both technically possible and feasible. We recently sequenced the tomato genome that, like many eukaryotic genomes, is large and complex. Current sequencing technologies allow the developmen...
The Contribution of Short Repeats of Low Sequence Complexity to Large Conifer Genomes

Treesearch

A. Schmidt; R.L. Doudrick; J.S. Heslop-Harrison; T. Schmidt

2000-01-01

Abstract: The abundance and genomic organization of six simple sequence repeats, consisting of di-, tri-, and tetranucleotide sequence motifs, and a minisatellite repeat have been analyzed in different gymnosperms by Southern hybridization. Within the gymnosperm genomes investigated, the abundance and genomic organization of micro- and...
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Treesearch

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
A computational genomics pipeline for prokaryotic sequencing projects.

PubMed

Kislyuk, Andrey O; Katz, Lee S; Agrawal, Sonia; Hagen, Matthew S; Conley, Andrew B; Jayaraman, Pushkala; Nelakuditi, Viswateja; Humphrey, Jay C; Sammons, Scott A; Govil, Dhwani; Mair, Raydel D; Tatti, Kathleen M; Tondella, Maria L; Harcourt, Brian H; Mayer, Leonard W; Jordan, I King

2010-08-01

New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data. We present a self-contained, automated high-throughput open source genome sequencing and computational genomics pipeline suitable for prokaryotic sequencing projects. The pipeline has been used at the Georgia Institute of Technology and the Centers for Disease Control and Prevention for the analysis of Neisseria meningitidis and Bordetella bronchiseptica genomes. The pipeline is capable of enhanced or manually assisted reference-based assembly using multiple assemblers and modes; gene predictor combining; and functional annotation of genes and gene products. Because every component of the pipeline is executed on a local machine with no need to access resources over the Internet, the pipeline is suitable for projects of a sensitive nature. Annotation of virulence-related features makes the pipeline particularly useful for projects working with pathogenic prokaryotes. The pipeline is licensed under the open-source GNU General Public License and available at the Georgia Tech Neisseria Base (http://nbase.biology.gatech.edu/). The pipeline is implemented with a combination of Perl, Bourne Shell and MySQL and is compatible with Linux and other Unix systems.
Low levels of LTR retrotransposon deletion by ectopic recombination in the gigantic genomes of salamanders.

PubMed

Frahry, Matthew Blake; Sun, Cheng; Chong, Rebecca A; Mueller, Rachel Lockridge

2015-02-01

Across the tree of life, species vary dramatically in nuclear genome size. Mutations that add or remove sequences from genomes-insertions or deletions, or indels-are the ultimate source of this variation. Differences in the tempo and mode of insertion and deletion across taxa have been proposed to contribute to evolutionary diversity in genome size. Among vertebrates, most of the largest genomes are found within the salamanders, an amphibian clade with genome sizes ranging from ~14 to ~120 Gb. Salamander genomes have been shown to experience slower rates of DNA loss through small (i.e., <30 bp) deletions than do other vertebrate genomes. However, no studies have addressed DNA loss from salamander genomes resulting from larger deletions. Here, we focus on one type of large deletion-ectopic-recombination-mediated removal of LTR retrotransposon sequences. In ectopic recombination, double-strand breaks are repaired using a "wrong" (i.e., ectopic, or non-allelic) template sequence-typically another locus of similar sequence. When breaks occur within the LTR portions of LTR retrotransposons, ectopic-recombination-mediated repair can produce deletions that remove the internal transposon sequence and the equivalent of one of the two LTR sequences. These deletions leave a signature in the genome-a solo LTR sequence. We compared levels of solo LTRs in the genomes of four salamander species with levels present in five vertebrates with smaller genomes. Our results demonstrate that salamanders have low levels of solo LTRs, suggesting that ectopic-recombination-mediated deletion of LTR retrotransposons occurs more slowly than in other vertebrates with smaller genomes.

Project 1: Microbial Genomes: A Genomic Approach to Understanding the Evolution of Virulence. Project 2: From Genomes to Life: Drosophilia Development in Space and Time

DOE Office of Scientific and Technical Information (OSTI.GOV)

Robert DeSalle

2004-09-10

This project seeks to use the genomes of two close relatives, A. actinomycetemcomitans and H. aphrophilus, to understand the evolutionary changes that take place in a genome to make it more or less virulent. Our primary specific aim of this project was to sequence, annotate, and analyze the genomes of Actinobacillus actinomycetemcomitans (CU1000, serotype f) and Haemophilus aphrophilus. With these genome sequences we have then compared the whole genome sequences to each other and to the current Aa (HK1651 www.genome.ou.edu) genome project sequence along with other fully sequenced Pasteurellaceae to determine inter and intra species differences that may account formore » the differences and similarities in disease. We also propose to create and curate a comprehensive database where sequence information and analysis for the Pasteurellaceae (family that includes the genera Actinobacillus and Haemophilus) are readily accessible. And finally we have proposed to develop phylogenetic techniques that can be used to efficiently and accurately examine the evolution of genomes. Below we report on progress we have made on these major specific aims. Progress on the specific aims is reported below under two major headings--experimental approaches and bioinformatics and systematic biology approaches.« less
Progress in Understanding and Sequencing the Genome of Brassica rapa

PubMed Central

Hong, Chang Pyo; Kwon, Soo-Jin; Kim, Jung Sun; Yang, Tae-Jin; Park, Beom-Seok; Lim, Yong Pyo

2008-01-01

Brassica rapa, which is closely related to Arabidopsis thaliana, is an important crop and a model plant for studying genome evolution via polyploidization. We report the current understanding of the genome structure of B. rapa and efforts for the whole-genome sequencing of the species. The tribe Brassicaceae, which comprises ca. 240 species, descended from a common hexaploid ancestor with a basic genome similar to that of Arabidopsis. Chromosome rearrangements, including fusions and/or fissions, resulted in the present-day “diploid” Brassica species with variation in chromosome number and phenotype. Triplicated genomic segments of B. rapa are collinear to those of A. thaliana with InDels. The genome triplication has led to an approximately 1.7-fold increase in the B. rapa gene number compared to that of A. thaliana. Repetitive DNA of B. rapa has also been extensively amplified and has diverged from that of A. thaliana. For its whole-genome sequencing, the Brassica rapa Genome Sequencing Project (BrGSP) consortium has developed suitable genomic resources and constructed genetic and physical maps. Ten chromosomes of B. rapa are being allocated to BrGSP consortium participants, and each chromosome will be sequenced by a BAC-by-BAC approach. Genome sequencing of B. rapa will offer a new perspective for plant biology and evolution in the context of polyploidization. PMID:18288250
Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses

PubMed Central

Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A.; Janke, Axel

2015-01-01

The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. PMID:26019166
Novel Insights into Tree Biology and Genome Evolution as Revealed Through Genomics.

PubMed

Neale, David B; Martínez-García, Pedro J; De La Torre, Amanda R; Montanari, Sara; Wei, Xiao-Xin

2017-04-28

Reference genome sequences are the key to the discovery of genes and gene families that determine traits of interest. Recent progress in sequencing technologies has enabled a rapid increase in genome sequencing of tree species, allowing the dissection of complex characters of economic importance, such as fruit and wood quality and resistance to biotic and abiotic stresses. Although the number of reference genome sequences for trees lags behind those for other plant species, it is not too early to gain insight into the unique features that distinguish trees from nontree plants. Our review of the published data suggests that, although many gene families are conserved among herbaceous and tree species, some gene families, such as those involved in resistance to biotic and abiotic stresses and in the synthesis and transport of sugars, are often expanded in tree genomes. As the genomes of more tree species are sequenced, comparative genomics will further elucidate the complexity of tree genomes and how this relates to traits unique to trees.
Rapid sequencing of the bamboo mitochondrial genome using Illumina technology and parallel episodic evolution of organelle genomes in grasses.

PubMed

Ma, Peng-Fei; Guo, Zhen-Hua; Li, De-Zhu

2012-01-01

Compared to their counterparts in animals, the mitochondrial (mt) genomes of angiosperms exhibit a number of unique features. However, unravelling their evolution is hindered by the few completed genomes, of which are essentially Sanger sequenced. While next-generation sequencing technologies have revolutionized chloroplast genome sequencing, they are just beginning to be applied to angiosperm mt genomes. Chloroplast genomes of grasses (Poaceae) have undergone episodic evolution and the evolutionary rate was suggested to be correlated between chloroplast and mt genomes in Poaceae. It is interesting to investigate whether correlated rate change also occurred in grass mt genomes as expected under lineage effects. A time-calibrated phylogenetic tree is needed to examine rate change. We determined a largely completed mt genome from a bamboo, Ferrocalamus rimosivaginus (Poaceae), through Illumina sequencing of total DNA. With combination of de novo and reference-guided assembly, 39.5-fold coverage Illumina reads were finally assembled into scaffolds totalling 432,839 bp. The assembled genome contains nearly the same genes as the completed mt genomes in Poaceae. For examining evolutionary rate in grass mt genomes, we reconstructed a phylogenetic tree including 22 taxa based on 31 mt genes. The topology of the well-resolved tree was almost identical to that inferred from chloroplast genome with only minor difference. The inconsistency possibly derived from long branch attraction in mtDNA tree. By calculating absolute substitution rates, we found significant rate change (∼4-fold) in mt genome before and after the diversification of Poaceae both in synonymous and nonsynonymous terms. Furthermore, the rate change was correlated with that of chloroplast genomes in grasses. Our result demonstrates that it is a rapid and efficient approach to obtain angiosperm mt genome sequences using Illumina sequencing technology. The parallel episodic evolution of mt and chloroplast genomes in grasses is consistent with lineage effects.
Rapid Sequencing of the Bamboo Mitochondrial Genome Using Illumina Technology and Parallel Episodic Evolution of Organelle Genomes in Grasses

PubMed Central

Ma, Peng-Fei; Guo, Zhen-Hua; Li, De-Zhu

2012-01-01

Background Compared to their counterparts in animals, the mitochondrial (mt) genomes of angiosperms exhibit a number of unique features. However, unravelling their evolution is hindered by the few completed genomes, of which are essentially Sanger sequenced. While next-generation sequencing technologies have revolutionized chloroplast genome sequencing, they are just beginning to be applied to angiosperm mt genomes. Chloroplast genomes of grasses (Poaceae) have undergone episodic evolution and the evolutionary rate was suggested to be correlated between chloroplast and mt genomes in Poaceae. It is interesting to investigate whether correlated rate change also occurred in grass mt genomes as expected under lineage effects. A time-calibrated phylogenetic tree is needed to examine rate change. Methodology/Principal Findings We determined a largely completed mt genome from a bamboo, Ferrocalamus rimosivaginus (Poaceae), through Illumina sequencing of total DNA. With combination of de novo and reference-guided assembly, 39.5-fold coverage Illumina reads were finally assembled into scaffolds totalling 432,839 bp. The assembled genome contains nearly the same genes as the completed mt genomes in Poaceae. For examining evolutionary rate in grass mt genomes, we reconstructed a phylogenetic tree including 22 taxa based on 31 mt genes. The topology of the well-resolved tree was almost identical to that inferred from chloroplast genome with only minor difference. The inconsistency possibly derived from long branch attraction in mtDNA tree. By calculating absolute substitution rates, we found significant rate change (∼4-fold) in mt genome before and after the diversification of Poaceae both in synonymous and nonsynonymous terms. Furthermore, the rate change was correlated with that of chloroplast genomes in grasses. Conclusions/Significance Our result demonstrates that it is a rapid and efficient approach to obtain angiosperm mt genome sequences using Illumina sequencing technology. The parallel episodic evolution of mt and chloroplast genomes in grasses is consistent with lineage effects. PMID:22272330
Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Johnson, Courtney M.; Martin, Stanton L.; Land, Miriam L.; Lu, Tse-Yuan S.; Schadt, Christopher W.; Doktycz, Mitchel J.

2012-01-01

To aid in the investigation of the Populus deltoides microbiome, we generated draft genome sequences for 21 Pseudomonas strains and 19 other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium, and Variovorax were generated. PMID:23045501
Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn Marie

To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.
Draft Genome Sequence, and a Sequence-Defined Genetic Linkage Map of the Legume Crop Species Lupinus angustifolius L

PubMed Central

Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao

2013-01-01

Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219
Draft genome sequence, and a sequence-defined genetic linkage map of the legume crop species Lupinus angustifolius L.

PubMed

Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao

2013-01-01

Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.
Sequence-specific epigenetic effects of the maternal somatic genome on developmental rearrangements of the zygotic genome in Paramecium primaurelia.

PubMed Central

Meyer, E; Butler, A; Dubrana, K; Duharcourt, S; Caron, F

1997-01-01

In ciliates, the germ line genome is extensively rearranged during the development of the somatic macronucleus from a mitotic product of the zygotic nucleus. Germ line chromosomes are fragmented in specific regions, and a large number of internal sequence elements are eliminated. It was previously shown that transformation of the vegetative macronucleus of Paramecium primaurelia with a plasmid containing a subtelomeric surface antigen gene can affect the processing of the homologous germ line genomic region during development of a new macronucleus in sexual progeny of transformed clones. The gene and telomere-proximal flanking sequences are deleted from the new macronuclear genome, although the germ line genome remains wild type. Here we show that plasmids containing nonoverlapping segments of the same genomic region are able to induce similar terminal deletions; the locations of deletion end points depend on the particular sequence used. Transformation of the maternal macronucleus with a sequence internal to a macronuclear chromosome also causes the occurrence of internal deletions between short direct repeats composed of alternating thymines and adenines. The epigenetic influence of maternal macronuclear sequences on developmental rearrangements of the zygotic genome thus appears to be both sequence specific and general, suggesting that this trans-nucleus effect is mediated by pairing of homologous sequences. PMID:9199294
Microsatellite analysis in the genome of Acanthaceae: An in silico approach

PubMed Central

Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar

2015-01-01

Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future. PMID:25709226
Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

PubMed

Janicki, Mateusz; Rooke, Rebecca; Yang, Guojun

2011-08-01

A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Independent assessment and improvement of wheat genome sequence assemblies using Fosill jumping libraries.

PubMed

Lu, Fu-Hao; McKenzie, Neil; Kettleborough, George; Heavens, Darren; Clark, Matthew D; Bevan, Michael W

2018-05-01

The accurate sequencing and assembly of very large, often polyploid, genomes remains a challenging task, limiting long-range sequence information and phased sequence variation for applications such as plant breeding. The 15-Gb hexaploid bread wheat (Triticum aestivum) genome has been particularly challenging to sequence, and several different approaches have recently generated long-range assemblies. Mapping and understanding the types of assembly errors are important for optimising future sequencing and assembly approaches and for comparative genomics. Here we use a Fosill 38-kb jumping library to assess medium and longer-range order of different publicly available wheat genome assemblies. Modifications to the Fosill protocol generated longer Illumina sequences and enabled comprehensive genome coverage. Analyses of two independent Bacterial Artificial Chromosome (BAC)-based chromosome-scale assemblies, two independent Illumina whole genome shotgun assemblies, and a hybrid Single Molecule Real Time (SMRT-PacBio) and short read (Illumina) assembly were carried out. We revealed a surprising scale and variety of discrepancies using Fosill mate-pair mapping and validated several of each class. In addition, Fosill mate-pairs were used to scaffold a whole genome Illumina assembly, leading to a 3-fold increase in N50 values. Our analyses, using an independent means to validate different wheat genome assemblies, show that whole genome shotgun assemblies based solely on Illumina sequences are significantly more accurate by all measures compared to BAC-based chromosome-scale assemblies and hybrid SMRT-Illumina approaches. Although current whole genome assemblies are reasonably accurate and useful, additional improvements will be needed to generate complete assemblies of wheat genomes using open-source, computationally efficient, and cost-effective methods.
Nanopore DNA Sequencing and Genome Assembly on the International Space Station.

PubMed

Castro-Wallace, Sarah L; Chiu, Charles Y; John, Kristen K; Stahl, Sarah E; Rubins, Kathleen H; McIntyre, Alexa B R; Dworkin, Jason P; Lupisella, Mark L; Smith, David J; Botkin, Douglas J; Stephenson, Timothy A; Juul, Sissel; Turner, Daniel J; Izquierdo, Fernando; Federman, Scot; Stryke, Doug; Somasekar, Sneha; Alexander, Noah; Yu, Guixia; Mason, Christopher E; Burton, Aaron S

2017-12-21

We evaluated the performance of the MinION DNA sequencer in-flight on the International Space Station (ISS), and benchmarked its performance off-Earth against the MinION, Illumina MiSeq, and PacBio RS II sequencing platforms in terrestrial laboratories. Samples contained equimolar mixtures of genomic DNA from lambda bacteriophage, Escherichia coli (strain K12, MG1655) and Mus musculus (female BALB/c mouse). Nine sequencing runs were performed aboard the ISS over a 6-month period, yielding a total of 276,882 reads with no apparent decrease in performance over time. From sequence data collected aboard the ISS, we constructed directed assemblies of the ~4.6 Mb E. coli genome, ~48.5 kb lambda genome, and a representative M. musculus sequence (the ~16.3 kb mitochondrial genome), at 100%, 100%, and 96.7% consensus pairwise identity, respectively; de novo assembly of the E. coli genome from raw reads yielded a single contig comprising 99.9% of the genome at 98.6% consensus pairwise identity. Simulated real-time analyses of in-flight sequence data using an automated bioinformatic pipeline and laptop-based genomic assembly demonstrated the feasibility of sequencing analysis and microbial identification aboard the ISS. These findings illustrate the potential for sequencing applications including disease diagnosis, environmental monitoring, and elucidating the molecular basis for how organisms respond to spaceflight.
Ultraaccurate genome sequencing and haplotyping of single human cells.

PubMed

Chu, Wai Keung; Edge, Peter; Lee, Ho Suk; Bansal, Vikas; Bafna, Vineet; Huang, Xiaohua; Zhang, Kun

2017-11-21

Accurate detection of variants and long-range haplotypes in genomes of single human cells remains very challenging. Common approaches require extensive in vitro amplification of genomes of individual cells using DNA polymerases and high-throughput short-read DNA sequencing. These approaches have two notable drawbacks. First, polymerase replication errors could generate tens of thousands of false-positive calls per genome. Second, relatively short sequence reads contain little to no haplotype information. Here we report a method, which is dubbed SISSOR (single-stranded sequencing using microfluidic reactors), for accurate single-cell genome sequencing and haplotyping. A microfluidic processor is used to separate the Watson and Crick strands of the double-stranded chromosomal DNA in a single cell and to randomly partition megabase-size DNA strands into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing. The separation and partitioning of large single-stranded DNA fragments of the homologous chromosome pairs allows for the independent sequencing of each of the complementary and homologous strands. This enables the assembly of long haplotypes and reduction of sequence errors by using the redundant sequence information and haplotype-based error removal. We demonstrated the ability to sequence single-cell genomes with error rates as low as 10 -8 and average 500-kb-long DNA fragments that can be assembled into haplotype contigs with N50 greater than 7 Mb. The performance could be further improved with more uniform amplification and more accurate sequence alignment. The ability to obtain accurate genome sequences and haplotype information from single cells will enable applications of genome sequencing for diverse clinical needs. Copyright © 2017 the Author(s). Published by PNAS.
Full Genome Sequence of Egg Drop Syndrome Virus Strain FJ12025 Isolated from Muscovy Duckling.

PubMed

Fu, Guanghua; Chen, Hongmei; Huang, Yu; Cheng, Longfei; Fu, Qiuling; Shi, Shaohua; Wan, Chunhe; Chen, Cuiteng; Lin, Jiansheng

2013-08-22

Egg drop syndrome virus (EDSV) strain FJ12025 was isolated from a 9-day-old Muscovy duckling. The results of the sequence showed that the genome of strain FJ12025 is 33,213 bp in length, with a G+C content of 43.03%. When comparing the genome sequence of strain FJ12025 to that of laying duck original strain AV-127, we found 50 single-nucleotide polymorphisms (SNPs) between the two viral genome sequences. A genomic sequence comparison of FJ12025 and AV-127 will help to understand the phenotypic differences between the two viruses.
An Exploration into Fern Genome Space.

PubMed

Wolf, Paul G; Sessa, Emily B; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J; Sigel, Erin M; Gitzendanner, Matthew A; Visger, Clayton J; Banks, Jo Ann; Soltis, Douglas E; Soltis, Pamela S; Pryer, Kathleen M; Der, Joshua P

2015-08-26

Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The contribution of the DNA microarray technology to gene expression profiling in Leishmania spp.: a retrospective.

PubMed

Alonso, Ana; Larraga, Vicente; Alcolea, Pedro J

2018-05-07

The first genome project of any living organism excluding viruses, the gammaproteobacteria Haemophilus influenzae, was completed in 1995. Until the last decade, genome sequencing was very tedious because genome survey sequences (GSS) and/or expressed sequence tags (ESTs) belonging to plasmid, cosmid and artificial chromosome genome libraries had to be sequenced and assembled in silico. Nowadays, no genome is completely assembled actually, because gaps and unassembled contigs are always remaining. However, most represent the whole genome of the organism of origin from a practical point of view. The first genome sequencing projects of trypanosomatid parasites were completed in 2005 following those strategies, and belong to Leishmania major, Trypanosoma cruzi and T. brucei. The functional genomics era rapidly developed on the basis of the microarray technology and has been evolving. In the case of the genus Leishmania, substantial biological information about differentiation in the digenetic life cycle of the parasite has been obtained. Later on, next generation sequencing has revolutionized genome sequencing and functional genomics, leading to more sensitive, accurate results by using much less resources. This new technology is more advantageous, but does not invalidate microarray results. In fact, promising vaccine candidates and drug targets have been found on the basis of microarray-based screening and preliminary proof-of-concept tests. Copyright © 2018. Published by Elsevier B.V.
Complete genome sequence of the plant pathogen Erwinia amylovora strain ATCC 49946

USDA-ARS?s Scientific Manuscript database

Erwinia amylovora causes the economically important disease fire blight that affects rosaceous plants, especially pear and apple. Here we report the complete genome sequence and annotation of strain ATCC 49946. The analysis of the sequence and its comparison with sequenced genomes of closely related...

Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing.

PubMed

Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R

2014-08-16

Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human diversity. 76% of micSeqs were confirmed by a comparative genomics approach. Fourteen micSeqs are expressed in human brain or contain TF binding regions. Some micSeqs are primate-specific, conserved and may play a role in the evolution of primates.
Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

PubMed

Wen, Chiu-Ming

2017-08-01

An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.
IMGD: an integrated platform supporting comparative genomics and phylogenetics of insect mitochondrial genomes

PubMed Central

Lee, Wonhoon; Park, Jongsun; Choi, Jaeyoung; Jung, Kyongyong; Park, Bongsoo; Kim, Donghan; Lee, Jaeyoung; Ahn, Kyohun; Song, Wonho; Kang, Seogchan; Lee, Yong-Hwan; Lee, Seunghwan

2009-01-01

Background Sequences and organization of the mitochondrial genome have been used as markers to investigate evolutionary history and relationships in many taxonomic groups. The rapidly increasing mitochondrial genome sequences from diverse insects provide ample opportunities to explore various global evolutionary questions in the superclass Hexapoda. To adequately support such questions, it is imperative to establish an informatics platform that facilitates the retrieval and utilization of available mitochondrial genome sequence data. Results The Insect Mitochondrial Genome Database (IMGD) is a new integrated platform that archives the mitochondrial genome sequences from 25,747 hexapod species, including 112 completely sequenced and 20 nearly completed genomes and 113,985 partially sequenced mitochondrial genomes. The Species-driven User Interface (SUI) of IMGD supports data retrieval and diverse analyses at multi-taxon levels. The Phyloviewer implemented in IMGD provides three methods for drawing phylogenetic trees and displays the resulting trees on the web. The SNP database incorporated to IMGD presents the distribution of SNPs and INDELs in the mitochondrial genomes of multiple isolates within eight species. A newly developed comparative SNU Genome Browser supports the graphical presentation and interactive interface for the identified SNPs/INDELs. Conclusion The IMGD provides a solid foundation for the comparative mitochondrial genomics and phylogenetics of insects. All data and functions described here are available at the web site . PMID:19351385
TARGETED CAPTURE IN EVOLUTIONARY AND ECOLOGICAL GENOMICS

PubMed Central

Jones, Matthew R.; Good, Jeffrey M.

2016-01-01

The rapid expansion of next-generation sequencing has yielded a powerful array of tools to address fundamental biological questions at a scale that was inconceivable just a few years ago. Various genome partitioning strategies to sequence select subsets of the genome have emerged as powerful alternatives to whole genome sequencing in ecological and evolutionary genomic studies. High throughput targeted capture is one such strategy that involves the parallel enrichment of pre-selected genomic regions of interest. The growing use of targeted capture demonstrates its potential power to address a range of research questions, yet these approaches have yet to expand broadly across labs focused on evolutionary and ecological genomics. In part, the use of targeted capture has been hindered by the logistics of capture design and implementation in species without established reference genomes. Here we aim to 1) increase the accessibility of targeted capture to researchers working in non-model taxa by discussing capture methods that circumvent the need of a reference genome, 2) highlight the evolutionary and ecological applications where this approach is emerging as a powerful sequencing strategy, and 3) discuss the future of targeted capture and other genome partitioning approaches in light of the increasing accessibility of whole genome sequencing. Given the practical advantages and increasing feasibility of high-throughput targeted capture, we anticipate an ongoing expansion of capture-based approaches in evolutionary and ecological research, synergistic with an expansion of whole genome sequencing. PMID:26137993
JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

PubMed Central

2009-01-01

Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Genomic Encyclopedia of Fungi

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor

Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supportedmore » by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.« less
Analysis of Illumina Microbial Assemblies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Clum, Alicia; Foster, Brian; Froula, Jeff

2010-05-28

Since the emerging of second generation sequencing technologies, the evaluation of different sequencing approaches and their assembly strategies for different types of genomes has become an important undertaken. Next generation sequencing technologies dramatically increase sequence throughput while decreasing cost, making them an attractive tool for whole genome shotgun sequencing. To compare different approaches for de-novo whole genome assembly, appropriate tools and a solid understanding of both quantity and quality of the underlying sequence data are crucial. Here, we performed an in-depth analysis of short-read Illumina sequence assembly strategies for bacterial and archaeal genomes. Different types of Illumina libraries as wellmore » as different trim parameters and assemblers were evaluated. Results of the comparative analysis and sequencing platforms will be presented. The goal of this analysis is to develop a cost-effective approach for the increased throughput of the generation of high quality microbial genomes.« less
Construction of a map-based reference genome sequence for barley, Hordeum vulgare L.

PubMed Central

Beier, Sebastian; Himmelbach, Axel; Colmsee, Christian; Zhang, Xiao-Qi; Barrero, Roberto A.; Zhang, Qisen; Li, Lin; Bayer, Micha; Bolser, Daniel; Taudien, Stefan; Groth, Marco; Felder, Marius; Hastie, Alex; Šimková, Hana; Staňková, Helena; Vrána, Jan; Chan, Saki; Muñoz-Amatriaín, María; Ounit, Rachid; Wanamaker, Steve; Schmutzer, Thomas; Aliyeva-Schnorr, Lala; Grasso, Stefano; Tanskanen, Jaakko; Sampath, Dharanya; Heavens, Darren; Cao, Sujie; Chapman, Brett; Dai, Fei; Han, Yong; Li, Hua; Li, Xuan; Lin, Chongyun; McCooke, John K.; Tan, Cong; Wang, Songbo; Yin, Shuya; Zhou, Gaofeng; Poland, Jesse A.; Bellgard, Matthew I.; Houben, Andreas; Doležel, Jaroslav; Ayling, Sarah; Lonardi, Stefano; Langridge, Peter; Muehlbauer, Gary J.; Kersey, Paul; Clark, Matthew D.; Caccamo, Mario; Schulman, Alan H.; Platzer, Matthias; Close, Timothy J.; Hansson, Mats; Zhang, Guoping; Braumann, Ilka; Li, Chengdao; Waugh, Robbie; Scholz, Uwe; Stein, Nils; Mascher, Martin

2017-01-01

Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. ‘Morex’ was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX). PMID:28448065
Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

PubMed

Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

2016-07-01

This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.
Reduced representation approaches to interrogate genome diversity in large repetitive plant genomes.

PubMed

Hirsch, Cory D; Evans, Joseph; Buell, C Robin; Hirsch, Candice N

2014-07-01

Technology and software improvements in the last decade now provide methodologies to access the genome sequence of not only a single accession, but also multiple accessions of plant species. This provides a means to interrogate species diversity at the genome level. Ample diversity among accessions in a collection of species can be found, including single-nucleotide polymorphisms, insertions and deletions, copy number variation and presence/absence variation. For species with small, non-repetitive rich genomes, re-sequencing of query accessions is robust, highly informative, and economically feasible. However, for species with moderate to large sized repetitive-rich genomes, technical and economic barriers prevent en masse genome re-sequencing of accessions. Multiple approaches to access a focused subset of loci in species with larger genomes have been developed, including reduced representation sequencing, exome capture and transcriptome sequencing. Collectively, these approaches have enabled interrogation of diversity on a genome scale for large plant genomes, including crop species important to worldwide food security. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Fungal Genomics Program

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor

The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scalemore » genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.« less
Microbial genomic taxonomy

PubMed Central

2013-01-01

A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132
Massively parallel whole genome amplification for single-cell sequencing using droplet microfluidics.

PubMed

Hosokawa, Masahito; Nishikawa, Yohei; Kogawa, Masato; Takeyama, Haruko

2017-07-12

Massively parallel single-cell genome sequencing is required to further understand genetic diversities in complex biological systems. Whole genome amplification (WGA) is the first step for single-cell sequencing, but its throughput and accuracy are insufficient in conventional reaction platforms. Here, we introduce single droplet multiple displacement amplification (sd-MDA), a method that enables massively parallel amplification of single cell genomes while maintaining sequence accuracy and specificity. Tens of thousands of single cells are compartmentalized in millions of picoliter droplets and then subjected to lysis and WGA by passive droplet fusion in microfluidic channels. Because single cells are isolated in compartments, their genomes are amplified to saturation without contamination. This enables the high-throughput acquisition of contamination-free and cell specific sequence reads from single cells (21,000 single-cells/h), resulting in enhancement of the sequence data quality compared to conventional methods. This method allowed WGA of both single bacterial cells and human cancer cells. The obtained sequencing coverage rivals those of conventional techniques with superior sequence quality. In addition, we also demonstrate de novo assembly of uncultured soil bacteria and obtain draft genomes from single cell sequencing. This sd-MDA is promising for flexible and scalable use in single-cell sequencing.
Complete Genome Sequence of Porcine Parvovirus 2 Recovered from Swine Sera

PubMed Central

Kluge, M.; Franco, A. C.; Giongo, A.; Valdez, F. P.; Saddi, T. M.; Brito, W. M. E. D.; Roehe, P. M.

2016-01-01

A complete genomic sequence of porcine parvovirus 2 (PPV-2) was detected by viral metagenome analysis on swine sera. A phylogenetic analysis of this genome reveals that it is highly similar to previously reported North American PPV-2 genomes. The complete PPV-2 sequence is 5,426 nucleotides long. PMID:26823583
Deep whole-genome sequencing of 90 Han Chinese genomes.

PubMed

Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

2017-09-01

Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects. © The Authors 2017. Published by Oxford University Press.
ProDeGe: A computational protocol for fully automated decontamination of genomes

DOE PAGES

Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott; ...

2015-06-09

Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less
ProDeGe: A computational protocol for fully automated decontamination of genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott

Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less
Harnessing Whole Genome Sequencing in Medical Mycology.

PubMed

Cuomo, Christina A

2017-01-01

Comparative genome sequencing studies of human fungal pathogens enable identification of genes and variants associated with virulence and drug resistance. This review describes current approaches, resources, and advances in applying whole genome sequencing to study clinically important fungal pathogens. Genomes for some important fungal pathogens were only recently assembled, revealing gene family expansions in many species and extreme gene loss in one obligate species. The scale and scope of species sequenced is rapidly expanding, leveraging technological advances to assemble and annotate genomes with higher precision. By using iteratively improved reference assemblies or those generated de novo for new species, recent studies have compared the sequence of isolates representing populations or clinical cohorts. Whole genome approaches provide the resolution necessary for comparison of closely related isolates, for example, in the analysis of outbreaks or sampled across time within a single host. Genomic analysis of fungal pathogens has enabled both basic research and diagnostic studies. The increased scale of sequencing can be applied across populations, and new metagenomic methods allow direct analysis of complex samples.
Development of a real-time PCR for detection of Staphylococcus pseudintermedius using a novel automated comparison of whole-genome sequences.

PubMed

Verstappen, Koen M; Huijbregts, Loes; Spaninks, Mirlin; Wagenaar, Jaap A; Fluit, Ad C; Duim, Birgitta

2017-01-01

Staphylococcus pseudintermedius is an opportunistic pathogen in dogs and cats and occasionally causes infections in humans. S. pseudintermedius is often resistant to multiple classes of antimicrobials. It requires a reliable detection so that it is not misidentified as S. aureus. Phenotypic and currently-used molecular-based diagnostic assays lack specificity or are labour-intensive using multiplex PCR or nucleic acid sequencing. The aim of this study was to identify a specific target for real-time PCR by comparing whole genome sequences of S. pseudintermedius and non-pseudintermedius.Genome sequences were downloaded from public repositories and supplemented by isolates that were sequenced in this study. A Perl-script was written that analysed 300-nt fragments from a reference genome sequence of S. pseudintermedius and checked if this sequence was present in other S. pseudintermedius genomes (n = 74) and non-pseudintermedius genomes (n = 138). Six sequences specific for S. pseudintermedius were identified (sequence length between 300-500 nt). One sequence, which was located in the spsJ gene, was used to develop primers and a probe. The real-time PCR showed 100% specificity when testing for S. pseudintermedius isolates (n = 54), and eight other staphylococcal species (n = 43). In conclusion, a novel approach by comparing whole genome sequences identified a sequence that is specific for S. pseudintermedius and provided a real-time PCR target for rapid and reliable detection of S. pseudintermedius.
Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines

PubMed Central

2009-01-01

Background Parthenium argentatum (guayule) is an industrial crop that produces latex, which was recently commercialized as a source of latex rubber safe for people with Type I latex allergy. The complete plastid genome of P. argentatum was sequenced. The sequence provides important information useful for genetic engineering strategies. Comparison to the sequences of plastid genomes from three other members of the Asteraceae, Lactuca sativa, Guitozia abyssinica and Helianthus annuus revealed details of the evolution of the four genomes. Chloroplast-specific DNA barcodes were developed for identification of Parthenium species and lines. Results The complete plastid genome of P. argentatum is 152,803 bp. Based on the overall comparison of individual protein coding genes with those in L. sativa, G. abyssinica and H. annuus, we demonstrate that the P. argentatum chloroplast genome sequence is most closely related to that of H. annuus. Similar to chloroplast genomes in G. abyssinica, L. sativa and H. annuus, the plastid genome of P. argentatum has a large 23 kb inversion with a smaller 3.4 kb inversion, within the large inversion. Using the matK and psbA-trnH spacer chloroplast DNA barcodes, three of the four Parthenium species tested, P. tomentosum, P. hysterophorus and P. schottii, can be differentiated from P. argentatum. In addition, we identified lines within P. argentatum. Conclusion The genome sequence of the P. argentatum chloroplast will enrich the sequence resources of plastid genomes in commercial crops. The availability of the complete plastid genome sequence may facilitate transformation efficiency by using the precise sequence of endogenous flanking sequences and regulatory elements in chloroplast transformation vectors. The DNA barcoding study forms the foundation for genetic identification of commercially significant lines of P. argentatum that are important for producing latex. PMID:19917140

Long-read whole genome sequencing and comparative analysis of six strains of the human pathogen Orientia tsutsugamushi.

PubMed

Batty, Elizabeth M; Chaemchuen, Suwittra; Blacksell, Stuart; Richards, Allen L; Paris, Daniel; Bowden, Rory; Chan, Caroline; Lachumanan, Ramkumar; Day, Nicholas; Donnelly, Peter; Chen, Swaine; Salje, Jeanne

2018-06-01

Orientia tsutsugamushi is a clinically important but neglected obligate intracellular bacterial pathogen of the Rickettsiaceae family that causes the potentially life-threatening human disease scrub typhus. In contrast to the genome reduction seen in many obligate intracellular bacteria, early genetic studies of Orientia have revealed one of the most repetitive bacterial genomes sequenced to date. The dramatic expansion of mobile elements has hampered efforts to generate complete genome sequences using short read sequencing methodologies, and consequently there have been few studies of the comparative genomics of this neglected species. We report new high-quality genomes of O. tsutsugamushi, generated using PacBio single molecule long read sequencing, for six strains: Karp, Kato, Gilliam, TA686, UT76 and UT176. In comparative genomics analyses of these strains together with existing reference genomes from Ikeda and Boryong strains, we identify a relatively small core genome of 657 genes, grouped into core gene islands and separated by repeat regions, and use the core genes to infer the first whole-genome phylogeny of Orientia. Complete assemblies of multiple Orientia genomes verify initial suggestions that these are remarkable organisms. They have larger genomes compared with most other Rickettsiaceae, with widespread amplification of repeat elements and massive chromosomal rearrangements between strains. At the gene level, Orientia has a relatively small set of universally conserved genes, similar to other obligate intracellular bacteria, and the relative expansion in genome size can be accounted for by gene duplication and repeat amplification. Our study demonstrates the utility of long read sequencing to investigate complex bacterial genomes and characterise genomic variation.
The 'dark matter' in the plant genomes: non-coding and unannotated DNA sequences associated with open chromatin.

PubMed

Jiang, Jiming

2015-04-01

Sequencing of complete plant genomes has become increasingly more routine since the advent of the next-generation sequencing technology. Identification and annotation of large amounts of noncoding but functional DNA sequences, including cis-regulatory DNA elements (CREs), have become a new frontier in plant genome research. Genomic regions containing active CREs bound to regulatory proteins are hypersensitive to DNase I digestion and are called DNase I hypersensitive sites (DHSs). Several recent DHS studies in plants illustrate that DHS datasets produced by DNase I digestion followed by next-generation sequencing (DNase-seq) are highly valuable for the identification and characterization of CREs associated with plant development and responses to environmental cues. DHS-based genomic profiling has opened a door to identify and annotate the 'dark matter' in sequenced plant genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Selected Insights from Application of Whole Genome Sequencing for Outbreak Investigations

PubMed Central

Le, Vien Thi Minh; Diep, Binh An

2014-01-01

Purpose of review The advent of high-throughput whole genome sequencing has the potential to revolutionize the conduct of outbreak investigation. Because of its ultimate pathogen strain resolution, whole genome sequencing could augment traditional epidemiologic investigations of infectious disease outbreaks. Recent findings The combination of whole genome sequencing and intensive epidemiologic analysis provided new insights on the sources and transmission dynamics of large-scale epidemics caused by Escherichia coli and Vibrio cholerae, nosocomial outbreaks caused by methicillin-resistant Staphylococcus aureus, Klebsiella pneumonia, and Mycobacterium abscessus, community-centered outbreaks caused by Mycobacterium tuberculosis, and natural disaster-associated outbreak caused by environmentally acquired molds. Summary When combined with traditional epidemiologic investigation, whole genome sequencing has proven useful for elucidating sources and transmission dynamics of disease outbreaks. Development of a fully automated bioinformatics pipeline for analysis of whole genome sequence data is much needed to make this powerful tool more widely accessible. PMID:23856896
Genomic Sequence around Butterfly Wing Development Genes: Annotation and Comparative Analysis

PubMed Central

Conceição, Inês C.; Long, Anthony D.; Gruber, Jonathan D.; Beldade, Patrícia

2011-01-01

Background Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. Methodology/Principal Findings We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). Conclusions The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation. PMID:21909358
The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans

PubMed Central

Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

2015-01-01

Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. PMID:26199191
Molecular variability analysis of five new complete cacao swollen shoot virus genomic sequences.

PubMed

Muller, E; Sackey, S

2005-01-01

Cacao swollen shoot virus (CSSV), a member of the family Caulimovi-ridae, genus Badnavirus occurs in all the main cacao-growing areas of West Africa. We amplified, cloned and sequenced complete genomes of five new isolates, two originating from Togo and three originating from Ghana. The genome of these five newly sequenced isolates all contain the five putative open reading frames I, II, III, X and Y described for the first sequenced CSSV isolate, Agou1 originating from Togo. Their genomes have been aligned with the genome of Agou1. The nucleotide and amino acid sequence identities between isolates have been calculated and a phylogenetic analysis has been made including other pararetroviruses. Maximum nucleotide sequence variability between complete genomes of CSSV isolates was 29.4%. Geographical differentiation between isolates appears more important than differentiation between mild and severe isolates. ORF X differs greatly in size and sequence between the Togolese isolates Nyongbo2 and Agou1, and the four other isolates, its functional role is therefore clearly questionable.
Analysis and Comparison of Aluminum Alloy Welded Joints Between Metal Inert Gas Welding and Tungsten Inert Gas Welding

NASA Astrophysics Data System (ADS)

Zhao, Lei; Guan, Yingchun; Wang, Qiang; Cong, Baoqiang; Qi, Bojin

2015-09-01

Surface contamination usually occurs during welding processing and it affects the welds quality largely. However, the formation of such contaminants has seldom been studied. Effort was made to study the contaminants caused by metal inert gas (MIG) welding and tungsten inert gas (TIG) welding processes of aluminum alloy, respectively. SEM, FTIR and XPS analysis was carried out to investigate the microstructure as well as surface chemistry. These contaminants were found to be mainly consisting of Al2O3, MgO, carbide and chromium complexes. The difference of contaminants between MIG and TIG welds was further examined. In addition, method to minimize these contaminants was proposed.
Recapitulating phylogenies using k-mers: from trees to networks.

PubMed

Bernard, Guillaume; Ragan, Mark A; Chan, Cheong Xin

2016-01-01

Ernst Haeckel based his landmark Tree of Life on the supposed ontogenic recapitulation of phylogeny, i.e. that successive embryonic stages during the development of an organism re-trace the morphological forms of its ancestors over the course of evolution. Much of this idea has since been discredited. Today, phylogenies are often based on families of molecular sequences. The standard approach starts with a multiple sequence alignment, in which the sequences are arranged relative to each other in a way that maximises a measure of similarity position-by-position along their entire length. A tree (or sometimes a network) is then inferred. Rigorous multiple sequence alignment is computationally demanding, and evolutionary processes that shape the genomes of many microbes (bacteria, archaea and some morphologically simple eukaryotes) can add further complications. In particular, recombination, genome rearrangement and lateral genetic transfer undermine the assumptions that underlie multiple sequence alignment, and imply that a tree-like structure may be too simplistic. Here, using genome sequences of 143 bacterial and archaeal genomes, we construct a network of phylogenetic relatedness based on the number of shared k -mers (subsequences at fixed length k ). Our findings suggest that the network captures not only key aspects of microbial genome evolution as inferred from a tree, but also features that are not treelike. The method is highly scalable, allowing for investigation of genome evolution across a large number of genomes. Instead of using specific regions or sequences from genome sequences, or indeed Haeckel's idea of ontogeny, we argue that genome phylogenies can be inferred using k -mers from whole-genome sequences. Representing these networks dynamically allows biological questions of interest to be formulated and addressed quickly and in a visually intuitive manner.
GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

PubMed

Lu, Bingxin; Leong, Hon Wai

2016-02-01

Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.
A sequence-based survey of the complex structural organization of tumor genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav

2008-04-03

The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
The mitochondrial genomes of the acoelomorph worms Paratomella rubra, Isodiametra pulchra and Archaphanostoma ylvae.

PubMed

Robertson, Helen E; Lapraz, François; Egger, Bernhard; Telford, Maximilian J; Schiffer, Philipp H

2017-05-12

Acoels are small, ubiquitous - but understudied - marine worms with a very simple body plan. Their internal phylogeny is still not fully resolved, and the position of their proposed phylum Xenacoelomorpha remains debated. Here we describe mitochondrial genome sequences from the acoels Paratomella rubra and Isodiametra pulchra, and the complete mitochondrial genome of the acoel Archaphanostoma ylvae. The P. rubra and A. ylvae sequences are typical for metazoans in size and gene content. The larger I. pulchra mitochondrial genome contains both ribosomal genes, 21 tRNAs, but only 11 protein-coding genes. We find evidence suggesting a duplicated sequence in the I. pulchra mitochondrial genome. The P. rubra, I. pulchra and A. ylvae mitochondria have a unique genome organisation in comparison to other metazoan mitochondrial genomes. We found a large degree of protein-coding gene and tRNA overlap with little non-coding sequence in the compact P. rubra genome. Conversely, the A. ylvae and I. pulchra genomes have many long non-coding sequences between genes, likely driving genome size expansion in the latter. Phylogenetic trees inferred from mitochondrial genes retrieve Xenacoelomorpha as an early branching taxon in the deuterostomes. Sequence divergence analysis between P. rubra sampled in England and Spain indicates cryptic diversity.
Using genic sequence capture in combination with a syntenic pseudo genome to map a deletion mutant in a wheat species.

PubMed

Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony

2014-12-01

Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

PubMed

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Draft genome sequence of the coccolithovirus Emiliania huxleyi virus 202.

PubMed

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2012-02-01

Emiliania huxleyi virus 202 (EhV-202) is a member of the Coccolithoviridae, a group of viruses that infect the marine coccolithophorid Emiliania huxleyi. EhV-202 has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 407 kbp, consisting of 485 coding sequences (CDSs). Here we describe the genomic features of EhV-202, together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.
Draft genome sequence of the Coccolithovirus Emiliania huxleyi virus 203.

PubMed

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2011-12-01

The Coccolithoviridae are a recently discovered group of viruses that infect the marine coccolithophorid Emiliania huxleyi. Emiliania huxleyi virus 203 (EhV-203) has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 400 kbp, consisting of 464 coding sequences (CDSs). Here we describe the genomic features of EhV-203 together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.
Substantial genome synteny preservation among woody angiosperm species: comparative genomics of Chinese chestnut (Castanea mollissima) and plant reference genomes.

PubMed

Staton, Margaret; Zhebentyayeva, Tetyana; Olukolu, Bode; Fang, Guang Chen; Nelson, Dana; Carlson, John E; Abbott, Albert G

2015-10-05

Chinese chestnut (Castanea mollissima) has emerged as a model species for the Fagaceae family with extensive genomic resources including a physical map, a dense genetic map and quantitative trait loci (QTLs) for chestnut blight resistance. These resources enable comparative genomics analyses relative to model plants. We assessed the degree of conservation between the chestnut genome and other well annotated and assembled plant genomic sequences, focusing on the QTL regions of most interest to the chestnut breeding community. The integrated physical and genetic map of Chinese chestnut has been improved to now include 858 shared sequence-based markers. The utility of the integrated map has also been improved through the addition of 42,970 BAC (bacterial artificial chromosome) end sequences spanning over 26 million bases of the estimated 800 Mb chestnut genome. Synteny between chestnut and ten model plant species was conducted on a macro-syntenic scale using sequences from both individual probes and BAC end sequences across the chestnut physical map. Blocks of synteny with chestnut were found in all ten reference species, with the percent of the chestnut physical map that could be aligned ranging from 10 to 39 %. The integrated genetic and physical map was utilized to identify BACs that spanned the three previously identified QTL regions conferring blight resistance. The clones were pooled and sequenced, yielding 396 sequence scaffolds covering 13.9 Mbp. Comparative genomic analysis on a microsytenic scale, using the QTL-associated genomic sequence, identified synteny from chestnut to other plant genomes ranging from 5.4 to 12.9 % of the genome sequences aligning. On both the macro- and micro-synteny levels, the peach, grape and poplar genomes were found to be the most structurally conserved with chestnut. Interestingly, these results did not strictly follow the expectation that decreased phylogenetic distance would correspond to increased levels of genome preservation, but rather suggest the additional influence of life-history traits on preservation of synteny. The regions of synteny that were detected provide an important tool for defining and cataloging genes in the QTL regions for advancing chestnut blight resistance research.
The Past, Present, and Future of Human Centromere Genomics

PubMed Central

Aldrup-MacDonald, Megan E.; Sullivan, Beth A.

2014-01-01

The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function. PMID:24683489
Saccharomyces cerevisiae: gene annotation and genome variability, state of the art through comparative genomics.

PubMed

Louis, Ed

2011-01-01

In the early days of the yeast genome sequencing project, gene annotation was in its infancy and suffered the problem of many false positive annotations as well as missed genes. The lack of other sequences for comparison also prevented the annotation of conserved, functional sequences that were not coding. We are now in an era of comparative genomics where many closely related as well as more distantly related genomes are available for direct sequence and synteny comparisons allowing for more probable predictions of genes and other functional sequences due to conservation. We also have a plethora of functional genomics data which helps inform gene annotation for previously uncharacterised open reading frames (ORFs)/genes. For Saccharomyces cerevisiae this has resulted in a continuous updating of the gene and functional sequence annotations in the reference genome helping it retain its position as the best characterized eukaryotic organism's genome. A single reference genome for a species does not accurately describe the species and this is quite clear in the case of S. cerevisiae where the reference strain is not ideal for brewing or baking due to missing genes. Recent surveys of numerous isolates, from a variety of sources, using a variety of technologies have revealed a great deal of variation amongst isolates with genome sequence surveys providing information on novel genes, undetectable by other means. We now have a better understanding of the extant variation in S. cerevisiae as a species as well as some idea of how much we are missing from this understanding. As with gene annotation, comparative genomics enhances the discovery and description of genome variation and is providing us with the tools for understanding genome evolution, adaptation and selection, and underlying genetics of complex traits.
Complete Chloroplast Genome Sequences of Important Oilseed Crop Sesamum indicum L

PubMed Central

Yi, Dong-Keun; Kim, Ki-Joong

2012-01-01

Sesamum indicum is an important crop plant species for yielding oil. The complete chloroplast (cp) genome of S. indicum (GenBank acc no. JN637766) is 153,324 bp in length, and has a pair of inverted repeat (IR) regions consisting of 25,141 bp each. The lengths of the large single copy (LSC) and the small single copy (SSC) regions are 85,170 bp and 17,872 bp, respectively. Comparative cp DNA sequence analyses of S. indicum with other cp genomes reveal that the genome structure, gene order, gene and intron contents, AT contents, codon usage, and transcription units are similar to the typical angiosperm cp genomes. Nucleotide diversity of the IR region between Sesamum and three other cp genomes is much lower than that of the LSC and SSC regions in both the coding region and noncoding region. As a summary, the regional constraints strongly affect the sequence evolution of the cp genomes, while the functional constraints weakly affect the sequence evolution of cp genomes. Five short inversions associated with short palindromic sequences that form step-loop structures were observed in the chloroplast genome of S. indicum. Twenty-eight different simple sequence repeat loci have been detected in the chloroplast genome of S. indicum. Almost all of the SSR loci were composed of A or T, so this may also contribute to the A-T richness of the cp genome of S. indicum. Seven large repeated loci in the chloroplast genome of S. indicum were also identified and these loci are useful to developing S. indicum-specific cp genome vectors. The complete cp DNA sequences of S. indicum reported in this paper are prerequisite to modifying this important oilseed crop by cp genetic engineering techniques. PMID:22606240
Whole genome sequence and genome annotation of Colletotrichum acutatum, causal agent of anthracnose in pepper plants in South Korea.

PubMed

Han, Joon-Hee; Chon, Jae-Kyung; Ahn, Jong-Hwa; Choi, Ik-Young; Lee, Yong-Hwan; Kim, Kyoung Su

2016-06-01

Colletotrichum acutatum is a destructive fungal pathogen which causes anthracnose in a wide range of crops. Here we report the whole genome sequence and annotation of C. acutatum strain KC05, isolated from an infected pepper in Kangwon, South Korea. Genomic DNA from the KC05 strain was used for the whole genome sequencing using a PacBio sequencer and the MiSeq system. The KC05 genome was determined to be 52,190,760 bp in size with a G + C content of 51.73% in 27 scaffolds and to contain 13,559 genes with an average length of 1516 bp. Gene prediction and annotation were performed by incorporating RNA-Seq data. The genome sequence of the KC05 was deposited at DDBJ/ENA/GenBank under the accession number LUXP00000000.

Ensembl 2002: accommodating comparative genomics.

PubMed

Clamp, M; Andrews, D; Barker, D; Bevan, P; Cameron, G; Chen, Y; Clark, L; Cox, T; Cuff, J; Curwen, V; Down, T; Durbin, R; Eyras, E; Gilbert, J; Hammond, M; Hubbard, T; Kasprzyk, A; Keefe, D; Lehvaslaiho, H; Iyer, V; Melsopp, C; Mongin, E; Pettett, R; Potter, S; Rust, A; Schmidt, E; Searle, S; Slater, G; Smith, J; Spooner, W; Stabenau, A; Stalker, J; Stupka, E; Ureta-Vidal, A; Vastrik, I; Birney, E

2003-01-01

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of human, mouse and other genome sequences, available as either an interactive web site or as flat files. Ensembl also integrates manually annotated gene structures from external sources where available. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. These range from sequence analysis to data storage and visualisation and installations exist around the world in both companies and at academic sites. With both human and mouse genome sequences available and more vertebrate sequences to follow, many of the recent developments in Ensembl have focusing on developing automatic comparative genome analysis and visualisation.
Complete genome sequence of Coriobacterium glomerans type strain (PW2T) from the midgut of Pyrrhocoris apterus L. (red soldier bug)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stackebrandt, Erko; Zeytun, Ahmet; Lapidus, Alla L.

2013-01-01

Coriobacterium glomerans Haas and Ko nig 1988, is the only species of the genus Coriobacterium, family Coriobacteriaceae, order Coriobacteriales, phylum Actinobacteria. The bacterium thrives as an endosymbiont of pyrrhocorid bugs, i.e. the red fire bug Pyrrhocoris apterus L. The rationale for sequencing the genome of strain PW2T is its endosymbiotic life style which is rare among members of Actinobacteria. Here we describe the features of this symbiont, together with the complete genome sequence and its annotation. This is the first complete genome sequence of a member of the genus Coriobacterium and the sixth member of the order Coriobacteriales for whichmore » complete genome sequences are now available. The 2,115,681 bp long single replicon genome with its 1,804 protein-coding and 54 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.« less
Ensembl 2004.

PubMed

Birney, E; Andrews, D; Bevan, P; Caccamo, M; Cameron, G; Chen, Y; Clarke, L; Coates, G; Cox, T; Cuff, J; Curwen, V; Cutts, T; Down, T; Durbin, R; Eyras, E; Fernandez-Suarez, X M; Gane, P; Gibbins, B; Gilbert, J; Hammond, M; Hotz, H; Iyer, V; Kahari, A; Jekosch, K; Kasprzyk, A; Keefe, D; Keenan, S; Lehvaslaiho, H; McVicker, G; Melsopp, C; Meidl, P; Mongin, E; Pettett, R; Potter, S; Proctor, G; Rae, M; Searle, S; Slater, G; Smedley, D; Smith, J; Spooner, W; Stabenau, A; Stalker, J; Storey, R; Ureta-Vidal, A; Woodwark, C; Clamp, M; Hubbard, T

2004-01-01

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organize biology around the sequences of large genomes. It is a comprehensive and integrated source of annotation of large genome sequences, available via interactive website, web services or flat files. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. The facilities of the system range from sequence analysis to data storage and visualization and installations exist around the world both in companies and at academic sites. With a total of nine genome sequences available from Ensembl and more genomes to follow, recent developments have focused mainly on closer integration between genomes and external data.
A computational genomics pipeline for prokaryotic sequencing projects

PubMed Central

Kislyuk, Andrey O.; Katz, Lee S.; Agrawal, Sonia; Hagen, Matthew S.; Conley, Andrew B.; Jayaraman, Pushkala; Nelakuditi, Viswateja; Humphrey, Jay C.; Sammons, Scott A.; Govil, Dhwani; Mair, Raydel D.; Tatti, Kathleen M.; Tondella, Maria L.; Harcourt, Brian H.; Mayer, Leonard W.; Jordan, I. King

2010-01-01

Motivation: New sequencing technologies have accelerated research on prokaryotic genomes and have made genome sequencing operations outside major genome sequencing centers routine. However, no off-the-shelf solution exists for the combined assembly, gene prediction, genome annotation and data presentation necessary to interpret sequencing data. The resulting requirement to invest significant resources into custom informatics support for genome sequencing projects remains a major impediment to the accessibility of high-throughput sequence data. Results: We present a self-contained, automated high-throughput open source genome sequencing and computational genomics pipeline suitable for prokaryotic sequencing projects. The pipeline has been used at the Georgia Institute of Technology and the Centers for Disease Control and Prevention for the analysis of Neisseria meningitidis and Bordetella bronchiseptica genomes. The pipeline is capable of enhanced or manually assisted reference-based assembly using multiple assemblers and modes; gene predictor combining; and functional annotation of genes and gene products. Because every component of the pipeline is executed on a local machine with no need to access resources over the Internet, the pipeline is suitable for projects of a sensitive nature. Annotation of virulence-related features makes the pipeline particularly useful for projects working with pathogenic prokaryotes. Availability and implementation: The pipeline is licensed under the open-source GNU General Public License and available at the Georgia Tech Neisseria Base (http://nbase.biology.gatech.edu/). The pipeline is implemented with a combination of Perl, Bourne Shell and MySQL and is compatible with Linux and other Unix systems. Contact: king.jordan@biology.gatech.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20519285
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
Genome-Wide Search Identifies 1.9 Mb from the Polar Bear Y Chromosome for Evolutionary Analyses.

PubMed

Bidon, Tobias; Schreck, Nancy; Hailer, Frank; Nilsson, Maria A; Janke, Axel

2015-05-27

The male-inherited Y chromosome is the major haploid fraction of the mammalian genome, rendering Y-linked sequences an indispensable resource for evolutionary research. However, despite recent large-scale genome sequencing approaches, only a handful of Y chromosome sequences have been characterized to date, mainly in model organisms. Using polar bear (Ursus maritimus) genomes, we compare two different in silico approaches to identify Y-linked sequences: 1) Similarity to known Y-linked genes and 2) difference in the average read depth of autosomal versus sex chromosomal scaffolds. Specifically, we mapped available genomic sequencing short reads from a male and a female polar bear against the reference genome and identify 112 Y-chromosomal scaffolds with a combined length of 1.9 Mb. We verified the in silico findings for the longer polar bear scaffolds by male-specific in vitro amplification, demonstrating the reliability of the average read depth approach. The obtained Y chromosome sequences contain protein-coding sequences, single nucleotide polymorphisms, microsatellites, and transposable elements that are useful for evolutionary studies. A high-resolution phylogeny of the polar bear patriline shows two highly divergent Y chromosome lineages, obtained from analysis of the identified Y scaffolds in 12 previously published male polar bear genomes. Moreover, we find evidence of gene conversion among ZFX and ZFY sequences in the giant panda lineage and in the ancestor of ursine and tremarctine bears. Thus, the identification of Y-linked scaffold sequences from unordered genome sequences yields valuable data to infer phylogenomic and population-genomic patterns in bears. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Next Generation Sequencing at the University of Chicago Genomics Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Faber, Pieter

2013-04-24

The University of Chicago Genomics Core provides University of Chicago investigators (and external clients) access to State-of-the-Art genomics capabilities: next generation sequencing, Sanger sequencing / genotyping and micro-arrays (gene expression, genotyping, and methylation). The current presentation will highlight our capabilities in the area of ultra-high throughput sequencing analysis.
BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes

USDA-ARS?s Scientific Manuscript database

New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by...
Multiplex sequencing of plant chloroplast genomes using Solexa sequencing-by-synthesis technology

Treesearch

Richard Cronn; Aaron Liston; Matthew Parks; David S. Gernandt; Rongkun Shen; Todd Mockler

2008-01-01

Organellar DNA sequences are widely used in evolutionary and population genetic studies; however, the conservative nature of chloroplast gene and genome evolution often limits phylogenetic resolution and statistical power. To gain maximal access to the historical record contained within chloroplast genomes, we have adapted multiplex sequencing-by-synthesis (MSBS) to...
Sequencing of 15,622 gene-bearing BACs clarifies the gene-dense regions of the barley genome

USDA-ARS?s Scientific Manuscript database

Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework....
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE PAGES

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.; ...

2015-01-14

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Release 6 reference sequence of the Drosophila melanogaster genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.

Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
Next Generation Sequencing Technologies: The Doorway to the Unexplored Genomics of Non-Model Plants

PubMed Central

Unamba, Chibuikem I. N.; Nag, Akshay; Sharma, Ram K.

2015-01-01

Non-model plants i.e., the species which have one or all of the characters such as long life cycle, difficulty to grow in the laboratory or poor fecundity, have been schemed out of sequencing projects earlier, due to high running cost of Sanger sequencing. Consequently, the information about their genomics and key biological processes are inadequate. However, the advent of fast and cost effective next generation sequencing (NGS) platforms in the recent past has enabled the unearthing of certain characteristic gene structures unique to these species. It has also aided in gaining insight about mechanisms underlying processes of gene expression and secondary metabolism as well as facilitated development of genomic resources for diversity characterization, evolutionary analysis and marker assisted breeding even without prior availability of genomic sequence information. In this review we explore how different Next Gen Sequencing platforms, as well as recent advances in NGS based high throughput genotyping technologies are rewarding efforts on de-novo whole genome/transcriptome sequencing, development of genome wide sequence based markers resources for improvement of non-model crops that are less costly than phenotyping. PMID:26734016
The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads.

PubMed

Wang, Zhiwen; Hobson, Neil; Galindo, Leonardo; Zhu, Shilin; Shi, Daihu; McDill, Joshua; Yang, Linfeng; Hawkins, Simon; Neutelings, Godfrey; Datla, Raju; Lambert, Georgina; Galbraith, David W; Grassa, Christopher J; Geraldes, Armando; Cronk, Quentin C; Cullis, Christopher; Dash, Prasanta K; Kumar, Polumetla A; Cloutier, Sylvie; Sharpe, Andrew G; Wong, Gane K-S; Wang, Jun; Deyholos, Michael K

2012-11-01

Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Sequence Search and Comparative Genomic Analysis of SUMO-Activating Enzymes Using CoGe.

PubMed

Carretero-Paulet, Lorenzo; Albert, Victor A

2016-01-01

The growing number of genome sequences completed during the last few years has made necessary the development of bioinformatics tools for the easy access and retrieval of sequence data, as well as for downstream comparative genomic analyses. Some of these are implemented as online platforms that integrate genomic data produced by different genome sequencing initiatives with data mining tools as well as various comparative genomic and evolutionary analysis possibilities.Here, we use the online comparative genomics platform CoGe ( http://www.genomevolution.org/coge/ ) (Lyons and Freeling. Plant J 53:661-673, 2008; Tang and Lyons. Front Plant Sci 3:172, 2012) (1) to retrieve the entire complement of orthologous and paralogous genes belonging to the SUMO-Activating Enzymes 1 (SAE1) gene family from a set of species representative of the Brassicaceae plant eudicot family with genomes fully sequenced, and (2) to investigate the history, timing, and molecular mechanisms of the gene duplications driving the evolutionary expansion and functional diversification of the SAE1 family in Brassicaceae.
APPLaUD: access for patients and participants to individual level uninterpreted genomic data.

PubMed

Thorogood, Adrian; Bobe, Jason; Prainsack, Barbara; Middleton, Anna; Scott, Erick; Nelson, Sarah; Corpas, Manuel; Bonhomme, Natasha; Rodriguez, Laura Lyman; Murtagh, Madeleine; Kleiderman, Erika

2018-02-17

There is a growing support for the stance that patients and research participants should have better and easier access to their raw (uninterpreted) genomic sequence data in both clinical and research contexts. We review legal frameworks and literature on the benefits, risks, and practical barriers of providing individuals access to their data. We also survey genomic sequencing initiatives that provide or plan to provide individual access. Many patients and research participants expect to be able to access their health and genomic data. Individuals have a legal right to access their genomic data in some countries and contexts. Moreover, increasing numbers of participatory research projects, direct-to-consumer genetic testing companies, and now major national sequencing initiatives grant individuals access to their genomic sequence data upon request. Drawing on current practice and regulatory analysis, we outline legal, ethical, and practical guidance for genomic sequencing initiatives seeking to offer interested patients and participants access to their raw genomic data.
Great expectations: patient perspectives and anticipated utility of non-diagnostic genomic-sequencing results.

PubMed

Hylind, Robyn; Smith, Maureen; Rasmussen-Torvik, Laura; Aufox, Sharon

2018-01-01

The management of secondary findings is a challenge to health-care providers relaying clinical genomic-sequencing results to patients. Understanding patients' expectations from non-diagnostic genomic sequencing could help guide this management. This study interviewed 14 individuals enrolled in the eMERGE (Electronic Medical Records and Genomics) study. Participants in eMERGE consent to undergo non-diagnostic genomic sequencing, receive results, and have results returned to their physicians. The interviews assessed expectations and intended use of results. The majority of interviewees were male (64%) and 43% identified as non-Caucasian. A unique theme identified was that many participants expressed uncertainty about the type of diseases they expected to receive results on, what results they wanted to learn about, and how they intended to use results. Participant uncertainty highlights the complex nature of deciding to undergo genomic testing and a deficiency in genomic knowledge. These results could help improve how genomic sequencing and secondary findings are discussed with patients.
Correlation between genome reduction and bacterial growth.

PubMed

Kurokawa, Masaomi; Seno, Shigeto; Matsuda, Hideo; Ying, Bei-Wen

2016-12-01

Genome reduction by removing dispensable genomic sequences in bacteria is commonly used in both fundamental and applied studies to determine the minimal genetic requirements for a living system or to develop highly efficient bioreactors. Nevertheless, whether and how the accumulative loss of dispensable genomic sequences disturbs bacterial growth remains unclear. To investigate the relationship between genome reduction and growth, a series of Escherichia coli strains carrying genomes reduced in a stepwise manner were used. Intensive growth analyses revealed that the accumulation of multiple genomic deletions caused decreases in the exponential growth rate and the saturated cell density in a deletion-length-dependent manner as well as gradual changes in the patterns of growth dynamics, regardless of the growth media. Accordingly, a perspective growth model linking genome evolution to genome engineering was proposed. This study provides the first demonstration of a quantitative connection between genomic sequence and bacterial growth, indicating that growth rate is potentially associated with dispensable genomic sequences. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Complete Genome Sequence of Porcine Parvovirus 2 Recovered from Swine Sera.

PubMed

Campos, F S; Kluge, M; Franco, A C; Giongo, A; Valdez, F P; Saddi, T M; Brito, W M E D; Roehe, P M

2016-01-28

A complete genomic sequence of porcine parvovirus 2 (PPV-2) was detected by viral metagenome analysis on swine sera. A phylogenetic analysis of this genome reveals that it is highly similar to previously reported North American PPV-2 genomes. The complete PPV-2 sequence is 5,426 nucleotides long. Copyright © 2016 Campos et al.
Comparative genomic survey, exon-intron annotation and phylogenetic analysis of NAT-homologous sequences in archaea, protists, fungi, viruses, and invertebrates

USDA-ARS?s Scientific Manuscript database

We have previously published extensive genomic surveys [1-3], reporting NAT-homologous sequences in hundreds of sequenced bacterial, fungal and vertebrate genomes. We present here the results of our latest search of 2445 genomes, representing 1532 (70 archaeal, 1210 bacterial, 43 protist, 97 fungal,...

Genome assembly reborn: recent computational challenges

PubMed Central

2009-01-01

Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain. PMID:19482960
Whole genome sequence of Enterobacter ludwigii type strain EN-119T, isolated from clinical specimens.

PubMed

Li, Gengmi; Hu, Zonghai; Zeng, Ping; Zhu, Bing; Wu, Lijuan

2015-04-01

Enterobacter ludwigii strain EN-119(T) is the type strain of E. ludwigii, which belongs to the E. cloacae complex (Ecc). This strain was first reported and nominated in 2005 and later been found in many hospitals. In this paper, the whole genome sequencing of this strain was carried out. The total genome size of EN-119(T) is 4952,770 bp with 4578 coding sequences, 88 tRNAs and 10 rRNAs. The genome sequence of EN-119(T) is the first whole genome sequence of E. ludwigii, which will further our understanding of Ecc. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

PubMed Central

Bergman, Casey M.; Haddrill, Penelope R.

2015-01-01

To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center. PMID:25717372
Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

PubMed

Bergman, Casey M; Haddrill, Penelope R

2015-01-01

To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center.
Tempo and mode of genomic mutations unveil human evolutionary history.

PubMed

Hara, Yuichiro

2015-01-01

Mutations that have occurred in human genomes provide insight into various aspects of evolutionary history such as speciation events and degrees of natural selection. Comparing genome sequences between human and great apes or among humans is a feasible approach for inferring human evolutionary history. Recent advances in high-throughput or so-called 'next-generation' DNA sequencing technologies have enabled the sequencing of thousands of individual human genomes, as well as a variety of reference genomes of hominids, many of which are publicly available. These sequence data can help to unveil the detailed demographic history of the lineage leading to humans as well as the explosion of modern human population size in the last several thousand years. In addition, high-throughput sequencing illustrates the tempo and mode of de novo mutations, which are producing human genetic variation at this moment. Pedigree-based human genome sequencing has shown that mutation rates vary significantly across the human genome. These studies have also provided an improved timescale of human evolution, because the mutation rate estimated from pedigree analysis is half that estimated from traditional analyses based on molecular phylogeny. Because of the dramatic reduction in sequencing cost, sequencing on-demand samples designed for specific studies is now also becoming popular. To produce data of sufficient quality to meet the requirements of the study, it is necessary to set an explicit sequencing plan that includes the choice of sample collection methods, sequencing platforms, and number of sequence reads.
Genome sequence resources for the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei).

PubMed

Xia, Chongjing; Wang, Meinan; Yin, Chuntao; Cornejo, Omar E; Hulbert, Scot; Chen, Xianming

2018-05-24

Puccinia striiformis f. sp. tritici (Pst) causes devastating stripe (yellow) rust on wheat and P. striiformis f. sp. hordei (Psh) causes stripe rust on barley. Several Pst genomes are available, but no Psh genome is available. More genomes of Pst and Psh are needed to understand the genome evolution and molecular mechanisms of their pathogenicity. We sequenced Pst isolate 93-210 and Psh isolate 93TX-2 using PacBio and Illumina technologies, and RNA sequencing. Their genomic sequences were assembled to contigs with high continuity and showed significant structural differences. The circular mitochondria genomes of both were complete. These genomes provide high-quality resources for deciphering the genomic basis of rapid evolution and host adaptation, identifying genes for avirulence and other important traits, and studying host-pathogen interaction.
KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.

PubMed

Wang, Dapeng; Xu, Jiayue; Yu, Jun

2015-09-16

The K-mer approach, treating genomic sequences as simple characters and counting the relative abundance of each string upon a fixed K, has been extensively applied to phylogeny inference for genome assembly, annotation, and comparison. To meet increasing demands for comparing large genome sequences and to promote the use of the K-mer approach, we develop a versatile database, KGCAK ( http://kgcak.big.ac.cn/KGCAK/ ), containing ~8,000 genomes that include genome sequences of diverse life forms (viruses, prokaryotes, protists, animals, and plants) and cellular organelles of eukaryotic lineages. It builds phylogeny based on genomic elements in an alignment-free fashion and provides in-depth data processing enabling users to compare the complexity of genome sequences based on K-mer distribution. We hope that KGCAK becomes a powerful tool for exploring relationship within and among groups of species in a tree of life based on genomic data.
Rosetta stone method for detecting protein function and protein-protein interactions from genome sequences

DOEpatents

Eisenberg, David; Marcotte, Edward M.; Pellegrini, Matteo; Thompson, Michael J.; Yeates, Todd O.

2002-10-15

A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
Molecular Targeting of Prostate Cancer During Androgen Ablation: Inhibition of CHES1/FOXN3

DTIC Science & Technology

2013-05-01

the DNA sequences (~25^6 reads/sample) were mapped to the human genome reference sequence (hg19...tumor the AR has a genomic abnormality, placing the novel sequence 3’ of the transcriptional start site. However, it is unclear if a genomic alteration...exon/intron organization of the CHES1 gene was determined by BLAST analysis of the human genome using the 1,473-bp CHES1 cDNA sequence
A Deep-Coverage Tomato BAC Library and Prospects Toward Development of an STC Framework for Genome Sequencing

PubMed Central

Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.

2000-01-01

Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957
Single molecule sequencing of the M13 virus genome without amplification

PubMed Central

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901
Single molecule sequencing of the M13 virus genome without amplification.

PubMed

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.
Reefgenomics.Org - a repository for marine genomics data.

PubMed

Liew, Yi Jin; Aranda, Manuel; Voolstra, Christian R

2016-01-01

Over the last decade, technological advancements have substantially decreased the cost and time of obtaining large amounts of sequencing data. Paired with the exponentially increased computing power, individual labs are now able to sequence genomes or transcriptomes to investigate biological questions of interest. This has led to a significant increase in available sequence data. Although the bulk of data published in articles are stored in public sequence databases, very often, only raw sequencing data are available; miscellaneous data such as assembled transcriptomes, genome annotations etc. are not easily obtainable through the same means. Here, we introduce our website (http://reefgenomics.org) that aims to centralize genomic and transcriptomic data from marine organisms. Besides providing convenient means to download sequences, we provide (where applicable) a genome browser to explore available genomic features, and a BLAST interface to search through the hosted sequences. Through the interface, multiple datasets can be queried simultaneously, allowing for the retrieval of matching sequences from organisms of interest. The minimalistic, no-frills interface reduces visual clutter, making it convenient for end-users to search and explore processed sequence data. DATABASE URL: http://reefgenomics.org. © The Author(s) 2016. Published by Oxford University Press.
Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

PubMed

Evans, Teri; Johnson, Andrew D; Loose, Matthew

2018-01-12

Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .
Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics.

PubMed

Aoki, Koh; Yano, Kentaro; Suzuki, Ayako; Kawamura, Shingo; Sakurai, Nozomu; Suda, Kunihiro; Kurabayashi, Atsushi; Suzuki, Tatsuya; Tsugane, Taneaki; Watanabe, Manabu; Ooga, Kazuhide; Torii, Maiko; Narita, Takanori; Shin-I, Tadasu; Kohara, Yuji; Yamamoto, Naoki; Takahashi, Hideki; Watanabe, Yuichiro; Egusa, Mayumi; Kodama, Motoichiro; Ichinose, Yuki; Kikuchi, Mari; Fukushima, Sumire; Okabe, Akiko; Arie, Tsutomu; Sato, Yuko; Yazawa, Katsumi; Satoh, Shinobu; Omura, Toshikazu; Ezura, Hiroshi; Shibata, Daisuke

2010-03-30

The Solanaceae family includes several economically important vegetable crops. The tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. Recently, a number of tomato resources have been developed in parallel with the ongoing tomato genome sequencing project. In particular, a miniature cultivar, Micro-Tom, is regarded as a model system in tomato genomics, and a number of genomics resources in the Micro-Tom-background, such as ESTs and mutagenized lines, have been established by an international alliance. To accelerate the progress in tomato genomics, we developed a collection of fully-sequenced 13,227 Micro-Tom full-length cDNAs. By checking redundant sequences, coding sequences, and chimeric sequences, a set of 11,502 non-redundant full-length cDNAs (nrFLcDNAs) was generated. Analysis of untranslated regions demonstrated that tomato has longer 5'- and 3'-untranslated regions than most other plants but rice. Classification of functions of proteins predicted from the coding sequences demonstrated that nrFLcDNAs covered a broad range of functions. A comparison of nrFLcDNAs with genes of sixteen plants facilitated the identification of tomato genes that are not found in other plants, most of which did not have known protein domains. Mapping of the nrFLcDNAs onto currently available tomato genome sequences facilitated prediction of exon-intron structure. Introns of tomato genes were longer than those of Arabidopsis and rice. According to a comparison of exon sequences between the nrFLcDNAs and the tomato genome sequences, the frequency of nucleotide mismatch in exons between Micro-Tom and the genome-sequencing cultivar (Heinz 1706) was estimated to be 0.061%. The collection of Micro-Tom nrFLcDNAs generated in this study will serve as a valuable genomic tool for plant biologists to bridge the gap between basic and applied studies. The nrFLcDNA sequences will help annotation of the tomato whole-genome sequence and aid in tomato functional genomics and molecular breeding. Full-length cDNA sequences and their annotations are provided in the database KaFTom http://www.pgb.kazusa.or.jp/kaftom/ via the website of the National Bioresource Project Tomato http://tomato.nbrp.jp.
Mutation Detection with Next-Generation Resequencing through a Mediator Genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wurtzel, Omri; Dori-Bachash, Mally; Pietrokovski, Shmuel

2010-12-31

The affordability of next generation sequencing (NGS) is transforming the field of mutation analysis in bacteria. The genetic basis for phenotype alteration can be identified directly by sequencing the entire genome of the mutant and comparing it to the wild-type (WT) genome, thus identifying acquired mutations. A major limitation for this approach is the need for an a-priori sequenced reference genome for the WT organism, as the short reads of most current NGS approaches usually prohibit de-novo genome assembly. To overcome this limitation we propose a general framework that utilizes the genome of relative organisms as mediators for comparing WTmore » and mutant bacteria. Under this framework, both mutant and WT genomes are sequenced with NGS, and the short sequencing reads are mapped to the mediator genome. Variations between the mutant and the mediator that recur in the WT are ignored, thus pinpointing the differences between the mutant and the WT. To validate this approach we sequenced the genome of Bdellovibrio bacteriovorus 109J, an obligatory bacterial predator, and its prey-independent mutant, and compared both to the mediator species Bdellovibrio bacteriovorus HD100. Although the mutant and the mediator sequences differed in more than 28,000 nucleotide positions, our approach enabled pinpointing the single causative mutation. Experimental validation in 53 additional mutants further established the implicated gene. Our approach extends the applicability of NGS-based mutant analyses beyond the domain of available reference genomes.« less
Protruding vulva mutants identify novel loci and Wnt signaling factors that function during Caenorhabditis elegans vulva development.

PubMed

Eisenmann, D M; Kim, S K

2000-11-01

The Caenorhabditis elegans vulva develops from the progeny of three vulval precursor cells (VPCs) induced to divide and differentiate by a signal from the somatic gonad. Evolutionarily conserved Ras and Notch extracellular signaling pathways are known to function during this process. To identify novel loci acting in vulval development, we carried out a genetic screen for mutants having a protruding-vulva (Pvl) mutant phenotype. Here we report the initial genetic characterization of several novel loci: bar-1, pvl-4, pvl-5, and pvl-6. In addition, on the basis of their Pvl phenotypes, we show that the previously identified genes lin-26, mom-3/mig-14, egl-18, and sem-4 also function during vulval development. Our characterization indicates that (1) pvl-4 and pvl-5 are required for generation/survival of the VPCs; (2) bar-1, mom-3/mig-14, egl-18, and sem-4 play a role in VPC fate specification; (3) lin-26 is required for proper VPC fate execution; and (4) pvl-6 acts during vulval morphogenesis. In addition, two of these genes, bar-1 and mom-3/mig-14, are known to function in processes regulated by Wnt signaling, suggesting that a Wnt signaling pathway is acting during vulval development.
Numerical Simulation and Experimental Validation of MIG Welding of T-Joints of Thin Aluminum Plates for Top Class Vehicles

NASA Astrophysics Data System (ADS)

Bonazzi, Enrico; Colombini, Elena; Panari, Davide; Vergnano, Alberto; Leali, Francesco; Veronesi, Paolo

2017-01-01

The integration of experiments with numerical simulations can efficiently support a quick evaluation of the welded joint. In this work, the MIG welding operation on aluminum T-joint thin plate has been studied by the integration of both simulation and experiments. The aim of the paper is to enlarge the global database, to promote the use of thin aluminum sheets in automotive body industries and to provide new data. Since the welding of aluminum thin plates is difficult to control due to high speed of the heat source and high heat flows during heating and cooling, a simulation model could be considered an effective design tool to predict the real phenomena. This integrated approach enables new evaluation possibilities on MIG-welded thin aluminum T-joints, as correspondence between the extension of the microstructural zones and the simulation parameters, material hardness, transient 3D temperature distribution on the surface and inside the material, stresses, strains, and deformations. The results of the mechanical simulations are comparable with the experimental measurements along the welding path, especially considering the variability of the process. The results could well predict the welding-induced distortion, which together with local heating during welding must be anticipated and subsequently minimized and counterbalance.
Complete Genomic Sequence of “Thermofilum adornatus” Strain 1910bT, a Hyperthermophilic Anaerobic Organotrophic Crenarchaeon

PubMed Central

Dominova, I. N.; Kublanov, I. V.; Podosokorskaya, O. A.; Derbikova, K. S.; Patrushev, M. V.

2013-01-01

The complete genomic sequence of a novel hyperthermophilic crenarchaeon, strain 1910bT, was determined. The genome comprises a 1,750,259-bp circular chromosome containing single copies of 3 rRNA genes, 43 tRNA genes, and 1,896 protein-coding sequences. In silico genome-genome hybridization suggests the proposal of a novel species, “Thermofilum adornatus” strain 1910bT. PMID:24029764
Defining Genome Project Standards in a New Era of Sequencing

ScienceCinema

Chain, Patrick

2018-01-16

Patrick Chain of the DOE Joint Genome Institute gives a talk on behalf of the International Genome Sequencing Standards Consortium on the need for intermediate genome classifications between "draft" and "finished".

Translational genomics for plant breeding with the genome sequence explosion.

PubMed

Kang, Yang Jae; Lee, Taeyoung; Lee, Jayern; Shim, Sangrea; Jeong, Haneul; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

2016-04-01

The use of next-generation sequencers and advanced genotyping technologies has propelled the field of plant genomics in model crops and plants and enhanced the discovery of hidden bridges between genotypes and phenotypes. The newly generated reference sequences of unstudied minor plants can be annotated by the knowledge of model plants via translational genomics approaches. Here, we reviewed the strategies of translational genomics and suggested perspectives on the current databases of genomic resources and the database structures of translated information on the new genome. As a draft picture of phenotypic annotation, translational genomics on newly sequenced plants will provide valuable assistance for breeders and researchers who are interested in genetic studies. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

PubMed Central

Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

2016-01-01

Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326
Genome sequencing of ovine isolates of Mycobacterium avium subspecies paratuberculosis offers insights into host association

PubMed Central

2012-01-01

Background The genome of Mycobacterium avium subspecies paratuberculosis (MAP) is remarkably homogeneous among the genomes of bovine, human and wildlife isolates. However, previous work in our laboratories with the bovine K-10 strain has revealed substantial differences compared to sheep isolates. To systematically characterize all genomic differences that may be associated with the specific hosts, we sequenced the genomes of three U.S. sheep isolates and also obtained an optical map. Results Our analysis of one of the isolates, MAP S397, revealed a genome 4.8 Mb in size with 4,700 open reading frames (ORFs). Comparative analysis of the MAP S397 isolate showed it acquired approximately 10 large sequence regions that are shared with the human M. avium subsp. hominissuis strain 104 and lost 2 large regions that are present in the bovine strain. In addition, optical mapping defined the presence of 7 large inversions between the bovine and ovine genomes (~ 2.36 Mb). Whole-genome sequencing of 2 additional sheep strains of MAP (JTC1074 and JTC7565) further confirmed genomic homogeneity of the sheep isolates despite the presence of polymorphisms on the nucleotide level. Conclusions Comparative sequence analysis employed here provided a better understanding of the host association, evolution of members of the M. avium complex and could help in deciphering the phenotypic differences observed among sheep and cattle strains of MAP. A similar approach based on whole-genome sequencing combined with optical mapping could be employed to examine closely related pathogens. We propose an evolutionary scenario for M. avium complex strains based on these genome sequences. PMID:22409516
Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies

PubMed Central

2014-01-01

Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied. PMID:24647006
Complete genome sequence of Streptosporangium roseum type strain (NI 9100T)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nolan, Matt; Sikorski, Johannes; Jando, Marlen

2010-01-01

Streptosporangium roseum Crauch 1955 is the type strain of the species which is the type species of the genus Streptosporangium. The pinkish coiled Streptomyces-like organism with a spore case was isolated from vegetable garden soil in 1955. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the family Streptosporangiaceae, and the second largest microbial genome sequence ever deciphered. The 10,369,518 bp long genome with its 9421 protein-coding and 80 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaeamore » project.« less
Draft genome sequence of four coccolithoviruses: Emiliania huxleyi virus EhV-88, EhV-201, EhV-207, and EhV-208.

PubMed

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2012-03-01

The Coccolithoviridae are a group of viruses which infect the marine coccolithophorid microalga Emiliania huxleyi. The Emiliania huxleyi viruses (known as EhVs) described herein have 160- to 180-nm diameter icosahedral structures, have genomes of approximately 400 kbp, and consist of more than 450 predicted coding sequences (CDSs). Here, we describe the genomic features of four newly sequenced coccolithoviruses (EhV-88, EhV-201, EhV-207, and EhV-208) together with their draft genome sequences and their annotations, highlighting the homology and heterogeneity of these genomes to the EhV-86 model reference genome.
“A draft Musa balbisiana genome sequence for molecular genetics in polyploid, inter- and intra-specific Musa hybrids”

PubMed Central

2013-01-01

Background Modern banana cultivars are primarily interspecific triploid hybrids of two species, Musa acuminata and Musa balbisiana, which respectively contribute the A- and B-genomes. The M. balbisiana genome has been associated with improved vigour and tolerance to biotic and abiotic stresses and is thus a target for Musa breeding programs. However, while a reference M. acuminata genome has recently been released (Nature 488:213–217, 2012), little sequence data is available for the corresponding B-genome. To address these problems we carried out Next Generation gDNA sequencing of the wild diploid M. balbisiana variety ‘Pisang Klutuk Wulung’ (PKW). Our strategy was to align PKW gDNA reads against the published A-genome and to extract the mapped consensus sequences for subsequent rounds of evaluation and gene annotation. Results The resulting B-genome is 79% the size of the A-genome, and contains 36,638 predicted functional gene sequences which is nearly identical to the 36,542 of the A-genome. There is substantial sequence divergence from the A-genome at a frequency of 1 homozygous SNP per 23.1 bp, and a high degree of heterozygosity corresponding to one heterozygous SNP per 55.9 bp. Using expressed small RNA data, a similar number of microRNA sequences were predicted in both A- and B-genomes, but additional novel miRNAs were detected, including some that are unique to each genome. The usefulness of this B-genome sequence was evaluated by mapping RNA-seq data from a set of triploid AAA and AAB hybrids simultaneously to both genomes. Results for the plantains demonstrated the expected 2:1 distribution of reads across the A- and B-genomes, but for the AAA genomes, results show they contain regions of significant homology to the B-genome supporting proposals that there has been a history of interspecific recombination between homeologous A and B chromosomes in Musa hybrids. Conclusions We have generated and annotated a draft reference Musa B-genome and demonstrate that this can be used for molecular genetic mapping of gene transcripts and small RNA expression data from several allopolyploid banana cultivars. This draft therefore represents a valuable resource to support the study of metabolism in inter- and intraspecific triploid Musa hybrids and to help direct breeding programs. PMID:24094114
Genome Sequence of Candidatus Nitrososphaera evergladensis from Group I.1b Enriched from Everglades Soil Reveals Novel Genomic Features of the Ammonia-Oxidizing Archaea

PubMed Central

Zhalnina, Kateryna V.; Dias, Raquel; Leonard, Michael T.; Dorr de Quadros, Patricia; Camargo, Flavio A. O.; Drew, Jennifer C.; Farmerie, William G.; Daroub, Samira H.; Triplett, Eric W.

2014-01-01

The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group. PMID:24999826
Social and behavioral research in genomic sequencing: approaches from the Clinical Sequencing Exploratory Research Consortium Outcomes and Measures Working Group.

PubMed

Gray, Stacy W; Martins, Yolanda; Feuerman, Lindsay Z; Bernhardt, Barbara A; Biesecker, Barbara B; Christensen, Kurt D; Joffe, Steven; Rini, Christine; Veenstra, David; McGuire, Amy L

2014-10-01

The routine use of genomic sequencing in clinical medicine has the potential to dramatically alter patient care and medical outcomes. To fully understand the psychosocial and behavioral impact of sequencing integration into clinical practice, it is imperative that we identify the factors that influence sequencing-related decision making and patient outcomes. In an effort to develop a collaborative and conceptually grounded approach to studying sequencing adoption, members of the National Human Genome Research Institute's Clinical Sequencing Exploratory Research Consortium formed the Outcomes and Measures Working Group. Here we highlight the priority areas of investigation and psychosocial and behavioral outcomes identified by the Working Group. We also review some of the anticipated challenges to measurement in social and behavioral research related to genomic sequencing; opportunities for instrument development; and the importance of qualitative, quantitative, and mixed-method approaches. This work represents the early, shared efforts of multiple research teams as we strive to understand individuals' experiences with genomic sequencing. The resulting body of knowledge will guide recommendations for the optimal use of sequencing in clinical practice.
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses.
Complete Chloroplast Genome Sequences of Mongolia Medicine Artemisia frigida and Phylogenetic Relationships with Other Plants

PubMed Central

Liu, Yue; Huo, Naxin; Dong, Lingli; Wang, Yi; Zhang, Shuixian; Young, Hugh A.; Feng, Xiaoxiao; Gu, Yong Qiang

2013-01-01

Background Artemisia frigida Willd. is an important Mongolian traditional medicinal plant with pharmacological functions of stanch and detumescence. However, there is little sequence and genomic information available for Artemisia frigida, which makes phylogenetic identification, evolutionary studies, and genetic improvement of its value very difficult. We report the complete chloroplast genome sequence of Artemisia frigida based on 454 pyrosequencing. Methodology/Principal Findings The complete chloroplast genome of Artemisia frigida is 151,076 bp including a large single copy (LSC) region of 82,740 bp, a small single copy (SSC) region of 18,394 bp and a pair of inverted repeats (IRs) of 24,971 bp. The genome contains 114 unique genes and 18 duplicated genes. The chloroplast genome of Artemisia frigida contains a small 3.4 kb inversion within a large 23 kb inversion in the LSC region, a unique feature in Asteraceae. The gene order in the SSC region of Artemisia frigida is inverted compared with the other 6 Asteraceae species with the chloroplast genomes sequenced. This inversion is likely caused by an intramolecular recombination event only occurred in Artemisia frigida. The existence of rich SSR loci in the Artemisia frigida chloroplast genome provides a rare opportunity to study population genetics of this Mongolian medicinal plant. Phylogenetic analysis demonstrates a sister relationship between Artemisia frigida and four other species in Asteraceae, including Ageratina adenophora, Helianthus annuus, Guizotia abyssinica and Lactuca sativa, based on 61 protein-coding sequences. Furthermore, Artemisia frigida was placed in the tribe Anthemideae in the subfamily Asteroideae (Asteraceae) based on ndhF and trnL-F sequence comparisons. Conclusion The chloroplast genome sequence of Artemisia frigida was assembled and analyzed in this study, representing the first plastid genome sequenced in the Anthemideae tribe. This complete chloroplast genome sequence will be useful for molecular ecology and molecular phylogeny studies within Artemisia species and also within the Asteraceae family. PMID:23460871
Exploring Pandora's Box: Potential and Pitfalls of Low Coverage Genome Surveys for Evolutionary Biology

PubMed Central

Leese, Florian; Mayer, Christoph; Agrawal, Shobhit; Dambach, Johannes; Dietz, Lars; Doemel, Jana S.; Goodall-Copstake, William P.; Held, Christoph; Jackson, Jennifer A.; Lampert, Kathrin P.; Linse, Katrin; Macher, Jan N.; Nolzen, Jennifer; Raupach, Michael J.; Rivera, Nicole T.; Schubart, Christoph D.; Striewski, Sebastian; Tollrian, Ralph; Sands, Chester J.

2012-01-01

High throughput sequencing technologies are revolutionizing genetic research. With this “rise of the machines”, genomic sequences can be obtained even for unknown genomes within a short time and for reasonable costs. This has enabled evolutionary biologists studying genetically unexplored species to identify molecular markers or genomic regions of interest (e.g. micro- and minisatellites, mitochondrial and nuclear genes) by sequencing only a fraction of the genome. However, when using such datasets from non-model species, it is possible that DNA from non-target contaminant species such as bacteria, viruses, fungi, or other eukaryotic organisms may complicate the interpretation of the results. In this study we analysed 14 genomic pyrosequencing libraries of aquatic non-model taxa from four major evolutionary lineages. We quantified the amount of suitable micro- and minisatellites, mitochondrial genomes, known nuclear genes and transposable elements and searched for contamination from various sources using bioinformatic approaches. Our results show that in all sequence libraries with estimated coverage of about 0.02–25%, many appropriate micro- and minisatellites, mitochondrial gene sequences and nuclear genes from different KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways could be identified and characterized. These can serve as markers for phylogenetic and population genetic analyses. A central finding of our study is that several genomic libraries suffered from different biases owing to non-target DNA or mobile elements. In particular, viruses, bacteria or eukaryote endosymbionts contributed significantly (up to 10%) to some of the libraries analysed. If not identified as such, genetic markers developed from high-throughput sequencing data for non-model organisms may bias evolutionary studies or fail completely in experimental tests. In conclusion, our study demonstrates the enormous potential of low-coverage genome survey sequences and suggests bioinformatic analysis workflows. The results also advise a more sophisticated filtering for problematic sequences and non-target genome sequences prior to developing markers. PMID:23185309
Genome Sequence of Stachybotrys chartarum Strain 51-11

PubMed Central

Kim, Jean; Levy, Josh

2015-01-01

The Stachybotrys chartarum strain 51-11 genome was sequenced by shotgun sequencing utilizing Illumina HiSeq 2000 and PacBio technologies. Since S. chartarum has been implicated as having health impacts within water-damaged buildings, any information extracted from the genomic sequence data relating to toxins or the metabolism of the fungus might be useful. PMID:26430036
Universal Influenza B Virus Genomic Amplification Facilitates Sequencing, Diagnostics, and Reverse Genetics

PubMed Central

Zhou, Bin; Lin, Xudong; Wang, Wei; Halpin, Rebecca A.; Bera, Jayati; Stockwell, Timothy B.; Barr, Ian G.

2014-01-01

Although human influenza B virus (IBV) is a significant human pathogen, its great genetic diversity has limited our ability to universally amplify the entire genome for subsequent sequencing or vaccine production. The generation of sequence data via next-generation approaches and the rapid cloning of viral genes are critical for basic research, diagnostics, antiviral drugs, and vaccines to combat IBV. To overcome the difficulty of amplifying the diverse and ever-changing IBV genome, we developed and optimized techniques that amplify the complete segmented negative-sense RNA genome from any IBV strain in a single tube/well (IBV genomic amplification [IBV-GA]). Amplicons for >1,000 diverse IBV genomes from different sample types (e.g., clinical specimens) were generated and sequenced using this robust technology. These approaches are sensitive, robust, and sequence independent (i.e., universally amplify past, present, and future IBVs), which facilitates next-generation sequencing and advanced genomic diagnostics. Importantly, special terminal sequences engineered into the optimized IBV-GA2 products also enable ligation-free cloning to rapidly generate reverse-genetics plasmids, which can be used for the rescue of recombinant viruses and/or the creation of vaccine seed stock. PMID:24501036
Regions flanking ori sequences affect the replication efficiency of the mitochondrial genome of ori+ petite mutants from yeast.

PubMed

Rayko, E; Goursot, R; Cherif-Zahar, B; Melis, R; Bernardi, G

1988-03-31

The mitochondrial genomes of progenies from 26 crosses between 17 cytoplasmic, spontaneous, suppressive, ori+ petite mutants of Saccharomyces cerevisiae have been studied by electrophoresis of restriction fragments. Only parental genomes (or occasionally, genomes derived from them by secondary excisions) were found in the progenies of the almost 500 diploids investigated; no evidence for illegitimate, site-specific mitochondrial recombination was detected. One of the parental genomes was always found to be predominate over the other one, although to different extents in different crosses. This predominance appears to be due to a higher replication efficiency, which is correlated with a greater density of ori sequences on the mitochondrial genome (and with a shorter repeat unit size of the latter). Exceptions to the 'repeat-unit-size rule' were found, however, even when the parental mitochondrial genomes carried the same ori sequence. This indicates that noncoding, intergenic sequences outside ori sequences also play a role in modulating replication efficiency. Since in different petites such sequences differ in primary structure, size, and position relative to ori sequences, this modulation is likely to take place through an indirect effect on DNA and nucleoid structure.
[Complete genome sequencing of polymalic acid-producing strain Aureobasidium pullulans CCTCC M2012223].

PubMed

Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang

2017-01-04

To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.
Construction of Pseudomolecule Sequences of the aus Rice Cultivar Kasalath for Comparative Genomics of Asian Cultivated Rice

PubMed Central

Sakai, Hiroaki; Kanamori, Hiroyuki; Arai-Kichise, Yuko; Shibata-Hatta, Mari; Ebana, Kaworu; Oono, Youko; Kurita, Kanako; Fujisawa, Hiroko; Katagiri, Satoshi; Mukai, Yoshiyuki; Hamada, Masao; Itoh, Takeshi; Matsumoto, Takashi; Katayose, Yuichi; Wakasa, Kyo; Yano, Masahiro; Wu, Jianzhong

2014-01-01

Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone. PMID:24578372
Year-round effects of a four-week randomized controlled trial using different types of feedback on employees' physical activity.

PubMed

Van Hoye, Karen; Wijtzes, Anne I; Lefevre, Johan; De Baere, Stijn; Boen, Filip

2018-04-12

This follow-up study investigated the year-round effects of a four-week randomized controlled trial using different types of feedback on employees' physical activity, including a need-supportive coach intervention. Participants (n = 227) were randomly assigned to a Minimal Intervention Group (MIG; no feedback), a Pedometer Group (PG; feedback on daily steps only), a Display Group (DG; feedback on daily steps, on daily moderate-to-vigorous physical activity [MVPA] and on total energy expenditure [EE]), or a Coaching Group (CoachG; same as DG with need supportive coaching). Daily physical activity level (PAL; Metabolic Equivalent of Task [MET]), number of daily steps, daily minutes of moderate to vigorous physical activity (MVPA), active daily EE (EE > 3 METs) and total daily EE were measured at five time points: before the start of the 4-week intervention, one week after the intervention, and 3, 6, and 12 months after the intervention. For minutes of MVPA, MIG showed higher mean change scores compared with the DG. For steps and daily minutes of MVPA, significantly lower mean change scores emerged for MIG compared with the PG. Participants of the CoachG showed significantly higher change scores in PAL, steps, minutes of MVPA, active EE, total EE compared with the MIG. As hypothesized, participants of the CoachG had significantly higher mean change scores in PAL and total EE compared with groups that only received feedback. However, no significant differences were found for steps, minutes of MVPA and active EE between CoachG and PG. Receiving additional need-supportive coaching resulted in a higher PAL and active EE compared with measurement (display) feedback only. These findings suggest to combine feedback on physical activity with personal coaching in order to facilitate long-term behavioral change. When it comes to increasing steps, minutes of MVPA or active EE, a pedometer constitutes a sufficient tool. Clinical Trails.gov NCT01432327 . Date registered: 12 September 2011.
Dysregulated expression of MIG/CXCL9, IP-10/CXCL10 and CXCL16 and their receptors in systemic sclerosis

PubMed Central

2011-01-01

Introduction Systemic sclerosis (SSc) is characterized by fibrosis and microvascular abnormalities including dysregulated angiogenesis. Chemokines, in addition to their chemoattractant properties, have the ability to modulate angiogenesis. Chemokines lacking the enzyme-linked receptor (ELR) motif, such as monokine induced by interferon-γ (IFN-γ) (MIG/CXCL9) and IFN-inducible protein 10 (IP-10/CXCL10), inhibit angiogenesis by binding CXCR3. In addition, CXCL16 promotes angiogenesis by binding its unique receptor CXCR6. In this study, we determined the expression of these chemokines and receptors in SSc skin and serum. Methods Immunohistology and enzyme-linked immunosorbent assays (ELISAs) were used to determine chemokine and chemokine receptor expression in the skin and serum, respectively, of SSc and normal patients. Endothelial cells (ECs) were isolated from SSc skin biopsies and chemokine and chemokine receptor expression was determined by quantitative PCR and immunofluorescence staining. Results Antiangiogenic IP-10/CXCL10 and MIG/CXCL9 were elevated in SSc serum and highly expressed in SSc skin. However, CXCR3, the receptor for these chemokines, was decreased on ECs in SSc vs. normal skin. CXCL16 was elevated in SSc serum and increased in SSc patients with early disease, pulmonary arterial hypertension, and those that died during the 36 months of the study. In addition, its receptor CXCR6 was overexpressed on ECs in SSc skin. At the mRNA and protein levels, CXCR3 was decreased while CXCR6 was increased on SSc ECs vs. human microvascular endothelial cells (HMVECs). Conclusions These results show that while the expression of MIG/CXCL9 and IP-10/CXCL10 are elevated in SSc serum, the expression of CXCR3 is downregulated on SSc dermal ECs. In contrast, CXCL16 and CXCR6 are elevated in SSc serum and on SSc dermal ECs, respectively. In all, these findings suggest angiogenic chemokine receptor expression is likely regulated in an effort to promote angiogenesis in SSc skin. PMID:21303517
Genomic insight into the common carp (Cyprinus carpio) genome by sequencing analysis of BAC-end sequences

PubMed Central

2011-01-01

Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES) are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio), a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3,100 microsyntenies, covering over 50% of the zebrafish genome. BES of common carp are tremendous tools for comparative mapping between the two closely related species, zebrafish and common carp, which should facilitate both structural and functional genome analysis in common carp. PMID:21492448

Comparative Genomics of Bacillus species and its Relevance in Industrial Microbiology.

PubMed

Sharma, Archana; Satyanarayana, T

2013-01-01

With the advent of high throughput sequencing platforms and relevant analytical tools, the rate of microbial genome sequencing has accelerated which has in turn led to better understanding of microbial molecular biology and genetics. The complete genome sequences of important industrial organisms provide opportunities for human health, industry, and the environment. Bacillus species are the dominant workhorses in industrial fermentations. Today, genome sequences of several Bacillus species are available, and comparative genomics of this genus helps in understanding their physiology, biochemistry, and genetics. The genomes of these bacterial species are the sources of many industrially important enzymes and antibiotics and, therefore, provide an opportunity to tailor enzymes with desired properties to suit a wide range of applications. A comparative account of strengths and weaknesses of the different sequencing platforms are also highlighted in the review.
Application of resequencing to rice genomics, functional genomics and evolutionary analysis

PubMed Central

2014-01-01

Rice is a model system used for crop genomics studies. The completion of the rice genome draft sequences in 2002 not only accelerated functional genome studies, but also initiated a new era of resequencing rice genomes. Based on the reference genome in rice, next-generation sequencing (NGS) using the high-throughput sequencing system can efficiently accomplish whole genome resequencing of various genetic populations and diverse germplasm resources. Resequencing technology has been effectively utilized in evolutionary analysis, rice genomics and functional genomics studies. This technique is beneficial for both bridging the knowledge gap between genotype and phenotype and facilitating molecular breeding via gene design in rice. Here, we also discuss the limitation, application and future prospects of rice resequencing. PMID:25006357
The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus)

PubMed Central

Ming, Ray; Hou, Shaobin; Feng, Yun; Yu, Qingyi; Dionne-Laporte, Alexandre; Saw, Jimmy H.; Senin, Pavel; Wang, Wei; Ly, Benjamin V.; Lewis, Kanako L. T.; Salzberg, Steven L.; Feng, Lu; Jones, Meghan R.; Skelton, Rachel L.; Murray, Jan E.; Chen, Cuixia; Qian, Wubin; Shen, Junguo; Du, Peng; Eustice, Moriah; Tong, Eric; Tang, Haibao; Lyons, Eric; Paull, Robert E.; Michael, Todd P.; Wall, Kerr; Rice, Danny W.; Albert, Henrik; Wang, Ming-Li; Zhu, Yun J.; Schatz, Michael; Nagarajan, Niranjan; Acob, Ricelle A.; Guan, Peizhu; Blas, Andrea; Wai, Ching Man; Ackerman, Christine M.; Ren, Yan; Liu, Chao; Wang, Jianmei; Wang, Jianping; Na, Jong-Kuk; Shakirov, Eugene V.; Haas, Brian; Thimmapuram, Jyothi; Nelson, David; Wang, Xiyin; Bowers, John E.; Gschwend, Andrea R.; Delcher, Arthur L.; Singh, Ratnesh; Suzuki, Jon Y.; Tripathi, Savarni; Neupane, Kabi; Wei, Hairong; Irikura, Beth; Paidi, Maya; Jiang, Ning; Zhang, Wenli; Presting, Gernot; Windsor, Aaron; Navajas-Pérez, Rafael; Torres, Manuel J.; Feltus, F. Alex; Porter, Brad; Li, Yingjun; Burroughs, A. Max; Luo, Ming-Cheng; Liu, Lei; Christopher, David A.; Mount, Stephen M.; Moore, Paul H.; Sugimura, Tak; Jiang, Jiming; Schuler, Mary A.; Friedman, Vikki; Mitchell-Olds, Thomas; Shippen, Dorothy E.; dePamphilis, Claude W.; Palmer, Jeffrey D.; Freeling, Michael; Paterson, Andrew H.; Gonsalves, Dennis; Wang, Lei; Alam, Maqsudul

2010-01-01

Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3× draft genome sequence of ‘SunUp’ papaya, the first commercial virus-resistant transgenic fruit tree1 to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far2–5, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties. PMID:18432245
The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus).

PubMed

Ming, Ray; Hou, Shaobin; Feng, Yun; Yu, Qingyi; Dionne-Laporte, Alexandre; Saw, Jimmy H; Senin, Pavel; Wang, Wei; Ly, Benjamin V; Lewis, Kanako L T; Salzberg, Steven L; Feng, Lu; Jones, Meghan R; Skelton, Rachel L; Murray, Jan E; Chen, Cuixia; Qian, Wubin; Shen, Junguo; Du, Peng; Eustice, Moriah; Tong, Eric; Tang, Haibao; Lyons, Eric; Paull, Robert E; Michael, Todd P; Wall, Kerr; Rice, Danny W; Albert, Henrik; Wang, Ming-Li; Zhu, Yun J; Schatz, Michael; Nagarajan, Niranjan; Acob, Ricelle A; Guan, Peizhu; Blas, Andrea; Wai, Ching Man; Ackerman, Christine M; Ren, Yan; Liu, Chao; Wang, Jianmei; Wang, Jianping; Na, Jong-Kuk; Shakirov, Eugene V; Haas, Brian; Thimmapuram, Jyothi; Nelson, David; Wang, Xiyin; Bowers, John E; Gschwend, Andrea R; Delcher, Arthur L; Singh, Ratnesh; Suzuki, Jon Y; Tripathi, Savarni; Neupane, Kabi; Wei, Hairong; Irikura, Beth; Paidi, Maya; Jiang, Ning; Zhang, Wenli; Presting, Gernot; Windsor, Aaron; Navajas-Pérez, Rafael; Torres, Manuel J; Feltus, F Alex; Porter, Brad; Li, Yingjun; Burroughs, A Max; Luo, Ming-Cheng; Liu, Lei; Christopher, David A; Mount, Stephen M; Moore, Paul H; Sugimura, Tak; Jiang, Jiming; Schuler, Mary A; Friedman, Vikki; Mitchell-Olds, Thomas; Shippen, Dorothy E; dePamphilis, Claude W; Palmer, Jeffrey D; Freeling, Michael; Paterson, Andrew H; Gonsalves, Dennis; Wang, Lei; Alam, Maqsudul

2008-04-24

Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
LINE-1 retrotransposons: from 'parasite' sequences to functional elements.

PubMed

Paço, Ana; Adega, Filomena; Chaves, Raquel

2015-02-01

Long interspersed nuclear elements-1 (LINE-1) are the most abundant and active retrotransposons in the mammalian genomes. Traditionally, the occurrence of LINE-1 sequences in the genome of mammals has been explained by the selfish DNA hypothesis. Nevertheless, recently, it has also been argued that these sequences could play important roles in these genomes, as in the regulation of gene expression, genome modelling and X-chromosome inactivation. The non-random chromosomal distribution is a striking feature of these retroelements that somehow reflects its functionality. In the present study, we have isolated and analysed a fraction of the open reading frame 2 (ORF2) LINE-1 sequence from three rodent species, Cricetus cricetus, Peromyscus eremicus and Praomys tullbergi. Physical mapping of the isolated sequences revealed an interspersed longitudinal AT pattern of distribution along all the chromosomes of the complement in the three genomes. A detailed analysis shows that these sequences are preferentially located in the euchromatic regions, although some signals could be detected in the heterochromatin. In addition, a coincidence between the location of imprinted gene regions (as Xist and Tsix gene regions) and the LINE-1 retroelements was also observed. According to these results, we propose an involvement of LINE-1 sequences in different genomic events as gene imprinting, X-chromosome inactivation and evolution of repetitive sequences located at the heterochromatic regions (e.g. satellite DNA sequences) of the rodents' genomes analysed.
The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans.

PubMed

Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

2015-07-20

Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Harnessing the sorghum genome sequence:development of a genome-wide microsattelite (SSR) resource for swift genetic mapping and map based cloning in sorghum

USDA-ARS?s Scientific Manuscript database

Sorghum is the second cereal crop to have a full genome completely sequenced (Nature (2009), 457:551). This achievement is widely recognized as a scientific milestone for grass genetics and genomics in general. However, the true worth of genetic information lies in translating the sequence informa...
Completed Genome Sequences of Strains from 36 Serotypes of Salmonella

PubMed Central

Robertson, James; Yoshida, Catherine; Gurnik, Simone; Rankin, Marisa

2018-01-01

ABSTRACT We report here the completed closed genome sequences of strains representing 36 serotypes of Salmonella. These genome sequences will provide useful references for understanding the genetic variation between serotypes, particularly as references for mapping of raw reads or to create assemblies of higher quality, as well as to aid in studies of comparative genomics of Salmonella. PMID:29348347
First Complete Genome Sequence of Suakwa aphid-borne yellows virus from East Timor

PubMed Central

Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

2016-01-01

We present here the first complete genomic RNA sequence of the polerovirus Suakwa aphid-borne yellows virus (SABYV), from East Timor. The isolate sequenced came from a virus-infected pumpkin plant. The East Timorese genome had a nucleotide identity of 86.5% with the only other SABYV genome available, which is from Taiwan. PMID:27469955
Comparison of Burrows-Wheeler transform-based mapping algorithms used in high-throughput whole-genome sequencing: application to Illumina data for livestock genomes

USDA-ARS?s Scientific Manuscript database

Ongoing developments and cost decreases in next-generation sequencing (NGS) technologies have led to an increase in their application, which has greatly enhanced the fields of genetics and genomics. Mapping sequence reads onto a reference genome is a fundamental step in the analysis of NGS data. Eff...
Evolutionary dynamics of retrotransposons assessed by high-throughput sequencing in wild relatives of wheat.

PubMed

Senerchia, Natacha; Wicker, Thomas; Felber, François; Parisod, Christian

2013-01-01

Transposable elements (TEs) represent a major fraction of plant genomes and drive their evolution. An improved understanding of genome evolution requires the dynamics of a large number of TE families to be considered. We put forward an approach bypassing the required step of a complete reference genome to assess the evolutionary trajectories of high copy number TE families from genome snapshot with high-throughput sequencing. Low coverage sequencing of the complex genomes of Aegilops cylindrica and Ae. geniculata using 454 identified more than 70% of the sequences as known TEs, mainly long terminal repeat (LTR) retrotransposons. Comparing the abundance of reads as well as patterns of sequence diversity and divergence within and among genomes assessed the dynamics of 44 major LTR retrotransposon families of the 165 identified. In particular, molecular population genetics on individual TE copies distinguished recently active from quiescent families and highlighted different evolutionary trajectories of retrotransposons among related species. This work presents a suite of tools suitable for current sequencing data, allowing to address the genome-wide evolutionary dynamics of TEs at the family level and advancing our understanding of the evolution of nonmodel genomes.
Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale.

PubMed

Liu, Siyang; Huang, Shujia; Rao, Junhua; Ye, Weijian; Krogh, Anders; Wang, Jun

2015-01-01

Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure.
The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats.

PubMed

Alverson, Andrew J; Zhuo, Shi; Rice, Danny W; Sloan, Daniel B; Palmer, Jeffrey D

2011-01-20

The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean), and show that despite its unexceptional size (401,262 nt), the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt) repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.
Targeted or whole genome sequencing of formalin fixed tissue samples: potential applications in cancer genomics.

PubMed

Munchel, Sarah; Hoang, Yen; Zhao, Yue; Cottrell, Joseph; Klotzle, Brandy; Godwin, Andrew K; Koestler, Devin; Beyerlein, Peter; Fan, Jian-Bing; Bibikova, Marina; Chien, Jeremy

2015-09-22

Current genomic studies are limited by the poor availability of fresh-frozen tissue samples. Although formalin-fixed diagnostic samples are in abundance, they are seldom used in current genomic studies because of the concern of formalin-fixation artifacts. Better characterization of these artifacts will allow the use of archived clinical specimens in translational and clinical research studies. To provide a systematic analysis of formalin-fixation artifacts on Illumina sequencing, we generated 26 DNA sequencing data sets from 13 pairs of matched formalin-fixed paraffin-embedded (FFPE) and fresh-frozen (FF) tissue samples. The results indicate high rate of concordant calls between matched FF/FFPE pairs at reference and variant positions in three commonly used sequencing approaches (whole genome, whole exome, and targeted exon sequencing). Global mismatch rates and C · G > T · A substitutions were comparable between matched FF/FFPE samples, and discordant rates were low (<0.26%) in all samples. Finally, low-pass whole genome sequencing produces similar pattern of copy number alterations between FF/FFPE pairs. The results from our studies suggest the potential use of diagnostic FFPE samples for cancer genomic studies to characterize and catalog variations in cancer genomes.
Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

PubMed Central

Aslam, Luqman; Beal, Kathryn; Ann Blomberg, Le; Bouffard, Pascal; Burt, David W.; Crasta, Oswald; Crooijmans, Richard P. M. A.; Cooper, Kristal; Coulombe, Roger A.; De, Supriyo; Delany, Mary E.; Dodgson, Jerry B.; Dong, Jennifer J.; Evans, Clive; Frederickson, Karin M.; Flicek, Paul; Florea, Liliana; Folkerts, Otto; Groenen, Martien A. M.; Harkins, Tim T.; Herrero, Javier; Hoffmann, Steve; Megens, Hendrik-Jan; Jiang, Andrew; de Jong, Pieter; Kaiser, Pete; Kim, Heebal; Kim, Kyu-Won; Kim, Sungwon; Langenberger, David; Lee, Mi-Kyung; Lee, Taeheon; Mane, Shrinivasrao; Marcais, Guillaume; Marz, Manja; McElroy, Audrey P.; Modise, Thero; Nefedov, Mikhail; Notredame, Cédric; Paton, Ian R.; Payne, William S.; Pertea, Geo; Prickett, Dennis; Puiu, Daniela; Qioa, Dan; Raineri, Emanuele; Ruffier, Magali; Salzberg, Steven L.; Schatz, Michael C.; Scheuring, Chantel; Schmidt, Carl J.; Schroeder, Steven; Searle, Stephen M. J.; Smith, Edward J.; Smith, Jacqueline; Sonstegard, Tad S.; Stadler, Peter F.; Tafer, Hakim; Tu, Zhijian (Jake); Van Tassell, Curtis P.; Vilella, Albert J.; Williams, Kelly P.; Yorke, James A.; Zhang, Liqing; Zhang, Hong-Bin; Zhang, Xiaojun; Zhang, Yang; Reed, Kent M.

2010-01-01

A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest. PMID:20838655
The complete and fully assembled genome sequence of Aeromonas salmonicida subsp. pectinolytica and its comparative analysis with other Aeromonas species: investigation of the mobilome in environmental and pathogenic strains.

PubMed

Pfeiffer, Friedhelm; Zamora-Lagos, Maria-Antonia; Blettinger, Martin; Yeroslaviz, Assa; Dahl, Andreas; Gruber, Stephan; Habermann, Bianca H

2018-01-05

Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. Here we report the finalized genome sequence of the environmental strain Aeromonas salmonicida subsp. pectinolytica 34mel, for which only a draft genome with 253 contigs is currently available. Successful completion of the transposon-rich genome critically depended on the PacBio long read sequencing technology. Using finalized genome sequences of A. salmonicida subsp. pectinolytica and other Aeromonads, we report the detailed analysis of the transposon composition of these bacterial species. Mobilome evolution is exemplified by a complex transposon, which has shifted from pathogenicity-related to environmental-related gene content in A. salmonicida subsp. pectinolytica 34mel. Obtaining the complete, circular genome of A. salmonicida subsp. pectinolytica allowed us to perform an in-depth analysis of its mobilome. We demonstrate the mobilome-dependent evolution of this strain's genetic profile from pathogenic to environmental.
The Saccharomyces Genome Database Variant Viewer

PubMed Central

Sheppard, Travis K.; Hitz, Benjamin C.; Engel, Stacia R.; Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C.; Dalusag, Kyla S.; Demeter, Janos; Hellerstedt, Sage T.; Karra, Kalpana; Nash, Robert S.; Paskov, Kelley M.; Skrzypek, Marek S.; Weng, Shuai; Wong, Edith D.; Cherry, J. Michael

2016-01-01

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer. PMID:26578556
Complete genome sequence of an attenuated Sparfloxacin-resistant Streptococcus agalactiae strain 138spar

USDA-ARS?s Scientific Manuscript database

The complete genome of a sparfloxacin-resistant Streptococcus agalactiae vaccine strain 138spar is 1,838,126 bp in size. The genome has 1892 coding sequences and 82 RNAs. The annotation of the genome is added by the NCBI Prokaryotic Genome Annotation Pipeline. The publishing of this genome will allo...
Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”

PubMed Central

Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.

2005-01-01

The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379
Mind the gap; seven reasons to close fragmented genome assemblies

USDA-ARS?s Scientific Manuscript database

Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including tho...

Salmonella Typhi genomics: envisaging the future of typhoid eradication.

PubMed

Yap, Kien-Pong; Thong, Kwai Lin

2017-08-01

Next-generation whole-genome sequencing has revolutionised the study of infectious diseases in recent years. The availability of genome sequences and its understanding have transformed the field of molecular microbiology, epidemiology, infection treatments and vaccine developments. We review the key findings of the publicly accessible genomes of Salmonella enterica serovar Typhi since the first complete genome to the most recent release of thousands of Salmonella Typhi genomes, which remarkably shape the genomic research of S. Typhi and other pathogens. Important new insights acquired from the genome sequencing of S. Typhi, pertaining to genomic variations, evolution, population structure, antibiotic resistance, virulence, pathogenesis, disease surveillance/investigation and disease control are discussed. As the numbers of sequenced genomes are increasing at an unprecedented rate, fine variations in the gene pool of S. Typhi are captured in high resolution, allowing deeper understanding of the pathogen's evolutionary trends and its pathogenesis, paving the way to bringing us closer to eradication of typhoid through effective vaccine/treatment development. © 2017 John Wiley & Sons Ltd.
Between Two Fern Genomes

PubMed Central

2014-01-01

Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969
MIPS: a database for genomes and protein sequences

PubMed Central

Mewes, H. W.; Frishman, D.; Güldener, U.; Mannhaupt, G.; Mayer, K.; Mokrejs, M.; Morgenstern, B.; Münsterkötter, M.; Rudd, S.; Weil, B.

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz–Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91–93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155–158; Barker et al. (2001) Nucleic Acids Res., 29, 29–32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de). PMID:11752246
MIPS: a database for genomes and protein sequences.

PubMed

Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
Whole-Genome Sequences of Thirteen Isolates of Borrelia burgdorferi

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schutzer S. E.; Dunn J.; Fraser-Liggett, C. M.

2011-02-01

Borrelia burgdorferi is a causative agent of Lyme disease in North America and Eurasia. The first complete genome sequence of B. burgdorferi strain 31, available for more than a decade, has assisted research on the pathogenesis of Lyme disease. Because a single genome sequence is not sufficient to understand the relationship between genotypic and geographic variation and disease phenotype, we determined the whole-genome sequences of 13 additional B. burgdorferi isolates that span the range of natural variation. These sequences should allow improved understanding of pathogenesis and provide a foundation for novel detection, diagnosis, and prevention strategies.
Quantitative phenotyping via deep barcode sequencing.

PubMed

Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

2009-10-01

Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.
The genome sequence of ectromelia virus Naval and Cornell isolates from outbreaks in North America.

PubMed

Mavian, Carla; López-Bueno, Alberto; Bryant, Neil A; Seeger, Kathy; Quail, Michael A; Harris, David; Barrell, Bart; Alcami, Antonio

2014-08-01

Ectromelia virus (ECTV) is the causative agent of mousepox, a disease of laboratory mouse colonies and an excellent model for human smallpox. We report the genome sequence of two isolates from outbreaks in laboratory mouse colonies in the USA in 1995 and 1999: ECTV-Naval and ECTV-Cornell, respectively. The genome of ECTV-Naval and ECTV-Cornell was sequenced by the 454-Roche technology. The ECTV-Naval genome was also sequenced by the Sanger and Illumina technologies in order to evaluate these technologies for poxvirus genome sequencing. Genomic comparisons revealed that ECTV-Naval and ECTV-Cornell correspond to the same virus isolated from independent outbreaks. Both ECTV-Naval and ECTV-Cornell are extremely virulent in susceptible BALB/c mice, similar to ECTV-Moscow. This is consistent with the ECTV-Naval genome sharing 98.2% DNA sequence identity with that of ECTV-Moscow, and indicates that the genetic differences with ECTV-Moscow do not affect the virulence of ECTV-Naval in the mousepox model of footpad infection. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

PubMed

Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

2003-07-04

The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Chromosome arm-specific BAC end sequences permit comparative analysis of homoeologous chromosomes and genomes of polyploid wheat

PubMed Central

2012-01-01

Background Bread wheat, one of the world’s staple food crops, has the largest, highly repetitive and polyploid genome among the cereal crops. The wheat genome holds the key to crop genetic improvement against challenges such as climate change, environmental degradation, and water scarcity. To unravel the complex wheat genome, the International Wheat Genome Sequencing Consortium (IWGSC) is pursuing a chromosome- and chromosome arm-based approach to physical mapping and sequencing. Here we report on the use of a BAC library made from flow-sorted telosomic chromosome 3A short arm (t3AS) for marker development and analysis of sequence composition and comparative evolution of homoeologous genomes of hexaploid wheat. Results The end-sequencing of 9,984 random BACs from a chromosome arm 3AS-specific library (TaaCsp3AShA) generated 11,014,359 bp of high quality sequence from 17,591 BAC-ends with an average length of 626 bp. The sequence represents 3.2% of t3AS with an average DNA sequence read every 19 kb. Overall, 79% of the sequence consisted of repetitive elements, 1.38% as coding regions (estimated 2,850 genes) and another 19% of unknown origin. Comparative sequence analysis suggested that 70-77% of the genes present in both 3A and 3B were syntenic with model species. Among the transposable elements, gypsy/sabrina (12.4%) was the most abundant repeat and was significantly more frequent in 3A compared to homoeologous chromosome 3B. Twenty novel repetitive sequences were also identified using de novo repeat identification. BESs were screened to identify simple sequence repeats (SSR) and transposable element junctions. A total of 1,057 SSRs were identified with a density of one per 10.4 kb, and 7,928 junctions between transposable elements (TE) and other sequences were identified with a density of one per 1.39 kb. With the objective of enhancing the marker density of chromosome 3AS, oligonucleotide primers were successfully designed from 758 SSRs and 695 Insertion Site Based Polymorphisms (ISBPs). Of the 96 ISBP primer pairs tested, 28 (29%) were 3A-specific and compared to 17 (18%) for 96 SSRs. Conclusion This work reports on the use of wheat chromosome arm 3AS-specific BAC library for the targeted generation of sequence data from a particular region of the huge genome of wheat. A large quantity of sequences were generated from the A genome of hexaploid wheat for comparative genome analysis with homoeologous B and D genomes and other model grass genomes. Hundreds of molecular markers were developed from the 3AS arm-specific sequences; these and other sequences will be useful in gene discovery and physical mapping. PMID:22559868
The genome of the sea urchin Strongylocentrotus purpuratus.

PubMed

Sodergren, Erica; Weinstock, George M; Davidson, Eric H; Cameron, R Andrew; Gibbs, Richard A; Angerer, Robert C; Angerer, Lynne M; Arnone, Maria Ina; Burgess, David R; Burke, Robert D; Coffman, James A; Dean, Michael; Elphick, Maurice R; Ettensohn, Charles A; Foltz, Kathy R; Hamdoun, Amro; Hynes, Richard O; Klein, William H; Marzluff, William; McClay, David R; Morris, Robert L; Mushegian, Arcady; Rast, Jonathan P; Smith, L Courtney; Thorndyke, Michael C; Vacquier, Victor D; Wessel, Gary M; Wray, Greg; Zhang, Lan; Elsik, Christine G; Ermolaeva, Olga; Hlavina, Wratko; Hofmann, Gretchen; Kitts, Paul; Landrum, Melissa J; Mackey, Aaron J; Maglott, Donna; Panopoulou, Georgia; Poustka, Albert J; Pruitt, Kim; Sapojnikov, Victor; Song, Xingzhi; Souvorov, Alexandre; Solovyev, Victor; Wei, Zheng; Whittaker, Charles A; Worley, Kim; Durbin, K James; Shen, Yufeng; Fedrigo, Olivier; Garfield, David; Haygood, Ralph; Primus, Alexander; Satija, Rahul; Severson, Tonya; Gonzalez-Garay, Manuel L; Jackson, Andrew R; Milosavljevic, Aleksandar; Tong, Mark; Killian, Christopher E; Livingston, Brian T; Wilt, Fred H; Adams, Nikki; Bellé, Robert; Carbonneau, Seth; Cheung, Rocky; Cormier, Patrick; Cosson, Bertrand; Croce, Jenifer; Fernandez-Guerra, Antonio; Genevière, Anne-Marie; Goel, Manisha; Kelkar, Hemant; Morales, Julia; Mulner-Lorillon, Odile; Robertson, Anthony J; Goldstone, Jared V; Cole, Bryan; Epel, David; Gold, Bert; Hahn, Mark E; Howard-Ashby, Meredith; Scally, Mark; Stegeman, John J; Allgood, Erin L; Cool, Jonah; Judkins, Kyle M; McCafferty, Shawn S; Musante, Ashlan M; Obar, Robert A; Rawson, Amanda P; Rossetti, Blair J; Gibbons, Ian R; Hoffman, Matthew P; Leone, Andrew; Istrail, Sorin; Materna, Stefan C; Samanta, Manoj P; Stolc, Viktor; Tongprasit, Waraporn; Tu, Qiang; Bergeron, Karl-Frederik; Brandhorst, Bruce P; Whittle, James; Berney, Kevin; Bottjer, David J; Calestani, Cristina; Peterson, Kevin; Chow, Elly; Yuan, Qiu Autumn; Elhaik, Eran; Graur, Dan; Reese, Justin T; Bosdet, Ian; Heesun, Shin; Marra, Marco A; Schein, Jacqueline; Anderson, Michele K; Brockton, Virginia; Buckley, Katherine M; Cohen, Avis H; Fugmann, Sebastian D; Hibino, Taku; Loza-Coll, Mariano; Majeske, Audrey J; Messier, Cynthia; Nair, Sham V; Pancer, Zeev; Terwilliger, David P; Agca, Cavit; Arboleda, Enrique; Chen, Nansheng; Churcher, Allison M; Hallböök, F; Humphrey, Glen W; Idris, Mohammed M; Kiyama, Takae; Liang, Shuguang; Mellott, Dan; Mu, Xiuqian; Murray, Greg; Olinski, Robert P; Raible, Florian; Rowe, Matthew; Taylor, John S; Tessmar-Raible, Kristin; Wang, D; Wilson, Karen H; Yaguchi, Shunsuke; Gaasterland, Terry; Galindo, Blanca E; Gunaratne, Herath J; Juliano, Celina; Kinukawa, Masashi; Moy, Gary W; Neill, Anna T; Nomura, Mamoru; Raisch, Michael; Reade, Anna; Roux, Michelle M; Song, Jia L; Su, Yi-Hsien; Townley, Ian K; Voronina, Ekaterina; Wong, Julian L; Amore, Gabriele; Branno, Margherita; Brown, Euan R; Cavalieri, Vincenzo; Duboc, Véronique; Duloquin, Louise; Flytzanis, Constantin; Gache, Christian; Lapraz, François; Lepage, Thierry; Locascio, Annamaria; Martinez, Pedro; Matassi, Giorgio; Matranga, Valeria; Range, Ryan; Rizzo, Francesca; Röttinger, Eric; Beane, Wendy; Bradham, Cynthia; Byrum, Christine; Glenn, Tom; Hussain, Sofia; Manning, Gerard; Miranda, Esther; Thomason, Rebecca; Walton, Katherine; Wikramanayke, Athula; Wu, Shu-Yu; Xu, Ronghui; Brown, C Titus; Chen, Lili; Gray, Rachel F; Lee, Pei Yun; Nam, Jongmin; Oliveri, Paola; Smith, Joel; Muzny, Donna; Bell, Stephanie; Chacko, Joseph; Cree, Andrew; Curry, Stacey; Davis, Clay; Dinh, Huyen; Dugan-Rocha, Shannon; Fowler, Jerry; Gill, Rachel; Hamilton, Cerrissa; Hernandez, Judith; Hines, Sandra; Hume, Jennifer; Jackson, Laronda; Jolivet, Angela; Kovar, Christie; Lee, Sandra; Lewis, Lora; Miner, George; Morgan, Margaret; Nazareth, Lynne V; Okwuonu, Geoffrey; Parker, David; Pu, Ling-Ling; Thorn, Rachel; Wright, Rita

2006-11-10

We report the sequence and analysis of the 814-megabase genome of the sea urchin Strongylocentrotus purpuratus, a model for developmental and systems biology. The sequencing strategy combined whole-genome shotgun and bacterial artificial chromosome (BAC) sequences. This use of BAC clones, aided by a pooling strategy, overcame difficulties associated with high heterozygosity of the genome. The genome encodes about 23,300 genes, including many previously thought to be vertebrate innovations or known only outside the deuterostomes. This echinoderm genome provides an evolutionary outgroup for the chordates and yields insights into the evolution of deuterostomes.
A draft physical map of a D-genome cotton species (Gossypium raimondii)

PubMed Central

2010-01-01

Background Genetically anchored physical maps of large eukaryotic genomes have proven useful both for their intrinsic merit and as an adjunct to genome sequencing. Cultivated tetraploid cottons, Gossypium hirsutum and G. barbadense, share a common ancestor formed by a merger of the A and D genomes about 1-2 million years ago. Toward the long-term goal of characterizing the spectrum of diversity among cotton genomes, the worldwide cotton community has prioritized the D genome progenitor Gossypium raimondii for complete sequencing. Results A whole genome physical map of G. raimondii, the putative D genome ancestral species of tetraploid cottons was assembled, integrating genetically-anchored overgo hybridization probes, agarose based fingerprints and 'high information content fingerprinting' (HICF). A total of 13,662 BAC-end sequences and 2,828 DNA probes were used in genetically anchoring 1585 contigs to a cotton consensus genetic map, and 370 and 438 contigs, respectively to Arabidopsis thaliana (AT) and Vitis vinifera (VV) whole genome sequences. Conclusion Several lines of evidence suggest that the G. raimondii genome is comprised of two qualitatively different components. Much of the gene rich component is aligned to the Arabidopsis and Vitis vinifera genomes and shows promise for utilizing translational genomic approaches in understanding this important genome and its resident genes. The integrated genetic-physical map is of value both in assembling and validating a planned reference sequence. PMID:20569427
Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples.

PubMed

Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J

2017-06-01

Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
EGenBio: A Data Management System for Evolutionary Genomics and Biodiversity

PubMed Central

Nahum, Laila A; Reynolds, Matthew T; Wang, Zhengyuan O; Faith, Jeremiah J; Jonna, Rahul; Jiang, Zhi J; Meyer, Thomas J; Pollock, David D

2006-01-01

Background Evolutionary genomics requires management and filtering of large numbers of diverse genomic sequences for accurate analysis and inference on evolutionary processes of genomic and functional change. We developed Evolutionary Genomics and Biodiversity (EGenBio; ) to begin to address this. Description EGenBio is a system for manipulation and filtering of large numbers of sequences, integrating curated sequence alignments and phylogenetic trees, managing evolutionary analyses, and visualizing their output. EGenBio is organized into three conceptual divisions, Evolution, Genomics, and Biodiversity. The Genomics division includes tools for selecting pre-aligned sequences from different genes and species, and for modifying and filtering these alignments for further analysis. Species searches are handled through queries that can be modified based on a tree-based navigation system and saved. The Biodiversity division contains tools for analyzing individual sequences or sequence alignments, whereas the Evolution division contains tools involving phylogenetic trees. Alignments are annotated with analytical results and modification history using our PRAED format. A miscellaneous Tools section and Help framework are also available. EGenBio was developed around our comparative genomic research and a prototype database of mtDNA genomes. It utilizes MySQL-relational databases and dynamic page generation, and calls numerous custom programs. Conclusion EGenBio was designed to serve as a platform for tools and resources to ease combined analysis in evolution, genomics, and biodiversity. PMID:17118150
The Yak genome database: an integrative database for studying yak biology and high-altitude adaption

PubMed Central

2012-01-01

Background The yak (Bos grunniens) is a long-haired bovine that lives at high altitudes and is an important source of milk, meat, fiber and fuel. The recent sequencing, assembly and annotation of its genome are expected to further our understanding of the means by which it has adapted to life at high altitudes and its ecologically important traits. Description The Yak Genome Database (YGD) is an internet-based resource that provides access to genomic sequence data and predicted functional information concerning the genes and proteins of Bos grunniens. The curated data stored in the YGD includes genome sequences, predicted genes and associated annotations, non-coding RNA sequences, transposable elements, single nucleotide variants, and three-way whole-genome alignments between human, cattle and yak. YGD offers useful searching and data mining tools, including the ability to search for genes by name or using function keywords as well as GBrowse genome browsers and/or BLAST servers, which can be used to visualize genome regions and identify similar sequences. Sequence data from the YGD can also be downloaded to perform local searches. Conclusions A new yak genome database (YGD) has been developed to facilitate studies on high-altitude adaption and bovine genomics. The database will be continuously updated to incorporate new information such as transcriptome data and population resequencing data. The YGD can be accessed at http://me.lzu.edu.cn/yak. PMID:23134687
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

PubMed Central

2013-01-01

Background The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another. PMID:23870653
Discovery and Complete Genome Sequence of a Bacteriophage from an Obligate Intracellular Symbiont of a Cellulolytic Protist in the Termite Gut

PubMed Central

Pramono, Ajeng K.; Kuwahara, Hirokazu; Itoh, Takehiko; Toyoda, Atsushi; Yamada, Akinori; Hongoh, Yuichi

2017-01-01

Termites depend nutritionally on their gut microbes, and protistan, bacterial, and archaeal gut communities have been extensively studied. However, limited information is available on viruses in the termite gut. We herein report the complete genome sequence (99,517 bp) of a phage obtained during a genome analysis of “Candidatus Azobacteroides pseudotrichonymphae” phylotype ProJPt-1, which is an obligate intracellular symbiont of the cellulolytic protist Pseudotrichonympha sp. in the gut of the termite Prorhinotermes japonicus. The genome of the phage, designated ProJPt-Bp1, was circular or circularly permuted, and was not integrated into the two circular chromosomes or five circular plasmids composing the host ProJPt-1 genome. The phage was putatively affiliated with the order Caudovirales based on sequence similarities with several phage-related genes; however, most of the 52 protein-coding sequences had no significant homology to sequences in the databases. The phage genome contained a tRNA-Gln (CAG) gene, which showed the highest sequence similarity to the tRNA-Gln (CAA) gene of the host “Ca. A. pseudotrichonymphae” phylotype ProJPt-1. Since the host genome lacked a tRNA-Gln (CAG) gene, the phage tRNA gene may compensate for differences in codon usage bias between the phage and host genomes. The phage genome also contained a non-coding region with high nucleotide sequence similarity to a region in one of the host plasmids. No other phage-related sequences were found in the host ProJPt-1 genome. To the best of our knowledge, this is the first report of a phage from an obligate, mutualistic endosymbiont permanently associated with eukaryotic cells. PMID:28321010
High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA.

PubMed

Wang, Wenqin; Messing, Joachim

2011-01-01

Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power.
High-Throughput Sequencing of Three Lemnoideae (Duckweeds) Chloroplast Genomes from Total DNA

PubMed Central

Wang, Wenqin; Messing, Joachim

2011-01-01

Background Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. Methods We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs) using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. Conclusions This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power. PMID:21931804
Sequencing and characterizing the genome of Estrella lausannensis as an undergraduate project: training students and biological insights.

PubMed

Bertelli, Claire; Aeby, Sébastien; Chassot, Bérénice; Clulow, James; Hilfiker, Olivier; Rappo, Samuel; Ritzmann, Sébastien; Schumacher, Paolo; Terrettaz, Céline; Benaglio, Paola; Falquet, Laurent; Farinelli, Laurent; Gharib, Walid H; Goesmann, Alexander; Harshman, Keith; Linke, Burkhard; Miyazaki, Ryo; Rivolta, Carlo; Robinson-Rechavi, Marc; van der Meer, Jan Roelof; Greub, Gilbert

2015-01-01

With the widespread availability of high-throughput sequencing technologies, sequencing projects have become pervasive in the molecular life sciences. The huge bulk of data generated daily must be analyzed further by biologists with skills in bioinformatics and by "embedded bioinformaticians," i.e., bioinformaticians integrated in wet lab research groups. Thus, students interested in molecular life sciences must be trained in the main steps of genomics: sequencing, assembly, annotation and analysis. To reach that goal, a practical course has been set up for master students at the University of Lausanne: the "Sequence a genome" class. At the beginning of the academic year, a few bacterial species whose genome is unknown are provided to the students, who sequence and assemble the genome(s) and perform manual annotation. Here, we report the progress of the first class from September 2010 to June 2011 and the results obtained by seven master students who specifically assembled and annotated the genome of Estrella lausannensis, an obligate intracellular bacterium related to Chlamydia. The draft genome of Estrella is composed of 29 scaffolds encompassing 2,819,825 bp that encode for 2233 putative proteins. Estrella also possesses a 9136 bp plasmid that encodes for 14 genes, among which we found an integrase and a toxin/antitoxin module. Like all other members of the Chlamydiales order, Estrella possesses a highly conserved type III secretion system, considered as a key virulence factor. The annotation of the Estrella genome also allowed the characterization of the metabolic abilities of this strictly intracellular bacterium. Altogether, the students provided the scientific community with the Estrella genome sequence and a preliminary understanding of the biology of this recently-discovered bacterial genus, while learning to use cutting-edge technologies for sequencing and to perform bioinformatics analyses.
Genome sequence and analysis of a stress-tolerant, wild-derived strain of Saccharomyces cerevisiae used in biofuels research

DOE Office of Scientific and Technical Information (OSTI.GOV)

McIlwain, Sean J.; Peris, Davis; Sardi, Maria

The genome sequences of more than 100 strains of the yeast Saccharomyces cerevisiae have been published. Unfortunately, most of these genome assemblies contain dozens to hundreds of gaps at repetitive sequences, including transposable elements, tRNAs, and subtelomeric regions, which is where novel genes generally reside. Relatively few strains have been chosen for genome sequencing based on their biofuel production potential, leaving an additional knowledge gap. Here, we describe the nearly complete genome sequence of GLBRCY22-3 (Y22-3), a strain of S. cerevisiae derived from the stress-tolerant wild strain NRRL YB-210 and subsequently engineered for xylose metabolism. After benchmarking several genome assemblymore » approaches, we developed a pipeline to integrate Pacific Biosciences (PacBio) and Illumina sequencing data and achieved one of the highest quality genome assemblies for any S. cerevisiae strain. Specifically, the contig N50 is 693 kbp, and the sequences of most chromosomes, the mitochondrial genome, and the 2-micron plasmid are complete. Our annotation predicts 92 genes that are not present in the reference genome of the laboratory strain S288c, over 70% of which were expressed. We predicted functions for 43 of these genes, 28 of which were previously uncharacterized and unnamed. Remarkably, many of these genes are predicted to be involved in stress tolerance and carbon metabolism and are shared with a Brazilian bioethanol production strain, even though the strains differ dramatically at most genetic loci. Lastly, the Y22-3 genome sequence provides an exceptionally high-quality resource for basic and applied research in bioenergy and genetics.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.