Sample records for true fisp sequence

  1. Comparison of MR imaging sequences for liver and head and neck interventions: is there a single optimal sequence for all purposes?

    PubMed

    Boll, Daniel T; Lewin, Jonathan S; Duerk, Jeffrey L; Aschoff, Andrik J; Merkle, Elmar M

    2004-05-01

    To compare the appropriate pulse sequences for interventional device guidance during magnetic resonance (MR) imaging at 0.2 T and to evaluate the dependence of sequence selection on the anatomic region of the procedure. Using a C-arm 0.2 T system, four interventional MR sequences were applied in 23 liver cases and during MR-guided neck interventions in 13 patients. The imaging protocol consisted of: multislice turbo spin echo (TSE) T2w, sequential-slice fast imaging with steady precession (FISP), a time-reversed version of FISP (PSIF), and FISP with balanced gradients in all spatial directions (True-FISP) sequences. Vessel conspicuity was rated and contrast-to-noise ratio (CNR) was calculated for each sequence and a differential receiver operating characteristic was performed. Liver findings were detected in 96% using the TSE sequence. PSIF, FISP, and True-FISP imaging showed lesions in 91%, 61%, and 65%, respectively. The TSE sequence offered the best CNR, followed by PSIF imaging. Differential receiver operating characteristic analysis also rated TSE and PSIF to be the superior sequences. Lesions in the head and neck were detected in all cases by TSE and FISP, in 92% using True-FISP, and in 84% using PSIF. True-FISP offered the best CNR, followed by TSE imaging. Vessels appeared bright on FISP and True-FISP imaging and dark on the other sequences. In interventional MR imaging, no single sequence fits all purposes. Image guidance for interventional MR during liver procedures is best achieved by PSIF or TSE, whereas biopsies in the head and neck are best performed using FISP or True-FISP sequences.

  2. Depicting the semicircular canals with inner-ear MRI: a comparison of the SPACE and TrueFISP sequences.

    PubMed

    Kojima, Shinya; Suzuki, Kazufumi; Hirata, Masami; Shinohara, Hiroyuki; Ueno, Eiko

    2013-03-01

    To assess the ability of magnetic resonance imaging (MRI) to depict the semicircular canals of the inner ear by comparing results from the sampling perfection with application-optimized contrasts by using different flip angle evolutions (SPACE) sequence with those from the true free induction with steady precession (TrueFISP) sequence. A 1.5-T MRI system was used to perform an in vivo study of 10 healthy volunteers and 17 patients. A three-point visual score was employed for assessing the depiction of the semicircular canals and facial and vestibulocochlear nerves and the contrast-to-noise ratio (CNR) was computed for the vestibule and pons on images with the SPACE and TrueFIPS sequences. There were no susceptibility artifact-related filling defects with the SPACE sequence. However, the TrueFISP sequence showed filling defects for at least one semicircular canal on both sides in seven cases for healthy subjects and in 10 cases for patients. The CNR with the SPACE sequence was significantly higher than with the TrueFISP sequence (P < 0.05). There was no statistically significant difference in depicting the facial and the vestibulocochlear nerves (P = 0.32). For the depiction of the semicircular canal, the SPACE sequence is superior to the TrueFISP sequence. Copyright © 2012 Wiley Periodicals, Inc.

  3. Quantification of glomerular filtration rate by measurement of gadobutrol clearance from the extracellular fluid volume: comparison of a TurboFLASH and a TrueFISP approach

    NASA Astrophysics Data System (ADS)

    Boss, Andreas; Martirosian, Petros; Artunc, Ferruh; Risler, Teut; Claussen, Claus D.; Schlemmer, Heinz-Peter; Schick, Fritz

    2007-03-01

    Purpose: As the MR contrast-medium gadobutrol is completely eliminated via glomerular filtration, the glomerular filtration rate (GFR) can be quantified after bolus-injection of gadobutrol and complete mixing in the extracellular fluid volume (ECFV) by measuring the signal decrease within the liver parenchyma. Two different navigator-gated single-shot saturation-recovery sequences have been tested for suitability of GFR quantification: a TurboFLASH and a TrueFISP readout technique. Materials and Methods: Ten healthy volunteers (mean age 26.1+/-3.6) were equally devided in two subgroups. After bolus-injection of 0.05 mmol/kg gadobutrol, coronal single-slice images of the liver were recorded every 4-5 seconds during free breathing using either the TurboFLASH or the TrueFISP technique. Time-intensity curves were determined from manually drawn regions-of-interest over the liver parenchyma. Both sequences were subsequently evaluated regarding signal to noise ratio (SNR) and the behaviour of signal intensity curves. The calculated GFR values were compared to an iopromide clearance gold standard. Results: The TrueFISP sequence exhibited a 3.4-fold higher SNR as compared to the TurboFLASH sequence and markedly lower variability of the recorded time-intensity curves. The calculated mean GFR values were 107.0+/-16.1 ml/min/1.73m2 (iopromide: 92.1+/-14.5 ml/min/1.73m2) for the TrueFISP technique and 125.6+/-24.1 ml/min/1.73m2 (iopromide: 97.7+/-6.3 ml/min/1.73m2) for the TurboFLASH approach. The mean paired differences with TrueFISP was lower (15.0 ml/min/1.73m2) than in the TurboFLASH method (27.9 ml/min/1.73m2). Conclusion: The global GFR can be quantified via measurement of gadobutrol clearance from the ECFV. A saturation-recovery TrueFISP sequence allows for more reliable GFR quantification as a saturation recovery TurboFLASH technique.

  4. Lung MRI at 1.5 and 3 Tesla: observer preference study and lesion contrast using five different pulse sequences.

    PubMed

    Fink, Christian; Puderbach, Michael; Biederer, Juergen; Fabel, Michael; Dietrich, Olaf; Kauczor, Hans-Ulrich; Reiser, Maximilian F; Schönberg, Stefan O

    2007-06-01

    To compare the image quality and lesion contrast of lung MRI using 5 different pulse sequences at 1.5 T and 3 T. Lung MRI was performed at 1.5 T and 3 T using 5 pulse sequences which have been previously proposed for lung MRI: 3D volumetric interpolated breath-hold examination (VIBE), true fast imaging with steady-state precession (TrueFISP), half-Fourier single-shot turbo spin-echo (HASTE), short tau inversion recovery (STIR), T2-weighted turbo spin-echo (TSE). In addition to 4 healthy volunteers, 5 porcine lungs were examined in a dedicated chest phantom. Lung pathology (nodules and infiltrates) was simulated in the phantom by intrapulmonary and intrabronchial injections of agarose. CT was performed in the phantom for correlation. Image quality of the sequences was ranked in a side-by-side comparison by 3 blinded radiologists regarding the delineation of pulmonary and mediastinal anatomy, conspicuity of pulmonary nodules and infiltrates, and presence of artifacts. The contrast of nodules and infiltrates (CNODULES and CINFILTRATES) defined by the ratio of the signal intensities of the lesion and adjacent normal lung parenchyma was determined. There were no relevant differences regarding the preference for the individual sequences between both field strengths. TSE was the preferred sequence for the visualization of the mediastinum at both field strengths. For the visualization of lung parenchyma the observers preferred TrueFISP in volunteers and TSE in the phantom studies. At both field strengths VIBE achieved the best rating for the depiction of nodules, whereas HASTE was rated best for the delineation of infiltrates. TrueFISP had the fewest artifacts in volunteers, whereas STIR showed the fewest artifacts in the phantom. For all but the TrueFISP sequence the lesion contrast increased from 1.5 T to 3 T. At both field strengths VIBE showed the highest CNODULES (6.6 and 7.1) and HASTE the highest CINFILTRATES (6.1 and 6.3). The imaging characteristics of different pulse sequences used for lung MRI do not substantially differ between 1.5 T and 3 T. A higher lesion contrast can be expected at 3 T.

  5. Comparison of 3D bone models of the knee joint derived from CT and 3T MR imaging.

    PubMed

    Neubert, Aleš; Wilson, Katharine J; Engstrom, Craig; Surowiec, Rachel K; Paproki, Anthony; Johnson, Nicholas; Crozier, Stuart; Fripp, Jurgen; Ho, Charles P

    2017-08-01

    To examine whether magnetic resonance (MR) imaging can offer a viable alternative to computed tomography (CT) based 3D bone modeling. CT and MR (SPACE, TrueFISP, VIBE) images were acquired from the left knee joint of a fresh-frozen cadaver. The distal femur, proximal tibia, proximal fibula and patella were manually segmented from the MR and CT examinations. The MR bone models obtained from manual segmentations of all three sequences were compared to CT models using a similarity measure based on absolute mesh differences. The average absolute distance between the CT and the various MR-based bone models were all below 1mm across all bones. The VIBE sequence provided the best agreement with the CT model, followed by the SPACE, then the TrueFISP data. The most notable difference was for the proximal tibia (VIBE 0.45mm, SPACE 0.82mm, TrueFISP 0.83mm). The study indicates that 3D MR bone models may offer a feasible alternative to traditional CT-based modeling. A single radiological examination using the MR imaging would allow simultaneous assessment of both bones and soft-tissues, providing anatomically comprehensive joint models for clinical evaluation, without the ionizing radiation of CT imaging. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Unenhanced respiratory-gated magnetic resonance angiography (MRA) of renal artery in hypertensive patients using true fast imaging with steady-state precession technique compared with contrast-enhanced MRA.

    PubMed

    Zhang, Weisheng; Lin, Jiang; Wang, Shaowu; Lv, Peng; Wang, Lili; Liu, Hao; Chen, Caizhong; Zeng, Mengsu

    2014-01-01

    This study was aimed to evaluate the accuracy of "True Fast Imaging with Steady-State Precession" (TrueFISP) MR angiography (MRA) for diagnosis of renal arterial stenosis (RAS) in hypertensive patients. Twenty-two patients underwent both TrueFISP MRA and contrast-enhanced MRA (CE-MRA) on a 1.5-T MR imager. Volume of main renal arteries, length of maximal visible renal arteries, number of visualized branches, stenotic grade, and subjective quality were compared. Paired 2-tailed Student t test and Wilcoxon signed rank test were applied to evaluate the significance of these variables. Volume of main renal arteries, length of maximal visible renal arteries, and number of branches indicated no significant difference between the 2 techniques (P > 0.05). Stenotic degree of 10 RAS was greater on CE-MRA than on TrueFISP MRA. Qualitative scores from TrueFISP MRA were higher than those from CE-MRA (P < 0.05). TrueFISP MRA is a reliable and accurate method for evaluating RAS.

  7. Suitability of miniature inductively coupled RF coils as MR-visible markers for clinical purposes.

    PubMed

    Garnov, Nikita; Thormer, Gregor; Trampel, Robert; Grunder, Wilfried; Kahn, Thomas; Moche, Michael; Busse, Harald

    2011-11-01

    MR-visible markers have already been used for various purposes such as image registration, motion detection, and device tracking. Inductively coupled RF (ICRF) coils, in particular, provide a high contrast and do not require connecting wires to the scanner, which makes their application highly flexible and safe. This work aims to thoroughly characterize the MR signals of such ICRF markers under various conditions with a special emphasis on fully automatic detection. The small markers consisted of a solenoid coil that was wound around a glass tube containing the MR signal source and tuned to the resonance frequency of a 1.5 T MRI. Marker imaging was performed with a spoiled gradient echo sequence (FLASH) and a balanced steady-state free precession (SSFP) sequence (TrueFISP) in three standard projections. The signal intensities of the markers were recorded for both pulse sequences, three source materials (tap water, distilled water, and contrast agent solution), different flip angles and coil alignments with respect to the B(0) direction as well as for different marker positions in the entire imaging volume (field of view, FOV). Heating of the ICRF coils was measured during 10-min RF expositions to three conventional pulse sequences. Clinical utility of the markers was assessed from their performance in computer-aided detection and in defining double oblique scan planes. For almost the entire FOV (±215 mm) and an estimated 82% of all possible RF coil alignments with respect to B(0), the ICRF markers generated clearly visible MR signals and could be reliably localized over a large range of flip angles, in particular with the TrueFISP sequence (0.3°-4.0°). Generally, TrueFISP provided a higher marker contrast than FLASH. RF exposition caused a moderate heating (≤5 °C) of the ICRF coils only. Small ICRF coils, imaged at low flip angles with a balanced SSFP sequence showed an excellent performance under a variety of experimental conditions and therefore make for a reliable, compact, flexible, and relatively safe marker for clinical use.

  8. Determination of the rCBF in the Amygdala and Rhinal Cortex Using a FAIR-TrueFISP Sequence

    PubMed Central

    Martirosian, Petros; Klose, Uwe; Nägele, Thomas; Schick, Fritz; Ernemann, Ulrike

    2011-01-01

    Objective Brain perfusion can be assessed non-invasively by modern arterial spin labeling MRI. The FAIR (flow-sensitive alternating inversion recovery)-TrueFISP (true fast imaging in steady precession) technique was applied for regional assessment of cerebral blood flow in brain areas close to the skull base, since this approach provides low sensitivity to magnetic susceptibility effects. The investigation of the rhinal cortex and the amygdala is a potentially important feature for the diagnosis and research on dementia in its early stages. Materials and Methods Twenty-three subjects with no structural or psychological impairment were investigated. FAIR-True-FISP quantitative perfusion data were evaluated in the amygdala on both sides and in the pons. A preparation of the radiofrequency FOCI (frequency offset corrected inversion) pulse was used for slice selective inversion. After a time delay of 1.2 sec, data acquisition began. Imaging slice thickness was 5 mm and inversion slab thickness for slice selective inversion was 12.5 mm. Image matrix size for perfusion images was 64 × 64 with a field of view of 256 × 256 mm, resulting in a spatial resolution of 4 × 4 × 5 mm. Repetition time was 4.8 ms; echo time was 2.4 ms. Acquisition time for the 50 sets of FAIR images was 6:56 min. Data were compared with perfusion data from the literature. Results Perfusion values in the right amygdala, left amygdala and pons were 65.2 (± 18.2) mL/100 g/minute, 64.6 (± 21.0) mL/100 g/minute, and 74.4 (± 19.3) mL/100 g/minute, respectively. These values were higher than formerly published data using continuous arterial spin labeling but similar to 15O-PET (oxygen-15 positron emission tomography) data. Conclusion The FAIR-TrueFISP approach is feasible for the quantitative assessment of perfusion in the amygdala. Data are comparable with formerly published data from the literature. The applied technique provided excellent image quality, even for brain regions located at the skull base in the vicinity of marked susceptibility steps. PMID:21927556

  9. Perfusion in Rat Brain at 7 T with Arterial Spin Labeling Using FAIR-TrueFISP and QUIPSS

    PubMed Central

    Esparza-Coss, Emilio; Wosik, Jarek; Narayana, Ponnada A.

    2010-01-01

    Measurement of perfusion in longitudinal studies allows for the assessment of tissue integrity and the detection of subtle pathologies. In this work, the feasibility of measuring brain perfusion in rats with high spatial resolution using arterial spin labeling (ASL) is reported. A flow sensitive alternating recovery (FAIR) sequence, coupled with a balanced gradient fast imaging with steady state precession (TrueFISP) readout section was used to minimize ghosting and geometric distortions, while achieving high SNR. The quantitative imaging of perfusion using a single subtraction (QUIPSS) method was implemented to address the effects of variable transit delays between the labeling of spins and their arrival at the imaging slice. Studies in six rats at 7 T showed good perfusion contrast with minimal geometric distortion. The measured blood flow values of 152.5 ± 6.3 ml/100g/min in gray matter and 72.3 ± 14.0 ml/100g/min in white matter are in good agreement with previously reported values based on autoradiography, considered to be the gold standard. PMID:20299174

  10. Non-enhanced magnetic resonance imaging of the small bowel at 7 Tesla in comparison to 1.5 Tesla: First steps towards clinical application.

    PubMed

    Hahnemann, Maria L; Kraff, Oliver; Maderwald, Stefan; Johst, Soeren; Orzada, Stephan; Umutlu, Lale; Ladd, Mark E; Quick, Harald H; Lauenstein, Thomas C

    2016-06-01

    To perform non-enhanced (NE) magnetic resonance imaging (MRI) of the small bowel at 7 Tesla (7T) and to compare it with 1.5 Tesla (1.5T). Twelve healthy subjects were prospectively examined using a 1.5T and 7T MRI system. Coronal and axial true fast imaging with steady-state precession (TrueFISP) imaging and a coronal T2-weighted (T2w) half-Fourier acquisition single-shot turbo spin-echo (HASTE) sequence were acquired. Image analysis was performed by 1) visual evaluation of tissue contrast and detail detectability, 2) measurement and calculation of contrast ratios and 3) assessment of artifacts. NE MRI of the small bowel at 7T was technically feasible. In the vast majority of the cases, tissue contrast and image details were equivalent at both field strengths. At 7T, two cases revealed better detail detectability in the TrueFISP, and better contrast in the HASTE. Susceptibility artifacts and B1 inhomogeneities were significantly increased at 7T. This study provides first insights into NE ultra-high field MRI of the small bowel and may be considered an important step towards high quality T2w abdominal imaging at 7T MRI. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Arterial Spin Labeling - Fast Imaging with Steady-State Free Precession (ASL-FISP): A Rapid and Quantitative Perfusion Technique for High Field MRI

    PubMed Central

    Gao, Ying; Goodnough, Candida L.; Erokwu, Bernadette O.; Farr, George W.; Darrah, Rebecca; Lu, Lan; Dell, Katherine M.; Yu, Xin; Flask, Chris A.

    2014-01-01

    Arterial Spin Labeling (ASL) is a valuable non-contrast perfusion MRI technique with numerous clinical applications. Many previous ASL MRI studies have utilized either Echo-Planar Imaging (EPI) or True Fast Imaging with Steady-State Free Precession (True FISP) readouts that are prone to off-resonance artifacts on high field MRI scanners. We have developed a rapid ASL-FISP MRI acquisition for high field preclinical MRI scanners providing perfusion-weighted images with little or no artifacts in less than 2 seconds. In this initial implementation, a FAIR (Flow-Sensitive Alternating Inversion Recovery) ASL preparation was combined with a rapid, centrically-encoded FISP readout. Validation studies on healthy C57/BL6 mice provided consistent estimation of in vivo mouse brain perfusion at 7 T and 9.4 T (249±38 ml/min/100g and 241±17 ml/min/100g, respectively). The utility of this method was further demonstrated in detecting significant perfusion deficits in a C57/BL6 mouse model of ischemic stroke. Reasonable kidney perfusion estimates were also obtained for a healthy C57/BL6 mouse exhibiting differential perfusion in the renal cortex and medulla. Overall, the ASL-FISP technique provides a rapid and quantitative in vivo assessment of tissue perfusion for high field MRI scanners with minimal image artifacts. PMID:24891124

  12. Use of an advanced 3-T MRI movie to investigate articulation.

    PubMed

    Nunthayanon, Kulthida; Honda, Ei-ichi; Shimazaki, Kazuo; Ohmori, Hiroko; Inoue-Arai, Maristela Sayuri; Kurabayashi, Tohru; Ono, Takashi

    2015-06-01

    To develop a magnetic resonance imaging (MRI) movie to reveal the dynamic movement of articulators and teeth. Five healthy females with normal occlusion participated in this study. Various concentrations of MRI contrast media (ferric ammonium citrate [FAC]) were tested for visualization of teeth, according to facial markers and with the use of a gel. Custom-made circuitry was connected to synchronize pronunciation of fricative sounds (/asa/) with scans. Three gradient echo sequences (True fast imaging with steady state precession [true FISP], FISP, and fast low angle shot [FLASH]) with a segmented cine were tested with the use of repetition times (TRs) of 9 ms and 31.5 ms. The MRI movie images were superimposed over the boundaries of teeth. The images produced during pronunciation, using the two different TRs (9 ms and 31 ms), were compared to assess the position of the lips and the tongue. Images obtained using the FLASH sequence, with a TR of 9 ms or 31.5 ms, can be used for diagnostic purposes. A TR of 9 ms, with 161 continuous images acquired, produced the highest-quality images of teeth, with few artifacts present. Pronunciation of the consonant "s" was clearly discernable. Our 3-T MRI movie system, with a temporal resolution less than 9 ms, can provide detailed information pertaining to variations in speech or oropharyngeal function. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Magnetic Resonance Imaging of Pulmonary Embolism: Diagnostic Accuracy of Unenhanced MR and Influence in Mortality Rates.

    PubMed

    Pasin, Lilian; Zanon, Matheus; Moreira, Jose; Moreira, Ana Luiza; Watte, Guilherme; Marchiori, Edson; Hochhegger, Bruno

    2017-04-01

    We evaluated the diagnostic value for pulmonary embolism (PE) of the True fast imaging with steady-state precession (TrueFISP) MRI, a method that allows the visualization of pulmonary vasculature without breath holding or intravenous contrast. This is a prospective investigation including 93 patients with suspected PE. All patients underwent TrueFISP MRI after undergoing CT pulmonary angiography (CTPA). Two independent readers evaluated each MR study, and consensus was obtained. CTPA results were analysed by a third independent reviewer and these results served as the reference standard. A fourth radiologist was responsible for evaluating if lesions found on MRI for both analysis were the same and if these were the correspondent lesions on the CTPA. Sensitivity, specificity, predictive values and accuracy were calculated. Evidence for death from PE within the 1-year follow-up was also assessed. Two patients could not undergo the real-time MRI and were excluded from the study. PE prevalence was 22%. During the 1-year follow-up period, eight patients died, whereas PE was responsible for 12.5% of cases. Between patients who developed PE, only 5% died due to this condition. There were no differences between MR and CT embolism detection in these subjects. MR sequences had a sensitivity of 85%, specificity was 98.6% and accuracy was 95.6%. Agreement between readers was high (κ= 0.87). Compared with contrast-enhanced CT, unenhanced MR sequences demonstrate good accuracy and no differences in the mortality rates in 1 year were detected.

  14. SU-E-J-229: Magnetic Resonance Imaging of Small Fiducial Markers for Proton Beam Therapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Y; James, J; Panda, A

    2015-06-15

    Purpose: For proton beam therapy, small fiducial markers are preferred for patient alignment due to less interference with the proton beam. Visualizing small fiducial markers can be challenging in MRI. This study intends to investigate MRI imaging protocols for better visualization of small fiducial markers. Methods: Two carbon and two coil-shaped gold markers were placed into a gel phantom. Both carbon markers had a diameter of 1mm and a length of 3mm. Both gold markers had a length of 5mm. One gold marker had a diameter of 0.5mm and the other had a diameter of 0.75mm. T1 VIBE, T2 SPACE,more » TrueFISP and susceptibility weighted (SW) images were acquired. To improve marker contrast, high spatial resolution was used to reduce partial volume effect. Slice thickness was 1.5mm for all four sequences and in-plane resolution was 0.6mm for TrueFISP, 0.7mm for T1 VIBE, and 0.8mm for T2 SPACE and SW. For comparison purpose, a 3D T1 VIBE image set at 3mm slice thickness and 1.2mm in-plane resolution was also acquired. Results: All markers were visible in all high-resolution image sets. In each image set, marker-induced signal void was the smallest (in diameter) for carbon markers, followed by the 0.5mm gold marker and the largest for the 0.75mm gold marker. The SW images had the largest marker-induced signal void. However, those might be confused by susceptibility-gradient-induced signal voids. T1 VIBE had good visualization of markers with nicely defined edges. T2 SPACE had reasonable visualization of markers but edges were slightly blurred. TrueFISP had good visualization of markers only if they were not masked by banding artifacts. As a comparison, all markers were hardly visible in the standard resolution T1 VIBE images. Conclusion: 3D high-resolution T1 VIBE and SW have great potential in providing good visualization of small fiducial markers for proton beam therapy.« less

  15. Towards real-time cardiovascular magnetic resonance guided transarterial CoreValve implantation: in vivo evaluation in swine

    PubMed Central

    2012-01-01

    Background Real-time cardiovascular magnetic resonance (rtCMR) is considered attractive for guiding TAVI. Owing to an unlimited scan plane orientation and an unsurpassed soft-tissue contrast with simultaneous device visualization, rtCMR is presumed to allow safe device navigation and to offer optimal orientation for precise axial positioning. We sought to evaluate the preclinical feasibility of rtCMR-guided transarterial aortic valve implatation (TAVI) using the nitinol-based Medtronic CoreValve bioprosthesis. Methods rtCMR-guided transfemoral (n = 2) and transsubclavian (n = 6) TAVI was performed in 8 swine using the original CoreValve prosthesis and a modified, CMR-compatible delivery catheter without ferromagnetic components. Results rtCMR using TrueFISP sequences provided reliable imaging guidance during TAVI, which was successful in 6 swine. One transfemoral attempt failed due to unsuccessful aortic arch passage and one pericardial tamponade with subsequent death occurred as a result of ventricular perforation by the device tip due to an operating error, this complication being detected without delay by rtCMR. rtCMR allowed for a detailed, simultaneous visualization of the delivery system with the mounted stent-valve and the surrounding anatomy, resulting in improved visualization during navigation through the vasculature, passage of the aortic valve, and during placement and deployment of the stent-valve. Post-interventional success could be confirmed using ECG-triggered time-resolved cine-TrueFISP and flow-sensitive phase-contrast sequences. Intended valve position was confirmed by ex-vivo histology. Conclusions Our study shows that rtCMR-guided TAVI using the commercial CoreValve prosthesis in conjunction with a modified delivery system is feasible in swine, allowing improved procedural guidance including immediate detection of complications and direct functional assessment with reduction of radiation and omission of contrast media. PMID:22453050

  16. Towards real-time cardiovascular magnetic resonance guided transarterial CoreValve implantation: in vivo evaluation in swine.

    PubMed

    Kahlert, Philipp; Parohl, Nina; Albert, Juliane; Schäfer, Lena; Reinhardt, Renate; Kaiser, Gernot M; McDougall, Ian; Decker, Brad; Plicht, Björn; Erbel, Raimund; Eggebrecht, Holger; Ladd, Mark E; Quick, Harald H

    2012-03-27

    Real-time cardiovascular magnetic resonance (rtCMR) is considered attractive for guiding TAVI. Owing to an unlimited scan plane orientation and an unsurpassed soft-tissue contrast with simultaneous device visualization, rtCMR is presumed to allow safe device navigation and to offer optimal orientation for precise axial positioning. We sought to evaluate the preclinical feasibility of rtCMR-guided transarterial aortic valve implatation (TAVI) using the nitinol-based Medtronic CoreValve bioprosthesis. rtCMR-guided transfemoral (n = 2) and transsubclavian (n = 6) TAVI was performed in 8 swine using the original CoreValve prosthesis and a modified, CMR-compatible delivery catheter without ferromagnetic components. rtCMR using TrueFISP sequences provided reliable imaging guidance during TAVI, which was successful in 6 swine. One transfemoral attempt failed due to unsuccessful aortic arch passage and one pericardial tamponade with subsequent death occurred as a result of ventricular perforation by the device tip due to an operating error, this complication being detected without delay by rtCMR. rtCMR allowed for a detailed, simultaneous visualization of the delivery system with the mounted stent-valve and the surrounding anatomy, resulting in improved visualization during navigation through the vasculature, passage of the aortic valve, and during placement and deployment of the stent-valve. Post-interventional success could be confirmed using ECG-triggered time-resolved cine-TrueFISP and flow-sensitive phase-contrast sequences. Intended valve position was confirmed by ex-vivo histology. Our study shows that rtCMR-guided TAVI using the commercial CoreValve prosthesis in conjunction with a modified delivery system is feasible in swine, allowing improved procedural guidance including immediate detection of complications and direct functional assessment with reduction of radiation and omission of contrast media.

  17. MR fingerprinting using fast imaging with steady state precession (FISP) with spiral readout.

    PubMed

    Jiang, Yun; Ma, Dan; Seiberlich, Nicole; Gulani, Vikas; Griswold, Mark A

    2015-12-01

    This study explores the possibility of using gradient echo-based sequences other than balanced steady-state free precession (bSSFP) in the magnetic resonance fingerprinting (MRF) framework to quantify the relaxation parameters . An MRF method based on a fast imaging with steady-state precession (FISP) sequence structure is presented. A dictionary containing possible signal evolutions with physiological range of T1 and T2 was created using the extended phase graph formalism according to the acquisition parameters. The proposed method was evaluated in a phantom and a human brain. T1 , T2 , and proton density were quantified directly from the undersampled data by the pattern recognition algorithm. T1 and T2 values from the phantom demonstrate that the results of MRF FISP are in good agreement with the traditional gold-standard methods. T1 and T2 values in brain are within the range of previously reported values. MRF-FISP enables a fast and accurate quantification of the relaxation parameters. It is immune to the banding artifact of bSSFP due to B0 inhomogeneities, which could improve the ability to use MRF for applications beyond brain imaging. © 2014 Wiley Periodicals, Inc.

  18. MR Fingerprinting Using Fast Imaging with Steady State Precession (FISP) with Spiral Readout

    PubMed Central

    Jiang, Yun; Ma, Dan; Seiberlich, Nicole; Gulani, Vikas; Griswold, Mark A.

    2015-01-01

    Purpose This study explores the possibility of using gradient echo based sequences other than bSSFP in the magnetic resonance fingerprinting (MRF) framework to quantify the relaxation parameters. Methods An MRF method based on a fast imaging with steady state precession (FISP) sequence structure is presented. A dictionary containing possible signal evolutions with physiological range of T1 and T2 was created using the extended phase graph (EPG) formalism according to the acquisition parameters. The proposed method was evaluated in a phantom and a human brain. T1, T2 and proton density were quantified directly from the undersampled data by the pattern recognition algorithm. Results T1 and T2 values from the phantom demonstrate that the results of MRF FISP are in good agreement with the traditional gold-standard methods. T1 and T2 values in brain are within the range of previously reported values. Conclusion MRF FISP enables a fast and accurate quantification of the relaxation parameters, while is immune to the banding artifact of bSSFP due to B0 inhomogeneities, which could improve the ability to use MRF for applications beyond brain imaging. PMID:25491018

  19. Evaluation of balanced steady-state free precession (TrueFISP) and K-space segmented gradient echo sequences for 3D coronary MR angiography with navigator gating at 3 Tesla.

    PubMed

    Kaul, M G; Stork, A; Bansmann, P M; Nolte-Ernsting, C; Lund, G K; Weber, C; Adam, G

    2004-11-01

    To test the feasibility of k-space segmented gradient-echo pulse sequences for free-breathing coronary magnetic resonance angiography (cMRA) on a clinical 3T system. T2-prepared, fat-suppressed turbo field echo (TFE, turboFLASH, SFPGR) as well as balanced TFE (b-TFE, trueFISP, FIESTA, segmented SSFP) sequences with navigator gating for prospective motion correction were applied on a 3T system equipped with a six-element phased-array cardiac coil. In 15 healthy volunteers, the right coronary artery (RCA) was examined with TFE and b-TFE sequences. Due to examination time limitations, the left coronary artery (LM/LAD) was examined exclusively with the TFE sequence in ten volunteers. Image quality was graded on a five point scale (0 = not visualized to 4 = excellent). The length, diameter and sharpness of the vessels and the contrast-to-noise ratios (CNR) were measured. 98 % of all major segments (proximal/middle/distal) of the RCA could be seen with the TFE sequence and 82 % with the b-TFE sequence. The image quality for the three segments was graded higher for the TFE sequence (2.7/2.7/1.5) than for the b-TFE sequence (1.9/1.6/0.9) with P: (< or = 0.001/< or = 0.004/< or = 0.056). The kappa of the interobserver variability was 0.75 for the TFE sequence and 0.8 for the b-TFE sequence. The measured vessel lengths were longer for the TFE sequence (95 +/- 22 mm) than for the b-TFE sequence (80 +/- 40 mm; P < or = 0.115). No significant changes (P < or = 0.074, P < or = 0.145) in diameter and vessel sharpness of the RCAs were observed between the TFE (2.4 +/- 0.3 mm, 60 % +/- 5) and b-TFE sequences (2.4 +/- 0.3 mm, 62 % +/- 6). The CNR was higher for the TFE sequence (10.1 +/- 3.4) than for the b-TFE sequence (6.6 +/- 2.1; P < or = 0.014). All ten main and proximal segments of the LM/LAD, which were examined exclusively with the TFE sequence, were visible with grade 2.5 and 2.1. The middle segment was visible in seven cases with grade 1.3. In three cases, the distal segment was visible with grade 0.5. The vessel length was 78 +/- 27 mm and the CNR 11.9 +/- 2.4. The conventional TFE technique has demonstrated good feasibility for cMRA at 3T. In its operational availability at 3T, the b-TFE sequence is inferior to the TFE sequence.

  20. Histological correlation of 7 T multi-parametric MRI performed in ex-vivo Achilles tendon.

    PubMed

    Juras, Vladimir; Apprich, Sebastian; Pressl, Christina; Zbyn, Stefan; Szomolanyi, Pavol; Domayer, Stephan; Hofstaetter, Jochen G; Trattnig, Siegfried

    2013-05-01

    The goal of this in vitro validation study was to investigate the feasibility of biochemical MRI techniques, such as sodium imaging, T₂ mapping, fast imaging with steady state precession (FISP), and reversed FISP (PSIF), as potential markers for collagen, glycosaminoglycan and water content in the Achilles tendon. Five fresh cadaver ankles acquired from a local anatomy department were used in the study. To acquire a sodium signal from the Achilles tendon, a 3D-gradient-echo sequence, optimized for sodium imaging, was used with TE=7.71 ms and TR=17 ms. The T₂ relaxation times were obtained using a multi-echo, spin-echo technique with a repetition time (TR) of 1200 ms and six echo times. A 3D, partially balanced, steady-state gradient echo pulse sequence was used to acquire FISP and PSIF images, with TR/TE=6.96/2.46 ms. MRI parameters were correlated with each other, as well as with histologically assessed glycosaminoglycan and water content in cadaver Achilles tendons. The highest relevant Pearson correlation coefficient was found between sodium SNR and glycosaminoglycan content (r=0.71, p=0.007). Relatively high correlation was found between the PSIF signal and T2 values (r=0.51, p=0.036), and between the FISP signal and T₂ values (r=0.56, p=0.047). Other correlations were found to be below the moderate level. This study demonstrated the feasibility of progressive biochemical MRI methods for the imaging of the AT. A GAG-specific, contrast-free method (sodium imaging), as well as collagen- and water-sensitive methods (T₂ mapping, FISP, PSIF), may be used in fast-relaxing tissues, such as tendons, in reasonable scan times. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  1. SU-G-JeP2-14: MRI-Based HDR Prostate Brachytherapy: A Phantom Study for Interstitial Catheter Reconstruction with 0.35T MRI Images

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Park, S; Kamrava, M; Yang, Y

    Purpose: To evaluate the accuracy of interstitial catheter reconstruction with 0.35T MRI images for MRI-based HDR prostate brachytherapy. Methods: Recently, a real-time MRI-guided radiotherapy system combining a 0.35T MRI system and three cobalt 60 heads (MRIdian System, ViewRay, Cleveland, OH, USA) was installed in our department. A TrueFISP sequence for MRI acquisition at lower field on Viewray was chosen due to its fast speed and high signal-to-noise efficiency. Interstitial FlexiGuide needles were implanted into a tissue equivalent ultrasound prostate phantom (CIRS, Norfolk, Virginia, USA). After an initial 15s pilot MRI to confirm the location of the phantom, planning MRI wasmore » acquired with a 172s TrueFISP sequence. The pulse sequence parameters included: flip angle = 60 degree, echo time (TE) =1.45 ms, repetition time (TR) = 3.37 ms, slice thickness = 1.5 mm, field of view (FOV) =500 × 450mm. For a reference image, a CT scan was followed. The CT and MR scans were then fused with the MIM Maestro (MIM software Inc., Cleveland, OH, USA) and sent to the Oncentra Brachy planning system (Elekta, Veenendaal, Netherlands). Automatic catheter reconstruction using CT and MR image intensities followed by manual reconstruction was used to digitize catheters. The accuracy of catheter reconstruction was evaluated from the catheter tip location. Results: The average difference between the catheter tip locations reconstructed from the CT and MR in the transverse, anteroposterior, and craniocaudal directions was −0.1 ± 0.1 mm (left), 0.2 ± 0.2 mm (anterior), and −2.3 ± 0.5 mm (cranio). The average distance in 3D was 2.3 mm ± 0.5 mm. Conclusion: This feasibility study proved that interstitial catheters can be reconstructed with 0.35T MRI images. For more accurate catheter reconstruction which can affect final dose distribution, a systematic shift should be applied to the MR based catheter reconstruction in HDR prostate brachytherapy.« less

  2. Quantitative 17O imaging towards oxygen consumption study in tumor bearing mice at 7 T.

    PubMed

    Narazaki, Michiko; Kanazawa, Yoko; Koike, Sachiko; Ando, Koichi; Ikehira, Hiroo

    2013-06-01

    (17)O magnetic resonance imaging (MRI) using a conventional pulse sequence was explored as a method of quantitative imaging towards regional oxygen consumption rate measurement for tumor evaluation in mice. At 7 T, fast imaging with steady state (FISP) was the best among gradient echo, fast spin echo and FISP for the purpose. The distribution of natural abundance H2(17)O in mice was visualized under spatial resolution of 2.5 × 2.5mm(2) by FISP in 10 min. The signal intensity by FISP showed a linear relationship with (17)O quantity both in phantom and mice. Following the injection of 5% (17)O enriched saline, (17)O re-distribution was monitored in temporal resolution down to 5 sec with an image quality sufficient to distinguish each organ. The image of labeled water produced from inhaled (17)O2 gas was also obtained. The present method provides quantitative (17)O images under sufficient temporal and spatial resolution for the evaluation of oxygen consumption rate in each organ. Experiments using various model compounds of R-OH type clarified that the signal contribution of body constituents other than water in the present in vivo(17)O FISP image was negligible. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. SU-F-I-58: Image Quality Comparisons of Different Motion Magnitudes and TR Values in MR-PET

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patrick, J; Thompson, R; Tavallaei, M

    2016-06-15

    Purpose: The aim of this work is to evaluate the accuracy and sensitivity of a respiratory-triggered MR-PET protocol in detecting four different sized lesions at two different magnitudes of motion, with two different TR values, using a novel PET-MR-CT compatible respiratory motion phantom. Methods: The eight-compartment torso phantom was setup adjacent to the motion stage, which moved four spherical compartments (28, 22, 17, 10 mm diameter) in two separate (1 and 2 cm) linear motion profiles, simulating a 3.5 second respiratory cycle. Scans were acquired on a 3T MR-PET system (Biograph mMR; Siemens Medical Solutions, Germany). MR measurements were takenmore » with: 1) Respiratory-triggered T2-weighted turbo spin echo (BLADE) sequence in coronal orientation, and 2) Real-time balanced steady-state gradient echo sequence (TrueFISP) in coronal and sagittal planes. PET was acquired simultaneously with MR. Sphere geometries and motion profiles were measured and compared with ground truths for T2 BLADE-TSE acquisitions and real time TrueFISP images. PET quantification and geometry measurements were taken using standardized uptake values, voxel intensity plots and were compared with known values, and examined alongside MR-based attenuation maps. Contrast and signal-to-noise ratios were also compared for each of the acquisitions as functions of motion range and TR. Results: Comparison of lesion diameters indicate the respiratory triggered T2 BLADE-TSE was able to maintain geometry within −2 mm for 1 cm motion for both TR values, and within −3.1 mm for TR = 2000 ms at 2 cm motion. Sphere measurements in respiratory triggered PET images were accurate within +/− 5 mm for both ranges of motion for 28, 22, and 17 mm diameter spheres. Conclusion: Hybrid MR-PET systems show promise in imaging lung cancer in non-compliant patients, with their ability to acquire both modalities simultaneously. However, MR-based attenuation maps are still susceptible to motion derived artifacts and pose the potential to affect PET accuracy.« less

  4. Automated 3D quantitative assessment and measurement of alpha angles from the femoral head-neck junction using MR imaging

    NASA Astrophysics Data System (ADS)

    Xia, Ying; Fripp, Jurgen; Chandra, Shekhar S.; Walker, Duncan; Crozier, Stuart; Engstrom, Craig

    2015-10-01

    To develop an automated approach for 3D quantitative assessment and measurement of alpha angles from the femoral head-neck (FHN) junction using bone models derived from magnetic resonance (MR) images of the hip joint. Bilateral MR images of the hip joints were acquired from 30 male volunteers (healthy active individuals and high-performance athletes, aged 18-49 years) using a water-excited 3D dual echo steady state (DESS) sequence. In a subset of these subjects (18 water-polo players), additional True Fast Imaging with Steady-state Precession (TrueFISP) images were acquired from the right hip joint. For both MR image sets, an active shape model based algorithm was used to generate automated 3D bone reconstructions of the proximal femur. Subsequently, a local coordinate system of the femur was constructed to compute a 2D shape map to project femoral head sphericity for calculation of alpha angles around the FHN junction. To evaluate automated alpha angle measures, manual analyses were performed on anterosuperior and anterior radial MR slices from the FHN junction that were automatically reformatted using the constructed coordinate system. High intra- and inter-rater reliability (intra-class correlation coefficients  >  0.95) was found for manual alpha angle measurements from the auto-extracted anterosuperior and anterior radial slices. Strong correlations were observed between manual and automatic measures of alpha angles for anterosuperior (r  =  0.84) and anterior (r  =  0.92) FHN positions. For matched DESS and TrueFISP images, there were no significant differences between automated alpha angle measures obtained from the upper anterior quadrant of the FHN junction (two-way repeated measures ANOVA, F  <  0.01, p  =  0.98). Our automatic 3D method analysed MR images of the hip joints to generate alpha angle measures around the FHN junction circumference with very good reliability and reproducibility. This work has the potential to improve analyses of cam-type lesions of the FHN junction for large-scale morphometric and clinical MR investigations of the human hip region.

  5. WE-FG-202-08: Assessment of Treatment Response Via Longitudinal Diffusion MRI On A MRI-Guided System: Initial Experience of Quantitative Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Qi, X; Yang, Y; Yang, L

    Purpose: To report our initial experience of systematic monitoring treatment response using longitudinal diffusion MR images on a Co-60 MRI-guided radiotherapy system. Methods: Four patients, including 2 head-and-necks, 1 sarcoma and 1 GBM treated on a 0.35 Tesla MRI-guided treatment system, were analyzed. For each patient, 3D TrueFISP MRIs were acquired during CT simulation and before each treatment for treatment planning and patient setup purposes respectively. Additionally, 2D diffusion-weighted MR images (DWI) were acquired weekly throughout the treatment course. The gross target volume (GTV) and brainstem (as a reference structure) were delineated on weekly 3D TrueFISP MRIs to monitor anatomymore » changes, the contours were then transferred onto the corresponding DWI images after fusing with the weekly TrueFISP images. The patient-specific temporal and spatial variations during the entire treatment course, such as anatomic changes, target apparent diffusion coefficient (ADC) distribution were evaluated in a longitudinal pattern. Results: Routine MRI revealed progressive soft-tissue GTV volume changes (up to 53%) for the H&N cases during the treatment course of 5–7 weeks. Within the GTV, the mean ADC values varied from −44% (ADC decrease) to +26% (ADC increase) in a week. The gradual increase of ADC value was inversely associated with target volume variation for one H&N case. The maximal changes of mean ADC values within the brainstem were 5.3% for the H&N cases. For the large size sarcoma and GBM tumors, spatial heterogeneity and temporal variations were observed through longitudinal ADC analysis. Conclusion: In addition to the superior soft-tissue visualization, the 0.35T MR system on ViewRay showed the potential to quantitatively measure the ADC values for both tumor and normal tissues. For normal tissue that is minimally affected by radiation, its ADC values are reproducible. Tumor ADC values show temporal and spatial fluctuation that can be exploited for personalized adaptive therapy.« less

  6. Non-contrast-enhanced perfusion and ventilation assessment of the human lung by means of fourier decomposition in proton MRI.

    PubMed

    Bauman, Grzegorz; Puderbach, Michael; Deimling, Michael; Jellus, Vladimir; Chefd'hotel, Christophe; Dinkel, Julien; Hintze, Christian; Kauczor, Hans-Ulrich; Schad, Lothar R

    2009-09-01

    Assessment of regional lung perfusion and ventilation has significant clinical value for the diagnosis and follow-up of pulmonary diseases. In this work a new method of non-contrast-enhanced functional lung MRI (not dependent on intravenous or inhalative contrast agents) is proposed. A two-dimensional (2D) true fast imaging with steady precession (TrueFISP) pulse sequence (TR/TE = 1.9 ms/0.8 ms, acquisition time [TA] = 112 ms/image) was implemented on a 1.5T whole-body MR scanner. The imaging protocol comprised sets of 198 lung images acquired with an imaging rate of 3.33 images/s in coronal and sagittal view. No electrocardiogram (ECG) or respiratory triggering was used. A nonrigid image registration algorithm was applied to compensate for respiratory motion. Rapid data acquisition allowed observing intensity changes in corresponding lung areas with respect to the cardiac and respiratory frequencies. After a Fourier analysis along the time domain, two spectral lines corresponding to both frequencies were used to calculate the perfusion- and ventilation-weighted images. The described method was applied in preliminary studies on volunteers and patients showing clinical relevance to obtain non-contrast-enhanced perfusion and ventilation data.

  7. MO-G-18C-03: Evaluation of Deformable Image Registration for Lung Motion Estimation Using Hyperpolarized Gas Tagging MRI

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, Q; Zhang, Y; Liu, Y

    2014-06-15

    Purpose: Hyperpolarized gas (HP) tagging MRI is a novel imaging technique for direct measurement of lung motion during breathing. This study aims to quantitatively evaluate the accuracy of deformable image registration (DIR) in lung motion estimation using HP tagging MRI as references. Methods: Three healthy subjects were imaged using the HP MR tagging, as well as a high-resolution 3D proton MR sequence (TrueFISP) at the end-of-inhalation (EOI) and the end-of-exhalation (EOE). Ground truth of lung motion and corresponding displacement vector field (tDVF) was derived from HP tagging MRI by manually tracking the displacement of tagging grids between EOI and EOE.more » Seven different DIR methods were applied to the high-resolution TrueFISP MR images (EOI and EOE) to generate the DIR-based DVFs (dDVF). The DIR methods include Velocity (VEL), MIM, Mirada, multi-grid B-spline from Elastix (MGB) and 3 other algorithms from DIRART toolbox (Double Force Demons (DFD), Improved Lucas-Kanade (ILK), and Iterative Optical Flow (IOF)). All registrations were performed by independent experts. Target registration error (TRE) was calculated as tDVF – dDVF. Analysis was performed for the entire lungs, and separately for the upper and lower lungs. Results: Significant differences between tDVF and dDVF were observed. Besides the DFD and IOF algorithms, all other dDVFs showed similarity in deformation magnitude distribution but away from the ground truth. The average TRE for entire lung ranged 2.5−23.7mm (mean=8.8mm), depending on the DIR method and subject's breathing amplitude. Larger TRE (13.3–23.7mm) was found in subject with larger breathing amplitude of 45.6mm. TRE was greater in lower lung (2.5−33.9 mm, mean=12.4mm) than that in upper lung (2.5−11.9 mm, mean=5.8mm). Conclusion: Significant differences were observed in lung motion estimation between the HP gas tagging MRI method and the DIR methods, especially when lung motion is large. Large variation among different DIR methods was also observed.« less

  8. Magnetic resonance imaging in the pre-operative evaluation of obstructive epiphora: true-FISP and VIBE vs gadolinium.

    PubMed

    Somma, Francesco; d'Agostino, Vincenzo; Tortora, Fabio; Serra, Nicola; Sorrentino, Gerardo; Piscitelli, Valeria; Somma, Andrea; Gamerra, Mario

    2017-02-01

    To assess unenhanced magnetic resonance imaging (MRI) in the preoperative evaluation of obstructive epiphora in patients undergoing dacryocystorhinostomy (DCR) and in particular, to evaluate the efficacy of this technique in the detection of the exact level of obstruction occurring in the naso-lachrymal duct (NLD). The correct identification and characterization of the NLD and its obstructions lead to a more effective surgery, preventing recurrent dacryocystitis after the surgical treatment. From January 2009 to December 2014, 127 obstructive epiphoras were diagnosed and treated in 127 patients (35 M, 92 F; mean age 60.7 ± 7.48 years, range 42-75 years) with endoscopic DCR, in a IRB-approved protocol. To precisely define the morphology of the NLD and the site of obstruction, some of these patients (67/127) underwent unenhanced 1.5-T MR with TrueFISP and VIBE sequences, while the remaining (60/127) underwent Gadolinium-enhanced 1.5-T MR. Afterwards, surgery checked the real site of obstruction in both groups of patients (enhanced and unenhanced MR), with surgical outcomes matched with previous MR reports. In all cases, unenhanced MRI was able to detect the exact site of obstruction along the NLD, allowing a correct planning of surgical endoscopic procedures. On the contrary, enhanced MRI wrongly diagnosed six patients with proximal stenosis (6/60, 10.0%) as intermediate NLD obstruction. Unenhanced MRI was found to be more accurate than enhanced MRI with a statistical significant difference (p value = 0.0256) and obviously cheaper and easier to perform. All imaging reports were verified with surgery. The correct identification of the level of obstruction allowed successful surgery in around 73% (93/127) of patients, who had no recurrence during 6-month follow-up. In patients with epiphora, unenhanced MR showed to be highly reliable and even more effective than enhanced MR in the preoperative characterization of NLD stenosis, with no need of performing complex, time-wasting and expensive procedures for the administration of topical contrast media.

  9. How to design 13C para-hydrogen-induced polarization experiments for MRI applications.

    PubMed

    Reineri, Francesca; Viale, Alessandra; Dastrù, Walter; Gobetto, Roberto; Aime, Silvio

    2011-01-01

    The application of hyperpolarization techniques for MRI purposes is gathering increasing attention, especially for nuclei such as (13)C or (129)Xe. Among the different proposed methods, ParaHydrogen Induced Polarization requires relatively cheap equipment. The setup of an MRI experiment by means of parahydrogen requires the application of skills and methodologies that derive from different fields of knowledge. The basic theory and a practical insight of this method are presented here. Parahydrogenation of alkynes, having a labelled (13)CO group adjacent to the triple bond, catalyzed by Rh(I) complexes containing a chelating phosphine, represents the best choice for producing and maintaining high heteronuclear polarization effect. In order to transform anti-phase into in-phase (net) (13)C polarization for MRI application it is necessary to set up the described magnetic field cycle procedure. In vitro and in vivo images have been acquired using fast imaging sequences (RARE and trueFISP). Copyright © 2010 John Wiley & Sons, Ltd.

  10. Method for automatic localization of MR-visible markers using morphological image processing and conventional pulse sequences: feasibility for image-guided procedures.

    PubMed

    Busse, Harald; Trampel, Robert; Gründer, Wilfried; Moche, Michael; Kahn, Thomas

    2007-10-01

    To evaluate the feasibility and accuracy of an automated method to determine the 3D position of MR-visible markers. Inductively coupled RF coils were imaged in a whole-body 1.5T scanner using the body coil and two conventional gradient echo sequences (FLASH and TrueFISP) and large imaging volumes up to (300 mm(3)). To minimize background signals, a flip angle of approximately 1 degrees was used. Morphological 2D image processing in orthogonal scan planes was used to determine the 3D positions of a configuration of three fiducial markers (FMC). The accuracies of the marker positions and of the orientation of the plane defined by the FMC were evaluated at various distances r(M) from the isocenter. Fiducial marker detection with conventional equipment (pulse sequences, imaging coils) was very reliable and highly reproducible over a wide range of experimental conditions. For r(M)

  11. 1.5 versus 3 versus 7 Tesla in abdominal MRI: A comparative study.

    PubMed

    Laader, Anja; Beiderwellen, Karsten; Kraff, Oliver; Maderwald, Stefan; Wrede, Karsten; Ladd, Mark E; Lauenstein, Thomas C; Forsting, Michael; Quick, Harald H; Nassenstein, Kai; Umutlu, Lale

    2017-01-01

    The aim of this study was to investigate and compare the feasibility as well as potential impact of altered magnetic field properties on image quality and potential artifacts of 1.5 Tesla, 3 Tesla and 7 Tesla non-enhanced abdominal MRI. Magnetic Resonance (MR) imaging of the upper abdomen was performed in 10 healthy volunteers on a 1.5 Tesla, a 3 Tesla and a 7 Tesla MR system. The study protocol comprised a (1) T1-weighted fat-saturated spoiled gradient-echo sequence (2D FLASH), (2) T1-weighted fat-saturated volumetric interpolated breath hold examination sequence (3D VIBE), (3) T1-weighted 2D in and opposed phase sequence, (4) True fast imaging with steady-state precession sequence (TrueFISP) and (5) T2-weighted turbo spin-echo (TSE) sequence. For comparison reasons field of view and acquisition times were kept comparable for each correlating sequence at all three field strengths, while trying to achieve the highest possible spatial resolution. Qualitative and quantitative analyses were tested for significant differences. While 1.5 and 3 Tesla MRI revealed comparable results in all assessed features and sequences, 7 Tesla MRI yielded considerable differences in T1 and T2 weighted imaging. Benefits of 7 Tesla MRI encompassed an increased higher spatial resolution and a non-enhanced hyperintense vessel signal at 7 Tesla, potentially offering a more accurate diagnosis of abdominal parenchymatous and vasculature disease. 7 Tesla MRI was also shown to be more impaired by artifacts, including residual B1 inhomogeneities, susceptibility and chemical shift artifacts, resulting in reduced overall image quality and overall image impairment ratings. While 1.5 and 3 Tesla T2w imaging showed equivalently high image quality, 7 Tesla revealed strong impairments in its diagnostic value. Our results demonstrate the feasibility and overall comparable imaging ability of T1-weighted 7 Tesla abdominal MRI towards 3 Tesla and 1.5 Tesla MRI, yielding a promising diagnostic potential for non-enhanced Magnetic Resonance Angiography (MRA). 1.5 Tesla and 3 Tesla offer comparably high-quality T2w imaging, showing superior diagnostic quality over 7 Tesla MRI.

  12. 1.5 versus 3 versus 7 Tesla in abdominal MRI: A comparative study

    PubMed Central

    Beiderwellen, Karsten; Kraff, Oliver; Maderwald, Stefan; Wrede, Karsten; Ladd, Mark E.; Lauenstein, Thomas C.; Forsting, Michael; Quick, Harald H.; Nassenstein, Kai; Umutlu, Lale

    2017-01-01

    Objectives The aim of this study was to investigate and compare the feasibility as well as potential impact of altered magnetic field properties on image quality and potential artifacts of 1.5 Tesla, 3 Tesla and 7 Tesla non-enhanced abdominal MRI. Materials and methods Magnetic Resonance (MR) imaging of the upper abdomen was performed in 10 healthy volunteers on a 1.5 Tesla, a 3 Tesla and a 7 Tesla MR system. The study protocol comprised a (1) T1-weighted fat-saturated spoiled gradient-echo sequence (2D FLASH), (2) T1-weighted fat-saturated volumetric interpolated breath hold examination sequence (3D VIBE), (3) T1-weighted 2D in and opposed phase sequence, (4) True fast imaging with steady-state precession sequence (TrueFISP) and (5) T2-weighted turbo spin-echo (TSE) sequence. For comparison reasons field of view and acquisition times were kept comparable for each correlating sequence at all three field strengths, while trying to achieve the highest possible spatial resolution. Qualitative and quantitative analyses were tested for significant differences. Results While 1.5 and 3 Tesla MRI revealed comparable results in all assessed features and sequences, 7 Tesla MRI yielded considerable differences in T1 and T2 weighted imaging. Benefits of 7 Tesla MRI encompassed an increased higher spatial resolution and a non-enhanced hyperintense vessel signal at 7 Tesla, potentially offering a more accurate diagnosis of abdominal parenchymatous and vasculature disease. 7 Tesla MRI was also shown to be more impaired by artifacts, including residual B1 inhomogeneities, susceptibility and chemical shift artifacts, resulting in reduced overall image quality and overall image impairment ratings. While 1.5 and 3 Tesla T2w imaging showed equivalently high image quality, 7 Tesla revealed strong impairments in its diagnostic value. Conclusions Our results demonstrate the feasibility and overall comparable imaging ability of T1-weighted 7 Tesla abdominal MRI towards 3 Tesla and 1.5 Tesla MRI, yielding a promising diagnostic potential for non-enhanced Magnetic Resonance Angiography (MRA). 1.5 Tesla and 3 Tesla offer comparably high-quality T2w imaging, showing superior diagnostic quality over 7 Tesla MRI. PMID:29125850

  13. [Comparison of Quantification of Myocardial Infarct Size by One Breath Hold Single Shot PSIR Sequence and Segmented FLASH-PSIR Sequence at 3. 0 Tesla MR].

    PubMed

    Cheng, Wei; Cai, Shu; Sun, Jia-yu; Xia, Chun-chao; Li, Zhen-lin; Chen, Yu-cheng; Zhong, Yao-zu

    2015-05-01

    To compare the two sequences [single shot true-FISP-PSIR (single shot-PSIR) and segmented-turbo-FLASH-PSIR (segmented-PSIR)] in the value of quantification for myocardial infarct size at 3. 0 tesla MRI. 38 patients with clinical confirmed myocardial infarction were served a comprehensive gadonilium cardiac MRI at 3. 0 tesla MRI system (Trio, Siemens). Myocardial delayed enhancement (MDE) were performed by single shot-PSIR and segmented-PSIR sequences separatedly in 12-20 min followed gadopentetate dimeglumine injection (0. 15 mmol/kg). The quality of MDE images were analysed by experienced physicians. Signal-to-noise ratio (SNR), contrast-to-noise ratio (CNR) between the two techniques were compared. Myocardial infarct size was quantified by a dedicated software automatically (Q-mass, Medis). All objectives were scanned on the 3. 0T MR successfully. No significant difference was found in SNR and CNR of the image quality between the two sequences (P>0. 05), as well as the total myocardial volume, between two sequences (P>0. 05). Furthermore, there were still no difference in the infarct size [single shot-PSIR (30. 87 ± 15. 72) mL, segmented-PSIR (29. 26±14. 07) ml], ratio [single shot-PSIR (22. 94%±10. 94%), segmented-PSIR (20. 75% ± 8. 78%)] between the two sequences (P>0. 05). However, the average aquisition time of single shot-PSIR (21. 4 s) was less than that of the latter (380 s). Single shot-PSIR is equal to segmented-PSIR in detecting the myocardial infarct size with less acquisition time, which is valuable in the clinic application and further research.

  14. Simultaneous PET/MR imaging of the brain: feasibility of cerebral blood flow measurements with FAIR-TrueFISP arterial spin labeling MRI.

    PubMed

    Stegger, Lars; Martirosian, Petros; Schwenzer, Nina; Bisdas, Sotirios; Kolb, Armin; Pfannenberg, Christina; Claussen, Claus D; Pichler, Bernd; Schick, Fritz; Boss, Andreas

    2012-11-01

    Hybrid positron emission tomography/magnetic resonance imaging (PET/MRI) with simultaneous data acquisition promises a comprehensive evaluation of cerebral pathophysiology on a molecular, anatomical, and functional level. Considering the necessary changes to the MR scanner design the feasibility of arterial spin labeling (ASL) is unclear. To evaluate whether cerebral blood flow imaging with ASL is feasible using a prototype PET/MRI device. ASL imaging of the brain with Flow-sensitive Alternating Inversion Recovery (FAIR) spin preparation and true fast imaging in steady precession (TrueFISP) data readout was performed in eight healthy volunteers sequentially on a prototype PET/MRI and a stand-alone MR scanner with 128 × 128 and 192 × 192 matrix sizes. Cerebral blood flow values for gray matter, signal-to-noise and contrast-to-noise ratios, and relative signal change were compared. Additionally, the feasibility of ASL as part of a clinical hybrid PET/MRI protocol was demonstrated in five patients with intracerebral tumors. Blood flow maps showed good delineation of gray and white matter with no discernible artifacts. The mean blood flow values of the eight volunteers on the PET/MR system were 51 ± 9 and 51 ± 7 mL/100 g/min for the 128 × 128 and 192 × 192 matrices (stand-alone MR, 57 ± 2 and 55 ± 5, not significant). The value for signal-to-noise (SNR) was significantly higher for the PET/MRI system using the 192 × 192 matrix size (P < 0.01), the relative signal change (δS) was significantly lower for the 192 × 192 matrix size (P = 0.02). ASL imaging as part of a clinical hybrid PET/MRI protocol could successfully be accomplished in all patients in diagnostic image quality. ASL brain imaging is feasible with a prototype hybrid PET/MRI scanner, thus adding to the value of this novel imaging technique.

  15. MR fingerprinting Deep RecOnstruction NEtwork (DRONE).

    PubMed

    Cohen, Ouri; Zhu, Bo; Rosen, Matthew S

    2018-09-01

    Demonstrate a novel fast method for reconstruction of multi-dimensional MR fingerprinting (MRF) data using deep learning methods. A neural network (NN) is defined using the TensorFlow framework and trained on simulated MRF data computed with the extended phase graph formalism. The NN reconstruction accuracy for noiseless and noisy data is compared to conventional MRF template matching as a function of training data size and is quantified in simulated numerical brain phantom data and International Society for Magnetic Resonance in Medicine/National Institute of Standards and Technology phantom data measured on 1.5T and 3T scanners with an optimized MRF EPI and MRF fast imaging with steady state precession (FISP) sequences with spiral readout. The utility of the method is demonstrated in a healthy subject in vivo at 1.5T. Network training required 10 to 74 minutes; once trained, data reconstruction required approximately 10 ms for the MRF EPI and 76 ms for the MRF FISP sequence. Reconstruction of simulated, noiseless brain data using the NN resulted in a RMS error (RMSE) of 2.6 ms for T 1 and 1.9 ms for T 2 . The reconstruction error in the presence of noise was less than 10% for both T 1 and T 2 for SNR greater than 25 dB. Phantom measurements yielded good agreement (R 2  = 0.99/0.99 for MRF EPI T 1 /T 2 and 0.94/0.98 for MRF FISP T 1 /T 2 ) between the T 1 and T 2 estimated by the NN and reference values from the International Society for Magnetic Resonance in Medicine/National Institute of Standards and Technology phantom. Reconstruction of MRF data with a NN is accurate, 300- to 5000-fold faster, and more robust to noise and dictionary undersampling than conventional MRF dictionary-matching. © 2018 International Society for Magnetic Resonance in Medicine.

  16. Free-breathing Sparse Sampling Cine MR Imaging with Iterative Reconstruction for the Assessment of Left Ventricular Function and Mass at 3.0 T.

    PubMed

    Sudarski, Sonja; Henzler, Thomas; Haubenreisser, Holger; Dösch, Christina; Zenge, Michael O; Schmidt, Michaela; Nadar, Mariappan S; Borggrefe, Martin; Schoenberg, Stefan O; Papavassiliu, Theano

    2017-01-01

    Purpose To prospectively evaluate the accuracy of left ventricle (LV) analysis with a two-dimensional real-time cine true fast imaging with steady-state precession (trueFISP) magnetic resonance (MR) imaging sequence featuring sparse data sampling with iterative reconstruction (SSIR) performed with and without breath-hold (BH) commands at 3.0 T. Materials and Methods Ten control subjects (mean age, 35 years; range, 25-56 years) and 60 patients scheduled to undergo a routine cardiac examination that included LV analysis (mean age, 58 years; range, 20-86 years) underwent a fully sampled segmented multiple BH cine sequence (standard of reference) and a prototype undersampled SSIR sequence performed during a single BH and during free breathing (non-BH imaging). Quantitative analysis of LV function and mass was performed. Linear regression, Bland-Altman analysis, and paired t testing were performed. Results Similar to the results in control subjects, analysis of the 60 patients showed excellent correlation with the standard of reference for single-BH SSIR (r = 0.93-0.99) and non-BH SSIR (r = 0.92-0.98) for LV ejection fraction (EF), volume, and mass (P < .0001 for all). Irrespective of breath holding, LV end-diastolic mass was overestimated with SSIR (standard of reference: 163.9 g ± 58.9, single-BH SSIR: 178.5 g ± 62.0 [P < .0001], non-BH SSIR: 175.3 g ± 63.7 [P < .0001]); the other parameters were not significantly different (EF: 49.3% ± 11.9 with standard of reference, 48.8% ± 11.8 with single-BH SSIR, 48.8% ± 11 with non-BH SSIR; P = .03 and P = .12, respectively). Bland-Altman analysis showed similar measurement errors for single-BH SSIR and non-BH SSIR when compared with standard of reference measurements for EF, volume, and mass. Conclusion Assessment of LV function with SSIR at 3.0 T is noninferior to the standard of reference irrespective of BH commands. LV mass, however, is overestimated with SSIR. © RSNA, 2016 Online supplemental material is available for this article.

  17. Fast group matching for MR fingerprinting reconstruction.

    PubMed

    Cauley, Stephen F; Setsompop, Kawin; Ma, Dan; Jiang, Yun; Ye, Huihui; Adalsteinsson, Elfar; Griswold, Mark A; Wald, Lawrence L

    2015-08-01

    MR fingerprinting (MRF) is a technique for quantitative tissue mapping using pseudorandom measurements. To estimate tissue properties such as T1 , T2 , proton density, and B0 , the rapidly acquired data are compared against a large dictionary of Bloch simulations. This matching process can be a very computationally demanding portion of MRF reconstruction. We introduce a fast group matching algorithm (GRM) that exploits inherent correlation within MRF dictionaries to create highly clustered groupings of the elements. During matching, a group specific signature is first used to remove poor matching possibilities. Group principal component analysis (PCA) is used to evaluate all remaining tissue types. In vivo 3 Tesla brain data were used to validate the accuracy of our approach. For a trueFISP sequence with over 196,000 dictionary elements, 1000 MRF samples, and image matrix of 128 × 128, GRM was able to map MR parameters within 2s using standard vendor computational resources. This is an order of magnitude faster than global PCA and nearly two orders of magnitude faster than direct matching, with comparable accuracy (1-2% relative error). The proposed GRM method is a highly efficient model reduction technique for MRF matching and should enable clinically relevant reconstruction accuracy and time on standard vendor computational resources. © 2014 Wiley Periodicals, Inc.

  18. Multishot EPI-SSFP in the Heart

    PubMed Central

    Herzka, Daniel A.; Kellman, Peter; Aletras, Anthony H.; Guttman, Michael A.; McVeigh, Elliot R.

    2007-01-01

    Refocused steady-state free precession (SSFP), or fast imaging with steady precession (FISP or TrueFISP), has recently proven valuable for cardiac imaging because of its high signal-to-noise ratio (SNR) and excellent blood-myocardium contrast. In this study, various implementations of multiecho SSFP or EPI-SSFP for imaging in the heart are presented. EPI-SSFP has higher scan-time efficiency than single-echo SSFP, as two or more phase-encode lines are acquired per repetition time (TR) at the cost of a modest increase in TR. To minimize TR, a noninterleaved phase-encode order in conjunction with a phased-array ghost elimination (PAGE) technique was employed, removing the need for echo time shifting (ETS). The multishot implementation of EPI-SSFP was used to decrease the breath-hold duration for cine acquisitions or to increase the temporal or spatial resolution for a fixed breath-hold duration. The greatest gain in efficiency was obtained with the use of a three-echo acquisition. Image quality for cardiac cine applications using multishot EPI-SSFP was comparable to that of single-echo SSFP in terms of blood-myocardium contrast and contrast-to-noise ratio (CNR). The PAGE method considerably reduced flow artifacts due to both the inherent ghost suppression and the concomitant reduction in phase-encode blip size. The increased TR of multishot EPI-SSFP led to a reduced specific absorption rate (SAR) for a fixed RF flip angle, and allowed the use of a larger flip angle without increasing the SAR above the FDA-approved limits. PMID:11948726

  19. Heating of metallic implants and instruments induced by gradient switching in a 1.5-Tesla whole-body unit.

    PubMed

    Graf, Hansjörg; Steidle, Günter; Schick, Fritz

    2007-11-01

    To examine gradient switching-induced heating of metallic parts. Copper and titanium frames and sheets ( approximately 50 x 50 mm(2), 1.5 mm thick, frame width = 3 mm) surrounded by air were positioned in the scanner perpendicular to the static field horizontally 20 cm off-center. During the execution of a sequence (three-dimensional [3D] true fast imaging with steady precession [True-FISP], TR = 6.4 msec) exploiting the gradient capabilities (maximum gradient = 40 mT/m, maximum slew rate = 200 T/m/second), heating was measured with an infrared camera. Radio frequency (RF) amplitude was set to zero volts. Heating of a copper frame with a narrowing to 1 mm over 20 mm at one side was examined in air and in addition surrounded by several liters of gelled saline using fiber-optic thermography. Further heating studies were performed using an artificial hip made of titanium, and an aluminum replica of the hip prosthesis with the same geometry. For the copper specimens, considerable heating (>10 degrees C) in air and in gelled saline (>1.2 degrees C) could be observed. Heating of the titanium specimens was markedly less ( approximately 1 degrees C in air). For the titanium artificial hip no heating could be detected, while the rise in temperature for the aluminum replica was approximately 2.2 degrees C. Heating of more than 10 degrees C solely due to gradient switching without any RF irradiation was demonstrated in isolated copper wire frames. Under specific conditions (high gradient duty cycle, metallic loop of sufficient inductance and low resistance, power matching) gradient switching-induced heating of conductive specimens must be considered.

  20. Multishot EPI-SSFP in the heart.

    PubMed

    Herzka, Daniel A; Kellman, Peter; Aletras, Anthony H; Guttman, Michael A; McVeigh, Elliot R

    2002-04-01

    Refocused steady-state free precession (SSFP), or fast imaging with steady precession (FISP or TrueFISP), has recently proven valuable for cardiac imaging because of its high signal-to-noise ratio (SNR) and excellent blood-myocardium contrast. In this study, various implementations of multiecho SSFP or EPI-SSFP for imaging in the heart are presented. EPI-SSFP has higher scan-time efficiency than single-echo SSFP, as two or more phase-encode lines are acquired per repetition time (TR) at the cost of a modest increase in TR. To minimize TR, a noninterleaved phase-encode order in conjunction with a phased-array ghost elimination (PAGE) technique was employed, removing the need for echo time shifting (ETS). The multishot implementation of EPI-SSFP was used to decrease the breath-hold duration for cine acquisitions or to increase the temporal or spatial resolution for a fixed breath-hold duration. The greatest gain in efficiency was obtained with the use of a three-echo acquisition. Image quality for cardiac cine applications using multishot EPI-SSFP was comparable to that of single-echo SSFP in terms of blood-myocardium contrast and contrast-to-noise ratio (CNR). The PAGE method considerably reduced flow artifacts due to both the inherent ghost suppression and the concomitant reduction in phase-encode blip size. The increased TR of multishot EPI-SSFP led to a reduced specific absorption rate (SAR) for a fixed RF flip angle, and allowed the use of a larger flip angle without increasing the SAR above the FDA-approved limits. Copyright 2002 Wiley-Liss, Inc.

  1. Evaluation of Intake Efficiencies and Associated Sediment-Concentration Errors in US D-77 Bag-Type and US D-96-Type Depth-Integrating Suspended-Sediment Samplers

    NASA Astrophysics Data System (ADS)

    Sabol, T. A.; Topping, D. J.; Griffiths, R. E.

    2011-12-01

    Accurate measurements of suspended-sediment concentration require suspended-sediment samplers to operate isokinetically with an intake-efficiency of 1.0 ± 0.10. Results from 1940s Federal Interagency Sedimentation Project (FISP) laboratory experiments show that when the intake efficiency does not equal 1.0, suspended-sediment samplers either under- or oversample sediment relative to water, leading to biases in suspended-sediment concentration. The majority of recent FISP sampler development and testing has been conducted under uniform flow conditions using flume and slack-water tow tests, with little testing in actual turbulent rivers. Recent work has focused on the hydraulic characteristics and intake efficiencies of these samplers, without field investigations of the accuracy of the suspended-sediment data collected with these samplers. When depth-integrating suspended-sediment samplers are deployed under the non-uniform and turbulent conditions that exist in rivers, multiple factors may contribute to departures from isokinetic sampling. This introduces errors into the suspended-sediment data that may not be predictable on the basis of flume and tow tests alone. This study (1) evaluates the intake efficiencies of the older US D-77 bag-type and newer, FISP-approved US D-96 samplers at multiple river cross sections under a range of flow conditions; (2) examines if water temperature and sampling duration explain measured differences in intake efficiency between samplers and between laboratory and field tests; (3) models and predicts the directions and magnitudes of errors in measured suspended-sand concentration; and (4) determines if the relative differences in suspended-sediment concentration in a variety of size classes are consistent with the differences expected on the basis of the 1940s FISP-laboratory experiments. Results indicate that under river conditions, the intake efficiency of the US D-96 sampler is superior to that of the US D-77 bag-type sampler and the overall performance of the US D-96 sampler is closer to the FISP-acceptable range of isokinetic operation. These results are in contrast with FISP-conducted flume tests that show that both the US D-77 bag-type and US D-96 samplers operate isokinetically in the laboratory. Our results show that a major problem with both samplers is the large time-dependent decrease in intake efficiency that likely arises from an inability of the filling bag to displace water in the flooded sampler cavity at the rate required for isokinetic sampling. Predicted errors in suspended-sand concentration measurements made with the US D-96 sampler are much smaller than those made with the US D-77 bag-type sampler, especially when the effects of water temperature and sampling duration are taken into account. Biases in the concentrations in each size class measured using the US D-77 bag-type relative to the US D-96 samplers are as expected and consistent with the results from the 1940s FISP laboratory experiments.

  2. MR of the small bowel with a biphasic oral contrast agent (polyethylene glycol): technical aspects and findings in patients affected by Crohn's disease.

    PubMed

    Laghi, Andrea; Paolantonio, Pasquale; Iafrate, Franco; Borrelli, Osvaldo; Dito, Lucia; Tomei, Ernesto; Cucchiara, Salvatore; Passariello, Roberto

    2003-01-01

    To report our experience using MR of the small bowel with polyethylene glycol (PEG) solution as an oral contrast agent in a population of adults and children with known Crohn's disease. 40 patients (29 males; 11 females), 15 adults (age range 24-52 years) and 25 children (age range 5-17 years), with known Crohn's disease, underwent MR of the small bowel using a supeconductive 1.5 T magnet, and polyethylene glycol solution as an oral contrast agent. The fixed amount of contrast agent was 750-1000 ml for adults and 10 ml/kg of body weight for children. The Crohn's Disease Activity Index (CDAI) was available in all patients. Our study protocol included the acquisition of T2-weighted half-Fourier single-shot turbo spin-echo (HASTE) sequences and true fast imaging in the steady-state precession (true-FISP) sequences, followed by the acquisition of "spoiled" 2D gradient echo T1-weighted sequences with fat suppression (FLASH, fast low-angle shot) or alternatively "spoiled" 3D (VIBE, volume interpolated breath-hold examination), acquired 70 seconds after intravenous administration of gadopentetate dimeglumine (Gd-DTPA) (0,1 mmol/kg). A specific MR score was created and calculated for each patient and was compared by means of the Spearman rank with CDAI. In all patients no significant side effects were observed and the MR examination was well tolerated even by paediatric patients. In all cases MR showed a small bowel wall thickening (> 4 mm) in the terminal ileum, with lumen stenosis in 26 patients. In 3 cases pathological segments proximal to the terminal ileum were observed and in another 3 cases caecal involvement was visible. The MR examination was able to show abnormalities of perivisceral fat tissue in 15 patients, mesenteric lymphadenopathy in 1 patient and abdominal abscess in 1 case. The Spearman rank showed a statistically significant correlation between CDAI and the MR score (r = 0.91, P = 0,0001). MR using PEG as an oral contrast agent could be considered a test of great interest in the evaluation of the small bowel in patients suspected of having Crohn's disease in that it is easily reproducible, well tolerated even by paediatric patients and it provides useful information about the localisation, extension and activity of inflammatory disease without the use of ionising radiation.

  3. Needle position estimation from sub-sampled k-space data for MRI-guided interventions

    NASA Astrophysics Data System (ADS)

    Schmitt, Sebastian; Choli, Morwan; Overhoff, Heinrich M.

    2015-03-01

    MRI-guided interventions have gained much interest. They profit from intervention synchronous data acquisition and image visualization. Due to long data acquisition durations, ergonomic limitations may occur. For a trueFISP MRI-data acquisition sequence, a time sparing sub-sampling strategy has been developed that is adapted to amagnetic needle detection. A symmetrical and contrast rich susceptibility needle artifact, i.e. an approximately rectangular gray scale profile is assumed. The 1-D-Fourier transformed of a rectangular function is a sinc-function. Its periodicity is exploited by sampling only along a few orthogonal trajectories in k-space. Because a needle moves during intervention, its tip region resembles a rectangle in a time-difference image that is reconstructed from such sub-sampled k-spaces acquired at different time stamps. In different phantom experiments, a needle was pushed forward along a reference trajectory, which was determined from a needle holders geometric parameters. In addition, the trajectory of the needle tip was estimated by the method described above. Only ca. 4 to 5% of the entire k-space data was used for needle tip estimation. The misalignment of needle orientation and needle tip position, i.e. the differences between reference and estimated values, is small and even in its worst case less than 2 mm. The results show that the method is applicable under nearly real conditions. Next steps are addressed to the validation of the method for clinical data.

  4. Optimising magnetic resonance image quality of the ear in healthy dogs.

    PubMed

    Wolf, Davina; Lüpke, Matthias; Wefstaedt, Patrick; Klopmann, Thilo; Nolte, Ingo; Seifert, Hermann

    2011-03-01

    The aim of this study was to develop an examination protocol for magnetic resonance imaging, in order to display diagnostically important information of the canine middle and inner ear. To ensure that this protocol could also be used as a basis for determining pathological changes, the anatomical structures of the ear were presented in detail. To minimise stress through anaesthesia in live animals, preliminary examinations were carried out on four dog cadavers. During these initial examinations, three-dimensional (3D) sequences proved to be superior to two-dimensional ones. Therefore, only 3D sequences were applied for the main examinations performed on six clinically healthy Beagles. The anonymised MR images were rated by three experienced reviewers using a five-point scale. The most valuable sequence was a T2-weighted CISS sequence (TR = 16.7 ms, TE = 8.08 ms). This sequence proved to be most suitable for illustrating the inner ear structures and enabled good tissue contrasts. The sequence ranked second best was also a T2-weighted DESS sequence (TR = 19 ms, TE = 6 ms), allowing the imaging of the tympanic cavity and enabling 3D reconstruction due to its isotropic voxels. Due to low contrast and strong noise, the other sequences (TSE, FISP, MP RAGE) were not suitable for anatomical illustration of the middle and inner ear.

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Paliwal, B; Asprey, W; Yan, Y

    Purpose: In order to take advantage of the high resolution soft tissue imaging available in MR images, we investigated 3D images obtained with the low field 0.35 T MR in ViewRay to serve as an alternative to CT scans for radiotherapy treatment planning. In these images, normal and target structure delineation can be visualized. Assessment is based upon comparison with the CT images and the ability to produce comparable contours. Methods: Routine radiation oncology CT scans were acquired on five patients. Contours of brain, brainstem, esophagus, heart, lungs, spinal cord, and the external body were drawn. The same five patientsmore » were then scanned on the ViewRay TrueFISP-based imaging pulse sequence. The same organs were selected on the MR images and compared to those from the CT scan. Physical volume and the Dice Similarity Coefficient (DSC) were used to assess the contours from the two systems. Image quality stability was quantitatively ensured throughout the study following the recommendations of the ACR MR accreditation procedure. Results: The highest DSC of 0.985, 0.863, and 0.843 were observed for brain, lungs, and heart respectively. On the other hand, the brainstem, spinal cord, and esophagus had the lowest DSC. Volume agreement was most satisfied for the heart (within 5%) and the brain (within 2%). Contour volume for the brainstem and lung (a widely dynamic organ) varied the most (27% and 19%). Conclusion: The DSC and volume measurements suggest that the results obtained from ViewRay images are quantitatively consistent and comparable to those obtained from CT scans for the brain, heart, and lungs. MR images from ViewRay are well-suited for treatment planning and for adaptive MRI-guided radiotherapy. The physical data from 0.35 T MR imaging is consistent with our geometrical understanding of normal structures.« less

  6. Biochemical evaluation of articular cartilage in patients with osteochondrosis dissecans by means of quantitative T2- and T2-mapping at 3T MRI: a feasibility study.

    PubMed

    Marik, W; Apprich, S; Welsch, G H; Mamisch, T C; Trattnig, S

    2012-05-01

    To perform an in vivo evaluation comparing overlying articular cartilage in patients suffering from osteochondrosis dissecans (OCD) in the talocrural joint and healthy volunteers using quantitative T2 mapping at 3.0 T. Ten patients with OCD of Grade II or lower and 9 healthy age matched volunteers were examined at a 3.0 T whole body MR scanner using a flexible multi-element coil. In all investigated persons MRI included proton-density (PD)-FSE and 3D GRE (TrueFisp) sequences for morphological diagnosis and location of anatomical site and quantitative T2 and T2 maps. Region of interest (ROI) analysis was performed for the cartilage layer above the OCD and for a morphologically healthy graded cartilage layer. Mean T2 and T2 values were then statistically analysed. The cartilage layer of healthy volunteers showed mean T2 and T2 values of 29.4 ms (SD 4.9) and 11.8 ms (SD 2.7), respectively. In patients with OCD of grade I and II lesions mean T2 values were 40.9 ms (SD 6.6), 48.7 ms (SD 11.2) and mean T2 values were 16.1 ms (SD 3.2), 16.2 ms (SD 4.8). Therefore statistically significantly higher mean T2 and T2 values were found in patients suffering from OCD compared to healthy volunteers. T2 and T2 mapping can help assess the microstructural composition of cartilage overlying osteochondral lesions. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  7. Evaluation of ex-vivo 9.4T MRI in post-surgical specimens from temporal lobe epilepsy patients.

    PubMed

    Kwan, Benjamin Y M; Salehi, Fateme; Kope, Ryan; Lee, Donald H; Sharma, Manas; Hammond, Robert; Burneo, Jorge G; Steven, David; Peters, Terry; Khan, Ali R

    2017-10-01

    This study evaluates hippocampal pathology through usage of ultra-high field 9.4T ex-vivo imaging of resected surgical specimens in patients who have undergone temporal lobe epilepsy surgery. This is a retrospective interpretation of prospectively acquired data. MRI scanning of resected surgical specimens from patients who have undergone temporal lobe epilepsy surgery was performed on a 9.4T small bore Varian MR magnet. Structural images employed a balanced steady-state free precession sequence (TrueFISP). Six patients (3 females; 3 males) were included in this study with an average age at surgery of 40.7 years (range 20Y_"60) (one was used as a control reference). Two neuroradiologists qualitatively reviewed the ex-vivo MRIs of resected specimens while blinded to the histopathology reports for the ability to identify abnormal features in hippocampal subfield structures. The hippocampal subfields were reliably identified on the 9.4T ex-vivo scans in the hippocampal head region and hippocampal body region by both neuroradiologists in all 6 patients. There was high concordance to pathology for abnormalities detected in the CA1, CA2, CA3 and CA4 subfields. Detection of abnormalities in the dentate gyrus was also high with detection in 4 of 5 cases. The Cohen's kappa between the two neuroradiologists was calculated at 0.734 SE=0.102. Ex-vivo 9.4T specimen imaging can detect abnormalities in CA1, CA2, CA3, CA4 and DG in both the hippocampal head and body. There was good concordance between qualitative findings and histopathological abnormalities for CA1, CA2, CA3, CA4 and DG. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  8. Use of pattern recognition for unaliasing simultaneously acquired slices in simultaneous multislice MR fingerprinting.

    PubMed

    Jiang, Yun; Ma, Dan; Bhat, Himanshu; Ye, Huihui; Cauley, Stephen F; Wald, Lawrence L; Setsompop, Kawin; Griswold, Mark A

    2017-11-01

    The purpose of this study is to accelerate an MR fingerprinting (MRF) acquisition by using a simultaneous multislice method. A multiband radiofrequency (RF) pulse was designed to excite two slices with different flip angles and phases. The signals of two slices were driven to be as orthogonal as possible. The mixed and undersampled MRF signal was matched to two dictionaries to retrieve T 1 and T 2 maps of each slice. Quantitative results from the proposed method were validated with the gold-standard spin echo methods in a phantom. T 1 and T 2 maps of in vivo human brain from two simultaneously acquired slices were also compared to the results of fast imaging with steady-state precession based MRF method (MRF-FISP) with a single-band RF excitation. The phantom results showed that the simultaneous multislice imaging MRF-FISP method quantified the relaxation properties accurately compared to the gold-standard spin echo methods. T 1 and T 2 values of in vivo brain from the proposed method also matched the results from the normal MRF-FISP acquisition. T 1 and T 2 values can be quantified at a multiband acceleration factor of two using our proposed acquisition even in a single-channel receive coil. Further acceleration could be achieved by combining this method with parallel imaging or iterative reconstruction. Magn Reson Med 78:1870-1876, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.

  9. [The optimization of chondromalacia patellae diagnosis by NMR tomography. The use of an apparatus for cartilage compression].

    PubMed

    König, H; Dinkelaker, F; Wolf, K J

    1991-08-01

    The aim of this study was to improve the MRI diagnosis of CMP, with special reference to the early stages and accurate staging. For this purpose, the retropatellar cartilage was examined by MRI while compression was carried out, using 21 patients and five normal controls. The compression was applied by means of a specially constructed device. Changes in cartilage thickness and signal intensity were evaluated quantitatively during FLASH and FISP sequences. In all patients the results of arthroscopies were available and in 12 patients, cartilage biopsies had been obtained. CMP stage I could be distinguished from normal cartilage by reduction in cartilage thickness and signal increase from the oedematous cartilage during compression. In CMP stages II/III, abnormal protein deposition of collagen type I could be demonstrated by its compressibility. In stages III and IV, the method does not add any significant additional information.

  10. Cardiac magnetic resonance findings predict increased resource utilization in elective coronary artery bypass grafting.

    PubMed

    Berry, Colin; Zimmerli, Lukas U; Steedman, Tracey; Foster, John E; Dargie, Henry J; Berg, Geoffrey A; Dominiczak, Anna F; Delles, Christian

    2008-03-01

    Morbidity following CABG (coronary artery bypass grafting) is difficult to predict and leads to increased healthcare costs. We hypothesized that pre-operative CMR (cardiac magnetic resonance) findings would predict resource utilization in elective CABG. Over a 12-month period, patients requiring elective CABG were invited to undergo CMR 1 day prior to CABG. Gadolinium-enhanced CMR was performed using a trueFISP inversion recovery sequence on a 1.5 tesla scanner (Sonata; Siemens). Clinical data were collected prospectively. Admission costs were quantified based on standardized actual cost/day. Admission cost greater than the median was defined as 'increased'. Of 458 elective CABG cases, 45 (10%) underwent pre-operative CMR. Pre-operative characteristics [mean (S.D.) age, 64 (9) years, mortality (1%) and median (interquartile range) admission duration, 7 (6-8) days] were similar in patients who did or did not undergo CMR. In the patients undergoing CMR, eight (18%) and 11 (24%) patients had reduced LV (left ventricular) systolic function by CMR [LVEF (LV ejection fraction) <55%] and echocardiography respectively. LE (late enhancement) with gadolinium was detected in 17 (38%) patients. The average cost/day was $2723. The median (interquartile range) admission cost was $19059 ($10891-157917). CMR LVEF {OR (odds ratio), 0.93 [95% CI (confidence interval), 0.87-0.99]; P=0.03} and SV (stroke volume) index [OR 1.07 (95% CI, 1.00-1.14); P=0.02] predicted increased admission cost. CMR LVEF (P=0.08) and EuroScore tended to predict actual admission cost (P=0.09), but SV by CMR (P=0.16) and LV function by echocardiography (P=0.95) did not. In conclusion, in this exploratory investigation, pre-operative CMR findings predicted admission duration and increased admission cost in elective CABG surgery. The cost-effectiveness of CMR in risk stratification in elective CABG surgery merits prospective assessment.

  11. Evaluation of intake efficiencies and associated sediment-concentration errors in US D-77 bag-type and US D-96-type depth-integrating suspended-sediment samplers

    USGS Publications Warehouse

    Sabol, Thomas A.; Topping, David J.

    2013-01-01

    Accurate measurements of suspended-sediment concentration require suspended-sediment samplers to operate isokinetically, within an intake-efficiency range of 1.0 ± 0.10, where intake efficiency is defined as the ratio of the velocity of the water through the sampler intake to the local ambient stream velocity. Local ambient stream velocity is defined as the velocity of the water in the river at the location of the nozzle, unaffected by the presence of the sampler. Results from Federal Interagency Sedimentation Project (FISP) laboratory experiments published in the early 1940s show that when the intake efficiency is less than 1.0, suspended-sediment samplers tend to oversample sediment relative to water, leading to potentially large positive biases in suspended-sediment concentration that are positively correlated with grain size. Conversely, these experiments show that, when the intake efficiency is greater than 1.0, suspended‑sediment samplers tend to undersample sediment relative to water, leading to smaller negative biases in suspended-sediment concentration that become slightly more negative as grain size increases. The majority of FISP sampler development and testing since the early 1990s has been conducted under highly uniform flow conditions via flume and slack-water tow tests, with relatively little work conducted under the greater levels of turbulence that exist in actual rivers. Additionally, all of this recent work has been focused on the hydraulic characteristics and intake efficiencies of these samplers, with no field investigations conducted on the accuracy of the suspended-sediment data collected with these samplers. When depth-integrating suspended-sediment samplers are deployed under the more nonuniform and turbulent conditions that exist in rivers, multiple factors may contribute to departures from isokinetic sampling, thus introducing errors into the suspended-sediment data collected by these samplers that may not be predictable on the basis of flume and tow tests alone. This study has three interrelated goals. First, the intake efficiencies of the older US D-77 bag-type and newer, FISP-approved US D-96-type1 depth-integrating suspended‑sediment samplers are evaluated at multiple cross‑sections under a range of actual-river conditions. The intake efficiencies measured in these actual-river tests are then compared to those previously measured in flume and tow tests. Second, other physical effects, mainly water temperature and the duration of sampling at a vertical, are examined to determine whether these effects can help explain observed differences in intake efficiency both between the two types of samplers and between the laboratory and field tests. Third, the signs and magnitudes of the likely errors in suspendedsand concentration in measurements made with both types of samplers are predicted based the intake efficiencies of these two types of depth-integrating samplers. Using the relative difference in isokinetic sampling observed between the US D-77 bag-type and D-96-type samplers during river tests, measured differences in suspended-sediment concentration in a variety of size classes were evaluated between paired equal-discharge-increment (EDI) and equal-width-increment (EWI) measurements made with these two types of samplers to determine whether these differences in concentration are consistent with the differences in concentrations expected on the basis of the 1940s FISP laboratory experiments. In addition, sequential single-vertical depth-integrated samples were collected (concurrent with velocity measurements) with the US D-96-type bag sampler and two different rigidcontainer samplers to evaluate whether the predicted errors in suspended-sand concentrations measured with the US D-96- type sampler are consistent with those expected on the basis of the 1940s FISP laboratory experiments. Results from our study indicate that the intake efficiency of the US D-96-type sampler is superior to that of the US D-77 bag-type sampler under actual-river conditions, with overall performance of the US D-96-type sampler being closer to, yet still typically below, the FISP-acceptable range of isokinetic operation. These results are in contrast to the results from FISP-conducted flume tests that showed that both the US D-77 bag-type and US D-96-type samplers sampled isokinetically in the laboratory. Results from our study indicate that the single largest problem with the behavior of both the US D-77 bag-type and the US D-96-type samplers under actual‑river conditions is that both samplers are prone to large time‑dependent decreases in intake efficiency as sampling duration increases. In the case of the US D-96-type sampler, this problem may be at least partially overcome by shortening the duration of sampling (or, instead, perhaps by a simple design improvement); in the case of the US D-77 bag-type sampler, although shortening the sampling duration improves the intake efficiency, it does not bring it into agreement with the FISP‑accepted range of isokinetic operation. The predicted errors in suspended-sand concentration in EDI or EWI measurements made with the US-96-type sampler are much smaller than those associated with EDI or EWI measurements made with the US D-77 bag-type sampler, especially when the results are corrected for the effects of water temperature and sampling duration. The bias in the concentration in each size class measured using the US D-77 bag-type relative to the concentration measured using the US D-96-type sampler behaves in a manner consistent with that expected on the basis of the observed differences in intake efficiency between the two samplers in conjunction with the results from the 1940s FISP laboratory experiments. In addition, the bias in the concentration in each size class measured using the US D-96‑type sampler relative to the concentration measured using the truly isokinetic rigid-container samplers is in excellent agreement with that predicted on the basis of the 1940s FISP laboratory experiments. Because suspended-sediment samplers can respond differently between laboratory and field conditions, actual-river tests such as those in this study should be conducted when models of suspended-sediment samplers are changed from one type to another during the course of long-term monitoring programs. Otherwise, potential large differences in the suspended-sediment data collected by different types of samplers would lead to large step changes in sediment loads that may be misinterpreted as real, when, in fact, they are associated with the change in suspended‑sediment sampling equipment.

  12. Interventional magnetic resonance angiography with no strings attached: wireless active catheter visualization.

    PubMed

    Quick, Harald H; Zenge, Michael O; Kuehl, Hilmar; Kaiser, Gernot; Aker, Stephanie; Massing, Sandra; Bosk, Silke; Ladd, Mark E

    2005-02-01

    Active instrument visualization strategies for interventional MR angiography (MRA) require vascular instruments to be equipped with some type of radiofrequency (RF) coil or dipole RF antenna for MR signal detection. Such visualization strategies traditionally necessitate a connection to the scanner with either coaxial cable or laser fibers. In order to eliminate any wire connection, RF resonators that inductively couple their signal to MR surface coils were implemented into catheters to enable wireless active instrument visualization. Instrument background to contrast-to-noise ratio was systematically investigated as a function of the excitation flip angle. Signal coupling between the catheter RF coil and surface RF coils was evaluated qualitatively and quantitatively as a function of the catheter position and orientation with regard to the static magnetic field B0 and to the surface coils. In vivo evaluation of the instruments was performed in interventional MRA procedures on five pigs under MR guidance. Cartesian and projection reconstruction TrueFISP imaging enabled simultaneous visualization of the instruments and vascular morphology in real time. The implementation of RF resonators enabled robust visualization of the catheter curvature to the very tip. Additionally, the active visualization strategy does not require any wire connection to the scanner and thus does not hamper the interventionalist during the course of an intervention.

  13. MO-C-17A-08: Evaluation of Lung Deformation Using Three Dimensional Strain Maps

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cui, T; Huang, Q; Miller, W

    2014-06-15

    Purpose: To develop a systematic approach to generate three dimensional (3D) strain maps of lung using the displacement vector field (DVF) during the respiratory deformation, and to demonstrate its application in evaluating deformable image registration (DIR). Methods: A DVF based strain tensor at each voxel of interest (VOI) was calculated from the relative displacements between the VOI and each of the six nearest neighbors. The maximum and minimum stretches of a VOI can be determined by the principal strains (E{sub 1}, E{sub 2} and E{sub 3}), which are the eigenvalues and the corresponding strain tensors. Two healthy volunteers enrolled inmore » this study under IRB-approved protocol, each was scanned using 3D Hyperpolarized He-3 tagging-MRI and 3D proton-MRI with TrueFISP sequence at the endof- inhalation (EOI) and the end-of-exhalation (EOE) phases. 3D DVFs of tagging- and proton-MRI were obtained by the direct measurements of the tagging grid trajectory and by the DIR method implemented in commercial software. Results: 3D strain maps were successfully generated for all DVFs. The principal strain E1s were calculated as 0.43±0.05 and 0.17±0.25 for tagging-MRI and proton-MRI, respectively. The large values of E{sub 1} indicate the predominant lung motion in the superior-inferior (SI) direction. Given that the DVFs from the tagging images are considered as the ground truth, the discrepancies in the DIR-based strain maps suggest the inaccuracy of the DIR algorithm. In the E{sub 1} maps of tagging-MRI for subject 1, the fissures were distinguishable by the larger values (0.49±0.02) from the adjacent tissues (0.41±0.03) due to the larger relative displacement between the lung lobes. Conclusion: We have successfully developed a methodology to generate DVF-based 3D strain maps of lung. It can potentially enable us to better understand the pulmonary biomechanics and to evaluate and improve the DIR algorithms for the lung deformation. We are currently studying more subjects to evaluate this tool.« less

  14. MR enterography in nonresponsive adult celiac disease: Correlation with endoscopic, pathologic, serologic, and genetic features.

    PubMed

    Radmard, Amir Reza; Hashemi Taheri, Amir Pejman; Salehian Nik, Elham; Kooraki, Soheil; Kolahdoozan, Shadi; Mirminachi, Babak; Sotoudeh, Masoud; Ekhlasi, Golnaz; Malekzadeh, Reza; Shahbazkhani, Bijan

    2017-10-01

    To assess small bowel abnormalities on magnetic resonance enterography (MRE) in adult patients with nonresponsive celiac disease (CD) and investigate their associations with endoscopic, histopathologic, serologic, and genetic features. This prospective study was carried out between September 2012 and August 2013. After approval by the Ethics Committee of our institution, informed consent was acquired from all participants. Forty consecutive patients with nonresponsive CD, aged 17-76 years, underwent MRE using a 1.5T unit. Sequences included T 2 -HASTE, True-FISP, pre- and postcontrast VIBE to assess the quantitative (number of ileal and jejunal folds) and qualitative (fold pattern abnormalities, mural thickening, increased enhancement, bowel dilatation, or intussusception) measures. Endoscopic manifestations were categorized as normal/mild vs. severe. Histopathological results were divided into mild and severe. Genotyping of HLA-DQ2 and DQ8 was performed. Serum levels of tissue-transglutaminase, endomysial, and gliadin antibodies were also determined. Logistic regression analysis and receiver operating characteristic (ROC) curve were used. Twenty-nine (72.5%) cases showed abnormal MRE. Reversed jejunoileal fold pattern had significant association with severe endoscopic (odds ratio [OR] = 8.38, 95% confidence interval [CI] 1.73-40.5) and pathologic features (OR = 7.36, 95% CI 1.33-40.54). An increased number of ileal folds/inch was significantly associated with severe MARSH score and positive HLA-DQ8. (P < 0.001 and P = 0.026, respectively). Ileal fold number had the highest areas under the curve for prediction of severe endoscopic (AUC: 0.75, P = 0.009) and pathologic (AUC: 0.84, P < 0.001) findings and positive anti-transglutaminase antibody (AUC: 0.85, P = 0.027). Fold pattern reversal on MRE is highly associated with endoscopic and pathologic features of refractory celiac disease (RCD). Increased ileal folds showed higher correlation with endoscopic-pathologic features, HLA-DQ8, and anti-transglutaminase level. MRE might be more sensitive for detection of increased ileal folds in CD rather than reduction of duodenal and jejunal folds due to better distension of ileal loops. 2 Technical Efficacy: Stage 3 J. Magn. Reson. Imaging 2017;46:1096-1106. © 2017 International Society for Magnetic Resonance in Medicine.

  15. Accuracy for detection of simulated lesions: comparison of fluid-attenuated inversion-recovery, proton density--weighted, and T2-weighted synthetic brain MR imaging

    NASA Technical Reports Server (NTRS)

    Herskovits, E. H.; Itoh, R.; Melhem, E. R.

    2001-01-01

    OBJECTIVE: The objective of our study was to determine the effects of MR sequence (fluid-attenuated inversion-recovery [FLAIR], proton density--weighted, and T2-weighted) and of lesion location on sensitivity and specificity of lesion detection. MATERIALS AND METHODS: We generated FLAIR, proton density-weighted, and T2-weighted brain images with 3-mm lesions using published parameters for acute multiple sclerosis plaques. Each image contained from zero to five lesions that were distributed among cortical-subcortical, periventricular, and deep white matter regions; on either side; and anterior or posterior in position. We presented images of 540 lesions, distributed among 2592 image regions, to six neuroradiologists. We constructed a contingency table for image regions with lesions and another for image regions without lesions (normal). Each table included the following: the reviewer's number (1--6); the MR sequence; the side, position, and region of the lesion; and the reviewer's response (lesion present or absent [normal]). We performed chi-square and log-linear analyses. RESULTS: The FLAIR sequence yielded the highest true-positive rates (p < 0.001) and the highest true-negative rates (p < 0.001). Regions also differed in reviewers' true-positive rates (p < 0.001) and true-negative rates (p = 0.002). The true-positive rate model generated by log-linear analysis contained an additional sequence-location interaction. The true-negative rate model generated by log-linear analysis confirmed these associations, but no higher order interactions were added. CONCLUSION: We developed software with which we can generate brain images of a wide range of pulse sequences and that allows us to specify the location, size, shape, and intrinsic characteristics of simulated lesions. We found that the use of FLAIR sequences increases detection accuracy for cortical-subcortical and periventricular lesions over that associated with proton density- and T2-weighted sequences.

  16. Molecular analysis in true hermaphrodites with different karyotypes and similar phenotypes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Torres, L.; Cervantes, A.; Kofman-Alfaro, S.

    1996-05-17

    True hermaphroditism is characterized by the development of ovarian and testicular tissue in the same individual. Muellerian and Wolffian structures are usually present, and external genitalia are often ambiguous. The most frequent karyotype in these patients is 46,XX or various forms of mosaicism, whereas 46,XY is very rarely found. The phenotype in all these subjects is similar. We studied 10 true hermaphrodites. Six of them had a 46,XX chromosomal complement: 3 had been reared as males and 3 as females. The other 4 patients were mosaics: 3 were 46,XX/46,XY and one had a 46,XX/47,XXY karyotype. One of the 46,XX/46,XY mosaicsmore » was reared as a female, whereas the other 3 mosaics were reared as males. The sex of assignment in the 10 patients depended only on labio-scrotal differentiation. Molecular studies in 46,XX subjects documented the absence of Y centromeric sequences in all cases, arguing against hidden mosaicism. One patient presented Yp sequences (ZFY+, SRY+), which contrast with South African black 46,XX true hermaphrodites in whom no Y sequences were found. Molecular analysis in the subjects with mosaicism demonstrated the presence of Y centromeric and Yp sequences confirming the presence of a Y chromosome. Gonadal development, endocrine function, and phenotype in the 10 patients did not correlate with the presence of a Y chromosome or Y-derived sequences in the genome, confirming that true hermaphroditism is a heterogeneous condition. Both Mexican and non-South African 46,XX true hermaphrodites may be SRY positive. 51 refs., 3 figs., 2 tabs.« less

  17. BlackOPs: increasing confidence in variant detection through mappability filtering.

    PubMed

    Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil

    2013-10-01

    Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.

  18. [Depiction of the cranial nerves around the cavernous sinus by 3D reversed FISP with diffusion weighted imaging (3D PSIF-DWI)].

    PubMed

    Ishida, Go; Oishi, Makoto; Jinguji, Shinya; Yoneoka, Yuichiro; Sato, Mitsuya; Fujii, Yukihiko

    2011-10-01

    To evaluate the anatomy of cranial nerves running in and around the cavernous sinus, we employed three-dimensional reversed fast imaging with steady-state precession (FISP) with diffusion weighted imaging (3D PSIF-DWI) on 3-T magnetic resonance (MR) system. After determining the proper parameters to obtain sufficient resolution of 3D PSIF-DWI, we collected imaging data of 20-side cavernous regions in 10 normal subjects. 3D PSIF-DWI provided high contrast between the cranial nerves and other soft tissues, fluid, and blood in all subjects. We also created volume-rendered images of 3D PSIF-DWI and anatomically evaluated the reliability of visualizing optic, oculomotor, trochlear, trigeminal, and abducens nerves on 3D PSIF-DWI. All 20 sets of cranial nerves were visualized and 12 trochlear nerves and 6 abducens nerves were partially identified. We also presented preliminary clinical experiences in two cases with pituitary adenomas. The anatomical relationship between the tumor and cranial nerves running in and around the cavernous sinus could be three-dimensionally comprehended by 3D PSIF-DWI and the volume-rendered images. In conclusion, 3D PSIF-DWI has great potential to provide high resolution "cranial nerve imaging", which visualizes the whole length of the cranial nerves including the parts in the blood flow as in the cavernous sinus region.

  19. Pseudorandom number generation using chaotic true orbits of the Bernoulli map

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Saito, Asaki, E-mail: saito@fun.ac.jp; Yamaguchi, Akihiro

    We devise a pseudorandom number generator that exactly computes chaotic true orbits of the Bernoulli map on quadratic algebraic integers. Moreover, we describe a way to select the initial points (seeds) for generating multiple pseudorandom binary sequences. This selection method distributes the initial points almost uniformly (equidistantly) in the unit interval, and latter parts of the generated sequences are guaranteed not to coincide. We also demonstrate through statistical testing that the generated sequences possess good randomness properties.

  20. MRI of the small bowel: can sufficient bowel distension be achieved with small volumes of oral contrast?

    PubMed

    Kinner, Sonja; Kuehle, Christiane A; Herbig, Sebastian; Haag, Sebastian; Ladd, Susanne C; Barkhausen, Joerg; Lauenstein, Thomas C

    2008-11-01

    Sufficient luminal distension is mandatory for small bowel imaging. However, patients often are unable to ingest volumes of currently applied oral contrast compounds. The aim of this study was to evaluate if administration of low doses of an oral contrast agent with high-osmolarity leads to sufficient and diagnostic bowel distension. Six healthy volunteers ingested at different occasions 150, 300 and 450 ml of a commercially available oral contrast agent (Banana Smoothie Readi-Cat, E-Z-EM; 194 mOsmol/l). Two-dimensional TrueFISP data sets were acquired in 5-min intervals up to 45 min after contrast ingestion. Small bowel distension was quantified using a visual five-grade ranking (5 = very good distension, 1 = collapsed bowel). Results were statistically compared using a Wilcoxon-Rank test. Ingestion of 450 ml and 300 ml resulted in a significantly better distension than 150 ml. The all-over average distension value for 450 ml amounted to 3.4 (300 ml: 3.0, 150 ml: 2.3) and diagnostic bowel distension could be found throughout the small intestine. Even 45 min after ingestion of 450 ml the jejunum and ileum could be reliably analyzed. Small bowel imaging with low doses of contrast leads to diagnostic distension values in healthy subjects when a high-osmolarity substance is applied. These findings may help to further refine small bowel MRI techniques, but need to be confirmed in patients with small bowel disorders.

  1. Magnetic resonance colonography without bowel cleansing: a prospective cross sectional study in a screening population

    PubMed Central

    Kuehle, Christiane A; Langhorst, Jost; Ladd, Susanne C; Zoepf, Thomas; Nuefer, Michael; Grabellus, Florian; Barkhausen, Joerg; Gerken, Guido; Lauenstein, Thomas C

    2007-01-01

    Background and aim To evaluate the diagnostic accuracy of magnetic resonance colonography (MRC) without bowel cleansing in a screening population and compare the results to colonoscopy as a standard of reference. Methods 315 screening patients, older than 50 years with a normal risk profile for colorectal cancer, were included in this study. For MRC, a tagging agent (5.0% Gastrografin, 1.0% barium sulphate, 0.2% locust bean gum) was ingested with each main meal within 2 days prior to MRC. No bowel cleansing was applied. For the magnetic resonance examination, a rectal water enema was administered. Data collection was based on contrast enhanced T1 weighted images and TrueFISP images. Magnetic resonance data were analysed for image quality and the presence of colorectal lesions. Conventional colonoscopy and histopathological samples served as reference. Results In 4% of all colonic segments, magnetic resonance image quality was insufficient because of untagged faecal material. Adenomatous polyps >5 mm were detected by means of MRC, with a sensitivity of 83.0%. Overall specificity was 90.2% (false positive findings in 19 patients). However, only 16 of 153 lesions <5 mm and 9 of 127 hyperplastic polyps could be visualised on magnetic resonance images. Conclusions Faecal tagging MRC is applicable for screening purposes. It provides good accuracy for the detection of relevant (ie, adenomatous) colorectal lesions >5 mm in a screening population. However, refinements to optimise image quality of faecal tagging are needed. PMID:17341542

  2. WE-G-17A-01: Improving Tracking Image Spatial Resolution for Onboard MR Image Guided Radiation Therapy Using the WHISKEE Technique

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Y; Mutic, S; Du, D

    Purpose: To evaluate the feasibility of using the weighted hybrid iterative spiral k-space encoded estimation (WHISKEE) technique to improve spatial resolution of tracking images for onboard MR image guided radiation therapy (MR-IGRT). Methods: MR tracking images of abdomen and pelvis had been acquired from healthy volunteers using the ViewRay onboard MRIGRT system (ViewRay Inc. Oakwood Village, OH) at a spatial resolution of 2.0mm*2.0mm*5.0mm. The tracking MR images were acquired using the TrueFISP sequence. The temporal resolution had to be traded off to 2 frames per second (FPS) to achieve the 2.0mm in-plane spatial resolution. All MR images were imported intomore » the MATLAB software. K-space data were synthesized through the Fourier Transform of the MR images. A mask was created to selected k-space points that corresponded to the under-sampled spiral k-space trajectory with an acceleration (or undersampling) factor of 3. The mask was applied to the fully sampled k-space data to synthesize the undersampled k-space data. The WHISKEE method was applied to the synthesized undersampled k-space data to reconstructed tracking MR images at 6 FPS. As a comparison, the undersampled k-space data were also reconstructed using the zero-padding technique. The reconstructed images were compared to the original image. The relatively reconstruction error was evaluated using the percentage of the norm of the differential image over the norm of the original image. Results: Compared to the zero-padding technique, the WHISKEE method was able to reconstruct MR images with better image quality. It significantly reduced the relative reconstruction error from 39.5% to 3.1% for the pelvis image and from 41.5% to 4.6% for the abdomen image at an acceleration factor of 3. Conclusion: We demonstrated that it was possible to use the WHISKEE method to expedite MR image acquisition for onboard MR-IGRT systems to achieve good spatial and temporal resolutions simultaneously. Y. Hu and O. green receive travel reimbursement from ViewRay. S. Mutic has consulting and research agreements with ViewRay. Q. Zeng, R. Nana, J.L. Patrick, S. Shvartsman and J.F. Dempsey are ViewRay employees.« less

  3. Next-generation sequencing can reveal in vitro-generated PCR crossover products: some artifactual sequences correspond to HLA alleles in the IMGT/HLA database.

    PubMed

    Holcomb, C L; Rastrou, M; Williams, T C; Goodridge, D; Lazaro, A M; Tilanus, M; Erlich, H A

    2014-01-01

    The high-resolution human leukocyte antigen (HLA) genotyping assay that we developed using 454 sequencing and Conexio software uses generic polymerase chain reaction (PCR) primers for DRB exon 2. Occasionally, we observed low abundance DRB amplicon sequences that resulted from in vitro PCR 'crossing over' between DRB1 and DRB3/4/5. These hybrid sequences, revealed by the clonal sequencing property of the 454 system, were generally observed at a read depth of 5%-10% of the true alleles. They usually contained at least one mismatch with the IMGT/HLA database, and consequently, were easily recognizable and did not cause a problem for HLA genotyping. Sometimes, however, these artifactual sequences matched a rare allele and the automatic genotype assignment was incorrect. These observations raised two issues: (1) could PCR conditions be modified to reduce such artifacts? and (2) could some of the rare alleles listed in the IMGT/HLA database be artifacts rather than true alleles? Because PCR crossing over occurs during late cycles of PCR, we compared DRB genotypes resulting from 28 and (our standard) 35 cycles of PCR. For all 21 cell line DNAs amplified for 35 cycles, crossover products were detected. In 33% of the cases, these hybrid sequences corresponded to named alleles. With amplification for only 28 cycles, these artifactual sequences were not detectable. To investigate whether some rare alleles in the IMGT/HLA database might be due to PCR artifacts, we analyzed four samples obtained from the investigators who submitted the sequences. In three cases, the sequences were generated from true alleles. In one case, our 454 sequencing revealed an error in the previously submitted sequence. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  4. Steady-state MRA techniques with a blood pool contrast agent improve visualization of pulmonary venous anatomy and left atrial patency compared with time-resolved MRA pre- and postcatheter ablation in atrial fibrillation.

    PubMed

    Rustogi, Rahul; Galizia, Mauricio; Thakrar, Darshit; Merritt, Bryce; Bi, Xiaoming; Collins, Jeremy; Carr, James C

    2015-11-01

    To compare steady-state magnetic resonance angiography (SS-MRA), using a blood pool contrast agent, with the established technique of time-resolved MRA (TR-MRA), in pulmonary vein mapping and left atrial patency. Twenty-one patients (12 males, age 58.3 ± 8.4 years; 9 females; 57 ± 10 years) undergoing pulmonary vein mapping were evaluated with TR-MRA (TWIST) and SS-MRA. Orthogonal measurements and areas for four veins per patient per technique were assessed by Friedman's test. Overall intertechnique mean difference for any pulmonary vein orthogonal measurement and area was 0.02 ± 0.34 cm (P = 0.705), and 0.2 ± 0.08 cm(2) (P < 0.001). Interobserver correlation was strong for diameter and area measurements using the three methods with a range of 0.72-0.94, and 0.87-0.97, respectively. Left atrial appendage image quality score for TR-MRA was significantly lower than the other two methods (P < 0.001). Both observers detected more stenosis on inversion recovery (IR)-True FISP compared to TR-MRA and IR-FLASH. SS-MRA with a blood pool agent compared favorably to the established technique of TR-MRA for quantitative assessment of pulmonary venous anatomy. SS-MRA offers greater spatial resolution than TR-MRA with increased confidence for ruling out left atrial appendage filling defect. © 2015 Wiley Periodicals, Inc.

  5. Repeatability of magnetic resonance fingerprinting T1 and T2 estimates assessed using the ISMRM/NIST MRI system phantom.

    PubMed

    Jiang, Yun; Ma, Dan; Keenan, Kathryn E; Stupic, Karl F; Gulani, Vikas; Griswold, Mark A

    2017-10-01

    The purpose of this study was to evaluate accuracy and repeatability of T 1 and T 2 estimates of a MR fingerprinting (MRF) method using the ISMRM/NIST MRI system phantom. The ISMRM/NIST MRI system phantom contains multiple compartments with standardized T 1 , T 2 , and proton density values. Conventional inversion-recovery spin echo and spin echo methods were used to characterize the T 1 and T 2 values in the phantom. The phantom was scanned using the MRF-FISP method over 34 consecutive days. The mean T 1 and T 2 values were compared with the values from the spin echo methods. The repeatability was characterized as the coefficient of variation of the measurements over 34 days. T 1 and T 2 values from MRF-FISP over 34 days showed a strong linear correlation with the measurements from the spin echo methods (R 2  = 0.999 for T 1 ; R 2  = 0.996 for T 2 ). The MRF estimates over the wide ranges of T 1 and T 2 values have less than 5% variation, except for the shortest T 2 relaxation times where the method still maintains less than 8% variation. MRF measurements of T 1 and T 2 are highly repeatable over time and across wide ranges of T 1 and T 2 values. Magn Reson Med 78:1452-1457, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.

  6. Phylogenetic analysis of the true water bugs (Insecta: Hemiptera: Heteroptera: Nepomorpha): evidence from mitochondrial genomes

    PubMed Central

    Hua, Jimeng; Li, Ming; Dong, Pengzhi; Cui, Ying; Xie, Qiang; Bu, Wenjun

    2009-01-01

    Background The true water bugs are grouped in infraorder Nepomorpha (Insecta: Hemiptera: Heteroptera) and are of great economic importance. The phylogenetic relationships within Nepomorpha and the taxonomic hierarchies of Pleoidea and Aphelocheiroidea are uncertain. Most of the previous studies were based on morphological characters without algorithmic assessment. In the latest study, the molecular markers employed in phylogenetic analyses were partial sequences of 16S rDNA and 18S rDNA with a total length about 1 kb. Up to now, no mitochondrial genome of the true water bugs has been sequenced, which is one of the largest data sets that could be compared across animal taxa. In this study we analyzed the unresolved problems in Nepomorpha using evidence from mitochondrial genomes. Results Nine mitochondrial genomes of Nepomorpha and five of other hemipterans were sequenced. These mitochondrial genomes contain the commonly found 37 genes without gene rearrangements. Based on the nucleotide sequences of mt-genomes, Pleoidea is not a member of the Nepomorpha and Aphelocheiroidea should be grouped back into Naucoroidea. Phylogenetic relationships among the superfamilies of Nepomorpha were resolved robustly. Conclusion The mt-genome is an effective data source for resolving intraordinal phylogenetic problems at the superfamily level within Heteroptera. The mitochondrial genomes of the true water bugs are typical insect mt-genomes. Based on the nucleotide sequences of the mt-genomes, we propose the Pleoidea to be a separate heteropteran infraorder. The infraorder Nepomorpha consists of five superfamilies with the relationships (Corixoidea + ((Naucoroidea + Notonectoidea) + (Ochteroidea + Nepoidea))). PMID:19523246

  7. Limited copy number - high resolution melting (LCN-HRM) enables the detection and identification by sequencing of low level mutations in cancer biopsies

    PubMed Central

    Do, Hongdo; Dobrovic, Alexander

    2009-01-01

    Background Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations. We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Results Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions. LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. Conclusion LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations. PMID:19811662

  8. Limited copy number-high resolution melting (LCN-HRM) enables the detection and identification by sequencing of low level mutations in cancer biopsies.

    PubMed

    Do, Hongdo; Dobrovic, Alexander

    2009-10-08

    Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations.We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions.LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations.

  9. Reproducibility of Illumina platform deep sequencing errors allows accurate determination of DNA barcodes in cells.

    PubMed

    Beltman, Joost B; Urbanus, Jos; Velds, Arno; van Rooij, Nienke; Rohr, Jan C; Naik, Shalin H; Schumacher, Ton N

    2016-04-02

    Next generation sequencing (NGS) of amplified DNA is a powerful tool to describe genetic heterogeneity within cell populations that can both be used to investigate the clonal structure of cell populations and to perform genetic lineage tracing. For applications in which both abundant and rare sequences are biologically relevant, the relatively high error rate of NGS techniques complicates data analysis, as it is difficult to distinguish rare true sequences from spurious sequences that are generated by PCR or sequencing errors. This issue, for instance, applies to cellular barcoding strategies that aim to follow the amount and type of offspring of single cells, by supplying these with unique heritable DNA tags. Here, we use genetic barcoding data from the Illumina HiSeq platform to show that straightforward read threshold-based filtering of data is typically insufficient to filter out spurious barcodes. Importantly, we demonstrate that specific sequencing errors occur at an approximately constant rate across different samples that are sequenced in parallel. We exploit this observation by developing a novel approach to filter out spurious sequences. Application of our new method demonstrates its value in the identification of true sequences amongst spurious sequences in biological data sets.

  10. Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction.

    PubMed

    Palmer, Lance E; Dejori, Mathaeus; Bolanos, Randall; Fasulo, Daniel

    2010-01-15

    With the rapid expansion of DNA sequencing databases, it is now feasible to identify relevant information from prior sequencing projects and completed genomes and apply it to de novo sequencing of new organisms. As an example, this paper demonstrates how such extra information can be used to improve de novo assemblies by augmenting the overlapping step. Finding all pairs of overlapping reads is a key task in many genome assemblers, and to this end, highly efficient algorithms have been developed to find alignments in large collections of sequences. It is well known that due to repeated sequences, many aligned pairs of reads nevertheless do not overlap. But no overlapping algorithm to date takes a rigorous approach to separating aligned but non-overlapping read pairs from true overlaps. We present an approach that extends the Minimus assembler by a data driven step to classify overlaps as true or false prior to contig construction. We trained several different classification models within the Weka framework using various statistics derived from overlaps of reads available from prior sequencing projects. These statistics included percent mismatch and k-mer frequencies within the overlaps as well as a comparative genomics score derived from mapping reads to multiple reference genomes. We show that in real whole-genome sequencing data from the E. coli and S. aureus genomes, by providing a curated set of overlaps to the contigging phase of the assembler, we nearly doubled the median contig length (N50) without sacrificing coverage of the genome or increasing the number of mis-assemblies. Machine learning methods that use comparative and non-comparative features to classify overlaps as true or false can be used to improve the quality of a sequence assembly.

  11. Somatic Point Mutation Calling in Low Cellularity Tumors

    PubMed Central

    Kassahn, Karin S.; Holmes, Oliver; Nones, Katia; Patch, Ann-Marie; Miller, David K.; Christ, Angelika N.; Harliwong, Ivon; Bruxner, Timothy J.; Xu, Qinying; Anderson, Matthew; Wood, Scott; Leonard, Conrad; Taylor, Darrin; Newell, Felicity; Song, Sarah; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Steptoe, Anita; Pajic, Marina; Cowley, Mark J.; Pinese, Mark; Chang, David K.; Gill, Anthony J.; Johns, Amber L.; Wu, Jianmin; Wilson, Peter J.; Fink, Lynn; Biankin, Andrew V.; Waddell, Nicola; Grimmond, Sean M.; Pearson, John V.

    2013-01-01

    Somatic mutation calling from next-generation sequencing data remains a challenge due to the difficulties of distinguishing true somatic events from artifacts arising from PCR, sequencing errors or mis-mapping. Tumor cellularity or purity, sub-clonality and copy number changes also confound the identification of true somatic events against a background of germline variants. We have developed a heuristic strategy and software (http://www.qcmg.org/bioinformatics/qsnp/) for somatic mutation calling in samples with low tumor content and we show the superior sensitivity and precision of our approach using a previously sequenced cell line, a series of tumor/normal admixtures, and 3,253 putative somatic SNVs verified on an orthogonal platform. PMID:24250782

  12. Correcting for sequencing error in maximum likelihood phylogeny inference.

    PubMed

    Kuhner, Mary K; McGill, James

    2014-11-04

    Accurate phylogenies are critical to taxonomy as well as studies of speciation processes and other evolutionary patterns. Accurate branch lengths in phylogenies are critical for dating and rate measurements. Such accuracy may be jeopardized by unacknowledged sequencing error. We use simulated data to test a correction for DNA sequencing error in maximum likelihood phylogeny inference. Over a wide range of data polymorphism and true error rate, we found that correcting for sequencing error improves recovery of the branch lengths, even if the assumed error rate is up to twice the true error rate. Low error rates have little effect on recovery of the topology. When error is high, correction improves topological inference; however, when error is extremely high, using an assumed error rate greater than the true error rate leads to poor recovery of both topology and branch lengths. The error correction approach tested here was proposed in 2004 but has not been widely used, perhaps because researchers do not want to commit to an estimate of the error rate. This study shows that correction with an approximate error rate is generally preferable to ignoring the issue. Copyright © 2014 Kuhner and McGill.

  13. Preclinical Magnetic Resonance Fingerprinting (MRF) at 7 T: Effective Quantitative Imaging for Rodent Disease Models

    PubMed Central

    Gao, Ying; Chen, Yong; Ma, Dan; Jiang, Yun; Herrmann, Kelsey A.; Vincent, Jason A.; Dell, Katherine M.; Drumm, Mitchell L.; Brady-Kalnay, Susann M.; Griswold, Mark A.; Flask, Chris A.; Lu, Lan

    2015-01-01

    High field, preclinical magnetic resonance imaging (MRI) scanners are now commonly used to quantitatively assess disease status and efficacy of novel therapies in a wide variety of rodent models. Unfortunately, conventional MRI methods are highly susceptible to respiratory and cardiac motion artifacts resulting in potentially inaccurate and misleading data. We have developed an initial preclinical, 7.0 T MRI implementation of the highly novel Magnetic Resonance Fingerprinting (MRF) methodology that has been previously described for clinical imaging applications. The MRF technology combines a priori variation in the MRI acquisition parameters with dictionary-based matching of acquired signal evolution profiles to simultaneously generate quantitative maps of T1 and T2 relaxation times and proton density. This preclinical MRF acquisition was constructed from a Fast Imaging with Steady-state Free Precession (FISP) MRI pulse sequence to acquire 600 MRF images with both evolving T1 and T2 weighting in approximately 30 minutes. This initial high field preclinical MRF investigation demonstrated reproducible and differentiated estimates of in vitro phantoms with different relaxation times. In vivo preclinical MRF results in mouse kidneys and brain tumor models demonstrated an inherent resistance to respiratory motion artifacts as well as sensitivity to known pathology. These results suggest that MRF methodology may offer the opportunity for quantification of numerous MRI parameters for a wide variety of preclinical imaging applications. PMID:25639694

  14. Preclinical MR fingerprinting (MRF) at 7 T: effective quantitative imaging for rodent disease models.

    PubMed

    Gao, Ying; Chen, Yong; Ma, Dan; Jiang, Yun; Herrmann, Kelsey A; Vincent, Jason A; Dell, Katherine M; Drumm, Mitchell L; Brady-Kalnay, Susann M; Griswold, Mark A; Flask, Chris A; Lu, Lan

    2015-03-01

    High-field preclinical MRI scanners are now commonly used to quantitatively assess disease status and the efficacy of novel therapies in a wide variety of rodent models. Unfortunately, conventional MRI methods are highly susceptible to respiratory and cardiac motion artifacts resulting in potentially inaccurate and misleading data. We have developed an initial preclinical 7.0-T MRI implementation of the highly novel MR fingerprinting (MRF) methodology which has been described previously for clinical imaging applications. The MRF technology combines a priori variation in the MRI acquisition parameters with dictionary-based matching of acquired signal evolution profiles to simultaneously generate quantitative maps of T1 and T2 relaxation times and proton density. This preclinical MRF acquisition was constructed from a fast imaging with steady-state free precession (FISP) MRI pulse sequence to acquire 600 MRF images with both evolving T1 and T2 weighting in approximately 30 min. This initial high-field preclinical MRF investigation demonstrated reproducible and differentiated estimates of in vitro phantoms with different relaxation times. In vivo preclinical MRF results in mouse kidneys and brain tumor models demonstrated an inherent resistance to respiratory motion artifacts as well as sensitivity to known pathology. These results suggest that MRF methodology may offer the opportunity for the quantification of numerous MRI parameters for a wide variety of preclinical imaging applications. Copyright © 2015 John Wiley & Sons, Ltd.

  15. Automated Identification of Medically Important Bacteria by 16S rRNA Gene Sequencing Using a Novel Comprehensive Database, 16SpathDB▿

    PubMed Central

    Woo, Patrick C. Y.; Teng, Jade L. L.; Yeung, Juilian M. Y.; Tse, Herman; Lau, Susanna K. P.; Yuen, Kwok-Yung

    2011-01-01

    Despite the increasing use of 16S rRNA gene sequencing, interpretation of 16S rRNA gene sequence results is one of the most difficult problems faced by clinical microbiologists and technicians. To overcome the problems we encountered in the existing databases during 16S rRNA gene sequence interpretation, we built a comprehensive database, 16SpathDB (http://147.8.74.24/16SpathDB) based on the 16S rRNA gene sequences of all medically important bacteria listed in the Manual of Clinical Microbiology and evaluated its use for automated identification of these bacteria. Among 91 nonduplicated bacterial isolates collected in our clinical microbiology laboratory, 71 (78%) were reported by 16SpathDB as a single bacterial species having >98.0% nucleotide identity with the query sequence, 19 (20.9%) were reported as more than one bacterial species having >98.0% nucleotide identity with the query sequence, and 1 (1.1%) was reported as no match. For the 71 bacterial isolates reported as a single bacterial species, all results were identical to their true identities as determined by a polyphasic approach. For the 19 bacterial isolates reported as more than one bacterial species, all results contained their true identities as determined by a polyphasic approach and all of them had their true identities as the “best match in 16SpathDB.” For the isolate (Gordonibacter pamelaeae) reported as no match, the bacterium has never been reported to be associated with human disease and was not included in the Manual of Clinical Microbiology. 16SpathDB is an automated, user-friendly, efficient, accurate, and regularly updated database for 16S rRNA gene sequence interpretation in clinical microbiology laboratories. PMID:21389154

  16. A gradient-boosting approach for filtering de novo mutations in parent-offspring trios.

    PubMed

    Liu, Yongzhuang; Li, Bingshan; Tan, Renjie; Zhu, Xiaolin; Wang, Yadong

    2014-07-01

    Whole-genome and -exome sequencing on parent-offspring trios is a powerful approach to identifying disease-associated genes by detecting de novo mutations in patients. Accurate detection of de novo mutations from sequencing data is a critical step in trio-based genetic studies. Existing bioinformatic approaches usually yield high error rates due to sequencing artifacts and alignment issues, which may either miss true de novo mutations or call too many false ones, making downstream validation and analysis difficult. In particular, current approaches have much worse specificity than sensitivity, and developing effective filters to discriminate genuine from spurious de novo mutations remains an unsolved challenge. In this article, we curated 59 sequence features in whole genome and exome alignment context which are considered to be relevant to discriminating true de novo mutations from artifacts, and then employed a machine-learning approach to classify candidates as true or false de novo mutations. Specifically, we built a classifier, named De Novo Mutation Filter (DNMFilter), using gradient boosting as the classification algorithm. We built the training set using experimentally validated true and false de novo mutations as well as collected false de novo mutations from an in-house large-scale exome-sequencing project. We evaluated DNMFilter's theoretical performance and investigated relative importance of different sequence features on the classification accuracy. Finally, we applied DNMFilter on our in-house whole exome trios and one CEU trio from the 1000 Genomes Project and found that DNMFilter could be coupled with commonly used de novo mutation detection approaches as an effective filtering approach to significantly reduce false discovery rate without sacrificing sensitivity. The software DNMFilter implemented using a combination of Java and R is freely available from the website at http://humangenome.duke.edu/software. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. How well do ITS rDNA sequences differentiate species of true morels (Morchella)?

    USDA-ARS?s Scientific Manuscript database

    Arguably more mycophiles hunt true morels (Morchella) during their brief fruiting season each spring in the Northern Hemisphere than any other wild edible fungus. Concerns about overharvesting by individual collectors and commercial enterprises make it essential that science-based management practic...

  18. [Usefulness of curved coronal MPR imaging for the diagnosis of cervical radiculopathy].

    PubMed

    Inukai, Chikage; Inukai, Takashi; Matsuo, Naoki; Shimizu, Ikuo; Goto, Hisaharu; Takagi, Teruhide; Takayasu, Masakazu

    2010-03-01

    In surgical treatment of cervical radiculopathy, localization of the responsible lesions by various imaging modalities is essential. Among them, MRI is non-invasive and plays a primary role in the assessment of spinal radicular symptoms. However, demonstration of nerve root compression is sometimes difficult by the conventional methods of MRI, such as T1 weighted (T1W) and T2 weighted (T2W) sagittal or axial images. We have applied a new technique of curved coronal multiplanar reconstruction (MPR) imaging for the diagnosis of cervical radiculopathy. Ten patients (4 male, 6 female) with ages between 31 and 79 year-old, who had clinical diagnosis of cervical radiculopathy, were included in this study. Seven patients underwent anterior key-hole foraminotomy to decompress the nerve root with successful results. All the patients had 3D MRI studies, such as true fast imaging with steady-state precession (FISP), 3DT2W sampling perfection with application optimized contrasts using different fillip angle evolution (SPACE), and 3D multi-echo data image combination (MEDIC) imagings in addition to the routine MRI (1.5 T Avanto, Siemens, Germany) with a phased array coil. The curved coronal MPR images were produced from these MRI data using a workstation. The nerve root compression was diagnosed by curved coronal MPR images in all the patients. The compression sites were compatible with those of the operative findings in 7 patients, who underwent surgical treatment. The MEDIC imagings were the most demonstrable to visualize the nerve root, while the 3D-space imagings were the next. The curved coronal MPR imaging is useful for the diagnosis of accurate localization of the compressing lesions in patients with cervical radiculopathy.

  19. Physical layer one-time-pad data encryption through synchronized semiconductor laser networks

    NASA Astrophysics Data System (ADS)

    Argyris, Apostolos; Pikasis, Evangelos; Syvridis, Dimitris

    2016-02-01

    Semiconductor lasers (SL) have been proven to be a key device in the generation of ultrafast true random bit streams. Their potential to emit chaotic signals under conditions with desirable statistics, establish them as a low cost solution to cover various needs, from large volume key generation to real-time encrypted communications. Usually, only undemanding post-processing is needed to convert the acquired analog timeseries to digital sequences that pass all established tests of randomness. A novel architecture that can generate and exploit these true random sequences is through a fiber network in which the nodes are semiconductor lasers that are coupled and synchronized to central hub laser. In this work we show experimentally that laser nodes in such a star network topology can synchronize with each other through complex broadband signals that are the seed to true random bit sequences (TRBS) generated at several Gb/s. The potential for each node to access real-time generated and synchronized with the rest of the nodes random bit streams, through the fiber optic network, allows to implement an one-time-pad encryption protocol that mixes the synchronized true random bit sequence with real data at Gb/s rates. Forward-error correction methods are used to reduce the errors in the TRBS and the final error rate at the data decoding level. An appropriate selection in the sampling methodology and properties, as well as in the physical properties of the chaotic seed signal through which network locks in synchronization, allows an error free performance.

  20. Multigene molecular phylogenetics reveals true morels (Morchella) are especially species-rich in China

    USDA-ARS?s Scientific Manuscript database

    The phylogenetic diversity of true morels (Morchella) in China was estimated by initially analyzing nuclear ribosomal internal transcribed spacer (ITS) rDNA sequences from 361 specimens collected in 21 provinces during the 2003-2011 growing seasons, together with six collections obtained on loan fro...

  1. A multigene molecular phylogenetic assessment of true morels (Morchella) in turkey

    USDA-ARS?s Scientific Manuscript database

    A collection of 247 true morels (Morchella spp.) primarily from the Mediterranean and Aegean Regions of Southern Turkey, were analyzed for species diversity using partial RNA polymerase I (RPB1) and nuclear ribosomal large subunit (LSU) rDNA gene sequences. Based on the result of this initial scree...

  2. Choice of Reference Sequence and Assembler for Alignment of Listeria monocytogenes Short-Read Sequence Data Greatly Influences Rates of Error in SNP Analyses

    PubMed Central

    Pightling, Arthur W.; Petronella, Nicholas; Pagotto, Franco

    2014-01-01

    The wide availability of whole-genome sequencing (WGS) and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs) in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs) are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps) are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i) depth of sequencing coverage, ii) choice of reference-guided short-read sequence assembler, iii) choice of reference genome, and iv) whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT), using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming). We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers should test a variety of conditions to achieve optimal results. PMID:25144537

  3. Key Ecological Roles for Zoosporic True Fungi in Aquatic Habitats.

    PubMed

    Gleason, Frank H; Scholz, Bettina; Jephcott, Thomas G; van Ogtrop, Floris F; Henderson, Linda; Lilje, Osu; Kittelmann, Sandra; Macarthur, Deborah J

    2017-03-01

    The diversity and abundance of zoosporic true fungi have been analyzed recently using fungal sequence libraries and advances in molecular methods, such as high-throughput sequencing. This review focuses on four evolutionary primitive true fungal phyla: the Aphelidea, Chytridiomycota, Neocallimastigomycota, and Rosellida (Cryptomycota), most species of which are not polycentric or mycelial (filamentous), rather they tend to be primarily monocentric (unicellular). Zoosporic fungi appear to be both abundant and diverse in many aquatic habitats around the world, with abundance often exceeding other fungal phyla in these habitats, and numerous novel genetic sequences identified. Zoosporic fungi are able to survive extreme conditions, such as high and extremely low pH; however, more work remains to be done. They appear to have important ecological roles as saprobes in decomposition of particulate organic substrates, pollen, plant litter, and dead animals; as parasites of zooplankton and algae; as parasites of vertebrate animals (such as frogs); and as symbionts in the digestive tracts of mammals. Some chytrids cause economically important diseases of plants and animals. They regulate sizes of phytoplankton populations. Further metagenomics surveys of aquatic ecosystems are expected to enlarge our knowledge of the diversity of true zoosporic fungi. Coupled with studies on their functional ecology, we are moving closer to unraveling the role of zoosporic fungi in carbon cycling and the impact of climate change on zoosporic fungal populations.

  4. A preliminary assessment of the true morels (Morchella) in Newfoundland and Labrador

    USDA-ARS?s Scientific Manuscript database

    A preliminary assessment of true morels (Morchella) from Newfoundland and Labrador (NL) was obtained by using DNA sequence data from portions of three genes to identify 20 collections from Newfoundland and one from a remote location in Labrador. To place this work in a broader context, data on 25 co...

  5. A template-finding algorithm and a comprehensive benchmark for homology modeling of proteins

    PubMed Central

    Vallat, Brinda Kizhakke; Pillardy, Jaroslaw; Elber, Ron

    2010-01-01

    The first step in homology modeling is to identify a template protein for the target sequence. The template structure is used in later phases of the calculation to construct an atomically detailed model for the target. We have built from the Protein Data Bank a large-scale learning set that includes tens of millions of pair matches that can be either a true template or a false one. Discriminatory learning (learning from positive and negative examples) is employed to train a decision tree. Each branch of the tree is a mathematical programming model. The decision tree is tested on an independent set from PDB entries and on the sequences of CASP7. It provides significant enrichment of true templates (between 50-100 percent) when compared to PSI-BLAST. The model is further verified by building atomically detailed structures for each of the tentative true templates with modeller. The probability that a true match does not yield an acceptable structural model (within 6Å RMSD from the native structure), decays linearly as a function of the TM structural-alignment score. PMID:18300226

  6. Finding a (pine) needle in a haystack: chloroplast genome sequence divergence in rare and widespread pines

    Treesearch

    J.B. Whittall; J. Syring; M. Parks; J. Buenrostro; C. Dick; A. Liston; R. Cronn

    2010-01-01

    Critical to conservation efforts and other investigations at low taxonomic levels, DNA sequence data offer important insights into the distinctiveness, biogeographic partitioning, and evolutionary histories of species. The resolving power of DNA sequences is often limited by insufficient variability at the intraspecific level. This is particularly true of studies...

  7. Identification of true EST alignments for recognising transcribed regions.

    PubMed

    Ma, Chuang; Wang, Jia; Li, Lun; Duan, Mo-Jie; Zhou, Yan-Hong

    2011-01-01

    Transcribed regions can be determined by aligning Expressed Sequence Tags (ESTs) with genome sequences. The kernel of this strategy is to effectively distinguish true EST alignments from spurious ones. In this study, three measures including Direction Check, Identity Check and Terminal Check were introduced to more effectively eliminate spurious EST alignments. On the basis of these introduced measures and other widely used measures, a computational tool, named ESTCleanser, has been developed to identify true EST alignments for obtaining reliable transcribed regions. The performance of ESTCleanser has been evaluated on the well-annotated human ENCyclopedia of DNA Elements (ENCODE) regions using human ESTs in the dbEST database. The evaluation results show that the accuracy of ESTCleanser at exon and intron levels is more remarkably enhanced than that of UCSC-spliced EST alignments. This work would be helpful to EST-based researches on finding new genes, complementing genome annotation, recognising alternative splicing events and Single Nucleotide Polymorphisms (SNPs), etc.

  8. Twisted trees and inconsistency of tree estimation when gaps are treated as missing data - The impact of model mis-specification in distance corrections.

    PubMed

    McTavish, Emily Jane; Steel, Mike; Holder, Mark T

    2015-12-01

    Statistically consistent estimation of phylogenetic trees or gene trees is possible if pairwise sequence dissimilarities can be converted to a set of distances that are proportional to the true evolutionary distances. Susko et al. (2004) reported some strikingly broad results about the forms of inconsistency in tree estimation that can arise if corrected distances are not proportional to the true distances. They showed that if the corrected distance is a concave function of the true distance, then inconsistency due to long branch attraction will occur. If these functions are convex, then two "long branch repulsion" trees will be preferred over the true tree - though these two incorrect trees are expected to be tied as the preferred true. Here we extend their results, and demonstrate the existence of a tree shape (which we refer to as a "twisted Farris-zone" tree) for which a single incorrect tree topology will be guaranteed to be preferred if the corrected distance function is convex. We also report that the standard practice of treating gaps in sequence alignments as missing data is sufficient to produce non-linear corrected distance functions if the substitution process is not independent of the insertion/deletion process. Taken together, these results imply inconsistent tree inference under mild conditions. For example, if some positions in a sequence are constrained to be free of substitutions and insertion/deletion events while the remaining sites evolve with independent substitutions and insertion/deletion events, then the distances obtained by treating gaps as missing data can support an incorrect tree topology even given an unlimited amount of data. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates.

    PubMed

    Low, Joel Z B; Khang, Tsung Fei; Tammi, Martti T

    2017-12-28

    In current statistical methods for calling differentially expressed genes in RNA-Seq experiments, the assumption is that an adjusted observed gene count represents an unknown true gene count. This adjustment usually consists of a normalization step to account for heterogeneous sample library sizes, and then the resulting normalized gene counts are used as input for parametric or non-parametric differential gene expression tests. A distribution of true gene counts, each with a different probability, can result in the same observed gene count. Importantly, sequencing coverage information is currently not explicitly incorporated into any of the statistical models used for RNA-Seq analysis. We developed a fast Bayesian method which uses the sequencing coverage information determined from the concentration of an RNA sample to estimate the posterior distribution of a true gene count. Our method has better or comparable performance compared to NOISeq and GFOLD, according to the results from simulations and experiments with real unreplicated data. We incorporated a previously unused sequencing coverage parameter into a procedure for differential gene expression analysis with RNA-Seq data. Our results suggest that our method can be used to overcome analytical bottlenecks in experiments with limited number of replicates and low sequencing coverage. The method is implemented in CORNAS (Coverage-dependent RNA-Seq), and is available at https://github.com/joel-lzb/CORNAS .

  10. Harnessing the sorghum genome sequence:development of a genome-wide microsattelite (SSR) resource for swift genetic mapping and map based cloning in sorghum

    USDA-ARS?s Scientific Manuscript database

    Sorghum is the second cereal crop to have a full genome completely sequenced (Nature (2009), 457:551). This achievement is widely recognized as a scientific milestone for grass genetics and genomics in general. However, the true worth of genetic information lies in translating the sequence informa...

  11. Fine mapping and identification of a candidate gene for the barley Un8 true loose smut resistance gene.

    PubMed

    Zang, Wen; Eckstein, Peter E; Colin, Mark; Voth, Doug; Himmelbach, Axel; Beier, Sebastian; Stein, Nils; Scoles, Graham J; Beattie, Aaron D

    2015-07-01

    The candidate gene for the barley Un8 true loose smut resistance gene encodes a deduced protein containing two tandem protein kinase domains. In North America, durable resistance against all known isolates of barley true loose smut, caused by the basidiomycete pathogen Ustilago nuda (Jens.) Rostr. (U. nuda), is under the control of the Un8 resistance gene. Previous genetic studies mapped Un8 to the long arm of chromosome 5 (1HL). Here, a population of 4625 lines segregating for Un8 was used to delimit the Un8 gene to a 0.108 cM interval on chromosome arm 1HL, and assign it to fingerprinted contig 546 of the barley physical map. The minimal tilling path was identified for the Un8 locus using two flanking markers and consisted of two overlapping bacterial artificial chromosomes. One gene located close to a marker co-segregating with Un8 showed high sequence identity to a disease resistance gene containing two kinase domains. Sequence of the candidate gene from the parents of the segregating population, and in an additional 19 barley lines representing a broader spectrum of diversity, showed there was no intron in alleles present in either resistant or susceptible lines, and fifteen amino acid variations unique to the deduced protein sequence in resistant lines differentiated it from the deduced protein sequences in susceptible lines. Some of these variations were present within putative functional domains which may cause a loss of function in the deduced protein sequences within susceptible lines.

  12. Efficient error correction for next-generation sequencing of viral amplicons

    PubMed Central

    2012-01-01

    Background Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. Results In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Conclusions Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses. The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm PMID:22759430

  13. Efficient error correction for next-generation sequencing of viral amplicons.

    PubMed

    Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury

    2012-06-25

    Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.

  14. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    PubMed

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of PWM (position weight matrix) motif model.

  15. Two Sources of Evidence on the Non-Automaticity of True and False Belief Ascription

    ERIC Educational Resources Information Center

    Back, Elisa; Apperly, Ian A.

    2010-01-01

    A recent study by Apperly et al. (2006) found evidence that adults do not automatically infer false beliefs while watching videos that afford such inferences. This method was extended to examine true beliefs, which are sometimes thought to be ascribed by "default" (e.g., Leslie & Thaiss, 1992). Sequences of pictures were presented in which the…

  16. Students Use of the PSOE Model to Understand Weather and Climate

    ERIC Educational Resources Information Center

    Brown, Patrick L.; Concannon, James

    2016-01-01

    One tried-and-true way to hook students' attention and promote long-lasting understanding is to sequence science instruction in an explore-before-explain instructional sequence. In these lessons for the second through sixth grade band, elementary students investigate the interaction between "cold" and "hot" substances and…

  17. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2014-07-01 2013-07-01 true Test sequence; general requirements. 92...

  18. The complete genome sequence of a second distinct betabaculovirus from the true armyworm, Mythimna unipuncta

    USDA-ARS?s Scientific Manuscript database

    The betabaculovirus Pseudaletia (Mythimna) sp. granulovirus #8 (MyspGV#8) was examined by electron microscopy, host barcoding PCR, and determination of the nucleotide sequence of its genome. Scanning and transmission electron microscopy revealed that the occlusion bodies of MyspGV#8 possessed the c...

  19. Discovery of a monophagous true predator, a specialist termite-eating spider (Araneae: Ammoxenidae)

    PubMed Central

    Petráková, Lenka; Líznarová, Eva; Pekár, Stano; Haddad, Charles R.; Sentenská, Lenka; Symondson, William O. C.

    2015-01-01

    True predators are characterised by capturing a number of prey items during their lifetime and by being generalists. Some true predators are facultative specialists, but very few species are stenophagous specialists that catch only a few closely related prey types. A monophagous true predator that would exploit a single prey species has not been discovered yet. Representatives of the spider family Ammoxenidae have been reported to have evolved to only catch termites. Here we tested the hypothesis that Ammoxenus amphalodes is a monophagous termite-eater capturing only Hodotermes mossambicus. We studied the trophic niche of A. amphalodes by means of molecular analysis of the gut contents using Next Generation Sequencing. We investigated their willingness to accept alternative prey and observed their specific predatory behaviour and prey capture efficiency. We found all of the 1.4 million sequences were H. mossambicus. In the laboratory A. amphalodes did not accept any other prey, including other termite species. The spiders attacked the lateral side of the thorax of termites and immobilised them within 1 min. The paralysis efficiency was independent of predator:prey size ratio. The results strongly indicate that A. amphalodes is a monophagous prey specialist, specifically adapted to feed on H. mossambicus. PMID:26359085

  20. High-Resolution Melting Analysis for Rapid Detection of Sequence Type 131 Escherichia coli.

    PubMed

    Harrison, Lucas B; Hanson, Nancy D

    2017-06-01

    Escherichia coli isolates belonging to the sequence type 131 (ST131) clonal complex have been associated with the global distribution of fluoroquinolone and β-lactam resistance. Whole-genome sequencing and multilocus sequence typing identify sequence type but are expensive when evaluating large numbers of samples. This study was designed to develop a cost-effective screening tool using high-resolution melting (HRM) analysis to differentiate ST131 from non-ST131 E. coli in large sample populations in the absence of sequence analysis. The method was optimized using DNA from 12 E. coli isolates. Singleplex PCR was performed using 10 ng of DNA, Type-it HRM buffer, and multilocus sequence typing primers and was followed by multiplex PCR. The amplicon sizes ranged from 630 to 737 bp. Melt temperature peaks were determined by performing HRM analysis at 0.1°C resolution from 50 to 95°C on a Rotor-Gene Q 5-plex HRM system. Derivative melt curves were compared between sequence types and analyzed by principal component analysis. A blinded study of 191 E. coli isolates of ST131 and unknown sequence types validated this methodology. This methodology returned 99.2% specificity (124 true negatives and 1 false positive) and 100% sensitivity (66 true positives and 0 false negatives). This HRM methodology distinguishes ST131 from non-ST131 E. coli without sequence analysis. The analysis can be accomplished in about 3 h in any laboratory with an HRM-capable instrument and principal component analysis software. Therefore, this assay is a fast and cost-effective alternative to sequencing-based ST131 identification. Copyright © 2017 Harrison and Hanson.

  1. VarBin, a novel method for classifying true and false positive variants in NGS data

    PubMed Central

    2013-01-01

    Background Variant discovery for rare genetic diseases using Illumina genome or exome sequencing involves screening of up to millions of variants to find only the one or few causative variant(s). Sequencing or alignment errors create "false positive" variants, which are often retained in the variant screening process. Methods to remove false positive variants often retain many false positive variants. This report presents VarBin, a method to prioritize variants based on a false positive variant likelihood prediction. Methods VarBin uses the Genome Analysis Toolkit variant calling software to calculate the variant-to-wild type genotype likelihood ratio at each variant change and position divided by read depth. The resulting Phred-scaled, likelihood-ratio by depth (PLRD) was used to segregate variants into 4 Bins with Bin 1 variants most likely true and Bin 4 most likely false positive. PLRD values were calculated for a proband of interest and 41 additional Illumina HiSeq, exome and whole genome samples (proband's family or unrelated samples). At variant sites without apparent sequencing or alignment error, wild type/non-variant calls cluster near -3 PLRD and variant calls typically cluster above 10 PLRD. Sites with systematic variant calling problems (evident by variant quality scores and biases as well as displayed on the iGV viewer) tend to have higher and more variable wild type/non-variant PLRD values. Depending on the separation of a proband's variant PLRD value from the cluster of wild type/non-variant PLRD values for background samples at the same variant change and position, the VarBin method's classification is assigned to each proband variant (Bin 1 to Bin 4). Results To assess VarBin performance, Sanger sequencing was performed on 98 variants in the proband and background samples. True variants were confirmed in 97% of Bin 1 variants, 30% of Bin 2, and 0% of Bin 3/Bin 4. Conclusions These data indicate that VarBin correctly classifies the majority of true variants as Bin 1 and Bin 3/4 contained only false positive variants. The "uncertain" Bin 2 contained both true and false positive variants. Future work will further differentiate the variants in Bin 2. PMID:24266885

  2. Genotyping-by-sequencing for estimating relatedness in nonmodel organisms: Avoiding the trap of precise bias.

    PubMed

    Attard, Catherine R M; Beheregaray, Luciano B; Möller, Luciana M

    2018-05-01

    There has been remarkably little attention to using the high resolution provided by genotyping-by-sequencing (i.e., RADseq and similar methods) for assessing relatedness in wildlife populations. A major hurdle is the genotyping error, especially allelic dropout, often found in this type of data that could lead to downward-biased, yet precise, estimates of relatedness. Here, we assess the applicability of genotyping-by-sequencing for relatedness inferences given its relatively high genotyping error rate. Individuals of known relatedness were simulated under genotyping error, allelic dropout and missing data scenarios based on an empirical ddRAD data set, and their true relatedness was compared to that estimated by seven relatedness estimators. We found that an estimator chosen through such analyses can circumvent the influence of genotyping error, with the estimator of Ritland (Genetics Research, 67, 175) shown to be unaffected by allelic dropout and to be the most accurate when there is genotyping error. We also found that the choice of estimator should not rely solely on the strength of correlation between estimated and true relatedness as a strong correlation does not necessarily mean estimates are close to true relatedness. We also demonstrated how even a large SNP data set with genotyping error (allelic dropout or otherwise) or missing data still performs better than a perfectly genotyped microsatellite data set of tens of markers. The simulation-based approach used here can be easily implemented by others on their own genotyping-by-sequencing data sets to confirm the most appropriate and powerful estimator for their data. © 2017 John Wiley & Sons Ltd.

  3. On the Existence of Simultaneous Edge Disjoint Realizations of Degree Sequences with ’Few’ Edges

    DTIC Science & Technology

    1975-08-01

    constructing graphs and digraphs with given valences and factors. Discrete Math . 6 (1973) 79-88. 3. M. Keren, Realization of a sun of sequences by a sum...appear. 5. S. Kundu, The k factor conjecture is true. Discrete Math . 6 (1973) 367-376. 6. S. Kundu, Disjoint representation of tree realizable

  4. Flow of wormlike micellar solutions around confined microfluidic cylinders.

    PubMed

    Zhao, Ya; Shen, Amy Q; Haward, Simon J

    2016-10-26

    Wormlike micellar (WLM) solutions are frequently used in enhanced oil and gas recovery applications in porous rock beds where complex microscopic geometries result in mixed flow kinematics with strong shear and extensional components. Experiments with WLM solutions through model microfluidic porous media have revealed a variety of complex flow phenomena, including the formation of stable gel-like structures known as a Flow-Induced Structured Phase (FISP), which undoubtedly play an important role in applications of WLM fluids, but are still poorly understood. A first step in understanding flows of WLM fluids through porous media can be made by examining the flow around a single micro-scale cylinder aligned on the flow axis. Here we study flow behavior of an aqueous WLM solution consisting of cationic surfactant cetyltrimethylammonium bromide (CTAB) and a stable hydrotropic salt 3-hydroxy naphthalene-2-carboxylate (SHNC) in microfluidic devices with three different cylinder blockage ratios, β. We observe a rich sequence of flow instabilities depending on β as the Weissenberg number (Wi) is increased to large values while the Reynolds number (Re) remains low. Instabilities upstream of the cylinder are associated with high stresses in fluid that accelerates into the narrow gap between the cylinder and the channel wall; vortex growth upstream is reminiscent of that seen in microfluidic contraction geometries. Instability downstream of the cylinder is associated with stresses generated at the trailing stagnation point and the resulting flow modification in the wake, coupled with the onset of time-dependent flow upstream and the asymmetric division of flow around the cylinder.

  5. The Role of Stress Proteins in Cell Stabilization: A Perspective from an Extremophile

    NASA Technical Reports Server (NTRS)

    Trent, Jonathan

    2001-01-01

    The existence of organisms that live at near boiling temperatures is living proof that all of the complex biochemical machinery of life can be adapted to function under these harsh conditions. The purpose of our research is to elucidate the role of a group of proteins known as heat shock proteins or HSP60s in this adaptation to high temperatures. HSP60s are found in all organisms and they are among the most highly conserved proteins known. We are investigating HSP60s in an organism growing at 80 C and pH 2.0 (Sulfolobus shibatae). This organism produces three closely-related HSP60 proteins, referred to as HSP60 alpha, beta, and gamma. Our DOE-funded research during the last two years has focused on clarifying the role of FiSP60 alpha and beta. These are among the two most abundant proteins in S. shibatae grown at high temperatures and significantly increase in abundance when the cells are exposed to near-lethal temperatures. We have demonstrated that these proteins protect the cells from lethal temperatures by stabilizing their membranes. During this last year we have been studying gamma, which was discovered by genome sequence analysis but nothing was known about its function. We have determined that gamma is only expressed at low temperatures. that it interacts with alpha and beta, and that it influences their ability to form higher-order structures critical to their function. We propose that gamma modulates HSP60 function at low temperatures.

  6. The complete genome sequence of a third distinct baculovirus isolated from the true armyworm, Mythimna unipuncta, contains two copies of the lef-7 gene

    USDA-ARS?s Scientific Manuscript database

    A baculovirus isolate from a USDA Forest Service collection was examined by electron microscopy and analysis of its genome sequence. The isolate, formerly referred to as Pseudoletia (Mythimna) sp. nucleopolyhedrovirus #7 (MyspNPV#7), was determined by barcoding PCR to derive from the host species My...

  7. QQ-SNV: single nucleotide variant detection at low frequency by comparing the quality quantiles.

    PubMed

    Van der Borght, Koen; Thys, Kim; Wetzels, Yves; Clement, Lieven; Verbist, Bie; Reumers, Joke; van Vlijmen, Herman; Aerssens, Jeroen

    2015-11-10

    Next generation sequencing enables studying heterogeneous populations of viral infections. When the sequencing is done at high coverage depth ("deep sequencing"), low frequency variants can be detected. Here we present QQ-SNV (http://sourceforge.net/projects/qqsnv), a logistic regression classifier model developed for the Illumina sequencing platforms that uses the quantiles of the quality scores, to distinguish true single nucleotide variants from sequencing errors based on the estimated SNV probability. To train the model, we created a dataset of an in silico mixture of five HIV-1 plasmids. Testing of our method in comparison to the existing methods LoFreq, ShoRAH, and V-Phaser 2 was performed on two HIV and four HCV plasmid mixture datasets and one influenza H1N1 clinical dataset. For default application of QQ-SNV, variants were called using a SNV probability cutoff of 0.5 (QQ-SNV(D)). To improve the sensitivity we used a SNV probability cutoff of 0.0001 (QQ-SNV(HS)). To also increase specificity, SNVs called were overruled when their frequency was below the 80(th) percentile calculated on the distribution of error frequencies (QQ-SNV(HS-P80)). When comparing QQ-SNV versus the other methods on the plasmid mixture test sets, QQ-SNV(D) performed similarly to the existing approaches. QQ-SNV(HS) was more sensitive on all test sets but with more false positives. QQ-SNV(HS-P80) was found to be the most accurate method over all test sets by balancing sensitivity and specificity. When applied to a paired-end HCV sequencing study, with lowest spiked-in true frequency of 0.5%, QQ-SNV(HS-P80) revealed a sensitivity of 100% (vs. 40-60% for the existing methods) and a specificity of 100% (vs. 98.0-99.7% for the existing methods). In addition, QQ-SNV required the least overall computation time to process the test sets. Finally, when testing on a clinical sample, four putative true variants with frequency below 0.5% were consistently detected by QQ-SNV(HS-P80) from different generations of Illumina sequencers. We developed and successfully evaluated a novel method, called QQ-SNV, for highly efficient single nucleotide variant calling on Illumina deep sequencing virology data.

  8. Comparison. US P-61 and Delft sediment samplers

    USGS Publications Warehouse

    Beverage, Joseph P.; Williams, David T.

    1990-01-01

    The Delft Bottle (DB) is a flow-through device designed by the Delft Hydraulic Laboratory (DHL), The Netherlands, to sample sand-sized sediment suspended in streams. The US P-61 sampler was designed by the Federal Interagency Sedimentation Project (FISP) at the St. Anthony Falls Hydraulic Laboratory, Minneapolis, Minnesota, to collect suspended sediment from deep, swift rivers. The results of two point-sampling tests in the United States, the Mississippi River near Vicksburg, Mississippi, in 1983 and the Colorado River near Blythe, California, in 1984, are provided in this report. These studies compare sand-transport rates, rather than total sediment-transport rates, because fine material washes through the DB sampler. In the United States, the commonly used limits for sand-sized material are 0.062 mm to 2.00 mm (Vanoni 1975).

  9. BAC sequencing using pooled methods.

    PubMed

    Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

    2015-01-01

    Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

  10. Recall Latencies, Confidence, and Output Positions of True and False Memories: Implications for Recall and Metamemory Theories

    ERIC Educational Resources Information Center

    Jou, Jerwen

    2008-01-01

    Recall latency, recall accuracy rate, and recall confidence were examined in free recall as a function of recall output serial position using a modified Deese-Roediger-McDermott paradigm to test a strength-based theory against the dual-retrieval process theory of recall output sequence. The strength theory predicts the item output sequence to be…

  11. Inferring Phylogenetic Relationships of Indian Citron (Citrus medica L.) based on rbcL and matK Sequences of Chloroplast DNA.

    PubMed

    Uchoi, Ajit; Malik, Surendra Kumar; Choudhary, Ravish; Kumar, Susheel; Rohini, M R; Pal, Digvender; Ercisli, Sezai; Chaudhury, Rekha

    2016-06-01

    Phylogenetic relationships of Indian Citron (Citrus medica L.) with other important Citrus species have been inferred through sequence analyses of rbcL and matK gene region of chloroplast DNA. The study was based on 23 accessions of Citrus genotypes representing 15 taxa of Indian Citrus, collected from wild, semi-wild, and domesticated stocks. The phylogeny was inferred using the maximum parsimony (MP) and neighbor-joining (NJ) methods. Both MP and NJ trees separated all the 23 accessions of Citrus into five distinct clusters. The chloroplast DNA (cpDNA) analysis based on rbcL and matK sequence data carried out in Indian taxa of Citrus was useful in differentiating all the true species and species/varieties of probable hybrid origin in distinct clusters or groups. Sequence analysis based on rbcL and matK gene provided unambiguous identification and disposition of true species like C. maxima, C. medica, C. reticulata, and related hybrids/cultivars. The separation of C. maxima, C. medica, and C. reticulata in distinct clusters or sub-clusters supports their distinctiveness as the basic species of edible Citrus. However, the cpDNA sequence analysis of rbcL and matK gene could not find any clear cut differentiation between subgenera Citrus and Papeda as proposed in Swingle's system of classification.

  12. Summation of the product of certain functions and generalized Fibonacci numbers

    NASA Astrophysics Data System (ADS)

    Chong, Chin-Yoon; Ang, Siew-Ling; Ho, C. K.

    2014-12-01

    In this paper, we derived the summation ∑ i = 0 n f(i)Ui and ∑ i = 0 ∞ f(i)Ui for certain functions f (i), where {Ui} is the generalized Fibonacci sequence defined by Un+2= pU n+1+qUn for all p,q∈ Z+ and for all non-negative integers n with the seed values U0 = 0, U1 = 1.

  13. Patients' experiences of dental implant treatment: A literature review of key qualitative studies.

    PubMed

    Kashbour, W A; Rousseau, N S; Ellis, J S; Thomason, J M

    2015-07-01

    To identify and summarise the findings of previous qualitative studies relating to patients' experience of dental implant treatment (DIT) at various stages of their implant treatment, by means of textual narrative synthesis. Original articles reporting patients' experience with dental implant were included. A two-stage search of the literature, electronic and hand search identified relevant qualitative studies up to July 2014. An extensive electronic search was conducted of databases including PubMed, Embase, Scopus, Web of Knowledge, Cochrane Database and Google Scholar. Included primary studies (n=10) used qualitative research methods and qualitative analysis to investigate patients' experiences with dental implants treatment. While the growing interest in implant treatment for the replacement of missing dentition is evident, it is essential to investigate patients' perceptions of different aspects of implant treatment. This textual narrative synthesis conducted to review qualitative studies which provided insight into patients' experience of two types of implant prostheses namely ISOD (implant-supported overdenture) and FISP (fixed implant supported prostheses). Primary reviewed studies tended to include samples of older patients with more extensive tooth loss, and to focus on experiences prior to and post-treatment rather than on the treatment period itself. Findings across reviewed studies (n=10) suggested that patients with FISP thought of implant treatment as a process of 'normalisation'(1) and believed that such implant restorations could be similar to natural teeth, whereas patients with ISOD focused more on the functional and social advantages of their implant treatment. The growing interest in qualitative research is evident in several branches of clinical dentistry and dental implantology is not an exception. Qualitative studies concerning the patients account of their experience of dental implants is however limited. The aim of this review is to firstly identify recent work within this field and to subsequently categorise it more consistently by means of textural narrative synthesis, thus highlighting similarities and differences and enabling identification of gaps in research knowledge thereby setting the direction of further research. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Two sources of evidence on the non-automaticity of true and false belief ascription.

    PubMed

    Back, Elisa; Apperly, Ian A

    2010-04-01

    A recent study by Apperly et al. (2006) found evidence that adults do not automatically infer false beliefs while watching videos that afford such inferences. This method was extended to examine true beliefs, which are sometimes thought to be ascribed by "default" (e.g., Leslie & Thaiss, 1992). Sequences of pictures were presented in which the location of an object and a character's belief about the location of the object often changed. During the picture sequences participants responded to an unpredictable probe picture about where the character believed the object to be located or where the object was located in reality. In Experiment 1 participants were not directly instructed to track the character's beliefs about the object. There was a significant reaction time cost for belief probes compared with matched reality probes, whether the character's belief was true or false. In Experiment 2, participants were asked to track where the character thought the object was located, responses to belief probes were faster than responses to reality probes, suggesting that the difference observed in Experiment 1 was not due to intrinsic differences between the probes, but was more likely to be due to participants inferring beliefs ad hoc in response to the probe. In both Experiments 1 and 2, responses to belief and reality probes were faster in the true belief condition than in the false belief condition. In Experiment 3 this difference was largely eliminated when participants had fewer reasons to make belief inferences spontaneously. These two lines of evidence are neatly explained by the proposition that neither true nor false beliefs are ascribed automatically, but that belief ascription may occur spontaneously in response to task demands. Copyright 2009 Elsevier B.V. All rights reserved.

  15. Semi-automatic volume measurement for orbital fat and total extraocular muscles based on Cube FSE-flex sequence in patients with thyroid-associated ophthalmopathy.

    PubMed

    Tang, X; Liu, H; Chen, L; Wang, Q; Luo, B; Xiang, N; He, Y; Zhu, W; Zhang, J

    2018-05-24

    To investigate the accuracy of two semi-automatic segmentation measurements based on magnetic resonance imaging (MRI) three-dimensional (3D) Cube fast spin echo (FSE)-flex sequence in phantoms, and to evaluate the feasibility of determining the volumetric alterations of orbital fat (OF) and total extraocular muscles (TEM) in patients with thyroid-associated ophthalmopathy (TAO) by semi-automatic segmentation. Forty-four fatty (n=22) and lean (n=22) phantoms were scanned by using Cube FSE-flex sequence with a 3 T MRI system. Their volumes were measured by manual segmentation (MS) and two semi-automatic segmentation algorithms (regional growing [RG], multi-dimensional threshold [MDT]). Pearson correlation and Bland-Altman analysis were used to evaluate the measuring accuracy of MS, RG, and MDT in phantoms as compared with the true volume. Then, OF and TEM volumes of 15 TAO patients and 15 normal controls were measured using MDT. Paired-sample t-tests were used to compare the volumes and volume ratios of different orbital tissues between TAO patients and controls. Each segmentation (MS RG, MDT) has a significant correlation (p<0.01) with true volume. There was a minimal bias for MS, and a stronger agreement between MDT and the true volume than RG and the true volume both in fatty and lean phantoms. The reproducibility of Cube FSE-flex determined MDT was adequate. The volumetric ratios of OF/globe (p<0.01), TEM/globe (p<0.01), whole orbit/globe (p<0.01) and bone orbit/globe (p<0.01) were significantly greater in TAO patients than those in healthy controls. MRI Cube FSE-flex determined MDT is a relatively accurate semi-automatic segmentation that can be used to evaluate OF and TEM volumes in clinic. Copyright © 2018 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.

  16. Dynamics of domain coverage of the protein sequence universe.

    PubMed

    Rekapalli, Bhanu; Wuichet, Kristin; Peterson, Gregory D; Zhulin, Igor B

    2012-11-16

    The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its "dark matter". Here we suggest that true size of "dark matter" is much larger than stated by current definitions. We propose an approach to reducing the size of "dark matter" by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of "dark matter"; however, its absolute size increases substantially with the growth of sequence data.

  17. Mitochondrial phylogenomics of Hemiptera reveals adaptive innovations driving the diversification of true bugs

    PubMed Central

    Li, Hu; Leavengood, John M.; Chapman, Eric G.; Burkhardt, Daniel; Song, Fan; Jiang, Pei; Liu, Jinpeng; Cai, Wanzhi

    2017-01-01

    Hemiptera, the largest non-holometabolous order of insects, represents approximately 7% of metazoan diversity. With extraordinary life histories and highly specialized morphological adaptations, hemipterans have exploited diverse habitats and food sources through approximately 300 Myr of evolution. To elucidate the phylogeny and evolutionary history of Hemiptera, we carried out the most comprehensive mitogenomics analysis on the richest taxon sampling to date covering all the suborders and infraorders, including 34 newly sequenced and 94 published mitogenomes. With optimized branch length and sequence heterogeneity, Bayesian analyses using a site-heterogeneous mixture model resolved the higher-level hemipteran phylogeny as (Sternorrhyncha, (Auchenorrhyncha, (Coleorrhyncha, Heteroptera))). Ancestral character state reconstruction and divergence time estimation suggest that the success of true bugs (Heteroptera) is probably due to angiosperm coevolution, but key adaptive innovations (e.g. prognathous mouthpart, predatory behaviour, and haemelytron) facilitated multiple independent shifts among diverse feeding habits and multiple independent colonizations of aquatic habitats. PMID:28878063

  18. Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing

    PubMed Central

    2012-01-01

    Background RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Results Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. Conclusions This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates. PMID:22985019

  19. Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing.

    PubMed

    Robles, José A; Qureshi, Sumaira E; Stephen, Stuart J; Wilson, Susan R; Burden, Conrad J; Taylor, Jennifer M

    2012-09-17

    RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.

  20. Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites.

    PubMed

    Rogan, P K; Schneider, T D

    1995-01-01

    Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.

  1. [The principle and application of the single-molecule real-time sequencing technology].

    PubMed

    Yanhu, Liu; Lu, Wang; Li, Yu

    2015-03-01

    Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modifications. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.

  2. Data mining of enzymes using specific peptides

    PubMed Central

    2009-01-01

    Background Predicting the function of a protein from its sequence is a long-standing challenge of bioinformatic research, typically addressed using either sequence-similarity or sequence-motifs. We employ the novel motif method that consists of Specific Peptides (SPs) that are unique to specific branches of the Enzyme Commission (EC) functional classification. We devise the Data Mining of Enzymes (DME) methodology that allows for searching SPs on arbitrary proteins, determining from its sequence whether a protein is an enzyme and what the enzyme's EC classification is. Results We extract novel SP sets from Swiss-Prot enzyme data. Using a training set of July 2006, and test sets of July 2008, we find that the predictive power of SPs, both for true-positives (enzymes) and true-negatives (non-enzymes), depends on the coverage length of all SP matches (the number of amino-acids matched on the protein sequence). DME is quite different from BLAST. Comparing the two on an enzyme test set of July 2008, we find that DME has lower recall. On the other hand, DME can provide predictions for proteins regarded by BLAST as having low homologies with known enzymes, thus supplying complementary information. We test our method on a set of proteins belonging to 10 bacteria, dated July 2008, establishing the usefulness of the coverage-length cutoff to determine true-negatives. Moreover, sifting through our predictions we find that some of them have been substantiated by Swiss-Prot annotations by July 2009. Finally we extract, for production purposes, a novel SP set trained on all Swiss-Prot enzymes as of July 2009. This new set increases considerably the recall of DME. The new SP set is being applied to three metagenomes: Sargasso Sea with over 1,000,000 proteins, producing predictions of over 220,000 enzymes, and two human gut metagenomes. The outcome of these analyses can be characterized by the enzymatic profile of the metagenomes, describing the relative numbers of enzymes observed for different EC categories. Conclusions Employing SPs for predicting enzymatic activity of proteins works well once one utilizes coverage-length criteria. In our analysis, L ≥ 7 has led to highly accurate results. PMID:20034383

  3. Stochastic control system parameter identifiability

    NASA Technical Reports Server (NTRS)

    Lee, C. H.; Herget, C. J.

    1975-01-01

    The parameter identification problem of general discrete time, nonlinear, multiple input/multiple output dynamic systems with Gaussian white distributed measurement errors is considered. The knowledge of the system parameterization was assumed to be known. Concepts of local parameter identifiability and local constrained maximum likelihood parameter identifiability were established. A set of sufficient conditions for the existence of a region of parameter identifiability was derived. A computation procedure employing interval arithmetic was provided for finding the regions of parameter identifiability. If the vector of the true parameters is locally constrained maximum likelihood (CML) identifiable, then with probability one, the vector of true parameters is a unique maximal point of the maximum likelihood function in the region of parameter identifiability and the constrained maximum likelihood estimation sequence will converge to the vector of true parameters.

  4. Dynamics of domain coverage of the protein sequence universe

    PubMed Central

    2012-01-01

    Background The currently known protein sequence space consists of millions of sequences in public databases and is rapidly expanding. Assigning sequences to families leads to a better understanding of protein function and the nature of the protein universe. However, a large portion of the current protein space remains unassigned and is referred to as its “dark matter”. Results Here we suggest that true size of “dark matter” is much larger than stated by current definitions. We propose an approach to reducing the size of “dark matter” by identifying and subtracting regions in protein sequences that are not likely to contain any domain. Conclusions Recent improvements in computational domain modeling result in a decrease, albeit slowly, in the relative size of “dark matter”; however, its absolute size increases substantially with the growth of sequence data. PMID:23157439

  5. Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution

    PubMed Central

    2017-01-01

    Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. PMID:28637852

  6. Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL): adapting the Partial Phylogenetic Profiling algorithm to scan sequences for signatures that predict protein function

    PubMed Central

    2010-01-01

    Background Comparative genomics methods such as phylogenetic profiling can mine powerful inferences from inherently noisy biological data sets. We introduce Sites Inferred by Metabolic Background Assertion Labeling (SIMBAL), a method that applies the Partial Phylogenetic Profiling (PPP) approach locally within a protein sequence to discover short sequence signatures associated with functional sites. The approach is based on the basic scoring mechanism employed by PPP, namely the use of binomial distribution statistics to optimize sequence similarity cutoffs during searches of partitioned training sets. Results Here we illustrate and validate the ability of the SIMBAL method to find functionally relevant short sequence signatures by application to two well-characterized protein families. In the first example, we partitioned a family of ABC permeases using a metabolic background property (urea utilization). Thus, the TRUE set for this family comprised members whose genome of origin encoded a urea utilization system. By moving a sliding window across the sequence of a permease, and searching each subsequence in turn against the full set of partitioned proteins, the method found which local sequence signatures best correlated with the urea utilization trait. Mapping of SIMBAL "hot spots" onto crystal structures of homologous permeases reveals that the significant sites are gating determinants on the cytosolic face rather than, say, docking sites for the substrate-binding protein on the extracellular face. In the second example, we partitioned a protein methyltransferase family using gene proximity as a criterion. In this case, the TRUE set comprised those methyltransferases encoded near the gene for the substrate RF-1. SIMBAL identifies sequence regions that map onto the substrate-binding interface while ignoring regions involved in the methyltransferase reaction mechanism in general. Neither method for training set construction requires any prior experimental characterization. Conclusions SIMBAL shows that, in functionally divergent protein families, selected short sequences often significantly outperform their full-length parent sequence for making functional predictions by sequence similarity, suggesting avenues for improved functional classifiers. When combined with structural data, SIMBAL affords the ability to localize and model functional sites. PMID:20102603

  7. Discovering Deeply Divergent RNA Viruses in Existing Metatranscriptome Data with Machine Learning

    NASA Astrophysics Data System (ADS)

    Rivers, A. R.

    2016-02-01

    Most sampling of RNA viruses and phages has been directed toward a narrow range of hosts and environments. Several marine metagenomic studies have examined the RNA viral fraction in aquatic samples and found a number of picornaviruses and uncharacterized sequences. The lack of homology to known protein families has limited the discovery of new RNA viruses. We developed a computational method for identifying RNA viruses that relies on information in the codon transition probabilities of viral sequences to train a classifier. This approach does not rely on homology, but it has higher information content than other reference-free methods such as tetranucleotide frequency. Training and validation with RefSeq data gave true positive and true negative rates of 99.6% and 99.5% on the highly imbalanced validation sets (0.2% viruses) that, like the metatranscriptomes themselves, contain mostly non-viral sequences. To further test the method, a validation dataset of putative RNA virus genomes were identified in metatransciptomes by the presence of RNA dependent RNA polymerase, an essential gene for RNA viruses. The classifier successfully identified 99.4% of those contigs as viral. This approach is currently being extended to screen all metatranscriptome data sequenced at the DOE Joint Genome Institute, presently 4.5 Gb of assembled data from 504 public projects representing a wide range of marine, aquatic and terrestrial environments.

  8. Constraining the weak-wind problem: an XMM-HST campaign for the magnetic O9.7 V star HD 54879

    NASA Astrophysics Data System (ADS)

    Shenar, T.; Oskinova, L. M.; Järvinen, S. P.; Luckas, P.; Hainich, R.; Todt, H.; Hubrig, S.; Sander, A. A. C.; Ilyin, I.; Hamann, W.-R.

    2018-01-01

    Mass-loss rates of massive, late type main sequence stars are much weaker than currently predicted, but their true values are very difficult to measure. We suggest that confined stellar winds of magnetic stars can be exploited to constrain the true mass-loss rates Ṁ of massive main sequence stars. We acquired UV, X-ray, and optical amateur data of HD 54879 (O9.7 V), one of a few O-type stars with a detected atmospheric magnetic field (Bd ≳ 2 kG). We analyze these data with the Potsdam Wolf-Rayet (PoWR) and XSPEC codes. We can roughly estimate the mass-loss rate the star would have in the absence of a magnetic field as log ṀB = 0 ≈ -9.0 M⊙yr-1. Since the wind is partially trapped within the Alfvén radius rA ≳ 12 R*, the true mass-loss rate of HD 54879 is log Ṁ ≲ -10.2 M⊙yr-1. Moreover, we find that the microturbulent, macroturbulent, and projected rotational velocities are lower than previously suggested (< 4 km s-1). An initial mass of 16 M⊙ and an age of 5 Myr are inferred. We derive a mean X-ray emitting temperature of log TX = 6.7 K and an X-ray luminosity of log LX = 32 erg s-1. The latter implies a significant X-ray excess (log LX/LBol ≈ -6.0), most likely stemming from collisions at the magnetic equator. A tentative period of P ≈ 5 yr is derived from variability of the Hα line. Our study confirms that strongly magnetized stars lose little or no mass, and supplies important constraints on the weak-wind problem of massive main sequence stars.

  9. High Order Non-Stationary Markov Models and Anomaly Propagation Analysis in Intrusion Detection System (IDS)

    DTIC Science & Technology

    2007-02-01

    almost identical system call sequences and triggering the same alarm at different hosts. The alarm propagation effect can be used to distinguish “true...different hosts. The alarm propagation effect can be used to distinguish “true alarms” from “false positives”. At the host-level, a new anomaly...0H ( ) ( )∑∑ = = ⎟⎟ ⎠ ⎞ ⎜⎜ ⎝ ⎛ − + − = 2 1 1, 2 2 2 2 1 1 ),( ),(),()( ),( ),(),()( k m ji jiT jiTjiTiN jiT jiTjiTiNW where - marginal observed

  10. Maternal mosaicism is a significant contributor to discordant sex chromosomal aneuploidies associated with noninvasive prenatal testing.

    PubMed

    Wang, Yanlin; Chen, Yan; Tian, Feng; Zhang, Jianguang; Song, Zhuo; Wu, Yi; Han, Xu; Hu, Wenjing; Ma, Duan; Cram, David; Cheng, Weiwei

    2014-01-01

    In the human fetus, sex chromosome aneuploidies (SCAs) are as prevalent as the common autosomal trisomies 21, 18, and 13. Currently, most noninvasive prenatal tests (NIPTs) offer screening only for chromosomes 21, 18, and 13, because the sensitivity and specificity are markedly higher than for the sex chromosomes. Limited studies suggest that the reduced accuracy associated with detecting SCAs is due to confined placental, placental, or true fetal mosaicism. We hypothesized that an altered maternal karyotype may also be an important contributor to discordant SCA NIPT results. We developed a rapid karyotyping method that uses massively parallel sequencing to measure the degree of chromosome mosaicism. The method was validated with DNA models mimicking XXX and XO mosaicism and then applied to maternal white blood cell (WBC) DNA from patients with discordant SCA NIPT results. Sequencing karyotyping detected chromosome X (ChrX) mosaicism as low as 5%, allowing an accurate assignment of the maternal X karyotype. In a prospective NIPT study, we showed that 16 (8.6%) of 181 positive SCAs were due to an abnormal maternal ChrX karyotype that masked the true contribution of the fetal ChrX DNA fraction. The accuracy of NIPT for ChrX and ChrY can be improved substantially by integrating the results of maternal-plasma sequencing with those for maternal-WBC sequencing. The relatively high frequency of maternal mosaicism warrants mandatory WBC testing in both shotgun sequencing- and single-nucleotide polymorphism-based clinical NIPT after the finding of a potential fetal SCA.

  11. An automatic and efficient pipeline for disease gene identification through utilizing family-based sequencing data.

    PubMed

    Song, Dandan; Li, Ning; Liao, Lejian

    2015-01-01

    Due to the generation of enormous amounts of data at both lower costs as well as in shorter times, whole-exome sequencing technologies provide dramatic opportunities for identifying disease genes implicated in Mendelian disorders. Since upwards of thousands genomic variants can be sequenced in each exome, it is challenging to filter pathogenic variants in protein coding regions and reduce the number of missing true variants. Therefore, an automatic and efficient pipeline for finding disease variants in Mendelian disorders is designed by exploiting a combination of variants filtering steps to analyze the family-based exome sequencing approach. Recent studies on the Freeman-Sheldon disease are revisited and show that the proposed method outperforms other existing candidate gene identification methods.

  12. Simple chained guide trees give high-quality protein multiple sequence alignments

    PubMed Central

    Boyce, Kieran; Sievers, Fabian; Higgins, Desmond G.

    2014-01-01

    Guide trees are used to decide the order of sequence alignment in the progressive multiple sequence alignment heuristic. These guide trees are often the limiting factor in making large alignments, and considerable effort has been expended over the years in making these quickly or accurately. In this article we show that, at least for protein families with large numbers of sequences that can be benchmarked with known structures, simple chained guide trees give the most accurate alignments. These also happen to be the fastest and simplest guide trees to construct, computationally. Such guide trees have a striking effect on the accuracy of alignments produced by some of the most widely used alignment packages. There is a marked increase in accuracy and a marked decrease in computational time, once the number of sequences goes much above a few hundred. This is true, even if the order of sequences in the guide tree is random. PMID:25002495

  13. Word-Synchronous Optical Sampling of Periodically Repeated OTDM Data Words for True Waveform Visualization

    NASA Astrophysics Data System (ADS)

    Benkler, Erik; Telle, Harald R.

    2007-06-01

    An improved phase-locked loop (PLL) for versatile synchronization of a sampling pulse train to an optical data stream is presented. It enables optical sampling of the true waveform of repetitive high bit-rate optical time division multiplexed (OTDM) data words such as pseudorandom bit sequences. Visualization of the true waveform can reveal details, which cause systematic bit errors. Such errors cannot be inferred from eye diagrams and require word-synchronous sampling. The programmable direct-digital-synthesis circuit used in our novel PLL approach allows flexible adaption of virtually any problem-specific synchronization scenario, including those required for waveform sampling, for jitter measurements by slope detection, and for classical eye-diagrams. Phase comparison of the PLL is performed at 10-GHz OTDM base clock rate, leading to a residual synchronization jitter of less than 70 fs.

  14. Aptaligner: automated software for aligning pseudorandom DNA X-aptamers from next-generation sequencing data.

    PubMed

    Lu, Emily; Elizondo-Riojas, Miguel-Angel; Chang, Jeffrey T; Volk, David E

    2014-06-10

    Next-generation sequencing results from bead-based aptamer libraries have demonstrated that traditional DNA/RNA alignment software is insufficient. This is particularly true for X-aptamers containing specialty bases (W, X, Y, Z, ...) that are identified by special encoding. Thus, we sought an automated program that uses the inherent design scheme of bead-based X-aptamers to create a hypothetical reference library and Markov modeling techniques to provide improved alignments. Aptaligner provides this feature as well as length error and noise level cutoff features, is parallelized to run on multiple central processing units (cores), and sorts sequences from a single chip into projects and subprojects.

  15. Aetiological diagnosis of male sex ambiguity: a collaborative study.

    PubMed

    Morel, Yves; Rey, Rodolfo; Teinturier, Cécile; Nicolino, Marc; Michel-Calemard, Laurence; Mowszowicz, Irène; Jaubert, Francis; Fellous, Marc; Chaussain, Jean-Louis; Chatelain, Pierre; David, Michel; Nihoul-Fékété, Claire; Forest, Maguelone G; Josso, Nathalie

    2002-01-01

    A collaborative study, supported by the Biomed2 Programme of the European Community, was initiated to optimise the aetiological diagnosis in genetic or gonadal males with intersex disorders, a total of 67 patients with external sexual ambiguity, testicular tissue and/or a XY karyotype. In patients with gonadal dysgenesis or true hermaphroditism, the incidence of vaginal development was 100%, a uterus was present in 60%; uni or bilateral cryptorchidism was seen in nearly all cases of testicular dysgenesis (99%) but in only 57% of true hermaphrodites. Mean serum levels of anti-mullerian hormone and of serum testosterone response to chorionic gonadotropin stimulation were significantly decreased in both conditions, by comparison with patients with unexplained male pseudohermaphroditism or partial androgen insensitivity (PAIS). Mutations in the androgen receptor, 90% within exons 2-8, were detected in patients with PAIS. Clinically, a vaginal pouch was present in 90%, cryptorchidism in 36%. In 52% of cases, no diagnosis could be reached, despite an exhaustive clinical and laboratory work-up, including routine sequencing of exons 2-8 of the androgen receptor. By comparison with PAIS, unexplained male pseudohermaphroditism was characterised by a lower incidence of vaginal pouch (55%) and cryptorchidism (22%) but a high incidence of prematurity/intrauterine growth retardation (30%) or mild malformations (14%). reaching an aetiological diagnosis in cases of male intersex is difficult because of the variability of individual cases. Hormonal tests may help to discriminate between partial androgen insensitivity and gonadal dysgenesis/true hermaphroditism but are of less use for differentiating from unexplained male pseudohermaphroditism. Sequencing of exons 2-8 of the androgen receptor after study of testosterone precursors following human chorionic gonadotrophin stimulation is recommended when gonadal dysgenesis and true hermaphroditism can be excluded.

  16. Accurate indel prediction using paired-end short reads

    PubMed Central

    2013-01-01

    Background One of the major open challenges in next generation sequencing (NGS) is the accurate identification of structural variants such as insertions and deletions (indels). Current methods for indel calling assign scores to different types of evidence or counter-evidence for the presence of an indel, such as the number of split read alignments spanning the boundaries of a deletion candidate or reads that map within a putative deletion. Candidates with a score above a manually defined threshold are then predicted to be true indels. As a consequence, structural variants detected in this manner contain many false positives. Results Here, we present a machine learning based method which is able to discover and distinguish true from false indel candidates in order to reduce the false positive rate. Our method identifies indel candidates using a discriminative classifier based on features of split read alignment profiles and trained on true and false indel candidates that were validated by Sanger sequencing. We demonstrate the usefulness of our method with paired-end Illumina reads from 80 genomes of the first phase of the 1001 Genomes Project ( http://www.1001genomes.org) in Arabidopsis thaliana. Conclusion In this work we show that indel classification is a necessary step to reduce the number of false positive candidates. We demonstrate that missing classification may lead to spurious biological interpretations. The software is available at: http://agkb.is.tuebingen.mpg.de/Forschung/SV-M/. PMID:23442375

  17. Mel-36 – preliminary description of a new morel species

    USDA-ARS?s Scientific Manuscript database

    A pilot survey of true morels (Morchella) of Newfoundland and Labrador (NL), employing phylogenetic analyses of multilocus DNA sequence data, resulted in the discovery of a novel species that is currently only known from NL and New Brunswick. This unnamed species was informally designated Morchella ...

  18. Scene-based nonuniformity correction using local constant statistics.

    PubMed

    Zhang, Chao; Zhao, Wenyi

    2008-06-01

    In scene-based nonuniformity correction, the statistical approach assumes all possible values of the true-scene pixel are seen at each pixel location. This global-constant-statistics assumption does not distinguish fixed pattern noise from spatial variations in the average image. This often causes the "ghosting" artifacts in the corrected images since the existing spatial variations are treated as noises. We introduce a new statistical method to reduce the ghosting artifacts. Our method proposes a local-constant statistics that assumes that the temporal signal distribution is not constant at each pixel but is locally true. This considers statistically a constant distribution in a local region around each pixel but uneven distribution in a larger scale. Under the assumption that the fixed pattern noise concentrates in a higher spatial-frequency domain than the distribution variation, we apply a wavelet method to the gain and offset image of the noise and separate out the pattern noise from the spatial variations in the temporal distribution of the scene. We compare the results to the global-constant-statistics method using a clean sequence with large artificial pattern noises. We also apply the method to a challenging CCD video sequence and a LWIR sequence to show how effective it is in reducing noise and the ghosting artifacts.

  19. Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution.

    PubMed

    Warnock, Rachel C M; Yang, Ziheng; Donoghue, Philip C J

    2017-06-28

    Molecular sequence data provide information about relative times only, and fossil-based age constraints are the ultimate source of information about absolute times in molecular clock dating analyses. Thus, fossil calibrations are critical to molecular clock dating, but competing methods are difficult to evaluate empirically because the true evolutionary time scale is never known. Here, we combine mechanistic models of fossil preservation and sequence evolution in simulations to evaluate different approaches to constructing fossil calibrations and their impact on Bayesian molecular clock dating, and the relative impact of fossil versus molecular sampling. We show that divergence time estimation is impacted by the model of fossil preservation, sampling intensity and tree shape. The addition of sequence data may improve molecular clock estimates, but accuracy and precision is dominated by the quality of the fossil calibrations. Posterior means and medians are poor representatives of true divergence times; posterior intervals provide a much more accurate estimate of divergence times, though they may be wide and often do not have high coverage probability. Our results highlight the importance of increased fossil sampling and improved statistical approaches to generating calibrations, which should incorporate the non-uniform nature of ecological and temporal fossil species distributions. © 2017 The Authors.

  20. Molecular phylogeny, population genetics, and evolution of heterocystous cyanobacteria using nifH gene sequences.

    PubMed

    Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar

    2013-06-01

    In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.

  1. 75 FR 62820 - Screening Framework Guidance for Providers of Synthetic Double-Stranded DNA

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-10-13

    ... I. Summary Synthetic biology, the developing interdisciplinary field that focuses on both the design and fabrication of novel biological components and systems as well as the re-design and fabrication of... develop, maintain, and document protocols to determine if a sequence ``hit'' qualifies as a true...

  2. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    USDA-ARS?s Scientific Manuscript database

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres comprise of megabase-scale arrays of tandem repeats. The true prevalence of centromere tandem repeats, and whether they exhibit conserved seque...

  3. Toward Redefining the Humanistic Perspective.

    ERIC Educational Resources Information Center

    Shuman, R. Baird

    1980-01-01

    It is a pitifully narrow humanities curriculum which focuses on great Western art, as many college humanities sequences have done. Any true consideration of the humanities needs to view mankind in relation to the cosmos. In essence, all one's education needs to be humanistically oriented in the broadest possible sense. (Author/SJL)

  4. Quantum random number generator based on quantum nature of vacuum fluctuations

    NASA Astrophysics Data System (ADS)

    Ivanova, A. E.; Chivilikhin, S. A.; Gleim, A. V.

    2017-11-01

    Quantum random number generator (QRNG) allows obtaining true random bit sequences. In QRNG based on quantum nature of vacuum, optical beam splitter with two inputs and two outputs is normally used. We compare mathematical descriptions of spatial beam splitter and fiber Y-splitter in the quantum model for QRNG, based on homodyne detection. These descriptions were identical, that allows to use fiber Y-splitters in practical QRNG schemes, simplifying the setup. Also we receive relations between the input radiation and the resulting differential current in homodyne detector. We experimentally demonstrate possibility of true random bits generation by using QRNG based on homodyne detection with Y-splitter.

  5. An effect of loudness of advisory speech on a choice response task

    NASA Astrophysics Data System (ADS)

    Utsuki, Narisuke; Takeuchi, Yoshinori; Nomiyama, Takenori

    1995-03-01

    Recent technologies have realized talking advisory/guidance systems in which machines give advice and guidance to operators in speech. However, nonverbal aspects of spoken messages may have significant effects on an operator's behavior. Twelve subjects participated in a TV game-like choice response task where they were asked to choose a 'true' target from three invader-like figures displayed on a CRT screen. The subjects had received a prerecorded advice designating either left, center, or right target that would be true before each choice. The position of the 'true' targets and advice were preprogrammed in pseudorandom sequences. In other words, there was no way for the subjects to predict the 'true' target and there was no relationship between spoken advice and the true target position. The subjects tended to make more choices corresponding to the presented messages when the messages were presented in a louder voice than in a softer voice. Choice response time was significantly shorter when the response was the same as the advice indicated. The shortening of response time was slightly greater when advice was presented in a louder voice. This study demonstrates that spoken advice may result in faster and less deliberate reponses in accordance with the presented messages which are given by talking guidance systems.

  6. 18S rRNA data indicate that Aschelminthes are polyphyletic in origin and consist of at least three distinct clades.

    PubMed

    Winnepenninckx, B; Backeljau, T; Mackey, L Y; Brooks, J M; De Wachter, R; Kumar, S; Garey, J R

    1995-11-01

    The Aschelminthes is a collection of at least eight animal phyla, historically grouped together because the absence of a true body cavity was perceived as a pseudocoelom. Analyses of 18S rRNA sequences from six Aschelminth phyla (including four previously unpublished sequences) support polyphyly for the Aschelminthes. At least three distinct groups of Aschelminthes were detected: the Priapulida among the protostomes, the Rotifera-Acanthocephala as a sister group to the protostomes, and the Nematoda as a basal group to the triploblastic Eumetazoa.

  7. Mutation detection using automated fluorescence-based sequencing.

    PubMed

    Montgomery, Kate T; Iartchouck, Oleg; Li, Li; Perera, Anoja; Yassin, Yosuf; Tamburino, Alex; Loomis, Stephanie; Kucherlapati, Raju

    2008-04-01

    The development of high-throughput DNA sequencing techniques has made direct DNA sequencing of PCR-amplified genomic DNA a rapid and economical approach to the identification of polymorphisms that may play a role in disease. Point mutations as well as small insertions or deletions are readily identified by DNA sequencing. The mutations may be heterozygous (occurring in one allele while the other allele retains the normal sequence) or homozygous (occurring in both alleles). Sequencing alone cannot discriminate between true homozygosity and apparent homozygosity due to the loss of one allele due to a large deletion. In this unit, strategies are presented for using PCR amplification and automated fluorescence-based sequencing to identify sequence variation. The size of the project and laboratory preference and experience will dictate how the data is managed and which software tools are used for analysis. A high-throughput protocol is given that has been used to search for mutations in over 200 different genes at the Harvard Medical School - Partners Center for Genetics and Genomics (HPCGG, http://www.hpcgg.org/). Copyright 2008 by John Wiley & Sons, Inc.

  8. An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.

    PubMed

    Jun, Goo; Wing, Mary Kate; Abecasis, Gonçalo R; Kang, Hyun Min

    2015-06-01

    The analysis of next-generation sequencing data is computationally and statistically challenging because of the massive volume of data and imperfect data quality. We present GotCloud, a pipeline for efficiently detecting and genotyping high-quality variants from large-scale sequencing data. GotCloud automates sequence alignment, sample-level quality control, variant calling, filtering of likely artifacts using machine-learning techniques, and genotype refinement using haplotype information. The pipeline can process thousands of samples in parallel and requires less computational resources than current alternatives. Experiments with whole-genome and exome-targeted sequence data generated by the 1000 Genomes Project show that the pipeline provides effective filtering against false positive variants and high power to detect true variants. Our pipeline has already contributed to variant detection and genotyping in several large-scale sequencing projects, including the 1000 Genomes Project and the NHLBI Exome Sequencing Project. We hope it will now prove useful to many medical sequencing studies. © 2015 Jun et al.; Published by Cold Spring Harbor Laboratory Press.

  9. ATP hydrolysis provides functions that promote rejection of pairings between different copies of long repeated sequences

    PubMed Central

    Danilowicz, Claudia; Hermans, Laura; Coljee, Vincent; Prévost, Chantal

    2017-01-01

    Abstract During DNA recombination and repair, RecA family proteins must promote rapid joining of homologous DNA. Repeated sequences with >100 base pair lengths occupy more than 1% of bacterial genomes; however, commitment to strand exchange was believed to occur after testing ∼20–30 bp. If that were true, pairings between different copies of long repeated sequences would usually become irreversible. Our experiments reveal that in the presence of ATP hydrolysis even 75 bp sequence-matched strand exchange products remain quite reversible. Experiments also indicate that when ATP hydrolysis is present, flanking heterologous dsDNA regions increase the reversibility of sequence matched strand exchange products with lengths up to ∼75 bp. Results of molecular dynamics simulations provide insight into how ATP hydrolysis destabilizes strand exchange products. These results inspired a model that shows how pairings between long repeated sequences could be efficiently rejected even though most homologous pairings form irreversible products. PMID:28854739

  10. Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

    PubMed

    Magner, Abram; Duda, Jarosław; Szpankowski, Wojciech; Grama, Ananth

    2016-06-01

    Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.

  11. Scene-based nonuniformity correction with video sequences and registration.

    PubMed

    Hardie, R C; Hayat, M M; Armstrong, E; Yasuda, B

    2000-03-10

    We describe a new, to our knowledge, scene-based nonuniformity correction algorithm for array detectors. The algorithm relies on the ability to register a sequence of observed frames in the presence of the fixed-pattern noise caused by pixel-to-pixel nonuniformity. In low-to-moderate levels of nonuniformity, sufficiently accurate registration may be possible with standard scene-based registration techniques. If the registration is accurate, and motion exists between the frames, then groups of independent detectors can be identified that observe the same irradiance (or true scene value). These detector outputs are averaged to generate estimates of the true scene values. With these scene estimates, and the corresponding observed values through a given detector, a curve-fitting procedure is used to estimate the individual detector response parameters. These can then be used to correct for detector nonuniformity. The strength of the algorithm lies in its simplicity and low computational complexity. Experimental results, to illustrate the performance of the algorithm, include the use of visible-range imagery with simulated nonuniformity and infrared imagery with real nonuniformity.

  12. DNA microarrays for identifying fishes.

    PubMed

    Kochzius, M; Nölte, M; Weber, H; Silkenbeumer, N; Hjörleifsdottir, S; Hreggvidsson, G O; Marteinsson, V; Kappel, K; Planes, S; Tinti, F; Magoulas, A; Garcia Vazquez, E; Turan, C; Hervet, C; Campo Falgueras, D; Antoniou, A; Landi, M; Blohm, D

    2008-01-01

    In many cases marine organisms and especially their diverse developmental stages are difficult to identify by morphological characters. DNA-based identification methods offer an analytically powerful addition or even an alternative. In this study, a DNA microarray has been developed to be able to investigate its potential as a tool for the identification of fish species from European seas based on mitochondrial 16S rDNA sequences. Eleven commercially important fish species were selected for a first prototype. Oligonucleotide probes were designed based on the 16S rDNA sequences obtained from 230 individuals of 27 fish species. In addition, more than 1200 sequences of 380 species served as sequence background against which the specificity of the probes was tested in silico. Single target hybridisations with Cy5-labelled, PCR-amplified 16S rDNA fragments from each of the 11 species on microarrays containing the complete set of probes confirmed their suitability. True-positive, fluorescence signals obtained were at least one order of magnitude stronger than false-positive cross-hybridisations. Single nontarget hybridisations resulted in cross-hybridisation signals at approximately 27% of the cases tested, but all of them were at least one order of magnitude lower than true-positive signals. This study demonstrates that the 16S rDNA gene is suitable for designing oligonucleotide probes, which can be used to differentiate 11 fish species. These data are a solid basis for the second step to create a "Fish Chip" for approximately 50 fish species relevant in marine environmental and fisheries research, as well as control of fisheries products.

  13. Extreme assay sensitivity in molecular diagnostics further unveils intratumour heterogeneity in metastatic colorectal cancer as well as artifactual low-frequency mutations in the KRAS gene.

    PubMed

    Mariani, Sara; Bertero, Luca; Osella-Abate, Simona; Di Bello, Cristiana; Francia di Celle, Paola; Coppola, Vittoria; Sapino, Anna; Cassoni, Paola; Marchiò, Caterina

    2017-07-25

    Gene mutations in the RAS family rule out metastatic colorectal carcinomas (mCRCs) from anti-EGFR therapies. We report a retrospective analysis by Sequenom Massarray and fast COLD-PCR followed by Sanger sequencing on 240 mCRCs. By Sequenom, KRAS and NRAS exons 2-3-4 were mutated in 52.9% (127/240) of tumours, while BRAF codon 600 mutations reached 5% (12/240). Fast COLD-PCR found extra mutations at KRAS exon 2 in 15/166 (9%) of samples, previously diagnosed by Sequenom as wild-type or mutated at RAS (exons 3-4) or BRAF genes. After UDG digestion results were reproduced in 2/12 analysable subclonally mutated samples leading to a frequency of true subclonal KRAS mutations of 1.2% (2.1% of the previous Sequenom wild-type subgroup). In 10 out of 12 samples, the subclonal KRAS mutations disappeared (9 out of 12) or turned to a different sequence variant (1 out of 12). mCRC can harbour coexisting multiple gene mutations. High sensitivity assays allow the detection of a small subset of patients harbouring true subclonal KRAS mutations. However, DNA changes with mutant allele frequencies <3% detected in formalin-fixed paraffin-embedded samples may be artifactual in a non-negligible fraction of cases. UDG pre-treatment of DNA is mandatory to identify true DNA changes in archival samples and avoid misinterpretation due to artifacts.

  14. Extreme assay sensitivity in molecular diagnostics further unveils intratumour heterogeneity in metastatic colorectal cancer as well as artifactual low-frequency mutations in the KRAS gene

    PubMed Central

    Mariani, Sara; Bertero, Luca; Osella-Abate, Simona; Di Bello, Cristiana; Francia di Celle, Paola; Coppola, Vittoria; Sapino, Anna; Cassoni, Paola; Marchiò, Caterina

    2017-01-01

    Background: Gene mutations in the RAS family rule out metastatic colorectal carcinomas (mCRCs) from anti-EGFR therapies. Methods: We report a retrospective analysis by Sequenom Massarray and fast COLD-PCR followed by Sanger sequencing on 240 mCRCs. Results: By Sequenom, KRAS and NRAS exons 2-3-4 were mutated in 52.9% (127/240) of tumours, while BRAF codon 600 mutations reached 5% (12/240). Fast COLD-PCR found extra mutations at KRAS exon 2 in 15/166 (9%) of samples, previously diagnosed by Sequenom as wild-type or mutated at RAS (exons 3-4) or BRAF genes. After UDG digestion results were reproduced in 2/12 analysable subclonally mutated samples leading to a frequency of true subclonal KRAS mutations of 1.2% (2.1% of the previous Sequenom wild-type subgroup). In 10 out of 12 samples, the subclonal KRAS mutations disappeared (9 out of 12) or turned to a different sequence variant (1 out of 12). Conclusions: mCRC can harbour coexisting multiple gene mutations. High sensitivity assays allow the detection of a small subset of patients harbouring true subclonal KRAS mutations. However, DNA changes with mutant allele frequencies <3% detected in formalin-fixed paraffin-embedded samples may be artifactual in a non-negligible fraction of cases. UDG pre-treatment of DNA is mandatory to identify true DNA changes in archival samples and avoid misinterpretation due to artifacts. PMID:28618430

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Heng, E-mail: hengli@mdanderson.org; Zhu, X. Ronald; Zhang, Xiaodong

    Purpose: To develop and validate a novel delivery strategy for reducing the respiratory motion–induced dose uncertainty of spot-scanning proton therapy. Methods and Materials: The spot delivery sequence was optimized to reduce dose uncertainty. The effectiveness of the delivery sequence optimization was evaluated using measurements and patient simulation. One hundred ninety-one 2-dimensional measurements using different delivery sequences of a single-layer uniform pattern were obtained with a detector array on a 1-dimensional moving platform. Intensity modulated proton therapy plans were generated for 10 lung cancer patients, and dose uncertainties for different delivery sequences were evaluated by simulation. Results: Without delivery sequence optimization,more » the maximum absolute dose error can be up to 97.2% in a single measurement, whereas the optimized delivery sequence results in a maximum absolute dose error of ≤11.8%. In patient simulation, the optimized delivery sequence reduces the mean of fractional maximum absolute dose error compared with the regular delivery sequence by 3.3% to 10.6% (32.5-68.0% relative reduction) for different patients. Conclusions: Optimizing the delivery sequence can reduce dose uncertainty due to respiratory motion in spot-scanning proton therapy, assuming the 4-dimensional CT is a true representation of the patients' breathing patterns.« less

  16. A Partial Least Squares Based Procedure for Upstream Sequence Classification in Prokaryotes.

    PubMed

    Mehmood, Tahir; Bohlin, Jon; Snipen, Lars

    2015-01-01

    The upstream region of coding genes is important for several reasons, for instance locating transcription factor, binding sites, and start site initiation in genomic DNA. Motivated by a recently conducted study, where multivariate approach was successfully applied to coding sequence modeling, we have introduced a partial least squares (PLS) based procedure for the classification of true upstream prokaryotic sequence from background upstream sequence. The upstream sequences of conserved coding genes over genomes were considered in analysis, where conserved coding genes were found by using pan-genomics concept for each considered prokaryotic species. PLS uses position specific scoring matrix (PSSM) to study the characteristics of upstream region. Results obtained by PLS based method were compared with Gini importance of random forest (RF) and support vector machine (SVM), which is much used method for sequence classification. The upstream sequence classification performance was evaluated by using cross validation, and suggested approach identifies prokaryotic upstream region significantly better to RF (p-value < 0.01) and SVM (p-value < 0.01). Further, the proposed method also produced results that concurred with known biological characteristics of the upstream region.

  17. Proteins of unknown function in the Protein Data Bank (PDB): an inventory of true uncharacterized proteins and computational tools for their analysis.

    PubMed

    Nadzirin, Nurul; Firdaus-Raih, Mohd

    2012-10-08

    Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB). Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files) that were categorized under "unknown function" are true examples of proteins of unknown function at this point in time. The remainder 1465 entries also annotated as such appear to be able to have their annotations re-assessed, based on the availability of direct functional characterization experiments for the protein itself, or for homologous sequences or structures thus enabling computational function inference.

  18. A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the ‘true citrus fruit trees’ group (Citrinae, Rutaceae) and the origin of cultivated species

    PubMed Central

    Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    Background and Aims Despite differences in morphology, the genera representing ‘true citrus fruit trees’ are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial ‘species’ of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between ‘true citrus fruit trees’ were clarified. Methods Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. Key Results A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Conclusions Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis. PMID:23104641

  19. A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the 'true citrus fruit trees' group (Citrinae, Rutaceae) and the origin of cultivated species.

    PubMed

    Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis.

  20. Mass loss from pre-main-sequence accretion disks. I - The accelerating wind of FU Orionis

    NASA Technical Reports Server (NTRS)

    Calvet, Nuria; Hartmann, Lee; Kenyon, Scott J.

    1993-01-01

    We present evidence that the wind of the pre-main-sequence object FU Orionis arises from the surface of the luminous accretion disk. A disk wind model calculated assuming radiative equilibrium explains the differential behavior of the observed asymmetric absorption-line profiles. The model predicts that strong lines should be asymmetric and blueshifted, while weak lines should be symmetric and double-peaked due to disk rotation, in agreement with observations. We propose that many blueshifted 'shell' absorption features are not produced in a true shell of material, but rather form in a differentially expanding wind that is rapidly rotating. The inference of rapid rotation supports the proposal that pre-main-sequence disk winds are rotationally driven.

  1. Genomics-inspired discovery of natural products.

    PubMed

    Winter, Jaclyn M; Behnken, Swantje; Hertweck, Christian

    2011-02-01

    The massive surge in genome sequencing projects has opened our eyes to the overlooked biosynthetic potential and metabolic diversity of microorganisms. While traditional approaches have been successful at identifying many useful therapeutic agents from these organisms, new tactics are needed in order to exploit their true biosynthetic potential. Several genomics-inspired strategies have been successful in unveiling new metabolites that were overlooked under standard fermentation and detection conditions. In addition, genome sequences have given us valuable insight for genetically engineering biosynthesis gene clusters that remain silent or are poorly expressed in the absence of a specific trigger. As more genome sequences are becoming available, we are noticing the emergence of underexplored or neglected organisms as alternative resources for new therapeutic agents. Copyright © 2010 Elsevier Ltd. All rights reserved.

  2. Gender Identification in Date Palm Using Molecular Markers.

    PubMed

    Awan, Faisal Saeed; Maryam; Jaskani, Muhammad J; Sadia, Bushra

    2017-01-01

    Breeding of date palm is complicated because of its long life cycle and heterozygous nature. Sexual propagation of date palm does not produce true-to-type plants. Sex of date palms cannot be identified until the first flowering stage. Molecular markers such as random amplified polymorphic DNA (RAPD), sequence-characterized amplified regions (SCAR), and simple sequence repeats (SSR) have successfully been used to identify the sex-linked loci in the plant genome and to isolate the corresponding genes. This chapter highlights the use of three molecular markers including RAPD, SCAR, and SSR to identify the gender of date palm seedlings.

  3. Speciation and Neutral Molecular Evolution in One-Dimensional Closed Population

    NASA Astrophysics Data System (ADS)

    Semovski, Sergei V.; Bukin, Yuri S.; Sherbakov, Dmitry Yu.

    Models are presented suitable for a description of speciation processes arising due to reproductive isolation depending on genetic distance. The main attention is paid to the model of a one-dimensional closed population, which describes the evolution of littoral benthic organisms. In order to correspond the modeling results to the results obtained in the course of experimental phylogenetic studies, all individual-based models described here involve neutrally evolving and maternally inherited DNA sequence. Sub-samples of the resulting sequences were used for a posteriori phylogenetic inferences which then were compared to the "true" evolutionary histories.

  4. Protein Kinase Classification with 2866 Hidden Markov Models and One Support Vector Machine

    NASA Technical Reports Server (NTRS)

    Weber, Ryan; New, Michael H.; Fonda, Mark (Technical Monitor)

    2002-01-01

    The main application considered in this paper is predicting true kinases from randomly permuted kinases that share the same length and amino acid distributions as the true kinases. Numerous methods already exist for this classification task, such as HMMs, motif-matchers, and sequence comparison algorithms. We build on some of these efforts by creating a vector from the output of thousands of structurally based HMMs, created offline with Pfam-A seed alignments using SAM-T99, which then must be combined into an overall classification for the protein. Then we use a Support Vector Machine for classifying this large ensemble Pfam-Vector, with a polynomial and chisquared kernel. In particular, the chi-squared kernel SVM performs better than the HMMs and better than the BLAST pairwise comparisons, when predicting true from false kinases in some respects, but no one algorithm is best for all purposes or in all instances so we consider the particular strengths and weaknesses of each.

  5. A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination.

    PubMed

    Buckley, Michael; Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C; Manning, Phillip L

    2017-05-31

    A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus The resulting LC-MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. © 2017 The Authors.

  6. A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination

    PubMed Central

    Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C.; Manning, Phillip L.

    2017-01-01

    A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus. The resulting LC–MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. PMID:28566488

  7. College Bound in Middle School & High School? How Math Course Sequences Matter

    ERIC Educational Resources Information Center

    Finkelstein, Neal; Fong, Anthony; Tiffany-Morales, Juliet; Shields, Patrick; Huang, Min

    2012-01-01

    As California competes for jobs in an increasingly competitive global economy, the state faces a looming shortage of highly educated workers (PPIC, 2012). For a variety of reasons, the need for individuals with degrees in science, technology, engineering, and mathematics (STEM) is of particular concern. Nowhere is this more true than in the…

  8. Spreadsheet Simulation of the Law of Large Numbers

    ERIC Educational Resources Information Center

    Boger, George

    2005-01-01

    If larger and larger samples are successively drawn from a population and a running average calculated after each sample has been drawn, the sequence of averages will converge to the mean, [mu], of the population. This remarkable fact, known as the law of large numbers, holds true if samples are drawn from a population of discrete or continuous…

  9. A Learning Cycle Approach to Dealing with Pseudoscience Beliefs of Prospective Elementary Teachers.

    ERIC Educational Resources Information Center

    Rosenthal, Dorothy B.

    1993-01-01

    Describes a lesson on pseudoscience for a teaching methods course that promotes active student participation, is not a laboratory activity, and follows the sequence of the three phases associated with the learning cycle model. Contains a true-false science questionnaire to be administered to students as a bridge to discussion. (PR)

  10. Relationships in subtribe Diocleinae (Leguminosae; Papilionoideae) inferred from internal transcribed spacer sequences from nuclear ribosomal DNA.

    PubMed

    Varela, Eduardo S; Lima, João P M S; Galdino, Alexsandro S; Pinto, Luciano da S; Bezerra, Walderly M; Nunes, Edson P; Alves, Maria A O; Grangeiro, Thalles B

    2004-01-01

    The complete sequences of nuclear ribosomal DNA (nrDNA) internal transcribed spacer regions (ITS/5.8S) were determined for species belonging to six genera from the subtribe Diocleinae as well as for the anomalous genera Calopogonium and Pachyrhizus. Phylogenetic trees constructed by distance matrix, maximum parsimony and maximum likelihood methods showed that Calopogonium and Pachyrhizus were outside the clade Diocleinae (Canavalia, Camptosema, Cratylia, Dioclea, Cymbosema, and Galactia). This finding supports previous morphological, phytochemical, and molecular evidence that Calopogonium and Pachyrhizus do not belong to the subtribe Diocleinae. Within the true Diocleinae clade, the clustering of genera and species were congruent with morphology-based classifications, suggesting that ITS/5.8S sequences can provide enough informative sites to allow resolution below the genus level. This is the first evidence of the phylogeny of subtribe Diocleinae based on nuclear DNA sequences.

  11. Helicos BioSciences.

    PubMed

    Milos, Patrice

    2008-04-01

    Helicos BioSciences Corporation is a life sciences company developing revolutionary new single molecule sequencing technology to provide the path to the US$1000 genome. True Single Molecule Sequencing (tSMS) will drive advancements in pharmacogenomics that can enable a better understanding of an individual's susceptibility to disease, develop more effective disease diagnoses and differentiate response to disease therapies. During 2007, genome-wide disease-association studies, the encylopedia of DNA elements (ENCODE) and the published genome sequence of two individuals have revealed human genome variation far more extensive than originally believed. These also demonstrated that common variations explain only a fraction of the genetic basis of disease. Therefore, the capability to understand an individual genome is critical in setting the foundation for the next great revolution in healthcare. Helicos is committed to this vision and will provide cost-effective genome sequencing and comprehensive analysis of the transcribed genome that can unlock the era of personalized healthcare.

  12. Low incidence of DNA sequence variation in human induced pluripotent stem cells generated by non-integrating plasmid expression

    PubMed Central

    Cheng, Linzhao; Hansen, Nancy F.; Zhao, Ling; Du, Yutao; Zou, Chunlin; Donovan, Frank X.; Chou, Bin-Kuan; Zhou, Guangyu; Li, Shijie; Dowey, Sarah N.; Ye, Zhaohui; Chandrasekharappa, Settara C.; Yang, Huanming; Mullikin, James C.; Liu, P. Paul

    2012-01-01

    Summary The utility of induced pluripotent stem cells (iPSCs) as models to study diseases and as sources for cell therapy depends on the integrity of their genomes. Despite recent publications of DNA sequence variations in the iPSCs, the true scope of such changes for the entire genome is not clear. Here we report the whole-genome sequencing of three human iPSC lines derived from two cell types of an adult donor by episomal vectors. The vector sequence was undetectable in the deeply sequenced iPSC lines. We identified 1058–1808 heterozygous single nucleotide variants (SNVs), but no copy number variants, in each iPSC line. Six to twelve of these SNVs were within coding regions in each iPSC line, but ~50% of them are synonymous changes and the remaining are not selectively enriched for known genes associated with cancers. Our data thus suggest that episome-mediated reprogramming is not inherently mutagenic during integration-free iPSC induction. PMID:22385660

  13. Cloud-based adaptive exon prediction for DNA analysis.

    PubMed

    Putluri, Srinivasareddy; Zia Ur Rahman, Md; Fathima, Shaik Yasmeen

    2018-02-01

    Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.

  14. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data.

    PubMed Central

    Drummond, Alexei J; Nicholls, Geoff K; Rodrigo, Allen G; Solomon, Wiremu

    2002-01-01

    Molecular sequences obtained at different sampling times from populations of rapidly evolving pathogens and from ancient subfossil and fossil sources are increasingly available with modern sequencing technology. Here, we present a Bayesian statistical inference approach to the joint estimation of mutation rate and population size that incorporates the uncertainty in the genealogy of such temporally spaced sequences by using Markov chain Monte Carlo (MCMC) integration. The Kingman coalescent model is used to describe the time structure of the ancestral tree. We recover information about the unknown true ancestral coalescent tree, population size, and the overall mutation rate from temporally spaced data, that is, from nucleotide sequences gathered at different times, from different individuals, in an evolving haploid population. We briefly discuss the methodological implications and show what can be inferred, in various practically relevant states of prior knowledge. We develop extensions for exponentially growing population size and joint estimation of substitution model parameters. We illustrate some of the important features of this approach on a genealogy of HIV-1 envelope (env) partial sequences. PMID:12136032

  15. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data.

    PubMed

    Drummond, Alexei J; Nicholls, Geoff K; Rodrigo, Allen G; Solomon, Wiremu

    2002-07-01

    Molecular sequences obtained at different sampling times from populations of rapidly evolving pathogens and from ancient subfossil and fossil sources are increasingly available with modern sequencing technology. Here, we present a Bayesian statistical inference approach to the joint estimation of mutation rate and population size that incorporates the uncertainty in the genealogy of such temporally spaced sequences by using Markov chain Monte Carlo (MCMC) integration. The Kingman coalescent model is used to describe the time structure of the ancestral tree. We recover information about the unknown true ancestral coalescent tree, population size, and the overall mutation rate from temporally spaced data, that is, from nucleotide sequences gathered at different times, from different individuals, in an evolving haploid population. We briefly discuss the methodological implications and show what can be inferred, in various practically relevant states of prior knowledge. We develop extensions for exponentially growing population size and joint estimation of substitution model parameters. We illustrate some of the important features of this approach on a genealogy of HIV-1 envelope (env) partial sequences.

  16. Benchmark Evaluation of True Single Molecular Sequencing to Determine Cystic Fibrosis Airway Microbiome Diversity.

    PubMed

    Hahn, Andrea; Bendall, Matthew L; Gibson, Keylie M; Chaney, Hollis; Sami, Iman; Perez, Geovanny F; Koumbourlis, Anastassios C; McCaffrey, Timothy A; Freishtat, Robert J; Crandall, Keith A

    2018-01-01

    Cystic fibrosis (CF) is an autosomal recessive disease associated with recurrent lung infections that can lead to morbidity and mortality. The impact of antibiotics for treatment of acute pulmonary exacerbations on the CF airway microbiome remains unclear with prior studies giving conflicting results and being limited by their use of 16S ribosomal RNA sequencing. Our primary objective was to validate the use of true single molecular sequencing (tSMS) and PathoScope in the analysis of the CF airway microbiome. Three control samples were created with differing amounts of Burkholderia cepacia , Pseudomonas aeruginosa , and Prevotella melaninogenica , three common bacteria found in cystic fibrosis lungs. Paired sputa were also obtained from three study participants with CF before and >6 days after initiation of antibiotics. Antibiotic resistant B. cepacia and P. aeruginosa were identified in concurrently obtained respiratory cultures. Direct sequencing was performed using tSMS, and filtered reads were aligned to reference genomes from NCBI using PathoScope and Kraken and unique clade-specific marker genes using MetaPhlAn. A total of 180-518 K of 6-12 million filtered reads were aligned for each sample. Detection of known pathogens in control samples was most successful using PathoScope. In the CF sputa, alpha diversity measures varied based on the alignment method used, but similar trends were found between pre- and post-antibiotic samples. PathoScope outperformed Kraken and MetaPhlAn in our validation study of artificial bacterial community controls and also has advantages over Kraken and MetaPhlAn of being able to determine bacterial strains and the presence of fungal organisms. PathoScope can be confidently used when evaluating metagenomic data to determine CF airway microbiome diversity.

  17. Structurally complex and highly active RNA ligases derived from random RNA sequences

    NASA Technical Reports Server (NTRS)

    Ekland, E. H.; Szostak, J. W.; Bartel, D. P.

    1995-01-01

    Seven families of RNA ligases, previously isolated from random RNA sequences, fall into three classes on the basis of secondary structure and regiospecificity of ligation. Two of the three classes of ribozymes have been engineered to act as true enzymes, catalyzing the multiple-turnover transformation of substrates into products. The most complex of these ribozymes has a minimal catalytic domain of 93 nucleotides. An optimized version of this ribozyme has a kcat exceeding one per second, a value far greater than that of most natural RNA catalysts and approaching that of comparable protein enzymes. The fact that such a large and complex ligase emerged from a very limited sampling of sequence space implies the existence of a large number of distinct RNA structures of equivalent complexity and activity.

  18. Existence of a True Phosphofructokinase in Bacillus sphaericus: Cloning and Sequencing of the pfk Gene

    PubMed Central

    Alice, Alejandro F.; Pérez-Martínez, Gaspar; Sánchez-Rivas, Carmen

    2002-01-01

    Some strains of Bacillus sphaericus are entomopathogenic to mosquito larvae, which transmit diseases, such as filariasis and malaria, affecting millions of people worldwide. This species is unable to use hexoses and pentoses as unique carbon sources, which was proposed to be due to the lack of glycolytic enzymes, such as 6-phosphofructokinase (PFK). In this study, PFK activity was detected and the pfk gene was cloned and sequenced. Furthermore, this gene was shown to be present in strains belonging to all the homology groups of this heterogeneous species, in which PFK activity was also detected. A careful sequence analysis revealed the conservation of different catalytic and regulatory residues, as well as the enzyme's phylogenetic affiliation with the family of allosteric ATP-PFK enzymes. PMID:12450869

  19. Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA.

    PubMed

    Holt, Kathryn E; Teo, Yik Y; Li, Heng; Nair, Satheesh; Dougan, Gordon; Wain, John; Parkhill, Julian

    2009-08-15

    Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x. The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.

  20. Evaluation of exome variants using the Ion Proton Platform to sequence error-prone regions.

    PubMed

    Seo, Heewon; Park, Yoomi; Min, Byung Joo; Seo, Myung Eui; Kim, Ju Han

    2017-01-01

    The Ion Proton sequencer from Thermo Fisher accurately determines sequence variants from target regions with a rapid turnaround time at a low cost. However, misleading variant-calling errors can occur. We performed a systematic evaluation and manual curation of read-level alignments for the 675 ultrarare variants reported by the Ion Proton sequencer from 27 whole-exome sequencing data but that are not present in either the 1000 Genomes Project and the Exome Aggregation Consortium. We classified positive variant calls into 393 highly likely false positives, 126 likely false positives, and 156 likely true positives, which comprised 58.2%, 18.7%, and 23.1% of the variants, respectively. We identified four distinct error patterns of variant calling that may be bioinformatically corrected when using different strategies: simplicity region, SNV cluster, peripheral sequence read, and base inversion. Local de novo assembly successfully corrected 201 (38.7%) of the 519 highly likely or likely false positives. We also demonstrate that the two sequencing kits from Thermo Fisher (the Ion PI Sequencing 200 kit V3 and the Ion PI Hi-Q kit) exhibit different error profiles across different error types. A refined calling algorithm with better polymerase may improve the performance of the Ion Proton sequencing platform.

  1. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

    PubMed Central

    Laehnemann, David; Borkhardt, Arndt

    2016-01-01

    Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

  2. Do Recognition and Priming Index a Unitary Knowledge Base? Comment on Shanks et al. (2003)

    ERIC Educational Resources Information Center

    Runger, Dennis; Nagy, Gabriel; Frensch, Peter A.

    2009-01-01

    Whether sequence learning entails a single or multiple memory systems is a moot issue. Recently, D. R. Shanks, L. Wilkinson, and S. Channon advanced a single-system model that predicts a perfect correlation between true (i.e., error free) response time priming and recognition. The Shanks model is contrasted with a dual-process model that…

  3. Deviance sensitivity in the auditory cortex of freely moving rats

    PubMed Central

    2018-01-01

    Deviance sensitivity is the specific response to a surprising stimulus, one that violates expectations set by the past stimulation stream. In audition, deviance sensitivity is often conflated with stimulus-specific adaptation (SSA), the decrease in responses to a common stimulus that only partially generalizes to other, rare stimuli. SSA is usually measured using oddball sequences, where a common (standard) tone and a rare (deviant) tone are randomly intermixed. However, the larger responses to a tone when deviant does not necessarily represent deviance sensitivity. Deviance sensitivity is commonly tested using a control sequence in which many different tones serve as the standard, eliminating the expectations set by the standard ('deviant among many standards'). When the response to a tone when deviant (against a single standard) is larger than the responses to the same tone in the control sequence, it is concluded that true deviance sensitivity occurs. In primary auditory cortex of anesthetized rats, responses to deviants and to the same tones in the control condition are comparable in size. We recorded local field potentials and multiunit activity from the auditory cortex of awake, freely moving rats, implanted with 32-channel drivable microelectrode arrays and using telemetry. We observed highly significant SSA in the awake state. Moreover, the responses to a tone when deviant were significantly larger than the responses to the same tone in the control condition. These results establish the presence of true deviance sensitivity in primary auditory cortex in awake rats. PMID:29874246

  4. Application of hidden Markov models to biological data mining: a case study

    NASA Astrophysics Data System (ADS)

    Yin, Michael M.; Wang, Jason T.

    2000-04-01

    In this paper we present an example of biological data mining: the detection of splicing junction acceptors in eukaryotic genes. Identification or prediction of transcribed sequences from within genomic DNA has been a major rate-limiting step in the pursuit of genes. Programs currently available are far from being powerful enough to elucidate the gene structure completely. Here we develop a hidden Markov model (HMM) to represent the degeneracy features of splicing junction acceptor sites in eukaryotic genes. The HMM system is fully trained using an expectation maximization (EM) algorithm and the system performance is evaluated using the 10-way cross- validation method. Experimental results show that our HMM system can correctly classify more than 94% of the candidate sequences (including true and false acceptor sites) into right categories. About 90% of the true acceptor sites and 96% of the false acceptor sites in the test data are classified correctly. These results are very promising considering that only the local information in DNA is used. The proposed model will be a very important component of an effective and accurate gene structure detection system currently being developed in our lab.

  5. Micropropagation and assessment of genetic fidelity of Henckelia incana: an endemic and medicinal Gesneriad of South India.

    PubMed

    Prameela, J; Ramakrishnaiah, H; Krishna, V; Deepalakshmi, A P; Naveen Kumar, N; Radhika, R N

    2015-07-01

    Henckelia incana is an endemic medicinal plant used for the treatment of fever and skin allergy. In the present study shoot regeneration was evaluated on Murashige and Skoog's (MS) medium supplemented with auxins, Indole-3-acetic acid (IAA), Indole-3- butyric acid (IBA), 1-Naphthaleneacetic acid (NAA), 2, 4-Dichlorophenoxyacetic acid (2, 4-D) and cytokinins, 6-Benzylaminopurine (BAP) and Kinetin (Kn) at concentrations of 0.5, 1.0, 2.0, 3.0, 4.0 and 5.0 mgl(-1). MS medium with IBA (18.08), NAA (17.83) and IAA (17.58) at 0.5 mgl(-1) concentrations showed efficient regeneration. Regenerated shoots were rooted on half-strength MS medium with and without 0.5 mgl(-1) IBA or NAA. The plantlets were successfully hardened in rooting trays (peat, vermiculite and sand) and transferred to field mileu. The genetic fidelity of in vitro raised plants was assessed by using three different single primer amplification reaction (SPAR) markers namely random amplified polymorphic DNA (RAPD), inter-simple sequence repeat (ISSR) and direct amplification of mini-satellite DNA region (DAMD). The results consistently demonstrated true-to-true type propagation. This is the first report of in vitro propagation and establishment of true-to-true type genetic fidelity in H. incana.

  6. High-Throughput Identification of Loss-of-Function Mutations for Anti-Interferon Activity in the Influenza A Virus NS Segment

    PubMed Central

    Wu, Nicholas C.; Young, Arthur P.; Al-Mawsawi, Laith Q.; Olson, C. Anders; Feng, Jun; Qi, Hangfei; Luan, Harding H.; Li, Xinmin; Wu, Ting-Ting

    2014-01-01

    ABSTRACT Viral proteins often display several functions which require multiple assays to dissect their genetic basis. Here, we describe a systematic approach to screen for loss-of-function mutations that confer a fitness disadvantage under a specified growth condition. Our methodology was achieved by genetically monitoring a mutant library under two growth conditions, with and without interferon, by deep sequencing. We employed a molecular tagging technique to distinguish true mutations from sequencing error. This approach enabled us to identify mutations that were negatively selected against, in addition to those that were positively selected for. Using this technique, we identified loss-of-function mutations in the influenza A virus NS segment that were sensitive to type I interferon in a high-throughput fashion. Mechanistic characterization further showed that a single substitution, D92Y, resulted in the inability of NS to inhibit RIG-I ubiquitination. The approach described in this study can be applied under any specified condition for any virus that can be genetically manipulated. IMPORTANCE Traditional genetics focuses on a single genotype-phenotype relationship, whereas high-throughput genetics permits phenotypic characterization of numerous mutants in parallel. High-throughput genetics often involves monitoring of a mutant library with deep sequencing. However, deep sequencing suffers from a high error rate (∼0.1 to 1%), which is usually higher than the occurrence frequency for individual point mutations within a mutant library. Therefore, only mutations that confer a fitness advantage can be identified with confidence due to an enrichment in the occurrence frequency. In contrast, it is impossible to identify deleterious mutations using most next-generation sequencing techniques. In this study, we have applied a molecular tagging technique to distinguish true mutations from sequencing errors. It enabled us to identify mutations that underwent negative selection, in addition to mutations that experienced positive selection. This study provides a proof of concept by screening for loss-of-function mutations on the influenza A virus NS segment that are involved in its anti-interferon activity. PMID:24965464

  7. Truly random number generation: an example

    NASA Astrophysics Data System (ADS)

    Frauchiger, Daniela; Renner, Renato

    2013-10-01

    Randomness is crucial for a variety of applications, ranging from gambling to computer simulations, and from cryptography to statistics. However, many of the currently used methods for generating randomness do not meet the criteria that are necessary for these applications to work properly and safely. A common problem is that a sequence of numbers may look random but nevertheless not be truly random. In fact, the sequence may pass all standard statistical tests and yet be perfectly predictable. This renders it useless for many applications. For example, in cryptography, the predictability of a "andomly" chosen password is obviously undesirable. Here, we review a recently developed approach to generating true | and hence unpredictable | randomness.

  8. Schema vs. primitive perceptual grouping: the relative weighting of sequential vs. spatial cues during an auditory grouping task in frogs.

    PubMed

    Farris, Hamilton E; Ryan, Michael J

    2017-03-01

    Perceptually, grouping sounds based on their sources is critical for communication. This is especially true in túngara frog breeding aggregations, where multiple males produce overlapping calls that consist of an FM 'whine' followed by harmonic bursts called 'chucks'. Phonotactic females use at least two cues to group whines and chucks: whine-chuck spatial separation and sequence. Spatial separation is a primitive cue, whereas sequence is schema-based, as chuck production is morphologically constrained to follow whines, meaning that males cannot produce the components simultaneously. When one cue is available, females perceptually group whines and chucks using relative comparisons: components with the smallest spatial separation or those closest to the natural sequence are more likely grouped. By simultaneously varying the temporal sequence and spatial separation of a single whine and two chucks, this study measured between-cue perceptual weighting during a specific grouping task. Results show that whine-chuck spatial separation is a stronger grouping cue than temporal sequence, as grouping is more likely for stimuli with smaller spatial separation and non-natural sequence than those with larger spatial separation and natural sequence. Compared to the schema-based whine-chuck sequence, we propose that spatial cues have less variance, potentially explaining their preferred use when grouping during directional behavioral responses.

  9. Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic

    PubMed Central

    Yebra, Gonzalo; Hodcroft, Emma B.; Ragonnet-Cronin, Manon L.; Pillay, Deenan; Brown, Andrew J. Leigh; Fraser, Christophe; Kellam, Paul; de Oliveira, Tulio; Dennis, Ann; Hoppe, Anne; Kityo, Cissy; Frampton, Dan; Ssemwanga, Deogratius; Tanser, Frank; Keshani, Jagoda; Lingappa, Jairam; Herbeck, Joshua; Wawer, Maria; Essex, Max; Cohen, Myron S.; Paton, Nicholas; Ratmann, Oliver; Kaleebu, Pontiano; Hayes, Richard; Fidler, Sarah; Quinn, Thomas; Novitsky, Vladimir; Haywards, Andrew; Nastouli, Eleni; Morris, Steven; Clark, Duncan; Kozlakidis, Zisis

    2016-01-01

    HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences. PMID:28008945

  10. Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic.

    PubMed

    Yebra, Gonzalo; Hodcroft, Emma B; Ragonnet-Cronin, Manon L; Pillay, Deenan; Brown, Andrew J Leigh

    2016-12-23

    HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.

  11. Position-specific automated processing of V3 env ultra-deep pyrosequencing data for predicting HIV-1 tropism

    PubMed Central

    Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre

    2015-01-01

    HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds. PMID:26585833

  12. Position-specific automated processing of V3 env ultra-deep pyrosequencing data for predicting HIV-1 tropism.

    PubMed

    Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre

    2015-11-20

    HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds.

  13. Phylogenetic position of the North American isolate of Pasteuria that parasitizes the soybean cyst nematode, Heterodera glycines, as inferred from 16S rDNA sequence analysis.

    PubMed

    Atibalentja, N; Noel, G R; Domier, L L

    2000-03-01

    A 1341 bp sequence of the 16S rDNA of an undescribed species of Pasteuria that parasitizes the soybean cyst nematode, Heterodera glycines, was determined and then compared with a homologous sequence of Pasteuria ramosa, a parasite of cladoceran water fleas of the family Daphnidae. The two Pasteuria sequences, which diverged from each other by a dissimilarity index of 7%, also were compared with the 16S rDNA sequences of 30 other bacterial species to determine the phylogenetic position of the genus Pasteuria among the Gram-positive eubacteria. Phylogenetic analyses using maximum-likelihood, maximum-parsimony and neighbour-joining methods showed that the Heterodera glycines-infecting Pasteuria and its sister species, P. ramosa, form a distinct line of descent within the Alicyclobacillus group of the Bacillaceae. These results are consistent with the view that the genus Pasteuria is a deeply rooted member of the Clostridium-Bacillus-Streptococcus branch of the Gram-positive eubacteria, neither related to the actinomycetes nor closely related to true endospore-forming bacteria.

  14. Artificially designed pathogens - a diagnostic option for future military deployments.

    PubMed

    Zautner, Andreas E; Masanta, Wycliffe O; Hinz, Rebecca; Hagen, Ralf Matthias; Frickmann, Hagen

    2015-01-01

    Diagnostic microbial isolates of bio-safety levels 3 and 4 are difficult to handle in medical field camps under military deployment settings. International transport of such isolates is challenging due to restrictions by the International Air Transport Association. An alternative option might be inactivation and sequencing of the pathogen at the deployment site with subsequent sequence-based revitalization in well-equipped laboratories in the home country for further scientific assessment. A literature review was written based on a PubMed search. First described for poliovirus in 2002, de novo synthesis of pathogens based on their sequence information has become a well-established procedure in science. Successful syntheses have been demonstrated for both viruses and prokaryotes. However, the technology is not yet available for routine diagnostic purposes. Due to the potential utility of diagnostic sequencing and sequence-based de novo synthesis of pathogens, it seems worthwhile to establish the technology for diagnostic purposes over the intermediate term. This is particularly true for resource-restricted deployment settings, where safe handling of harmful pathogens cannot always be guaranteed.

  15. Relative Packing Groups in Template-Based Structure Prediction: Cooperative Effects of True Positive Constraints

    PubMed Central

    Day, Ryan; Qu, Xiaotao; Swanson, Rosemarie; Bohannan, Zach; Bliss, Robert

    2011-01-01

    Abstract Most current template-based structure prediction methods concentrate on finding the correct backbone conformation and then packing sidechains within that backbone. Our packing-based method derives distance constraints from conserved relative packing groups (RPGs). In our refinement approach, the RPGs provide a level of resolution that restrains global topology while allowing conformational sampling. In this study, we test our template-based structure prediction method using 51 prediction units from CASP7 experiments. RPG-based constraints are able to substantially improve approximately two-thirds of starting templates. Upon deeper investigation, we find that true positive spatial constraints, especially those non-local in sequence, derived from the RPGs were important to building nearer native models. Surprisingly, the fraction of incorrect or false positive constraints does not strongly influence the quality of the final candidate. This result indicates that our RPG-based true positive constraints sample the self-consistent, cooperative interactions of the native structure. The lack of such reinforcing cooperativity explains the weaker effect of false positive constraints. Generally, these findings are encouraging indications that RPGs will improve template-based structure prediction. PMID:21210729

  16. Journey During Acute Ischemic Stroke: A Physician’s Experience

    PubMed Central

    Hoong, Low Chen; Sharma, Vijay K.

    2010-01-01

    Acute ischemic stroke is a potentially devastating condition. What follows is a true narration of the experience of a doctor-patient during his treatment for acute ischemic stroke and how the experience changed him. Described is the temporal sequence of events, starting from home to infusion of tissue plasminogen activator, which, when coupled with a multimodal therapeutic approach, resulted in an excellent clinical recovery. PMID:20458112

  17. SU-E-J-155: Utilizing Varian TrueBeam Developer Mode for the Quantification of Mechanical Limits and the Simulation of 4D Respiratory Motion

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Moseley, D; Dave, M

    Purpose: Use Varian TrueBeam Developer mode to quantify the mechanical limits of the couch and to simulate 4D respiratory motion. Methods: An in-house MATLAB based GUI was created to make the BEAM XML files. The couch was moved in a triangular wave in the S/I direction with varying amplitudes (1mm, 5mm, 10mm, and 50mm) and periods (3s, 6s, and 9s). The periods were determined by specifying the speed. The theoretical positions were compared to the values recorded by the machine at 50 Hz. HD videos were taken for certain tests as external validation. 4D Respiratory motion was simulated by anmore » A/P MV beam being delivered while the couch moved in an elliptical manner. The ellipse had a major axis of 2 cm (S/I) and a minor axis of 1 cm (A/P). Results: The path planned by the TrueBeam deviated from the theoretical triangular form as the speed increased. Deviations were noticed starting at a speed of 3.33 cm/s (50mm amplitude, 6s period). The greatest deviation occurred in the 50mm- 3s sequence with a correlation value of −0.13 and a 27% time increase; the plan essentially became out of phase. Excluding these two, the plans had correlation values of 0.99. The elliptical sequence effectively simulated a respiratory pattern with a period of 6s. The period could be controlled by changing the speeds or the dose rate. Conclusion: The work first shows the quantification of the mechanical limits of the couch and the speeds at which the proposed plans begin to deviate. These limits must be kept in mind when programming other couch sequences. The methodology can be used to quantify the limits of other axes. Furthermore, the work shows the possibility of creating 4D respiratory simulations without using specialized phantoms or motion-platforms. This can be further developed to program patient-specific breathing patterns.« less

  18. Ancient DNA studies: new perspectives on old samples

    PubMed Central

    2012-01-01

    In spite of past controversies, the field of ancient DNA is now a reliable research area due to recent methodological improvements. A series of recent large-scale studies have revealed the true potential of ancient DNA samples to study the processes of evolution and to test models and assumptions commonly used to reconstruct patterns of evolution and to analyze population genetics and palaeoecological changes. Recent advances in DNA technologies, such as next-generation sequencing make it possible to recover DNA information from archaeological and paleontological remains allowing us to go back in time and study the genetic relationships between extinct organisms and their contemporary relatives. With the next-generation sequencing methodologies, DNA sequences can be retrieved even from samples (for example human remains) for which the technical pitfalls of classical methodologies required stringent criteria to guaranty the reliability of the results. In this paper, we review the methodologies applied to ancient DNA analysis and the perspectives that next-generation sequencing applications provide in this field. PMID:22697611

  19. Simulating Next-Generation Sequencing Datasets from Empirical Mutation and Sequencing Models

    PubMed Central

    Stephens, Zachary D.; Hudson, Matthew E.; Mainzer, Liudmila S.; Taschuk, Morgan; Weber, Matthew R.; Iyer, Ravishankar K.

    2016-01-01

    An obstacle to validating and benchmarking methods for genome analysis is that there are few reference datasets available for which the “ground truth” about the mutational landscape of the sample genome is known and fully validated. Additionally, the free and public availability of real human genome datasets is incompatible with the preservation of donor privacy. In order to better analyze and understand genomic data, we need test datasets that model all variants, reflecting known biology as well as sequencing artifacts. Read simulators can fulfill this requirement, but are often criticized for limited resemblance to true data and overall inflexibility. We present NEAT (NExt-generation sequencing Analysis Toolkit), a set of tools that not only includes an easy-to-use read simulator, but also scripts to facilitate variant comparison and tool evaluation. NEAT has a wide variety of tunable parameters which can be set manually on the default model or parameterized using real datasets. The software is freely available at github.com/zstephens/neat-genreads. PMID:27893777

  20. The rRNA evolution and procaryotic phylogeny

    NASA Technical Reports Server (NTRS)

    Fox, G. E.

    1986-01-01

    Studies of ribosomal RNA primary structure allow reconstruction of phylogenetic trees for prokaryotic organisms. Such studies reveal major dichotomy among the bacteria that separates them into eubacteria and archaebacteria. Both groupings are further segmented into several major divisions. The results obtained from 5S rRNA sequences are essentially the same as those obtained with the 16S rRNA data. In the case of Gram negative bacteria the ribosomal RNA sequencing results can also be directly compared with hybridization studies and cytochrome c sequencing studies. There is again excellent agreement among the several methods. It seems likely then that the overall picture of microbial phylogeny that is emerging from the RNA sequence studies is a good approximation of the true history of these organisms. The RNA data allow examination of the evolutionary process in a semi-quantitative way. The secondary structures of these RNAs are largely established. As a result it is possible to recognize examples of local structural evolution. Evolutionary pathways accounting for these events can be proposed and their probability can be assessed.

  1. Relation between native ensembles and experimental structures of proteins

    PubMed Central

    Best, Robert B.; Lindorff-Larsen, Kresten; DePristo, Mark A.; Vendruscolo, Michele

    2006-01-01

    Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of “high-sequence similarity Protein Data Bank” (HSP) structures and consider the extent to which such ensembles represent the structural heterogeneity of the native state in solution. We find that different NMR measurements probing structure and dynamics of given proteins in solution, including order parameters, scalar couplings, and residual dipolar couplings, are remarkably well reproduced by their respective high-sequence similarity Protein Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest that even a modest number of structures of a protein determined under different conditions, or with small variations in sequence, capture a representative subset of the true native-state ensemble. PMID:16829580

  2. Cloud-based adaptive exon prediction for DNA analysis

    PubMed Central

    Putluri, Srinivasareddy; Fathima, Shaik Yasmeen

    2018-01-01

    Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database. PMID:29515813

  3. SeqFIRE: a web application for automated extraction of indel regions and conserved blocks from protein multiple sequence alignments.

    PubMed

    Ajawatanawong, Pravech; Atkinson, Gemma C; Watson-Haigh, Nathan S; Mackenzie, Bryony; Baldauf, Sandra L

    2012-07-01

    Analyses of multiple sequence alignments generally focus on well-defined conserved sequence blocks, while the rest of the alignment is largely ignored or discarded. This is especially true in phylogenomics, where large multigene datasets are produced through automated pipelines. However, some of the most powerful phylogenetic markers have been found in the variable length regions of multiple alignments, particularly insertions/deletions (indels) in protein sequences. We have developed Sequence Feature and Indel Region Extractor (SeqFIRE) to enable the automated identification and extraction of indels from protein sequence alignments. The program can also extract conserved blocks and identify fast evolving sites using a combination of conservation and entropy. All major variables can be adjusted by the user, allowing them to identify the sets of variables most suited to a particular analysis or dataset. Thus, all major tasks in preparing an alignment for further analysis are combined in a single flexible and user-friendly program. The output includes a numbered list of indels, alignments in NEXUS format with indels annotated or removed and indel-only matrices. SeqFIRE is a user-friendly web application, freely available online at www.seqfire.org/.

  4. Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

    PubMed

    Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

    2018-01-01

    We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation.  Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases.  We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes.  Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

  5. Determination of azoxystrobin residues in grapes, musts and wines with a multicommuted flow-through optosensor implemented with photochemically induced fluorescence.

    PubMed

    Flores, Javier López; Díaz, Antonio Molina; Fernández de Córdova, María L

    2007-02-28

    In this paper, the conversion of azoxystrobin in a strongly fluorescent degradation product by UV irradiation with quantitative purposes and its fluorimetric determination are reported for the first time. A multicommuted flow injection-solid phase spectroscopy (FI-SPS) system combined with photochemically-induced fluorescence (PIF) is developed for the determination of azoxystrobin in grapes, must and wine. Grape samples were homogenized and extracted with methanol and further cleaned-up by solid-phase extraction on C(18) silica gel. Wine samples were solid-phase extracted on C(18) sorbent using dichloromethane as eluent. Recoveries of azoxystrobin from spiked grapes (0.5-2.0 mg Kg(-1)), must (0.5-2.0 microg mL(-1)) and wine (0.5-2.0 microg mL(-1)) were 84.0-87.6%, 95.5-105.9% and 88.5-111.2%, respectively. The quantification limit for grapes was 0.021 mg Kg(-1), being within European Union regulations, and 18 microg L(-1) and 8 microg L(-1) for must and wine, respectively.

  6. Community and gene composition of a human dental plaque microbiota obtained by metagenomic sequencing

    PubMed Central

    Xie, G.; Chain, P.S.G.; Lo, C.; Liu, K-L.; Gans, J.; Merritt, J.; Qi, F.

    2010-01-01

    SUMMARY Human dental plaque is a complex microbial community containing an estimated 700 to 19,000 species/phylotypes. Despite numerous studies analysing species richness in healthy and diseased human subjects, the true genomic composition of the human dental plaque microbiota remains unknown. Here we report a metagenomic analysis of a healthy human plaque sample using a combination of second-generation sequencing platforms. A total of 860 million base pairs of non-human sequences were generated. Various analysis tools revealed the presence of 12 well-characterized phyla, members of the TM-7 and BRC1 clade, and sequences that could not be classified. Both pathogens and opportunistic pathogens were identified, supporting the ecological plaque hypothesis for oral diseases. Mapping the metagenomic reads to sequenced reference genomes demonstrated that 4% of the reads could be assigned to the sequenced species. Preliminary annotation identified genes belonging to all known functional categories. Interestingly, although 73% of the total assembled contig sequences were predicted to code for proteins, only 51% of them could be assigned a functional role. Furthermore, ~ 2.8% of the total predicted genes coded for proteins involved in resistance to antibiotics and toxic compounds, suggesting that the oral cavity is an important reservoir for antimicrobial resistance. PMID:21040513

  7. Community and gene composition of a human dental plaque microbiota obtained by metagenomic sequencing.

    PubMed

    Xie, G; Chain, P S G; Lo, C-C; Liu, K-L; Gans, J; Merritt, J; Qi, F

    2010-12-01

    Human dental plaque is a complex microbial community containing an estimated 700 to 19,000 species/phylotypes. Despite numerous studies analysing species richness in healthy and diseased human subjects, the true genomic composition of the human dental plaque microbiota remains unknown. Here we report a metagenomic analysis of a healthy human plaque sample using a combination of second-generation sequencing platforms. A total of 860 million base pairs of non-human sequences were generated. Various analysis tools revealed the presence of 12 well-characterized phyla, members of the TM-7 and BRC1 clade, and sequences that could not be classified. Both pathogens and opportunistic pathogens were identified, supporting the ecological plaque hypothesis for oral diseases. Mapping the metagenomic reads to sequenced reference genomes demonstrated that 4% of the reads could be assigned to the sequenced species. Preliminary annotation identified genes belonging to all known functional categories. Interestingly, although 73% of the total assembled contig sequences were predicted to code for proteins, only 51% of them could be assigned a functional role. Furthermore, ~2.8% of the total predicted genes coded for proteins involved in resistance to antibiotics and toxic compounds, suggesting that the oral cavity is an important reservoir for antimicrobial resistance. © 2010 John Wiley & Sons A/S.

  8. GFam: a platform for automatic annotation of gene families.

    PubMed

    Sasidharan, Rajkumar; Nepusz, Tamás; Swarbreck, David; Huala, Eva; Paccanaro, Alberto

    2012-10-01

    We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam's capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/.

  9. False positives complicate ancient pathogen identifications using high-throughput shotgun sequencing

    PubMed Central

    2014-01-01

    Background Identification of historic pathogens is challenging since false positives and negatives are a serious risk. Environmental non-pathogenic contaminants are ubiquitous. Furthermore, public genetic databases contain limited information regarding these species. High-throughput sequencing may help reliably detect and identify historic pathogens. Results We shotgun-sequenced 8 16th-century Mixtec individuals from the site of Teposcolula Yucundaa (Oaxaca, Mexico) who are reported to have died from the huey cocoliztli (‘Great Pestilence’ in Nahautl), an unknown disease that decimated native Mexican populations during the Spanish colonial period, in order to identify the pathogen. Comparison of these sequences with those deriving from the surrounding soil and from 4 precontact individuals from the site found a wide variety of contaminant organisms that confounded analyses. Without the comparative sequence data from the precontact individuals and soil, false positives for Yersinia pestis and rickettsiosis could have been reported. Conclusions False positives and negatives remain problematic in ancient DNA analyses despite the application of high-throughput sequencing. Our results suggest that several studies claiming the discovery of ancient pathogens may need further verification. Additionally, true single molecule sequencing’s short read lengths, inability to sequence through DNA lesions, and limited ancient-DNA-specific technical development hinder its application to palaeopathology. PMID:24568097

  10. Axillary lymph node metastases in patients with breast carcinomas: assessment with nonenhanced versus uspio-enhanced MR imaging.

    PubMed

    Memarsadeghi, Mazda; Riedl, Christopher C; Kaneider, Andreas; Galid, Arik; Rudas, Margaretha; Matzek, Wolfgang; Helbich, Thomas H

    2006-11-01

    To prospectively assess the accuracy of nonenhanced versus ultrasmall superparamagnetic iron oxide (USPIO)-enhanced magnetic resonance (MR) imaging for depiction of axillary lymph node metastases in patients with breast carcinoma, with histopathologic findings as reference standard. The study was approved by the university ethics committee; written informed consent was obtained. Twenty-two women (mean age, 60 years; range, 40-79 years) with breast carcinomas underwent nonenhanced and USPIO-enhanced (2.6 mg of iron per kilogram of body weight intravenously administered) transverse T1-weighted and transverse and sagittal T2-weighted and T2*-weighted MR imaging in adducted and elevated arm positions. Two experienced radiologists, blinded to the histopathologic findings, analyzed images of axillary lymph nodes with regard to size, morphologic features, and USPIO uptake. A third independent radiologist served as a tiebreaker if consensus between two readers could not be reached. Visual and quantitative analyses of MR images were performed. Sensitivity, specificity, and accuracy values were calculated. To assess the effect of USPIO after administration, signal-to-noise ratio (SNR) changes were statistically analyzed with repeated-measurements analysis of variance (mixed model) for MR sequences. At nonenhanced MR imaging, of 133 lymph nodes, six were rated as true-positive, 99 as true-negative, 23 as false-positive, and five as false-negative. At USPIO-enhanced MR imaging, 11 lymph nodes were rated as true-positive, 120 as true-negative, two as false-positive, and none as false-negative. In two metastatic lymph nodes in two patients with more than one metastatic lymph node, a consensus was not reached. USPIO-enhanced MR imaging revealed a node-by-node sensitivity, specificity, and accuracy of 100%, 98%, and 98%, respectively. At USPIO-enhanced MR imaging, no metastatic lymph nodes were missed on a patient-by-patient basis. Significant interactions indicating differences in the decrease of SNR values for metastatic and nonmetastatic lymph nodes were found for all sequences (P < .001 to P = .022). USPIO-enhanced MR imaging appears valuable for assessment of axillary lymph node metastases in patients with breast carcinomas and is superior to nonenhanced MR imaging.

  11. Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics

    PubMed Central

    Kolaczkowski, Bryan; Thornton, Joseph W.

    2009-01-01

    Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias—which is apparent under both controlled simulation conditions and in analyses of empirical sequence data—also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages—that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis. PMID:20011052

  12. Long-branch attraction bias and inconsistency in Bayesian phylogenetics.

    PubMed

    Kolaczkowski, Bryan; Thornton, Joseph W

    2009-12-09

    Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.

  13. Accelerating plant DNA barcode reference library construction using herbarium specimens: improved experimental techniques.

    PubMed

    Xu, Chao; Dong, Wenpan; Shi, Shuo; Cheng, Tao; Li, Changhao; Liu, Yanlei; Wu, Ping; Wu, Hongkun; Gao, Peng; Zhou, Shiliang

    2015-11-01

    A well-covered reference library is crucial for successful identification of species by DNA barcoding. The biggest difficulty in building such a reference library is the lack of materials of organisms. Herbarium collections are potentially an enormous resource of materials. In this study, we demonstrate that it is likely to build such reference libraries using the reconstructed (self-primed PCR amplified) DNA from the herbarium specimens. We used 179 rosaceous specimens to test the effects of DNA reconstruction, 420 randomly sampled specimens to estimate the usable percentage and another 223 specimens of true cherries (Cerasus, Rosaceae) to test the coverage of usable specimens to the species. The barcode rbcLb (the central four-sevenths of rbcL gene) and matK was each amplified in two halves and sequenced on Roche GS 454 FLX+. DNA from the herbarium specimens was typically shorter than 300 bp. DNA reconstruction enabled amplification fragments of 400-500 bp without bringing or inducing any sequence errors. About one-third of specimens in the national herbarium of China (PE) were proven usable after DNA reconstruction. The specimens in PE cover all Chinese true cherry species and 91.5% of vascular species listed in Flora of China. It is very possible to build well-covered reference libraries for DNA barcoding of vascular species in China. As exemplified in this study, DNA reconstruction and DNA-labelled next-generation sequencing can accelerate the construction of local reference libraries. By putting the local reference libraries together, a global library for DNA barcoding becomes closer to reality. © 2015 John Wiley & Sons Ltd.

  14. Estimating and comparing microbial diversity in the presence of sequencing errors

    PubMed Central

    Chiu, Chun-Huo

    2016-01-01

    Estimating and comparing microbial diversity are statistically challenging due to limited sampling and possible sequencing errors for low-frequency counts, producing spurious singletons. The inflated singleton count seriously affects statistical analysis and inferences about microbial diversity. Previous statistical approaches to tackle the sequencing errors generally require different parametric assumptions about the sampling model or about the functional form of frequency counts. Different parametric assumptions may lead to drastically different diversity estimates. We focus on nonparametric methods which are universally valid for all parametric assumptions and can be used to compare diversity across communities. We develop here a nonparametric estimator of the true singleton count to replace the spurious singleton count in all methods/approaches. Our estimator of the true singleton count is in terms of the frequency counts of doubletons, tripletons and quadrupletons, provided these three frequency counts are reliable. To quantify microbial alpha diversity for an individual community, we adopt the measure of Hill numbers (effective number of taxa) under a nonparametric framework. Hill numbers, parameterized by an order q that determines the measures’ emphasis on rare or common species, include taxa richness (q = 0), Shannon diversity (q = 1, the exponential of Shannon entropy), and Simpson diversity (q = 2, the inverse of Simpson index). A diversity profile which depicts the Hill number as a function of order q conveys all information contained in a taxa abundance distribution. Based on the estimated singleton count and the original non-singleton frequency counts, two statistical approaches (non-asymptotic and asymptotic) are developed to compare microbial diversity for multiple communities. (1) A non-asymptotic approach refers to the comparison of estimated diversities of standardized samples with a common finite sample size or sample completeness. This approach aims to compare diversity estimates for equally-large or equally-complete samples; it is based on the seamless rarefaction and extrapolation sampling curves of Hill numbers, specifically for q = 0, 1 and 2. (2) An asymptotic approach refers to the comparison of the estimated asymptotic diversity profiles. That is, this approach compares the estimated profiles for complete samples or samples whose size tends to be sufficiently large. It is based on statistical estimation of the true Hill number of any order q ≥ 0. In the two approaches, replacing the spurious singleton count by our estimated count, we can greatly remove the positive biases associated with diversity estimates due to spurious singletons and also make fair comparisons across microbial communities, as illustrated in our simulation results and in applying our method to analyze sequencing data from viral metagenomes. PMID:26855872

  15. Inferring the shallow phylogeny of true salamanders (Salamandra) by multiple phylogenomic approaches.

    PubMed

    Rodríguez, Ariel; Burgon, James D; Lyra, Mariana; Irisarri, Iker; Baurain, Denis; Blaustein, Leon; Göçmen, Bayram; Künzel, Sven; Mable, Barbara K; Nolte, Arne W; Veith, Michael; Steinfartz, Sebastian; Elmer, Kathryn R; Philippe, Hervé; Vences, Miguel

    2017-10-01

    The rise of high-throughput sequencing techniques provides the unprecedented opportunity to analyse controversial phylogenetic relationships in great depth, but also introduces a risk of being misinterpreted by high node support values influenced by unevenly distributed missing data or unrealistic model assumptions. Here, we use three largely independent phylogenomic data sets to reconstruct the controversial phylogeny of true salamanders of the genus Salamandra, a group of amphibians providing an intriguing model to study the evolution of aposematism and viviparity. For all six species of the genus Salamandra, and two outgroup species from its sister genus Lyciasalamandra, we used RNA sequencing (RNAseq) and restriction site associated DNA sequencing (RADseq) to obtain data for: (1) 3070 nuclear protein-coding genes from RNAseq; (2) 7440 loci obtained by RADseq; and (3) full mitochondrial genomes. The RNAseq and RADseq data sets retrieved fully congruent topologies when each of them was analyzed in a concatenation approach, with high support for: (1) S. infraimmaculata being sister group to all other Salamandra species; (2) S. algira being sister to S. salamandra; (3) these two species being the sister group to a clade containing S. atra, S. corsica and S. lanzai; and (4) the alpine species S. atra and S. lanzai being sister taxa. The phylogeny inferred from the mitochondrial genome sequences differed from these results, most notably by strongly supporting a clade containing S. atra and S. corsica as sister taxa. A different placement of S. corsica was also retrieved when analysing the RNAseq and RADseq data under species tree approaches. Closer examination of gene trees derived from RNAseq revealed that only a low number of them supported each of the alternative placements of S. atra. Furthermore, gene jackknife support for the S. atra - S. lanzai node stabilized only with very large concatenated data sets. The phylogeny of true salamanders thus provides a compelling example of how classical node support metrics such as bootstrap and Bayesian posterior probability can provide high confidence values in a phylogenomic topology even if the phylogenetic signal for some nodes is spurious, highlighting the importance of complementary approaches such as gene jackknifing. Yet, the general congruence among the topologies recovered from the RNAseq and RADseq data sets increases our confidence in the results, and validates the use of phylotranscriptomic approaches for reconstructing shallow relationships among closely related taxa. We hypothesize that the evolution of Salamandra has been characterized by episodes of introgressive hybridization, which would explain the difficulties of fully reconstructing their evolutionary relationships. Copyright © 2017. Published by Elsevier Inc.

  16. Evolution, language and analogy in functional genomics.

    PubMed

    Benner, S A; Gaucher, E A

    2001-07-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  17. Evolution, language and analogy in functional genomics

    NASA Technical Reports Server (NTRS)

    Benner, S. A.; Gaucher, E. A.

    2001-01-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  18. Pharmacological Studies on Clostridial Neurotoxins.

    DTIC Science & Technology

    1982-08-01

    1974). The entire (e.g., dichain) molecule is needed to poison intact cells, but only the light chain polypeptide is needed to inhibit protein...synthesis in broken cell preparations. The sequence of events that underlies the ability of diphtheria toxin to poison eukaryotic cells is not unique to...may belong to a novel class of internalized poisons . In at least one respect the latter possibility is true. Most toxins that are internalized act in

  19. Automatic Generation of Mechanical Assembly Sequences

    DTIC Science & Technology

    1988-12-01

    Planning Algorithm for General Robot Manipulators. In AAAI-86 Proceedings of the F~th National Conference on Artifcial Intelligence , pages 626-631...topic in artificial intelligence , and the Al approach has dominated much of the research in robot task planning using domain-independent methods. The...computed, using the data in the relational model: " The GEOMETRIC-FEASIBILITY predicate which is true if there exists a collision-free path to bring the two

  20. Utility of whole-genome sequencing for detection of newborn screening disorders in a population cohort of 1,696 neonates.

    PubMed

    Bodian, Dale L; Klein, Elisabeth; Iyer, Ramaswamy K; Wong, Wendy S W; Kothiyal, Prachi; Stauffer, Daniel; Huddleston, Kathi C; Gaither, Amber D; Remsburg, Irina; Khromykh, Alina; Baker, Robin L; Maxwell, George L; Vockley, Joseph G; Niederhuber, John E; Solomon, Benjamin D

    2016-03-01

    To assess the potential of whole-genome sequencing (WGS) to replicate and augment results from conventional blood-based newborn screening (NBS). Research-generated WGS data from an ancestrally diverse cohort of 1,696 infants and both parents of each infant were analyzed for variants in 163 genes involved in disorders included or under discussion for inclusion in US NBS programs. WGS results were compared with results from state NBS and related follow-up testing. NBS genes are generally well covered by WGS. There is a median of one (range: 0-6) database-annotated pathogenic variant in the NBS genes per infant. Results of WGS and NBS in detecting 28 state-screened disorders and four hemoglobin traits were concordant for 88.6% of true positives (n = 35) and 98.9% of true negatives (n = 45,757). Of the five infants affected with a state-screened disorder, WGS identified two whereas NBS detected four. WGS yielded fewer false positives than NBS (0.037 vs. 0.17%) but more results of uncertain significance (0.90 vs. 0.013%). WGS may help rule in and rule out NBS disorders, pinpoint molecular diagnoses, and detect conditions not amenable to current NBS assays.

  1. Reconstructing metastatic seeding patterns of human cancers

    PubMed Central

    Reiter, Johannes G.; Makohon-Moore, Alvin P.; Gerold, Jeffrey M.; Bozic, Ivana; Chatterjee, Krishnendu; Iacobuzio-Donahue, Christine A.; Vogelstein, Bert; Nowak, Martin A.

    2017-01-01

    Reconstructing the evolutionary history of metastases is critical for understanding their basic biological principles and has profound clinical implications. Genome-wide sequencing data has enabled modern phylogenomic methods to accurately dissect subclones and their phylogenies from noisy and impure bulk tumour samples at unprecedented depth. However, existing methods are not designed to infer metastatic seeding patterns. Here we develop a tool, called Treeomics, to reconstruct the phylogeny of metastases and map subclones to their anatomic locations. Treeomics infers comprehensive seeding patterns for pancreatic, ovarian, and prostate cancers. Moreover, Treeomics correctly disambiguates true seeding patterns from sequencing artifacts; 7% of variants were misclassified by conventional statistical methods. These artifacts can skew phylogenies by creating illusory tumour heterogeneity among distinct samples. In silico benchmarking on simulated tumour phylogenies across a wide range of sample purities (15–95%) and sequencing depths (25-800 × ) demonstrates the accuracy of Treeomics compared with existing methods. PMID:28139641

  2. On the design of henon and logistic map-based random number generator

    NASA Astrophysics Data System (ADS)

    Magfirawaty; Suryadi, M. T.; Ramli, Kalamullah

    2017-10-01

    The key sequence is one of the main elements in the cryptosystem. True Random Number Generators (TRNG) method is one of the approaches to generating the key sequence. The randomness source of the TRNG divided into three main groups, i.e. electrical noise based, jitter based and chaos based. The chaos based utilizes a non-linear dynamic system (continuous time or discrete time) as an entropy source. In this study, a new design of TRNG based on discrete time chaotic system is proposed, which is then simulated in LabVIEW. The principle of the design consists of combining 2D and 1D chaotic systems. A mathematical model is implemented for numerical simulations. We used comparator process as a harvester method to obtain the series of random bits. Without any post processing, the proposed design generated random bit sequence with high entropy value and passed all NIST 800.22 statistical tests.

  3. A Statistical Framework for the Functional Analysis of Metagenomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sharon, Itai; Pati, Amrita; Markowitz, Victor

    2008-10-01

    Metagenomic studies consider the genetic makeup of microbial communities as a whole, rather than their individual member organisms. The functional and metabolic potential of microbial communities can be analyzed by comparing the relative abundance of gene families in their collective genomic sequences (metagenome) under different conditions. Such comparisons require accurate estimation of gene family frequencies. They present a statistical framework for assessing these frequencies based on the Lander-Waterman theory developed originally for Whole Genome Shotgun (WGS) sequencing projects. They also provide a novel method for assessing the reliability of the estimations which can be used for removing seemingly unreliable measurements.more » They tested their method on a wide range of datasets, including simulated genomes and real WGS data from sequencing projects of whole genomes. Results suggest that their framework corrects inherent biases in accepted methods and provides a good approximation to the true statistics of gene families in WGS projects.« less

  4. Robustness of composite pulse sequences to time-dependent noise

    NASA Astrophysics Data System (ADS)

    Kabytayev, Chingiz; Green, Todd J.; Khodjasteh, Kaveh; Viola, Lorenza; Biercuk, Michael J.; Brown, Kenneth R.

    2014-03-01

    Quantum control protocols can minimize the effect of noise sources that reduce the quality of quantum operations. Originally developed for NMR, composite pulse sequences correct for unknown static control errors . We study these compensating pulses in the general case of time-varying Gaussian control noise using a filter-function approach and detailed numerics. Three different noise models were considered in this work: amplitude noise, detuning noise and simultaneous presence of both noises. Pulse sequences are shown to be robust to noise up to frequencies as high as ~10% of the Rabi frequency. Robustness of pulses designed for amplitude noise is explained using a geometric picture that naturally follows from filter function. We also discuss future directions including new pulses correcting for noise of certain frequency. True J. Merrill and Kenneth R. Brown. arXiv:1203.6392v1. In press Adv. Chem. Phys. (2013)

  5. Evolution stings: the origin and diversification of scorpion toxin peptide scaffolds.

    PubMed

    Sunagar, Kartik; Undheim, Eivind A B; Chan, Angelo H C; Koludarov, Ivan; Muñoz-Gómez, Sergio A; Antunes, Agostinho; Fry, Bryan G

    2013-12-13

    The episodic nature of natural selection and the accumulation of extreme sequence divergence in venom-encoding genes over long periods of evolutionary time can obscure the signature of positive Darwinian selection. Recognition of the true biocomplexity is further hampered by the limited taxon selection, with easy to obtain or medically important species typically being the subject of intense venom research, relative to the actual taxonomical diversity in nature. This holds true for scorpions, which are one of the most ancient terrestrial venomous animal lineages. The family Buthidae that includes all the medically significant species has been intensely investigated around the globe, while almost completely ignoring the remaining non-buthid families. Australian scorpion lineages, for instance, have been completely neglected, with only a single scorpion species (Urodacus yaschenkoi) having its venom transcriptome sequenced. Hence, the lack of venom composition and toxin sequence information from an entire continent's worth of scorpions has impeded our understanding of the molecular evolution of scorpion venom. The molecular origin, phylogenetic relationships and evolutionary histories of most scorpion toxin scaffolds remain enigmatic. In this study, we have sequenced venom gland transcriptomes of a wide taxonomical diversity of scorpions from Australia, including buthid and non-buthid representatives. Using state-of-art molecular evolutionary analyses, we show that a majority of CSα/β toxin scaffolds have experienced episodic influence of positive selection, while most non-CSα/β linear toxins evolve under the extreme influence of negative selection. For the first time, we have unraveled the molecular origin of the major scorpion toxin scaffolds, such as scorpion venom single von Willebrand factor C-domain peptides (SV-SVC), inhibitor cystine knot (ICK), disulphide-directed beta-hairpin (DDH), bradykinin potentiating peptides (BPP), linear non-disulphide bridged peptides and antimicrobial peptides (AMP). We have thus demonstrated that even neglected lineages of scorpions are a rich pool of novel biochemical components, which have evolved over millions of years to target specific ion channels in prey animals, and as a result, possess tremendous implications in therapeutics.

  6. Critical study of the distribution of rotational velocities of Be stars. I. Deconvolution methods, effects due to gravity darkening, macroturbulence, and binarity

    NASA Astrophysics Data System (ADS)

    Zorec, J.; Frémat, Y.; Domiciano de Souza, A.; Royer, F.; Cidale, L.; Hubert, A.-M.; Semaan, T.; Martayan, C.; Cochetti, Y. R.; Arias, M. L.; Aidelman, Y.; Stee, P.

    2016-11-01

    Context. Among intermediate-mass and massive stars, Be stars are the fastest rotators in the main sequence (MS) and, as such, these stars are a cornerstone to validate models of structure and evolution of rotating stars. Several phenomena, however, induce under- or overestimations either of their apparent Vsini, or true velocity V. Aims: In the present contribution we aim at obtaining distributions of true rotational velocities corrected for systematic effects induced by the rapid rotation itself, macroturbulent velocities, and binarity. Methods: We study a set of 233 Be stars by assuming they have inclination angles distributed at random. We critically discuss the methods of Cranmer and Lucy-Richardson, which enable us to transform a distribution of projected velocities into another distribution of true rotational velocities, where the gravitational darkening effect on the Vsini parameter is considered in different ways. We conclude that iterative algorithm by Lucy-Richardson responds at best to the purposes of the present work, but it requires a thorough determination of the stellar fundamental parameters. Results: We conclude that once the mode of ratios of the true velocities of Be stars attains the value V/Vc ≃ 0.77 in the main-sequence (MS) evolutionary phase, it remains unchanged up to the end of the MS lifespan. The statistical corrections found on the distribution of ratios V/Vc for overestimations of Vsini, due to macroturbulent motions and binarity, produce a shift of this distribution toward lower values of V/Vc when Be stars in all MS evolutionary stages are considered together. The mode of the final distribution obtained is at V/Vc ≃ 0.65. This distribution has a nearly symmetric distribution and shows that the Be phenomenon is characterized by a wide range of true velocity ratios 0.3 ≲ V/Vc ≲ 0.95. It thus suggests that the probability that Be stars are critical rotators is extremely low. Conclusions: The corrections attempted in the present work represent an initial step to infer indications about the nature of the Be-star surface rotation that will be studied in the second paper of this series. Full Tables 1 and 4 are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/595/A132

  7. Factors That Affect Large Subunit Ribosomal DNA Amplicon Sequencing Studies of Fungal Communities: Classification Method, Primer Choice, and Error

    PubMed Central

    Porter, Teresita M.; Golding, G. Brian

    2012-01-01

    Nuclear large subunit ribosomal DNA is widely used in fungal phylogenetics and to an increasing extent also amplicon-based environmental sequencing. The relatively short reads produced by next-generation sequencing, however, makes primer choice and sequence error important variables for obtaining accurate taxonomic classifications. In this simulation study we tested the performance of three classification methods: 1) a similarity-based method (BLAST + Metagenomic Analyzer, MEGAN); 2) a composition-based method (Ribosomal Database Project naïve Bayesian classifier, NBC); and, 3) a phylogeny-based method (Statistical Assignment Package, SAP). We also tested the effects of sequence length, primer choice, and sequence error on classification accuracy and perceived community composition. Using a leave-one-out cross validation approach, results for classifications to the genus rank were as follows: BLAST + MEGAN had the lowest error rate and was particularly robust to sequence error; SAP accuracy was highest when long LSU query sequences were classified; and, NBC runs significantly faster than the other tested methods. All methods performed poorly with the shortest 50–100 bp sequences. Increasing simulated sequence error reduced classification accuracy. Community shifts were detected due to sequence error and primer selection even though there was no change in the underlying community composition. Short read datasets from individual primers, as well as pooled datasets, appear to only approximate the true community composition. We hope this work informs investigators of some of the factors that affect the quality and interpretation of their environmental gene surveys. PMID:22558215

  8. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

    PubMed

    Schirmer, Melanie; Ijaz, Umer Z; D'Amore, Rosalinda; Hall, Neil; Sloan, William T; Quince, Christopher

    2015-03-31

    With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Prediction of enhancer-promoter interactions via natural language processing.

    PubMed

    Zeng, Wanwen; Wu, Mengmeng; Jiang, Rui

    2018-05-09

    Precise identification of three-dimensional genome organization, especially enhancer-promoter interactions (EPIs), is important to deciphering gene regulation, cell differentiation and disease mechanisms. Currently, it is a challenging task to distinguish true interactions from other nearby non-interacting ones since the power of traditional experimental methods is limited due to low resolution or low throughput. We propose a novel computational framework EP2vec to assay three-dimensional genomic interactions. We first extract sequence embedding features, defined as fixed-length vector representations learned from variable-length sequences using an unsupervised deep learning method in natural language processing. Then, we train a classifier to predict EPIs using the learned representations in supervised way. Experimental results demonstrate that EP2vec obtains F1 scores ranging from 0.841~ 0.933 on different datasets, which outperforms existing methods. We prove the robustness of sequence embedding features by carrying out sensitivity analysis. Besides, we identify motifs that represent cell line-specific information through analysis of the learned sequence embedding features by adopting attention mechanism. Last, we show that even superior performance with F1 scores 0.889~ 0.940 can be achieved by combining sequence embedding features and experimental features. EP2vec sheds light on feature extraction for DNA sequences of arbitrary lengths and provides a powerful approach for EPIs identification.

  10. A massive parallel sequencing workflow for diagnostic genetic testing of mismatch repair genes

    PubMed Central

    Hansen, Maren F; Neckmann, Ulrike; Lavik, Liss A S; Vold, Trine; Gilde, Bodil; Toft, Ragnhild K; Sjursen, Wenche

    2014-01-01

    The purpose of this study was to develop a massive parallel sequencing (MPS) workflow for diagnostic analysis of mismatch repair (MMR) genes using the GS Junior system (Roche). A pathogenic variant in one of four MMR genes, (MLH1, PMS2, MSH6, and MSH2), is the cause of Lynch Syndrome (LS), which mainly predispose to colorectal cancer. We used an amplicon-based sequencing method allowing specific and preferential amplification of the MMR genes including PMS2, of which several pseudogenes exist. The amplicons were pooled at different ratios to obtain coverage uniformity and maximize the throughput of a single-GS Junior run. In total, 60 previously identified and distinct variants (substitutions and indels), were sequenced by MPS and successfully detected. The heterozygote detection range was from 19% to 63% and dependent on sequence context and coverage. We were able to distinguish between false-positive and true-positive calls in homopolymeric regions by cross-sample comparison and evaluation of flow signal distributions. In addition, we filtered variants according to a predefined status, which facilitated variant annotation. Our study shows that implementation of MPS in routine diagnostics of LS can accelerate sample throughput and reduce costs without compromising sensitivity, compared to Sanger sequencing. PMID:24689082

  11. Response to comment on "Nuclear genomic sequences reveal that polar bears are an old and distinct bear lineage".

    PubMed

    Hailer, Frank; Kutschera, Verena E; Hallström, Björn M; Fain, Steven R; Leonard, Jennifer A; Arnason, Ulfur; Janke, Axel

    2013-03-29

    Nakagome et al. reanalyzed some of our data and assert that we cannot refute the mitochondrial DNA-based scenario for polar bear evolution. Their single-locus test statistic is strongly affected by introgression and incomplete lineage sorting, whereas our multilocus approaches are better suited to recover the true species relationships. Indeed, our sister-lineage model receives high support in a Bayesian model comparison.

  12. Human evolution: a tale from ancient genomes

    PubMed Central

    2017-01-01

    The field of human ancient DNA (aDNA) has moved from mitochondrial sequencing that suffered from contamination and provided limited biological insights, to become a fully genomic discipline that is changing our conception of human history. Recent successes include the sequencing of extinct hominins, and true population genomic studies of Bronze Age populations. Among the emerging areas of aDNA research, the analysis of past epigenomes is set to provide more new insights into human adaptation and disease susceptibility through time. Starting as a mere curiosity, ancient human genetics has become a major player in the understanding of our evolutionary history. This article is part of the themed issue ‘Evo-devo in the genomics era, and the origins of morphological diversity’. PMID:27994125

  13. Spectral analysis of time series of categorical variables in earth sciences

    NASA Astrophysics Data System (ADS)

    Pardo-Igúzquiza, Eulogio; Rodríguez-Tovar, Francisco J.; Dorador, Javier

    2016-10-01

    Time series of categorical variables often appear in Earth Science disciplines and there is considerable interest in studying their cyclic behavior. This is true, for example, when the type of facies, petrofabric features, ichnofabrics, fossil assemblages or mineral compositions are measured continuously over a core or throughout a stratigraphic succession. Here we deal with the problem of applying spectral analysis to such sequences. A full indicator approach is proposed to complement the spectral envelope often used in other disciplines. Additionally, a stand-alone computer program is provided for calculating the spectral envelope, in this case implementing the permutation test to assess the statistical significance of the spectral peaks. We studied simulated sequences as well as real data in order to illustrate the methodology.

  14. Screening and expression of selected taxonomically conserved and unique hypothetical proteins in Burkholderia pseudomallei K96243

    NASA Astrophysics Data System (ADS)

    Akhir, Nor Azurah Mat; Nadzirin, Nurul; Mohamed, Rahmah; Firdaus-Raih, Mohd

    2015-09-01

    Hypothetical proteins of bacterial pathogens represent a large numbers of novel biological mechanisms which could belong to essential pathways in the bacteria. They lack functional characterizations mainly due to the inability of sequence homology based methods to detect functional relationships in the absence of detectable sequence similarity. The dataset derived from this study showed 550 candidates conserved in genomes that has pathogenicity information and only present in the Burkholderiales order. The dataset has been narrowed down to taxonomic clusters. Ten proteins were selected for ORF amplification, seven of them were successfully amplified, and only four proteins were successfully expressed. These proteins will be great candidates in determining the true function via structural biology.

  15. Things fall apart: biological species form unconnected parsimony networks.

    PubMed

    Hart, Michael W; Sunday, Jennifer

    2007-10-22

    The generality of operational species definitions is limited by problematic definitions of between-species divergence. A recent phylogenetic species concept based on a simple objective measure of statistically significant genetic differentiation uses between-species application of statistical parsimony networks that are typically used for population genetic analysis within species. Here we review recent phylogeographic studies and reanalyse several mtDNA barcoding studies using this method. We found that (i) alignments of DNA sequences typically fall apart into a separate subnetwork for each Linnean species (but with a higher rate of true positives for mtDNA data) and (ii) DNA sequences from single species typically stick together in a single haplotype network. Departures from these patterns are usually consistent with hybridization or cryptic species diversity.

  16. Complete genome sequence of Anabaena variabilis ATCC 29413

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thiel, Teresa; Pratte, Brenda S.; Zhong, Jinshun

    2013-01-01

    Anabaena variabilis ATCC 29413 is a filamentous, heterocyst-forming cyanobacterium that has served as a model organism, with an extensive literature extending over 40 years. The strain has three distinct nitrogenases that function under different environmental conditions and is capable of photoautotrophic growth in the light and true heterotrophic growth in the dark using fructose as both carbon and energy source. While this strain was first isolated in 1964 in Mississippi and named Ana-baena flos-aquae MSU A-37, it clusters phylogenetically with cyanobacteria of the genus Nostoc. The strain is a moderate thermophile, growing well at approximately 40 C. Here we providemore » some additional characteristics of the strain, and an analysis of the complete genome sequence.« less

  17. Full Genome Sequencing Reveals New Southern African Territories Genotypes Bringing Us Closer to Understanding True Variability of Foot-and-Mouth Disease Virus in Africa

    PubMed Central

    Lasecka-Dykes, Lidia; Wright, Caroline F.; Di Nardo, Antonello; Logan, Grace; Mioulet, Valerie; Jackson, Terry; Tuthill, Tobias J.; Knowles, Nick J.; King, Donald P.

    2018-01-01

    Foot-and-mouth disease virus (FMDV) causes a highly contagious disease of cloven-hooved animals that poses a constant burden on farmers in endemic regions and threatens the livestock industries in disease-free countries. Despite the increased number of publicly available whole genome sequences, FMDV data are biased by the opportunistic nature of sampling. Since whole genomic sequences of Southern African Territories (SAT) are particularly underrepresented, this study sequenced 34 isolates from eastern and southern Africa. Phylogenetic analyses revealed two novel genotypes (that comprised 8/34 of these SAT isolates) which contained unusual 5′ untranslated and non-structural encoding regions. While recombination has occurred between these sequences, phylogeny violation analyses indicated that the high degree of sequence diversity for the novel SAT genotypes has not solely arisen from recombination events. Based on estimates of the timing of ancestral divergence, these data are interpreted as being representative of un-sampled FMDV isolates that have been subjected to geographical isolation within Africa by the effects of the Great African Rinderpest Pandemic (1887–1897), which caused a mass die-out of FMDV-susceptible hosts. These findings demonstrate that further sequencing of African FMDV isolates is likely to reveal more unusual genotypes and will allow for better understanding of natural variability and evolution of FMDV. PMID:29652800

  18. The Evolution of Mobile DNAs: When Will Transposons Create Phylogenies That Look As If There Is a Master Gene?

    PubMed Central

    Brookfield, John F. Y.; Johnson, Louise J.

    2006-01-01

    Some families of mammalian interspersed repetitive DNA, such as the Alu SINE sequence, appear to have evolved by the serial replacement of one active sequence with another, consistent with there being a single source of transposition: the “master gene.” Alternative models, in which multiple source sequences are simultaneously active, have been called “transposon models.” Transposon models differ in the proportion of elements that are active and in whether inactivation occurs at the moment of transposition or later. Here we examine the predictions of various types of transposon model regarding the patterns of sequence variation expected at an equilibrium between transposition, inactivation, and deletion. Under the master gene model, all bifurcations in the true tree of elements occur in a single lineage. We show that this property will also hold approximately for transposon models in which most elements are inactive and where at least some of the inactivation events occur after transposition. Such tree shapes are therefore not conclusive evidence for a single source of transposition. PMID:16790583

  19. Ongoing behavior predicts perceptual report of interval duration

    PubMed Central

    Gouvêa, Thiago S.; Monteiro, Tiago; Soares, Sofia; Atallah, Bassam V.; Paton, Joseph J.

    2014-01-01

    The ability to estimate the passage of time is essential for adaptive behavior in complex environments. Yet, it is not known how the brain encodes time over the durations necessary to explain animal behavior. Under temporally structured reinforcement schedules, animals tend to develop temporally structured behavior, and interval timing has been suggested to be accomplished by learning sequences of behavioral states. If this is true, trial to trial fluctuations in behavioral sequences should be predictive of fluctuations in time estimation. We trained rodents in an duration categorization task while continuously monitoring their behavior with a high speed camera. Animals developed highly reproducible behavioral sequences during the interval being timed. Moreover, those sequences were often predictive of perceptual report from early in the trial, providing support to the idea that animals may use learned behavioral patterns to estimate the duration of time intervals. To better resolve the issue, we propose that continuous and simultaneous behavioral and neural monitoring will enable identification of neural activity related to time perception that is not explained by ongoing behavior. PMID:24672473

  20. Diagnosis of twin-to-twin transfusion syndrome, selective fetal growth restriction, twin anaemia-polycythaemia sequence, and twin reversed arterial perfusion sequence.

    PubMed

    Sueters, Marieke; Oepkes, Dick

    2014-02-01

    Monochorionic twin pregnancies are well known to be at risk for a variety of severe complications, a true challenge for the maternal-fetal medicine specialist. With current standards of care, monochorionicity should be established in the first trimester. Subsequently, frequent monitoring using the appropriate diagnostic tools, and in-depth knowledge about the pathophysiology of all possible clinical presentations of monochorionic twin abnormalities, should lead to timely recognition, and appropriate management. Virtually all unique diseases found in monochorionic twins are directly related to placental angio-architecture. This, however, cannot be established reliably before birth. The clinician needs to be aware of the definitions and symptoms of twin-to twin transfusion syndrome, selective fetal growth restriction, twin anaemia-polycythaemia sequence, and twin reversed arterial perfusion sequence, to be able to recognise each disease and take the required action. In this chapter, we address current standards on correct and timely diagnoses of severe complications of monochorionic twin pregnancies. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Error and Error Mitigation in Low-Coverage Genome Assemblies

    PubMed Central

    Hubisz, Melissa J.; Lin, Michael F.; Kellis, Manolis; Siepel, Adam

    2011-01-01

    The recent release of twenty-two new genome sequences has dramatically increased the data available for mammalian comparative genomics, but twenty of these new sequences are currently limited to ∼2× coverage. Here we examine the extent of sequencing error in these 2× assemblies, and its potential impact in downstream analyses. By comparing 2× assemblies with high-quality sequences from the ENCODE regions, we estimate the rate of sequencing error to be 1–4 errors per kilobase. While this error rate is fairly modest, sequencing error can still have surprising effects. For example, an apparent lineage-specific insertion in a coding region is more likely to reflect sequencing error than a true biological event, and the length distribution of coding indels is strongly distorted by error. We find that most errors are contributed by a small fraction of bases with low quality scores, in particular, by the ends of reads in regions of single-read coverage in the assembly. We explore several approaches for automatic sequencing error mitigation (SEM), making use of the localized nature of sequencing error, the fact that it is well predicted by quality scores, and information about errors that comes from comparisons across species. Our automatic methods for error mitigation cannot replace the need for additional sequencing, but they do allow substantial fractions of errors to be masked or eliminated at the cost of modest amounts of over-correction, and they can reduce the impact of error in downstream phylogenomic analyses. Our error-mitigated alignments are available for download. PMID:21340033

  2. VaDiR: an integrated approach to Variant Detection in RNA.

    PubMed

    Neums, Lisa; Suenaga, Seiji; Beyerlein, Peter; Anders, Sara; Koestler, Devin; Mariani, Andrea; Chien, Jeremy

    2018-02-01

    Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets.

  3. Information-optimal genome assembly via sparse read-overlap graphs.

    PubMed

    Shomorony, Ilan; Kim, Samuel H; Courtade, Thomas A; Tse, David N C

    2016-09-01

    In the context of third-generation long-read sequencing technologies, read-overlap-based approaches are expected to play a central role in the assembly step. A fundamental challenge in assembling from a read-overlap graph is that the true sequence corresponds to a Hamiltonian path on the graph, and, under most formulations, the assembly problem becomes NP-hard, restricting practical approaches to heuristics. In this work, we avoid this seemingly fundamental barrier by first setting the computational complexity issue aside, and seeking an algorithm that targets information limits In particular, we consider a basic feasibility question: when does the set of reads contain enough information to allow unambiguous reconstruction of the true sequence? Based on insights from this information feasibility question, we present an algorithm-the Not-So-Greedy algorithm-to construct a sparse read-overlap graph. Unlike most other assembly algorithms, Not-So-Greedy comes with a performance guarantee: whenever information feasibility conditions are satisfied, the algorithm reduces the assembly problem to an Eulerian path problem on the resulting graph, and can thus be solved in linear time. In practice, this theoretical guarantee translates into assemblies of higher quality. Evaluations on both simulated reads from real genomes and a PacBio Escherichia coli K12 dataset demonstrate that Not-So-Greedy compares favorably with standard string graph approaches in terms of accuracy of the resulting read-overlap graph and contig N50. Available at github.com/samhykim/nsg courtade@eecs.berkeley.edu or dntse@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Mapping brain activity in gradient-echo functional MRI using principal component analysis

    NASA Astrophysics Data System (ADS)

    Khosla, Deepak; Singh, Manbir; Don, Manuel

    1997-05-01

    The detection of sites of brain activation in functional MRI has been a topic of immense research interest and many technique shave been proposed to this end. Recently, principal component analysis (PCA) has been applied to extract the activated regions and their time course of activation. This method is based on the assumption that the activation is orthogonal to other signal variations such as brain motion, physiological oscillations and other uncorrelated noises. A distinct advantage of this method is that it does not require any knowledge of the time course of the true stimulus paradigm. This technique is well suited to EPI image sequences where the sampling rate is high enough to capture the effects of physiological oscillations. In this work, we propose and apply tow methods that are based on PCA to conventional gradient-echo images and investigate their usefulness as tools to extract reliable information on brain activation. The first method is a conventional technique where a single image sequence with alternating on and off stages is subject to a principal component analysis. The second method is a PCA-based approach called the common spatial factor analysis technique (CSF). As the name suggests, this method relies on common spatial factors between the above fMRI image sequence and a background fMRI. We have applied these methods to identify active brain ares during visual stimulation and motor tasks. The results from these methods are compared to those obtained by using the standard cross-correlation technique. We found good agreement in the areas identified as active across all three techniques. The results suggest that PCA and CSF methods have good potential in detecting the true stimulus correlated changes in the presence of other interfering signals.

  5. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution.

    PubMed

    Melters, Daniël P; Bradnam, Keith R; Young, Hugh A; Telis, Natalie; May, Michael R; Ruby, J Graham; Sebra, Robert; Peluso, Paul; Eid, John; Rank, David; Garcia, José Fernando; DeRisi, Joseph L; Smith, Timothy; Tobias, Christian; Ross-Ibarra, Jeffrey; Korf, Ian; Chan, Simon W L

    2013-01-30

    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.

  6. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution

    PubMed Central

    2013-01-01

    Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes. PMID:23363705

  7. New FeFe-hydrogenase genes identified in a metagenomic fosmid library from a municipal wastewater treatment plant as revealed by high-throughput sequencing.

    PubMed

    Tomazetto, Geizecler; Wibberg, Daniel; Schlüter, Andreas; Oliveira, Valéria M

    2015-01-01

    A fosmid metagenomic library was constructed with total community DNA obtained from a municipal wastewater treatment plant (MWWTP), with the aim of identifying new FeFe-hydrogenase genes encoding the enzymes most important for hydrogen metabolism. The dataset generated by pyrosequencing of a fosmid library was mined to identify environmental gene tags (EGTs) assigned to FeFe-hydrogenase. The majority of EGTs representing FeFe-hydrogenase genes were affiliated with the class Clostridia, suggesting that this group is the main hydrogen producer in the MWWTP analyzed. Based on assembled sequences, three FeFe-hydrogenase genes were predicted based on detection of the L2 motif (MPCxxKxxE) in the encoded gene product, confirming true FeFe-hydrogenase sequences. These sequences were used to design specific primers to detect fosmids encoding FeFe-hydrogenase genes predicted from the dataset. Three identified fosmids were completely sequenced. The cloned genomic fragments within these fosmids are closely related to members of the Spirochaetaceae, Bacteroidales and Firmicutes, and their FeFe-hydrogenase sequences are characterized by the structure type M3, which is common to clostridial enzymes. FeFe-hydrogenase sequences found in this study represent hitherto undetected sequences, indicating the high genetic diversity regarding these enzymes in MWWTP. Results suggest that MWWTP have to be considered as reservoirs for new FeFe-hydrogenase genes. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  8. Molecular phylogenetic trees - On the validity of the Goodman-Moore augmentation algorithm

    NASA Technical Reports Server (NTRS)

    Holmquist, R.

    1979-01-01

    A response is made to the reply of Nei and Tateno (1979) to the letter of Holmquist (1978) supporting the validity of the augmentation algorithm of Moore (1977) in reconstructions of nucleotide substitutions by means of the maximum parsimony principle. It is argued that the overestimation of the augmented numbers of nucleotide substitutions (augmented distances) found by Tateno and Nei (1978) is due to an unrepresentative data sample and that it is only necessary that evolution be stochastically uniform in different regions of the phylogenetic network for the augmentation method to be useful. The importance of the average value of the true distance over all links is explained, and the relative variances of the true and augmented distances are calculated to be almost identical. The effects of topological changes in the phylogenetic tree on the augmented distance and the question of the correctness of ancestral sequences inferred by the method of parsimony are also clarified.

  9. iMARS--mutation analysis reporting software: an analysis of spontaneous cII mutation spectra.

    PubMed

    Morgan, Claire; Lewis, Paul D

    2006-01-31

    The sensitivity of any mutational assay is determined by the level at which spontaneous mutations occur in the corresponding untreated controls. Establishing the type and frequency at which mutations occur naturally within a test system is essential if one is to draw scientifically sound conclusions regarding chemically induced mutations. Currently, mutation-spectra analysis is laborious and time-consuming. Thus, we have developed iMARS, a comprehensive mutation-spectrum analysis package that utilises routinely used methodologies and visualisation tools. To demonstrate the use and capabilities of iMARS, we have analysed the distribution, types and sequence context of spontaneous base substitutions derived from the cII gene mutation assay in transgenic animals. Analysis of spontaneous mutation spectra revealed variation both within and between the transgenic rodent test systems Big Blue Mouse, MutaMouse and Big Blue Rat. The most common spontaneous base substitutions were G:C-->A:T transitions and G:C-->T:A transversions. All Big Blue Mouse spectra were significantly different from each other by distribution and nearly all by mutation type, whereas the converse was true for the other test systems. Twenty-eight mutation hotspots were observed across all spectra generally occurring in CG, GA/TC, GG and GC dinucleotides. A mutation hotspot at nucleotide 212 occurred at a higher frequency in MutaMouse and Big Blue Rat. In addition, CG dinucleotides were the most mutable in all spectra except two Big Blue Mouse spectra. Thus, spontaneous base-substitution spectra showed more variation in distribution, type and sequence context in Big Blue Mouse relative to spectra derived from MutaMouse and Big Blue Rat. The results of our analysis provide a baseline reference for mutation studies utilising the cII gene in transgenic rodent models. The potential differences in spontaneous base-substitution spectra should be considered when making comparisons between these test systems. The ease at which iMARS has allowed us to carry out an exhaustive investigation to assess mutation distribution, mutation type, strand bias, target sequences and motifs, as well as predict mutation hotspots provides us with a valuable tool in helping to distinguish true chemically induced hotspots from background mutations and gives a true reflection of mutation frequency.

  10. Pulsed Thrust Method for Hover Formation Flying

    NASA Technical Reports Server (NTRS)

    Hope, Alan; Trask, Aaron

    2003-01-01

    A non-continuous thrust method for hover type formation flying has been developed. This method differs from a true hover which requires constant range and bearing from a reference vehicle. The new method uses a pulsed loop, or pogo, maneuver sequence that keeps the follower spacecraft within a defined box in a near hover situation. Equations are developed for the hover maintenance maneuvers. The constraints on the hover location, pulse interval, and maximum/minimum ranges are discussed.

  11. [Study of beta-turns in globular proteins].

    PubMed

    Amirova, S R; Milchevskiĭ, Iu V; Filatov, I V; Esipova, N G; Tumanian, V G

    2005-01-01

    The formation of beta-turns in globular proteins has been studied by the method of molecular mechanics. Statistical method of discriminant analysis was applied to calculate energy components and sequences of oligopeptide segments, and after this prediction of I type beta-turns has been drawn. The accuracy of true positive prediction is 65%. Components of conformational energy considerably affecting beta-turn formation were delineated. There are torsional energy, energy of hydrogen bonds, and van der Waals energy.

  12. A Bayesian mixture model for chromatin interaction data.

    PubMed

    Niu, Liang; Lin, Shili

    2015-02-01

    Chromatin interactions mediated by a particular protein are of interest for studying gene regulation, especially the regulation of genes that are associated with, or known to be causative of, a disease. A recent molecular technique, Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET), that uses chromatin immunoprecipitation (ChIP) and high throughput paired-end sequencing, is able to detect such chromatin interactions genomewide. However, ChIA-PET may generate noise (i.e., pairings of DNA fragments by random chance) in addition to true signal (i.e., pairings of DNA fragments by interactions). In this paper, we propose MC_DIST based on a mixture modeling framework to identify true chromatin interactions from ChIA-PET count data (counts of DNA fragment pairs). The model is cast into a Bayesian framework to take into account the dependency among the data and the available information on protein binding sites and gene promoters to reduce false positives. A simulation study showed that MC_DIST outperforms the previously proposed hypergeometric model in terms of both power and type I error rate. A real data study showed that MC_DIST may identify potential chromatin interactions between protein binding sites and gene promoters that may be missed by the hypergeometric model. An R package implementing the MC_DIST model is available at http://www.stat.osu.edu/~statgen/SOFTWARE/MDM.

  13. A strategy for detecting the conservation of folding-nucleus residues in protein superfamilies.

    PubMed

    Michnick, S W; Shakhnovich, E

    1998-01-01

    Nucleation-growth theory predicts that fast-folding peptide sequences fold to their native structure via structures in a transition-state ensemble that share a small number of native contacts (the folding nucleus). Experimental and theoretical studies of proteins suggest that residues participating in folding nuclei are conserved among homologs. We attempted to determine if this is true in proteins with highly diverged sequences but identical folds (superfamilies). We describe a strategy based on comparisons of residue conservation in natural superfamily sequences with simulated sequences (generated with a Monte-Carlo sequence design strategy) for the same proteins. The basic assumptions of the strategy were that natural sequences will conserve residues needed for folding and stability plus function, the simulated sequences contain no functional conservation, and nucleus residues make native contacts with each other. Based on these assumptions, we identified seven potential nucleus residues in ubiquitin superfamily members. Non-nucleus conserved residues were also identified; these are proposed to be involved in stabilizing native interactions. We found that all superfamily members conserved the same potential nucleus residue positions, except those for which the structural topology is significantly different. Our results suggest that the conservation of the nucleus of a specific fold can be predicted by comparing designed simulated sequences with natural highly diverged sequences that fold to the same structure. We suggest that such a strategy could be used to help plan protein folding and design experiments, to identify new superfamily members, and to subdivide superfamilies further into classes having a similar folding mechanism.

  14. Protein-Protein Interactions in a Crowded Environment: An Analysis via Cross-Docking Simulations and Evolutionary Information

    PubMed Central

    Lopes, Anne; Sacquin-Mora, Sophie; Dimitrova, Viktoriya; Laine, Elodie; Ponty, Yann; Carbone, Alessandra

    2013-01-01

    Large-scale analyses of protein-protein interactions based on coarse-grain molecular docking simulations and binding site predictions resulting from evolutionary sequence analysis, are possible and realizable on hundreds of proteins with variate structures and interfaces. We demonstrated this on the 168 proteins of the Mintseris Benchmark 2.0. On the one hand, we evaluated the quality of the interaction signal and the contribution of docking information compared to evolutionary information showing that the combination of the two improves partner identification. On the other hand, since protein interactions usually occur in crowded environments with several competing partners, we realized a thorough analysis of the interactions of proteins with true partners but also with non-partners to evaluate whether proteins in the environment, competing with the true partner, affect its identification. We found three populations of proteins: strongly competing, never competing, and interacting with different levels of strength. Populations and levels of strength are numerically characterized and provide a signature for the behavior of a protein in the crowded environment. We showed that partner identification, to some extent, does not depend on the competing partners present in the environment, that certain biochemical classes of proteins are intrinsically easier to analyze than others, and that small proteins are not more promiscuous than large ones. Our approach brings to light that the knowledge of the binding site can be used to reduce the high computational cost of docking simulations with no consequence in the quality of the results, demonstrating the possibility to apply coarse-grain docking to datasets made of thousands of proteins. Comparison with all available large-scale analyses aimed to partner predictions is realized. We release the complete decoys set issued by coarse-grain docking simulations of both true and false interacting partners, and their evolutionary sequence analysis leading to binding site predictions. Download site: http://www.lgm.upmc.fr/CCDMintseris/ PMID:24339765

  15. The OGCleaner: filtering false-positive homology clusters.

    PubMed

    Fujimoto, M Stanley; Suvorov, Anton; Jensen, Nicholas O; Clement, Mark J; Snell, Quinn; Bybee, Seth M

    2017-01-01

    Detecting homologous sequences in organisms is an essential step in protein structure and function prediction, gene annotation and phylogenetic tree construction. Heuristic methods are often employed for quality control of putative homology clusters. These heuristics, however, usually only apply to pairwise sequence comparison and do not examine clusters as a whole. We present the Orthology Group Cleaner (the OGCleaner), a tool designed for filtering putative orthology groups as homology or non-homology clusters by considering all sequences in a cluster. The OGCleaner relies on high-quality orthologous groups identified in OrthoDB to train machine learning algorithms that are able to distinguish between true-positive and false-positive homology groups. This package aims to improve the quality of phylogenetic tree construction especially in instances of lower-quality transcriptome assemblies. https://github.com/byucsl/ogcleaner CONTACT: sfujimoto@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. (GTG)5 microsatellite regions in citrinin-producing Penicillium.

    PubMed

    Di Conza, José Alejandro; Nepote, Andrea Fabiana; González, Ana María; Lurá, María Cristina

    2007-03-01

    Morphological and cultural characteristics, as well as biochemical properties, are the main criteria used in fungal taxonomy and in the standard description of fungi species. Sometimes, however, this criterion is difficult to apply due to fungal phenotypic variations. This is particularly true in the genus Penicillium. The aims of this work were to determine (GTG)5 microsatellite sequence in potentially citrinin-producing Penicillium strains and to investigate if this sequence could be useful to characterize such fungi. Penicillium citrinum Thom and Penicillium chrysogenum Thom were isolated from different foods. The identification of the isolates at species level was carried out according to classical taxonomy. The production of citrinin was determined by thin layer chromatography. This study proved that microsatellite regions exist as short repeated sequences in all tested strains. The patterns were very similar for all P. citrinum isolates and it was possible to group them in function of the quantity of citrinin produced. Yet, not similar clusters were obtained when P. chrysogenum isolates were analyzed.

  17. Towards a Logical Distinction Between Swarms and Aftershock Sequences

    NASA Astrophysics Data System (ADS)

    Gardine, M.; Burris, L.; McNutt, S.

    2007-12-01

    The distinction between swarms and aftershock sequences has, up to this point, been fairly arbitrary and non- uniform. Typically 0.5 to 1 order of magnitude difference between the mainshock and largest aftershock has been a traditional choice, but there are many exceptions. Seismologists have generally assumed that the mainshock carries most of the energy, but this is only true if it is sufficiently large compared to the size and numbers of aftershocks. Here we present a systematic division based on energy of the aftershock sequence compared to the energy of the largest event of the sequence. It is possible to calculate the amount of aftershock energy assumed to be in the sequence using the b-value of the frequency-magnitude relation with a fixed choice of magnitude separation (M-mainshock minus M-largest aftershock). Assuming that the energy of an aftershock sequence is less than the energy of the mainshock, the b-value at which the aftershock energy exceeds that of the mainshock energy determines the boundary between aftershock sequences and swarms. The amount of energy for various choices of b-value is also calculated using different values of magnitude separation. When the minimum b-value at which the sequence energy exceeds that of the largest event/mainshock is plotted against the magnitude separation, a linear trend emerges. Values plotting above this line represent swarms and values plotting below it represent aftershock sequences. This scheme has the advantage that it represents a physical quantity - energy - rather than only statistical features of earthquake distributions. As such it may be useful to help distinguish swarms from mainshock/aftershock sequences and to better determine the underlying causes of earthquake swarms.

  18. Characterization of PepB, a group B streptococcal oligopeptidase.

    PubMed Central

    Lin, B; Averett, W F; Novak, J; Chatham, W W; Hollingshead, S K; Coligan, J E; Egan, M L; Pritchard, D G

    1996-01-01

    Group B streptococci were recently reported to possess a cell-associated collagenase. Although the enzyme hydrolyzed the synthetic collagen-like substrate N-(3-[2-furyl]acryloyl)-Leu-Gly-Pro-Ala, we found that neither the highly purified enzyme nor crude group B streptococcal cell lysate solubilized a film of reconstituted rat tail collagen, an activity regarded as obligatory for a true collagenase. We cloned and sequenced the gene for the enzyme (pepB). The deduced amino acid sequence showed 66.4% identity to the PepF oligopeptidase from Lactococcus lactis, a member of the M3 or thimet family of zinc metallopeptidases. The group B streptococcal enzyme also showed oligopeptidase activity and degraded a variety of small bioactive peptides, including bradykinin, neurotensin, and peptide fragments of substance P and adrenocorticotropin. PMID:8757883

  19. Acid processing of pre-Tertiary radiolarian cherts and its impact on faunal content and biozonal correlation

    USGS Publications Warehouse

    Blome, C.D.; Reed, K.M.

    1993-01-01

    Destruction of radiolarians during both diagenesis and HF processing severely reduces faunal abundance and diversity and affects the taxonomic and biostratigraphic utility of chert residues. The robust forms that survive the processing represent only a small fraction of the death assemblage, and delicate skeletal structures used for species differentiation, are either poorly preserved or dissolved in many coeval chert residues. First and last occurrences of taxa in chert sequences are likely to be coarse approximations of their true stratigraphic ranges. Precise correlation is difficult between biozonations based solely on index species from cherts and those constructed from limestone faunas. Careful selection of samples in sequence, use of weaker HF solutions, and study of both chert and limestone faunas should yield better biostratigraphic information. -from Authors

  20. Improved Prediction of Non-methylated Islands in Vertebrates Highlights Different Characteristic Sequence Patterns

    PubMed Central

    Vingron, Martin

    2016-01-01

    Non-methylated islands (NMIs) of DNA are genomic regions that are important for gene regulation and development. A recent study of genome-wide non-methylation data in vertebrates by Long et al. (eLife 2013;2:e00348) has shown that many experimentally identified non-methylated regions do not overlap with classically defined CpG islands which are computationally predicted using simple DNA sequence features. This is especially true in cold-blooded vertebrates such as Danio rerio (zebrafish). In order to investigate how predictive DNA sequence is of a region’s methylation status, we applied a supervised learning approach using a spectrum kernel support vector machine, to see if a more complex model and supervised learning can be used to improve non-methylated island prediction and to understand the sequence properties of these regions. We demonstrate that DNA sequence is highly predictive of methylation status, and that in contrast to existing CpG island prediction methods our method is able to provide more useful predictions of NMIs genome-wide in all vertebrate organisms that were studied. Our results also show that in cold-blooded vertebrates (Anolis carolinensis, Xenopus tropicalis and Danio rerio) where genome-wide classical CpG island predictions consist primarily of false positives, longer primarily AT-rich DNA sequence features are able to identify these regions much more accurately. PMID:27984582

  1. Mining SNPs from EST sequences using filters and ensemble classifiers.

    PubMed

    Wang, J; Zou, Q; Guo, M Z

    2010-05-04

    Abundant single nucleotide polymorphisms (SNPs) provide the most complete information for genome-wide association studies. However, due to the bottleneck of manual discovery of putative SNPs and the inaccessibility of the original sequencing reads, it is essential to develop a more efficient and accurate computational method for automated SNP detection. We propose a novel computational method to rapidly find true SNPs in public-available EST (expressed sequence tag) databases; this method is implemented as SNPDigger. EST sequences are clustered and aligned. SNP candidates are then obtained according to a measure of redundant frequency. Several new informative biological features, such as the structural neighbor profiles and the physical position of the SNP, were extracted from EST sequences, and the effectiveness of these features was demonstrated. An ensemble classifier, which employs a carefully selected feature set, was included for the imbalanced training data. The sensitivity and specificity of our method both exceeded 80% for human genetic data in the cross validation. Our method enables detection of SNPs from the user's own EST dataset and can be used on species for which there is no genome data. Our tests showed that this method can effectively guide SNP discovery in ESTs and will be useful to avoid and save the cost of biological analyses.

  2. The (in)complete organelle genome: exploring the use and nonuse of available technologies for characterizing mitochondrial and plastid chromosomes.

    PubMed

    Sanitá Lima, Matheus; Woods, Laura C; Cartwright, Matthew W; Smith, David Roy

    2016-11-01

    Not long ago, scientists paid dearly in time, money and skill for every nucleotide that they sequenced. Today, DNA sequencing technologies epitomize the slogan 'faster, easier, cheaper and more', and in many ways, sequencing an entire genome has become routine, even for the smallest laboratory groups. This is especially true for mitochondrial and plastid genomes. Given their relatively small sizes and high copy numbers per cell, organelle DNAs are currently among the most highly sequenced kind of chromosome. But accurately characterizing an organelle genome and the information it encodes can require much more than DNA sequencing and bioinformatics analyses. Organelle genomes can be surprisingly complex and can exhibit convoluted and unconventional modes of gene expression. Unravelling this complexity can demand a wide assortment of experiments, from pulsed-field gel electrophoresis to Southern and Northern blots to RNA analyses. Here, we show that it is exactly these types of 'complementary' analyses that are often lacking from contemporary organelle genome papers, particularly short 'genome announcement' articles. Consequently, crucial and interesting features of organelle chromosomes are going undescribed, which could ultimately lead to a poor understanding and even a misrepresentation of these genomes and the genes they express. High-throughput sequencing and bioinformatics have made it easy to sequence and assemble entire chromosomes, but they should not be used as a substitute for or at the expense of other types of genomic characterization methods. © 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  3. Planar dGEMRIC Maps May Aid Imaging Assessment of Cartilage Damage in Femoroacetabular Impingement.

    PubMed

    Bulat, Evgeny; Bixby, Sarah D; Siversson, Carl; Kalish, Leslie A; Warfield, Simon K; Kim, Young-Jo

    2016-02-01

    Three-dimensional (3-D) delayed gadolinium-enhanced MRI of cartilage (dGEMRIC) helps quantify biochemical changes in articular cartilage that correlate with early-stage osteoarthritis. However, dGEMRIC analysis is performed slice by slice, limiting the potential of 3-D data to give an overall impression of cartilage biochemistry. We previously developed a computational algorithm to produce unfolded, or "planar," dGEMRIC maps of acetabular cartilage, but have neither assessed their application nor determined whether MRI-based grading of cartilage damage or dGEMRIC measurements predict intraoperative findings in hips with symptomatic femoroacetabular impingement (FAI). (1) Does imaging-based assessment of acetabular cartilage damage correlate with intraoperative findings in hips with symptomatic FAI? (2) Does the planar dGEMRIC map improve this correlation? (3) Does the planar map improve the correlation between the dGEMRIC index and MRI-based grading of cartilage damage in hips with symptomatic FAI? (4) Does the planar map improve imaging-based evaluation time for hips with symptomatic FAI? We retrospectively studied 47 hips of 45 patients with symptomatic FAI who underwent hip surgery between 2009 and 2013 and had a 1.5-T 3-D dGEMRIC scan within 6 months preoperatively. Our cohort included 25 males and 20 females with a mean ± SD age at surgery of 29 ± 11 years. Planar dGEMRIC maps were generated from isotropic, sagittal oblique TrueFISP and T1 sequences. A pediatric musculoskeletal radiologist with experience in hip MRI evaluated studies using radially reformatted sequences. For six acetabular subregions (anterior-peripheral [AP]; anterior-central [AC]; superior-peripheral [SP]; superior-central [SC]; posterior-peripheral [PP]; posterior-central [PC]), modified Outerbridge cartilage damage grades were recorded and region-of-interest T1 averages (the dGEMRIC index) were measured. Beck's intraoperative cartilage damage grades were compared with the Outerbridge grades and dGEMRIC indices. For a subset of 26 hips, 13 were reevaluated with the map and 13 without the map, and total evaluation times were recorded. There were no meaningful differences in the correlations obtained with versus without referencing the planar maps. Planar map-independent Outerbridge grades had a notable (p < 0.05) Spearman's rank correlation (ρ) with Beck's grades that was moderate in AP, SC, and PC (0.3 < ρ < 0.5) and strong in SP (ρ > 0.5). For map-dependent Outerbridge grades, ρ was moderate in AP, AC, and SC and strong in SP. Map-independent dGEMRIC indices had a ρ with Beck's grades that was moderate in AP and SC (-0.3 > ρ > -0.5) and strong in SP (ρ < -0.5). For map-dependent dGEMRIC indices, ρ was moderate in SC and strong in SP. Similarly, there were no meaningful, map-dependent differences in the correlations. When comparing Outerbridge grades and dGEMRIC indices, there were notable correlations across all subregions. Without the planar map, ρ was moderate in AC and PC and strong in AP, SP, SC, and PP. With the map, ρ was strong in all six subregions. In AC, there was a notable map-dependent improvement in this correlation (p < 0.001). Finally, referencing the planar dGEMRIC map during evaluation was associated with a decrease in mean evaluation time, from 207 ± 32 seconds to 152 ± 33 seconds (p = 0.001). Our work challenges the weak correlation between dGEMRIC and intraoperative findings of cartilage damage that was previously reported in hips with symptomatic FAI, suggesting that dGEMRIC has potential diagnostic use for this patient population. The planar dGEMRIC maps did not meaningfully alter the correlation of imaging-based evaluation of cartilage damage with intraoperative findings; however, they notably improved the correlation of dGEMRIC and MRI-based grading in AC, and their use incurred no additional time cost to imaging-based evaluation. Therefore, the planar maps may improve dGEMRIC's use as a continuous proxy for an otherwise discrete and simplified MRI-based grade of cartilage damage in hips with symptomatic FAI. Level III, diagnostic study.

  4. Preoperative Medical Evaluation: Part 1: General Principles and Cardiovascular Considerations

    PubMed Central

    Becker, Daniel E

    2009-01-01

    A thorough assessment of a patient's medical status is standard practice when dental care is provided. Although this is true for procedures performed under local anesthesia alone, the information gathered may be viewed somewhat differently if the dentist is planning to use sedation or general anesthesia as an adjunct to dental treatment. This article is the first of a 2-part sequence and will address general principles and cardiovascular considerations. A second article will address pulmonary, metabolic, and miscellaneous disorders. PMID:19769423

  5. Determination of the Structural Basis of Antibody Diversity Using NMR

    DTIC Science & Technology

    1989-06-15

    Tomasello , J., & Whitaker, the time - in terms of the true first-order off rate constants M. (1987) Biochemistry 26, 6058-6064. kso and kDO and the fractional...Levitt. M., spectra is unlikely to yield sequence-specific assignments for McConnell, H. M., Rule. G. S., Tomasello , J. & Whittaker. M. AN02. The...Leahy, D. J., Levitt, M., McConnell, H. M., Rule, G. S., Tomasello , J., & Whittaker, M. (1987) Biochemistry 26, 6058-6064. Leahy, D. J., Rule, G. S

  6. Rapid Creation and Quantitative Monitoring of High Coverage shRNA Libraries

    PubMed Central

    Bassik, Michael C.; Lebbink, Robert Jan; Churchman, L. Stirling; Ingolia, Nicholas T.; Patena, Weronika; LeProust, Emily M.; Schuldiner, Maya; Weissman, Jonathan S.; McManus, Michael T.

    2009-01-01

    Short hairpin RNA (shRNA) libraries are limited by the low efficacy of many shRNAs, giving false negatives, and off-target effects, giving false positives. Here we present a strategy for rapidly creating expanded shRNA pools (∼30 shRNAs/gene) that are analyzed by deep-sequencing (EXPAND). This approach enables identification of multiple effective target-specific shRNAs from a complex pool, allowing a rigorous statistical evaluation of whether a gene is a true hit. PMID:19448642

  7. Progression-free survival as surrogate and as true end point: insights from the breast and colorectal cancer literature.

    PubMed

    Saad, E D; Katz, A; Hoff, P M; Buyse, M

    2010-01-01

    Significant achievements in the systemic treatment of both advanced breast cancer and advanced colorectal cancer over the past 10 years have led to a growing number of drugs, combinations, and sequences to be tested. The choice of surrogate and true end points has become a critical issue and one that is currently the subject of much debate. Many recent randomized trials in solid tumor oncology have used progression-free survival (PFS) as the primary end point. PFS is an attractive end point because it is available earlier than overall survival (OS) and is not influenced by second-line treatments. PFS is now undergoing validation as a surrogate end point in various disease settings. The question of whether PFS can be considered an acceptable surrogate end point depends not only on formal validation studies but also on a standardized definition and unbiased ascertainment of disease progression in clinical trials. In advanced breast cancer, formal validation of PFS as a surrogate for OS has so far been unsuccessful. In advanced colorectal cancer, in contrast, current evidence indicates that PFS is a valid surrogate for OS after first-line treatment with chemotherapy. The other question is whether PFS sufficiently reflects clinical benefit to be considered a true end point in and of itself.

  8. Estimating true human and animal host source contribution in quantitative microbial source tracking using the Monte Carlo method.

    PubMed

    Wang, Dan; Silkie, Sarah S; Nelson, Kara L; Wuertz, Stefan

    2010-09-01

    Cultivation- and library-independent, quantitative PCR-based methods have become the method of choice in microbial source tracking. However, these qPCR assays are not 100% specific and sensitive for the target sequence in their respective hosts' genome. The factors that can lead to false positive and false negative information in qPCR results are well defined. It is highly desirable to have a way of removing such false information to estimate the true concentration of host-specific genetic markers and help guide the interpretation of environmental monitoring studies. Here we propose a statistical model based on the Law of Total Probability to predict the true concentration of these markers. The distributions of the probabilities of obtaining false information are estimated from representative fecal samples of known origin. Measurement error is derived from the sample precision error of replicated qPCR reactions. Then, the Monte Carlo method is applied to sample from these distributions of probabilities and measurement error. The set of equations given by the Law of Total Probability allows one to calculate the distribution of true concentrations, from which their expected value, confidence interval and other statistical characteristics can be easily evaluated. The output distributions of predicted true concentrations can then be used as input to watershed-wide total maximum daily load determinations, quantitative microbial risk assessment and other environmental models. This model was validated by both statistical simulations and real world samples. It was able to correct the intrinsic false information associated with qPCR assays and output the distribution of true concentrations of Bacteroidales for each animal host group. Model performance was strongly affected by the precision error. It could perform reliably and precisely when the standard deviation of the precision error was small (≤ 0.1). Further improvement on the precision of sample processing and qPCR reaction would greatly improve the performance of the model. This methodology, built upon Bacteroidales assays, is readily transferable to any other microbial source indicator where a universal assay for fecal sources of that indicator exists. Copyright © 2010 Elsevier Ltd. All rights reserved.

  9. Ecological Consistency of SSU rRNA-Based Operational Taxonomic Units at a Global Scale

    PubMed Central

    Schmidt, Thomas S. B.; Matias Rodrigues, João F.; von Mering, Christian

    2014-01-01

    Operational Taxonomic Units (OTUs), usually defined as clusters of similar 16S/18S rRNA sequences, are the most widely used basic diversity units in large-scale characterizations of microbial communities. However, it remains unclear how well the various proposed OTU clustering algorithms approximate ‘true’ microbial taxa. Here, we explore the ecological consistency of OTUs – based on the assumption that, like true microbial taxa, they should show measurable habitat preferences (niche conservatism). In a global and comprehensive survey of available microbial sequence data, we systematically parse sequence annotations to obtain broad ecological descriptions of sampling sites. Based on these, we observe that sequence-based microbial OTUs generally show high levels of ecological consistency. However, different OTU clustering methods result in marked differences in the strength of this signal. Assuming that ecological consistency can serve as an objective external benchmark for cluster quality, we conclude that hierarchical complete linkage clustering, which provided the most ecologically consistent partitions, should be the default choice for OTU clustering. To our knowledge, this is the first approach to assess cluster quality using an external, biologically meaningful parameter as a benchmark, on a global scale. PMID:24763141

  10. Using populations of human and microbial genomes for organism detection in metagenomes

    DOE PAGES

    Ames, Sasha K.; Gardner, Shea N.; Marti, Jose Manuel; ...

    2015-04-29

    Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-freemore » human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. In conclusion, left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected.« less

  11. Rehearsal dynamics in elementary school children.

    PubMed

    Lehmann, Martin; Hasselhorn, Marcus

    2012-03-01

    Several studies on free recall suggest that processes responsible for recall are analogous to processes responsible for rehearsal. In children, the relationship between cumulative rehearsal and recall performance has been proven to be critical; however, the locus of the effect of rehearsal is not yet fully understood. To unfold the mechanisms that come into play in an overt rehearsal free recall task, we assessed rehearsal and recall sequences in children between 8 and 10 years of age. These sequences give information about the context in which items are repeated and rearranged throughout the list and subsequently recalled. Rehearsal sequences consisted mainly of items from neighboring list positions in their original temporal order. The same characteristics were true for recall sequences. Qualitatively, order effects during study and recall did not differ over age groups. However, in older children who were using cumulative rehearsal more intensively, successive rehearsal and recall of items in their original order was more pronounced. Therefore, we suggest that a main feature of item rehearsal with regard to facilitating recall is the strengthening of interitem associations based on the temporal order within a list and that this characteristic develops with age. Copyright © 2011 Elsevier Inc. All rights reserved.

  12. Using populations of human and microbial genomes for organism detection in metagenomes.

    PubMed

    Ames, Sasha K; Gardner, Shea N; Marti, Jose Manuel; Slezak, Tom R; Gokhale, Maya B; Allen, Jonathan E

    2015-07-01

    Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-free human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. Left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected. © 2015 Ames et al.; Published by Cold Spring Harbor Laboratory Press.

  13. Using populations of human and microbial genomes for organism detection in metagenomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ames, Sasha K.; Gardner, Shea N.; Marti, Jose Manuel

    Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) presents a powerful tool to apply when other targeted diagnostics fail. Numerous technical challenges remain, however, before SMS can move beyond the role of research tool. Accurately separating the known and unknown organism content remains difficult, particularly when SMS is applied as a last resort. The true amount of human DNA that remains in a sample after screening against the human reference genome and filtering nonbiological components left from library preparation has previously been underreported. In this study, we create the most comprehensive collection of microbial and reference-freemore » human genetic variation available in a database optimized for efficient metagenomic search by extracting sequences from GenBank and the 1000 Genomes Project. The results reveal new human sequences found in individual Human Microbiome Project (HMP) samples. Individual samples contain up to 95% human sequence, and 4% of the individual HMP samples contain 10% or more human reads. In conclusion, left unidentified, human reads can complicate and slow down further analysis and lead to inaccurately labeled microbial taxa and ultimately lead to privacy concerns as more human genome data is collected.« less

  14. Analysis and Visualization of ChIP-Seq and RNA-Seq Sequence Alignments Using ngs.plot.

    PubMed

    Loh, Yong-Hwee Eddie; Shen, Li

    2016-01-01

    The continual maturation and increasing applications of next-generation sequencing technology in scientific research have yielded ever-increasing amounts of data that need to be effectively and efficiently analyzed and innovatively mined for new biological insights. We have developed ngs.plot-a quick and easy-to-use bioinformatics tool that performs visualizations of the spatial relationships between sequencing alignment enrichment and specific genomic features or regions. More importantly, ngs.plot is customizable beyond the use of standard genomic feature databases to allow the analysis and visualization of user-specified regions of interest generated by the user's own hypotheses. In this protocol, we demonstrate and explain the use of ngs.plot using command line executions, as well as a web-based workflow on the Galaxy framework. We replicate the underlying commands used in the analysis of a true biological dataset that we had reported and published earlier and demonstrate how ngs.plot can easily generate publication-ready figures. With ngs.plot, users would be able to efficiently and innovatively mine their own datasets without having to be involved in the technical aspects of sequence coverage calculations and genomic databases.

  15. Haplogroup relationships between domestic and wild sheep resolved using a mitogenome panel.

    PubMed

    Meadows, J R S; Hiendleder, S; Kijas, J W

    2011-04-01

    Five haplogroups have been identified in domestic sheep through global surveys of mitochondrial (mt) sequence variation, however these group classifications are often based on small fragments of the complete mtDNA sequence; partial control region or the cytochrome B gene. This study presents the complete mitogenome from representatives of each haplogroup identified in domestic sheep, plus a sample of their wild relatives. Comparison of the sequence successfully resolved the relationships between each haplogroup and provided insight into the relationship with wild sheep. The five haplogroups were characterised as branching independently, a radiation that shared a common ancestor 920,000 ± 190,000 years ago based on protein coding sequence. The utility of various mtDNA components to inform the true relationship between sheep was also examined with Bayesian, maximum likelihood and partitioned Bremmer support analyses. The control region was found to be the mtDNA component, which contributed the highest amount of support to the tree generated using the complete data set. This study provides the nucleus of a mtDNA mitogenome panel, which can be used to assess additional mitogenomes and serve as a reference set to evaluate small fragments of the mtDNA.

  16. Haplogroup relationships between domestic and wild sheep resolved using a mitogenome panel

    PubMed Central

    Meadows, J R S; Hiendleder, S; Kijas, J W

    2011-01-01

    Five haplogroups have been identified in domestic sheep through global surveys of mitochondrial (mt) sequence variation, however these group classifications are often based on small fragments of the complete mtDNA sequence; partial control region or the cytochrome B gene. This study presents the complete mitogenome from representatives of each haplogroup identified in domestic sheep, plus a sample of their wild relatives. Comparison of the sequence successfully resolved the relationships between each haplogroup and provided insight into the relationship with wild sheep. The five haplogroups were characterised as branching independently, a radiation that shared a common ancestor 920 000±190 000 years ago based on protein coding sequence. The utility of various mtDNA components to inform the true relationship between sheep was also examined with Bayesian, maximum likelihood and partitioned Bremmer support analyses. The control region was found to be the mtDNA component, which contributed the highest amount of support to the tree generated using the complete data set. This study provides the nucleus of a mtDNA mitogenome panel, which can be used to assess additional mitogenomes and serve as a reference set to evaluate small fragments of the mtDNA. PMID:20940734

  17. The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants.

    PubMed

    Fadista, João; Manning, Alisa K; Florez, Jose C; Groop, Leif

    2016-08-01

    Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals of European ancestry from the Genetics of Type 2 Diabetes (GoT2D) project and a whole-exome-sequencing data set of 13 000 individuals from five ancestries from the GoT2D and T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) projects, we describe guidelines for genome- and exome-wide association P-value thresholds needed to correct for multiple testing, explaining the impact of linkage disequilibrium thresholds for distinguishing independent variants, minor allele frequency and ancestry characteristics. We emphasize the advantage of studying recent genetic isolate populations when performing rare and low-frequency genetic association analyses, as the multiple testing burden is diminished due to higher genetic homogeneity.

  18. Evolution Stings: The Origin and Diversification of Scorpion Toxin Peptide Scaffolds

    PubMed Central

    Sunagar, Kartik; Undheim, Eivind A. B.; Chan, Angelo H. C.; Koludarov, Ivan; Muñoz-Gómez, Sergio A.; Antunes, Agostinho; Fry, Bryan G.

    2013-01-01

    The episodic nature of natural selection and the accumulation of extreme sequence divergence in venom-encoding genes over long periods of evolutionary time can obscure the signature of positive Darwinian selection. Recognition of the true biocomplexity is further hampered by the limited taxon selection, with easy to obtain or medically important species typically being the subject of intense venom research, relative to the actual taxonomical diversity in nature. This holds true for scorpions, which are one of the most ancient terrestrial venomous animal lineages. The family Buthidae that includes all the medically significant species has been intensely investigated around the globe, while almost completely ignoring the remaining non-buthid families. Australian scorpion lineages, for instance, have been completely neglected, with only a single scorpion species (Urodacus yaschenkoi) having its venom transcriptome sequenced. Hence, the lack of venom composition and toxin sequence information from an entire continent’s worth of scorpions has impeded our understanding of the molecular evolution of scorpion venom. The molecular origin, phylogenetic relationships and evolutionary histories of most scorpion toxin scaffolds remain enigmatic. In this study, we have sequenced venom gland transcriptomes of a wide taxonomical diversity of scorpions from Australia, including buthid and non-buthid representatives. Using state-of-art molecular evolutionary analyses, we show that a majority of CSα/β toxin scaffolds have experienced episodic influence of positive selection, while most non-CSα/β linear toxins evolve under the extreme influence of negative selection. For the first time, we have unraveled the molecular origin of the major scorpion toxin scaffolds, such as scorpion venom single von Willebrand factor C-domain peptides (SV-SVC), inhibitor cystine knot (ICK), disulphide-directed beta-hairpin (DDH), bradykinin potentiating peptides (BPP), linear non-disulphide bridged peptides and antimicrobial peptides (AMP). We have thus demonstrated that even neglected lineages of scorpions are a rich pool of novel biochemical components, which have evolved over millions of years to target specific ion channels in prey animals, and as a result, possess tremendous implications in therapeutics. PMID:24351712

  19. Improve homology search sensitivity of PacBio data by correcting frameshifts.

    PubMed

    Du, Nan; Sun, Yanni

    2016-09-01

    Single-molecule, real-time sequencing (SMRT) developed by Pacific BioSciences produces longer reads than secondary generation sequencing technologies such as Illumina. The long read length enables PacBio sequencing to close gaps in genome assembly, reveal structural variations, and identify gene isoforms with higher accuracy in transcriptomic sequencing. However, PacBio data has high sequencing error rate and most of the errors are insertion or deletion errors. During alignment-based homology search, insertion or deletion errors in genes will cause frameshifts and may only lead to marginal alignment scores and short alignments. As a result, it is hard to distinguish true alignments from random alignments and the ambiguity will incur errors in structural and functional annotation. Existing frameshift correction tools are designed for data with much lower error rate and are not optimized for PacBio data. As an increasing number of groups are using SMRT, there is an urgent need for dedicated homology search tools for PacBio data. In this work, we introduce Frame-Pro, a profile homology search tool for PacBio reads. Our tool corrects sequencing errors and also outputs the profile alignments of the corrected sequences against characterized protein families. We applied our tool to both simulated and real PacBio data. The results showed that our method enables more sensitive homology search, especially for PacBio data sets of low sequencing coverage. In addition, we can correct more errors when comparing with a popular error correction tool that does not rely on hybrid sequencing. The source code is freely available at https://sourceforge.net/projects/frame-pro/ yannisun@msu.edu. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. Passenger Flow Forecasting Research for Airport Terminal Based on SARIMA Time Series Model

    NASA Astrophysics Data System (ADS)

    Li, Ziyu; Bi, Jun; Li, Zhiyin

    2017-12-01

    Based on the data of practical operating of Kunming Changshui International Airport during2016, this paper proposes Seasonal Autoregressive Integrated Moving Average (SARIMA) model to predict the passenger flow. This article not only considers the non-stationary and autocorrelation of the sequence, but also considers the daily periodicity of the sequence. The prediction results can accurately describe the change trend of airport passenger flow and provide scientific decision support for the optimal allocation of airport resources and optimization of departure process. The result shows that this model is applicable to the short-term prediction of airport terminal departure passenger traffic and the average error ranges from 1% to 3%. The difference between the predicted and the true values of passenger traffic flow is quite small, which indicates that the model has fairly good passenger traffic flow prediction ability.

  1. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liolios, Konstantinos; Abt, Birte; Scheuner, Carmen

    Spirochaeta africana Zhilina et al. 1996 is an anaerobic, aerotolerant, spiral-shaped bacte- rium that is motile via periplasmic flagella. The type strain of the species, Z-7692T, was iso- lated in 1993 or earlier from a bacterial bloom in the brine under the trona layer in a shallow lagoon of the alkaline equatorial Lake Magadi in Kenya. Here we describe the features of this organism, together with the complete genome sequence, and annotation. Considering the pending reclassification of S. caldaria to the genus Treponema, S. africana is only the second 'true' member of the genus Spirochaeta with a genome-sequenced type strainmore » to be pub- lished. The 3,285,855 bp long genome of strain Z-7692T with its 2,817 protein-coding and 57 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  2. Combined Leydig cell and Sertoli cell dysfunction in 46,XX males lacking the sex determining region Y gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Turner, B.; Vordermark, J.S.; Fechner, P.Y.

    1995-07-03

    We have evaluated 3 individuals with a rare form of 46,XX sex reversal. All of them had ambiguous external genitalia and mixed wolffian and muellerian structures, indicating both Leydig cell and Sertoli cell dysfunction, similar to that of patients with true hermaphroditism. However, gonadal tissue was not ovotesticular but testicular with varying degrees of dysgenesis. SRY sequences were absent in genomic DNA from peripheral leukocytes in all 3 subjects. Y centromere sequences were also absent, indicating that testis development did not occur because of a low level mosaicism of Y-bearing cells. The subjects in this report demonstrate that there ismore » a continuum in the extent of the testis determination in SRY-negative 46,XX sex reversal, ranging from nearly normal to minimal testicular development. 20 refs.« less

  3. The Molecular Revolution in Cutaneous Biology: Era of Next-Generation Sequencing.

    PubMed

    Sarig, Ofer; Sprecher, Eli

    2017-05-01

    Like any true conceptual revolution, next-generation sequencing (NGS) has not only radically changed research and clinical practice, it has also modified scientific culture. With the possibility to investigate DNA contents of any organism and in any context, including in somatic disorders or in tissues carrying complex microbial populations, it initially seemed as if the genetic underpinning of any biological phenomenon could now be deciphered in an almost streamlined fashion. However, over the past recent years, we have once again come to understand that there is no such a thing as great opportunities without great challenges. The steadily expanding use of NGS and related applications is now facing biologists and physicians with novel technological obstacles, analytical hurdles and increasingly pressing ethical questions. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  4. Too good to be true: when overwhelming evidence fails to convince.

    PubMed

    Gunn, Lachlan J; Chapeau-Blondeau, François; McDonnell, Mark D; Davis, Bruce R; Allison, Andrew; Abbott, Derek

    2016-03-01

    Is it possible for a large sequence of measurements or observations, which support a hypothesis, to counterintuitively decrease our confidence? Can unanimous support be too good to be true? The assumption of independence is often made in good faith; however, rarely is consideration given to whether a systemic failure has occurred. Taking this into account can cause certainty in a hypothesis to decrease as the evidence for it becomes apparently stronger. We perform a probabilistic Bayesian analysis of this effect with examples based on (i) archaeological evidence, (ii) weighing of legal evidence and (iii) cryptographic primality testing. In this paper, we investigate the effects of small error rates in a set of measurements or observations. We find that even with very low systemic failure rates, high confidence is surprisingly difficult to achieve; in particular, we find that certain analyses of cryptographically important numerical tests are highly optimistic, underestimating their false-negative rate by as much as a factor of 2 80 .

  5. [Expert reconstruction of the true circumstances of the shooting of the Emperor Nikolai II, members of his family, and those who accompanied them into exile at Ekaterinburg].

    PubMed

    Kovalev, A V; Kolkutin, V V

    2011-01-01

    Analysis of the materials of the criminal case, historical, archive, and forensic medical documents permitted for the first time to reconstruct the circumstances of the shooting of Emperor Nikolai II, members of the royal family, and those who chose to accompany them into exile in the house of the engineer N.N. Ipatiev, Ekaterinburg, on the night of 16-17th July (Grigorian calendar) 1918. The results of the study allowed the true picture of the assassination to be reconstructed including the mutual positions of the victims and the executors in the semi-basement room No 2, the total number of participants, the number and direction of shots, the types of the weapons used, and the sequence of actions of each person involved in the event. It was shown, that in the beginning most shots had been fired at the emperor and tsarevich Alexei.

  6. GOTHiC, a probabilistic model to resolve complex biases and to identify real interactions in Hi-C data.

    PubMed

    Mifsud, Borbala; Martincorena, Inigo; Darbo, Elodie; Sugar, Robert; Schoenfelder, Stefan; Fraser, Peter; Luscombe, Nicholas M

    2017-01-01

    Hi-C is one of the main methods for investigating spatial co-localisation of DNA in the nucleus. However, the raw sequencing data obtained from Hi-C experiments suffer from large biases and spurious contacts, making it difficult to identify true interactions. Existing methods use complex models to account for biases and do not provide a significance threshold for detecting interactions. Here we introduce a simple binomial probabilistic model that resolves complex biases and distinguishes between true and false interactions. The model corrects biases of known and unknown origin and yields a p-value for each interaction, providing a reliable threshold based on significance. We demonstrate this experimentally by testing the method against a random ligation dataset. Our method outperforms previous methods and provides a statistical framework for further data analysis, such as comparisons of Hi-C interactions between different conditions. GOTHiC is available as a BioConductor package (http://www.bioconductor.org/packages/release/bioc/html/GOTHiC.html).

  7. Biological applications of near-field scanning optical microscopy

    NASA Astrophysics Data System (ADS)

    Moers, Marco H. P.; Ruiter, A. G. T.; Jalocha, Alain; van Hulst, Niko F.; Kalle, W. H. J.; Wiegant, J. C. A. G.; Raap, A. K.

    1995-09-01

    Near-field Scanning Optical Microscopy (NSOM) is a true optical microscopic technique allowing fluorescence, absorption, reflection and polarization contrast with the additional advantage of nanometer lateral resolution, unlimited by diffraction and operation at ambient conditions. NSOM based on metal coated adiabatically tapered fibers, combined with shear force feedback and operated in illumination mode, has proven to be the most powerful NSOM arrangement, because of its true localization of the optical interaction, its various optical contrast possibilities and its sensitivity down to the single molecular level. In this paper applications of `aperture' NSOM to Fluorescence In Situ Hybridization of human metaphase chromosomes are presented, where the localized fluorescence allows to identify specific DNA sequences. All images are accompanied by the simultaneously acquired force image, enabling direct comparison of the optical contrast with the sample topography on nanometer scale, far beyond the diffraction limit. Thus the unique combination of high resolution, specific optical contrast and ambient operation offers many new direction possibilities in biological studies.

  8. VecScreen_plus_taxonomy: imposing a tax(onomy) increase on vector contamination screening.

    PubMed

    Schäffer, Alejandro A; Nawrocki, Eric P; Choi, Yoon; Kitts, Paul A; Karsch-Mizrachi, Ilene; McVeigh, Richard

    2018-03-01

    Nucleic acid sequences in public databases should not contain vector contamination, but many sequences in GenBank do (or did) contain vectors. The National Center for Biotechnology Information uses the program VecScreen to screen submitted sequences for contamination. Additional tools are needed to distinguish true-positive (contamination) from false-positive (not contamination) VecScreen matches. A principal reason for false-positive VecScreen matches is that the sequence and the matching vector subsequence originate from closely related or identical organisms (for example, both originate in Escherichia coli). We collected information on the taxonomy of sources of vector segments in the UniVec database used by VecScreen. We used that information in two overlapping software pipelines for retrospective analysis of contamination in GenBank and for prospective analysis of contamination in new sequence submissions. Using the retrospective pipeline, we identified and corrected over 8000 contaminated sequences in the nonredundant nucleotide database. The prospective analysis pipeline has been in production use since April 2017 to evaluate some new GenBank submissions. Data on the sources of UniVec entries were included in release 10.0 (ftp://ftp.ncbi.nih.gov/pub/UniVec/). The main software is freely available at https://github.com/aaschaffer/vecscreen_plus_taxonomy. aschaffe@helix.nih.gov. Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2017. This work is written by US Government employees and are in the public domain in the US.

  9. BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations

    PubMed Central

    Wang, Junbai; Batmanov, Kirill

    2015-01-01

    Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein–DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein–DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972

  10. Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller.

    PubMed

    Xu, Chang; Nezami Ranjbar, Mohammad R; Wu, Zhong; DiCarlo, John; Wang, Yexun

    2017-01-03

    Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-containing DNA fragments efficiently in the mix of abundant wild-type DNA; 2) sequences the DNA library extensively to deep coverage; and 3) distinguishes low level true variants from amplification and sequencing errors with high accuracy. Targeted enrichment using PCR primers provides researchers with a convenient way to achieve deep sequencing for a small, yet most relevant region using benchtop sequencers. Molecular barcoding (or indexing) provides a unique solution for reducing sequencing artifacts analytically. Although different molecular barcoding schemes have been reported in recent literature, most variant calling has been done on limited targets, using simple custom scripts. The analytical performance of barcode-aware variant calling can be significantly improved by incorporating advanced statistical models. We present here a highly efficient, simple and scalable enrichment protocol that integrates molecular barcodes in multiplex PCR amplification. In addition, we developed smCounter, an open source, generic, barcode-aware variant caller based on a Bayesian probabilistic model. smCounter was optimized and benchmarked on two independent read sets with SNVs and indels at 5 and 1% allele fractions. Variants were called with very good sensitivity and specificity within coding regions. We demonstrated that we can accurately detect somatic mutations with allele fractions as low as 1% in coding regions using our enrichment protocol and variant caller.

  11. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  12. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

    2004-08-06

    The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayedmore » embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less

  13. Diffusion-weighted imaging of the liver with multiple b values: effect of diffusion gradient polarity and breathing acquisition on image quality and intravoxel incoherent motion parameters--a pilot study.

    PubMed

    Dyvorne, Hadrien A; Galea, Nicola; Nevers, Thomas; Fiel, M Isabel; Carpenter, David; Wong, Edmund; Orton, Matthew; de Oliveira, Andre; Feiweier, Thorsten; Vachon, Marie-Louise; Babb, James S; Taouli, Bachir

    2013-03-01

    To optimize intravoxel incoherent motion (IVIM) diffusion-weighted (DW) imaging by estimating the effects of diffusion gradient polarity and breathing acquisition scheme on image quality, signal-to-noise ratio (SNR), IVIM parameters, and parameter reproducibility, as well as to investigate the potential of IVIM in the detection of hepatic fibrosis. In this institutional review board-approved prospective study, 20 subjects (seven healthy volunteers, 13 patients with hepatitis C virus infection; 14 men, six women; mean age, 46 years) underwent IVIM DW imaging with four sequences: (a) respiratory-triggered (RT) bipolar (BP) sequence, (b) RT monopolar (MP) sequence, (c) free-breathing (FB) BP sequence, and (d) FB MP sequence. Image quality scores were assessed for all sequences. A biexponential analysis with the Bayesian method yielded true diffusion coefficient (D), pseudodiffusion coefficient (D*), and perfusion fraction (PF) in liver parenchyma. Mixed-model analysis of variance was used to compare image quality, SNR, IVIM parameters, and interexamination variability between the four sequences, as well as the ability to differentiate areas of liver fibrosis from normal liver tissue. Image quality with RT sequences was superior to that with FB acquisitions (P = .02) and was not affected by gradient polarity. SNR did not vary significantly between sequences. IVIM parameter reproducibility was moderate to excellent for PF and D, while it was less reproducible for D*. PF and D were both significantly lower in patients with hepatitis C virus than in healthy volunteers with the RT BP sequence (PF = 13.5% ± 5.3 [standard deviation] vs 9.2% ± 2.5, P = .038; D = [1.16 ± 0.07] × 10(-3) mm(2)/sec vs [1.03 ± 0.1] × 10(-3) mm(2)/sec, P = .006). The RT BP DW imaging sequence had the best results in terms of image quality, reproducibility, and ability to discriminate between healthy and fibrotic liver with biexponential fitting.

  14. Reducing false-positive incidental findings with ensemble genotyping and logistic regression based variant filtering methods.

    PubMed

    Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choe, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B; Gupta, Neha; Kohane, Isaac S; Green, Robert C; Kong, Sek Won

    2014-08-01

    As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false-positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here, we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false-negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous single nucleotide variants (SNVs); 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery in NA12878, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and an ensemble genotyping would be essential to minimize false-positive DNM candidates. © 2014 WILEY PERIODICALS, INC.

  15. Reducing false positive incidental findings with ensemble genotyping and logistic regression-based variant filtering methods

    PubMed Central

    Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choi, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B.; Gupta, Neha; Kohane, Isaac S.; Green, Robert C.; Kong, Sek Won

    2014-01-01

    As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous SNVs; 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and ensemble genotyping would be essential to minimize false positive DNM candidates. PMID:24829188

  16. Next generation sequencing and its applications in forensic genetics.

    PubMed

    Børsting, Claus; Morling, Niels

    2015-09-01

    It has been almost a decade since the first next generation sequencing (NGS) technologies emerged and quickly changed the way genetic research is conducted. Today, full genomes are mapped and published almost weekly and with ever increasing speed and decreasing costs. NGS methods and platforms have matured during the last 10 years, and the quality of the sequences has reached a level where NGS is used in clinical diagnostics of humans. Forensic genetic laboratories have also explored NGS technologies and especially in the last year, there has been a small explosion in the number of scientific articles and presentations at conferences with forensic aspects of NGS. These contributions have demonstrated that NGS offers new possibilities for forensic genetic case work. More information may be obtained from unique samples in a single experiment by analyzing combinations of markers (STRs, SNPs, insertion/deletions, mRNA) that cannot be analyzed simultaneously with the standard PCR-CE methods used today. The true variation in core forensic STR loci has been uncovered, and previously unknown STR alleles have been discovered. The detailed sequence information may aid mixture interpretation and will increase the statistical weight of the evidence. In this review, we will give an introduction to NGS and single-molecule sequencing, and we will discuss the possible applications of NGS in forensic genetics. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  17. Benchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Models.

    PubMed

    Jacquin, Hugo; Gilson, Amy; Shakhnovich, Eugene; Cocco, Simona; Monasson, Rémi

    2016-05-01

    Inverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of 'true' LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect both positive design (stabilization of native conformation) and negative design (destabilization of competing folds). In addition to providing detailed structural information, the inferred Potts models used as protein Hamiltonian for design of new sequences are able to generate with high probability completely new sequences with the desired folds, which is not possible using independent-site models. Those are remarkable results as the effective LP Hamiltonians used to generate MSA are not simple pairwise models due to the competition between the folds. Our findings elucidate the reasons for the success of inverse approaches to the modelling of proteins from sequence data, and their limitations.

  18. Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic

    PubMed Central

    Derkach, Andriy; Chiang, Theodore; Gong, Jiafen; Addis, Laura; Dobbins, Sara; Tomlinson, Ian; Houlston, Richard; Pal, Deb K.; Strug, Lisa J.

    2014-01-01

    Motivation: Sufficiently powered case–control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data. Results: We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the ‘gold standard’ analysis with the true underlying genotypes for both common and rare variants. Availability and implementation: An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. Contact: lisa.strug@utoronto.ca Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24733292

  19. Identification and sequence analyses of novel lipase encoding novel thermophillic bacilli isolated from Armenian geothermal springs.

    PubMed

    Shahinyan, Grigor; Margaryan, Armine; Panosyan, Hovik; Trchounian, Armen

    2017-05-02

    Among the huge diversity of thermophilic bacteria mainly bacilli have been reported as active thermostable lipase producers. Geothermal springs serve as the main source for isolation of thermostable lipase producing bacilli. Thermostable lipolytic enzymes, functioning in the harsh conditions, have promising applications in processing of organic chemicals, detergent formulation, synthesis of biosurfactants, pharmaceutical processing etc. In order to study the distribution of lipase-producing thermophilic bacilli and their specific lipase protein primary structures, three lipase producers from different genera were isolated from mesothermal (27.5-70 °C) springs distributed on the territory of Armenia and Nagorno Karabakh. Based on phenotypic characteristics and 16S rRNA gene sequencing the isolates were identified as Geobacillus sp., Bacillus licheniformis and Anoxibacillus flavithermus strains. The lipase genes of isolates were sequenced by using initially designed primer sets. Multiple alignments generated from primary structures of the lipase proteins and annotated lipase protein sequences, conserved regions analysis and amino acid composition have illustrated the similarity (98-99%) of the lipases with true lipases (family I) and GDSL esterase family (family II). A conserved sequence block that determines the thermostability has been identified in the multiple alignments of the lipase proteins. The results are spreading light on the lipase producing bacilli distribution in geothermal springs in Armenia and Nagorno Karabakh. Newly isolated bacilli strains could be prospective source for thermostable lipases and their genes.

  20. Small animal magnetic resonance imaging: an efficient tool to assess liver volume and intrahepatic vascular anatomy.

    PubMed

    Melloul, Emmanuel; Raptis, Dimitri A; Boss, Andreas; Pfammater, Thomas; Tschuor, Christoph; Tian, Yinghua; Graf, Rolf; Clavien, Pierre-Alain; Lesurtel, Mickael

    2014-04-01

    To develop a noninvasive technique to assess liver volumetry and intrahepatic portal vein anatomy in a mouse model of liver regeneration. Fifty-two C57BL/6 male mice underwent magnetic resonance imaging (MRI) of the liver using a 4.7 T small animal MRI system after no treatment, 70% partial hepatectomy (PH), or selective portal vein embolization. The protocol consisted of the following sequences: three-dimensional-encoded spoiled gradient-echo sequence (repetition time per echo time 15 per 2.7 ms, flip angle 20°) for volumetry, and two-dimensional-encoded time-of-flight angiography sequence (repetition time per echo time 18 per 6.4 ms, flip angle 80°) for vessel visualization. Liver volume and portal vein segmentation was performed using a dedicated postprocessing software. In animals with portal vein embolization, portography served as reference standard. True liver volume was measured after sacrificing the animals. Measurements were carried out by two independent observers with subsequent analysis by the Cohen κ-test for interobserver agreement. MRI liver volumetry highly correlated with the true liver volume measurement using a conventional method in both the untreated liver and the liver remnant after 70% PH with a high interobserver correlation coefficient of 0.94 (95% confidence interval, 0.80-0.98 for untreated liver [P < 0.001] and 0.90-0.97 after 70% PH [P < 0.001]). The diagnostic accuracy of magnetic resonance angiography for the occlusion of one branch of the portal vein was 0.95 (95% confidence interval, 0.84-1). The level of agreement between the two observers for the description of intrahepatic vascular anatomy was excellent (Cohen κ value = 0.925). This protocol may be used for noninvasive liver volumetry and visualization of portal vein anatomy in mice. It will serve the dynamic study of new strategies to enhance liver regeneration in vivo. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. Building-up of a DNA barcode library for true bugs (insecta: hemiptera: heteroptera) of Germany reveals taxonomic uncertainties and surprises.

    PubMed

    Raupach, Michael J; Hendrich, Lars; Küchler, Stefan M; Deister, Fabian; Morinière, Jérome; Gossner, Martin M

    2014-01-01

    During the last few years, DNA barcoding has become an efficient method for the identification of species. In the case of insects, most published DNA barcoding studies focus on species of the Ephemeroptera, Trichoptera, Hymenoptera and especially Lepidoptera. In this study we test the efficiency of DNA barcoding for true bugs (Hemiptera: Heteroptera), an ecological and economical highly important as well as morphologically diverse insect taxon. As part of our study we analyzed DNA barcodes for 1742 specimens of 457 species, comprising 39 families of the Heteroptera. We found low nucleotide distances with a minimum pairwise K2P distance <2.2% within 21 species pairs (39 species). For ten of these species pairs (18 species), minimum pairwise distances were zero. In contrast to this, deep intraspecific sequence divergences with maximum pairwise distances >2.2% were detected for 16 traditionally recognized and valid species. With a successful identification rate of 91.5% (418 species) our study emphasizes the use of DNA barcodes for the identification of true bugs and represents an important step in building-up a comprehensive barcode library for true bugs in Germany and Central Europe as well. Our study also highlights the urgent necessity of taxonomic revisions for various taxa of the Heteroptera, with a special focus on various species of the Miridae. In this context we found evidence for on-going hybridization events within various taxonomically challenging genera (e.g. Nabis Latreille, 1802 (Nabidae), Lygus Hahn, 1833 (Miridae), Phytocoris Fallén, 1814 (Miridae)) as well as the putative existence of cryptic species (e.g. Aneurus avenius (Duffour, 1833) (Aradidae) or Orius niger (Wolff, 1811) (Anthocoridae)).

  2. Building-Up of a DNA Barcode Library for True Bugs (Insecta: Hemiptera: Heteroptera) of Germany Reveals Taxonomic Uncertainties and Surprises

    PubMed Central

    Raupach, Michael J.; Hendrich, Lars; Küchler, Stefan M.; Deister, Fabian; Morinière, Jérome; Gossner, Martin M.

    2014-01-01

    During the last few years, DNA barcoding has become an efficient method for the identification of species. In the case of insects, most published DNA barcoding studies focus on species of the Ephemeroptera, Trichoptera, Hymenoptera and especially Lepidoptera. In this study we test the efficiency of DNA barcoding for true bugs (Hemiptera: Heteroptera), an ecological and economical highly important as well as morphologically diverse insect taxon. As part of our study we analyzed DNA barcodes for 1742 specimens of 457 species, comprising 39 families of the Heteroptera. We found low nucleotide distances with a minimum pairwise K2P distance <2.2% within 21 species pairs (39 species). For ten of these species pairs (18 species), minimum pairwise distances were zero. In contrast to this, deep intraspecific sequence divergences with maximum pairwise distances >2.2% were detected for 16 traditionally recognized and valid species. With a successful identification rate of 91.5% (418 species) our study emphasizes the use of DNA barcodes for the identification of true bugs and represents an important step in building-up a comprehensive barcode library for true bugs in Germany and Central Europe as well. Our study also highlights the urgent necessity of taxonomic revisions for various taxa of the Heteroptera, with a special focus on various species of the Miridae. In this context we found evidence for on-going hybridization events within various taxonomically challenging genera (e.g. Nabis Latreille, 1802 (Nabidae), Lygus Hahn, 1833 (Miridae), Phytocoris Fallén, 1814 (Miridae)) as well as the putative existence of cryptic species (e.g. Aneurus avenius (Duffour, 1833) (Aradidae) or Orius niger (Wolff, 1811) (Anthocoridae)). PMID:25203616

  3. Mechanism of transcription termination by RNA polymerase III utilizes a nontemplate-strand sequence-specific signal element

    PubMed Central

    Arimbasseri, Aneeshkumar G.; Maraia, Richard J.

    2015-01-01

    SUMMARY Understanding the mechanism of transcription termination by a eukaryotic RNA polymerase (RNAP) has been limited by lack of a characterizable intermediate that reflects transition from an elongation complex to a true termination event. While other multisubunit RNAPs require multipartite cis-signals and/or ancillary factors to mediate pausing and release of the nascent transcript from the clutches of these enzymes, RNAP III does so with precision and efficiency on a simple oligo(dT) tract, independent of other cis-elements or trans-factors. We report a RNAP III pre-termination complex that reveals termination mechanisms controlled by sequence-specific elements in the non-template strand. Furthermore, the TFIIF-like, RNAP III subunit, C37 is required for this function of the non-template strand signal. The results reveal the RNAP III terminator as an information-rich control element. While the template strand promotes destabilization via a weak oligo(rU:dA) hybrid, the non-template strand provides distinct sequence-specific destabilizing information through interactions with the C37 subunit. PMID:25959395

  4. The 2012 Ferrara seismic sequence: Regional crustal structure, earthquake sources, and seismic hazard

    NASA Astrophysics Data System (ADS)

    Malagnini, Luca; Herrmann, Robert B.; Munafò, Irene; Buttinelli, Mauro; Anselmi, Mario; Akinci, Aybige; Boschi, E.

    2012-10-01

    Inadequate seismic design codes can be dangerous, particularly when they underestimate the true hazard. In this study we use data from a sequence of moderate-sized earthquakes in northeast Italy to validate and test a regional wave propagation model which, in turn, is used to understand some weaknesses of the current design spectra. Our velocity model, while regionalized and somewhat ad hoc, is consistent with geophysical observations and the local geology. In the 0.02-0.1 Hz band, this model is validated by using it to calculate moment tensor solutions of 20 earthquakes (5.6 ≥ MW ≥ 3.2) in the 2012 Ferrara, Italy, seismic sequence. The seismic spectra observed for the relatively small main shock significantly exceeded the design spectra to be used in the area for critical structures. Observations and synthetics reveal that the ground motions are dominated by long-duration surface waves, which, apparently, the design codes do not adequately anticipate. In light of our results, the present seismic hazard assessment in the entire Pianura Padana, including the city of Milan, needs to be re-evaluated.

  5. CNNdel: Calling Structural Variations on Low Coverage Data Based on Convolutional Neural Networks

    PubMed Central

    2017-01-01

    Many structural variations (SVs) detection methods have been proposed due to the popularization of next-generation sequencing (NGS). These SV calling methods use different SV-property-dependent features; however, they all suffer from poor accuracy when running on low coverage sequences. The union of results from these tools achieves fairly high sensitivity but still produces low accuracy on low coverage sequence data. That is, these methods contain many false positives. In this paper, we present CNNdel, an approach for calling deletions from paired-end reads. CNNdel gathers SV candidates reported by multiple tools and then extracts features from aligned BAM files at the positions of candidates. With labeled feature-expressed candidates as a training set, CNNdel trains convolutional neural networks (CNNs) to distinguish true unlabeled candidates from false ones. Results show that CNNdel works well with NGS reads from 26 low coverage genomes of the 1000 Genomes Project. The paper demonstrates that convolutional neural networks can automatically assign the priority of SV features and reduce the false positives efficaciously. PMID:28630866

  6. Dioszegia antarctica sp. nov. and Dioszegia cryoxerica sp. nov., psychrophilic basidiomycetous yeasts from polar desert soils in Antarctica

    USGS Publications Warehouse

    Rodriguez, Russell J.; Connell, L.; Redman, R.; Barrett, A.; Iszard, M.; Fonseca, A.

    2010-01-01

    During a survey of the culturable soil fungal population in samples collected in Taylor Valley, South Victoria Land, Antarctica, 13 basidiomycetous yeast strains with orange-coloured colonies were isolated. Phylogenetic analyses of internal transcribed spacer (ITS) and partial LSU rRNA gene sequences showed that the strains belong to the Dioszegia clade of the Tremellales (Tremellomycetes, Agaricomycotina), but did not correspond to any of the hitherto recognized species. Two novel species, Dioszegia antarctica sp. nov. (type strain ANT-03-116T =CBS 10920T =PYCC 5970T) and Dioszegia cryoxerica sp. nov. (type strain ANT-03-071T =CBS 10919T =PYCC 5967T), are described to accommodate ten and three of these strains, respectively. Analysis of ITS sequences demonstrated intrastrain sequence heterogeneity in D. cryoxerica. The latter species is also notable for producing true hyphae with clamp connections and haustoria. However, no sexual structures were observed. The two novel species can be considered obligate psychrophiles, since they failed to grow above 20 °C and grew best between 10 and 15 °C.

  7. Plastid primers for angiosperm phylogenetics and phylogeography.

    PubMed

    Prince, Linda M

    2015-06-01

    PCR primers are available for virtually every region of the plastid genome. Selection of which primer pairs to use is second only to selection of the genic region. This is particularly true for research at the species/population interface. Primer pairs for 130 regions of the chloroplast genome were evaluated in 12 species distributed across the angiosperms. Likelihood of amplification success was inferred based upon number and location of mismatches to target sequence. Intraspecific sequence variability was evaluated under three different criteria in four species. Many published primer pairs should work across all taxa sampled, with the exception of failure due to genomic reorganization events. Universal barcoding primers were the least likely to work (65% success). The list of most variable regions for use within species has little in common with the lists identified in prior studies. Published primer sequences should amplify a diversity of flowering plant DNAs, even those designed for specific taxonomic groups. "Universal" primers may have extremely limited utility. There was little consistency in likelihood of amplification success for any given publication across lineages or within lineage across publications.

  8. Ensuring privacy in the study of pathogen genetics

    PubMed Central

    Mehta, Sanjay R.; Vinterbo, Staal A.; Little, Susan J.

    2014-01-01

    Rapid growth in the genetic sequencing of pathogens in recent years has led to the creation of large sequence databases. This aggregated sequence data can be very useful for tracking and predicting epidemics of infectious diseases. However, the balance between the potential public health benefit and the risk to personal privacy for individuals whose genetic data (personal or pathogen) are included in such work has been difficult to delineate, because neither the true benefit nor the actual risk to participants has been adequately defined. Existing approaches to minimise the risk of privacy loss to participants are based on de-identification of data by removal of a predefined set of identifiers. These approaches neither guarantee privacy nor protect the usefulness of the data. We propose a new approach to privacy protection that will quantify the risk to participants, while still maximising the usefulness of the data to researchers. This emerging standard in privacy protection and disclosure control, which is known as differential privacy, uses a process-driven rather than data-centred approach to protecting privacy. PMID:24721230

  9. Ensuring privacy in the study of pathogen genetics.

    PubMed

    Mehta, Sanjay R; Vinterbo, Staal A; Little, Susan J

    2014-08-01

    Rapid growth in the genetic sequencing of pathogens in recent years has led to the creation of large sequence databases. This aggregated sequence data can be very useful for tracking and predicting epidemics of infectious diseases. However, the balance between the potential public health benefit and the risk to personal privacy for individuals whose genetic data (personal or pathogen) are included in such work has been difficult to delineate, because neither the true benefit nor the actual risk to participants has been adequately defined. Existing approaches to minimise the risk of privacy loss to participants are based on de-identification of data by removal of a predefined set of identifiers. These approaches neither guarantee privacy nor protect the usefulness of the data. We propose a new approach to privacy protection that will quantify the risk to participants, while still maximising the usefulness of the data to researchers. This emerging standard in privacy protection and disclosure control, which is known as differential privacy, uses a process-driven rather than data-centred approach to protecting privacy. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Molecular phylogeny of choanoflagellates, the sister group to Metazoa

    PubMed Central

    Carr, M.; Leadbeater, B. S. C.; Hassan, R.; Nelson, M.; Baldauf, S. L.

    2008-01-01

    Choanoflagellates are single-celled aquatic flagellates with a unique morphology consisting of a cell with a single flagellum surrounded by a “collar” of microvilli. They have long interested evolutionary biologists because of their striking resemblance to the collared cells (choanocytes) of sponges. Molecular phylogeny has confirmed a close relationship between choanoflagellates and Metazoa, and the first choanoflagellate genome sequence has recently been published. However, molecular phylogenetic studies within choanoflagellates are still extremely limited. Thus, little is known about choanoflagellate evolution or the exact nature of the relationship between choanoflagellates and Metazoa. We have sequenced four genes from a broad sampling of the morphological diversity of choanoflagellates including most species currently available in culture. Phylogenetic analyses of these sequences, alone and in combination, reject much of the traditional taxonomy of the group. The molecular data also strongly support choanoflagellate monophyly rejecting proposals that Metazoa were derived from a true choanoflagellate ancestor. Mapping of a complementary matrix of morphological and ecological traits onto the phylogeny allows a reinterpretation of choanoflagellate character evolution and predicts the nature of their last common ancestor. PMID:18922774

  11. Autozygome Sequencing Expands the Horizon of Human Knockout Research and Provides Novel Insights into Human Phenotypic Variation

    PubMed Central

    Anazi, Shamsa; Alshamekh, Shomoukh; Alkuraya, Fowzan S.

    2013-01-01

    The use of autozygosity as a mapping tool in the search for autosomal recessive disease genes is well established. We hypothesized that autozygosity not only unmasks the recessiveness of disease causing variants, but can also reveal natural knockouts of genes with less obvious phenotypic consequences. To test this hypothesis, we exome sequenced 77 well phenotyped individuals born to first cousin parents in search of genes that are biallelically inactivated. Using a very conservative estimate, we show that each of these individuals carries biallelic inactivation of 22.8 genes on average. For many of the 169 genes that appear to be biallelically inactivated, available data support involvement in modulating metabolism, immunity, perception, external appearance and other phenotypic aspects, and appear therefore to contribute to human phenotypic variation. Other genes with biallelic inactivation may contribute in yet unknown mechanisms or may be on their way to conversion into pseudogenes due to true recent dispensability. We conclude that sequencing the autozygome is an efficient way to map the contribution of genes to human phenotypic variation that goes beyond the classical definition of disease. PMID:24367280

  12. Single-cell mRNA cytometry via sequence-specific nanoparticle clustering and trapping

    NASA Astrophysics Data System (ADS)

    Labib, Mahmoud; Mohamadi, Reza M.; Poudineh, Mahla; Ahmed, Sharif U.; Ivanov, Ivaylo; Huang, Ching-Lung; Moosavi, Maral; Sargent, Edward H.; Kelley, Shana O.

    2018-05-01

    Cell-to-cell variation in gene expression creates a need for techniques that can characterize expression at the level of individual cells. This is particularly true for rare circulating tumour cells, in which subtyping and drug resistance are of intense interest. Here we describe a method for cell analysis—single-cell mRNA cytometry—that enables the isolation of rare cells from whole blood as a function of target mRNA sequences. This approach uses two classes of magnetic particles that are labelled to selectively hybridize with different regions of the target mRNA. Hybridization leads to the formation of large magnetic clusters that remain localized within the cells of interest, thereby enabling the cells to be magnetically separated. Targeting specific intracellular mRNAs enablescirculating tumour cells to be distinguished from normal haematopoietic cells. No polymerase chain reaction amplification is required to determine RNA expression levels and genotype at the single-cell level, and minimal cell manipulation is required. To demonstrate this approach we use single-cell mRNA cytometry to detect clinically important sequences in prostate cancer specimens.

  13. Magmatic Diversity of the Wehrlitic Intrusions in the Oceanic Lower Crust of the Northern Oman Ophiolite

    NASA Astrophysics Data System (ADS)

    Kaneko, R.; Adachi, Y.; Miyashita, S.

    2014-12-01

    The Oman ophiolite extends along the east coast of Oman, and is the world's largest and best-preserved slice of obducted oceanic lithosphere. The magmatic history of this ophiolite is complex and is generally regarded as having occurred in three stages (MOR magmatism, subduction magmatism and intraplate magmatism). Wehrlitic intrusions constitute an important element of oceanic lower crust of the ophiolite, and numerous intrusions cut gabbro units in the northern Salahi block of this ophiolite. In this study area, we identified two different types of wehrlitic intrusions. One type of the intrusions mainly consists of dunite, plagioclase (Pl) wehrlite and mela-olivine (Ol) gabbro, in which the crystallization sequence is Ol followed by the contemporaneous crystallization of Pl and clinopyroxene (Cpx). This type is called "ordinary" wehrlitic intrusions and has similar mineral compositions to host gabbros (Adachi and Miyashita 2003; Kaneko et al. 2014). Another type of the intrusions is a single intrusion that crops out in an area 250 m × 150 m along Wadi Salahi. This intrusion consists of Pl-free "true" wehrlite, in which the crystallization sequence is Ol and then Cpx. The forsterite contents (Fo%) of Ol from the "ordinary" wehrlitic intrusions and "true" wehrlitic intrusions have ranges of 90.8-87.0 (NiO = 0.36-0.13 wt%) and 84.7 (NiO = 0.31 wt%), respectively. Cr numbers (Cr#) of Cr-spinel from the "true" wehrlitic intrusions show higher Cr# value of 0.85 than those of the "ordinary" wehrlitic intrusions (0.48-0.64). But the former is characterized by very high Fe3+ values (YFe3+ = 0.49-0.68). Kaneko et al. (2014) showed that the "ordinary" ubiquitous type has similar features to MOR magmatism and the depleted type in the Fizh block (Adachi and Miyashita 2003) links to subduction magmatism. These types are distinguished by their mineral chemistries (TiO2 and Na2O contents of Cpx). The TiO2 and Na2O contents of Cpx from the "true" wehrlitic intrusions have 0.38 wt% and 0.26 wt%, respectively, and plot on the field of MOR magmatism. The most-evolved Ol (Fo% = 84.7) from the wehrlitic intrusions has high NiO (0.31 wt%) and plots on the olivine mantle array (Takahashi 1986). It is suggested that heterogeneity of source mantle influences the magmatic diversity of the wehrlitic intrusions.

  14. A Bayesian taxonomic classification method for 16S rRNA gene sequences with improved species-level accuracy.

    PubMed

    Gao, Xiang; Lin, Huaiying; Revanna, Kashi; Dong, Qunfeng

    2017-05-10

    Species-level classification for 16S rRNA gene sequences remains a serious challenge for microbiome researchers, because existing taxonomic classification tools for 16S rRNA gene sequences either do not provide species-level classification, or their classification results are unreliable. The unreliable results are due to the limitations in the existing methods which either lack solid probabilistic-based criteria to evaluate the confidence of their taxonomic assignments, or use nucleotide k-mer frequency as the proxy for sequence similarity measurement. We have developed a method that shows significantly improved species-level classification results over existing methods. Our method calculates true sequence similarity between query sequences and database hits using pairwise sequence alignment. Taxonomic classifications are assigned from the species to the phylum levels based on the lowest common ancestors of multiple database hits for each query sequence, and further classification reliabilities are evaluated by bootstrap confidence scores. The novelty of our method is that the contribution of each database hit to the taxonomic assignment of the query sequence is weighted by a Bayesian posterior probability based upon the degree of sequence similarity of the database hit to the query sequence. Our method does not need any training datasets specific for different taxonomic groups. Instead only a reference database is required for aligning to the query sequences, making our method easily applicable for different regions of the 16S rRNA gene or other phylogenetic marker genes. Reliable species-level classification for 16S rRNA or other phylogenetic marker genes is critical for microbiome research. Our software shows significantly higher classification accuracy than the existing tools and we provide probabilistic-based confidence scores to evaluate the reliability of our taxonomic classification assignments based on multiple database matches to query sequences. Despite its higher computational costs, our method is still suitable for analyzing large-scale microbiome datasets for practical purposes. Furthermore, our method can be applied for taxonomic classification of any phylogenetic marker gene sequences. Our software, called BLCA, is freely available at https://github.com/qunfengdong/BLCA .

  15. Oligonucleotides as antivirals: dream or realistic perspective?

    PubMed

    Van Aerschot, Arthur

    2006-09-01

    Many reports have been published on antiviral activity of synthetic oligonucleotides, targeted to act either by a true antisense effect or via non-sequence specific interactions. This short review will try to evaluate the current status of the field by focusing on the effects as reported for inhibition of either HSV-1, HCMV or HIV-1. Following an introduction with a historical background and a brief discussion on the different types of constructs and mechanisms of action, the therapeutic potential of antisense oligonucleotides as antivirals, as well as possible pitfalls upon their evaluation will be discussed.

  16. Scaling effect of fraction of vegetation cover retrieved by algorithms based on linear mixture model

    NASA Astrophysics Data System (ADS)

    Obata, Kenta; Miura, Munenori; Yoshioka, Hiroki

    2010-08-01

    Differences in spatial resolution among sensors have been a source of error among satellite data products, known as a scaling effect. This study investigates the mechanism of the scaling effect on fraction of vegetation cover retrieved by a linear mixture model which employs NDVI as one of the constraints. The scaling effect is induced by the differences in texture, and the differences between the true endmember spectra and the endmember spectra assumed during retrievals. A mechanism of the scaling effect was analyzed by focusing on the monotonic behavior of spatially averaged FVC as a function of spatial resolution. The number of endmember is limited into two to proceed the investigation analytically. Although the spatially-averaged NDVI varies monotonically along with spatial resolution, the corresponding FVC values does not always vary monotonically. The conditions under which the averaged FVC varies monotonically for a certain sequence of spatial resolutions, were derived analytically. The increasing and decreasing trend of monotonic behavior can be predicted from the true and assumed endmember spectra of vegetation and non-vegetation classes regardless the distributions of the vegetation class within a fixed area. The results imply that the scaling effect on FVC is more complicated than that on NDVI, since, unlike NDVI, FVC becomes non-monotonic under a certain condition determined by the true and assumed endmember spectra.

  17. Pattern pluralism and the Tree of Life hypothesis

    PubMed Central

    Doolittle, W. Ford; Bapteste, Eric

    2007-01-01

    Darwin claimed that a unique inclusively hierarchical pattern of relationships between all organisms based on their similarities and differences [the Tree of Life (TOL)] was a fact of nature, for which evolution, and in particular a branching process of descent with modification, was the explanation. However, there is no independent evidence that the natural order is an inclusive hierarchy, and incorporation of prokaryotes into the TOL is especially problematic. The only data sets from which we might construct a universal hierarchy including prokaryotes, the sequences of genes, often disagree and can seldom be proven to agree. Hierarchical structure can always be imposed on or extracted from such data sets by algorithms designed to do so, but at its base the universal TOL rests on an unproven assumption about pattern that, given what we know about process, is unlikely to be broadly true. This is not to say that similarities and differences between organisms are not to be accounted for by evolutionary mechanisms, but descent with modification is only one of these mechanisms, and a single tree-like pattern is not the necessary (or expected) result of their collective operation. Pattern pluralism (the recognition that different evolutionary models and representations of relationships will be appropriate, and true, for different taxa or at different scales or for different purposes) is an attractive alternative to the quixotic pursuit of a single true TOL. PMID:17261804

  18. Protein 3D Structure Computed from Evolutionary Sequence Variation

    PubMed Central

    Sheridan, Robert; Hopf, Thomas A.; Pagnani, Andrea; Zecchina, Riccardo; Sander, Chris

    2011-01-01

    The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes. PMID:22163331

  19. MG-Digger: An Automated Pipeline to Search for Giant Virus-Related Sequences in Metagenomes

    PubMed Central

    Verneau, Jonathan; Levasseur, Anthony; Raoult, Didier; La Scola, Bernard; Colson, Philippe

    2016-01-01

    The number of metagenomic studies conducted each year is growing dramatically. Storage and analysis of such big data is difficult and time-consuming. Interestingly, analysis shows that environmental and human metagenomes include a significant amount of non-annotated sequences, representing a ‘dark matter.’ We established a bioinformatics pipeline that automatically detects metagenome reads matching query sequences from a given set and applied this tool to the detection of sequences matching large and giant DNA viral members of the proposed order Megavirales or virophages. A total of 1,045 environmental and human metagenomes (≈ 1 Terabase) were collected, processed, and stored on our bioinformatics server. In addition, nucleotide and protein sequences from 93 Megavirales representatives, including 19 giant viruses of amoeba, and 5 virophages, were collected. The pipeline was generated by scripts written in Python language and entitled MG-Digger. Metagenomes previously found to contain megavirus-like sequences were tested as controls. MG-Digger was able to annotate 100s of metagenome sequences as best matching those of giant viruses. These sequences were most often found to be similar to phycodnavirus or mimivirus sequences, but included reads related to recently available pandoraviruses, Pithovirus sibericum, and faustoviruses. Compared to other tools, MG-Digger combined stand-alone use on Linux or Windows operating systems through a user-friendly interface, implementation of ready-to-use customized metagenome databases and query sequence databases, adjustable parameters for BLAST searches, and creation of output files containing selected reads with best match identification. Compared to Metavir 2, a reference tool in viral metagenome analysis, MG-Digger detected 8% more true positive Megavirales-related reads in a control metagenome. The present work shows that massive, automated and recurrent analyses of metagenomes are effective in improving knowledge about the presence and prevalence of giant viruses in the environment and the human body. PMID:27065984

  20. Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

    PubMed Central

    Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

    2016-01-01

    Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524

  1. Selection of optimal oligonucleotide probes for microarrays usingmultiple criteria, global alignment and parameter estimation.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Xingyuan; He, Zhili; Zhou, Jizhong

    2005-10-30

    The oligonucleotide specificity for microarray hybridizationcan be predicted by its sequence identity to non-targets, continuousstretch to non-targets, and/or binding free energy to non-targets. Mostcurrently available programs only use one or two of these criteria, whichmay choose 'false' specific oligonucleotides or miss 'true' optimalprobes in a considerable proportion. We have developed a software tool,called CommOligo using new algorithms and all three criteria forselection of optimal oligonucleotide probes. A series of filters,including sequence identity, free energy, continuous stretch, GC content,self-annealing, distance to the 3'-untranslated region (3'-UTR) andmelting temperature (Tm), are used to check each possibleoligonucleotide. A sequence identity is calculated based onmore » gapped globalalignments. A traversal algorithm is used to generate alignments for freeenergy calculation. The optimal Tm interval is determined based on probecandidates that have passed all other filters. Final probes are pickedusing a combination of user-configurable piece-wise linear functions andan iterative process. The thresholds for identity, stretch and freeenergy filters are automatically determined from experimental data by anaccessory software tool, CommOligo_PE (CommOligo Parameter Estimator).The program was used to design probes for both whole-genome and highlyhomologous sequence data. CommOligo and CommOligo_PE are freely availableto academic users upon request.« less

  2. The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection

    PubMed Central

    Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael

    2015-01-01

    With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710

  3. Adaptation of the Haloarcula hispanica CRISPR-Cas system to a purified virus strictly requires a priming process

    PubMed Central

    Li, Ming; Wang, Rui; Zhao, Dahe; Xiang, Hua

    2014-01-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)-Cas system mediates adaptive immunity against foreign nucleic acids in prokaryotes. However, efficient adaptation of a native CRISPR to purified viruses has only been observed for the type II-A system from a Streptococcus thermophilus industry strain, and rarely reported for laboratory strains. Here, we provide a second native system showing efficient adaptation. Infected by a newly isolated virus HHPV-2, Haloarcula hispanica type I-B CRISPR system acquired spacers discriminatively from viral sequences. Unexpectedly, in addition to Cas1, Cas2 and Cas4, this process also requires Cas3 and at least partial Cascade proteins, which are involved in interference and/or CRISPR RNA maturation. Intriguingly, a preexisting spacer partially matching a viral sequence is also required, and spacer acquisition from upstream and downstream sequences of its target sequence (i.e. priming protospacer) shows different strand bias. These evidences strongly indicate that adaptation in this system strictly requires a priming process. This requirement, if validated also true for other CRISPR systems as implied by our bioinformatic analysis, may help to explain failures to observe efficient adaptation to purified viruses in many laboratory strains, and the discrimination mechanism at the adaptation level that has confused scientists for years. PMID:24265226

  4. ADEPT, a dynamic next generation sequencing data error-detection program with trimming

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Feng, Shihai; Lo, Chien-Chi; Li, Po-E

    Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less

  5. ADEPT, a dynamic next generation sequencing data error-detection program with trimming

    DOE PAGES

    Feng, Shihai; Lo, Chien-Chi; Li, Po-E; ...

    2016-02-29

    Illumina is the most widely used next generation sequencing technology and produces millions of short reads that contain errors. These sequencing errors constitute a major problem in applications such as de novo genome assembly, metagenomics analysis and single nucleotide polymorphism discovery. In this study, we present ADEPT, a dynamic error detection method, based on the quality scores of each nucleotide and its neighboring nucleotides, together with their positions within the read and compares this to the position-specific quality score distribution of all bases within the sequencing run. This method greatly improves upon other available methods in terms of the truemore » positive rate of error discovery without affecting the false positive rate, particularly within the middle of reads. We conclude that ADEPT is the only tool to date that dynamically assesses errors within reads by comparing position-specific and neighboring base quality scores with the distribution of quality scores for the dataset being analyzed. The result is a method that is less prone to position-dependent under-prediction, which is one of the most prominent issues in error prediction. The outcome is that ADEPT improves upon prior efforts in identifying true errors, primarily within the middle of reads, while reducing the false positive rate.« less

  6. Discovery of parvovirus-related sequences in an unexpected broad range of animals.

    PubMed

    François, S; Filloux, D; Roumagnac, P; Bigot, D; Gayral, P; Martin, D P; Froissart, R; Ogliastro, M

    2016-09-07

    Our knowledge of the genetic diversity and host ranges of viruses is fragmentary. This is particularly true for the Parvoviridae family. Genetic diversity studies of single stranded DNA viruses within this family have been largely focused on arthropod- and vertebrate-infecting species that cause diseases of humans and our domesticated animals: a focus that has biased our perception of parvovirus diversity. While metagenomics approaches could help rectify this bias, so too could transcriptomics studies. Large amounts of transcriptomic data are available for a diverse array of animal species and whenever this data has inadvertently been gathered from virus-infected individuals, it could contain detectable viral transcripts. We therefore performed a systematic search for parvovirus-related sequences (PRSs) within publicly available transcript, genome and protein databases and eleven new transcriptome datasets. This revealed 463 PRSs in the transcript databases of 118 animals. At least 41 of these PRSs are likely integrated within animal genomes in that they were also found within genomic sequence databases. Besides illuminating the ubiquity of parvoviruses, the number of parvoviral sequences discovered within public databases revealed numerous previously unknown parvovirus-host combinations; particularly in invertebrates. Our findings suggest that the host-ranges of extant parvoviruses might span the entire animal kingdom.

  7. Event-specific qualitative and quantitative PCR detection of the GMO carnation (Dianthus caryophyllus) variety Moonlite based upon the 5'-transgene integration sequence.

    PubMed

    Li, P; Jia, J W; Jiang, L X; Zhu, H; Bai, L; Wang, J B; Tang, X M; Pan, A H

    2012-04-27

    To ensure the implementation of genetically modified organism (GMO)-labeling regulations, an event-specific detection method was developed based on the junction sequence of an exogenous integrant in the transgenic carnation variety Moonlite. The 5'-transgene integration sequence was isolated by thermal asymmetric interlaced PCR. Based upon the 5'-transgene integration sequence, the event-specific primers and TaqMan probe were designed to amplify the fragments, which spanned the exogenous DNA and carnation genomic DNA. Qualitative and quantitative PCR assays were developed employing the designed primers and probe. The detection limit of the qualitative PCR assay was 0.05% for Moonlite in 100 ng total carnation genomic DNA, corresponding to about 79 copies of the carnation haploid genome; the limit of detection and quantification of the quantitative PCR assay were estimated to be 38 and 190 copies of haploid carnation genomic DNA, respectively. Carnation samples with different contents of genetically modified components were quantified and the bias between the observed and true values of three samples were lower than the acceptance criterion (<25%) of the GMO detection method. These results indicated that these event-specific methods would be useful for the identification and quantification of the GMO carnation Moonlite.

  8. Comment: Characterization of Two Historic Smallpox Specimens from a Czech Museum.

    PubMed

    Porter, Ashleigh F; Duggan, Ana T; Poinar, Hendrik N; Holmes, Edward C

    2017-09-28

    The complete genome sequences of two strains of variola virus (VARV) sampled from human smallpox specimens present in the Czech National Museum, Prague, were recently determined, with one of the sequences estimated to date to the mid-19th century. Using molecular clock methods, the authors of this study go on to infer that the currently available strains of VARV share an older common ancestor, at around 1350 AD, than some recent estimates based on other archival human samples. Herein, we show that the two Czech strains exhibit anomalous branch lengths given their proposed age, and by assuming a constant rate of evolutionary change across the rest of the VARV phylogeny estimate that their true age in fact lies between 1918 and 1937. We therefore suggest that the age of the common ancestor of currently available VARV genomes most likely dates to late 16th and early 17th centuries and not ~1350 AD.

  9. The Deinococcus-Thermus phylum and the effect of rRNA composition on phylogenetic tree construction

    NASA Technical Reports Server (NTRS)

    Weisburg, W. G.; Giovannoni, S. J.; Woese, C. R.

    1989-01-01

    Through comparative analysis of 16S ribosomal RNA sequences, it can be shown that two seemingly dissimilar types of eubacteria Deinococcus and the ubiquitous hot spring organism Thermus are distantly but specifically related to one another. This confirms an earlier report based upon 16S rRNA oligonucleotide cataloging studies (Hensel et al., 1986). Their two lineages form a distinctive grouping within the eubacteria that deserved the taxonomic status of a phylum. The (partial) sequence of T. aquaticus rRNA appears relatively close to those of other thermophilic eubacteria. e.g. Thermotoga maritima and Thermomicrobium roseum. However, this closeness does not reflect a true evolutionary closeness; rather it is due to a "thermophilic convergence", the result of unusually high G+C composition in the rRNAs of thermophilic bacteria. Unless such compositional biases are taken into account, the branching order and root of phylogenetic trees can be incorrectly inferred.

  10. Comment: Characterization of Two Historic Smallpox Specimens from a Czech Museum

    PubMed Central

    Porter, Ashleigh F.; Duggan, Ana T.

    2017-01-01

    The complete genome sequences of two strains of variola virus (VARV) sampled from human smallpox specimens present in the Czech National Museum, Prague, were recently determined, with one of the sequences estimated to date to the mid-19th century. Using molecular clock methods, the authors of this study go on to infer that the currently available strains of VARV share an older common ancestor, at around 1350 AD, than some recent estimates based on other archival human samples. Herein, we show that the two Czech strains exhibit anomalous branch lengths given their proposed age, and by assuming a constant rate of evolutionary change across the rest of the VARV phylogeny estimate that their true age in fact lies between 1918 and 1937. We therefore suggest that the age of the common ancestor of currently available VARV genomes most likely dates to late 16th and early 17th centuries and not ~1350 AD. PMID:28956829

  11. Analysis of Clinical Ostreid Herpesvirus 1 (Malacoherpesviridae) Specimens by Sequencing Amplified Fragments from Three Virus Genome Areas

    PubMed Central

    Moreau, Pierrick; Faury, Nicole; Pepin, Jean-François; Segarra, Amélie; Webb, Stephen

    2012-01-01

    Although there are a number of ostreid herpesvirus 1 (OsHV-1) variants, it is expected that the true diversity of this virus will be known only after the analysis of significantly more data. To this end, we analyzed 72 OsHV-1 “specimens” collected mainly in France over an 18-year period, from 1993 to 2010. Additional samples were also collected in Ireland, the United States, China, Japan, and New Zealand. Three virus genome regions (open reading frame 4 [ORF4], ORF35, -36, -37, and -38, and ORF42 and -43) were selected for PCR analysis and sequencing. Although ORF4 appeared to be the most polymorphic genome area, distinguishing several genogroups, ORF35, -36, -37, and -38 and ORF42 and -43 also showed variations useful in grouping subpopulations of this virus. PMID:22419803

  12. Empirical Bayes Estimation of Coalescence Times from Nucleotide Sequence Data.

    PubMed

    King, Leandra; Wakeley, John

    2016-09-01

    We demonstrate the advantages of using information at many unlinked loci to better calibrate estimates of the time to the most recent common ancestor (TMRCA) at a given locus. To this end, we apply a simple empirical Bayes method to estimate the TMRCA. This method is both asymptotically optimal, in the sense that the estimator converges to the true value when the number of unlinked loci for which we have information is large, and has the advantage of not making any assumptions about demographic history. The algorithm works as follows: we first split the sample at each locus into inferred left and right clades to obtain many estimates of the TMRCA, which we can average to obtain an initial estimate of the TMRCA. We then use nucleotide sequence data from other unlinked loci to form an empirical distribution that we can use to improve this initial estimate. Copyright © 2016 by the Genetics Society of America.

  13. A possible brown dwarf companion to Gliese 569

    NASA Technical Reports Server (NTRS)

    Forrest, W. J.; Shure, Mark; Skrutskie, M. F.

    1988-01-01

    A faint cool companion to Gliese 569, discovered during an IR imaging survey of nearby stars, may be the lowest-mass stellar object yet found. The companion is somewhat cooler in its 1.65-3.75-micron energy distribution than the coolest known main-sequence stars, indicating a low mass. Despite its lower temperature, it is more luminous than similar extremely low-mass stars, suggesting that it is either a young low-mass star evolving toward the main sequence or a cooling substellar brown dwarf. The primary star has emission lines and a low space velocity and exhibits flaring, all of which imply youth for this system. Observations of Gliese 569 and its companion over a period of 2 yr confirm the common proper motion expected of a true binary. The 5-arcsec apparent separation (50 AU) implies an orbital period of roughly 500 yr, which will permit an eventual direct determination of the mass of the companion.

  14. DIRECT N-BODY MODELING OF THE OLD OPEN CLUSTER NGC 188: A DETAILED COMPARISON OF THEORETICAL AND OBSERVED BINARY STAR AND BLUE STRAGGLER POPULATIONS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Geller, Aaron M.; Hurley, Jarrod R.; Mathieu, Robert D., E-mail: a-geller@northwestern.edu, E-mail: mathieu@astro.wisc.edu, E-mail: jhurley@astro.swin.edu.au

    2013-01-01

    Following on from a recently completed radial-velocity survey of the old (7 Gyr) open cluster NGC 188 in which we studied in detail the solar-type hard binaries and blue stragglers of the cluster, here we investigate the dynamical evolution of NGC 188 through a sophisticated N-body model. Importantly, we employ the observed binary properties of the young (180 Myr) open cluster M35, where possible, to guide our choices for parameters of the initial binary population. We apply pre-main-sequence tidal circularization and a substantial increase to the main-sequence tidal circularization rate, both of which are necessary to match the observed tidalmore » circularization periods in the literature, including that of NGC 188. At 7 Gyr the main-sequence solar-type hard-binary population in the model matches that of NGC 188 in both binary frequency and distributions of orbital parameters. This agreement between the model and observations is in a large part due to the similarities between the NGC 188 and M35 solar-type binaries. Indeed, among the 7 Gyr main-sequence binaries in the model, only those with P {approx}> 1000 days begin to show potentially observable evidence for modifications by dynamical encounters, even after 7 Gyr of evolution within the star cluster. This emphasizes the importance of defining accurate initial conditions for star cluster models, which we propose is best accomplished through comparisons with observations of young open clusters like M35. Furthermore, this finding suggests that observations of the present-day binaries in even old open clusters can provide valuable information on their primordial binary populations. However, despite the model's success at matching the observed solar-type main-sequence population, the model underproduces blue stragglers and produces an overabundance of long-period circular main-sequence-white-dwarf binaries as compared with the true cluster. We explore several potential solutions to the paucity of blue stragglers and conclude that the model dramatically underproduces blue stragglers through mass-transfer processes. We suggest that common-envelope evolution may have been incorrectly imposed on the progenitors of the spurious long-period circular main-sequence-white-dwarf binaries, which perhaps instead should have gone through stable mass transfer to create blue stragglers, thereby bringing both the number and binary frequency of the blue straggler population in the model into agreement with the true blue stragglers in NGC 188. Thus, improvements in the physics of mass transfer and common-envelope evolution employed in the model may in fact solve both discrepancies with the observations. This project highlights the unique accessibility of open clusters to both comprehensive observational surveys and full-scale N-body simulations, both of which have only recently matured sufficiently to enable such a project, and underscores the importance of open clusters to the study of star cluster dynamics.« less

  15. Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

    PubMed Central

    Van Nostrand, Joy D.; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong

    2017-01-01

    Illumina’s MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1–3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility. PMID:28453559

  16. Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wen, Chongqing; Wu, Liyou; Qin, Yujia

    Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less

  17. Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform

    DOE PAGES

    Wen, Chongqing; Wu, Liyou; Qin, Yujia; ...

    2017-04-28

    Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered,more » the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.« less

  18. Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform.

    PubMed

    Wen, Chongqing; Wu, Liyou; Qin, Yujia; Van Nostrand, Joy D; Ning, Daliang; Sun, Bo; Xue, Kai; Liu, Feifei; Deng, Ye; Liang, Yuting; Zhou, Jizhong

    2017-01-01

    Illumina's MiSeq has become the dominant platform for gene amplicon sequencing in microbial ecology studies; however, various technical concerns, such as reproducibility, still exist. To assess reproducibility, 16S rRNA gene amplicons from 18 soil samples of a reciprocal transplantation experiment were sequenced on an Illumina MiSeq. The V4 region of 16S rRNA gene from each sample was sequenced in triplicate with each replicate having a unique barcode. The average OTU overlap, without considering sequence abundance, at a rarefaction level of 10,323 sequences was 33.4±2.1% and 20.2±1.7% between two and among three technical replicates, respectively. When OTU sequence abundance was considered, the average sequence abundance weighted OTU overlap was 85.6±1.6% and 81.2±2.1% for two and three replicates, respectively. Removing singletons significantly increased the overlap for both (~1-3%, p<0.001). Increasing the sequencing depth to 160,000 reads by deep sequencing increased OTU overlap both when sequence abundance was considered (95%) and when not (44%). However, if singletons were not removed the overlap between two technical replicates (not considering sequence abundance) plateaus at 39% with 30,000 sequences. Diversity measures were not affected by the low overlap as α-diversities were similar among technical replicates while β-diversities (Bray-Curtis) were much smaller among technical replicates than among treatment replicates (e.g., 0.269 vs. 0.374). Higher diversity coverage, but lower OTU overlap, was observed when replicates were sequenced in separate runs. Detrended correspondence analysis indicated that while there was considerable variation among technical replicates, the reproducibility was sufficient for detecting treatment effects for the samples examined. These results suggest that although there is variation among technical replicates, amplicon sequencing on MiSeq is useful for analyzing microbial community structure if used appropriately and with caution. For example, including technical replicates, removing spurious sequences and unrepresentative OTUs, using a clustering method with a high stringency for OTU generation, estimating treatment effects at higher taxonomic levels, and adapting the unique molecular identifier (UMI) and other newly developed methods to lower PCR and sequencing error and to identify true low abundance rare species all can increase reproducibility.

  19. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  20. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

    2004-08-06

    Background The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. Results We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene,more » and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Conclusions Measuring conservation of sequence features closely linked to function - such as binding-site clustering - makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less

  1. Not all are free-living: high-throughput DNA metabarcoding reveals a diverse community of protists parasitizing soil metazoa.

    PubMed

    Geisen, S; Laros, I; Vizcaíno, A; Bonkowski, M; de Groot, G A

    2015-09-01

    Protists, the most diverse eukaryotes, are largely considered to be free-living bacterivores, but vast numbers of taxa are known to parasitize plants or animals. High-throughput sequencing (HTS) approaches now commonly replace cultivation-based approaches in studying soil protists, but insights into common biases associated with this method are limited to aquatic taxa and samples. We created a mock community of common free-living soil protists (amoebae, flagellates, ciliates), extracted DNA and amplified it in the presence of metazoan DNA using 454 HTS. We aimed at evaluating whether HTS quantitatively reveals true relative abundances of soil protists and at investigating whether the expected protist community structure is altered by the co-amplification of metazoan-associated protist taxa. Indeed, HTS revealed fundamentally different protist communities from those expected. Ciliate sequences were highly over-represented, while those of most amoebae and flagellates were under-represented or totally absent. These results underpin the biases introduced by HTS that prevent reliable quantitative estimations of free-living protist communities. Furthermore, we detected a wide range of nonadded protist taxa probably introduced along with metazoan DNA, which altered the protist community structure. Among those, 20 taxa most closely resembled parasitic, often pathogenic taxa. Therewith, we provide the first HTS data in support of classical observational studies that showed that potential protist parasites are hosted by soil metazoa. Taken together, profound differences in amplification success between protist taxa and an inevitable co-extraction of protist taxa parasitizing soil metazoa obscure the true diversity of free-living soil protist communities. © 2015 John Wiley & Sons Ltd.

  2. Intensity-based dual model method for generation of synthetic CT images from standard T2-weighted MR images - Generalized technique for four different MR scanners.

    PubMed

    Koivula, Lauri; Kapanen, Mika; Seppälä, Tiina; Collan, Juhani; Dowling, Jason A; Greer, Peter B; Gustafsson, Christian; Gunnlaugsson, Adalsteinn; Olsson, Lars E; Wee, Leonard; Korhonen, Juha

    2017-12-01

    Recent studies have shown that it is possible to conduct entire radiotherapy treatment planning (RTP) workflow using only MR images. This study aims to develop a generalized intensity-based method to generate synthetic CT (sCT) images from standard T2-weighted (T2 w ) MR images of the pelvis. This study developed a generalized dual model HU conversion method to convert standard T2 w MR image intensity values to synthetic HU values, separately inside and outside of atlas-segmented bone volume contour. The method was developed and evaluated with 20 and 35 prostate cancer patients, respectively. MR images with scanning sequences in clinical use were acquired with four different MR scanners of three vendors. For the generated synthetic CT (sCT) images of the 35 prostate patients, the mean (and maximal) HU differences in soft and bony tissue volumes were 16 ± 6 HUs (34 HUs) and -46 ± 56 HUs (181 HUs), respectively, against the true CT images. The average of the PTV mean dose difference in sCTs compared to those in true CTs was -0.6 ± 0.4% (-1.3%). The study provides a generalized method for sCT creation from standard T2 w images of the pelvis. The method produced clinically acceptable dose calculation results for all the included scanners and MR sequences. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Vampires in the oceans: predatory cercozoan amoebae in marine habitats.

    PubMed

    Berney, Cédric; Romac, Sarah; Mahé, Frédéric; Santini, Sébastien; Siano, Raffaele; Bass, David

    2013-12-01

    Vampire amoebae (vampyrellids) are predators of algae, fungi, protozoa and small metazoans known primarily from soils and in freshwater habitats. They are among the very few heterotrophic naked, filose and reticulose protists that have received some attention from a morphological and ecological point of view over the last few decades, because of the peculiar mode of feeding of known species. Yet, the true extent of their biodiversity remains largely unknown. Here we use a complementary approach of culturing and sequence database mining to address this issue, focusing our efforts on marine environments, where vampyrellids are very poorly known. We present 10 new vampyrellid isolates, 8 from marine or brackish sediments, and 2 from soil or freshwater sediment. Two of the former correspond to the genera Thalassomyxa Grell and Penardia Cash for which sequence data were previously unavailable. Small-subunit ribosomal DNA analysis confirms they are all related to previously sequenced vampyrellids. An exhaustive screening of the NCBI GenBank database and of 454 sequence data generated by the European BioMarKs consortium revealed hundreds of distinct environmental vampyrellid sequences. We show that vampyrellids are much more diverse than previously thought, especially in marine habitats. Our new isolates, which cover almost the full phylogenetic range of vampyrellid sequences revealed in this study, offer a rare opportunity to integrate data from environmental DNA surveys with phenotypic information. However, the very large genetic diversity we highlight within vampyrellids (especially in marine sediments and soils) contrasts with the paradoxically low morphological distinctiveness we observed across our isolates.

  4. Vampires in the oceans: predatory cercozoan amoebae in marine habitats

    PubMed Central

    Berney, Cédric; Romac, Sarah; Mahé, Frédéric; Santini, Sébastien; Siano, Raffaele; Bass, David

    2013-01-01

    Vampire amoebae (vampyrellids) are predators of algae, fungi, protozoa and small metazoans known primarily from soils and in freshwater habitats. They are among the very few heterotrophic naked, filose and reticulose protists that have received some attention from a morphological and ecological point of view over the last few decades, because of the peculiar mode of feeding of known species. Yet, the true extent of their biodiversity remains largely unknown. Here we use a complementary approach of culturing and sequence database mining to address this issue, focusing our efforts on marine environments, where vampyrellids are very poorly known. We present 10 new vampyrellid isolates, 8 from marine or brackish sediments, and 2 from soil or freshwater sediment. Two of the former correspond to the genera Thalassomyxa Grell and Penardia Cash for which sequence data were previously unavailable. Small-subunit ribosomal DNA analysis confirms they are all related to previously sequenced vampyrellids. An exhaustive screening of the NCBI GenBank database and of 454 sequence data generated by the European BioMarKs consortium revealed hundreds of distinct environmental vampyrellid sequences. We show that vampyrellids are much more diverse than previously thought, especially in marine habitats. Our new isolates, which cover almost the full phylogenetic range of vampyrellid sequences revealed in this study, offer a rare opportunity to integrate data from environmental DNA surveys with phenotypic information. However, the very large genetic diversity we highlight within vampyrellids (especially in marine sediments and soils) contrasts with the paradoxically low morphological distinctiveness we observed across our isolates. PMID:23864128

  5. Identification of a Divergent Environmental DNA Sequence Clade Using the Phylogeny of Gregarine Parasites (Apicomplexa) from Crustacean Hosts

    PubMed Central

    Rueckert, Sonja; Simdyanov, Timur G.; Aleoshin, Vladimir V.; Leander, Brian S.

    2011-01-01

    Background Environmental SSU rDNA surveys have significantly improved our understanding of microeukaryotic diversity. Many of the sequences acquired using this approach are closely related to lineages previously characterized at both morphological and molecular levels, making interpretation of these data relatively straightforward. Some sequences, by contrast, appear to be phylogenetic orphans and are sometimes inferred to represent “novel lineages” of unknown cellular identity. Consequently, interpretation of environmental DNA surveys of cellular diversity rely on an adequately comprehensive database of DNA sequences derived from identified species. Several major taxa of microeukaryotes, however, are still very poorly represented in these databases, and this is especially true for diverse groups of single-celled parasites, such as gregarine apicomplexans. Methodology/Principal Findings This study attempts to address this paucity of DNA sequence data by characterizing four different gregarine species, isolated from the intestines of crustaceans, at both morphological and molecular levels: Thiriotia pugettiae sp. n. from the graceful kelp crab (Pugettia gracilis), Cephaloidophora cf. communis from two different species of barnacles (Balanus glandula and B. balanus), Heliospora cf. longissima from two different species of freshwater amphipods (Eulimnogammarus verrucosus and E. vittatus), and Heliospora caprellae comb. n. from a skeleton shrimp (Caprella alaskana). SSU rDNA sequences were acquired from isolates of these gregarine species and added to a global apicomplexan alignment containing all major groups of gregarines characterized so far. Molecular phylogenetic analyses of these data demonstrated that all of the gregarines collected from crustacean hosts formed a very strongly supported clade with 48 previously unidentified environmental DNA sequences. Conclusions/Significance This expanded molecular phylogenetic context enabled us to establish a major clade of intestinal gregarine parasites and infer the cellular identities of several previously unidentified environmental SSU rDNA sequences, including several sequences that have formerly been discussed broadly in the literature as a suspected “novel” lineage of eukaryotes. PMID:21483868

  6. Comparison of Free-Breathing With Navigator-Triggered Technique in Diffusion Weighted Imaging for Evaluation of Small Hepatocellular Carcinoma: Effect on Image Quality and Intravoxel Incoherent Motion Parameters.

    PubMed

    Shan, Yan; Zeng, Meng-su; Liu, Kai; Miao, Xi-Yin; Lin, Jiang; Fu, Cai xia; Xu, Peng-ju

    2015-01-01

    To evaluate the effect on image quality and intravoxel incoherent motion (IVIM) parameters of small hepatocellular carcinoma (HCC) from choice of either free-breathing (FB) or navigator-triggered (NT) diffusion-weighted (DW) imaging. Thirty patients with 37 small HCCs underwent IVIM DW imaging using 12 b values (0-800 s/mm) with 2 sequences: NT, FB. A biexponential analysis with the Bayesian method yielded true diffusion coefficient (D), pseudodiffusion coefficient (D*), and perfusion fraction (f) in small HCCs and liver parenchyma. Apparent diffusion coefficient (ADC) was also calculated. The acquisition time and image quality scores were assessed for 2 sequences. Independent sample t test was used to compare image quality, signal intensity ratio, IVIM parameters, and ADC values between the 2 sequences; reproducibility of IVIM parameters, and ADC values between 2 sequences was assessed with the Bland-Altman method (BA-LA). Image quality with NT sequence was superior to that with FB acquisition (P = 0.02). The mean acquisition time for FB scheme was shorter than that of NT sequence (6 minutes 14 seconds vs 10 minutes 21 seconds ± 10 seconds P < 0.01). The signal intensity ratio of small HCCs did not vary significantly between the 2 sequences. The ADC and IVIM parameters from the 2 sequences show no significant difference. Reproducibility of D*and f parameters in small HCC was poor (BA-LA: 95% confidence interval, -180.8% to 189.2% for D* and -133.8% to 174.9% for f). A moderate reproducibility of D and ADC parameters was observed (BA-LA: 95% confidence interval, -83.5% to 76.8% for D and -74.4% to 88.2% for ADC) between the 2 sequences. The NT DW imaging technique offers no advantage in IVIM parameters measurements of small HCC except better image quality, whereas FB technique offers greater confidence in fitted diffusion parameters for matched acquisition periods.

  7. Characterization of a marsupial sperm protamine gene and its transcripts from the North American opossum (Didelphis marsupialis).

    PubMed

    Winkfein, R J; Nishikawa, S; Connor, W; Dixon, G H

    1993-07-01

    A synthetic oligonucleotide primer, designed from marsupial protamine protein-sequence data [Balhorn, R., Corzett, M., Matrimas, J. A., Cummins, J. & Faden, B. (1989) Analysis of protamines isolated from two marsupials, the ring-tailed wallaby and gray short-tailed opossum, J. Cell. Biol. 107] was used to amplify, via the polymerase chain reaction, protamine sequences from a North American opossum (Didelphis marsupialis) cDNA. Using the amplified sequences as probes, several protamine cDNA clones were isolated. The protein sequence, predicted from the cDNA sequences, consisted of 57 amino acids, contained a large number of arginine residues and exhibited the sequence ARYR at its amino terminus, which is conserved in avian and most eutherian mammal protamines. Like the true protamines of trout and chicken, the opossum protamine lacked cysteine residues, distinguishing it from placental mammalian protamine 1 (P1 or stable) protamines. Examination of the protamine gene, isolated by polymerase-chain-reaction amplification of genomic DNA, revealed the presence of an intron dividing the protamine-coding region, a common characteristic of all mammalian P1 genes. In addition, extensive sequence identity in the 5' and 3' flanking regions between mouse and opossum sequences classify the marsupial protamine as being closely related to placental mammal P1. Protamine transcripts, in both birds and mammals, are present in two size classes, differing by the length of their poly(A) tails (either short or long). Examination of opossum protamine transcripts by Northern hybridization revealed four distinct mRNA species in the total RNA fraction, two of which were enriched in the poly(A)-rich fraction. Northern-blot analysis, using an intron-specific probe, revealed the presence of intron sequences in two of the four protamine transcripts. If expressed, the corresponding protein from intron-containing transcripts would differ from spliced transcripts by length (49 versus 57 amino acids) and would contain a cysteine residue.

  8. Use of Lambda Phage DNA as a Hybrid Internal Control in a PCR-Enzyme Immunoassay To Detect Chlamydia pneumoniae

    PubMed Central

    Pham, Dien G.; Madico, Guillermo E.; Quinn, Thomas C.; Enzler, Mark J.; Smith, Thomas F.; Gaydos, Charlotte A.

    1998-01-01

    An inherent problem in the diagnostic PCR assay is the presence of ill-defined inhibitors of amplification which may cause false-negative results. Addition of an amplifiable fragment of foreign DNA in the PCR to serve as a hybrid internal control (HIC) would allow for a simple way to identify specimens containing inhibitors. Two oligonucleotide hybrid primers were synthesized to contain nucleic acid sequences of the Chlamydia pneumoniae 16S rRNA primers in a position flanking two primers that target the sequences of a 650-bp lambda phage DNA segment. By using the hybrid primers, hybrid DNA comprising a large sequence of lambda phage DNA flanked by short pieces of chlamydia DNA was subsequently generated by PCR, cloned into a plasmid vector, and purified. Plasmids containing the hybrid DNA were diluted and used as a HIC by adding them to each C. pneumoniae PCR test. Consequently, C. pneumoniae primers were able to amplify both chlamydia DNA and the HIC DNA. The production of a 689-bp HIC DNA band on an acrylamide gel indicated that the specimen contained no inhibitors and that internal conditions were compatible with PCR. Subsequently, a biotinylated RNA probe for the HIC was transcribed from a nested sequence of the HIC and was used for its hybridization. Detection of the HIC DNA-RNA hybrid was achieved by enzyme immunoassay (EIA). This PCR-EIA system with a HIC was initially tested with 12 previously PCR-positive and 14 previously PCR-negative specimens. Of the 12 PCR-positive specimens, 11 were reconfirmed as positive; 1 had a negative HIC value, indicating inhibition. Of the 14 previously PCR-negative specimens, 13 were confirmed as true negative; 1 had a negative HIC value, indicating inhibition. The assay was then used with 237 nasopharyngeal specimens from patients with pneumonia. Twenty-one of 237 (8.9%) were positive for C. pneumoniae, and 42 (17.7%) were found to inhibit the PCR. Specimens showing inhibitory activity were diluted 1:10 and were retested. Ten specimens were still inhibitory to the PCR and required further DNA purification. No additional positive samples were detected and 3 nasopharyngeal specimens remained inhibitory to PCR. Coamplification of a HIC DNA can help confirm true-negative PCR results by ruling out the presence of inhibitors of DNA amplification. PMID:9650936

  9. EXors and the stellar birthline

    NASA Astrophysics Data System (ADS)

    Moody, Mackenzie S. L.; Stahler, Steven W.

    2017-04-01

    We assess the evolutionary status of EXors. These low-mass, pre-main-sequence stars repeatedly undergo sharp luminosity increases, each a year or so in duration. We place into the HR diagram all EXors that have documented quiescent luminosities and effective temperatures, and thus determine their masses and ages. Two alternate sets of pre-main-sequence tracks are used, and yield similar results. Roughly half of EXors are embedded objects, I.e., they appear observationally as Class I or flat-spectrum infrared sources. We find that these are relatively young and are located close to the stellar birthline in the HR diagram. Optically visible EXors, on the other hand, are situated well below the birthline. They have ages of several Myr, typical of classical T Tauri stars. Judging from the limited data at hand, we find no evidence that binarity companions trigger EXor eruptions; this issue merits further investigation. We draw several general conclusions. First, repetitive luminosity outbursts do not occur in all pre-main-sequence stars, and are not in themselves a sign of extreme youth. They persist, along with other signs of activity, in a relatively small subset of these objects. Second, the very existence of embedded EXors demonstrates that at least some Class I infrared sources are not true protostars, but very young pre-main-sequence objects still enshrouded in dusty gas. Finally, we believe that the embedded pre-main-sequence phase is of observational and theoretical significance, and should be included in a more complete account of early stellar evolution.

  10. nuID: a universal naming scheme of oligonucleotides for Illumina, Affymetrix, and other microarrays

    PubMed Central

    Du, Pan; Kibbe, Warren A; Lin, Simon M

    2007-01-01

    Background Oligonucleotide probes that are sequence identical may have different identifiers between manufacturers and even between different versions of the same company's microarray; and sometimes the same identifier is reused and represents a completely different oligonucleotide, resulting in ambiguity and potentially mis-identification of the genes hybridizing to that probe. Results We have devised a unique, non-degenerate encoding scheme that can be used as a universal representation to identify an oligonucleotide across manufacturers. We have named the encoded representation 'nuID', for nucleotide universal identifier. Inspired by the fact that the raw sequence of the oligonucleotide is the true definition of identity for a probe, the encoding algorithm uniquely and non-degenerately transforms the sequence itself into a compact identifier (a lossless compression). In addition, we added a redundancy check (checksum) to validate the integrity of the identifier. These two steps, encoding plus checksum, result in an nuID, which is a unique, non-degenerate, permanent, robust and efficient representation of the probe sequence. For commercial applications that require the sequence identity to be confidential, we have an encryption schema for nuID. We demonstrate the utility of nuIDs for the annotation of Illumina microarrays, and we believe it has universal applicability as a source-independent naming convention for oligomers. Reviewers This article was reviewed by Itai Yanai, Rong Chen (nominated by Mark Gerstein), and Gregory Schuler (nominated by David Lipman). PMID:17540033

  11. Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic.

    PubMed

    Derkach, Andriy; Chiang, Theodore; Gong, Jiafen; Addis, Laura; Dobbins, Sara; Tomlinson, Ian; Houlston, Richard; Pal, Deb K; Strug, Lisa J

    2014-08-01

    Sufficiently powered case-control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data. We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the 'gold standard' analysis with the true underlying genotypes for both common and rare variants. An RVS R script and instructions can be found at strug.research.sickkids.ca, and at https://github.com/strug-lab/RVS. lisa.strug@utoronto.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  12. Customisation of the exome data analysis pipeline using a combinatorial approach.

    PubMed

    Pattnaik, Swetansu; Vaidyanathan, Srividya; Pooja, Durgad G; Deepak, Sa; Panda, Binay

    2012-01-01

    The advent of next generation sequencing (NGS) technologies have revolutionised the way biologists produce, analyse and interpret data. Although NGS platforms provide a cost-effective way to discover genome-wide variants from a single experiment, variants discovered by NGS need follow up validation due to the high error rates associated with various sequencing chemistries. Recently, whole exome sequencing has been proposed as an affordable option compared to whole genome runs but it still requires follow up validation of all the novel exomic variants. Customarily, a consensus approach is used to overcome the systematic errors inherent to the sequencing technology, alignment and post alignment variant detection algorithms. However, the aforementioned approach warrants the use of multiple sequencing chemistry, multiple alignment tools, multiple variant callers which may not be viable in terms of time and money for individual investigators with limited informatics know-how. Biologists often lack the requisite training to deal with the huge amount of data produced by NGS runs and face difficulty in choosing from the list of freely available analytical tools for NGS data analysis. Hence, there is a need to customise the NGS data analysis pipeline to preferentially retain true variants by minimising the incidence of false positives and make the choice of right analytical tools easier. To this end, we have sampled different freely available tools used at the alignment and post alignment stage suggesting the use of the most suitable combination determined by a simple framework of pre-existing metrics to create significant datasets.

  13. Annual Report Nucelar Energy Research and Development Program Nuclear Energy Research Initiative

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hively, LM

    2003-02-13

    NERI Project No.2000-0109 began in August 2000 and has three tasks. The first project year addressed Task 1, namely development of nonlinear prognostication for critical equipment in nuclear power facilities. That work is described in the first year's annual report (ORNLTM-2001/195). The current (second) project year (FY02) addresses Task 2, while the third project year will address Tasks 2-3. This report describes the work for the second project year, spanning August 2001 through August 2002, including status of the tasks, issues and concerns, cost performance, and status summary of tasks. The objective of the second project year's work is amore » compelling demonstration of the nonlinear prognostication algorithm using much more data. The guidance from Dr. Madeline Feltus (DOE/NE-20) is that it would be preferable to show forewarning of failure for different kinds of nuclear-grade equipment, as opposed to many different failure modes from one piece of equipment. Long-term monitoring of operational utility equipment is possible in principle, but is not practically feasible for the following reason. Time and funding constraints for this project do not allow us to monitor the many machines (thousands) that will be necessary to obtain even a few failure sequences, due to low failure rates (<10{sup -3}/year) in the operational environment. Moreover, the ONLY way to guarantee a controlled failure sequence is to seed progressively larger faults in the equipment or to overload the equipment for accelerated tests. Both of these approaches are infeasible for operational utility machinery, but are straight-forward in a test environment. Our subcontractor has provided such test sequences. Thus, we have revised Tasks 2.1-2.4 to analyze archival test data from such tests. The second phase of our work involves validation of the nonlinear prognostication over the second and third years of the proposed work. Recognizing the inherent limitations outlined in the previous paragraph, Dr. Feltus urged Oak Ridge National Laboratory (ORNL) to contact other researchers for additional data from other test equipment. Consequently, we have revised the work plan for Tasks 2.1-2.2, with corresponding changes to the work plan as shown in the Status Summary of NERI Tasks. The revised tasks are as follows: Task 2.1--ORNL will obtain test data from a subcontractor and other researchers for various test equipment. This task includes development of a test plan or a description of the historical testing, as appropriate: test facility, equipment to be tested, choice of failure mode(s), testing protocol, data acquisition equipment, and resulting data from the test sequence. ORNL will analyze this data for quality, and subsequently via the nonlinear paradigm for prognostication. Task 2.2--ORNL will evaluate the prognostication capability of the nonlinear paradigm. The comparison metrics for reliability of the predictions will include the true positives, true negatives, and the forewarning times. Task 2.3--ORNL will improve the nonlinear paradigm as appropriate, in accord with the results of Tasks 2.1-2.2, to maximize the rate of true positive and true negative indications of failure. Maximal forewarning time is also highly desirable. Task 2.4--ORNL will develop advanced algorithms for the phase-space distribution function (PS-DF) pattern change recognition, based on the results of Task 2.3. This implementation will provide a capability for automated prognostication, as part of the maintenance decision-making. Appendix A provides a detailed description of the analysis methods, which include conventional statistics, traditional nonlinear measures, and ORNL's patented nonlinear PSDM. The body of this report focuses on results of this analysis.« less

  14. Comparison of Sanger and next generation sequencing performance for genotyping Cryptosporidium isolates at the 18S rRNA and actin loci.

    PubMed

    Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M

    2015-01-01

    Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Spatio-Temporal History of HIV-1 CRF35_AD in Afghanistan and Iran.

    PubMed

    Eybpoosh, Sana; Bahrampour, Abbas; Karamouzian, Mohammad; Azadmanesh, Kayhan; Jahanbakhsh, Fatemeh; Mostafavi, Ehsan; Zolala, Farzaneh; Haghdoost, Ali Akbar

    2016-01-01

    HIV-1 Circulating Recombinant Form 35_AD (CRF35_AD) has an important position in the epidemiological profile of Afghanistan and Iran. Despite the presence of this clade in Afghanistan and Iran for over a decade, our understanding of its origin and dissemination patterns is limited. In this study, we performed a Bayesian phylogeographic analysis to reconstruct the spatio-temporal dispersion pattern of this clade using eligible CRF35_AD gag and pol sequences available in the Los Alamos HIV database (432 sequences available from Iran, 16 sequences available from Afghanistan, and a single CRF35_AD-like pol sequence available from USA). Bayesian Markov Chain Monte Carlo algorithm was implemented in BEAST v1.8.1. Between-country dispersion rates were tested with Bayesian stochastic search variable selection method and were considered significant where Bayes factor values were greater than three. The findings suggested that CRF35_AD sequences were genetically similar to parental sequences from Kenya and Uganda, and to a set of subtype A1 sequences available from Afghan refugees living in Pakistan. Our results also showed that across all phylogenies, Afghan and Iranian CRF35_AD sequences formed a monophyletic cluster (posterior clade credibility> 0.7). The divergence date of this cluster was estimated to be between 1990 and 1992. Within this cluster, a bidirectional dispersion of the virus was observed across Afghanistan and Iran. We could not clearly identify if Afghanistan or Iran first established or received this epidemic, as the root location of this cluster could not be robustly estimated. Three CRF35_AD sequences from Afghan refugees living in Pakistan nested among Afghan and Iranian CRF35_AD branches. However, the CRF35_AD-like sequence available from USA diverged independently from Kenyan subtype A1 sequences, suggesting it not to be a true CRF35_AD lineage. Potential factors contributing to viral exchange between Afghanistan and Iran could be injection drug networks and mass migration of Afghan refugees and labours to Iran, which calls for extensive preventive efforts.

  16. Spatio-Temporal History of HIV-1 CRF35_AD in Afghanistan and Iran

    PubMed Central

    Eybpoosh, Sana; Bahrampour, Abbas; Karamouzian, Mohammad; Azadmanesh, Kayhan; Jahanbakhsh, Fatemeh; Mostafavi, Ehsan; Zolala, Farzaneh; Haghdoost, Ali Akbar

    2016-01-01

    HIV-1 Circulating Recombinant Form 35_AD (CRF35_AD) has an important position in the epidemiological profile of Afghanistan and Iran. Despite the presence of this clade in Afghanistan and Iran for over a decade, our understanding of its origin and dissemination patterns is limited. In this study, we performed a Bayesian phylogeographic analysis to reconstruct the spatio-temporal dispersion pattern of this clade using eligible CRF35_AD gag and pol sequences available in the Los Alamos HIV database (432 sequences available from Iran, 16 sequences available from Afghanistan, and a single CRF35_AD-like pol sequence available from USA). Bayesian Markov Chain Monte Carlo algorithm was implemented in BEAST v1.8.1. Between-country dispersion rates were tested with Bayesian stochastic search variable selection method and were considered significant where Bayes factor values were greater than three. The findings suggested that CRF35_AD sequences were genetically similar to parental sequences from Kenya and Uganda, and to a set of subtype A1 sequences available from Afghan refugees living in Pakistan. Our results also showed that across all phylogenies, Afghan and Iranian CRF35_AD sequences formed a monophyletic cluster (posterior clade credibility> 0.7). The divergence date of this cluster was estimated to be between 1990 and 1992. Within this cluster, a bidirectional dispersion of the virus was observed across Afghanistan and Iran. We could not clearly identify if Afghanistan or Iran first established or received this epidemic, as the root location of this cluster could not be robustly estimated. Three CRF35_AD sequences from Afghan refugees living in Pakistan nested among Afghan and Iranian CRF35_AD branches. However, the CRF35_AD-like sequence available from USA diverged independently from Kenyan subtype A1 sequences, suggesting it not to be a true CRF35_AD lineage. Potential factors contributing to viral exchange between Afghanistan and Iran could be injection drug networks and mass migration of Afghan refugees and labours to Iran, which calls for extensive preventive efforts. PMID:27280293

  17. Diffusion-weighted Imaging of the Liver with Multiple b Values: Effect of Diffusion Gradient Polarity and Breathing Acquisition on Image Quality and Intravoxel Incoherent Motion Parameters—A Pilot Study

    PubMed Central

    Dyvorne, Hadrien A.; Galea, Nicola; Nevers, Thomas; Fiel, M. Isabel; Carpenter, David; Wong, Edmund; Orton, Matthew; de Oliveira, Andre; Feiweier, Thorsten; Vachon, Marie-Louise; Babb, James S.

    2013-01-01

    Purpose: To optimize intravoxel incoherent motion (IVIM) diffusion-weighted (DW) imaging by estimating the effects of diffusion gradient polarity and breathing acquisition scheme on image quality, signal-to-noise ratio (SNR), IVIM parameters, and parameter reproducibility, as well as to investigate the potential of IVIM in the detection of hepatic fibrosis. Materials and Methods: In this institutional review board–approved prospective study, 20 subjects (seven healthy volunteers, 13 patients with hepatitis C virus infection; 14 men, six women; mean age, 46 years) underwent IVIM DW imaging with four sequences: (a) respiratory-triggered (RT) bipolar (BP) sequence, (b) RT monopolar (MP) sequence, (c) free-breathing (FB) BP sequence, and (d) FB MP sequence. Image quality scores were assessed for all sequences. A biexponential analysis with the Bayesian method yielded true diffusion coefficient (D), pseudodiffusion coefficient (D*), and perfusion fraction (PF) in liver parenchyma. Mixed-model analysis of variance was used to compare image quality, SNR, IVIM parameters, and interexamination variability between the four sequences, as well as the ability to differentiate areas of liver fibrosis from normal liver tissue. Results: Image quality with RT sequences was superior to that with FB acquisitions (P = .02) and was not affected by gradient polarity. SNR did not vary significantly between sequences. IVIM parameter reproducibility was moderate to excellent for PF and D, while it was less reproducible for D*. PF and D were both significantly lower in patients with hepatitis C virus than in healthy volunteers with the RT BP sequence (PF = 13.5% ± 5.3 [standard deviation] vs 9.2% ± 2.5, P = .038; D = [1.16 ± 0.07] × 10−3 mm2/sec vs [1.03 ± 0.1] × 10−3 mm2/sec, P = .006). Conclusion: The RT BP DW imaging sequence had the best results in terms of image quality, reproducibility, and ability to discriminate between healthy and fibrotic liver with biexponential fitting. © RSNA, 2012 PMID:23220895

  18. Mars - A planet with a complex surface evolution

    NASA Technical Reports Server (NTRS)

    Arvidson, R. E.; Coradini, M.

    1975-01-01

    The surface of Mars has evolved to its present form through a complex sequence of tectonism and associated volcanism, impact processes, water erosion, mass movements, and wind action. The diversity of geological processes active in past Martian history far exceeded most predictions. By the same token, predictions of processes modifying the satellites of the outer planets may fall far short of the true range of phenomena. A summary of present though with regard to Martian surface evolution is presented to serve as a case in point of the value of imagery and topography data in making interpretations of geological histories.

  19. The whole earth telescope - A new astronomical instrument

    NASA Technical Reports Server (NTRS)

    Nather, R. E.; Winget, D. E.; Clemens, J. C.; Hansen, C. J.; Hine, B. P.

    1990-01-01

    A new multimirror ground-based telescope for time-series photometry of rapid variable stars, designed to minimize or eliminate gaps in the brightness record caused by the rotation of the earth, is described. A sequence of existing telescopes distributed in longitude, coordinated from a single control center, is used to measure designated target stars so long as they are in darkness. Data are returned by electronic mail to the control center, where they are analyzed in real time. This instrument is the first to provide data of continuity and quality that permit true high-resolution power spectroscopy of pulsating white dwarf stars.

  20. Pseudo progression identification of glioblastoma with dictionary learning.

    PubMed

    Zhang, Jian; Yu, Hengyong; Qian, Xiaohua; Liu, Keqin; Tan, Hua; Yang, Tielin; Wang, Maode; Li, King Chuen; Chan, Michael D; Debinski, Waldemar; Paulsson, Anna; Wang, Ge; Zhou, Xiaobo

    2016-06-01

    Although the use of temozolomide in chemoradiotherapy is effective, the challenging clinical problem of pseudo progression has been raised in brain tumor treatment. This study aims to distinguish pseudo progression from true progression. Between 2000 and 2012, a total of 161 patients with glioblastoma multiforme (GBM) were treated with chemoradiotherapy at our hospital. Among the patients, 79 had their diffusion tensor imaging (DTI) data acquired at the earliest diagnosed date of pseudo progression or true progression, and 23 had both DTI data and genomic data. Clinical records of all patients were kept in good condition. Volumetric fractional anisotropy (FA) images obtained from the DTI data were decomposed into a sequence of sparse representations. Then, a feature selection algorithm was applied to extract the critical features from the feature matrix to reduce the size of the feature matrix and to improve the classification accuracy. The proposed approach was validated using the 79 samples with clinical DTI data. Satisfactory results were obtained under different experimental conditions. The area under the receiver operating characteristic (ROC) curve (AUC) was 0.87 for a given dictionary with 1024 atoms. For the subgroup of 23 samples, genomics data analysis was also performed. Results implied further perspective on pseudo progression classification. The proposed method can determine pseudo progression and true progression with improved accuracy. Laboring segmentation is no longer necessary because this skillfully designed method is not sensitive to tumor location. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Modified-hybrid optical neural network filter for multiple object recognition within cluttered scenes

    NASA Astrophysics Data System (ADS)

    Kypraios, Ioannis; Young, Rupert C. D.; Chatwin, Chris R.

    2009-08-01

    Motivated by the non-linear interpolation and generalization abilities of the hybrid optical neural network filter between the reference and non-reference images of the true-class object we designed the modifiedhybrid optical neural network filter. We applied an optical mask to the hybrid optical neural network's filter input. The mask was built with the constant weight connections of a randomly chosen image included in the training set. The resulted design of the modified-hybrid optical neural network filter is optimized for performing best in cluttered scenes of the true-class object. Due to the shift invariance properties inherited by its correlator unit the filter can accommodate multiple objects of the same class to be detected within an input cluttered image. Additionally, the architecture of the neural network unit of the general hybrid optical neural network filter allows the recognition of multiple objects of different classes within the input cluttered image by modifying the output layer of the unit. We test the modified-hybrid optical neural network filter for multiple objects of the same and of different classes' recognition within cluttered input images and video sequences of cluttered scenes. The filter is shown to exhibit with a single pass over the input data simultaneously out-of-plane rotation, shift invariance and good clutter tolerance. It is able to successfully detect and classify correctly the true-class objects within background clutter for which there has been no previous training.

  2. In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data

    NASA Astrophysics Data System (ADS)

    Cai, Lei; Yuan, Wei; Zhang, Zhou; He, Lin; Chou, Kuo-Chen

    2016-11-01

    Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of ~50X) and ultra-deep targeted sequencing (UDT-Seq, depth of ~370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate of dbSNP presence and high-alternative-alleles-in-control calls, demonstrating their superior sensitivity and accuracy. Combining different callers does increase reliability of candidates, but narrows the list down to very limited range of tumor read depth and variant allele frequency. Calling SNV on UDT-Seq data, which were of much higher read-depth, discovered additional true-positive variations, despite an even more tremendous growth in false positive predictions. Our findings not only provide valuable benchmark for state-of-the-art SNV calling methods, but also shed light on the access to more accurate SNV identification in the future.

  3. First case of Plasmodium knowlesi infection in a Japanese traveller returning from Malaysia.

    PubMed

    Tanizaki, Ryutaro; Ujiie, Mugen; Kato, Yasuyuki; Iwagami, Moritoshi; Hashimoto, Aki; Kutsuna, Satoshi; Takeshita, Nozomi; Hayakawa, Kyoko; Kanagawa, Shuzo; Kano, Shigeyuki; Ohmagari, Norio

    2013-04-15

    This is the first case of Plasmodium knowlesi infection in a Japanese traveller returning from Malaysia. In September 2012, a previously healthy 35-year-old Japanese man presented to National Center for Global Health and Medicine in Tokyo with a two-day history of daily fever, mild headaches and mild arthralgia. Malaria parasites were found in the Giemsa-stained thin blood smear, which showed band forms similar to Plasmodium malariae. Although a nested PCR showed the amplification of the primer of Plasmodium vivax and Plasmodium knowlesi, he was finally diagnosed with P. knowlesi mono-infection by DNA sequencing. He was treated with mefloquine, and recovered without any complications. DNA sequencing of the PCR products is indispensable to confirm P. knowlesi infection, however there is limited access to DNA sequencing procedures in endemic areas. The extent of P. knowlesi transmission in Asia has not been clearly defined. There is limited availability of diagnostic tests and routine surveillance system for reporting an accurate diagnosis in the Asian endemic regions. Thus, reporting accurately diagnosed cases of P. knowlesi infection in travellers would be important for assessing the true nature of this emerging human infection.

  4. Layered data association using graph-theoretic formulation with applications to tennis ball tracking in monocular sequences.

    PubMed

    Yan, Fei; Christmas, William; Kittler, Josef

    2008-10-01

    In this paper, we propose a multilayered data association scheme with graph-theoretic formulation for tracking multiple objects that undergo switching dynamics in clutter. The proposed scheme takes as input object candidates detected in each frame. At the object candidate level, "tracklets'' are "grown'' from sets of candidates that have high probabilities of containing only true positives. At the tracklet level, a directed and weighted graph is constructed, where each node is a tracklet, and the edge weight between two nodes is defined according to the "compatibility'' of the two tracklets. The association problem is then formulated as an all-pairs shortest path (APSP) problem in this graph. Finally, at the path level, by analyzing the APSPs, all object trajectories are identified, and track initiation and track termination are automatically dealt with. By exploiting a special topological property of the graph, we have also developed a more efficient APSP algorithm than the general-purpose ones. The proposed data association scheme is applied to tennis sequences to track tennis balls. Experiments show that it works well on sequences where other data association methods perform poorly or fail completely.

  5. Phylogenetic analysis of feline immunodeficiency virus strains from naturally infected cats in Belgium and The Netherlands.

    PubMed

    Roukaerts, Inge D M; Theuns, Sebastiaan; Taffin, Elien R L; Daminet, Sylvie; Nauwynck, Hans J

    2015-01-22

    Feline immunodeficiency virus (FIV) is a major pathogen in feline populations worldwide, with seroprevalences up to 26%. Virus strains circulating in domestic cats are subdivided into different phylogenetic clades (A-E), based on the genetic diversity of the V3-V4 region of the env gene. In this report, a phylogenetic analysis of the V3-V4 env region, and a variable region in the gag gene was made for 36 FIV strains isolated in Belgium and The Netherlands. All newly generated gag sequences clustered together with previously known clade A FIV viruses, confirming the dominance of clade A viruses in Northern Europe. The same was true for the obtained env sequences, with only one sample of an unknown env subtype. Overall, the genetic diversity of FIV strains sequenced in this report was low. This indicates a relatively recent introduction of FIV in Belgium and The Netherlands. However, the sample with an unknown env subtype indicates that new introductions of FIV from unknown origin do occur and this will likely increase genetic variability in time. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Haplotype-Based Genotyping in Polyploids.

    PubMed

    Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

    2018-01-01

    Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.

  7. Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

    PubMed

    Hu, Ming; Zhu, Yu; Taylor, Jeremy M G; Liu, Jun S; Qin, Zhaohui S

    2012-01-01

    RNA sequencing (RNA-Seq) is a powerful new technology for mapping and quantifying transcriptomes using ultra high-throughput next-generation sequencing technologies. Using deep sequencing, gene expression levels of all transcripts including novel ones can be quantified digitally. Although extremely promising, the massive amounts of data generated by RNA-Seq, substantial biases and uncertainty in short read alignment pose challenges for data analysis. In particular, large base-specific variation and between-base dependence make simple approaches, such as those that use averaging to normalize RNA-Seq data and quantify gene expressions, ineffective. In this study, we propose a Poisson mixed-effects (POME) model to characterize base-level read coverage within each transcript. The underlying expression level is included as a key parameter in this model. Since the proposed model is capable of incorporating base-specific variation as well as between-base dependence that affect read coverage profile throughout the transcript, it can lead to improved quantification of the true underlying expression level. POME can be freely downloaded at http://www.stat.purdue.edu/~yuzhu/pome.html. yuzhu@purdue.edu; zhaohui.qin@emory.edu Supplementary data are available at Bioinformatics online.

  8. High-throughput Methods Redefine the Rumen Microbiome and Its Relationship with Nutrition and Metabolism

    PubMed Central

    McCann, Joshua C.; Wickersham, Tryon A.; Loor, Juan J.

    2014-01-01

    Diversity in the forestomach microbiome is one of the key features of ruminant animals. The diverse microbial community adapts to a wide array of dietary feedstuffs and management strategies. Understanding rumen microbiome composition, adaptation, and function has global implications ranging from climatology to applied animal production. Classical knowledge of rumen microbiology was based on anaerobic, culture-dependent methods. Next-generation sequencing and other molecular techniques have uncovered novel features of the rumen microbiome. For instance, pyrosequencing of the 16S ribosomal RNA gene has revealed the taxonomic identity of bacteria and archaea to the genus level, and when complemented with barcoding adds multiple samples to a single run. Whole genome shotgun sequencing generates true metagenomic sequences to predict the functional capability of a microbiome, and can also be used to construct genomes of isolated organisms. Integration of high-throughput data describing the rumen microbiome with classic fermentation and animal performance parameters has produced meaningful advances and opened additional areas for study. In this review, we highlight recent studies of the rumen microbiome in the context of cattle production focusing on nutrition, rumen development, animal efficiency, and microbial function. PMID:24940050

  9. Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs

    PubMed Central

    2017-01-01

    Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package. PMID:29107980

  10. FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise.

    PubMed

    Hoogenboom, Jerry; van der Gaag, Kristiaan J; de Leeuw, Rick H; Sijen, Titia; de Knijff, Peter; Laros, Jeroen F J

    2017-03-01

    Massively parallel sequencing (MPS) is on the advent of a broad scale application in forensic research and casework. The improved capabilities to analyse evidentiary traces representing unbalanced mixtures is often mentioned as one of the major advantages of this technique. However, most of the available software packages that analyse forensic short tandem repeat (STR) sequencing data are not well suited for high throughput analysis of such mixed traces. The largest challenge is the presence of stutter artefacts in STR amplifications, which are not readily discerned from minor contributions. FDSTools is an open-source software solution developed for this purpose. The level of stutter formation is influenced by various aspects of the sequence, such as the length of the longest uninterrupted stretch occurring in an STR. When MPS is used, STRs are evaluated as sequence variants that each have particular stutter characteristics which can be precisely determined. FDSTools uses a database of reference samples to determine stutter and other systemic PCR or sequencing artefacts for each individual allele. In addition, stutter models are created for each repeating element in order to predict stutter artefacts for alleles that are not included in the reference set. This information is subsequently used to recognise and compensate for the noise in a sequence profile. The result is a better representation of the true composition of a sample. Using Promega Powerseq™ Auto System data from 450 reference samples and 31 two-person mixtures, we show that the FDSTools correction module decreases stutter ratios above 20% to below 3%. Consequently, much lower levels of contributions in the mixed traces are detected. FDSTools contains modules to visualise the data in an interactive format allowing users to filter data with their own preferred thresholds. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  11. Novel sequence variants in the TMIE gene in families with autosomal recessive nonsyndromic hearing impairment

    PubMed Central

    Santos, Regie Lyn P.; El-Shanti, Hatem; Sikandar, Shaheen; Lee, Kwanghyuk; Bhatti, Attya; Yan, Kai; Chahrour, Maria H.; McArthur, Nathan; Pham, Thanh L.; Mahasneh, Amjad Abdullah; Ahmad, Wasim

    2010-01-01

    To date, 37 genes have been identified for nonsyndromic hearing impairment (NSHI). Identifying the functional sequence variants within these genes and knowing their population-specific frequencies is of public health value, in particular for genetic screening for NSHI. To determine putatively functional sequence variants in the transmembrane inner ear (TMIE) gene in Pakistani and Jordanian families with autosomal recessive (AR) NSHI, four Jordanian and 168 Pakistani families with ARNSHI that is not due to GJB2 (CX26) were submitted to a genome scan. Two-point and multipoint parametric linkage analyses were performed, and families with logarithmic odds (LOD) scores of 1.0 or greater within the TMIE region underwent further DNA sequencing. The evolutionary conservation and location in predicted protein domains of amino acid residues where sequence variants occurred were studied to elucidate the possible effects of these sequence variants on function. Of seven families that were screened for TMIE, putatively functional sequence variants were found to segregate with hearing impairment in four families but were not seen in not less than 110 ethnically matched control chromosomes. The previously reported c.241C>T (p.R81C) variant was observed in two Pakistani families. Two novel variants, c.92A>G (p.E31G) and the splice site mutation c.212–2A>C, were identified in one Pakistani and one Jordanian family, respectively. The c.92A>G (p.E31G) variant occurred at a residue that is conserved in the mouse and is predicted to be extracellular. Conservation and potential functionality of previously published mutations were also examined. The prevalence of functional TMIE variants in Pakistani families is 1.7% [95% confidence interval (CI) 0.3–4.8]. Further studies on the spectrum, prevalence rates, and functional effect of sequence variants in the TMIE gene in other populations should demonstrate the true importance of this gene as a cause of hearing impairment. PMID:16389551

  12. A Simple Method for Amplifying RNA Targets (SMART)

    PubMed Central

    McCalla, Stephanie E.; Ong, Carmichael; Sarma, Aartik; Opal, Steven M.; Artenstein, Andrew W.; Tripathi, Anubhav

    2012-01-01

    We present a novel and simple method for amplifying RNA targets (named by its acronym, SMART), and for detection, using engineered amplification probes that overcome existing limitations of current RNA-based technologies. This system amplifies and detects optimal engineered ssDNA probes that hybridize to target RNA. The amplifiable probe-target RNA complex is captured on magnetic beads using a sequence-specific capture probe and is separated from unbound probe using a novel microfluidic technique. Hybridization sequences are not constrained as they are in conventional target-amplification reactions such as nucleic acid sequence amplification (NASBA). Our engineered ssDNA probe was amplified both off-chip and in a microchip reservoir at the end of the separation microchannel using isothermal NASBA. Optimal solution conditions for ssDNA amplification were investigated. Although KCl and MgCl2 are typically found in NASBA reactions, replacing 70 mmol/L of the 82 mmol/L total chloride ions with acetate resulted in optimal reaction conditions, particularly for low but clinically relevant probe concentrations (≤100 fmol/L). With the optimal probe design and solution conditions, we also successfully removed the initial heating step of NASBA, thus achieving a true isothermal reaction. The SMART assay using a synthetic model influenza DNA target sequence served as a fundamental demonstration of the efficacy of the capture and microfluidic separation system, thus bridging our system to a clinically relevant detection problem. PMID:22691910

  13. Is MMTV associated with human breast cancer? Maybe, but probably not.

    PubMed

    Perzova, Raisa; Abbott, Lynn; Benz, Patricia; Landas, Steve; Khan, Seema; Glaser, Jordan; Cunningham, Coleen K; Poiesz, Bernard

    2017-10-13

    Conflicting results regarding the association of MMTV with human breast cancer have been reported. Published sequence data have indicated unique MMTV strains in some human samples. However, concerns regarding contamination as a cause of false positive results have persisted. We performed PCR assays for MMTV on human breast cancer cell lines and fresh frozen and formalin fixed normal and malignant human breast epithelial samples. Assays were also performed on peripheral blood mononuclear cells from volunteer blood donors and subjects at risk for human retroviral infections. In addition, assays were performed on DNA samples from wild and laboratory mice. Sequencing of MMTV positive samples from both humans and mice were performed and phylogenetically compared. Using PCR under rigorous conditions to prevent and detect "carryover" contamination, we did detect MMTV DNA in human samples, including breast cancer. However, the results were not consistent and seemed to be an artifact. Further, experiments indicated that the probable source of false positives was murine DNA, containing endogenous MMTV, present in our building. However, comparison of published and, herein, newly described MMTV sequences with published data, indicates that there are some very unique human MMTV sequences in the literature. While we could not confirm the true presence of MMTV in our human breast cancer subjects, the data indicate that further, perhaps more traditional, retroviral studies are warranted to ascertain whether MMTV might rarely be the cause of human breast cancer.

  14. Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.

    1987-06-01

    To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from lambdagt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. Inmore » RNA blots of poly(A)/sup +/ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species.« less

  15. Recombinations in Staphylococcal Cassette Chromosome mec Elements Compromise the Molecular Detection of Methicillin Resistance in Staphylococcus aureus

    PubMed Central

    Hill-Cawthorne, Grant A.; Hudson, Lyndsey O.; El Ghany, Moataz Fouad Abd; Piepenburg, Olaf; Nair, Mridul; Dodgson, Andrew; Forrest, Matthew S.

    2014-01-01

    Clinical laboratories are increasingly using molecular tests for methicillin-resistant Staphylococcus aureus (MRSA) screening. However, primers have to be targeted to a variable chromosomal region, the staphylococcal cassette chromosome mec (SCCmec). We initially screened 726 MRSA isolates from a single UK hospital trust by recombinase polymerase amplification (RPA), a novel, isothermal alternative to PCR. Undetected isolates were further characterised using multilocus sequence, spa typing and whole genome sequencing. 96% of our tested phenotypically MRSA isolates contained one of the six orfX-SCCmec junctions our RPA test and commercially available molecular tests target. However 30 isolates could not be detected. Sequencing of 24 of these isolates demonstrated recombinations within the SCCmec element with novel insertions that interfered with the RPA, preventing identification as MRSA. This result suggests that clinical laboratories cannot rely solely upon molecular assays to reliably detect all methicillin-resistance. The presence of significant recombinations in the SCCmec element, where the majority of assays target their primers, suggests that there will continue to be isolates that escape identification. We caution that dependence on amplification-based molecular assays will continue to result in failure to diagnose a small proportion (∼4%) of MRSA isolates, unless the true level of SCCmec natural diversity is determined by whole genome sequencing of a large collection of MRSA isolates. PMID:24972080

  16. Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations

    PubMed Central

    Zhou, Shuntai; Jones, Corbin; Mieczkowski, Piotr

    2015-01-01

    ABSTRACT Validating the sampling depth and reducing sequencing errors are critical for studies of viral populations using next-generation sequencing (NGS). We previously described the use of Primer ID to tag each viral RNA template with a block of degenerate nucleotides in the cDNA primer. We now show that low-abundance Primer IDs (offspring Primer IDs) are generated due to PCR/sequencing errors. These artifactual Primer IDs can be removed using a cutoff model for the number of reads required to make a template consensus sequence. We have modeled the fraction of sequences lost due to Primer ID resampling. For a typical sequencing run, less than 10% of the raw reads are lost to offspring Primer ID filtering and resampling. The remaining raw reads are used to correct for PCR resampling and sequencing errors. We also demonstrate that Primer ID reveals bias intrinsic to PCR, especially at low template input or utilization. cDNA synthesis and PCR convert ca. 20% of RNA templates into recoverable sequences, and 30-fold sequence coverage recovers most of these template sequences. We have directly measured the residual error rate to be around 1 in 10,000 nucleotides. We use this error rate and the Poisson distribution to define the cutoff to identify preexisting drug resistance mutations at low abundance in an HIV-infected subject. Collectively, these studies show that >90% of the raw sequence reads can be used to validate template sampling depth and to dramatically reduce the error rate in assessing a genetically diverse viral population using NGS. IMPORTANCE Although next-generation sequencing (NGS) has revolutionized sequencing strategies, it suffers from serious limitations in defining sequence heterogeneity in a genetically diverse population, such as HIV-1 due to PCR resampling and PCR/sequencing errors. The Primer ID approach reveals the true sampling depth and greatly reduces errors. Knowing the sampling depth allows the construction of a model of how to maximize the recovery of sequences from input templates and to reduce resampling of the Primer ID so that appropriate multiplexing can be included in the experimental design. With the defined sampling depth and measured error rate, we are able to assign cutoffs for the accurate detection of minority variants in viral populations. This approach allows the power of NGS to be realized without having to guess about sampling depth or to ignore the problem of PCR resampling, while also being able to correct most of the errors in the data set. PMID:26041299

  17. Insights into Deep-Sea Sediment Fungal Communities from the East Indian Ocean Using Targeted Environmental Sequencing Combined with Traditional Cultivation

    PubMed Central

    Zhang, Xiao-yong; Tang, Gui-ling; Xu, Xin-ya; Nong, Xu-hua; Qi, Shu-Hua

    2014-01-01

    The fungal diversity in deep-sea environments has recently gained an increasing amount attention. Our knowledge and understanding of the true fungal diversity and the role it plays in deep-sea environments, however, is still limited. We investigated the fungal community structure in five sediments from a depth of ∼4000 m in the East India Ocean using a combination of targeted environmental sequencing and traditional cultivation. This approach resulted in the recovery of a total of 45 fungal operational taxonomic units (OTUs) and 20 culturable fungal phylotypes. This finding indicates that there is a great amount of fungal diversity in the deep-sea sediments collected in the East Indian Ocean. Three fungal OTUs and one culturable phylotype demonstrated high divergence (89%–97%) from the existing sequences in the GenBank. Moreover, 44.4% fungal OTUs and 30% culturable fungal phylotypes are new reports for deep-sea sediments. These results suggest that the deep-sea sediments from the East India Ocean can serve as habitats for new fungal communities compared with other deep-sea environments. In addition, different fungal community could be detected when using targeted environmental sequencing compared with traditional cultivation in this study, which suggests that a combination of targeted environmental sequencing and traditional cultivation will generate a more diverse fungal community in deep-sea environments than using either targeted environmental sequencing or traditional cultivation alone. This study is the first to report new insights into the fungal communities in deep-sea sediments from the East Indian Ocean, which increases our knowledge and understanding of the fungal diversity in deep-sea environments. PMID:25272044

  18. Analysis of selected genes associated with cardiomyopathy by next-generation sequencing.

    PubMed

    Szabadosova, Viktoria; Boronova, Iveta; Ferenc, Peter; Tothova, Iveta; Bernasovska, Jarmila; Zigova, Michaela; Kmec, Jan; Bernasovsky, Ivan

    2018-02-01

    As the leading cause of congestive heart failure, cardiomyopathy represents a heterogenous group of heart muscle disorders. Despite considerable progress being made in the genetic diagnosis of cardiomyopathy by detection of the mutations in the most prevalent cardiomyopathy genes, the cause remains unsolved in many patients. High-throughput mutation screening in the disease genes for cardiomyopathy is now possible because of using target enrichment followed by next-generation sequencing. The aim of the study was to analyze a panel of genes associated with dilated or hypertrophic cardiomyopathy based on previously published results in order to identify the subjects at risk. The method of next-generation sequencing by IlluminaHiSeq 2500 platform was used to detect sequence variants in 16 individuals diagnosed with dilated or hypertrophic cardiomyopathy. Detected variants were filtered and the functional impact of amino acid changes was predicted by computational programs. DNA samples of the 16 patients were analyzed by whole exome sequencing. We identified six nonsynonymous variants that were shown to be pathogenic in all used prediction softwares: rs3744998 (EPG5), rs11551768 (MGME1), rs148374985 (MURC), rs78461695 (PLEC), rs17158558 (RET) and rs2295190 (SYNE1). Two of the analyzed sequence variants had minor allele frequency (MAF)<0.01: rs148374985 (MURC), rs34580776 (MYBPC3). Our data support the potential role of the detected variants in pathogenesis of dilated or hypertrophic cardiomyopathy; however, the possibility that these variants might not be true disease-causing variants but are susceptibility alleles that require additional mutations or injury to cause the clinical phenotype of disease must be considered. © 2017 Wiley Periodicals, Inc.

  19. Evolutionary relationships in the ilarviruses: nucleotide sequence of prunus necrotic ringspot virus RNA 3.

    PubMed

    Sánchez-Navarro, J A; Pallás, V

    1997-01-01

    The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.

  20. Development of internal COI primers to improve and extend barcoding of fruit flies (Diptera: Tephritidae: Dacini).

    PubMed

    Krosch, Matt N; Strutt, Francesca; Blacket, Mark J; Batovska, Jana; Starkie, Melissa; Clarke, Anthony R; Cameron, Stephen L; Schutze, Mark K

    2018-06-06

    Accurate species-level identifications underpin many aspects of basic and applied biology; however, identifications can be hampered by a lack of discriminating morphological characters, taxonomic expertise or time. Molecular approaches, such as DNA 'barcoding' of the cytochrome c oxidase (COI) gene, are argued to overcome these issues. However, nuclear encoding of mitochondrial genes (numts) and poor amplification success of suboptimally preserved specimens can lead to erroneous identifications. One insect group for which these molecular and morphological problems are significant are the dacine fruit flies (Diptera: Tephritidae: Dacini). We addressed these issues associated with COI barcoding in the dacines by first assessing several 'universal' COI primers against public mitochondrial genome and numt sequences for dacine taxa. We then modified a set of four primers that more closely matched true dacine COI sequence and amplified two overlapping portions of the COI barcode region. Our new primers were tested alongside universal primers on a selection of dacine species, including both fresh preserved and decades-old dry specimens. Additionally, Bactrocera tryoni mitochondrial and nuclear genomes were compared to identify putative numts. Four numt clades were identified, three of which were amplified using existing universal primers. In contrast, our new primers preferentially amplified the 'true' mitochondrial COI barcode in all dacine species tested. The new primers also successfully amplified partial barcodes from dry specimens for which full length barcodes were unobtainable. Thus we recommend these new primers be incorporated into the suites of primers used by diagnosticians and quarantine labs for the accurate identification of dacine species. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  1. Effectiveness of phylogenomic data and coalescent species-tree methods for resolving difficult nodes in the phylogeny of advanced snakes (Serpentes: Caenophidia).

    PubMed

    Pyron, R Alexander; Hendry, Catriona R; Chou, Vincent M; Lemmon, Emily M; Lemmon, Alan R; Burbrink, Frank T

    2014-12-01

    Next-generation genomic sequencing promises to quickly and cheaply resolve remaining contentious nodes in the Tree of Life, and facilitates species-tree estimation while taking into account stochastic genealogical discordance among loci. Recent methods for estimating species trees bypass full likelihood-based estimates of the multi-species coalescent, and approximate the true species-tree using simpler summary metrics. These methods converge on the true species-tree with sufficient genomic sampling, even in the anomaly zone. However, no studies have yet evaluated their efficacy on a large-scale phylogenomic dataset, and compared them to previous concatenation strategies. Here, we generate such a dataset for Caenophidian snakes, a group with >2500 species that contains several rapid radiations that were poorly resolved with fewer loci. We generate sequence data for 333 single-copy nuclear loci with ∼100% coverage (∼0% missing data) for 31 major lineages. We estimate phylogenies using neighbor joining, maximum parsimony, maximum likelihood, and three summary species-tree approaches (NJst, STAR, and MP-EST). All methods yield similar resolution and support for most nodes. However, not all methods support monophyly of Caenophidia, with Acrochordidae placed as the sister taxon to Pythonidae in some analyses. Thus, phylogenomic species-tree estimation may occasionally disagree with well-supported relationships from concatenated analyses of small numbers of nuclear or mitochondrial genes, a consideration for future studies. In contrast for at least two diverse, rapid radiations (Lamprophiidae and Colubridae), phylogenomic data and species-tree inference do little to improve resolution and support. Thus, certain nodes may lack strong signal, and larger datasets and more sophisticated analyses may still fail to resolve them. Copyright © 2014 Elsevier Inc. All rights reserved.

  2. On the evaluation of the fidelity of supervised classifiers in the prediction of chimeric RNAs.

    PubMed

    Beaumeunier, Sacha; Audoux, Jérôme; Boureux, Anthony; Ruffle, Florence; Commes, Thérèse; Philippe, Nicolas; Alves, Ronnie

    2016-01-01

    High-throughput sequencing technology and bioinformatics have identified chimeric RNAs (chRNAs), raising the possibility of chRNAs expressing particularly in diseases can be used as potential biomarkers in both diagnosis and prognosis. The task of discriminating true chRNAs from the false ones poses an interesting Machine Learning (ML) challenge. First of all, the sequencing data may contain false reads due to technical artifacts and during the analysis process, bioinformatics tools may generate false positives due to methodological biases. Moreover, if we succeed to have a proper set of observations (enough sequencing data) about true chRNAs, chances are that the devised model can not be able to generalize beyond it. Like any other machine learning problem, the first big issue is finding the good data to build models. As far as we were concerned, there is no common benchmark data available for chRNAs detection. The definition of a classification baseline is lacking in the related literature too. In this work we are moving towards benchmark data and an evaluation of the fidelity of supervised classifiers in the prediction of chRNAs. We proposed a modelization strategy that can be used to increase the tools performances in context of chRNA classification based on a simulated data generator, that permit to continuously integrate new complex chimeric events. The pipeline incorporated a genome mutation process and simulated RNA-seq data. The reads within distinct depth were aligned and analysed by CRAC that integrates genomic location and local coverage, allowing biological predictions at the read scale. Additionally, these reads were functionally annotated and aggregated to form chRNAs events, making it possible to evaluate ML methods (classifiers) performance in both levels of reads and events. Ensemble learning strategies demonstrated to be more robust to this classification problem, providing an average AUC performance of 95 % (ACC=94 %, Kappa=0.87 %). The resulting classification models were also tested on real RNA-seq data from a set of twenty-seven patients with acute myeloid leukemia (AML).

  3. Towards a molecular taxonomic key of the Aurantioideae subfamily using chloroplastic SNP diagnostic markers of the main clades genotyped by competitive allele-specific PCR.

    PubMed

    Oueslati, Amel; Ollitrault, Frederique; Baraket, Ghada; Salhi-Hannachi, Amel; Navarro, Luis; Ollitrault, Patrick

    2016-08-18

    Chloroplast DNA is a primary source of molecular variations for phylogenetic analysis of photosynthetic eukaryotes. However, the sequencing and analysis of multiple chloroplastic regions is difficult to apply to large collections or large samples of natural populations. The objective of our work was to demonstrate that a molecular taxonomic key based on easy, scalable and low-cost genotyping method should be developed from a set of Single Nucleotide Polymorphisms (SNPs) diagnostic of well-established clades. It was applied to the Aurantioideae subfamily, the largest group of the Rutaceae family that includes the cultivated citrus species. The publicly available nucleotide sequences of eight plastid genomic regions were compared for 79 accessions of the Aurantioideae subfamily to search for SNPs revealing taxonomic differentiation at the inter-tribe, inter-subtribe, inter-genus and interspecific levels. Diagnostic SNPs (DSNPs) were found for 46 of the 54 clade levels analysed. Forty DSNPs were selected to develop KASPar markers and their taxonomic value was tested by genotyping 108 accessions of the Aurantioideae subfamily. Twenty-seven markers diagnostic of 24 clades were validated and they displayed a very high rate of transferability in the Aurantioideae subfamily (only 1.2 % of missing data on average). The UPGMA from the validated markers produced a cladistic organisation that was highly coherent with the previous phylogenetic analysis based on the sequence data of the eight plasmid regions. In particular, the monophyletic origin of the "true citrus" genera plus Oxanthera was validated. However, some clarification remains necessary regarding the organisation of the other wild species of the Citreae tribe. We validated the concept that with well-established clades, DSNPs can be selected and efficiently transformed into competitive allele-specific PCR markers (KASPar method) allowing cost-effective highly efficient cladistic analysis in large collections at subfamily level. The robustness of this genotyping method is an additional decisive advantage for network collaborative research. The availability of WGS data for the main "true citrus" species should soon make it possible to develop a set of DSNP markers allowing very fine resolution of this very important horticultural group.

  4. Multicentre experience with the BridgePoint devices to facilitate recanalisation of chronic total coronary occlusions through controlled subintimal re-entry.

    PubMed

    Werner, Gerald S; Schofer, Joachim; Sievert, Horst; Kugler, Chad; Reifart, Nicolaus J

    2011-06-01

    The major challenge for the interventional treatment of chronic total coronary occlusion (CTO) is a low primary success rate. A common problem is the passage of the recanalisation wire into a subintimal position. New devices, which were evaluated in the first multicentre study in CTOs resistant to a conventional wire approach, may help to facilitate a controlled re-entry into the true lumen. The aim of this study was to assess the safety and efficacy of this approach, with successful true lumen distal wire passage as the primary endpoint. Forty-two patients were enrolled in four centres with high expertise in PCI for CTOs. All CTOs were of at least three months duration, and were initially attempted with dedicated recanalisation wires. After failure to pass or creation of a subintimal dissection, the BridgePoint devices were applied, consisting of a ball-tipped catheter (CrossBoss) to pass the proximal occlusion cap, and a flat-shaped balloon catheter (Stingray catheter) to be inflated within the subintimal space to guide the re-entry into the true vessel lumen with a special wire (Stingray guidewire). The primary endpoint was met in 67% of all patients. A higher success rate seemed to be possible when all devices were used in sequenced beginning with the CrossBoss, and in the case of a subintimal passage, followed by the Stingray. True lumen re-entry failed because of the loss of distally contrast filling and thus loss of a target for re-entry, and by a failure to advance the Stingray balloon far enough distal and parallel to the distal lumen. There were no severe device related complications. In patients with complex CTOs referred to dedicated centres with high experience in CTOs, these results demonstrate the potential of a guided re-entry from a subintimal wire position by use of the BridgePoint devices.

  5. PreTIS: A Tool to Predict Non-canonical 5’ UTR Translational Initiation Sites in Human and Mouse

    PubMed Central

    Reuter, Kerstin; Helms, Volkhard

    2016-01-01

    Translation of mRNA sequences into proteins typically starts at an AUG triplet. In rare cases, translation may also start at alternative non–AUG codons located in the annotated 5’ UTR which leads to an increased regulatory complexity. Since ribosome profiling detects translational start sites at the nucleotide level, the properties of these start sites can then be used for the statistical evaluation of functional open reading frames. We developed a linear regression approach to predict in–frame and out–of–frame translational start sites within the 5’ UTR from mRNA sequence information together with their translation initiation confidence. Predicted start codons comprise AUG as well as near–cognate codons. The underlying datasets are based on published translational start sites for human HEK293 and mouse embryonic stem cells that were derived by the original authors from ribosome profiling data. The average prediction accuracy of true vs. false start sites for HEK293 cells was 80%. When applied to mouse mRNA sequences, the same model predicted translation initiation sites observed in mouse ES cells with an accuracy of 76%. Moreover, we illustrate the effect of in silico mutations in the flanking sequence context of a start site on the predicted initiation confidence. Our new webservice PreTIS visualizes alternative start sites and their respective ORFs and predicts their ability to initiate translation. Solely, the mRNA sequence is required as input. PreTIS is accessible at http://service.bioinformatik.uni-saarland.de/pretis. PMID:27768687

  6. Diversity of Bacteria at Healthy Human Conjunctiva

    PubMed Central

    Dong, Qunfeng; Brulc, Jennifer M.; Iovieno, Alfonso; Bates, Brandon; Garoutte, Aaron; Miller, Darlene; Revanna, Kashi V.; Gao, Xiang; Antonopoulos, Dionysios A.; Slepak, Vladlen Z.

    2011-01-01

    Purpose. Ocular surface (OS) microbiota contributes to infectious and autoimmune diseases of the eye. Comprehensive analysis of microbial diversity at the OS has been impossible because of the limitations of conventional cultivation techniques. This pilot study aimed to explore true diversity of human OS microbiota using DNA sequencing-based detection and identification of bacteria. Methods. Composition of the bacterial community was characterized using deep sequencing of the 16S rRNA gene amplicon libraries generated from total conjunctival swab DNA. The DNA sequences were classified and the diversity parameters measured using bioinformatics software ESPRIT and MOTHUR and tools available through the Ribosomal Database Project-II (RDP-II). Results. Deep sequencing of conjunctival rDNA from four subjects yielded a total of 115,003 quality DNA reads, corresponding to 221 species-level phylotypes per subject. The combined bacterial community classified into 5 phyla and 59 distinct genera. However, 31% of all DNA reads belonged to unclassified or novel bacteria. The intersubject variability of individual OS microbiomes was very significant. Regardless, 12 genera—Pseudomonas, Propionibacterium, Bradyrhizobium, Corynebacterium, Acinetobacter, Brevundimonas, Staphylococci, Aquabacterium, Sphingomonas, Streptococcus, Streptophyta, and Methylobacterium—were ubiquitous among the analyzed cohort and represented the putative “core” of conjunctival microbiota. The other 47 genera accounted for <4% of the classified portion of this microbiome. Unexpectedly, healthy conjunctiva contained many genera that are commonly identified as ocular surface pathogens. Conclusions. The first DNA sequencing-based survey of bacterial population at the conjunctiva have revealed an unexpectedly diverse microbial community. All analyzed samples contained ubiquitous (core) genera that included commensal, environmental, and opportunistic pathogenic bacteria. PMID:21571682

  7. Unique properties of multiple tandem copies of the M26 recombination hotspot in mitosis and meiosis in Schizosaccharomyces pombe.

    PubMed

    Steiner, Walter W; Recor, Chelsea L; Zakrzewski, Bethany M

    2016-11-15

    The M26 hotspot of the fission yeast Schizosaccharomyces pombe is one of the best-characterized eukaryotic hotspots of recombination. The hotspot requires a seven bp sequence, ATGACGT, that serves as a binding site for the Atf1-Pcr1 transcription factor, which is also required for activity. The M26 hotspot is active in meiosis but not mitosis and is active in some but not all chromosomal contexts and not on a plasmid. A longer palindromic version of M26, ATGACGTCAT, shows significantly greater activity than the seven bp sequence. Here, we tested whether the properties of the seven bp sequence were also true of the longer sequence by placing one, two, or three copies of the sequence into the ade6 gene, where M26 was originally discovered. These constructs were tested for activity when located on a plasmid or on a chromosome in mitosis and meiosis. We found that two copies of the 10bp M26 motif on a chromosome were significantly more active for meiotic recombination than one, but no further increase was observed with three copies. However, three copies of M26 on a chromosome created an Atf1-dependent mitotic recombination hotspot. When located on a plasmid, M26 also appears to behave as a mitotic recombination hotspot; however, this behavior most likely results from Atf1-dependent inter-allelic complementation between the plasmid and chromosomal ade6 alleles. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Photometry and spectroscopy in the open cluster Alpha Persei, 2

    NASA Technical Reports Server (NTRS)

    Prosser, Charles F.

    1993-01-01

    Results from a combination of new spectroscopic and photometric observations in the lower main-sequence and pre-main sequence of the open cluster alpha Persei are presented. New echelle spectroscopy has provided radial and rotational velocity information for thirteen candidate members, three of which are nonmembers based on radial velocity, absence of a Li 6707A feature, and absence of H-alpha emission. A set of revised rotational velocity estimates for several slowly rotating candidates identified earlier is given, yielding rotational velocities as low as 7 km/s for two apparent cluster members. VRI photometry for several pre-main sequence members is given; the new (V,V-I(sub K)) photometry yields a more clearly defined pre-main sequence. A list of approximately 43 new faint candidate members based on the (V,V-I(sub K)) CCD photometry is presented in an effort to identify additional cluster members at very low masses. Low-dispersion spectra obtained for several of these candidates provide in some cases supporting evidence for cluster membership. The single brown dwarf candidate in this cluster is for the first time placed in a color-magnitude diagram with other cluster members, providing a better means for establishing its true status. Stars from among the list of new photometric candidates may provide the means for establishing a sequence of cluster members down to very faint magnitudes (V approximately 21) and consequently very low masses. New coordinate determinations for previous candidate members and finding charts for the new photometric candidates are provided in appendices.

  9. Methods for comparative metagenomics

    PubMed Central

    Huson, Daniel H; Richter, Daniel C; Mitra, Suparna; Auch, Alexander F; Schuster, Stephan C

    2009-01-01

    Background Metagenomics is a rapidly growing field of research that aims at studying uncultured organisms to understand the true diversity of microbes, their functions, cooperation and evolution, in environments such as soil, water, ancient remains of animals, or the digestive system of animals and humans. The recent development of ultra-high throughput sequencing technologies, which do not require cloning or PCR amplification, and can produce huge numbers of DNA reads at an affordable cost, has boosted the number and scope of metagenomic sequencing projects. Increasingly, there is a need for new ways of comparing multiple metagenomics datasets, and for fast and user-friendly implementations of such approaches. Results This paper introduces a number of new methods for interactively exploring, analyzing and comparing multiple metagenomic datasets, which will be made freely available in a new, comparative version 2.0 of the stand-alone metagenome analysis tool MEGAN. Conclusion There is a great need for powerful and user-friendly tools for comparative analysis of metagenomic data and MEGAN 2.0 will help to fill this gap. PMID:19208111

  10. Development and validation of a quantitative PCR for rapid and specific detection of California sea lion adenovirus 1 and prevalence in wild and managed populations.

    PubMed

    Cortés-Hinojosa, Galaxia; Gulland, Frances M D; Goldstein, Tracey; Venn-Watson, Stephanie; Rivera, Rebecca; Archer, Linda L; Waltzek, Thomas B; Gray, Gregory C; Wellehan, James F X

    2017-03-01

    California sea lion adenovirus 1 (CSLAdV-1) has been associated with hepatitis and enteritis in several wild and captive populations of diverse pinniped species. Currently available tests have been limited to pan-adenoviral polymerase chain reaction (PCR) followed by sequencing. We present the development of a quantitative probe-hybridization PCR (qPCR) assay for rapid, sensitive, and specific detection of this virus in California sea lions ( Zalophus californianus) and other pinnipeds. This assay did not amplify other mammalian adenoviruses and is able to detect consistently down to 10 viral copies per well. Compared with the gold standard conventional pan-adenovirus PCR/sequencing assay, diagnostic sensitivity and specificity of 100% and 88.2% were found, respectively. The lower diagnostic specificity of this qPCR assay may be the result of the lower limit of detection of this assay compared with the gold standard rather than the result of detection of true false-positives.

  11. A synonymous mutation in TCOF1 causes Treacher Collins syndrome due to mis-splicing of a constitutive exon.

    PubMed

    Macaya, D; Katsanis, S H; Hefferon, T W; Audlin, S; Mendelsohn, N J; Roggenbuck, J; Cutting, G R

    2009-08-01

    Interpretation of the pathogenicity of sequence alterations in disease-associated genes is challenging. This is especially true for novel alterations that lack obvious functional consequences. We report here on a patient with Treacher Collins syndrome (TCS) found to carry a previously reported mutation, c.122C > T, which predicts p.A41V, and a novel synonymous mutation, c.3612A > C. Pedigree analysis showed that the c.122C > T mutation segregated with normal phenotypes in multiple family members while the c.3612A > C was de novo in the patient. Analysis of TCOF1 RNA in lymphocytes showed a transcript missing exon 22. These results show that TCS in the patient is due to haploinsufficiency of TCOF1 caused by the synonymous de novo c.3612A > C mutation. This study highlights the importance of clinical and pedigree evaluation in the interpretation of known and novel sequence alterations. 2009 Wiley-Liss, Inc.

  12. Natural Product Biosynthetic Diversity and Comparative Genomics of the Cyanobacteria.

    PubMed

    Dittmann, Elke; Gugger, Muriel; Sivonen, Kaarina; Fewer, David P

    2015-10-01

    Cyanobacteria are an ancient lineage of slow-growing photosynthetic bacteria and a prolific source of natural products with intricate chemical structures and potent biological activities. The bulk of these natural products are known from just a handful of genera. Recent efforts have elucidated the mechanisms underpinning the biosynthesis of a diverse array of natural products from cyanobacteria. Many of the biosynthetic mechanisms are unique to cyanobacteria or rarely described from other organisms. Advances in genome sequence technology have precipitated a deluge of genome sequences for cyanobacteria. This makes it possible to link known natural products to biosynthetic gene clusters but also accelerates the discovery of new natural products through genome mining. These studies demonstrate that cyanobacteria encode a huge variety of cryptic gene clusters for the production of natural products, and the known chemical diversity is likely to be just a fraction of the true biosynthetic capabilities of this fascinating and ancient group of organisms. Copyright © 2015. Published by Elsevier Ltd.

  13. Massive thymic hemorrhage and hemothorax occurring in utero.

    PubMed

    Gargano, Giancarlo; Paltrinieri, Anna Lucia; Gallo, Claudio; Di Pancrazio, Luciana; Roversi, Maria Federica; Ferrari, Fabrizio

    2015-11-14

    Thymic enlargement is a common and physiological finding in children and neonates' X-rays, but it is usually asymptomatic. Occasionally it can cause respiratory distress. In most cases the aetiology of this expansion remains unclear and it is diagnosed as a thymic hyperplasia. True thymic hyperplasia is defined as a gland expansion, both in size and weight, while maintaining normal microscopic architecture. Often it is a diagnosis of exclusion and prognosis is good. Thymic haemorrhage is an unusual condition related to high foetal and neonatal mortality. We report a case of spontaneous massive thymic haemorrhage in a newborn developing at birth acute respiratory distress associated with severe bilateral haemothorax. Thymic enlargement was evident after pleural evacuation and confirmed by radiographic, Computed Tomography (CT) images and Magnetic Resonance Imaging (MRI) sequences. The spontaneous resolution of this enlargement seen with CT scan and MRI sequences suggested a thymic haemorrhage; surgery was not necessary. Thymic haemorrhage should be considered in newborn infants with pleural effusion, mediastinal space enlargement and Respiratory Distress.

  14. Identification of Marteilia refringens infecting the razor clam Solen marginatus by PCR and in situ hybridization.

    PubMed

    López-Flores, Inmaculada; Garrido-Ramos, Manuel A; de la Herran, Roberto; Ruiz-Rejón, Carmelo; Ruiz-Rejón, Manuel; Navas, José I

    2008-06-01

    Marteilia refringens is a protozoan parasite recognized as a significant pathogen of the European flat oyster Ostrea edulis. It is believed to have a complex life-cycle involving several hosts. In this study, we applied molecular approaches to identify this parasite in samples of the razor clam Solen marginatus from the south west coast of Spain. We used a PCR assay to amplify a fragment of the IGS rDNA region. PCR products were sequenced and the phylogenetic affinity of the sequences was determined. In situ hybridization analysis showed tissue distribution and presence of different developmental stages of the parasite in the digestive diverticula epithelium, which suggested a true parasitism in these individuals. This is the first report of the occurrence of M. refringens in the razor clam S. marginatus in the south Atlantic. The methodology described herein may be useful for accurate identification of the parasite strain in different hosts and thus provide valuable information for marteiliosis control programmes.

  15. Evolutionary characterization of the West Nile Virus complete genome.

    PubMed

    Gray, R R; Veras, N M C; Santos, L A; Salemi, M

    2010-07-01

    The spatial dynamics of the West Nile Virus epidemic in North America are largely unknown. Previous studies that investigated the evolutionary history of the virus used sequence data from the structural genes (prM and E); however, these regions may lack phylogenetic information and obscure true evolutionary relationships. This study systematically evaluated the evolutionary patterns in the eleven genes of the WNV genome in order to determine which region(s) were most phylogenetically informative. We found that while the E region lacks resolution and can potentially result in misleading conclusions, the full NS3 or NS5 regions have strong phylogenetic signal. Furthermore, we show that geographic structure of WNV infection within the US is more pronounced than previously reported in studies that used the structural genes. We conclude that future evolutionary studies should focus on NS3 and NS5 in order to maximize the available sequences while retaining maximal interpretative power to infer temporal and geographic trends among WNV strains. Copyright 2010 Elsevier Inc. All rights reserved.

  16. The epigenomic interface between genome and environment in common complex diseases.

    PubMed

    Bell, Christopher G; Beck, Stephan

    2010-12-01

    The epigenome plays the pivotal role as interface between genome and environment. True genome-wide assessments of epigenetic marks, such as DNA methylation (methylomes) or chromatin modifications (chromatinomes), are now possible, either through high-throughput arrays or increasingly by second-generation DNA sequencing methods. The ability to collect these data at this level of resolution enables us to begin to be able to propose detailed questions, and interrogate this information, with regards to changes that occur due to development, lineage and tissue-specificity, and significantly those caused by environmental influence, such as ageing, stress, diet, hormones or toxins. Common complex traits are under variable levels of genetic influence and additionally epigenetic effect. The detection of pathological epigenetic alterations will reveal additional insights into their aetiology and how possible environmental modulation of this mechanism may occur. Due to the reversibility of these marks, the potential for sequence-specific targeted therapeutics exists. This review surveys recent epigenomic advances and their current and prospective application to the study of common diseases.

  17. A Population Study of Wide-Separation Brown Dwarf Companions to Main Sequence Stars

    NASA Technical Reports Server (NTRS)

    Smith, Jeffrey J.

    2005-01-01

    Increased interest in infrared astronomy has opened the frontier to study cooler objects that shed significant light on the formation of planetary systems. Brown dwarf research provides a wealth of information useful for sorting through a myriad of proposed formation theories. Our study combines observational data from 2MASS with rigorous computer simulations to estimate the true population of long-range (greater than 1000 AU) brown dwarf companions in the solar neighborhood (less than 25 pc from Earth). Expanding on Gizis et al. (2001), we have found the margin of error in previous estimates to be significantly underestimated after we included orbit eccentricity, longitude of pericenter, angle of inclination, field star density, and primary and secondary luminosities as parameters influencing the companion systems in observational studies. We apply our simulation results to current L- and T-dwarf catalogs to provide updated estimates on the frequency of wide-separation brown dwarf companions to main sequence stars.

  18. Phylogenetic classification and the universal tree.

    PubMed

    Doolittle, W F

    1999-06-25

    From comparative analyses of the nucleotide sequences of genes encoding ribosomal RNAs and several proteins, molecular phylogeneticists have constructed a "universal tree of life," taking it as the basis for a "natural" hierarchical classification of all living things. Although confidence in some of the tree's early branches has recently been shaken, new approaches could still resolve many methodological uncertainties. More challenging is evidence that most archaeal and bacterial genomes (and the inferred ancestral eukaryotic nuclear genome) contain genes from multiple sources. If "chimerism" or "lateral gene transfer" cannot be dismissed as trivial in extent or limited to special categories of genes, then no hierarchical universal classification can be taken as natural. Molecular phylogeneticists will have failed to find the "true tree," not because their methods are inadequate or because they have chosen the wrong genes, but because the history of life cannot properly be represented as a tree. However, taxonomies based on molecular sequences will remain indispensable, and understanding of the evolutionary process will ultimately be enriched, not impoverished.

  19. DrImpute: imputing dropout events in single cell RNA sequencing data.

    PubMed

    Gong, Wuming; Kwak, Il-Youp; Pota, Pruthvi; Koyano-Nakagawa, Naoko; Garry, Daniel J

    2018-06-08

    The single cell RNA sequencing (scRNA-seq) technique begin a new era by allowing the observation of gene expression at the single cell level. However, there is also a large amount of technical and biological noise. Because of the low number of RNA transcriptomes and the stochastic nature of the gene expression pattern, there is a high chance of missing nonzero entries as zero, which are called dropout events. We develop DrImpute to impute dropout events in scRNA-seq data. We show that DrImpute has significantly better performance on the separation of the dropout zeros from true zeros than existing imputation algorithms. We also demonstrate that DrImpute can significantly improve the performance of existing tools for clustering, visualization and lineage reconstruction of nine published scRNA-seq datasets. DrImpute can serve as a very useful addition to the currently existing statistical tools for single cell RNA-seq analysis. DrImpute is implemented in R and is available at https://github.com/gongx030/DrImpute .

  20. Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

    PubMed

    Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

    2016-01-01

    Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.

  1. Microfluidic flows of wormlike micellar solutions.

    PubMed

    Zhao, Ya; Cheung, Perry; Shen, Amy Q

    2014-09-01

    The widespread use of wormlike micellar solutions is commonly found in household items such as cosmetic products, industrial fluids used in enhanced oil recovery and as drag reducing agents, and in biological applications such as drug delivery and biosensors. Despite their extensive use, there are still many details about the microscopic micellar structure and the mechanisms by which wormlike micelles form under flow that are not clearly understood. Microfluidic devices provide a versatile platform to study wormlike micellar solutions under various flow conditions and confined geometries. A review of recent investigations using microfluidics to study the flow of wormlike micelles is presented here with an emphasis on three different flow types: shear, elongation, and complex flow fields. In particular, we focus on the use of shear flows to study shear banding, elastic instabilities of wormlike micellar solutions in extensional flow (including stagnation and contraction flow field), and the use of contraction geometries to measure the elongational viscosity of wormlike micellar solutions. Finally, we showcase the use of complex flow fields in microfluidics to generate a stable and nanoporous flow-induced structured phase (FISP) from wormlike micellar solutions. This review shows that the influence of spatial confinement and moderate hydrodynamic forces present in the microfluidic device can give rise to a host of possibilities of microstructural rearrangements and interesting flow phenomena. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Basic Skills Resource Center: Report on the Preliminary Research Findings

    DTIC Science & Technology

    1985-01-01

    indicates that the higher the level of processing , the greater the comprehension and recall. This is true of word lists ( Craik & Lockhart , 1972) as well as... Levels of Processing Principle 9 Content-Driven Strategy/Skills Instruction Principle 10 Instruction, Content, and Prior Knowledge Principle 11 Sequencing...34 Ws 1.’t) 0 U) 14 C0 W u w. C -0.0 C) a. I-s U) w~ 0 4) 0 C "q’ 01 .0 0c 414U >4 0.4 F 0 to 0)0 IvJ0 04Cu B-13 Principle 8 ( Levels of Processing ) The

  3. Monoparametric family of metrics derived from classical Jensen-Shannon divergence

    NASA Astrophysics Data System (ADS)

    Osán, Tristán M.; Bussandri, Diego G.; Lamberti, Pedro W.

    2018-04-01

    Jensen-Shannon divergence is a well known multi-purpose measure of dissimilarity between probability distributions. It has been proven that the square root of this quantity is a true metric in the sense that, in addition to the basic properties of a distance, it also satisfies the triangle inequality. In this work we extend this last result to prove that in fact it is possible to derive a monoparametric family of metrics from the classical Jensen-Shannon divergence. Motivated by our results, an application into the field of symbolic sequences segmentation is explored. Additionally, we analyze the possibility to extend this result into the quantum realm.

  4. Towards a complete map of the human long non-coding RNA transcriptome.

    PubMed

    Uszczynska-Ratajczak, Barbara; Lagarde, Julien; Frankish, Adam; Guigó, Roderic; Johnson, Rory

    2018-05-23

    Gene maps, or annotations, enable us to navigate the functional landscape of our genome. They are a resource upon which virtually all studies depend, from single-gene to genome-wide scales and from basic molecular biology to medical genetics. Yet present-day annotations suffer from trade-offs between quality and size, with serious but often unappreciated consequences for downstream studies. This is particularly true for long non-coding RNAs (lncRNAs), which are poorly characterized compared to protein-coding genes. Long-read sequencing technologies promise to improve current annotations, paving the way towards a complete annotation of lncRNAs expressed throughout a human lifetime.

  5. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    PubMed Central

    2010-01-01

    Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882

  6. Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison

    PubMed Central

    Ratmann, Oliver; Hodcroft, Emma B.; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A.S. Md Mukarram; Joy, Jeffrey B.; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E.; Liang, Richard; Plazzotta, Giacomo; Poon, Art F.Y.; Rasmussen, David A.; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J.; Fraser, Christophe

    2017-01-01

    Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods’ development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. PMID:28053012

  7. Acetylcholinesterase genes within the Diptera: takeover and loss in true flies

    PubMed Central

    Huchard, Elise; Martinez, Michel; Alout, Haoues; Douzery, Emmanuel J.P; Lutfalla, Georges; Berthomieu, Arnaud; Berticat, Claire; Raymond, Michel; Weill, Mylène

    2006-01-01

    It has recently been reported that the synaptic acetylcholinesterase (AChE) in mosquitoes is encoded by the ace-1 gene, distinct and divergent from the ace-2 gene, which performs this function in Drosophila. This is an unprecedented situation within the Diptera order because both ace genes derive from an old duplication and are present in most insects and arthropods. Nevertheless, Drosophila possesses only the ace-2 gene. Thus, a secondary loss occurred during the evolution of Diptera, implying a vital function switch from one gene (ace-1) to the other (ace-2). We sampled 78 species, representing 50 families (27% of the Dipteran families) spread over all major subdivisions of the Diptera, and looked for ace-1 and ace-2 by systematic PCR screening to determine which taxonomic groups within the Diptera have this gene change. We show that this loss probably extends to all true flies (or Cyclorrhapha), a large monophyletic group of the Diptera. We also show that ace-2 plays a non-detectable role in the synaptic AChE in a lower Diptera species, suggesting that it has non-synaptic functions. A relative molecular evolution rate test showed that the intensity of purifying selection on ace-2 sequences is constant across the Diptera, irrespective of the presence or absence of ace-1, confirming the evolutionary importance of non-synaptic functions for this gene. We discuss the evolutionary scenarios for the takeover of ace-2 and the loss of ace-1, taking into account our limited knowledge of non-synaptic functions of ace genes and some specific adaptations of true flies. PMID:17002944

  8. PolyaPeak: Detecting Transcription Factor Binding Sites from ChIP-seq Using Peak Shape Information

    PubMed Central

    Wu, Hao; Ji, Hongkai

    2014-01-01

    ChIP-seq is a powerful technology for detecting genomic regions where a protein of interest interacts with DNA. ChIP-seq data for mapping transcription factor binding sites (TFBSs) have a characteristic pattern: around each binding site, sequence reads aligned to the forward and reverse strands of the reference genome form two separate peaks shifted away from each other, and the true binding site is located in between these two peaks. While it has been shown previously that the accuracy and resolution of binding site detection can be improved by modeling the pattern, efficient methods are unavailable to fully utilize that information in TFBS detection procedure. We present PolyaPeak, a new method to improve TFBS detection by incorporating the peak shape information. PolyaPeak describes peak shapes using a flexible Pólya model. The shapes are automatically learnt from the data using Minorization-Maximization (MM) algorithm, then integrated with the read count information via a hierarchical model to distinguish true binding sites from background noises. Extensive real data analyses show that PolyaPeak is capable of robustly improving TFBS detection compared with existing methods. An R package is freely available. PMID:24608116

  9. Theory and implementation of a very high throughput true random number generator in field programmable gate array

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Yonggang, E-mail: wangyg@ustc.edu.cn; Hui, Cong; Liu, Chong

    The contribution of this paper is proposing a new entropy extraction mechanism based on sampling phase jitter in ring oscillators to make a high throughput true random number generator in a field programmable gate array (FPGA) practical. Starting from experimental observation and analysis of the entropy source in FPGA, a multi-phase sampling method is exploited to harvest the clock jitter with a maximum entropy and fast sampling speed. This parametrized design is implemented in a Xilinx Artix-7 FPGA, where the carry chains in the FPGA are explored to realize the precise phase shifting. The generator circuit is simple and resource-saving,more » so that multiple generation channels can run in parallel to scale the output throughput for specific applications. The prototype integrates 64 circuit units in the FPGA to provide a total output throughput of 7.68 Gbps, which meets the requirement of current high-speed quantum key distribution systems. The randomness evaluation, as well as its robustness to ambient temperature, confirms that the new method in a purely digital fashion can provide high-speed high-quality random bit sequences for a variety of embedded applications.« less

  10. Theory and implementation of a very high throughput true random number generator in field programmable gate array.

    PubMed

    Wang, Yonggang; Hui, Cong; Liu, Chong; Xu, Chao

    2016-04-01

    The contribution of this paper is proposing a new entropy extraction mechanism based on sampling phase jitter in ring oscillators to make a high throughput true random number generator in a field programmable gate array (FPGA) practical. Starting from experimental observation and analysis of the entropy source in FPGA, a multi-phase sampling method is exploited to harvest the clock jitter with a maximum entropy and fast sampling speed. This parametrized design is implemented in a Xilinx Artix-7 FPGA, where the carry chains in the FPGA are explored to realize the precise phase shifting. The generator circuit is simple and resource-saving, so that multiple generation channels can run in parallel to scale the output throughput for specific applications. The prototype integrates 64 circuit units in the FPGA to provide a total output throughput of 7.68 Gbps, which meets the requirement of current high-speed quantum key distribution systems. The randomness evaluation, as well as its robustness to ambient temperature, confirms that the new method in a purely digital fashion can provide high-speed high-quality random bit sequences for a variety of embedded applications.

  11. Identification of a novel PSR as the substrate of an SR protein kinase in the true slime mold.

    PubMed

    Zhang, Yong-Xia; Xing, Miao; Fei, Xuan; Zhang, Jian-Hua; Tian, Sheng-Li; Li, Ming-Hua; Liu, Shi-De

    2011-03-01

    Here, a novel cDNA encoding a serine/arginine (SR)-rich protein, designated PSR, was isolated from the true slime mold Physarum polycephalum and expressed in Escherichia coli. The deduced amino acid (aa) sequence reveals that PSR contains RS repeats at its C-terminus, similar to the conventional PSRPK substrate ASF/SF2. To study the novel protein, we generated a variety of mutant constructs by PCR and site-directed mutagenesis. Our analysis indicated that the purified recombinant PSR was phosphorylated by PSRPK in vitro and the SR-rich domain (amino acids 460-469) in the PSR protein was required for phosphorylation. In addition, removal of the docking motif (amino acids 424-450) from PSR significantly reduced the overall catalytic efficiency of the phosphorylation reaction. We also found that the conserved ATP-binding region (62)LGWGHFSTVWLAIDEKNGGREVALK(86) and the serine/threonine protein kinases active-site signature (184)IIHTDLKPENVLL(196) of PSRPK played a crucial role in substrate phosphorylation and Lys(86) and Asp(188) were crucial for PSRPK phosphorylation of PSR. These results suggest that PSR is a novel SR-related protein that is phosphorylated by PSRPK.

  12. Wireless Synchronization of a Multi-Pinhole Small Animal SPECT Collimation Device With a Clinical Scanner

    NASA Astrophysics Data System (ADS)

    DiFilippo, Frank P.; Patel, Sagar

    2009-06-01

    A multi-pinhole collimation device for small animal single photon emission computed tomography (SPECT) uses the gamma camera detectors of a standard clinical SPECT scanner. The collimator and animal bed move independently of the detectors, and therefore their motions must be synchronized. One approach is manual triggering of the SPECT acquisition simultaneously with a programmed motion sequence for the device. However, some data blurring and loss of image quality result, and true electronic synchronization is preferred. An off-the-shelf digital gyroscope with integrated Bluetooth interface provides a wireless solution to device synchronization. The sensor attaches to the SPECT gantry and reports its rotational speed to a notebook computer controlling the device. Software processes the rotation data in real-time, averaging the signal and issuing triggers while compensating for baseline drift. Motion commands are sent to the collimation device with minimal delay, within approximately 0.5 second of the start of SPECT gantry rotation. Test scans of a point source demonstrate an increase in true counts and a reduction in background counts compared to manual synchronization. The wireless rotation sensor provides robust synchronization of the collimation device with the clinical SPECT scanner and enhances image quality.

  13. Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling.

    PubMed

    Łabaj, Paweł P; Leparc, Germán G; Linggi, Bryan E; Markillie, Lye Meng; Wiley, H Steven; Kreil, David P

    2011-07-01

    Measurement precision determines the power of any analysis to reliably identify significant signals, such as in screens for differential expression, independent of whether the experimental design incorporates replicates or not. With the compilation of large-scale RNA-Seq datasets with technical replicate samples, however, we can now, for the first time, perform a systematic analysis of the precision of expression level estimates from massively parallel sequencing technology. This then allows considerations for its improvement by computational or experimental means. We report on a comprehensive study of target identification and measurement precision, including their dependence on transcript expression levels, read depth and other parameters. In particular, an impressive recall of 84% of the estimated true transcript population could be achieved with 331 million 50 bp reads, with diminishing returns from longer read lengths and even less gains from increased sequencing depths. Most of the measurement power (75%) is spent on only 7% of the known transcriptome, however, making less strongly expressed transcripts harder to measure. Consequently, <30% of all transcripts could be quantified reliably with a relative error<20%. Based on established tools, we then introduce a new approach for mapping and analysing sequencing reads that yields substantially improved performance in gene expression profiling, increasing the number of transcripts that can reliably be quantified to over 40%. Extrapolations to higher sequencing depths highlight the need for efficient complementary steps. In discussion we outline possible experimental and computational strategies for further improvements in quantification precision. rnaseq10@boku.ac.at

  14. Motion-compensated compressed sensing for dynamic imaging

    NASA Astrophysics Data System (ADS)

    Sundaresan, Rajagopalan; Kim, Yookyung; Nadar, Mariappan S.; Bilgin, Ali

    2010-08-01

    The recently introduced Compressed Sensing (CS) theory explains how sparse or compressible signals can be reconstructed from far fewer samples than what was previously believed possible. The CS theory has attracted significant attention for applications such as Magnetic Resonance Imaging (MRI) where long acquisition times have been problematic. This is especially true for dynamic MRI applications where high spatio-temporal resolution is needed. For example, in cardiac cine MRI, it is desirable to acquire the whole cardiac volume within a single breath-hold in order to avoid artifacts due to respiratory motion. Conventional MRI techniques do not allow reconstruction of high resolution image sequences from such limited amount of data. Vaswani et al. recently proposed an extension of the CS framework to problems with partially known support (i.e. sparsity pattern). In their work, the problem of recursive reconstruction of time sequences of sparse signals was considered. Under the assumption that the support of the signal changes slowly over time, they proposed using the support of the previous frame as the "known" part of the support for the current frame. While this approach works well for image sequences with little or no motion, motion causes significant change in support between adjacent frames. In this paper, we illustrate how motion estimation and compensation techniques can be used to reconstruct more accurate estimates of support for image sequences with substantial motion (such as cardiac MRI). Experimental results using phantoms as well as real MRI data sets illustrate the improved performance of the proposed technique.

  15. Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues.

    PubMed Central

    Prody, C A; Zevin-Sonkin, D; Gnatt, A; Goldberg, O; Soreq, H

    1987-01-01

    To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase (BtChoEase; EC 3.1.1.8) and Torpedo electric organ "true" acetylcholinesterase (AcChoEase; EC 3.1.1.7). Using these probes, we isolated several cDNA clones from lambda gt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A)+ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These findings demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species. Images PMID:3035536

  16. Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

    NASA Astrophysics Data System (ADS)

    Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.; Carr, Christopher E.

    2017-08-01

    Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars.

  17. Identifying transposon insertions and their effects from RNA-sequencing data.

    PubMed

    de Ruiter, Julian R; Kas, Sjors M; Schut, Eva; Adams, David J; Koudijs, Marco J; Wessels, Lodewyk F A; Jonkers, Jos

    2017-07-07

    Insertional mutagenesis using engineered transposons is a potent forward genetic screening technique used to identify cancer genes in mouse model systems. In the analysis of these screens, transposon insertion sites are typically identified by targeted DNA-sequencing and subsequently assigned to predicted target genes using heuristics. As such, these approaches provide no direct evidence that insertions actually affect their predicted targets or how transcripts of these genes are affected. To address this, we developed IM-Fusion, an approach that identifies insertion sites from gene-transposon fusions in standard single- and paired-end RNA-sequencing data. We demonstrate IM-Fusion on two separate transposon screens of 123 mammary tumors and 20 B-cell acute lymphoblastic leukemias, respectively. We show that IM-Fusion accurately identifies transposon insertions and their true target genes. Furthermore, by combining the identified insertion sites with expression quantification, we show that we can determine the effect of a transposon insertion on its target gene(s) and prioritize insertions that have a significant effect on expression. We expect that IM-Fusion will significantly enhance the accuracy of cancer gene discovery in forward genetic screens and provide initial insight into the biological effects of insertions on candidate cancer genes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Leveraging the Power of High Performance Computing for Next Generation Sequencing Data Analysis: Tricks and Twists from a High Throughput Exome Workflow

    PubMed Central

    Wonczak, Stephan; Thiele, Holger; Nieroda, Lech; Jabbari, Kamel; Borowski, Stefan; Sinha, Vishal; Gunia, Wilfried; Lang, Ulrich; Achter, Viktor; Nürnberg, Peter

    2015-01-01

    Next generation sequencing (NGS) has been a great success and is now a standard method of research in the life sciences. With this technology, dozens of whole genomes or hundreds of exomes can be sequenced in rather short time, producing huge amounts of data. Complex bioinformatics analyses are required to turn these data into scientific findings. In order to run these analyses fast, automated workflows implemented on high performance computers are state of the art. While providing sufficient compute power and storage to meet the NGS data challenge, high performance computing (HPC) systems require special care when utilized for high throughput processing. This is especially true if the HPC system is shared by different users. Here, stability, robustness and maintainability are as important for automated workflows as speed and throughput. To achieve all of these aims, dedicated solutions have to be developed. In this paper, we present the tricks and twists that we utilized in the implementation of our exome data processing workflow. It may serve as a guideline for other high throughput data analysis projects using a similar infrastructure. The code implementing our solutions is provided in the supporting information files. PMID:25942438

  19. REDO: RNA Editing Detection in Plant Organelles Based on Variant Calling Results.

    PubMed

    Wu, Shuangyang; Liu, Wanfei; Aljohi, Hasan Awad; Alromaih, Sarah A; Alanazi, Ibrahim O; Lin, Qiang; Yu, Jun; Hu, Songnian

    2018-05-01

    RNA editing is a post-transcriptional or cotranscriptional process that changes the sequence of the precursor transcript by substitutions, insertions, or deletions. Almost all of the land plants undergo RNA editing in organelles (plastids and mitochondria). Although several software tools have been developed to identify RNA editing events, there has been a great challenge to distinguish true RNA editing events from genome variation, sequencing errors, and other factors. Here we introduce REDO, a comprehensive application tool for identifying RNA editing events in plant organelles based on variant call format files from RNA-sequencing data. REDO is a suite of Perl scripts that illustrate a bunch of attributes of RNA editing events in figures and tables. REDO can also detect RNA editing events in multiple samples simultaneously and identify the significant differential proportion of RNA editing loci. Comparing with similar tools, such as REDItools, REDO runs faster with higher accuracy, and more specificity at the cost of slightly lower sensitivity. Moreover, REDO annotates each RNA editing site in RNAs, whereas REDItools reports only possible RNA editing sites in genome, which need additional steps to obtain RNA editing profiles for RNAs. Overall, REDO can identify potential RNA editing sites easily and provide several functions such as detailed annotations, statistics, figures, and significantly differential proportion of RNA editing sites among different samples.

  20. Role of serial order in the impact of talker variability on short-term memory: testing a perceptual organization-based account.

    PubMed

    Hughes, Robert W; Marsh, John E; Jones, Dylan M

    2011-11-01

    In two experiments, we examined the impact of the degree of match between sequential auditory perceptual organization processes and the demands of a short-term memory task (memory for order vs. item information). When a spoken sequence of digits was presented so as to promote its perceptual partitioning into two distinct streams by conveying it in alternating female (F) and male (M) voices (FMFMFMFM)--thereby disturbing the perception of true temporal order--recall of item order was greatly impaired (as compared to recall of item identity). Moreover, an order error type consistent with the formation of voice-based streams was committed more quickly in the alternating-voice condition (Exp. 1). In contrast, when the perceptual organization of the sequence mapped well onto an optimal two-group serial rehearsal strategy--by presenting the two voices in discrete clusters (FFFFMMMM)--order, but not item, recall was enhanced (Exp. 2). The results are consistent with the view that the degree of compatibility between perceptual and deliberate sequencing processes is a key determinant of serial short-term memory performance. Alternative accounts of talker variability effects in short-term memory, based on the concept of a dedicated phonological short-term store and a capacity-limited focus of attention, are also reviewed.

  1. Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection.

    PubMed

    Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T; Carr, Christopher E

    2017-08-01

    Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments-Nucleic acids-Mars-Panspermia. Astrobiology 17, 747-760.

  2. DNA barcoding reveal patterns of species diversity among northwestern Pacific molluscs

    PubMed Central

    Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong; Zheng, Xiaodong; Yu, Ruihai; Dai, Lina; Sun, Yan; Chen, Jun; Liu, Jun; Ni, Lehai; Feng, Yanwei; Yu, Zhenzhen; Zou, Shanmei; Lin, Jiping

    2016-01-01

    This study represents the first comprehensive molecular assessment of northwestern Pacific molluscs. In total, 2801 DNA barcodes belonging to 569 species from China, Japan and Korea were analyzed. An overlap between intra- and interspecific genetic distances was present in 71 species. We tested the efficacy of this library by simulating a sequence-based specimen identification scenario using Best Match (BM), Best Close Match (BCM) and All Species Barcode (ASB) criteria with three threshold values. BM approach returned 89.15% true identifications (95.27% when excluding singletons). The highest success rate of congruent identifications was obtained with BCM at 0.053 threshold. The analysis of our barcode library together with public data resulted in 582 Barcode Index Numbers (BINs), 72.2% of which was found to be concordantly with morphology-based identifications. The discrepancies were divided in two groups: sequences from different species clustered in a single BIN and conspecific sequences divided in one more BINs. In Neighbour-Joining phenogram, 2,320 (83.0%) queries fromed 355 (62.4%) species-specific barcode clusters allowing their successful identification. 33 species showed paraphyletic and haplotype sharing. 62 cases are represented by deeply diverged lineages. This study suggest an increased species diversity in this region, highlighting taxonomic revision and conservation strategy for the cryptic complexes. PMID:27640675

  3. COPS: A Sensitive and Accurate Tool for Detecting Somatic Copy Number Alterations Using Short-Read Sequence Data from Paired Samples

    PubMed Central

    Krishnan, Neeraja M.; Gaur, Prakhar; Chaudhary, Rakshit; Rao, Arjun A.; Panda, Binay

    2012-01-01

    Copy Number Alterations (CNAs) such as deletions and duplications; compose a larger percentage of genetic variations than single nucleotide polymorphisms or other structural variations in cancer genomes that undergo major chromosomal re-arrangements. It is, therefore, imperative to identify cancer-specific somatic copy number alterations (SCNAs), with respect to matched normal tissue, in order to understand their association with the disease. We have devised an accurate, sensitive, and easy-to-use tool, COPS, COpy number using Paired Samples, for detecting SCNAs. We rigorously tested the performance of COPS using short sequence simulated reads at various sizes and coverage of SCNAs, read depths, read lengths and also with real tumor:normal paired samples. We found COPS to perform better in comparison to other known SCNA detection tools for all evaluated parameters, namely, sensitivity (detection of true positives), specificity (detection of false positives) and size accuracy. COPS performed well for sequencing reads of all lengths when used with most upstream read alignment tools. Additionally, by incorporating a downstream boundary segmentation detection tool, the accuracy of SCNA boundaries was further improved. Here, we report an accurate, sensitive and easy to use tool in detecting cancer-specific SCNAs using short-read sequence data. In addition to cancer, COPS can be used for any disease as long as sequence reads from both disease and normal samples from the same individual are available. An added boundary segmentation detection module makes COPS detected SCNA boundaries more specific for the samples studied. COPS is available at ftp://115.119.160.213 with username “cops” and password “cops”. PMID:23110103

  4. Resolution of a Protracted Serogroup B Meningococcal Outbreak with Whole-Genome Sequencing Shows Interspecies Genetic Transfer

    PubMed Central

    Brehony, Carina; O'Connor, Lois; Meyler, Kenneth; Jolley, Keith A.; Bray, James; Bennett, Desiree; Maiden, Martin C. J.; Cunney, Robert

    2016-01-01

    A carriage study was undertaken (n = 112) to ascertain the prevalence of Neisseria spp. following the eighth case of invasive meningococcal disease in young children (5 to 46 months) and members of a large extended indigenous ethnic minority Traveller family (n = 123), typically associated with high-occupancy living conditions. Nested multilocus sequence typing (MLST) was employed for case specimen extracts. Isolates were genome sequenced and then were assembled de novo and deposited into the Bacterial Isolate Genome Sequencing Database (BIGSdb). This facilitated an expanded MLST approach utilizing large numbers of loci for isolate characterization and discrimination. A rare sequence type, ST-6697, predominated in disease specimens and isolates that were carried (n = 8/14), persisting for at least 44 months, likely driven by the high population density of houses (n = 67/112) and trailers (n = 45/112). Carriage for Neisseria meningitidis (P < 0.05) and Neisseria lactamica (P < 0.002) (2-sided Fisher's exact test) was more likely in the smaller, more densely populated trailers. Meningococcal carriage was highest in 24- to 39-year-olds (45%, n = 9/20). Evidence of horizontal gene transfer (HGT) was observed in four individuals cocolonized by Neisseria lactamica and Neisseria meningitidis. One HGT event resulted in the acquisition of 26 consecutive N. lactamica alleles. This study demonstrates how housing density can drive meningococcal transmission and carriage, which likely facilitated the persistence of ST-6697 and prolonged the outbreak. Whole-genome MLST effectively distinguished between highly similar outbreak strain isolates, including those isolated from person-to-person transmission, and also highlighted how a few HGT events can distort the true phylogenetic relationship between highly similar clonal isolates. PMID:27629899

  5. XLID-Causing Mutations and Associated Genes Challenged in Light of Data From Large-Scale Human Exome Sequencing

    PubMed Central

    Piton, Amélie; Redin, Claire; Mandel, Jean-Louis

    2013-01-01

    Because of the unbalanced sex ratio (1.3–1.4 to 1) observed in intellectual disability (ID) and the identification of large ID-affected families showing X-linked segregation, much attention has been focused on the genetics of X-linked ID (XLID). Mutations causing monogenic XLID have now been reported in over 100 genes, most of which are commonly included in XLID diagnostic gene panels. Nonetheless, the boundary between true mutations and rare non-disease-causing variants often remains elusive. The sequencing of a large number of control X chromosomes, required for avoiding false-positive results, was not systematically possible in the past. Such information is now available thanks to large-scale sequencing projects such as the National Heart, Lung, and Blood (NHLBI) Exome Sequencing Project, which provides variation information on 10,563 X chromosomes from the general population. We used this NHLBI cohort to systematically reassess the implication of 106 genes proposed to be involved in monogenic forms of XLID. We particularly question the implication in XLID of ten of them (AGTR2, MAGT1, ZNF674, SRPX2, ATP6AP2, ARHGEF6, NXF5, ZCCHC12, ZNF41, and ZNF81), in which truncating variants or previously published mutations are observed at a relatively high frequency within this cohort. We also highlight 15 other genes (CCDC22, CLIC2, CNKSR2, FRMPD4, HCFC1, IGBP1, KIAA2022, KLF8, MAOA, NAA10, NLGN3, RPL10, SHROOM4, ZDHHC15, and ZNF261) for which replication studies are warranted. We propose that similar reassessment of reported mutations (and genes) with the use of data from large-scale human exome sequencing would be relevant for a wide range of other genetic diseases. PMID:23871722

  6. Automated DNA mutation detection using universal conditions direct sequencing: application to ten muscular dystrophy genes

    PubMed Central

    2009-01-01

    Background One of the most common and efficient methods for detecting mutations in genes is PCR amplification followed by direct sequencing. Until recently, the process of designing PCR assays has been to focus on individual assay parameters rather than concentrating on matching conditions for a set of assays. Primers for each individual assay were selected based on location and sequence concerns. The two primer sequences were then iteratively adjusted to make the individual assays work properly. This generally resulted in groups of assays with different annealing temperatures that required the use of multiple thermal cyclers or multiple passes in a single thermal cycler making diagnostic testing time-consuming, laborious and expensive. These factors have severely hampered diagnostic testing services, leaving many families without an answer for the exact cause of a familial genetic disease. A search of GeneTests for sequencing analysis of the entire coding sequence for genes that are known to cause muscular dystrophies returns only a small list of laboratories that perform comprehensive gene panels. The hypothesis for the study was that a complete set of universal assays can be designed to amplify and sequence any gene or family of genes using computer aided design tools. If true, this would allow automation and optimization of the mutation detection process resulting in reduced cost and increased throughput. Results An automated process has been developed for the detection of deletions, duplications/insertions and point mutations in any gene or family of genes and has been applied to ten genes known to bear mutations that cause muscular dystrophy: DMD; CAV3; CAPN3; FKRP; TRIM32; LMNA; SGCA; SGCB; SGCG; SGCD. Using this process, mutations have been found in five DMD patients and four LGMD patients (one in the FKRP gene, one in the CAV3 gene, and two likely causative heterozygous pairs of variations in the CAPN3 gene of two other patients). Methods and assay sequences are reported in this paper. Conclusion This automated process allows laboratories to discover DNA variations in a short time and at low cost. PMID:19835634

  7. Automated DNA mutation detection using universal conditions direct sequencing: application to ten muscular dystrophy genes.

    PubMed

    Bennett, Richard R; Schneider, Hal E; Estrella, Elicia; Burgess, Stephanie; Cheng, Andrew S; Barrett, Caitlin; Lip, Va; Lai, Poh San; Shen, Yiping; Wu, Bai-Lin; Darras, Basil T; Beggs, Alan H; Kunkel, Louis M

    2009-10-18

    One of the most common and efficient methods for detecting mutations in genes is PCR amplification followed by direct sequencing. Until recently, the process of designing PCR assays has been to focus on individual assay parameters rather than concentrating on matching conditions for a set of assays. Primers for each individual assay were selected based on location and sequence concerns. The two primer sequences were then iteratively adjusted to make the individual assays work properly. This generally resulted in groups of assays with different annealing temperatures that required the use of multiple thermal cyclers or multiple passes in a single thermal cycler making diagnostic testing time-consuming, laborious and expensive.These factors have severely hampered diagnostic testing services, leaving many families without an answer for the exact cause of a familial genetic disease. A search of GeneTests for sequencing analysis of the entire coding sequence for genes that are known to cause muscular dystrophies returns only a small list of laboratories that perform comprehensive gene panels.The hypothesis for the study was that a complete set of universal assays can be designed to amplify and sequence any gene or family of genes using computer aided design tools. If true, this would allow automation and optimization of the mutation detection process resulting in reduced cost and increased throughput. An automated process has been developed for the detection of deletions, duplications/insertions and point mutations in any gene or family of genes and has been applied to ten genes known to bear mutations that cause muscular dystrophy: DMD; CAV3; CAPN3; FKRP; TRIM32; LMNA; SGCA; SGCB; SGCG; SGCD. Using this process, mutations have been found in five DMD patients and four LGMD patients (one in the FKRP gene, one in the CAV3 gene, and two likely causative heterozygous pairs of variations in the CAPN3 gene of two other patients). Methods and assay sequences are reported in this paper. This automated process allows laboratories to discover DNA variations in a short time and at low cost.

  8. The Evolution of Cataclysmic Variables as Revealed by Their Donor Stars

    NASA Astrophysics Data System (ADS)

    Knigge, Christian; Baraffe, Isabelle; Patterson, Joseph

    2011-06-01

    We present an attempt to reconstruct the complete evolutionary path followed by cataclysmic variables (CVs), based on the observed mass-radius relationship of their donor stars. Along the way, we update the semi-empirical CV donor sequence presented previously by one of us, present a comprehensive review of the connection between CV evolution and the secondary stars in these systems, and reexamine most of the commonly used magnetic braking (MB) recipes, finding that even conceptually similar ones can differ greatly in both magnitude and functional form. The great advantage of using donor radii to infer mass-transfer and angular-momentum-loss (AML) rates is that they sample the longest accessible timescales and are most likely to represent the true secular (evolutionary average) rates. We show explicitly that if CVs exhibit long-term mass-transfer-rate fluctuations, as is often assumed, the expected variability timescales are so long that other tracers of the mass-transfer rate—including white dwarf (WD) temperatures—become unreliable. We carefully explore how much of the radius difference between CV donors and models of isolated main-sequence stars may be due to mechanisms other than mass loss. The tidal and rotational deformation of Roche-lobe-filling stars produces ~= 4.5% radius inflation below the period gap and ~= 7.9% above. A comparison of stellar models to mass-radius data for non-interacting stars suggests a real offset of ~= 1.5% for fully convective stars (i.e., donors below the gap) and ~= 4.9% for partially radiative ones (donors above the gap). We also show that donor bloating due to irradiation is probably smaller than, and at most comparable to, these effects. After calibrating our models to account for these issues, we fit self-consistent evolution sequences to our compilation of donor masses and radii. In the standard model of CV evolution, AMLs below the period gap are assumed to be driven solely by gravitational radiation (GR), while AMLs above the gap are usually described by an MB law first suggested by Rappaport et al. We adopt simple scaled versions of these AML recipes and find that these are able to match the data quite well. The optimal scaling factors turn out to be f GR = 2.47 ± 0.22 below the gap and f MB = 0.66 ± 0.05 above (the errors here are purely statistical, and the standard model corresponds to f GR = f MB = 1). This revised model describes the mass-radius data significantly better than the standard model. Some of the most important implications and applications of our results are as follows. (1) The revised evolution sequence yields correct locations for the minimum period and the upper edge of the period gap; the standard sequence does not. (2) The observed spectral types of CV donors are compatible with both standard and revised models. (3) A direct comparison of predicted and observed WD temperatures suggests an even higher value for f GR, but this comparison is sensitive to the assumed mean WD mass and the possible existence of mass-transfer-rate fluctuations. (4) The predicted absolute magnitudes of donor stars in the near-infrared form a lower envelope around the observed absolute magnitudes for systems with parallax distances. This is true for all of our sequences, so any of them can be used to set firm lower limits on (or obtain rough estimates of) the distances toward CVs based only on P orb and single epoch near-IR measurements. (5) Both standard and revised sequences predict that short-period CVs should be susceptible to dwarf nova (DN) eruptions, consistent with observations. However, both sequences also predict that the fraction of DNe among long-period CVs should decline with P orb above the period gap. Observations suggest the opposite behavior, and we discuss the possible explanations for this discrepancy. (6) Approximate orbital period distributions constructed from our evolution sequences suggest that the ratio of long-period CVs to short-period, pre-bounce CVs is about 3 × higher for the revised sequence than the standard one. This may resolve a long-standing problem in CV evolution. Tables describing our donor and evolution sequences are provided in electronically readable form.

  9. Fungal endophytes in germinated seeds of the common bean, Phaseolus vulgaris

    PubMed Central

    Parsa, Soroush; García-Lemos, Adriana M.; Castillo, Katherine; Ortiz, Viviana; López-Lavalle, Luis Augusto Becerra; Braun, Jerome; Vega, Fernando E.

    2016-01-01

    We conducted a survey of fungal endophytes in 582 germinated seeds belonging to 11 Colombian cultivars of the common bean (Phaseolus vulgaris). The survey yielded 394 endophytic isolates belonging to 42 taxa, as identified by sequence analysis of the ribosomal DNA internal transcribed spacer (ITS) region. Aureobasidium pullulans was the dominant endophyte, isolated from 46.7 % of the samples. Also common were Fusarium oxysporum, Xylaria sp., and Cladosporium cladosporioides, but found in only 13.4 %, 11.7 %, and 7.6 % of seedlings, respectively. Endophytic colonization differed significantly among common bean cultivars and seedling parts, with the highest colonization occurring in the first true leaves of the seedlings. PMID:27109374

  10. Graph mining for next generation sequencing: leveraging the assembly graph for biological insights.

    PubMed

    Warnke-Sommer, Julia; Ali, Hesham

    2016-05-06

    The assembly of Next Generation Sequencing (NGS) reads remains a challenging task. This is especially true for the assembly of metagenomics data that originate from environmental samples potentially containing hundreds to thousands of unique species. The principle objective of current assembly tools is to assemble NGS reads into contiguous stretches of sequence called contigs while maximizing for both accuracy and contig length. The end goal of this process is to produce longer contigs with the major focus being on assembly only. Sequence read assembly is an aggregative process, during which read overlap relationship information is lost as reads are merged into longer sequences or contigs. The assembly graph is information rich and capable of capturing the genomic architecture of an input read data set. We have developed a novel hybrid graph in which nodes represent sequence regions at different levels of granularity. This model, utilized in the assembly and analysis pipeline Focus, presents a concise yet feature rich view of a given input data set, allowing for the extraction of biologically relevant graph structures for graph mining purposes. Focus was used to create hybrid graphs to model metagenomics data sets obtained from the gut microbiomes of five individuals with Crohn's disease and eight healthy individuals. Repetitive and mobile genetic elements are found to be associated with hybrid graph structure. Using graph mining techniques, a comparative study of the Crohn's disease and healthy data sets was conducted with focus on antibiotics resistance genes associated with transposase genes. Results demonstrated significant differences in the phylogenetic distribution of categories of antibiotics resistance genes in the healthy and diseased patients. Focus was also evaluated as a pure assembly tool and produced excellent results when compared against the Meta-velvet, Omega, and UD-IDBA assemblers. Mining the hybrid graph can reveal biological phenomena captured by its structure. We demonstrate the advantages of considering assembly graphs as data-mining support in addition to their role as frameworks for assembly.

  11. Rapid-Onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD): exome sequencing of trios, monozygotic twins and tumours.

    PubMed

    Barclay, Sarah F; Rand, Casey M; Borch, Lauren A; Nguyen, Lisa; Gray, Paul A; Gibson, William T; Wilson, Richard J A; Gordon, Paul M K; Aung, Zaw; Berry-Kravis, Elizabeth M; Ize-Ludlow, Diego; Weese-Mayer, Debra E; Bech-Hansen, N Torben

    2015-08-25

    Rapid-onset Obesity with Hypothalamic Dysfunction, Hypoventilation, and Autonomic Dysregulation (ROHHAD) is thought to be a genetic disease caused by de novo mutations, though causative mutations have yet to be identified. We searched for de novo coding mutations among a carefully-diagnosed and clinically homogeneous cohort of 35 ROHHAD patients. We sequenced the exomes of seven ROHHAD trios, plus tumours from four of these patients and the unaffected monozygotic (MZ) twin of one (discovery cohort), to identify constitutional and somatic de novo sequence variants. We further analyzed this exome data to search for candidate genes under autosomal dominant and recessive models, and to identify structural variations. Candidate genes were tested by exome or Sanger sequencing in a replication cohort of 28 ROHHAD singletons. The analysis of the trio-based exomes found 13 de novo variants. However, no two patients had de novo variants in the same gene, and additional patient exomes and mutation analysis in the replication cohort did not provide strong genetic evidence to implicate any of these sequence variants in ROHHAD. Somatic comparisons revealed no coding differences between any blood and tumour samples, or between the two discordant MZ twins. Neither autosomal dominant nor recessive analysis yielded candidate genes for ROHHAD, and we did not identify any potentially causative structural variations. Clinical exome sequencing is highly unlikely to be a useful diagnostic test in patients with true ROHHAD. As ROHHAD has a high risk for fatality if not properly managed, it remains imperative to expand the search for non-exomic genetic risk factors, as well as to investigate other possible mechanisms of disease. In so doing, we will be able to confirm objectively the ROHHAD diagnosis and to contribute to our understanding of obesity, respiratory control, hypothalamic function, and autonomic regulation.

  12. Parkin dosage mutations have greater pathogenicity in familial PD than simple sequence mutations

    PubMed Central

    Pankratz, N; Kissell, D K.; Pauciulo, M W.; Halter, C A.; Rudolph, A; Pfeiffer, R F.; Marder, K S.; Foroud, T; Nichols, W C.

    2009-01-01

    Objective: Mutations in both alleles of parkin have been shown to result in Parkinson disease (PD). However, it is unclear whether haploinsufficiency (presence of a mutation in only 1 of the 2 parkin alleles) increases the risk for PD. Methods: We performed comprehensive dosage and sequence analysis of all 12 exons of parkin in a sample of 520 independent patients with familial PD and 263 controls. We evaluated whether presence of a single parkin mutation, either a sequence (point mutation or small insertion/deletion) or dosage (whole exon deletion or duplication) mutation, was found at increased frequency in cases as compared with controls. We then compared the clinical characteristics of cases with 0, 1, or 2 parkin mutations. Results: We identified 55 independent patients with PD with at least 1 parkin mutation and 9 controls with a single sequence mutation. Cases and controls had a similar frequency of single sequence mutations (3.1% vs 3.4%, p = 0.83); however, the cases had a significantly higher rate of dosage mutations (2.6% vs 0%, p = 0.009). Cases with a single dosage mutation were more likely to have an earlier age at onset (50% with onset at ≤45 years) compared with those with no parkin mutations (10%, p = 0.00002); this was not true for cases with only a single sequence mutation (25% with onset at ≤45 years, p = 0.06). Conclusions: Parkin haploinsufficiency, specifically for a dosage mutation rather than a point mutation or small insertion/deletion, is a risk factor for familial PD and may be associated with earlier age at onset. GLOSSARY ADL = Activities of Daily Living; GDS = Geriatric Depression Scale; MLPA = multiplex ligation-dependent probe amplification; MMSE = Mini-Mental State Examination; PD = Parkinson disease; UPDRS = Unified Parkinson’s Disease Rating Scale. PMID:19636047

  13. The 3of5 web application for complex and comprehensive pattern matching in protein sequences.

    PubMed

    Seiler, Markus; Mehrle, Alexander; Poustka, Annemarie; Wiemann, Stefan

    2006-03-16

    The identification of patterns in biological sequences is a key challenge in genome analysis and in proteomics. Frequently such patterns are complex and highly variable, especially in protein sequences. They are frequently described using terms of regular expressions (RegEx) because of the user-friendly terminology. Limitations arise for queries with the increasing complexity of patterns and are accompanied by requirements for enhanced capabilities. This is especially true for patterns containing ambiguous characters and positions and/or length ambiguities. We have implemented the 3of5 web application in order to enable complex pattern matching in protein sequences. 3of5 is named after a special use of its main feature, the novel n-of-m pattern type. This feature allows for an extensive specification of variable patterns where the individual elements may vary in their position, order, and content within a defined stretch of sequence. The number of distinct elements can be constrained by operators, and individual characters may be excluded. The n-of-m pattern type can be combined with common regular expression terms and thus also allows for a comprehensive description of complex patterns. 3of5 increases the fidelity of pattern matching and finds ALL possible solutions in protein sequences in cases of length-ambiguous patterns instead of simply reporting the longest or shortest hits. Grouping and combined search for patterns provides a hierarchical arrangement of larger patterns sets. The algorithm is implemented as internet application and freely accessible. The application is available at http://dkfz.de/mga2/3of5/3of5.html. The 3of5 application offers an extended vocabulary for the definition of search patterns and thus allows the user to comprehensively specify and identify peptide patterns with variable elements. The n-of-m pattern type offers an improved accuracy for pattern matching in combination with the ability to find all solutions, without compromising the user friendliness of regular expression terms.

  14. Three-input majority logic gate and multiple input logic circuit based on DNA strand displacement.

    PubMed

    Li, Wei; Yang, Yang; Yan, Hao; Liu, Yan

    2013-06-12

    In biomolecular programming, the properties of biomolecules such as proteins and nucleic acids are harnessed for computational purposes. The field has gained considerable attention due to the possibility of exploiting the massive parallelism that is inherent in natural systems to solve computational problems. DNA has already been used to build complex molecular circuits, where the basic building blocks are logic gates that produce single outputs from one or more logical inputs. We designed and experimentally realized a three-input majority gate based on DNA strand displacement. One of the key features of a three-input majority gate is that the three inputs have equal priority, and the output will be true if any of the two inputs are true. Our design consists of a central, circular DNA strand with three unique domains between which are identical joint sequences. Before inputs are introduced to the system, each domain and half of each joint is protected by one complementary ssDNA that displays a toehold for subsequent displacement by the corresponding input. With this design the relationship between any two domains is analogous to the relationship between inputs in a majority gate. Displacing two or more of the protection strands will expose at least one complete joint and return a true output; displacing none or only one of the protection strands will not expose a complete joint and will return a false output. Further, we designed and realized a complex five-input logic gate based on the majority gate described here. By controlling two of the five inputs the complex gate can realize every combination of OR and AND gates of the other three inputs.

  15. De novo inference of protein function from coarse-grained dynamics.

    PubMed

    Bhadra, Pratiti; Pal, Debnath

    2014-10-01

    Inference of molecular function of proteins is the fundamental task in the quest for understanding cellular processes. The task is getting increasingly difficult with thousands of new proteins discovered each day. The difficulty arises primarily due to lack of high-throughput experimental technique for assessing protein molecular function, a lacunae that computational approaches are trying hard to fill. The latter too faces a major bottleneck in absence of clear evidence based on evolutionary information. Here we propose a de novo approach to annotate protein molecular function through structural dynamics match for a pair of segments from two dissimilar proteins, which may share even <10% sequence identity. To screen these matches, corresponding 1 µs coarse-grained (CG) molecular dynamics trajectories were used to compute normalized root-mean-square-fluctuation graphs and select mobile segments, which were, thereafter, matched for all pairs using unweighted three-dimensional autocorrelation vectors. Our in-house custom-built forcefield (FF), extensively validated against dynamics information obtained from experimental nuclear magnetic resonance data, was specifically used to generate the CG dynamics trajectories. The test for correspondence of dynamics-signature of protein segments and function revealed 87% true positive rate and 93.5% true negative rate, on a dataset of 60 experimentally validated proteins, including moonlighting proteins and those with novel functional motifs. A random test against 315 unique fold/function proteins for a negative test gave >99% true recall. A blind prediction on a novel protein appears consistent with additional evidences retrieved therein. This is the first proof-of-principle of generalized use of structural dynamics for inferring protein molecular function leveraging our custom-made CG FF, useful to all. © 2014 Wiley Periodicals, Inc.

  16. Accuracy of ultrasound for the prediction of placenta accreta.

    PubMed

    Bowman, Zachary S; Eller, Alexandra G; Kennedy, Anne M; Richards, Douglas S; Winter, Thomas C; Woodward, Paula J; Silver, Robert M

    2014-08-01

    Ultrasound has been reported to be greater than 90% sensitive for the diagnosis of accreta. Prior studies may be subject to bias because of single expert observers, suspicion for accreta, and knowledge of risk factors. We aimed to assess the accuracy of ultrasound for the prediction of accreta. Patients with accreta at a single academic center were matched to patients with placenta previa, but no accreta, by year of delivery. Ultrasound studies with views of the placenta were collected, deidentified, blinded to clinical history, and placed in random sequence. Six investigators prospectively interpreted each study for the presence of accreta and findings reported to be associated with its diagnosis. Sensitivity, specificity, positive predictive, negative predictive value, and accuracy were calculated. Characteristics of accurate findings were compared using univariate and multivariate analyses. Six investigators examined 229 ultrasound studies from 55 patients with accreta and 56 controls for 1374 independent observations. 1205/1374 (87.7% overall, 90% controls, 84.9% cases) studies were given a diagnosis. There were 371 (27.0%) true positives; 81 (5.9%) false positives; 533 (38.8%) true negatives, 220 (16.0%) false negatives, and 169 (12.3%) with uncertain diagnosis. Sensitivity, specificity, positive predictive value, negative predictive value, and accuracy were 53.5%, 88.0%, 82.1%, 64.8%, and 64.8%, respectively. In multivariate analysis, true positives were more likely to have placental lacunae (odds ratio [OR], 1.5; 95% confidence interval [CI], 1.4-1.6), loss of retroplacental clear space (OR, 2.4; 95% CI, 1.1-4.9), or abnormalities on color Doppler (OR, 2.1; 95% CI, 1.8-2.4). Ultrasound for the prediction of placenta accreta may not be as sensitive as previously described. Copyright © 2014 Mosby, Inc. All rights reserved.

  17. Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison.

    PubMed

    Ratmann, Oliver; Hodcroft, Emma B; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A S Md Mukarram; Joy, Jeffrey B; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E; Liang, Richard; Plazzotta, Giacomo; Poon, Art F Y; Rasmussen, David A; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J; Fraser, Christophe

    2017-01-01

    Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods' development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Magnetic Resonance–Based Automatic Air Segmentation for Generation of Synthetic Computed Tomography Scans in the Head Region

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zheng, Weili; Kim, Joshua P.; Kadbi, Mo

    2015-11-01

    Purpose: To incorporate a novel imaging sequence for robust air and tissue segmentation using ultrashort echo time (UTE) phase images and to implement an innovative synthetic CT (synCT) solution as a first step toward MR-only radiation therapy treatment planning for brain cancer. Methods and Materials: Ten brain cancer patients were scanned with a UTE/Dixon sequence and other clinical sequences on a 1.0 T open magnet with simulation capabilities. Bone-enhanced images were generated from a weighted combination of water/fat maps derived from Dixon images and inverted UTE images. Automated air segmentation was performed using unwrapped UTE phase maps. Segmentation accuracy was assessedmore » by calculating segmentation errors (true-positive rate, false-positive rate, and Dice similarity indices using CT simulation (CT-SIM) as ground truth. The synCTs were generated using a voxel-based, weighted summation method incorporating T2, fluid attenuated inversion recovery (FLAIR), UTE1, and bone-enhanced images. Mean absolute error (MAE) characterized Hounsfield unit (HU) differences between synCT and CT-SIM. A dosimetry study was conducted, and differences were quantified using γ-analysis and dose-volume histogram analysis. Results: On average, true-positive rate and false-positive rate for the CT and MR-derived air masks were 80.8% ± 5.5% and 25.7% ± 6.9%, respectively. Dice similarity indices values were 0.78 ± 0.04 (range, 0.70-0.83). Full field of view MAE between synCT and CT-SIM was 147.5 ± 8.3 HU (range, 138.3-166.2 HU), with the largest errors occurring at bone–air interfaces (MAE 422.5 ± 33.4 HU for bone and 294.53 ± 90.56 HU for air). Gamma analysis revealed pass rates of 99.4% ± 0.04%, with acceptable treatment plan quality for the cohort. Conclusions: A hybrid MRI phase/magnitude UTE image processing technique was introduced that significantly improved bone and air contrast in MRI. Segmented air masks and bone-enhanced images were integrated into our synCT pipeline for brain, and results agreed well with clinical CTs, thereby supporting MR-only radiation therapy treatment planning in the brain.« less

  19. Magnetic Resonance-Based Automatic Air Segmentation for Generation of Synthetic Computed Tomography Scans in the Head Region.

    PubMed

    Zheng, Weili; Kim, Joshua P; Kadbi, Mo; Movsas, Benjamin; Chetty, Indrin J; Glide-Hurst, Carri K

    2015-11-01

    To incorporate a novel imaging sequence for robust air and tissue segmentation using ultrashort echo time (UTE) phase images and to implement an innovative synthetic CT (synCT) solution as a first step toward MR-only radiation therapy treatment planning for brain cancer. Ten brain cancer patients were scanned with a UTE/Dixon sequence and other clinical sequences on a 1.0 T open magnet with simulation capabilities. Bone-enhanced images were generated from a weighted combination of water/fat maps derived from Dixon images and inverted UTE images. Automated air segmentation was performed using unwrapped UTE phase maps. Segmentation accuracy was assessed by calculating segmentation errors (true-positive rate, false-positive rate, and Dice similarity indices using CT simulation (CT-SIM) as ground truth. The synCTs were generated using a voxel-based, weighted summation method incorporating T2, fluid attenuated inversion recovery (FLAIR), UTE1, and bone-enhanced images. Mean absolute error (MAE) characterized Hounsfield unit (HU) differences between synCT and CT-SIM. A dosimetry study was conducted, and differences were quantified using γ-analysis and dose-volume histogram analysis. On average, true-positive rate and false-positive rate for the CT and MR-derived air masks were 80.8% ± 5.5% and 25.7% ± 6.9%, respectively. Dice similarity indices values were 0.78 ± 0.04 (range, 0.70-0.83). Full field of view MAE between synCT and CT-SIM was 147.5 ± 8.3 HU (range, 138.3-166.2 HU), with the largest errors occurring at bone-air interfaces (MAE 422.5 ± 33.4 HU for bone and 294.53 ± 90.56 HU for air). Gamma analysis revealed pass rates of 99.4% ± 0.04%, with acceptable treatment plan quality for the cohort. A hybrid MRI phase/magnitude UTE image processing technique was introduced that significantly improved bone and air contrast in MRI. Segmented air masks and bone-enhanced images were integrated into our synCT pipeline for brain, and results agreed well with clinical CTs, thereby supporting MR-only radiation therapy treatment planning in the brain. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Machine learning methods can replace 3D profile method in classification of amyloidogenic hexapeptides.

    PubMed

    Stanislawski, Jerzy; Kotulska, Malgorzata; Unold, Olgierd

    2013-01-17

    Amyloids are proteins capable of forming fibrils. Many of them underlie serious diseases, like Alzheimer disease. The number of amyloid-associated diseases is constantly increasing. Recent studies indicate that amyloidogenic properties can be associated with short segments of aminoacids, which transform the structure when exposed. A few hundreds of such peptides have been experimentally found. Experimental testing of all possible aminoacid combinations is currently not feasible. Instead, they can be predicted by computational methods. 3D profile is a physicochemical-based method that has generated the most numerous dataset - ZipperDB. However, it is computationally very demanding. Here, we show that dataset generation can be accelerated. Two methods to increase the classification efficiency of amyloidogenic candidates are presented and tested: simplified 3D profile generation and machine learning methods. We generated a new dataset of hexapeptides, using more economical 3D profile algorithm, which showed very good classification overlap with ZipperDB (93.5%). The new part of our dataset contains 1779 segments, with 204 classified as amyloidogenic. The dataset of 6-residue sequences with their binary classification, based on the energy of the segment, was applied for training machine learning methods. A separate set of sequences from ZipperDB was used as a test set. The most effective methods were Alternating Decision Tree and Multilayer Perceptron. Both methods obtained area under ROC curve of 0.96, accuracy 91%, true positive rate ca. 78%, and true negative rate 95%. A few other machine learning methods also achieved a good performance. The computational time was reduced from 18-20 CPU-hours (full 3D profile) to 0.5 CPU-hours (simplified 3D profile) to seconds (machine learning). We showed that the simplified profile generation method does not introduce an error with regard to the original method, while increasing the computational efficiency. Our new dataset proved representative enough to use simple statistical methods for testing the amylogenicity based only on six letter sequences. Statistical machine learning methods such as Alternating Decision Tree and Multilayer Perceptron can replace the energy based classifier, with advantage of very significantly reduced computational time and simplicity to perform the analysis. Additionally, a decision tree provides a set of very easily interpretable rules.

  1. Identification and Characterization of a Novel Alpaca Respiratory Coronavirus Most Closely Related to the Human Coronavirus 229E

    PubMed Central

    Crossley, Beate M.; Mock, Richard E.; Callison, Scott A.; Hietala, Sharon K.

    2012-01-01

    In 2007, a novel coronavirus associated with an acute respiratory disease in alpacas (Alpaca Coronavirus, ACoV) was isolated. Full-length genomic sequencing of the ACoV demonstrated the genome to be consistent with other Alphacoronaviruses. A putative additional open-reading frame was identified between the nucleocapsid gene and 3'UTR. The ACoV was genetically most similar to the common human coronavirus (HCoV) 229E with 92.2% nucleotide identity over the entire genome. A comparison of spike gene sequences from ACoV and from HCoV-229E isolates recovered over a span of five decades showed the ACoV to be most similar to viruses isolated in the 1960’s to early 1980’s. The true origin of the ACoV is unknown, however a common ancestor between the ACoV and HCoV-229E appears to have existed prior to the 1960’s, suggesting virus transmission, either as a zoonosis or anthroponosis, has occurred between alpacas and humans. PMID:23235471

  2. Prediction of Nucleotide Binding Peptides Using Star Graph Topological Indices.

    PubMed

    Liu, Yong; Munteanu, Cristian R; Fernández Blanco, Enrique; Tan, Zhiliang; Santos Del Riego, Antonino; Pazos, Alejandro

    2015-11-01

    The nucleotide binding proteins are involved in many important cellular processes, such as transmission of genetic information or energy transfer and storage. Therefore, the screening of new peptides for this biological function is an important research topic. The current study proposes a mixed methodology to obtain the first classification model that is able to predict new nucleotide binding peptides, using only the amino acid sequence. Thus, the methodology uses a Star graph molecular descriptor of the peptide sequences and the Machine Learning technique for the best classifier. The best model represents a Random Forest classifier based on two features of the embedded and non-embedded graphs. The performance of the model is excellent, considering similar models in the field, with an Area Under the Receiver Operating Characteristic Curve (AUROC) value of 0.938 and true positive rate (TPR) of 0.886 (test subset). The prediction of new nucleotide binding peptides with this model could be useful for drug target studies in drug development. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. A NEW TECHNIQUE FOR THE PHOTOSPHERIC DRIVING OF NON-POTENTIAL SOLAR CORONAL MAGNETIC FIELD SIMULATIONS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weinzierl, Marion; Yeates, Anthony R.; Mackay, Duncan H.

    2016-05-20

    In this paper, we develop a new technique for driving global non-potential simulations of the Sun’s coronal magnetic field solely from sequences of radial magnetic maps of the solar photosphere. A primary challenge to driving such global simulations is that the required horizontal electric field cannot be uniquely determined from such maps. We show that an “inductive” electric field solution similar to that used by previous authors successfully reproduces specific features of the coronal field evolution in both single and multiple bipole simulations. For these cases, the true solution is known because the electric field was generated from a surfacemore » flux-transport model. The match for these cases is further improved by including the non-inductive electric field contribution from surface differential rotation. Then, using this reconstruction method for the electric field, we show that a coronal non-potential simulation can be successfully driven from a sequence of ADAPT maps of the photospheric radial field, without including additional physical observations which are not routinely available.« less

  4. No genome-wide protein sequence convergence for echolocation.

    PubMed

    Zou, Zhengting; Zhang, Jianzhi

    2015-05-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. BGD: a database of bat genomes.

    PubMed

    Fang, Jianfei; Wang, Xuan; Mu, Shuo; Zhang, Shuyi; Dong, Dong

    2015-01-01

    Bats account for ~20% of mammalian species, and are the only mammals with true powered flight. For the sake of their specialized phenotypic traits, many researches have been devoted to examine the evolution of bats. Until now, some whole genome sequences of bats have been assembled and annotated, however, a uniform resource for the annotated bat genomes is still unavailable. To make the extensive data associated with the bat genomes accessible to the general biological communities, we established a Bat Genome Database (BGD). BGD is an open-access, web-available portal that integrates available data of bat genomes and genes. It hosts data from six bat species, including two megabats and four microbats. Users can query the gene annotations using efficient searching engine, and it offers browsable tracks of bat genomes. Furthermore, an easy-to-use phylogenetic analysis tool was also provided to facilitate online phylogeny study of genes. To the best of our knowledge, BGD is the first database of bat genomes. It will extend our understanding of the bat evolution and be advantageous to the bat sequences analysis. BGD is freely available at: http://donglab.ecnu.edu.cn/databases/BatGenome/.

  6. Fungi associated with black mould on baobab trees in southern Africa.

    PubMed

    Cruywagen, Elsie M; Crous, Pedro W; Roux, Jolanda; Slippers, Bernard; Wingfield, Michael J

    2015-07-01

    There have been numerous reports in the scientific and popular literature suggesting that African baobab (Adansonia digitata) trees are dying, with symptoms including a black mould on their bark. The aim of this study was to determine the identity of the fungi causing this black mould and to consider whether they might be affecting the health of trees. The fungi were identified by sequencing directly from mycelium on the infected tissue as well as from cultures on agar. Sequence data for the ITS region of the rDNA resulted in the identification of four fungi including Aureobasidium pullulans, Toxicocladosporium irritans and a new species of Rachicladosporium described here as Rachicladosporium africanum. A single isolate of an unknown Cladosporium sp. was also found. These fungi, referred to here as black mould, are not true sooty mould fungi and they were shown to penetrate below the bark of infected tissue, causing a distinct host reaction. Although infections can lead to dieback of small twigs on severely infected branches, the mould was not found to kill trees.

  7. A method for high-throughput production of sequence-verified DNA libraries and strain collections.

    PubMed

    Smith, Justin D; Schlecht, Ulrich; Xu, Weihong; Suresh, Sundari; Horecka, Joe; Proctor, Michael J; Aiyar, Raeka S; Bennett, Richard A O; Chu, Angela; Li, Yong Fuga; Roy, Kevin; Davis, Ronald W; Steinmetz, Lars M; Hyman, Richard W; Levy, Sasha F; St Onge, Robert P

    2017-02-13

    The low costs of array-synthesized oligonucleotide libraries are empowering rapid advances in quantitative and synthetic biology. However, high synthesis error rates, uneven representation, and lack of access to individual oligonucleotides limit the true potential of these libraries. We have developed a cost-effective method called Recombinase Directed Indexing (REDI), which involves integration of a complex library into yeast, site-specific recombination to index library DNA, and next-generation sequencing to identify desired clones. We used REDI to generate a library of ~3,300 DNA probes that exhibited > 96% purity and remarkable uniformity (> 95% of probes within twofold of the median abundance). Additionally, we created a collection of ~9,000 individually accessible CRISPR interference yeast strains for > 99% of genes required for either fermentative or respiratory growth, demonstrating the utility of REDI for rapid and cost-effective creation of strain collections from oligonucleotide pools. Our approach is adaptable to any complex DNA library, and fundamentally changes how these libraries can be parsed, maintained, propagated, and characterized. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.

  8. The development of a cisgenic apple plant.

    PubMed

    Vanblaere, Thalia; Szankowski, Iris; Schaart, Jan; Schouten, Henk; Flachowsky, Henryk; Broggini, Giovanni A L; Gessler, Cesare

    2011-07-20

    Cisgenesis represents a step toward a new generation of GM crops. The lack of selectable genes (e.g. antibiotic or herbicide resistance) in the final product and the fact that the inserted gene(s) derive from organisms sexually compatible with the target crop should rise less environmental concerns and increase consumer's acceptance. Here we report the generation of a cisgenic apple plant by inserting the endogenous apple scab resistance gene HcrVf2 under the control of its own regulatory sequences into the scab susceptible apple cultivar Gala. A previously developed method based on Agrobacterium-mediated transformation combined with a positive and negative selection system and a chemically inducible recombination machinery allowed the generation of apple cv. Gala carrying the scab resistance gene HcrVf2 under its native regulatory sequences and no foreign genes. Three cisgenic lines were chosen for detailed investigation and were shown to carry a single T-DNA insertion and express the target gene HcrVf2. This is the first report of the generation of a true cisgenic plant. Copyright © 2011 Elsevier B.V. All rights reserved.

  9. Numerical study on the sequential Bayesian approach for radioactive materials detection

    NASA Astrophysics Data System (ADS)

    Qingpei, Xiang; Dongfeng, Tian; Jianyu, Zhu; Fanhua, Hao; Ge, Ding; Jun, Zeng

    2013-01-01

    A new detection method, based on the sequential Bayesian approach proposed by Candy et al., offers new horizons for the research of radioactive detection. Compared with the commonly adopted detection methods incorporated with statistical theory, the sequential Bayesian approach offers the advantages of shorter verification time during the analysis of spectra that contain low total counts, especially in complex radionuclide components. In this paper, a simulation experiment platform implanted with the methodology of sequential Bayesian approach was developed. Events sequences of γ-rays associating with the true parameters of a LaBr3(Ce) detector were obtained based on an events sequence generator using Monte Carlo sampling theory to study the performance of the sequential Bayesian approach. The numerical experimental results are in accordance with those of Candy. Moreover, the relationship between the detection model and the event generator, respectively represented by the expected detection rate (Am) and the tested detection rate (Gm) parameters, is investigated. To achieve an optimal performance for this processor, the interval of the tested detection rate as a function of the expected detection rate is also presented.

  10. Ends-in Vs. Ends-Out Recombination in Yeast

    PubMed Central

    Hastings, P. J.; McGill, C.; Shafer, B.; Strathern, J. N.

    1993-01-01

    Integration of linearized plasmids into yeast chromosomes has been used as a model system for the study of recombination initiated by double-strand breaks. The linearized plasmid DNA recombines efficiently into sequences homologous to the ends of the DNA. This efficient recombination occurs both for the configuration in which the break is in a contiguous region of homology (herein called the ends-in configuration) and for ``omega'' insertions in which plasmid sequences interrupt a linear region of homology (herein called the ends-out configuration). The requirements for integration of these two configurations are expected to be different. We compared these two processes in a yeast strain containing an ends-in target and an ends-out target for the same cut plasmid. Recovery of ends-in events exceeds ends-out events by two- to threefold. Possible causes for the origin of this small bias are discussed. The lack of an extreme difference in frequency implies that cooperativity between the two ends does not contribute to the efficiency with which cut circular plasmids are integrated. This may also be true for the repair of chromosomal double-strand breaks. PMID:8307337

  11. ORCAN-a web-based meta-server for real-time detection and functional annotation of orthologs.

    PubMed

    Zielezinski, Andrzej; Dziubek, Michal; Sliski, Jan; Karlowski, Wojciech M

    2017-04-15

    ORCAN (ORtholog sCANner) is a web-based meta-server for one-click evolutionary and functional annotation of protein sequences. The server combines information from the most popular orthology-prediction resources, including four tools and four online databases. Functional annotation utilizes five additional comparisons between the query and identified homologs, including: sequence similarity, protein domain architectures, functional motifs, Gene Ontology term assignments and a list of associated articles. Furthermore, the server uses a plurality-based rating system to evaluate the orthology relationships and to rank the reference proteins by their evolutionary and functional relevance to the query. Using a dataset of ∼1 million true yeast orthologs as a sample reference set, we show that combining multiple orthology-prediction tools in ORCAN increases the sensitivity and precision by 1-2 percent points. The service is available for free at http://www.combio.pl/orcan/ . wmk@amu.edu.pl. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  12. A 454 multiplex sequencing method for rapid and reliable genotyping of highly polymorphic genes in large-scale studies.

    PubMed

    Galan, Maxime; Guivier, Emmanuel; Caraux, Gilles; Charbonnel, Nathalie; Cosson, Jean-François

    2010-05-11

    High-throughput sequencing technologies offer new perspectives for biomedical, agronomical and evolutionary research. Promising progresses now concern the application of these technologies to large-scale studies of genetic variation. Such studies require the genotyping of high numbers of samples. This is theoretically possible using 454 pyrosequencing, which generates billions of base pairs of sequence data. However several challenges arise: first in the attribution of each read produced to its original sample, and second, in bioinformatic analyses to distinguish true from artifactual sequence variation. This pilot study proposes a new application for the 454 GS FLX platform, allowing the individual genotyping of thousands of samples in one run. A probabilistic model has been developed to demonstrate the reliability of this method. DNA amplicons from 1,710 rodent samples were individually barcoded using a combination of tags located in forward and reverse primers. Amplicons consisted in 222 bp fragments corresponding to DRB exon 2, a highly polymorphic gene in mammals. A total of 221,789 reads were obtained, of which 153,349 were finally assigned to original samples. Rules based on a probabilistic model and a four-step procedure, were developed to validate sequences and provide a confidence level for each genotype. The method gave promising results, with the genotyping of DRB exon 2 sequences for 1,407 samples from 24 different rodent species and the sequencing of 392 variants in one half of a 454 run. Using replicates, we estimated that the reproducibility of genotyping reached 95%. This new approach is a promising alternative to classical methods involving electrophoresis-based techniques for variant separation and cloning-sequencing for sequence determination. The 454 system is less costly and time consuming and may enhance the reliability of genotypes obtained when high numbers of samples are studied. It opens up new perspectives for the study of evolutionary and functional genetics of highly polymorphic genes like major histocompatibility complex genes in vertebrates or loci regulating self-compatibility in plants. Important applications in biomedical research will include the detection of individual variation in disease susceptibility. Similarly, agronomy will benefit from this approach, through the study of genes implicated in productivity or disease susceptibility traits.

  13. Dimorphic cycle in Candida citri sp. nov., a novel yeast species isolated from rotting fruit in Borneo.

    PubMed

    Sipiczki, Matthias

    2011-03-01

    Five dimorphic yeast strains were isolated from rotting lime fruits in Borneo. The sequences of the D1/D2 domains of the 26S rRNA genes, the internal transcribed spacer (ITS) chromosomal regions and the 18S rRNA genes were identical in the isolates and differed from the corresponding sequences of all known yeast species. Based on the sequence differences (12-15% in the D1/D2 domain) from the closest relatives and the different pattern of taxonomic traits, the new isolates are assigned the status of a new species, for which the name Candida citri sp. nov. is proposed. Its type strain is 11-469(T) , which has been deposited in Centralbureau voor Schimmelcultures (Utrecht, the Netherlands) as CBS 11858(T) , Culture Collection of Yeasts (Bratislava, Slovakia) as CCY 29-181-1(T) and the National Collection of Agricultural and Industrial Microorganisms (Budapest, Hungary) as NCAIM Y.01978(T) . MycoBank number: MB 519100. The GenBank accession numbers for nucleotide sequences of its D1/D2 domain, ITS and 18S regions are HM803241, HM803242 and HM803243, respectively. Candida citri produces invasive mycelium composed of true septate hyphae that grow towards nutrient-rich parts of the medium and develop large vacuoles at the nongrowing ends of their cells. The hyphae produce blastoconidia, which can establish satellite yeast colonies in the invaded solid substrate. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  14. Robust high-performance nanoliter-volume single-cell multiple displacement amplification on planar substrates.

    PubMed

    Leung, Kaston; Klaus, Anders; Lin, Bill K; Laks, Emma; Biele, Justina; Lai, Daniel; Bashashati, Ali; Huang, Yi-Fei; Aniba, Radhouane; Moksa, Michelle; Steif, Adi; Mes-Masson, Anne-Marie; Hirst, Martin; Shah, Sohrab P; Aparicio, Samuel; Hansen, Carl L

    2016-07-26

    The genomes of large numbers of single cells must be sequenced to further understanding of the biological significance of genomic heterogeneity in complex systems. Whole genome amplification (WGA) of single cells is generally the first step in such studies, but is prone to nonuniformity that can compromise genomic measurement accuracy. Despite recent advances, robust performance in high-throughput single-cell WGA remains elusive. Here, we introduce droplet multiple displacement amplification (MDA), a method that uses commercially available liquid dispensing to perform high-throughput single-cell MDA in nanoliter volumes. The performance of droplet MDA is characterized using a large dataset of 129 normal diploid cells, and is shown to exceed previously reported single-cell WGA methods in amplification uniformity, genome coverage, and/or robustness. We achieve up to 80% coverage of a single-cell genome at 5× sequencing depth, and demonstrate excellent single-nucleotide variant (SNV) detection using targeted sequencing of droplet MDA product to achieve a median allelic dropout of 15%, and using whole genome sequencing to achieve false and true positive rates of 9.66 × 10(-6) and 68.8%, respectively, in a G1-phase cell. We further show that droplet MDA allows for the detection of copy number variants (CNVs) as small as 30 kb in single cells of an ovarian cancer cell line and as small as 9 Mb in two high-grade serous ovarian cancer samples using only 0.02× depth. Droplet MDA provides an accessible and scalable method for performing robust and accurate CNV and SNV measurements on large numbers of single cells.

  15. Current sequencing technology makes microhaplotypes a powerful new type of genetic marker for forensics.

    PubMed

    Kidd, Kenneth K; Pakstis, Andrew J; Speed, William C; Lagacé, Robert; Chang, Joseph; Wootton, Sharon; Haigh, Eva; Kidd, Judith R

    2014-09-01

    SNPs that are molecularly very close (<10kb) will generally have extremely low recombination rates, much less than 10(-4). Multiple haplotypes will often exist because of the history of the origins of the variants at the different sites, rare recombinants, and the vagaries of random genetic drift and/or selection. Such multiallelic haplotype loci are potentially important in forensic work for individual identification, for defining ancestry, and for identifying familial relationships. The new DNA sequencing capabilities currently available make possible continuous runs of a few hundred base pairs so that we can now determine the allelic combination of multiple SNPs on each chromosome of an individual, i.e., the phase, for multiple SNPs within a small segment of DNA. Therefore, we have begun to identify regions, encompassing two to four SNPs with an extent of <200bp that define multiallelic haplotype loci. We have identified candidate regions and have collected pilot data on many candidate microhaplotype loci. Here we present 31 microhaplotype loci that have at least three alleles, have high heterozygosity, are globally informative, and are statistically independent at the population level. This study of microhaplotype loci (microhaps) provides proof of principle that such markers exist and validates their usefulness for ancestry inference, lineage-clan-family inference, and individual identification. The true value of microhaplotypes will come with sequencing methods that can establish alleles unambiguously, including disentangling of mixtures, because a single sequencing run on a single strand of DNA will encompass all of the SNPs. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  16. An interleaved sequence for simultaneous magnetic resonance angiography (MRA), susceptibility weighted imaging (SWI) and quantitative susceptibility mapping (QSM).

    PubMed

    Chen, Yongsheng; Liu, Saifeng; Buch, Sagar; Hu, Jiani; Kang, Yan; Haacke, E Mark

    2018-04-01

    To image the entire vasculature of the brain with complete suppression of signal from background tissue using a single 3D excitation interleaved rephased/dephased multi-echo gradient echo sequence. This ensures no loss of signal from fast flow and provides co-registered susceptibility weighted images (SWI) and quantitative susceptibility maps (QSM) from the same scan. The suppression of background tissue was accomplished by subtracting the flow-dephased images from the flow-rephased images with the same echo time of 12.5ms to generate a magnetic resonance angiogram and venogram (MRAV). Further, a 2.5ms flow-compensated echo was added in the rephased portion to provide sufficient signal for major arteries with fast flow. The QSM data from the rephased 12.5ms echo was used to suppress veins on the MRAV to generate an artery-only MRA. The proposed approach was tested on five healthy volunteers at 3T. This three-echo interleaved GRE sequence provided complete background suppression of stationary tissues, while the short echo data gave high signal in the internal carotid and middle cerebral arteries (MCA). The contrast-to-noise ratio (CNR) of the arteries was significantly improved in the M3 territory of the MCA compared to the non-linear subtraction MRA and TOF-MRA. Veins were suppressed successfully utilizing the QSM data. The background tissue can be properly suppressed using the proposed interleaved MRAV sequence. One can obtain whole brain MRAV, MRA, SWI, true-SWI (or tSWI) and QSM data simultaneously from a single scan. Published by Elsevier Inc.

  17. Paleomagnetism of Holocene lava flows from the Reykjanes Peninsula and the Tungnaá lava sequence (Iceland): implications for flow correlation and ages

    NASA Astrophysics Data System (ADS)

    Pinton, Annamaria; Giordano, Guido; Speranza, Fabio; Þórðarson, Þorvaldur

    2018-01-01

    The impact of Holocene eruptive events from hot spots like Iceland may have had significant global implications; thus, dating and knowledge of past eruptions chronology is important. However, at high-latitude volcanic islands, the paucity of soils severely limits 14C dating, while the poor K content of basalts strongly restricts the use of K/Ar and Ar/Ar methods. Even tephrochronology, based on 14C age determinations, refers to layers that rarely lie directly above lava flows to be dated. We report on the paleomagnetic dating of 25 sites from the Reykjanes Peninsula and the Tungnaá lava sequence of Iceland. The gathered paleomagnetic directions were compared with the available reference paleosecular variation curves of the Earth magnetic field to obtain the possible emplacement age intervals. To test the method's validity, we sampled the precisely dated Laki (1783-1784 AD) and Eldgjà (934-938 AD) lavas. The age windows obtained for these events encompass the true flow ages. For sites from the Reykjanes peninsula and the Tugnaá lava sequence, we derived multiple possible eruption events and ages. In the Reykjanes peninsula, we propose an older emplacement age (immediately following the 870 AD Iceland Settlement age) for Ogmundarhraun and Kapelluhraun lava fields. For pre-historical (older than the settlement age) Tugnaá eruptions, the method has a dating precision of 300-400 years which allows an increase of the detail in the chronostratigraphy and distribution of lavas in the Tugnaá sequence.

  18. Assessment of phylogenetic sensitivity for reconstructing HIV-1 epidemiological relationships.

    PubMed

    Beloukas, Apostolos; Magiorkinis, Emmanouil; Magiorkinis, Gkikas; Zavitsanou, Asimina; Karamitros, Timokratis; Hatzakis, Angelos; Paraskevis, Dimitrios

    2012-06-01

    Phylogenetic analysis has been extensively used as a tool for the reconstruction of epidemiological relations for research or for forensic purposes. It was our objective to assess the sensitivity of different phylogenetic methods and various phylogenetic programs to reconstruct epidemiological links among HIV-1 infected patients that is the probability to reveal a true transmission relationship. Multiple datasets (90) were prepared consisting of HIV-1 sequences in protease (PR) and partial reverse transcriptase (RT) sampled from patients with documented epidemiological relationship (target population), and from unrelated individuals (control population) belonging to the same HIV-1 subtype as the target population. Each dataset varied regarding the number, the geographic origin and the transmission risk groups of the sequences among the control population. Phylogenetic trees were inferred by neighbor-joining (NJ), maximum likelihood heuristics (hML) and Bayesian methods. All clusters of sequences belonging to the target population were correctly reconstructed by NJ and Bayesian methods receiving high bootstrap and posterior probability (PP) support, respectively. On the other hand, TreePuzzle failed to reconstruct or provide significant support for several clusters; high puzzling step support was associated with the inclusion of control sequences from the same geographic area as the target population. In contrary, all clusters were correctly reconstructed by hML as implemented in PhyML 3.0 receiving high bootstrap support. We report that under the conditions of our study, hML using PhyML, NJ and Bayesian methods were the most sensitive for the reconstruction of epidemiological links mostly from sexually infected individuals. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. An algorithm for extraction of periodic signals from sparse, irregularly sampled data

    NASA Technical Reports Server (NTRS)

    Wilcox, J. Z.

    1994-01-01

    Temporal gaps in discrete sampling sequences produce spurious Fourier components at the intermodulation frequencies of an oscillatory signal and the temporal gaps, thus significantly complicating spectral analysis of such sparsely sampled data. A new fast Fourier transform (FFT)-based algorithm has been developed, suitable for spectral analysis of sparsely sampled data with a relatively small number of oscillatory components buried in background noise. The algorithm's principal idea has its origin in the so-called 'clean' algorithm used to sharpen images of scenes corrupted by atmospheric and sensor aperture effects. It identifies as the signal's 'true' frequency that oscillatory component which, when passed through the same sampling sequence as the original data, produces a Fourier image that is the best match to the original Fourier space. The algorithm has generally met with succession trials with simulated data with a low signal-to-noise ratio, including those of a type similar to hourly residuals for Earth orientation parameters extracted from VLBI data. For eight oscillatory components in the diurnal and semidiurnal bands, all components with an amplitude-noise ratio greater than 0.2 were successfully extracted for all sequences and duty cycles (greater than 0.1) tested; the amplitude-noise ratios of the extracted signals were as low as 0.05 for high duty cycles and long sampling sequences. When, in addition to these high frequencies, strong low-frequency components are present in the data, the low-frequency components are generally eliminated first, by employing a version of the algorithm that searches for non-integer multiples of the discrete FET minimum frequency.

  20. Phylogeny and origin of 82 zygomycetes from all 54 genera of the Mucorales and Mortierellales based on combined analysis of actin and translation elongation factor EF-1alpha genes.

    PubMed

    Voigt, K; Wöstemeyer, J

    2001-05-30

    True fungi (Eumycota) are heterotrophic eukaryotic microorganisms encompassing ascomycetes, basidiomycetes, chytridiomycetes and zygomycetes. The natural systematics of the latter group, Zygomycota, are very poorly understood due to the lack of distinguishing morphological characters. We have determined sequences for the nuclear-encoded genes actin (act) from 82 zygomycetes representing all 54 currently recognized genera from the two zygomycetous orders Mucorales and Mortierellales. We also determined sequences for translation elongation factor EF-1alpha (tef) from 16 zygomycetes (total of 96,837 bp). Phylogenetic analysis in the context of available sequence data (total 2,062 nucleotide positions per species) revealed that current classification schemes for the mucoralean fungi are highly unnatural at the family and, to a large extent, at the genus level. The data clearly indicate a deep, ancient and distinct dichotomy of the orders Mucorales and Mortierellales, which are recognized only in some zygomycete systems. Yet at the same time the data show that two genera - Umbelopsis and Micromucor - previously placed within the Mortierellales on the basis of their weakly developed columella (a morphological structure of the sporangiophore well-developed within all Mucorales) are in fact members of the Mucorales. Phylogenetic analyses of the encoded amino acid sequences in the context of homologues from eukaryotes and archaebacterial outgroups indicate that the Eumycota studied here are a natural group but provide little or no support for the monophyly of either zygomycetes, ascomycetes or basidiomycetes. The data clearly indicate that a complete revision of zygomycete natural systematics is necessary.

  1. Phylogenetic relationships between some members of the genera Neisseria, Acinetobacter, Moraxella, and Kingella based on partial 16S ribosomal DNA sequence analysis.

    PubMed

    Enright, M C; Carter, P E; MacLean, I A; McKenzie, H

    1994-07-01

    We obtained 16S ribosomal DNA (rDNA) sequence data for strains belonging to 11 species of Proteobacteria, including the type strains of Kingella kingae, Neisseria lactamica, Neisseria meningitidis, Moraxella lacunata subsp. lacunata, [Neisseria] ovis, Moraxella catarrhalis, Moraxella osloensis, [Moraxella] phenylpyruvica, and Acinetobacter lwoffii, as well as strains of Neisseria subflava and Acinetobacter calcoaceticus. The data in a distance matrix constructed by comparing the sequences supported the proposal that the genera Acinetobacter and Moraxella and [N.] ovis should be excluded from the family Neisseriaceae. Our results are consistent with hybridization data which suggest that these excluded taxa should be part of a new family, the Moraxellaceae. The strains that we studied can be divided into the following five groups: (i) M. lacunata subsp. lacunata, [N.] ovis, and M. catarrhalis; (ii) M. osloensis; (iii) [M.] phenylpyruvica; (iv) A. calcoaceticus and A. lwoffii; and (v) N. meningitidis, N. subflava, N. lactamica, and K. kingae. We agree with the previous proposal that [N.] ovis should be renamed Moraxella ovis, as this organism is closely related to Moraxella species and not to Neisseria species. The generically misnamed taxon [M.] phenylpyruvica belongs to the proposed family Moraxellaceae, but it is sufficiently different to warrant exclusion from the genus Moraxella. Further work needs to be done to investigate genetically similar species, such as Psychrobacter immobilis, before the true generic position of this organism can be determined. Automated 16S rDNA sequencing with the PCR allows workers to accurately determine phylogenetic relationships between groups of organisms.(ABSTRACT TRUNCATED AT 250 WORDS)

  2. Isotopic complexities and the age of the Delfonte volcanic rocks, eastern Mescal Range, southeastern California: Stratigraphic and tectonic implications

    USGS Publications Warehouse

    Fleck, R.J.; Mattinson, J.M.; Busby, C.J.; Carr, M.D.; Davis, G.A.; Burchfiel, B.C.

    1994-01-01

    Combined U-Pb zircon, Rb-Sr, 40Ar/39Ar laser-fusion, and conventional K-Ar geochronology establish a late Early Cretaceous age for the Delfonte volcanic rocks. U-Pb zircon analyses define a lower intercept age of 100.5 ± 2 Ma that is interpreted as the crystallization age of the Delfonte sequence. Argon studies document both xenocrystic contamination and postemplacement Ar loss. Rb-Sr results from mafic lavas at the base of the sequence demonstrate compositionally correlated variations in initial 87Sr/86Sr ratios (Sri) from 0.706 for basalts to 0.716 for andesitic compositions. This covariation indicates substantial mixing of subcontinental lithosphere with Proterozoic upper crust. Correlations between Rb/Sr and Sri may result not only in pseudoisochrons approaching the age of the crustal component, but also in reasonable but incorrect apparent ages approaching the true age.Ages obtained in this study require that at least some of the thrust faulting in the Mescal Range-Clark Mountain portion of the foreland fold-and-thrust belt occurred later than ca. 100 Ma and was broadly contemporaneous with emplacement of the Keystone thrust plate in the Spring Mountains to the northeast. Comparison of the age and Rb-Sr systematics of ash-flow tuff boulders in the synorogenic Lavinia Wash sequence near Goodsprings, Nevada, with those of the Delfonte volcanic rocks supports a Delfonte source for the boulders. The 99 Ma age of the Lavinia Wash sequence is nearly identical to the Delfonte age, requiring rapid erosion, transport, and deposition following Delfonte volcanism.

  3. Phylogeny of culturable cyanobacteria from Brazilian mangroves.

    PubMed

    Silva, Caroline Souza Pamplona; Genuário, Diego Bonaldo; Vaz, Marcelo Gomes Marçal Vieira; Fiore, Marli Fátima

    2014-03-01

    The cyanobacterial community from Brazilian mangrove ecosystems was examined using a culture-dependent method. Fifty cyanobacterial strains were isolated from soil, water and periphytic samples collected from Cardoso Island and Bertioga mangroves using specific cyanobacterial culture media. Unicellular, homocytous and heterocytous morphotypes were recovered, representing five orders, seven families and eight genera (Synechococcus, Cyanobium, Cyanobacterium, Chlorogloea, Leptolyngbya, Phormidium, Nostoc and Microchaete). All of these novel mangrove strains had their 16S rRNA gene sequenced and BLAST analysis revealed sequence identities ranging from 92.5 to 99.7% when they were compared with other strains available in GenBank. The results showed a high variability of the 16S rRNA gene sequences among the genotypes that was not associated with the morphologies observed. Phylogenetic analyses showed several branches formed exclusively by some of these novel 16S rRNA gene sequences. BLAST and phylogeny analyses allowed for the identification of Nodosilinea and Oxynema strains, genera already known to exhibit poor morphological diacritic traits. In addition, several Nostoc and Leptolyngbya morphotypes of the mangrove strains may represent new generic entities, as they were distantly affiliated with true genera clades. The presence of non-ribosomal peptide synthetase, polyketide synthase, microcystin and saxitoxin genes were detected in 20.5%, 100%, 37.5% and 33.3%, respectively, of the 44 tested isolates. A total of 134 organic extracts obtained from 44 strains were tested against microorganisms, and 26% of the extracts showed some antimicrobial activity. This is the first polyphasic study of cultured cyanobacteria from Brazilian mangrove ecosystems using morphological, genetic and biological approaches. Copyright © 2014 Elsevier GmbH. All rights reserved.

  4. Cloning and characterization of the gene encoding the endopolygalacturonase-inhibiting protein (PGIP) of Phaseolus vulgaris L.

    PubMed

    Toubart, P; Desiderio, A; Salvi, G; Cervone, F; Daroda, L; De Lorenzo, G

    1992-05-01

    Polygalacturonase-inhibiting protein (PGIP) is a cell wall protein purified from hypocotyls of true bean (Phaseolus vulgaris L.). PGIP inhibits fungal endopolygalacturonases and is considered to be an important factor for plant resistance to phytopathogenic fungi (Albersheim and Anderson, 1971; Cervone et al., 1987). The amino acid sequences of the N-terminus and one internal tryptic peptide of the PGIP purified from P. vulgaris cv. Pinto were used to design redundant oligonucleotides that were successfully utilized as primers in a polymerase chain reaction (PCR) with total DNA of P. vulgaris as a template. A DNA band of 758 bp (a specific PCR amplification product of part of the gene coding for PGIP) was isolated and cloned. By using the 758-bp DNA as a hybridization probe, a lambda clone containing the PGIP gene was isolated from a genomic library of P. vulgaris cv. Saxa. The coding and immediate flanking regions of the PGIP gene, contained on a subcloned 3.3 kb SalI-SalI DNA fragment, were sequenced. A single, continuous ORF of 1026 nt (342 amino acids) was present in the genomic clone. The nucleotide and deduced amino acid sequences of the PGIP gene showed no significant similarity with any known databank sequence. Northern blotting analysis of poly(A)+ RNAs, isolated from various tissues of bean seedlings or from suspension-cultured bean cells, were also performed using the cloned PCR-generated DNA as a probe. A 1.2 kb transcript was detected in suspension-cultured cells and, to a lesser extent, in leaves, hypocotyls, and flowers.(ABSTRACT TRUNCATED AT 250 WORDS)

  5. Normalization, bias correction, and peak calling for ChIP-seq

    PubMed Central

    Diaz, Aaron; Park, Kiyoub; Lim, Daniel A.; Song, Jun S.

    2012-01-01

    Next-generation sequencing is rapidly transforming our ability to profile the transcriptional, genetic, and epigenetic states of a cell. In particular, sequencing DNA from the immunoprecipitation of protein-DNA complexes (ChIP-seq) and methylated DNA (MeDIP-seq) can reveal the locations of protein binding sites and epigenetic modifications. These approaches contain numerous biases which may significantly influence the interpretation of the resulting data. Rigorous computational methods for detecting and removing such biases are still lacking. Also, multi-sample normalization still remains an important open problem. This theoretical paper systematically characterizes the biases and properties of ChIP-seq data by comparing 62 separate publicly available datasets, using rigorous statistical models and signal processing techniques. Statistical methods for separating ChIP-seq signal from background noise, as well as correcting enrichment test statistics for sequence-dependent and sonication biases, are presented. Our method effectively separates reads into signal and background components prior to normalization, improving the signal-to-noise ratio. Moreover, most peak callers currently use a generic null model which suffers from low specificity at the sensitivity level requisite for detecting subtle, but true, ChIP enrichment. The proposed method of determining a cell type-specific null model, which accounts for cell type-specific biases, is shown to be capable of achieving a lower false discovery rate at a given significance threshold than current methods. PMID:22499706

  6. Quantitative Characterization of the T Cell Receptor Repertoire of Naïve and Memory Subsets Using an Integrated Experimental and Computational Pipeline Which Is Robust, Economical, and Versatile

    PubMed Central

    Oakes, Theres; Heather, James M.; Best, Katharine; Byng-Maddick, Rachel; Husovsky, Connor; Ismail, Mazlina; Joshi, Kroopa; Maxwell, Gavin; Noursadeghi, Mahdad; Riddell, Natalie; Ruehl, Tabea; Turner, Carolin T.; Uddin, Imran; Chain, Benny

    2017-01-01

    The T cell receptor (TCR) repertoire can provide a personalized biomarker for infectious and non-infectious diseases. We describe a protocol for amplifying, sequencing, and analyzing TCRs which is robust, sensitive, and versatile. The key experimental step is ligation of a single-stranded oligonucleotide to the 3′ end of the TCR cDNA. This allows amplification of all possible rearrangements using a single set of primers per locus. It also introduces a unique molecular identifier to label each starting cDNA molecule. This molecular identifier is used to correct for sequence errors and for effects of differential PCR amplification efficiency, thus producing more accurate measures of the true TCR frequency within the sample. This integrated experimental and computational pipeline is applied to the analysis of human memory and naive subpopulations, and results in consistent measures of diversity and inequality. After error correction, the distribution of TCR sequence abundance in all subpopulations followed a power law over a wide range of values. The power law exponent differed between naïve and memory populations, but was consistent between individuals. The integrated experimental and analysis pipeline we describe is appropriate to studies of T cell responses in a broad range of physiological and pathological contexts. PMID:29075258

  7. Directing an artificial zinc finger protein to new targets by fusion to a non-DNA-binding domain.

    PubMed

    Lim, Wooi F; Burdach, Jon; Funnell, Alister P W; Pearson, Richard C M; Quinlan, Kate G R; Crossley, Merlin

    2016-04-20

    Transcription factors are often regarded as having two separable components: a DNA-binding domain (DBD) and a functional domain (FD), with the DBD thought to determine target gene recognition. While this holds true for DNA bindingin vitro, it appears thatin vivoFDs can also influence genomic targeting. We fused the FD from the well-characterized transcription factor Krüppel-like Factor 3 (KLF3) to an artificial zinc finger (AZF) protein originally designed to target the Vascular Endothelial Growth Factor-A (VEGF-A) gene promoter. We compared genome-wide occupancy of the KLF3FD-AZF fusion to that observed with AZF. AZF bound to theVEGF-Apromoter as predicted, but was also found to occupy approximately 25,000 other sites, a large number of which contained the expected AZF recognition sequence, GCTGGGGGC. Interestingly, addition of the KLF3 FD re-distributes the fusion protein to new sites, with total DNA occupancy detected at around 50,000 sites. A portion of these sites correspond to known KLF3-bound regions, while others contained sequences similar but not identical to the expected AZF recognition sequence. These results show that FDs can influence and may be useful in directing AZF DNA-binding proteins to specific targets and provide insights into how natural transcription factors operate. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Detection and Analysis of Circular RNAs by RT-PCR.

    PubMed

    Panda, Amaresh C; Gorospe, Myriam

    2018-03-20

    Gene expression in eukaryotic cells is tightly regulated at the transcriptional and posttranscriptional levels. Posttranscriptional processes, including pre-mRNA splicing, mRNA export, mRNA turnover, and mRNA translation, are controlled by RNA-binding proteins (RBPs) and noncoding (nc)RNAs. The vast family of ncRNAs comprises diverse regulatory RNAs, such as microRNAs and long noncoding (lnc)RNAs, but also the poorly explored class of circular (circ)RNAs. Although first discovered more than three decades ago by electron microscopy, only the advent of high-throughput RNA-sequencing (RNA-seq) and the development of innovative bioinformatic pipelines have begun to allow the systematic identification of circRNAs (Szabo and Salzman, 2016; Panda et al ., 2017b; Panda et al ., 2017c). However, the validation of true circRNAs identified by RNA sequencing requires other molecular biology techniques including reverse transcription (RT) followed by conventional or quantitative (q) polymerase chain reaction (PCR), and Northern blot analysis (Jeck and Sharpless, 2014). RT-qPCR analysis of circular RNAs using divergent primers has been widely used for the detection, validation, and sometimes quantification of circRNAs (Abdelmohsen et al ., 2015 and 2017; Panda et al ., 2017b). As detailed here, divergent primers designed to span the circRNA backsplice junction sequence can specifically amplify the circRNAs and not the counterpart linear RNA. In sum, RT-PCR analysis using divergent primers allows direct detection and quantification of circRNAs.

  9. Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma.

    PubMed

    Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh

    2017-03-03

    In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies.

  10. The Cucurbitaceae of India: Accepted names, synonyms, geographic distribution, and information on images and DNA sequences

    PubMed Central

    Renner, Susanne S.; Pandey, Arun K.

    2013-01-01

    Abstract The most recent critical checklists of the Cucurbitaceae of India are 30 years old. Since then, botanical exploration, online availability of specimen images and taxonomic literature, and molecular-phylogenetic studies have led to modified taxon boundaries and geographic ranges. We present a checklist of the Cucurbitaceae of India that treats 400 relevant names and provides information on the collecting locations and herbaria for all types. We accept 94 species (10 of them endemic) in 31 genera. For accepted species, we provide their geographic distribution inside and outside India, links to online images of herbarium or living specimens, and information on publicly available DNA sequences to highlight gaps in the current understanding of Indian cucurbit diversity. Of the 94 species, 79% have DNA sequences in GenBank, albeit rarely from Indian material. The most species-rich genera are Trichosanthes with 22 species, Cucumis with 11 (all but two wild), Momordica with 8, and Zehneria with 5. From an evolutionary point of view, India is of special interest because it harbors a wide range of lineages, many of them relatively old and phylogenetically isolated. Phytogeographically, the north eastern and peninsular regions are richest in species, while the Jammu Kashmir and Himachal regions have few Cucurbitaceae. Our checklist probably underestimates the true diversity of Indian Cucurbitaceae, but should help focus efforts towards the least known species and regions. PMID:23717193

  11. Magnetic resonance imaging of pulmonary infection in immunocompromised children: comparison with multidetector computed tomography.

    PubMed

    Ozcan, H Nursun; Gormez, Ayşegul; Ozsurekci, Yasemin; Karakaya, Jale; Oguz, Berna; Unal, Sule; Cetin, Mualla; Ceyhan, Mehmet; Haliloglu, Mithat

    2017-02-01

    Computed tomography (CT) is commonly used to detect pulmonary infection in immunocompromised children. To compare MRI and multidetector CT findings of pulmonary abnormalities in immunocompromised children. Seventeen neutropaenic children (6 girls; ages 2-18 years) were included. Non-contrast-enhanced CT was performed with a 64-detector CT scanner. Axial and coronal non-enhanced thoracic MRI was performed using a 1.5-T scanner within 24 h of the CT examination (true fast imaging with steady-state free precession, fat-saturated T2-weighted turbo spin echo with motion correction, T2-weighted half-Fourier single-shot turbo spin echo [HASTE], fat-saturated T1-weighted spoiled gradient echo). Pulmonary abnormalities (nodules, consolidations, ground glass opacities, atelectasis, pleural effusion and lymph nodes) were evaluated and compared among MRI sequences and between MRI and CT. The relationship between MRI sequences and nodule sizes was examined by chi- square test. Of 256 CT lesions, 207 (81%, 95% confidence interval [CI] 76-85%) were detected at MRI. Of 202 CT-detected nodules, 157 (78%, 95% CI 71-83%) were seen at motion-corrected MRI. Of the 1-5-mm nodules, 69% were detected by motion-corrected T2-weighted MRI and 38% by HASTE MRI. Sensitivity of MRI (both axial fat-saturated T2-weighted turbo spin echo with variable phase encoding directions (BLADE) images and HASTE sequences) to detect pulmonary abnormalities is promising.

  12. Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

    PubMed Central

    Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.

    2017-01-01

    Abstract Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry–dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a “universal” nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments—Nucleic acids—Mars—Panspermia. Astrobiology 17, 747–760. PMID:28704064

  13. Induction log responses to layered, dipping, and anisotropic formations: Induction log shoulder-bed corrections to anisotropic formations and the effect of shale anisotropy in thinly laminated sand/shale sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hagiwara, Teruhiko

    1996-12-31

    Induction log responses to layered, dipping, and anisotropic formations are examined analytically. The analytical model is especially helpful in understanding induction log responses to thinly laminated binary formations, such as sand/shale sequences, that exhibit macroscopically anisotropic: resistivity. Two applications of the analytical model are discussed. In one application we examine special induction log shoulder-bed corrections for use when thin anisotropic beds are encountered. It is known that thinly laminated sand/shale sequences act as macroscopically anisotropic: formations. Hydrocarbon-bearing formations also act as macroscopically anisotropic formations when they consist of alternating layers of different grain-size distributions. When such formations are thick, inductionmore » logs accurately read the macroscopic conductivity, from which the hydrocarbon saturation in the formations can be computed. When the laminated formations are not thick, proper shoulder-bed corrections (or thin-bed corrections) should be applied to obtain the true macroscopic formation conductivity and to estimate the hydrocarbon saturation more accurately. The analytical model is used to calculate the thin-bed effect and to evaluate the shoulder-bed corrections. We will show that the formation resistivity and hence the hydrocarbon saturation are greatly overestimated when the anisotropy effect is not accounted for and conventional shoulder-bed corrections are applied to the log responses from such laminated formations.« less

  14. Integration of adeno-associated virus vectors in CD34+ human hematopoietic progenitor cells after transduction.

    PubMed

    Fisher-Adams, G; Wong, K K; Podsakoff, G; Forman, S J; Chatterjee, S

    1996-07-15

    Gene transfer vectors based on adeno-associated virus (AAV) appear promising because of their high transduction frequencies regardless of cell cycle status and ability to integrate into chromosomal DNA. We tested AAV-mediated gene transfer into a panel of human bone marrow or umbilical cord-derived CD34+ hematopoietic progenitor cells, using vectors encoding several transgenes under the control of viral and cellular promoters. Gene transfer was evaluated by (1) chromosomal integration of vector sequences and (2) analysis of transgene expression. Southern hybridization and fluorescence in situ hybridization analysis of transduced CD34 genomic DNA showed the presence of integrated vector sequences in chromosomal DNA in a portion of transduced cells and showed that integrated vector sequences were replicated along with cellular DNA during mitosis. Transgene expression in transduced CD34 cells in suspension cultures and in myeloid colonies differentiating in vitro from transduced CD34 cells approximated that predicted by the multiplicity of transduction. This was true in CD34 cells from different donors, regardless of the transgene or selective pressure. Comparisons of CD34 cell transduction either before or after cytokine stimulation showed similar gene transfer frequencies. Our findings suggest that AAV transduction of CD34+ hematopoietic progenitor cells is efficient, can lead to stable integration in a population of transduced cells, and may therefore provide the basis for safe and efficient ex vivo gene therapy of the hematopoietic system.

  15. Prenatal cranial ossification of the humpback whale (Megaptera novaeangliae).

    PubMed

    Hampe, Oliver; Franke, Helena; Hipsley, Christy A; Kardjilov, Nikolay; Müller, Johannes

    2015-05-01

    Being descendants of small terrestrial ungulate mammals, whales underwent enormous transformations during their evolutionary history, that is, extensive changes in anatomy, physiology, and behavior were evolved during secondary adaptations to life in water. However, still only little is known about whale ontogenetic development, which help to identify the timing and sequence of critical evolutionary events, such as modification of the cetacean ear. This is particularly true for baleen whales (Mysticeti), the group including the humpback whale Megaptera novaeangliae. We use high-resolution X-ray computed tomography to reinvestigate humpback whale fetuses from the Kükenthal collection at the Museum für Naturkunde, Berlin, thus, extending historic descriptions of their skeletogenesis and providing for the first time sequences of cranial ossification for this species. Principally, the ossification sequence of prenatal Megaptera follows a typical mammalian pattern with the anterior dermal bones being the first ossifying elements in the skull, starting with the dentary. In contrast to other mammals, the ectotympanic bone ossifies at an early stage. Alveolar structure can be observed in both the maxillae and dentaries in these early prenatal specimens but evidence for teeth is lacking. Although the possibility of obtaining new embryological material is unlikely due to conservation issues, our study shows that reexamination of existing specimens employing new technologies still holds promise for filling gaps in our knowledge of whale evolution and ontogeny. © 2015 Wiley Periodicals, Inc.

  16. Sequence-Based Typing of Legionella pneumophila Serogroup 1 Offers the Potential for True Portability in Legionellosis Outbreak Investigation

    PubMed Central

    Gaia, Valeria; Fry, Norman K.; Harrison, Timothy G.; Peduzzi, Raffaele

    2003-01-01

    Seven gene loci of Legionella pneumophila serogroup 1 were analyzed as potential epidemiological typing markers to aid in the investigation of legionella outbreaks. The genes chosen included four likely to be selectively neutral (acn, groES, groEL, and recA) and three likely to be under selective pressure (flaA, mompS, and proA). Oligonucleotide primers were designed to amplify 279- to 763-bp fragments from each gene. Initial sequence analysis of the seven loci from 10 well-characterized isolates of L. pneumophila serogroup 1 gave excellent reproducibility (R) and epidemiological concordance (E) values (R = 1.00; E = 1.00). The three loci showing greatest discrimination and nucleotide variation, flaA, mompS, and proA, were chosen for further study. Indices of discrimination (D) were calculated using a panel of 79 unrelated isolates. Single loci gave D values ranging from 0.767 to 0.857, and a combination of all three loci resulted in a D value of 0.924. When all three loci were combined with monoclonal antibody subgrouping, the D value was 0.971. Sequence-based typing of L. pneumophila serogroup 1 using only three loci is epidemiologically concordant and highly discriminatory and has the potential to become the new “gold standard” for the epidemiological typing of L. pneumophila. PMID:12843023

  17. Population Genomics of Francisella tularensis subsp. holarctica and its Implication on the Eco-Epidemiology of Tularemia in Switzerland

    PubMed Central

    Wittwer, Matthias; Altpeter, Ekkehard; Pilo, Paola; Gygli, Sebastian M.; Beuret, Christian; Foucault, Frederic; Ackermann-Gäumann, Rahel; Karrer, Urs; Jacob, Daniela; Grunow, Roland; Schürch, Nadia

    2018-01-01

    Whole genome sequencing (WGS) methods provide new possibilities in the field of molecular epidemiology. This is particularly true for monomorphic organisms where the discriminatory power of traditional methods (e.g., restriction enzyme length polymorphism typing, multi locus sequence typing etc.) is inadequate to elucidate complex disease transmission patterns, as well as resolving the phylogeny at high resolution on a micro-geographic scale. In this study, we present insights into the population structure of Francisella tularensis subsp. holarctica, the causative agent of tularemia in Switzerland. A total of 59 Fth isolates were obtained from castor bean ticks (Ixodes ricinus), animals and humans and a high resolution phylogeny was inferred using WGS methods. The majority of the Fth population in Switzerland belongs to the west European B.11 clade and shows an extraordinary genetic diversity underlining the old evolutionary history of the pathogen in the alpine region. Moreover, a new B.11 subclade was identified which was not described so far. The combined analysis of the epidemiological data of human tularemia cases with the whole genome sequences of the 59 isolates provide evidence that ticks play a pivotal role in transmitting Fth to humans and other vertebrates in Switzerland. This is further underlined by the correlation of disease risk estimates with climatic and ecological factors influencing the survival of ticks. PMID:29623260

  18. Gene expression distribution deconvolution in single-cell RNA sequencing.

    PubMed

    Wang, Jingshu; Huang, Mo; Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Murray, John; Raj, Arjun; Li, Mingyao; Zhang, Nancy R

    2018-06-26

    Single-cell RNA sequencing (scRNA-seq) enables the quantification of each gene's expression distribution across cells, thus allowing the assessment of the dispersion, nonzero fraction, and other aspects of its distribution beyond the mean. These statistical characterizations of the gene expression distribution are critical for understanding expression variation and for selecting marker genes for population heterogeneity. However, scRNA-seq data are noisy, with each cell typically sequenced at low coverage, thus making it difficult to infer properties of the gene expression distribution from raw counts. Based on a reexamination of nine public datasets, we propose a simple technical noise model for scRNA-seq data with unique molecular identifiers (UMI). We develop deconvolution of single-cell expression distribution (DESCEND), a method that deconvolves the true cross-cell gene expression distribution from observed scRNA-seq counts, leading to improved estimates of properties of the distribution such as dispersion and nonzero fraction. DESCEND can adjust for cell-level covariates such as cell size, cell cycle, and batch effects. DESCEND's noise model and estimation accuracy are further evaluated through comparisons to RNA FISH data, through data splitting and simulations and through its effectiveness in removing known batch effects. We demonstrate how DESCEND can clarify and improve downstream analyses such as finding differentially expressed genes, identifying cell types, and selecting differentiation markers. Copyright © 2018 the Author(s). Published by PNAS.

  19. Exome Sequencing Identifies Potentially Druggable Mutations in Nasopharyngeal Carcinoma

    PubMed Central

    Chow, Yock Ping; Tan, Lu Ping; Chai, San Jiun; Abdul Aziz, Norazlin; Choo, Siew Woh; Lim, Paul Vey Hong; Pathmanathan, Rajadurai; Mohd Kornain, Noor Kaslina; Lum, Chee Lun; Pua, Kin Choo; Yap, Yoke Yeow; Tan, Tee Yong; Teo, Soo Hwang; Khoo, Alan Soo-Beng; Patel, Vyomesh

    2017-01-01

    In this study, we first performed whole exome sequencing of DNA from 10 untreated and clinically annotated fresh frozen nasopharyngeal carcinoma (NPC) biopsies and matched bloods to identify somatically mutated genes that may be amenable to targeted therapeutic strategies. We identified a total of 323 mutations which were either non-synonymous (n = 238) or synonymous (n = 85). Furthermore, our analysis revealed genes in key cancer pathways (DNA repair, cell cycle regulation, apoptosis, immune response, lipid signaling) were mutated, of which those in the lipid-signaling pathway were the most enriched. We next extended our analysis on a prioritized sub-set of 37 mutated genes plus top 5 mutated cancer genes listed in COSMIC using a custom designed HaloPlex target enrichment panel with an additional 88 NPC samples. Our analysis identified 160 additional non-synonymous mutations in 37/42 genes in 66/88 samples. Of these, 99/160 mutations within potentially druggable pathways were further selected for validation. Sanger sequencing revealed that 77/99 variants were true positives, giving an accuracy of 78%. Taken together, our study indicated that ~72% (n = 71/98) of NPC samples harbored mutations in one of the four cancer pathways (EGFR-PI3K-Akt-mTOR, NOTCH, NF-κB, DNA repair) which may be potentially useful as predictive biomarkers of response to matched targeted therapies. PMID:28256603

  20. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets.

    PubMed

    Savitski, Mikhail M; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus

    2015-09-01

    Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  1. Molecular Surveillance of True Nontypeable Haemophilus influenzae: An Evaluation of PCR Screening Assays

    PubMed Central

    Binks, Michael J.; Temple, Beth; Kirkham, Lea-Ann; Wiertsema, Selma P.; Dunne, Eileen M.; Richmond, Peter C.; Marsh, Robyn L.; Leach, Amanda J.; Smith-Vaughan, Heidi C.

    2012-01-01

    Background Unambiguous identification of nontypeable Haemophilus influenzae (NTHi) is not possible by conventional microbiology. Molecular characterisation of phenotypically defined NTHi isolates suggests that up to 40% are Haemophilus haemolyticus (Hh); however, the genetic similarity of NTHi and Hh limits the power of simple molecular techniques such as PCR for species discrimination. Methodology/Principal Findings Here we assess the ability of previously published and novel PCR-based assays to identify true NTHi. Sixty phenotypic NTHi isolates, classified by a dual 16S rRNA gene PCR algorithm as NTHi (n = 22), Hh (n = 27) or equivocal (n = 11), were further characterised by sequencing of the 16S rRNA and recA genes then interrogated by PCR-based assays targeting the omp P2, omp P6, lgtC, hpd, 16S rRNA, fucK and iga genes. The sequencing data and PCR results were used to define NTHi for this study. Two hpd real time PCR assays (hpd#1 and hpd#3) and the conventional iga PCR assay were equally efficient at differentiating study-defined NTHi from Hh, each with a receiver operator characteristic curve area of 0.90 [0.83; 0.98]. The hpd#1 and hpd#3 assays were completely specific against a panel of common respiratory bacteria, unlike the iga PCR, and the hpd#3 assay was able to detect below 10 copies per reaction. Conclusions/Significance Our data suggest an evolutionary continuum between NTHi and Hh and therefore no single gene target could completely differentiate NTHi from Hh. The hpd#3 real time PCR assay proved to be the superior method for discrimination of NTHi from closely related Haemophilus species with the added potential for quantification of H. influenzae directly from specimens. We suggest the hpd#3 assay would be suitable for routine NTHi surveillance and to assess the impact of antibiotics and vaccines, on H. influenzae carriage rates, carriage density, and disease. PMID:22470516

  2. Molecular surveillance of true nontypeable Haemophilus influenzae: an evaluation of PCR screening assays.

    PubMed

    Binks, Michael J; Temple, Beth; Kirkham, Lea-Ann; Wiertsema, Selma P; Dunne, Eileen M; Richmond, Peter C; Marsh, Robyn L; Leach, Amanda J; Smith-Vaughan, Heidi C

    2012-01-01

    Unambiguous identification of nontypeable Haemophilus influenzae (NTHi) is not possible by conventional microbiology. Molecular characterisation of phenotypically defined NTHi isolates suggests that up to 40% are Haemophilus haemolyticus (Hh); however, the genetic similarity of NTHi and Hh limits the power of simple molecular techniques such as PCR for species discrimination. Here we assess the ability of previously published and novel PCR-based assays to identify true NTHi. Sixty phenotypic NTHi isolates, classified by a dual 16S rRNA gene PCR algorithm as NTHi (n = 22), Hh (n = 27) or equivocal (n = 11), were further characterised by sequencing of the 16S rRNA and recA genes then interrogated by PCR-based assays targeting the omp P2, omp P6, lgtC, hpd, 16S rRNA, fucK and iga genes. The sequencing data and PCR results were used to define NTHi for this study. Two hpd real time PCR assays (hpd#1 and hpd#3) and the conventional iga PCR assay were equally efficient at differentiating study-defined NTHi from Hh, each with a receiver operator characteristic curve area of 0.90 [0.83; 0.98]. The hpd#1 and hpd#3 assays were completely specific against a panel of common respiratory bacteria, unlike the iga PCR, and the hpd#3 assay was able to detect below 10 copies per reaction. Our data suggest an evolutionary continuum between NTHi and Hh and therefore no single gene target could completely differentiate NTHi from Hh. The hpd#3 real time PCR assay proved to be the superior method for discrimination of NTHi from closely related Haemophilus species with the added potential for quantification of H. influenzae directly from specimens. We suggest the hpd#3 assay would be suitable for routine NTHi surveillance and to assess the impact of antibiotics and vaccines, on H. influenzae carriage rates, carriage density, and disease.

  3. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets

    PubMed Central

    Savitski, Mikhail M.; Wilhelm, Mathias; Hahne, Hannes; Kuster, Bernhard; Bantscheff, Marcus

    2015-01-01

    Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target–decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target–decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The “picked” protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The “picked” target–decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used “classic” protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software. PMID:25987413

  4. Double jeopardy revisited: clinical decision making in unstable patients with, thoraco-abdominal stab wounds and, potential injuries in multiple body cavities.

    PubMed

    Clarke, Damian L; Gall, Tamara M H; Thomson, Sandie R

    2011-05-01

    In the setting of the hypovolaemic patient with a thoraco-abdominal stab wound and potential injuries in both the chest and abdomen, deciding which cavity to explore first may be difficult.Opening the incorrect body cavity can delay control of tamponade or haemorrhage and exacerbate hypothermia and fluid shifts. This situation has been described as one of double jeopardy. All stab victims from July 2007 to July 2009 requiring a thoracotomy and laparotomy at the same operation were identified from a database. Demographics, site and nature of injuries, admission observations and investigations as well as operative sequence were recorded. Correct sequencing was defined as first opening the cavity with most lethal injury. Incorrect sequencing was defined as opening a cavity and finding either no injury or an injury of less severity than a simultaneous injury in the unopened cavity. The primary outcome was survival or death. Sixteen stab victims underwent thoracotomy and laparotomy during the same operation. All were male with an age range of 18–40 (mean/median 27). Median systolic blood pressure on presentation was 90 mm Hg. (quartile range 80–90 mm Hg). Median base excess was 6.5 (quartile range 12 to 2.2). All the deaths were the result of cardiac injuries. Incorrect sequencing occurred in four patients (25%). In this group there were four negative abdominal explorations prior to thoracotomy with two deaths. There was one death in the correct sequencing group. Incorrect sequencing in stab victims who require both thoracotomy and laparotomy at the same sitting is associated with a high mortality. This is especially true when the abdomen is incorrectly entered first whilst the life threatening pathology is in the chest. Clinical signs may be confusing, leading to incorrect sequencing of exploration. The common causes for confusion include failure to appreciate that cardiac tamponade does not present with bleeding and difficulty in assessing peritonism in an unstable patient with multiple stab wounds. In the setting of the unstable patient with stab wounds and suspected dual cavity injuries the chest should be opened first followed by the abdomen. 2010 Elsevier Ltd. All rights reserved.

  5. Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data

    PubMed Central

    2014-01-01

    Background The rapid evolution in high-throughput sequencing (HTS) technologies has opened up new perspectives in several research fields and led to the production of large volumes of sequence data. A fundamental step in HTS data analysis is the mapping of reads onto reference sequences. Choosing a suitable mapper for a given technology and a given application is a subtle task because of the difficulty of evaluating mapping algorithms. Results In this paper, we present a benchmark procedure to compare mapping algorithms used in HTS using both real and simulated datasets and considering four evaluation criteria: computational resource and time requirements, robustness of mapping, ability to report positions for reads in repetitive regions, and ability to retrieve true genetic variation positions. To measure robustness, we introduced a new definition for a correctly mapped read taking into account not only the expected start position of the read but also the end position and the number of indels and substitutions. We developed CuReSim, a new read simulator, that is able to generate customized benchmark data for any kind of HTS technology by adjusting parameters to the error types. CuReSim and CuReSimEval, a tool to evaluate the mapping quality of the CuReSim simulated reads, are freely available. We applied our benchmark procedure to evaluate 14 mappers in the context of whole genome sequencing of small genomes with Ion Torrent data for which such a comparison has not yet been established. Conclusions A benchmark procedure to compare HTS data mappers is introduced with a new definition for the mapping correctness as well as tools to generate simulated reads and evaluate mapping quality. The application of this procedure to Ion Torrent data from the whole genome sequencing of small genomes has allowed us to validate our benchmark procedure and demonstrate that it is helpful for selecting a mapper based on the intended application, questions to be addressed, and the technology used. This benchmark procedure can be used to evaluate existing or in-development mappers as well as to optimize parameters of a chosen mapper for any application and any sequencing platform. PMID:24708189

  6. Quality control procedures for dynamic treatment delivery techniques involving couch motion.

    PubMed

    Yu, Victoria Y; Fahimian, Benjamin P; Xing, Lei; Hristov, Dimitre H

    2014-08-01

    In this study, the authors introduce and demonstrate quality control procedures for evaluating the geometric and dosimetric fidelity of dynamic treatment delivery techniques involving treatment couch motion synchronous with gantry and multileaf collimator (MLC). Tests were designed to evaluate positional accuracy, velocity constancy and accuracy for dynamic couch motion under a realistic weight load. A test evaluating the geometric accuracy of the system in delivering treatments over complex dynamic trajectories was also devised. Custom XML scripts that control the Varian TrueBeam™ STx (Serial #3) axes in Developer Mode were written to implement the delivery sequences for the tests. Delivered dose patterns were captured with radiographic film or the electronic portal imaging device. The couch translational accuracy in dynamic treatment mode was 0.01 cm. Rotational accuracy was within 0.3°, with 0.04 cm displacement of the rotational axis. Dose intensity profiles capturing the velocity constancy and accuracy for translations and rotation exhibited standard deviation and maximum deviations below 3%. For complex delivery involving MLC and couch motions, the overall translational accuracy for reproducing programmed patterns was within 0.06 cm. The authors conclude that in Developer Mode, TrueBeam™ is capable of delivering dynamic treatment delivery techniques involving couch motion with good geometric and dosimetric fidelity.

  7. The 1988 Jansson memorial lecture. The performance of the 'idiot-savant': implicit and explicit.

    PubMed

    O'Connor, N

    1989-04-01

    'Idiots-savants' are people of low intelligence who have one or two outstanding talents such as calendrical calculation, drawing or musical performance. Such people are mostly male and occur with high frequency among the autistic population. Do they perform their amazing feats because of an outstanding memory or do they draw on some faculty of reasoning to help them? Although they cannot easily make clear how they carry out their tasks by using speech, experiments reveal that they follow simple rules which they use to aid them in recalling correct dates and sequences in classical music. It has been said that they cannot abstract but this turns out not to be true: all can abstract to some degree and some are more at home with abstract than with concrete material. Whatever else is true of these handicapped but gifted people their gift becomes apparent at an early age and is apparently not improved by practice. Perhaps the most important conclusion from work with these groups is that their gifts force us to think again about the concept of general intelligence. How far is it possible to have low intelligence and yet be an outstanding musician or artist? Speculation on this idea may force us to revise our concepts of intelligence, neuropsychology and handicap.

  8. Research on signal processing of shock absorber test bench based on zero-phase filter

    NASA Astrophysics Data System (ADS)

    Wu, Yi; Ding, Guoqing

    2017-10-01

    The quality of force-displacement diagram is significant to help evaluate the performance of shock absorbers. Damping force sampling data is often interfered by Gauss white noise, 50Hz power interference and its harmonic wave during the process of testing; data de-noising has become the core problem of drawing true, accurate and real-time indicator diagram. The noise and interference can be filtered out through generic IIR or FIR low-pass filter, but addition phase lag of useful signal will be caused due to the inherent attribute of IIR and FIR filter. The paper uses FRR method to realize zero-phase digital filtering in a software way based on mutual cancellation of phase lag between the forward and reverse sequences after through the filter. High-frequency interference above 40Hz are filtered out completely and noise attenuation is more than -40dB, with no additional phase lag. The method is able to restore the true signal as far as possible. Theoretical simulation and practical test indicate high-frequency noises have been effectively inhibited in multiple typical speed cases, signal-to-noise ratio being greatly improved; the curve in indicator diagram has better smoothness and fidelity. The FRR algorithm has low computational complexity, fast running time, and can be easily transplanted in multiple platforms.

  9. True-breeding targeted gene knock-out in barley using designer TALE-nuclease in haploid cells.

    PubMed

    Gurushidze, Maia; Hensel, Goetz; Hiekel, Stefan; Schedel, Sindy; Valkov, Vladimir; Kumlehn, Jochen

    2014-01-01

    Transcription activator-like effector nucleases (TALENs) are customizable fusion proteins able to cleave virtually any genomic DNA sequence of choice, and thereby to generate site-directed genetic modifications in a wide range of cells and organisms. In the present study, we expressed TALENs in pollen-derived, regenerable cells to establish the generation of instantly true-breeding mutant plants. A gfp-specific TALEN pair was expressed via Agrobacterium-mediated transformation in embryogenic pollen of transgenic barley harboring a functional copy of gfp. Thanks to the haploid nature of the target cells, knock-out mutations were readily detected, and homozygous primary mutant plants obtained following genome duplication. In all, 22% of the TALEN transgenics proved knocked out with respect to gfp, and the loss of function could be ascribed to the deletions of between four and 36 nucleotides in length. The altered gfp alleles were transmitted normally through meiosis, and the knock-out phenotype was consistently shown by the offspring of two independent mutants. Thus, here we describe the efficient production of TALEN-mediated gene knock-outs in barley that are instantaneously homozygous and non-chimeric in regard to the site-directed mutations induced. This TALEN approach has broad applicability for both elucidating gene function and tailoring the phenotype of barley and other crop species.

  10. Cardiac phase detection in intravascular ultrasound images

    NASA Astrophysics Data System (ADS)

    Matsumoto, Monica M. S.; Lemos, Pedro Alves; Yoneyama, Takashi; Furuie, Sergio Shiguemi

    2008-03-01

    Image gating is related to image modalities that involve quasi-periodic moving organs. Therefore, during intravascular ultrasound (IVUS) examination, there is cardiac movement interference. In this paper, we aim to obtain IVUS gated images based on the images themselves. This would allow the reconstruction of 3D coronaries with temporal accuracy for any cardiac phase, which is an advantage over the ECG-gated acquisition that shows a single one. It is also important for retrospective studies, as in existing IVUS databases there are no additional reference signals (ECG). From the images, we calculated signals based on average intensity (AI), and, from consecutive frames, average intensity difference (AID), cross-correlation coefficient (CC) and mutual information (MI). The process includes a wavelet-based filter step and ascendant zero-cross detection in order to obtain the phase information. Firstly, we tested 90 simulated sequences with 1025 frames each. Our method was able to achieve more than 95.0% of true positives and less than 2.3% of false positives ratio, for all signals. Afterwards, we tested in a real examination, with 897 frames and ECG as gold-standard. We achieved 97.4% of true positives (CC and MI), and 2.5% of false positives. For future works, methodology should be tested in wider range of IVUS examinations.

  11. Resolved stars in nearby galaxies: Ground-based photometry of M81

    NASA Technical Reports Server (NTRS)

    Madore, Barry F.; Freedman, Wendy L.; Lee, Myung G.

    1993-01-01

    Using the Canada-France-Hawaii Telescope (CFHT) we have obtained three closely spaced epochs of calibrated Blue Violet Red Infrared (BVRI) CCD imaging of two fields in M81, each known to contain a thirty-day Cepheid. Calibrated BVRI photometry of the brightest stars in these fields is presented. The slope of the luminosity function from the brightest 3-4 mag of the main-sequence blue plume is consistent with similar determinations of the apparent luminosity function in other resolved galaxies, thereby removing the one potential deviation from universality noted by Freedman in a photographic study of luminosity functions in nearby resolved galaxies. Under the assumption that the two Cepheids are representative, a reddening-law fit to the multiwavelength BVRI period-luminosity moduli give a true distance modulus of (m-M)sub 0 = 27.79 mag for M81, corresponding to a linear distance of 3.6 Mpc. An error analysis shows that the derived true distance modulus has a random error of +/- 0.28 mag (due to the photometric uncertainties in the BVRI data), with a systematic uncertainty of +/- 0.10 mag (accounting for the combined effects of unknown phasing of the data points, and the unknown positioning of these particular stars within the Cepheid instabiliy strip).

  12. Crescendo: A Protein Sequence Database Search Engine for Tandem Mass Spectra.

    PubMed

    Wang, Jianqi; Zhang, Yajie; Yu, Yonghao

    2015-07-01

    A search engine that discovers more peptides reliably is essential to the progress of the computational proteomics. We propose two new scoring functions (L- and P-scores), which aim to capture similar characteristics of a peptide-spectrum match (PSM) as Sequest and Comet do. Crescendo, introduced here, is a software program that implements these two scores for peptide identification. We applied Crescendo to test datasets and compared its performance with widely used search engines, including Mascot, Sequest, and Comet. The results indicate that Crescendo identifies a similar or larger number of peptides at various predefined false discovery rates (FDR). Importantly, it also provides a better separation between the true and decoy PSMs, warranting the future development of a companion post-processing filtering algorithm.

  13. X-rays from accretion of red giant winds

    NASA Technical Reports Server (NTRS)

    Jura, M.; Helfand, D. J.

    1984-01-01

    X-ray observations of the late-type red giants Mira and R Aqr obtained with the Einstein Observatory are presented, and the general problems of white dwarf accretion from late-type giant winds is considered. The extremely low measured luminosities obtained for the two systems leads to the conclusion that the companions of Mira and R Aqr are most likely low-mass main sequence objects rather than white dwarfs as is usually assumed. The expected X-ray luminosities of true red giant/white dwarf systems are considered, and it is concluded that far too few have been detected if the canonical accretion scenario is adopted. A possible explanation of this situation in terms of grain-dominated Eddington-limited accretion is proposed.

  14. Realistic facial animation generation based on facial expression mapping

    NASA Astrophysics Data System (ADS)

    Yu, Hui; Garrod, Oliver; Jack, Rachael; Schyns, Philippe

    2014-01-01

    Facial expressions reflect internal emotional states of a character or in response to social communications. Though much effort has been taken to generate realistic facial expressions, it still remains a challenging topic due to human being's sensitivity to subtle facial movements. In this paper, we present a method for facial animation generation, which reflects true facial muscle movements with high fidelity. An intermediate model space is introduced to transfer captured static AU peak frames based on FACS to the conformed target face. And then dynamic parameters derived using a psychophysics method is integrated to generate facial animation, which is assumed to represent natural correlation of multiple AUs. Finally, the animation sequence in the intermediate model space is mapped to the target face to produce final animation.

  15. What is behind "centromere repositioning"?

    PubMed

    Schubert, Ingo

    2018-06-01

    An increasing number of observations suggest an evolutionary switch of centromere position on monocentric eukaryotic chromosomes which otherwise display a conserved sequence of genes and markers. Such observations are particularly frequent for primates and equidae (for review see Heredity 108:59-67, 2012) but occur also in marsupials (J Hered 96:217-224, 2005) and in plants (Chromosome Res 25:299-311, 2017 and references therein). The actual mechanism(s) behind remained unclear in many cases (Proc Natl Acad Sci USA 101:6542-6547, 2004; Trends Genet 30:66-74, 2014). The same is true for de novo centromere formation on chromosomes lacking an active centromere. This article focuses on recent reports on centromere repositioning and possible mechanisms behind and addresses open questions.

  16. Genetic sequencing for surveillance of drug resistance in tuberculosis in highly endemic countries: a multi-country population-based surveillance study.

    PubMed

    Zignol, Matteo; Cabibbe, Andrea Maurizio; Dean, Anna S; Glaziou, Philippe; Alikhanova, Natavan; Ama, Cecilia; Andres, Sönke; Barbova, Anna; Borbe-Reyes, Angeli; Chin, Daniel P; Cirillo, Daniela Maria; Colvin, Charlotte; Dadu, Andrei; Dreyer, Andries; Driesen, Michèle; Gilpin, Christopher; Hasan, Rumina; Hasan, Zahra; Hoffner, Sven; Hussain, Alamdar; Ismail, Nazir; Kamal, S M Mostofa; Khanzada, Faisal Masood; Kimerling, Michael; Kohl, Thomas Andreas; Mansjö, Mikael; Miotto, Paolo; Mukadi, Ya Diul; Mvusi, Lindiwe; Niemann, Stefan; Omar, Shaheed V; Rigouts, Leen; Schito, Marco; Sela, Ivita; Seyfaddinova, Mehriban; Skenders, Girts; Skrahina, Alena; Tahseen, Sabira; Wells, William A; Zhurilo, Alexander; Weyer, Karin; Floyd, Katherine; Raviglione, Mario C

    2018-06-01

    In many countries, regular monitoring of the emergence of resistance to anti-tuberculosis drugs is hampered by the limitations of phenotypic testing for drug susceptibility. We therefore evaluated the use of genetic sequencing for surveillance of drug resistance in tuberculosis. Population-level surveys were done in hospitals and clinics in seven countries (Azerbaijan, Bangladesh, Belarus, Pakistan, Philippines, South Africa, and Ukraine) to evaluate the use of genetic sequencing to estimate the resistance of Mycobacterium tuberculosis isolates to rifampicin, isoniazid, ofloxacin, moxifloxacin, pyrazinamide, kanamycin, amikacin, and capreomycin. For each drug, we assessed the accuracy of genetic sequencing by a comparison of the adjusted prevalence of resistance, measured by genetic sequencing, with the true prevalence of resistance, determined by phenotypic testing. Isolates were taken from 7094 patients with tuberculosis who were enrolled in the study between November, 2009, and May, 2014. In all tuberculosis cases, the overall pooled sensitivity values for predicting resistance by genetic sequencing were 91% (95% CI 87-94) for rpoB (rifampicin resistance), 86% (74-93) for katG, inhA, and fabG promoter combined (isoniazid resistance), 54% (39-68) for pncA (pyrazinamide resistance), 85% (77-91) for gyrA and gyrB combined (ofloxacin resistance), and 88% (81-92) for gyrA and gyrB combined (moxifloxacin resistance). For nearly all drugs and in most settings, there was a large overlap in the estimated prevalence of drug resistance by genetic sequencing and the estimated prevalence by phenotypic testing. Genetic sequencing can be a valuable tool for surveillance of drug resistance, providing new opportunities to monitor drug resistance in tuberculosis in resource-poor countries. Before its widespread adoption for surveillance purposes, there is a need to standardise DNA extraction methods, recording and reporting nomenclature, and data interpretation. Bill & Melinda Gates Foundation, United States Agency for International Development, Global Alliance for Tuberculosis Drug Development. © 2018 World Health Organization; licensee Elsevier. This is an Open Access article published under the CC BY 3.0 IGO license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. In any use of this article, there should be no suggestion that WHO endorses any specific organisation, products or services. The use of the WHO logo is not permitted. This notice should be preserved along with the article's original URL.

  17. History dependence in insect flight decisions during odor tracking.

    PubMed

    Pang, Rich; van Breugel, Floris; Dickinson, Michael; Riffell, Jeffrey A; Fairhall, Adrienne

    2018-02-01

    Natural decision-making often involves extended decision sequences in response to variable stimuli with complex structure. As an example, many animals follow odor plumes to locate food sources or mates, but turbulence breaks up the advected odor signal into intermittent filaments and puffs. This scenario provides an opportunity to ask how animals use sparse, instantaneous, and stochastic signal encounters to generate goal-oriented behavioral sequences. Here we examined the trajectories of flying fruit flies (Drosophila melanogaster) and mosquitoes (Aedes aegypti) navigating in controlled plumes of attractive odorants. While it is known that mean odor-triggered flight responses are dominated by upwind turns, individual responses are highly variable. We asked whether deviations from mean responses depended on specific features of odor encounters, and found that odor-triggered turns were slightly but significantly modulated by two features of odor encounters. First, encounters with higher concentrations triggered stronger upwind turns. Second, encounters occurring later in a sequence triggered weaker upwind turns. To contextualize the latter history dependence theoretically, we examined trajectories simulated from three normative tracking strategies. We found that neither a purely reactive strategy nor a strategy in which the tracker learned the plume centerline over time captured the observed history dependence. In contrast, "infotaxis", in which flight decisions maximized expected information gain about source location, exhibited a history dependence aligned in sign with the data, though much larger in magnitude. These findings suggest that while true plume tracking is dominated by a reactive odor response it might also involve a history-dependent modulation of responses consistent with the accumulation of information about a source over multi-encounter timescales. This suggests that short-term memory processes modulating decision sequences may play a role in natural plume tracking.

  18. GABARAPL1 antibodies: target one protein, get one free!

    PubMed

    Le Grand, Jaclyn Nicole; Chakrama, Fatima Zahra; Seguin-Py, Stéphanie; Fraichard, Annick; Delage-Mourroux, Régis; Jouvenot, Michèle; Risold, Pierre-Yves; Boyer-Guittaut, Michaël

    2011-11-01

    Atg8 is a yeast protein involved in the autophagic process and in particular in the elongation of autophagosomes. In mammals, several orthologs have been identified and are classed into two subfamilies: the LC3 subfamily and the GABARAP subfamily, referred to simply as the LC3 or GABARAP families. GABARAPL1 (GABARAP-like protein 1), one of the proteins belonging to the GABARAP (GABA(A) receptor-associated protein) family, is highly expressed in the central nervous system and implicated in processes such as receptor and vesicle transport as well as autophagy. The proteins that make up the GABARAP family demonstrate conservation of their amino acid sequences and protein structures. In humans, GABARAPL1 shares 86% identity with GABARAP and 61% with GABARAPL2 (GATE-16). The identification of the individual proteins is thus very limited when working in vivo due to a lack of unique peptide sequences from which specific antibodies can be developed. Actually, and to our knowledge, there are no available antibodies on the market that are entirely specific to GABARAPL1 and the same may be true of the anti-GABARAP antibodies. In this study, we sought to examine the specificity of three antibodies targeted against different peptide sequences within GABARAPL1: CHEM-CENT (an antibody raised against a short peptide sequence within the center of the protein), PTG-NTER (an antibody raised against the N-terminus of the protein) and PTG-FL (an antibody raised against the full-length protein). The results described in this article demonstrate the importance of testing antibody specificity under the conditions for which it will be used experimentally, a caution that should be taken when studying the expression of the GABARAP family proteins.

  19. True hermaphroditism in a 46, XY individual, caused by a postzygotic somatic point mutation in the male gonadal sex-determining locus (SRY): Molecular genetics and histological findings in a sporadic case

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Braun, A.; Kammerer, S.; Cleve, H.

    1993-03-01

    Recently, the gene for the determination of maleness has been identified in the sex-determining region on the short arm of the Y chromosome (SRY) between the Y-chromosomal pseudoautosomal boundary (PABY) and the ZFY gene locus. Experiments with transgenic mice confirmed that SRY is a part of the testis-determining factor (TDF). The authors describe a sporadic case of a patient with intersexual genitalia and the histological finding of ovotestes in the gonad, which resembles the mixed type of gonadal tissue without primordial follicle structures. The karyotype of the patient was 46,XY. By PCR amplification, they tested for the presence of SRYmore » by using DNA obtained from histological gonadal slices. The SRY products of both DNA preparations were further analyzed by direct sequencing. All three parts of the sex-determining region of the Y chromosome could be amplified from leukocytic DNA. The patient's and the father's SRY sequences were identical with the published sequence. In the SRY PCR product of gonadal DNA, the wild-type and two point mutations were present in the patient's sequence, simulating a heterozygous state of a Y-chromosomal gene: one of the mutations was silent, while the other encoded for a nonconservative amino acid substitution from leucine to histidine. Subcloning procedures showed that the two point mutations always occurred together. The origin of the patient's intersexuality is a postzygotic mutation of the SRY occurring in part of the gonadal tissue. This event caused the loss of the testis-determining function in affected cells. 37 refs., 6 figs.« less

  20. History dependence in insect flight decisions during odor tracking

    PubMed Central

    van Breugel, Floris; Dickinson, Michael; Riffell, Jeffrey A.; Fairhall, Adrienne

    2018-01-01

    Natural decision-making often involves extended decision sequences in response to variable stimuli with complex structure. As an example, many animals follow odor plumes to locate food sources or mates, but turbulence breaks up the advected odor signal into intermittent filaments and puffs. This scenario provides an opportunity to ask how animals use sparse, instantaneous, and stochastic signal encounters to generate goal-oriented behavioral sequences. Here we examined the trajectories of flying fruit flies (Drosophila melanogaster) and mosquitoes (Aedes aegypti) navigating in controlled plumes of attractive odorants. While it is known that mean odor-triggered flight responses are dominated by upwind turns, individual responses are highly variable. We asked whether deviations from mean responses depended on specific features of odor encounters, and found that odor-triggered turns were slightly but significantly modulated by two features of odor encounters. First, encounters with higher concentrations triggered stronger upwind turns. Second, encounters occurring later in a sequence triggered weaker upwind turns. To contextualize the latter history dependence theoretically, we examined trajectories simulated from three normative tracking strategies. We found that neither a purely reactive strategy nor a strategy in which the tracker learned the plume centerline over time captured the observed history dependence. In contrast, “infotaxis”, in which flight decisions maximized expected information gain about source location, exhibited a history dependence aligned in sign with the data, though much larger in magnitude. These findings suggest that while true plume tracking is dominated by a reactive odor response it might also involve a history-dependent modulation of responses consistent with the accumulation of information about a source over multi-encounter timescales. This suggests that short-term memory processes modulating decision sequences may play a role in natural plume tracking. PMID:29432454

Top