TreeSeq, a fast and intuitive tool for analysis of whole genome and metagenomic sequence data

B.B. Wintermans, B.W. Brandt, C.M.J.E. Vandenbroucke-Grauls, A.E. Budding

Research output: Contribution to journalArticleAcademicpeer-review

2 Citations (Scopus)

Abstract

Next-generation sequencing is not yet commonly used in clinical laboratories because of a lack of simple and intuitive tools. We developed a software tool (TreeSeq) with a quaternary tree search structure for the analysis of sequence data. This permits rapid searches for sequences of interest in large datasets. We used TreeSeq to screen a gut microbiota metagenomic dataset and a whole genome sequencing (WGS) dataset of a strain of Klebsiella pneumoniae for antibiotic resistance genes and compared the results with BLAST and phenotypic resistance determination. TreeSeq was more than thirty times faster than BLAST and accurately detected resistance gene sequences in complex metagenomic data and resistance genes corresponding with the phenotypic resistance pattern of the Klebsiella strain. Resistance genes found by TreeSeq were visualized as a gene coverage heat map, aiding in the interpretation of results. TreeSeq brings analysis of metagenomic and WGS data within reach of clinical diagnostics.
Original languageEnglish
Article numbere0133816
Number of pages11
JournalPLOS ONE
Volume10
Issue number5
DOIs
Publication statusPublished - 1 May 2015

Cite this