Metagenomics poses opportunities for clinical and public health virology applications by offering a way to assess complete taxonomic composition of a clinical sample in an unbiased way. However, the techniques required are complicated and analysis standards have yet to develop. This, together with the wealth of different tools and workflows that have been proposed, poses a barrier for new users. We evaluated 49 published computational classification workflows for virus metagenomics in a literature review. To this end, we described the methods of existing workflows by breaking them up into five general steps and assessed their ease-of-use and validation experiments. Performance scores of previous benchmarks were summarized and correlations between methods and performance were investigated. We indicate the potential suitability of the different workflows for (1) time-constrained diagnostics, (2) surveillance and outbreak source tracing, (3) detection of remote homologies (discovery), and (4) biodiversity studies. We provide two decision trees for virologists to help select a workflow for medical or biodiversity studies, as well as directions for future developments in clinical viral metagenomics.

Additional Metadata
Keywords Decision tree, Pipeline, Software, Standardization, Use case, Viral metagenomics
Persistent URL dx.doi.org/10.3389/fmicb.2018.00749, hdl.handle.net/1765/106085
Journal Frontiers in Microbiology
Citation
Nooij, S. (Sam), Schmitz, D. (Dennis), Vennema, H, Kroneman, A, & Koopmans, M.P.G, D.V.M. (2018). Overview of virus metagenomic classification methods and their biological applications. Frontiers in Microbiology (Vol. 9). doi:10.3389/fmicb.2018.00749