Chronic infection with Hepatitis B virus (HBV) is a major risk factor for the development of advanced liver disease including fibrosis, cirrhosis, and hepatocellular carcinoma (HCC). The relative contribution of virological factors to disease progression has not been fully defined and tools aiding the deconvolution of complex patient virus profiles is an unmet clinical need. Variable viral mutant signatures develop within individual patients due to the low-fidelity replication of the viral polymerase creating 'quasispecies' populations. Here we present the first comprehensive survey of the diversity of HBV quasispecies through ultra-deep sequencing of the complete HBV genome across two distinct European and Asian patient populations. Seroconversion to the HBV e antigen (HBeAg) represents a critical clinical waymark in infected individuals. Using a machine learning approach, a model was developed to determine the viral variants that accurately classify HBeAg status. Serial surveys of patient quasispecies populations and advanced analytics will facilitate clinical decision support for chronic HBV infection and direct therapeutic strategies through improved patient stratification.

Additional Metadata
Persistent URL dx.doi.org/10.1038/s41598-019-55445-8, hdl.handle.net/1765/122745
Journal Scientific Reports
Citation
Mueller-Breckenridge, A.J. (Alan J.), Garcia-Alcalde, F. (Fernando), Wildum, S. (Steffen), Smits, S.L, de Man, R.A, van Campenhout, M.J.H, … Haagmans, B.L. (2019). Machine-learning based patient classification using Hepatitis B virus full-length genome quasispecies from Asian and European cohorts. Scientific Reports, 9(1). doi:10.1038/s41598-019-55445-8