rPinecone : define sub-lineages of a clonal expansion via a phylogenetic tree

Wailan, Alexander M. and Coll, Francesc and Heinz, Eva and Tonkin-Hill, Gerry and Corander, Jukka and Feasey, Nicholas A. and Thomson, Nicholas R. (2019) rPinecone : define sub-lineages of a clonal expansion via a phylogenetic tree. Microbial Genomics, 5 (4). pp. 1-9. ISSN 2057-5858 (https://doi.org/10.1099/mgen.0.000264)

[thumbnail of Wailan-etal-MG-2019-rPinecone-define-sub-lineages-of-a-clonal-expansion-via-a-phylogenetic-tree]
Preview
Text. Filename: Wailan-etal-MG-2019-rPinecone-define-sub-lineages-of-a-clonal-expansion-via-a-phylogenetic-tree.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (3MB)| Preview

Abstract

The ability to distinguish different circulating pathogen clones from each other is a fundamental requirement to understand the epidemiology of infectious diseases. Phylogenetic analysis of genomic data can provide a powerful platform to identify lineages within bacterial populations, and thus inform outbreak investigation and transmission dynamics. However, resolving differences between pathogens associated with low-variant (LV) populations carrying low median pairwise single nucleotide variant (SNV) distances remains a major challenge. Here we present rPinecone, an R package designed to define sublineages within closely related LV populations. rPinecone uses a root-to-tip directional approach to define sub-lineages within a phylogenetic tree according to SNV distance from the ancestral node. The utility of this software was demonstrated using both simulated outbreaks and real genomic data of two LV populations: a hospital outbreak of methicillin-resistant Staphylococcus aureus and endemic Salmonella Typhi from rural Cambodia. rPinecone identified the transmission branches of the hospital outbreak and geographically confined lineages in Cambodia. Sub-lineages identified by rPinecone in both analyses were phylogenetically robust. It is anticipated that rPinecone can be used to discriminate between lineages of bacteria from LV populations where other methods fail, enabling a deeper understanding of infectious disease epidemiology for public health purposes.

ORCID iDs

Wailan, Alexander M., Coll, Francesc, Heinz, Eva ORCID logoORCID: https://orcid.org/0000-0003-4413-3756, Tonkin-Hill, Gerry, Corander, Jukka, Feasey, Nicholas A. and Thomson, Nicholas R.;