T2-01 Whole Genome Comparisons of Listeria monocytogenes Isolates:  A Two-Step Analysis Combining Whole Genome Multilocus Sequence Typing (wgMLST) and Whole Genome Single-Nucleotide Polymorphism (wgSNP)

Wednesday, 29 March 2017: 14:00
314-316 (The Square)
Katleen Vranckx, Applied Maths NV, Sint-Martens-Latem, Belgium
Katrien De Bruyne, Applied Maths NV, Sint-Martens-Latem, Belgium
Hannes Pouseele, Applied Maths NV, Sint-Martens-Latem, Belgium
Introduction:  Listeria monocytogenes (Lmo), although an uncommon cause of illness, is an important pathogen in pregnant patients, neonates, elderly individuals, and immunocompromised individuals. Following considerable costs reductions, Lmo genome sequencing dramatically increased the number of publically available genomes on NCBI/SRA. The key challenge is to rapidly compute and interpret the relevant information from this vast amount of data. Rapid and automated processing of WGS data ensures a reliable and easy to follow workflow in routine molecular surveillance, reducing the time to detect and contain an outbreak. WGS has the potential to provide information on traditional typing technologies such as MLST, virulence, and resistance determination.

Purpose: This study compared two subsequent pipelines for high resolution WGS-based molecular typing.

Methods:  First, whole genome multilocus sequence typing (wgMLST) was applied to WGS data from all 10,000+ isolates available on NCBI, with the purpose to detect clusters of highly related strains. Clusters, defined by wgMLST, were then characterized by whole genome single-nucleotide polymorphism (wgSNP) analysis. SNP variants detected by mapping the WGS reads to a reference chosen from within the cluster were used to maximize the resolution. As working examples, we identified clusters containing isolates originating from different food sources. Both analysis pipelines were run on the BioNumerics® Calculation Engine, which is fully integrated with the BioNumerics®7.6 software.

Results:  We demonstrated that wgMLST was suitable for the analysis of very large (growing) datasets; making it a suitable technique for outbreak surveillance. The added resolution of wgSNP against an internal reference sequence increased the confidence in the detected clusters. This supports epidemiologists in their source tracking efforts, opening many perspectives for cost efficient food safety and public health monitoring programs.

Significance:  BioNumerics® 7.6 software offers a powerful platform where both wgMLST and wgSNP analysis can be performed at high-throughput rates, and in combination with traditional typing data (MLST, PFGE, etc.) to rapidly provide a robust, portable, and high resolution picture of molecular typing data.