P2-72 Defining a Core Genome Multilocus Sequence Typing Scheme for the Global Epidemiology of Vibrio parahaemolyticus Strains

Tuesday, July 11, 2017
Exhibit Hall (Tampa Convention Center)
Narjol Gonzalez-Escalona , U.S. Food and Drug Administration , College Park , MD
Keith A. Jolley , Department of Zoology, University of Oxford , Oxford , United Kingdom
Elizabeth Reed , U.S. Food and Drug Administration , College Park , MD
Jaime Martinez-Urtaza , University of Bath , Bath , United Kingdom
Introduction: Vibrio parahaemolyticus is an important human foodborne pathogen whose transmission is associated with the consumption of contaminated seafood. A multilocus sequence typing (MLST) database for V. parahaemolyticus is available and has been used to identify a large number of clones that have caused severe outbreaks worldwide or recurrent outbreaks in certain regions, or have spread to other regions where they are nonendemic. The current MLST scheme is a powerful tool for inferring the population structure of this pathogen; but it has limited resolution, especially when compared to pulse field gel electrophoresis.

Purpose: Application of whole genome sequencing has become a routine for tracing infections or outbreaks. Core genome MLST (cgMLST) analysis is one of the most straightforward ways to explore complex genomic data in an epidemiological context. Therefore, there is a need to generate a new, portable, standardized, and more advanced system that provides higher resolution among V. parahaemolyticus strains using WGS data.

Methods: To establish this cgMLST scheme, we sequenced 92 V. parahaemolyticus genomes and used the genome of strain RIMD 2210633, as the reference (total 4,832 genes), to determine which genes were suitable for establishing the V. parahaemolyticus cgMLST scheme.

Results: The initial analysis resulted in the identification of 2,254 suitable core genes for use in the scheme. To evaluate the performance of this scheme, we performed a cgMLST analysis of the 92 newly sequenced genomes, plus an additional 142 strains with genomes available at NCBI. The cgMLST scheme distinguished related and unrelated strains, including those with the same sequencing type; clearly showing its enhanced resolution over conventional MLST analysis. This cgMLST also distinguished outbreak-related from unrelated strains within the same sequencing type.

Significance: Application of this cgMLST scheme to V. parahaemolyticus strains from different laboratories around the world will facilitate a global picture of the epidemiology, spread, and evolution of this pathogen.