Purpose: This pilot study demonstrates how desktop Whole Genome Sequencing (WGS) data can be used, in a combined analysis, for source tracking of foodborne pathogens. Additionally, we describe the components of the NGS pathogen network that integrates state public health laboratories (AK, AZ, FL, HI, MD, MN, NM, NY, NY_Ag, TX, VA, and WA) as well as federal laboratories.
Methods: Multiple data analysis pipelines (Examples) were tested to combine draft bacterial genomes for phylogenetic analyses to provide leads in outbreak investigations. Results of over 12,000 draft bacterial genomes plus the phylogenetic trees of these results are available from the GenomeTrakr bioprojects at the NCBI.
Results: The hardware and software implemented allowed us to compare and cluster complete genomes of thousands of taxa at a time, and the software outputs daily phylogenetic trees for source tracking of food, clinical and environmental isolates. We report enhanced molecular epidemiological insights gained by comparative analysis of Salmonella and Listeria genomes previously deemed indistinguishable by conventional subtyping methodologies. Details will be provided, for two outbreak case studies with roughly 50 isolates for each case one in nut butter and one in Latin style cheeses The nut butter case documents the ability to link a few clinical cases to an environmental swab from a food facility. The Listeria case was the first example of FDA using WGS for regulatory action.
Significance: These results demonstrate the role of WGS tools within a regulatory environment while highlighting the novel additional insights provided to epidemiological investigations through comparison to a reference database.