P3-55 FDA-Escherichia coli Identification (FDA-ECID) Microarray: A Pan-Genome Molecular Toolbox for Serotyping, Virulence Profiling, Molecular Epidemiology, and Phylogeny

Wednesday, August 3, 2016
America's Center - St. Louis
Jayanthi Gangiredla, U.S. Food and Drug Administration, Laurel, MD
Isha Patel, U.S. Food and Drug Administration, Laurel, MD
David Lacher, U.S. Food and Drug Administration, Laurel, MD
Mark Mammel, U.S. Food and Drug Administration, Laurel, MD
Scott Jackson, NIST, Gaithersburg, MD
Keith Lampel, U.S. Food and Drug Administration, Laurel, MD
Christopher Elkins, U.S. Food and Drug Administration, Laurel, MD
Introduction: Most Escherichia coli are non-pathogenic and some have a beneficial effect for humans. However, it is important to identify pathogenic strains for clinical diagnosis as well as food safety analysis. Current methods for the identification of pathogenic E. coli are either time-consuming and/or provide limited and sometimes incomplete information to make a definitive determination of the potential risk to human health.

Purpose: The purpose of this study was to describe a microarray-based assay to discriminate pathogenic and non-pathogenic E. coli

Methods: A high-density custom DNA microarray was designed with informative genetic features extracted from 368 whole genome sequences (WGS) for rapid and high-throughput pathogen identification. The FDA-ECID microarray contains three sets of molecularly informative features that function together to stratify strain identification and relatedness. This includes molecular serotyping, E. coli pan-genome content information, and recapitulating the phylogeny of E. coli, the latter based on 9984 SNPs providing the most discriminatory capability. We analyzed 103 diverse E. coli isolates with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy of the array.

Results: The array was able to accurately identify the molecular O and H serotypes of all 103 isolates tested. In addition, molecular risk assessment was possible with virulence maker identifications, as exemplified with the targeted stx and eae alleles.  Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 strains with strain-level resolution demonstrated for food and clinical examples. Finally, a 99% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advance genome typing.

Significance:  The current study confirms the FDA-ECID microarray as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli.