Single-molecule sequencing to track plasmid diversity of hospital-associated carbapenemase-producing Enterobacteriaceae

See allHide authors and affiliations

Science Translational Medicine  17 Sep 2014:
Vol. 6, Issue 254, pp. 254ra126
DOI: 10.1126/scitranslmed.3009845


Public health officials have raised concerns that plasmid transfer between Enterobacteriaceae species may spread resistance to carbapenems, an antibiotic class of last resort, thereby rendering common health care–associated infections nearly impossible to treat. To determine the diversity of carbapenemase-encoding plasmids and assess their mobility among bacterial species, we performed comprehensive surveillance and genomic sequencing of carbapenem-resistant Enterobacteriaceae in the National Institutes of Health (NIH) Clinical Center patient population and hospital environment. We isolated a repertoire of carbapenemase-encoding Enterobacteriaceae, including multiple strains of Klebsiella pneumoniae, Klebsiella oxytoca, Escherichia coli, Enterobacter cloacae, Citrobacter freundii, and Pantoea species. Long-read genome sequencing with full end-to-end assembly revealed that these organisms carry the carbapenem resistance genes on a wide array of plasmids. K. pneumoniae and E. cloacae isolated simultaneously from a single patient harbored two different carbapenemase-encoding plasmids, indicating that plasmid transfer between organisms was unlikely within this patient. We did, however, find evidence of horizontal transfer of carbapenemase-encoding plasmids between K. pneumoniae, E. cloacae, and C. freundii in the hospital environment. Our data, including full plasmid identification, challenge assumptions about horizontal gene transfer events within patients and identify possible connections between patients and the hospital environment. In addition, we identified a new carbapenemase-encoding plasmid of potentially high clinical impact carried by K. pneumoniae, E. coli, E. cloacae, and Pantoea species, in unrelated patients and in the hospital environment.


Carbapenem-resistant Enterobacteriaceae (CRE) are formidable Gram-negative bacterial pathogens that pose a serious triple threat to hospitalized patients around the globe. First, CRE are resistant to most, if not all, antibiotics, with investigations reporting as high as a 40 to 80% mortality rate from infection (14). Second, the incidence of CRE in the United States has quadrupled over the past decade, with isolates reported from nearly every state and detected in 3.9% of hospitals and 17.8% of long-term acute care facilities (5). CRE strains are transmitted easily in health care settings from patients who have asymptomatic intestinal colonization (4). Third, CRE have the potential to spread antibiotic resistance through plasmid transfer to other bacterial species, including common human flora and potential pathogens such as Escherichia coli (1, 4).

In 1928, Griffith demonstrated that a nonencapsulated pneumococcal strain could be transformed into a virulent, encapsulated type by incubation with heat-killed encapsulated pneumococci (6), a phenomenon later confirmed to occur through DNA transfer. In the 86 years since this discovery, plasmids have been recognized as important mediators of horizontal gene transfer between bacteria and have been exploited extensively as tools for molecular biology. Whereas the mechanism of plasmid transfer in model organisms and laboratory strains has been elucidated, much less is known about clinically relevant plasmid “trafficking” in the hospital setting where the environment, patients, and health care personnel may each serve as reservoirs. Understanding horizontal gene transfer among Enterobacteriaceae is particularly important because most carbapenem resistance genes are found on plasmids (7). Carbapenemase-mediated resistance in the United States is most commonly acquired through the plasmid-encoded β-lactamase blaKPC (KPC) gene. Although evidence-based guidelines to prevent transmission of CRE exist (8), they do not explicitly take into account the mobility and potential spread of the blaKPC gene through plasmid transfer, which may complicate tracking nosocomial transmission.

Comparative genomics has been used to track the dissemination and transmission of a wide variety of antibiotic-resistant bacterial pathogens (913). Single nucleotide variant (SNV) analysis has been used to characterize the global spread of pathogens (14, 15) as well as localized outbreaks (4, 1618). However, the vast majority of these studies were based on shotgun-assembled sequence data or read mapping alone and, thus, were not able to discriminate and resolve plasmid from chromosomally encoded genes.

In 2011, a patient colonized with KPC+ Klebsiella pneumoniae was transferred to the National Institutes of Health (NIH) Clinical Center, after which 17 additional patients became colonized, and 6 severely ill patients died from bloodstream infections caused by this multidrug-resistant organism. We traced this hospital outbreak back to three separate transmissions from the index patient (4). We deduced the most likely transmission of KPC+ K. pneumoniae among these 18 patients, integrating epidemiological data (patient location records) and genomic data (SNVs). Co-inheritance of SNVs provided the resolution to infer transmission because the evolutionary rate of these molecular markers is on the same time scale as nosocomial spread. SNVs in patient isolates were identified from high-quality genomic data that did not require fully assembled genomes. Although the outbreak ended in December 2011, we have continued to conduct active surveillance for CRE among patients and the hospital environment (7).

During 2012 to 2013, KPC+ Enterobacteriaceae belonging to six species were cultured from NIH Clinical Center patient and environmental surveillance cultures. Here, we present the genomic detective work required to identify and track plasmids within these isolates in a hospital setting, revealing a complex pattern of introduction and environmental spread of carbapenemase-encoding plasmids.


Overview of hospital surveillance of patients and surfaces

During 2012 to 2013, perirectal swabs and/or throat/groin swabs were collected twice weekly from all patients in the intensive care unit and a high-risk medical ward. Once each month, surveillance cultures were ordered on every inpatient except those housed in locked mental health wards. Beginning July 2012, all patients transferred from other health care facilities underwent surveillance cultures upon admission for two consecutive days and were placed on empiric contact isolation until cultures demonstrated no CRE growth. Beginning in September 2013, admission surveillance cultures were implemented for every patient admitted to non-mental health wards. Patients colonized with KPC+ organisms were co-housed in a single ward with cohorted care (19). Environmental cultures were performed when a KPC+ organism was identified in a patient in whom nosocomial acquisition was suspected. A total of 416 environmental samples and 14,204 surveillance cultures from 1086 patients were collected.

Throughout 2012 and 2013, surveillance cultures identified 10 patients colonized with KPC+ Enterobacteriaceae in our institution (Fig. 1 and Table 1), as well as persistent colonization among patients who had acquired the 2011 outbreak strain. Patients A to K are denoted with letters to symbolize independent introduction, whereas patient 19 is given a number to denote linkage with the 18 patients of our institution’s 2011 outbreak (evidence presented below). Colonization was detected in six patients before or shortly after admission and in four patients after weeks in the hospital, raising concern about possible nosocomial transmission. Two additional isolates were included in this study—these were obtained from patients D and E (Table 1) who had been admitted to our hospital, were discharged, and subsequently sought medical treatment elsewhere and were found at those facilities to be colonized with KPC+ Enterobacteriaceae. Environmental surveillance identified six additional KPC+ Enterobacteriaceae. In total, this study includes 20 isolates: 2 from the 2011 NIH outbreak, 10 from 2012 to 2013 NIH patients, 2 from NIH patients identified at outside institutions, and 6 from environmental surveillance.

Fig. 1. Timeline of KPC+ Enterobacteriaceae organisms identified and sequenced from 2011 to 2013.

Patient code, genus, species, and strain denoted in label; for example, Pt1; KPNIH1 is patient 1; K. pneumoniae NIH1 strain and Env; ECNIH5 is environmental; E. cloacae NIH5 strain. Detailed data are presented in Table 1. The isolate source is denoted by shape, and the bacterial species is denoted by color. Isolates carrying a variant of the plasmid pKpQIL are marked with a “Q.” Isolates carrying the pKPC-47e backbone (IncN, this study) are marked with an “N.” Isolates connected by lines a and c share genetic similarity, without epidemiologic linkage. Isolates in box b show evidence of an inter-genus exchange of a pR55-like plasmid. The 2011 KPC+ K. pneumoniae outbreak is denoted with a hashed line.

Table 1. Specimen source data for KPC+ isolates.

Patients associated with the 2011 K. pneumoniae clonal strain are numbered (1 to 19), additional patients are indicated by letters (A to K), and environmental samples are indicated by “Env.” NA, not applicable.

View this table:

Single-molecule, real-time sequencing of 20 KPC+ carbapenemase-encoding Enterobacteriaceae isolates

Whole-genome shotgun data, generated as “short-reads” of 100 to 500 base pairs (bp) with Illumina or 454/Roche platforms, have been used by our group to characterize a number of hospital-associated pathogens including K. pneumoniae and Acinetobacter baumannii (4, 20). However, preliminary analysis of short-read genome sequences quickly proved the difficulty of assembling plasmids, replete with repeats and mobile genetic elements (21). Recombination, driven by mobile genetic elements, made assisted assembly of plasmids based on published references more difficult. In addition, many plasmids are 50 to 200 kb, too large for simple small plasmid DNA purification methods. To characterize the diversity of blaKPC-encoding plasmids and investigate possible horizontal transfer, we explored the utility of single-molecule, real-time (SMRT) sequencing from Pacific Biosciences. SMRT sequencing long-read technology (22) produces highly accurate fully contiguous genomes (23), both critical requirements to infer phylogenetic relationships. SMRT sequencing generates kilobase-long DNA sequences as seeds to which shorter reads are aligned and used for error correction. The resulting highly accurate, long DNA sequences are further assembled to obtain fully contiguous genomes.

In total, we generated 20 finished genome sequences for KPC+ Enterobacteriaceae, including K. pneumoniae, Klebsiella oxytoca, Enterobacter cloacae, E. coli, Citrobacter freundii, and Pantoea sp. (Tables 1 and 2 and table S1). These 20 bacterial genomes varied from 3.9 to 6.2 Mb in chromosome size and carried between one and five plasmids ranging from 9.3 to 379 kb. Most organisms had a single copy of the blaKPC gene. However, KONIH1 (patient C) carried a plasmid with two copies of the blaKPC gene, and ECNIH2 (environmental) carried three different plasmids, each encoding the blaKPC gene. KPNIH33 (patient G) had three copies of the blaKPC gene, one carried on a plasmid and two chromosomal copies. Multiple copies of the blaKPC gene have been found on plasmids previously, and no gene dosage effect was observed (24). The genomic architecture of these isolates with multiple blaKPC genes is clinically relevant, but technically difficult to extract from fragmented genome data.

Table 2. Summary of genomic data for blaKPC carrying plasmids identified in this study.

Genome size corresponds to chromosome. ST indicates the sequence type if a database exists for this species. Sequence types marked with an asterisk are new. The number of different plasmids found in the isolate is indicated in the plasmid count column. The plasmid replicon was identified by the presence of typing genes. Plasmids that are simple variants (SNVs or small indels) of the published sequences (pKpQIL; NC_014016) are named accordingly. Two common plasmid backbones are indicated by letters in parentheses after replicon and correspond to Fig. 1. The last two columns indicate whether the KPC-2 or KPC-3 allele is present and its sequence context. Missing values are indicated by n.a. (not applicable) or n.d. (no data).

View this table:

We independently validated assembly and nucleotide error rates for SMRT sequencing using data from the OpGen Argus and Illumina MiSeq platforms (see Supplementary Materials) for 10 of these finished genomes. Briefly, OpGen physical maps, produced from ordered maps of restriction enzyme sites, identified one chromosomal inversion in the PacBio genome assemblies caused by repetitive flanking sequence. Manual inspection identified reads supporting the correct orientation, and final chromosome sequences showed complete concordance. To evaluate accuracy at the base pair level, we aligned independently generated MiSeq fragment reads from each isolate to the respective PacBio sequence, followed by analysis with Most Probable Genotype software (25). The final accuracy for each genome exceeded 99.9999% (table S2).

Genome and plasmid sequencing to investigate possible nosocomial transmission

Our 2011 outbreak strain, a sequence type (ST) 258 K. pneumoniae, carried the KPC-3 allele of the blaKPC gene inside the Tn4401a transposon on a 113.6-kb plasmid nearly identical to the published sequence of pKpQIL from a 2006 Israeli isolate (NC_014016) (26). Aligning these two plasmid sequences, we detect only two high-confidence base pair changes, C40547A and T72669C. The C40547A change occurred in an intergenic region between convergently transcribed genes. The T72669C resulted in a silent mutation (TGT to TGC) in the 3′ end of a hypothetical protein. These two SNVs were stably inherited and found in every patient isolate obtained in the 2011 outbreak. Thus, whereas these genomic changes may not be evolutionarily selected, they do provide genetic “fingerprints” of our outbreak strain. Our dominant 2011 KPC+ K. pneumoniae strain contained two additional plasmids, the 15.1-kb pAAC154 plasmid (27) and a 243.8-kb plasmid similar to pKN-LS6 (28). The genomes from the index patient and patient 5 provided accurate and complete references for investigating links between the 2011 outbreak and CREs subsequently detected at our institution.

In addition to possible transmission of KPC+ K. pneumoniae between patients within our hospital, we expanded our investigations to explore whether carbapenem resistance was acquired through nosocomial blaKPC plasmid transfer between bacteria of the same or different species. Through these sequencing efforts, we assessed the diversity and complexity of blaKPC carbapenem resistance encoding plasmids. To identify hospital practices associated with increased risk of transmission, we incorporated genomic sequence data with epidemiologic data such as time from admission, previous negative surveillance cultures, and ward location. First, we discuss five cases of patients colonized with KPC+ Klebsiella sp. in which genomic data confirmed or refuted nosocomial transmission.

Patient A was identified as colonized with KPC+ K. pneumoniae 70 days into his hospital stay after 19 negative perirectal surveillance cultures. Colonization was identified while the patient was in the intensive care unit after receiving prolonged broad-spectrum antibiotics, just 2 weeks after we detected the last patient of the 2011 outbreak. Surveillance data put this case into a category for high suspicion of nosocomial transmission. The isolate from patient A (KPNIH27) was ST 34, and hundreds of distributed SNVs distinguished this genome from our outbreak strain (ST 258). This isolate’s carbapenem resistance was encoded by a blaKPC-2 gene, carried on a 268.3-kb plasmid (pKEC-dc3) that bears no similarity to pKpQIL, and was not flanked by a Tn4401 transposon (Table 2). Genomic data did not support nosocomial transmission from an outbreak patient, and yet the origin of this patient’s blaKPC encoding plasmid remains unknown. This case may represent (i) low-level patient colonization on admission that eluded the many perirectal surveillance cultures until selected out by broad-spectrum antibiotics, (ii) transmission from the environment, or (iii) possible transmission from individuals interacting with the patient, such as visitors or health care personnel, who are not included in routine surveillance in our institution. This patient’s isolate could have been characterized with techniques less powerful than complete genome sequencing because multiple molecular diagnostics demonstrate KPNIH27’s dissimilarity to the dominant outbreak strain. However, each sequenced isolate expands the currently limited public database of plasmid sequences and enables the development of future, more accurate molecular diagnostics.

Patient B had intestinal colonization with KPC+ K. pneumoniae detected by a surveillance culture collected 3 days after transfer to our facility from another hospital. Because this patient was new to our institution, epidemiologic evidence pointed to acquisition of the isolate at another health care facility. However, because of concern for possible transmission of the 2011 outbreak strain, we investigated this strain with complete genome sequencing. The outbreak strain (KPNIH1) and this patient’s strain (KPNIH24) shared some similarities, such as classification as the predominant ST 258 (Fig. 2, A and B). Nonetheless, major differences existed at the genomic level: the bacterial chromosomes of patient 1 and patient B have distinct capsular biosynthetic locus sizes of 19,637 and 24,931 bp, respectively. In addition, there were 155 high-confidence SNVs distributed along the bacterial chromosomes of patient 1 and patient B, further supporting that the bacteria were independently introduced. Patient B’s isolate contained the blaKPC-2 gene encoded on an 85.5-kb plasmid (pKPC-484; Fig. 2B), arguing against plasmid transfer from the 2011 strain. However, molecular diagnostics targeted to the KPC gene region might have yielded conflicting data because patient B’s blaKPC gene is flanked by an ~10-kb Tn4401a transposon, embedded in a 30-kb region that is 99.9% identical to pKpQIL (Fig. 2, A and B). The case of patient B demonstrates the value of accurate whole-genome sequencing to exclude definitively nosocomial transmission of a bacterial isolate and blaKPC plasmid.

Fig. 2. Genome sequencing and plasmid composition of Klebsiella isolates (A to E) and isolates associated with an inter-genus plasmid transfer (F to H).

The color of title bar corresponds to the bacterial species from Fig. 1. Patient code or environmental isolate code and chromosome size are denoted in each title bar. Detailed patient code, genus, and species are described in Table 1. Plasmids are colored to highlight three common backbones across isolates (red, pKpQIL; blue, pEC-IMP-like; purple, pR55-like). The blaKPC genes are colored by allele type; KPC-2 is cyan and KPC-3 is magenta. Flanking sequences are colored to differentiate the bla gene context (red, Tn4401a; blue, Tn4401b; green, IS26-TnpA). Additional antibiotic resistance genes are marked in yellow. The positions of SNVs in pKpQIL plasmids, relative to the reference (NC_014016), are marked as black tick marks in (A), (C), and (D).

Patient E, who received treatment at our institution during the 2011 outbreak, had multiple negative surveillance cultures at the time. He was admitted subsequently to another facility and was identified as KPC+ K. pneumoniae colonized during that hospitalization. We requested that the outside facility send the isolate for genomic sequencing so that we could determine whether the organism might have originated in our institution. We envisioned two distinct possibilities: our surveillance culture missed low-level colonization with our outbreak strain, or the patient acquired a new organism after leaving our hospital. Similar to our outbreak strain (KPNIH1), this patient’s isolate (KPR0928) is K. pneumoniae ST 258 with the blaKPC gene encoded on the pKpQIL plasmid (Fig. 2C). Standard classification and typing strategies to differentiate bacterial strains, such as pulsed-field gel electrophoresis, multilocus sequence typing, and polymerase chain reaction (PCR) of plasmid targets, even combined with organism identification and susceptibility results, might not be able to conclude definitively whether these isolates were related. However, full genome analysis revealed that isolates from patient 1 (the index patient in the outbreak) and patient E differ at the capsule biosynthetic locus, with distinct locus sizes of 19,637 and 23,731 bp. In addition, we identified 173 high-confidence SNVs (recombinant regions excluded) between the chromosomes of KPNIH1 (patient 1) and KPR0928 (patient E). Moreover, the pKpQIL plasmid from patient E lacks the C40547A and T72669C SNVs (relative to NC_014016) specific to our institution’s outbreak, and contains an additional eight SNVs (including the KPC-2 allele) and unique insertions/deletions in the pAAC154 plasmid. On the basis of these differences, we conclude that the K. pneumoniae strain and pKpQIL plasmid from patient E are distinct from those of our institution’s 2011 outbreak.

An open question remains whether patient E’s strain is circulating at the other hospital. Supporting this possibility is the genomic similarity of isolates from patient E and patient I, who was transferred to our facility in 2013 from the same medical system as patient E. Patient I was colonized with a nearly identical strain of K. pneumoniae (KPNIH30; Fig. 2D) as patient E, which contained the identical plasmids pKpQIL-531 and pAAC154-a50 as well as a new plasmid encoding additional antibiotic resistance genes. The example of patients E and I highlights the value of detailed sequencing for strain tracking and the potential of this technology to draw connections across time and space. This case exemplifies how detailed sequencing is critical for excluding nosocomial acquisition in one hospital and supporting nosocomial, but not likely direct, transmission in another.

Patient C had seven negative surveillance swabs over 2 months before detection of KPC+ K. oxytoca from an eighth surveillance swab. This isolate (KONIH1) was detected on the 50th day of her second hospitalization. KPC+ K. oxytoca had not been isolated previously from any other patient or environmental source in our institution. KONIH1 contains a pair of blaKPC-2 genes in opposite orientations on a 205.6-kb plasmid (Fig. 2E), which share little sequence similarity or genomic architecture with other sequenced clinical or environmental CRE isolates. A genome assembly based on short-read Illumina MiSeq data for this isolate collapsed both copies of the blaKPC gene into a single chimeric 20.8-kb contig, demonstrating the greater resolution of SMRT sequencing. As in the case of patient A, the plasmid sequence ruled out horizontal gene transfer from any known colonized patients. No other patients have been identified with KPC+ K. oxytoca. In the setting of extensive patient and environmental surveillance, the KPC+ K. oxytoca was likely present on admission and undetected by serial surveillance cultures, possibly because of bacterial titer or relative insensitivity of screening technique or because of an undefined transmission as discussed earlier for patient A. This case highlights potential limitations of surveillance techniques and encourages careful consideration of how frequently surveillance cultures should be collected.

Patient 19 was a hospitalized stem cell transplant recipient who was identified as carrying KPC+ K. pneumoniae from a perirectal surveillance culture in July 2013, following 26 negative cultures over 9 months. The genomic sequence of this patient’s isolate was phylogenetically linked to our institute’s 2011 outbreak strain (KPNIH1). Specifically, patient 19’s KPC+ K. pneumoniae isolate contained the genetic signature of SNVs that had evolved during the long-term colonization of index patient 1 and during our institute’s subsequent outbreak. These genetic data clearly indicated nosocomial transmission, and follow-up sequencing identified the source as a chronically colonized outbreak patient who was concomitantly receiving treatment at our hospital. An identical isolate cultured from the handrail outside the patient’s room provided additional epidemiologic support for nosocomial transmission and suggested a role for contaminated hands as a possible route of transmission. Patient 19 later died of complications related to KPC+ K. pneumoniae infection. This case is the only example from the five suspected cases that resulted in a definitive conclusion that nosocomial transmission had occurred, again underscoring the power of genomic sequencing in the clinical setting (4).

These five cases illustrate the resolution achieved with whole-genome and plasmid sequencing and the ability to use these data in combination with clinical data to resolve individual health care epidemiology puzzles that have great implications for allocation of hospital infection control resources. In the single case of documented nosocomial transmission (patient 19), our institution launched an extensive investigation and heightened infection control measures aimed at containing spread from known colonized patients, such as expanding adherence monitoring and increasing the frequency of surveillance cultures. Other cases, in which sequencing demonstrated that a new isolate did not match a previously identified strain, prompted efforts to improve admission surveillance rather than tracking transmission. Given the limited resources in all health care facilities, sequence data can help focus and direct the use of allotted funds for infection control interventions, thus providing the best patient care.

Transmission of plasmids in the hospital environment

Whereas the KPC+ K. pneumoniae isolated from patient A was not linked to other patients, we did find evidence of plasmid transfer with bacteria in the hospital environment, specifically KPC+ isolates identified in two sinks in this patient’s room. Four days after this patient’s KPC+ K. pneumoniae isolate was identified, KPC+ C. freundii and KPC+ E. cloacae were cultured from a faucet aerator and sink drain, respectively, within the patient’s room. Full-genome assembly of long-read SMRT sequences clarified the chromosomal and plasmid diversity of these isolates.

The blaKPC-2 encoding p55-like plasmids from patient A (KPNIH27; pKEC-dc3) and the two sinks (CFNIH1; pKEC-ac3 and ECNIH2; pKEC-39c) differ only by a very small number of insertion/deletion events (Figs. 2, F to H, and 3), consistent with horizontal plasmid transfer between different genera of Enterobacteriaceae. Relative to pKEC-dc3, the C. freundii blaKPC-2 encoding plasmid pKEC-ac3 has an 11-bp deletion upstream of the stbA gene and an expanded class 1 integron. The E. cloacae blaKPC-2 encoding plasmid pKEC-39c carries the same genetic alterations as the C. freundii pKEC-ac3 and an additional 47.7-kb insertion encoding more than 80 genes, including an iron transport system. Acquisition of a similar set of genes, including an iron uptake system, has been observed in other plasmids (29) where it has been postulated to improve fitness by facilitating iron uptake in the human host or environment. No SNVs exist across the aligned portions of these plasmid genomes, precluding definitive conclusions about the transmission order between the patient and the environment. The simplest explanation would be that the KPNIH27 isolate was a transient member of the complex microbial community of the sink biofilms.

Fig. 3. Genetic tracking of plasmid transfer between Enterobacteriaceae genera.

(A) KPC+ K. pneumoniae plasmid pKEC-dc3 from patient A. (B) C. freundii plasmid pKEC-a3c from sink aerator. (C) E. cloacae plasmid pKEC-39c from sink drain. Transposase and resolvase genes are brown, tra genes are dark green, blaKPC-2 is cyan, other antibiotic resistance genes are yellow, mercury resistance genes are blue, and iron acquisition genes are orange. Shared regions of homology are indicated in gray shading with a duplication event indicated in pink. Scale bar, 10 kb.

Epidemiologic data would suggest that the patient’s isolate colonized the environment, and there is no evidence that these sink biofilm isolates ever spread further to the hospital environment or other humans. Extensive surveillance carried out in this room and surrounding areas after plumbing was removed and cleaned identified no additional KPC+ organisms.

One troubling observation is that the E. cloacae sink isolate ECNIH2 carries three different blaKPC-encoding plasmids (Fig. 2H and Table 2). In addition to the pKEC-39c plasmid described earlier, ECNIH2 carries the blaKPC-3 gene embedded within a Tn4401b transposon in a 282.4-kb IncHI2 plasmid related to pEC-IMP and a previously undescribed 47.3-kb plasmid. All three plasmids are predicted to encode proteins involved in the stable maintenance of plasmids including restriction/anti-restriction proteins, methylases, and components of plasmid addiction systems. These plasmid maintenance systems, along with other genes that improve bacterial fitness, are barriers to clearance of antibiotic resistance plasmids. As noted before, with KONIH1, resolution of multiple blaKPC genes was not possible with assembled short-read data where only a single complete blaKPC gene was found; however, SMRT sequencing was able to distinguish the three blaKPC genes on three distinct plasmids. The two additional blaKPC-encoding plasmids contained within the E. cloacae sink isolate ECNIH2 do not match any other patient or environmental isolates from our institution, and contribute genetic diversity to the public database of KPC-encoding plasmids.

Genomic evidence discordant with horizontal plasmid transfer

Patient 5 became colonized during the 2011 outbreak at our institution, with his first KPC+ K. pneumoniae isolate (KPNIH10) detected in August 2011. He had undergone no previous perirectal surveillance cultures. A follow-up culture 2 weeks later indicated continued colonization with KPC+ K. pneumoniae, and a culture 3 weeks later subsequently identified the simultaneous presence of KPC+ E. cloacae (ECNIH3). Given the patient’s persistent colonization with the KPC+ K. pneumoniae outbreak strain, we hypothesized that horizontal transfer of the blaKPC gene–encoding plasmid from the outbreak strain to the E. cloacae had occurred. However, genome analysis comparing the sequences of the blaKPC gene encoding plasmids in the K. pneumoniae and E. cloacae strains demonstrated that this was not the case (Fig. 4, A and B). The KPC+ E. cloacae harbored a 50.3-kb incompatibility group N (IncN) plasmid with a blaKPC-2 gene flanked by an IS26 and partial Tn4401 transposon. In contrast, the KPC+ K. pneumoniae isolate harbored pKpQIL, a 113.6-kb IncF plasmid carrying the blaKPC-3 gene within a Tn4401a transposon. These data rule out both transfer of a plasmid between bacteria and transposition of the Tn4401 element. The IncN plasmid (pKPC-47e) is ST 6, but rearranged compared to the closest reference “plasmid 12” (FJ223605) (Fig. 5). In addition, the KPC+ E. cloacae isolate belonged to a new sequence type (ST 97). Together, these data suggest that patient 5 may have been colonized with the KPC+ E. cloacae isolate when he was admitted to our hospital and then became co-colonized with the KPC+ K. pneumoniae outbreak strain via nosocomial transmission.

Fig. 4. Genome sequencing of isolates with a broad–host range IncN plasmid.

(A to I) Color of title bar corresponds to the bacterial species from Fig. 1. Patient code or environmental isolate code and chromosome size are denoted in each title bar. Detailed patient code, genus, and species are described in Table 1. Plasmids are colored to highlight three common backbones across isolates (red, pKpQIL; blue, pEC-IMP-like; green, pKPC-47e). The blaKPC genes are colored by allele type; KPC-2 is cyan and KPC-3 is magenta. Flanking sequences are colored to differentiate the bla gene context (red, Tn4401a; blue, Tn4401b; green, IS26-TnpA). Additional antibiotic resistance genes are marked in yellow.

Fig. 5. Genetic similarity of IncN plasmids identified across KPC+ E. coli, K. pneumoniae, E. cloacae, and Pantoea sp.

Gray ribbons between panels mark regions of >99% sequence similarity. (A) Reference KPC+ K. pneumoniae plasmid 12. (B) KPC+ E. coli, ECONIH1, plasmid pKPC-629. (C) KPC+ K. pneumoniae, KPNIH29, plasmid pKPC-e4e. (D) KPC+ E. cloacae plasmid pKPC-47e. ECR091, ECNIH3, and ECNIH5 have identical sequences. Small insertions and deletions are marked with arrows for ECNIH4 (1), PSNIH1 (2), and PSNIH2 (3). SNVs are marked with an asterisk. Transposase and resolvase genes are colored brown, tra genes are dark green, blaKPC-2 is cyan, blaKPC-3 is magenta, other antibiotic resistance genes are yellow, mercury resistance genes are blue, and arsenic resistance genes are purple. The expanded iteron repeat is indicated in beige. Scale bar, 10 kb.

Circulation of newly emerging strains of CRE

Plasmid sequencing has also enabled us to identify epidemiological patterns that may otherwise escape notice. When analyzing surveillance data, the possibility that regional strains are circulating among hospitals should be considered as an alternative hypothesis to nosocomial transmission between patients with no discernible epidemiological connection. KPC+ K. pneumoniae ST 258 with blaKPC gene on pKpQIL is a dominant strain in U.S. hospitals, as demonstrated by the epidemiologically unrelated, genetically similar isolates obtained from patients 1 and E. Next, we explore the use of complete genomes to characterize and detect a newly emerging KPC+ E. cloacae strain.

Patient D was admitted to our institution in 2012 and was discharged shortly after a single negative perirectal surveillance culture. She was subsequently admitted at another hospital where KPC+ E. cloacae (ECR091) grew from a urine culture. Comparison of this isolate’s genome to our existing references showed striking resemblance to ECNIH3 from patient 5. ECR091 and ECNIH3 are the same sequence type and differ by only 12 high-confidence SNVs across the 4.6-Mb chromosomes. These two KPC+ E. cloacae isolates differ in their total plasmid composition (Fig. 4, B and C) but they both carry the pKPC-47e plasmid, which encodes the blaKPC gene. In addition, they each carry an IncHI2 plasmid related to pEC-IMP, with insertions/deletions unique to each isolate. However, there was no discernible epidemiological link between patient 5 and patient D. The KPC+ E. cloacae isolate from patient D was identified over a year after that of patient 5, and 3 months after his most recent hospitalization. The two patients were from disparate geographic areas with no common exposure at another health care facility. In this case, the combination of the close genetic sequence similarity and the lack of discernable epidemiologic link suggests that the KPC+ E. cloacae ST 97 is a newly recognized strain circulating in U.S. hospitals.

A broad–host range KPC+ IncN plasmid

After identifying the IncN plasmid, pKPC-47e, in patient 5 and patient D (above), we began to screen additional KPC+ isolates for the presence of an IncN, blaKPC plasmid with a multiplex PCR assay using IncN replicon typing primers (30). Between 2011 and 2013, we identified IncN, blaKPC plasmids (Figs. 1 and 4) carried by two additional patient isolates (ECONIH1 and KPNIH29) and four environmental isolates (ECNIH4, ECNIH5, PSNIH1, and PSNIH2). The host range of this plasmid includes E. cloacae, Pantoea sp., K. pneumoniae, and E. coli. Given the potential clinical impact of pKPC-47e, we characterized further this plasmid. pKPC-47e is 99.8% identical over 75% of its length to the published sequence of K. pneumoniae plasmid 12 (NC_011385), a Tn4401b-blaKPC-3 encoding plasmid (Fig. 5). The pKPC-47e plasmid has undergone substantial rearrangement compared with plasmid 12 and differed by an expanded iteron region (direct repeats that play a role in plasmid maintenance), insertion of a region encoding ccgCD, deletion of an arsenic resistance operon, and deletion of a large antimicrobial resistance region (Fig. 5). Deletion of the antimicrobial resistance region disrupts the Tn4401b transposon, replacing the region upstream of the blaKPC gene with an IS26 element. IS26 elements have been associated with mobilization of antibiotic resistance genes and have been shown to modify expression of flanking genes (3133).

In addition to the dominant pKPC-47e plasmid, two major variants of this plasmid were also detected. The Tn4401b element is intact in both variants, suggesting that pKPC-47e may be a product of a deletion event from one of these plasmids or a third unobserved plasmid. The E. coli plasmid, pKPC-629, differed from pKPC-47e by a 30-kb region, upstream of the blaKPC gene, encoding a Tn1331 transposon and a mercury resistance operon (Fig. 5B). The presence of this KPC+ plasmid in the ST 648 E. coli from patient H was of concern because this sequence type is commonly associated with extraintestinal pathogenic E. coli infections. A second variant of pKPC-47e was identified in K. pneumoniae (Fig. 5C) that is similar to the E. coli plasmid but is missing the mercury resistance operon.

The pKPC-47e backbone was represented frequently in our pool of KPC+ organisms, occurring in 8 of the 20 isolates. Whereas the presence of a common backbone suggests a common source, no epidemiological links were identified among the patients or environmental sites in which these isolates were found. Finally, while determining the distribution of the IncN plasmid in isolates from 2013, three additional KPC+ K. pneumoniae isolates were sequenced, two of which carried new KPC+ plasmids (fig. S1). Notably, two of these isolates (KPNIH33 and KPNIH31) carry chromosomally integrated Tn4401-KPC cassettes.


The NIH Clinical Center is the nation’s largest clinical research hospital, with about 1500 clinical research studies currently in progress. The patient population includes many severely ill and immunocompromised individuals who are particularly vulnerable to health care–associated infections. In addition, patients referred to the NIH Clinical Center are often receiving treatment at multiple health care facilities, increasing their risk of exposure to nosocomial pathogens. The motivation for the current work is to improve the detection and tracking of carbapenemase-producing organisms, to enable clinicians to better assess the threats posed by emerging antibiotic-resistant organisms, and to target infection control efforts to relevant instances of transmission.

Sequencing can elucidate the landscape of bacterial transmission, allowing hospitals to target funds and personnel for infection control interventions that provide the best patient care. On the basis of our sequence data demonstrating that only 1 of the 10 cases represented nosocomial spread, we directed our resources toward surveillance at admission for carbapenemase-producing organisms. In the case of patient 19 in whom hospital transmission was confirmed, we targeted resources to improve adherence to barrier precautions and hand hygiene on a high-risk inpatient ward. The cost of whole-genome sequencing is dwarfed by the others costs associated with outbreaks and their investigations, including the human and financial toll (34) and the loss of patient confidence in the health care facility. Detailed microbial sequencing thus provides actionable data that can improve patient outcomes and utilization of hospital resources.

Our 2012 KPC+ K. pneumoniae study at NIH exploited the synergy between epidemiologic trace data and bacterial genetic analyses to produce an integrated transmission map. Here, we extended this concept of tracking antibiotic-resistant bacterial transmission by investigating the next layer of complexity, the plasmid transmission network. Whereas examples of shared plasmids were detected, the number of independent introductions observed was surprisingly high and indicated a complex network of plasmids with incredible diversity. CRE isolates carry an average of three plasmids for a total of 62 plasmids across 20 isolates. After clustering of similar plasmids, there are 48 plasmids that differ from available reference sequences, resulting in >1% growth in the size of the nonredundant plasmid database. These diverse blaKPC encoding plasmid sequences can inform the development of advanced molecular diagnostics, such as PCR-based assays, to differentiate strains and plasmids (35).

We observed only a few examples of horizontal plasmid transfer, which is somewhat surprising given the selective advantage of antibiotic resistance. Deterrents for plasmid transmission include host range specificity, plasmid incompatibility, clustered regularly interspaced short palindromic repeat (CRISPR) loci, and restriction modification systems. The 22 KPC+ plasmids (ECNIH2 had three KPC+ plasmids) sequenced as part of this study belonged to several classes of replicon incompatibility groups, including IncF, IncA/C, IncN, and IncHI2. Some of these replicons have been reported to have a broad host range, such as IncN and IncA/C, and agree with trends we see across isolates. CRISPR systems, which act to prevent phage infection and plasmid transfer (36), were detected in seven of the sequenced isolates. DNA restriction and modification systems are another barrier to plasmid transfer and one which is uniquely addressable by SMRT sequencing technology (37). Here, pKpQIL plasmid presence is associated with N6-methyladenine present on both the chromosomal and plasmid DNA in the sequence context 5′-m6ACGNNNNNNCTG-3′ (methylated A on opposite DNA strand of underlined T). In the end, the low frequency of transmission is likely the result of molecular barriers to transmission combined with infection control interventions specified by Centers for Disease Control and Prevention (CDC) guidelines.

We observed three instances in which genomic data linked epidemiologically unrelated carbapenemase-encoding plasmids. The first example is a pair of KPC+ K. pneumoniae isolates carrying identical pKpQIL-531 plasmids, from patients E and I. These patients did not overlap in our institution, but both had previously received care at the same health care complex, pointing to a possible link across time and space. The second example of a newly emerging strain is a pair of KPC+ E. cloacae isolates from patients with no evident epidemiological connection and nearly identical genomes (12 SNVs), blaKPC-2 containing plasmids, and a second shared plasmid that differs by a single, large deletion. Our assessment of independent introduction is supported by the detection of this same blaKPC-containing plasmid in a broader range of Enterobacteriaceae, including KPC+ K. pneumoniae, E. coli, and Pantoea sp. We are actively screening CREs from patients and the environment for IncN plasmids to document the evolution of this plasmid.

The third example of shared carbapenemase-encoding plasmids was detected across three different bacterial genera (K. pneumoniae, E. cloacae, and C. freundii) with plausible epidemiologic links. The broad host range of a pR55-like plasmid and the possibility that it persists in environmental reservoirs like sink drain biofilms represent a challenge for hospital infection control. In this case, plumbing was removed, disinfected, and replaced, but the extent to which environmental bacteria represent a reservoir for transmission and the optimal method for remediation remain unknown. Several reports have implicated hospital sinks in the nosocomial transmission of multidrug-resistant Gram-negative bacteria, but none have been able to prove definitively whether the sink was the source or the recipient of the transmission (3840).

One limitation of this study is that our highly detailed plasmid sequence data cannot explain how these common environmental commensals acquired the plasmids, but nonetheless raise concern that KPC plasmid dissemination may occur silently in the hospital environment in the absence of any known colonized patients. Identification of the pKPC-47e backbone in both patient and environmental samples is disquieting for infection control personnel because some host organisms (for example, two nonclonal strains of Pantoea sp.) are capable of establishing a foothold on environmental surfaces, whereas others are human commensals and/or potential pathogens. E. coli, for instance, are a common cause of community-acquired infections, and carbapenemase-producing strains could potentially affect increasing numbers of patients outside health care facilities (41).

A second limitation is the sensitivity of perirectal surveillance cultures. In several cases, KPC+ organisms were detected in patients who had produced multiple negative surveillance cultures prior to a positive culture. For patient 19, genomic and epidemiologic data pointed to nosocomial transmission of KPC+ K. pneumoniae. In all other cases, genomic and epidemiological data did not support nosocomial transmission. In addition, these KPC+ isolates were not observed again even with extensive patient and environmental surveillance performed in our institution, suggesting strong adherence to infection control measures. Ultimately, the genetic and epidemiologic evidence led us to question whether these patients who converted from negative to positive likely represent low-level gastrointestinal colonization on admission not detected by surveillance cultures. Perirectal surveillance swabs, which only sample flora of the perianal area, can produce false-negative results due to poor sampling technique or insufficiently sensitive culture methods. Even the most sensitive method cannot detect KPC+ isolates present at low levels in areas of the gastrointestinal tract not accessed by the surveillance swab. Administration of broad-spectrum antibiotics may then have selected for the antimicrobial-resistant KPC+ organisms, yielding a positive culture. Although undetected acquisition is not entirely ruled out, the absence of any matching isolates in a setting of extensive surveillance among a highly immunocompromised patient population points to likely acquisition predating hospital admission.

Over the past decade, there has been a steady and alarming increase in antibiotic-resistant bacteria (42), a trend that poses a serious threat to the U.S. medical system. At the same time, development of antimicrobial drugs has nearly ground to a halt, with only two new antibiotics approved since 2009 (43). In the face of a dwindling selection of drugs to fight health care–associated infections, prevention is critical. In addition to implementing recommended infection control measures such as surveillance, hand hygiene, and barrier precautions, we must find more sophisticated methods to detect, track, and eradicate multidrug-resistant bacteria. Collaboration among physicians who have expertise in health care epidemiology, microbiologists who have expertise in diagnostics, and scientists who have expertise in genomics is critical to take advantage of emerging technologies and translate them into improved patient care.


Environmental and patient CRE surveillance

Patients were screened on admission for risk factors of CRE carriage, including hospitalization in the United States in the past week or abroad in the past 6 months. These higher-risk patients underwent surveillance perirectal cultures on two consecutive days and were placed on empiric contact isolation until cultures demonstrated no growth of carbapenem-resistant bacteria. Patients lacking prior hospitalization risk factors had one surveillance culture upon admission. Patient surveillance swabs were collected as described (19).

Environmental surface swabs were collected using 5 cm × 5 cm squares of sterile gauze moistened with fastidious broth (Hardy Diagnostics) and then, with sterile gloves, applied systematically to surfaces of interest. The gauze was immersed in broth and incubated overnight before culturing. Sink drain cultures were collected by inserting a culturette swab through the drain sieve and rubbing once around the interior walls of the drain.

Specimens were cultured onto RambaCHROM KPC (Gibson Bioscience). Organisms were identified by matrix-assisted laser desorption/ionization–time-of-flight mass spectrometry, using the MicroFlex LT instrument (Bruker Daltonics Inc.) (44). Organism identifications were confirmed or refined by comparing marker gene sequences, specifically the 16S ribosomal RNA and rpoB genes, to curated databases and other fully sequenced genomes. Carbapenemase production was assessed by TaqMan real-time PCR, with KPC-targeted primers adapted from the CDC (45, 46). Antibiotic susceptibility testing was performed by broth microdilution (Phoenix instrument; Becton, Dickinson and Co.) and Kirby-Bauer disc diffusion.

Genomic DNA sequencing and analysis

Genomic DNA was sheared to 10 to 15 kb using Covaris g-tubes (Covaris) and converted into SMRTbell template libraries (47). The library was subsequently subjected to DNA size selection using the BluePippin (Sage Science) to select the longest DNA fragments (lower size cutoff ~5 kb). Sequencing was performed on the PacBio RSII using P4 polymerase binding and C2 sequencing kits with magnetic bead loading and 120-min acquisition. Genome assemblies were performed using HGAP and Quiver (23) as part of SMRTAnalysis version 2.0. SMRT-based assemblies were validated using optical maps and MiSeq read data. A full description of the independent sequence analysis is found in the Supplementary Materials. Briefly, Nextera libraries were generated from the same genomic DNA and sequenced using a paired-end 200-base dual index run on an Illumina MiSeq to generate 1 million to 2 million read pairs per library. Base calling accuracy was determined with the MPG software (25). MiSeq read alignments were visualized with IGV (48), consed (49), and Hawkeye (50) software. Genome annotation was done using the National Center for Biotechnology Information (NCBI) Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP:


Multilocus sequence typing was used to classify isolates from their complete genomes. Alleles for each typing method were downloaded from their respective repositories: K. pneumoniae (Institut Pasteur), E. cloacae (PubMLST), E. coli (University of Warwick), and IncN replicon typing (PubMLST). Newly detected alleles or combinations of alleles were submitted to the appropriate databases for inclusion in future typing schemes. The blaKPC allele was identified by BLAST alignment to reference sequences from NC_011382 (KPC-2) and NC_014016 (KPC-3). The diversity of capsular polysaccharide regions was assessed by determining the size between conserved flanking genes galF and ugd (51), using in silico PCR with the following primers: cpsF, CGCTTCATAATGCCCTGAAT; cpsR, CAAAGGCAATTCCAAAGGAG.

SNV characterization

High-confidence SNVs were identified essentially as described in (4). Briefly, SNVs were extracted from pairwise alignments (NUCmer, v 3.06) of chromosome or plasmid sequences. SNVs were filtered if they were in repetitive regions, flanked by homopolymer runs, within 2 bases of a neighboring SNV or 20 bases of the end of a sequence. We also filtered out single-base insertions and deletions. For SNVs detected between chromosomes, recombinant regions were removed with a sliding window to detect regions of high SNV density. These filtering rules were originally developed to address systematic errors inherent in 454 and other sequencing platforms. Whereas SMRT sequencing has not been shown to have these sorts of systematic errors, we elected for conservative filtering because we were using genomes generated from assembled Sanger, 454, or Illumina data as references.


Materials and Methods

Fig. S1. Plasmid composition of three KPC+ K. pneumoniae from 2013.

Table S1. Assembly metrics.

Table S2. Sequence data summary.

Table S3. PacBio and MiSeq comparison.

Table S4. Plasmid copy number.


  1. Acknowledgments: We thank the staff of the NIH Clinical Center Hospital Epidemiology and Microbiology Services for their many contributions to the infection control methods described in this report. We thank the Segre laboratory members for their thoughtful reading of the manuscript, and J. Fekecs for providing graphic design assistance with figures. Funding: Supported by the National Human Genome Research Institute and NIH Clinical Center Intramural Research Programs and by an NIH Director’s Challenge Award. E.S.S. is supported by a Pharmacology Research Associate Training Fellowship, National Institute of General Medical Sciences. Author contributions: S.C., J.C.M., R.W.B., J.K., D.K.H., K.M.F., T.N.P., and J.A.S. conceived the study. D.K.H. and T.N.P. oversaw hospital epidemiology. A.F.L., J.P.D., and K.M.F. oversaw clinical microbiology. P.J.T., M.P., T.A.C., K.L., Y.S., Y.-C.T., M.B., J.D., S.Y.B., B.S., A.C.Y., J.W.T., G.G.B., R.W.B., J.C.M., J.K., and NISC performed genome sequencing and analysis. C.D. performed molecular analyses. S.C. and E.S.S. performed data analysis. S.C., D.K.H., K.M.F., T.N.P., and J.A.S. wrote the manuscript. Sequencing was performed at Pacific Biosciences, NISC, and Leidos Biomedical Research, Frederick National Laboratory for Cancer Research. Competing interests: T.A.C., K.L., Y.S., Y.-C.T., M.B., and J.K. are employees of Pacific Biosciences, a company commercializing DNA sequencing technologies. Data and materials availability: All sequence data can be retrieved associated with NCBI BioProject PRJNA251756. Isolates can be obtained from K.M.F.; a material transfer agreement is necessary.
View Abstract

Stay Connected to Science Translational Medicine

Navigate This Article