Benson, M. A., Ohneck, E. A., Ryan, C., Alonzo, F., Smith, H., Narechania, A., Kolokotronis,. 571. S. O., Satola, S. W., Uhlemann, A. C. & Sebra, R. (2014).
Microbial Genomics Pan-genomic perspective on the evolution of the Staphylococcus aureus USA300 epidemic --Manuscript Draft-Manuscript Number:
MGEN-D-16-00016R1
Full Title:
Pan-genomic perspective on the evolution of the Staphylococcus aureus USA300 epidemic
Article Type:
Research Paper
Section/Category:
Microbial evolution and epidemiology: Population Genomics
Corresponding Author:
Dorota Jamrozy, Ph.D Wellcome Trust Sanger Institute UNITED KINGDOM
First Author:
Dorota Jamrozy, Ph.D
Order of Authors:
Dorota Jamrozy, Ph.D Simon R Harris, PhD Naglaa Mohamed, PhD Sharon J Peacock, PhD, FRCP Charles Y Tan, PhD Julian Parkhill, PhD Annaliesa S Anderson, PhD Matthew T G Holden, PhD
Abstract:
Staphylococcus aureus USA300 represents the dominant community-associated methicillin-resistant S. aureus lineage in the United States (US), where it is a major cause of skin and soft tissue infections. Previous comparative genomic studies have described the population structure and evolution of USA300 based on geographically restricted isolate collections. Here, we investigated the USA300 population by sequencing genomes of a geographically distributed panel of 191 clinical S. aureus isolates belonging to clonal complex 8 (CC8), derived from the Tigecycline Evaluation and Surveillance Trial program. Isolates were collected at 12 healthcare centres across 9 US states in 2004, 2009 or 2010. Reconstruction of evolutionary relationships revealed that CC8 was dominated by USA300 isolates (154/191, 81%), which were heterogeneous and demonstrated limited phylogeographic clustering. Analysis of the USA300 core genomes revealed an increase in median pairwise SNP distance from 62 to 98 between 2004 and 2010, with a stable pattern of above average dN/dS ratios. The phylogeny of the USA300 population indicated that early diversification events led to the formation of nested clades, which arose through cumulative acquisition of predominantly non-synonymous SNPs in various coding sequences. The accessory genome of USA300 was largely homogenous and consisted of elements previously associated with this lineage. We observed an emergence of SCCmec negative and ACME negative USA300 isolates amongst more recent samples, and an increase in the prevalence of φSa5 phage. Together, the analysed S. aureus USA300 collection revealed an evolving pan-genome through increased core genome heterogeneity and temporal variation in the frequency of certain accessory elements
Downloaded from www.microbiologyresearch.org by 107.172.249.178 Powered by Editorial Manager® and IP: ProduXion Manager® from Aries Systems Corporation On: Sun, 15 May 2016 15:08:00
Manuscript Including References (Word document)
1 2
Click here to download Manuscript Including References (Word document) Manuscript_revision_050416.docx
Pan-genomic perspective on the evolution of the Staphylococcus aureus USA300 epidemic
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
ABSTRACT Staphylococcus aureus USA300 represents the dominant community-associated methicillinresistant S. aureus lineage in the United States (US), where it is a major cause of skin and soft tissue infections. Previous comparative genomic studies have described the population structure and evolution of USA300 based on geographically restricted isolate collections. Here, we investigated the USA300 population by sequencing genomes of a geographically distributed panel of 191 clinical S. aureus isolates belonging to clonal complex 8 (CC8), derived from the Tigecycline Evaluation and Surveillance Trial program. Isolates were collected at 12 healthcare centres across 9 US states in 2004, 2009 or 2010. Reconstruction of evolutionary relationships revealed that CC8 was dominated by USA300 isolates (154/191, 81%), which were heterogeneous and demonstrated limited phylogeographic clustering. Analysis of the USA300 core genomes revealed an increase in median pairwise SNP distance from 62 to 98 between 2004 and 2010, with a stable pattern of above average dN/dS ratios. The phylogeny of the USA300 population indicated that early diversification events led to the formation of nested clades, which arose through cumulative acquisition of predominantly non-synonymous SNPs in various coding sequences. The accessory genome of USA300 was largely homogenous and consisted of elements previously associated with this lineage. We observed an emergence of SCCmec negative and ACME negative USA300 isolates amongst more recent samples, and an increase in the prevalence of Sa5 prophage. Together, the analysed S. aureus USA300 collection revealed an evolving pan-genome through increased core genome heterogeneity and temporal variation in the frequency of certain accessory elements.
27 28 29
DATA SUMMARY
30 31 32 33
Whole-genome sequence reads and assemblies of Staphylococcus aureus isolates described in this study have been deposited in the European Nucleotide Archive. The accession numbers together with related metadata are provided in Table S1 of Supplementary Material.
34 35 36
I/We confirm all supporting data, code and protocols have been provided within the article or through supplementary data files. ☒
37
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
38 39
IMPACT STATEMENT
40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55
The expansion of Staphylococcus aureus USA300 and its dominance as a communityassociated MRSA clone within the United States has prompted interest in the genomics of this pathogen. The resolving power of whole genome sequencing has already shed light on the nature of USA300 transmission within defined communities. However, a question of the broader population structure remains. Here, we investigated whole genome sequences of epidemiologically diverse CC8 isolates, inclusive of 154 USA300 samples. Based on this collection we investigated the wider population structure of USA300 and other CC8 isolates and assessed temporal variation in core and accessory genomes. We report that the USA300 population has become increasingly heterogeneous between 2004 and 2010 at the core genome level, accompanied by subtle changes in the accessory genome involving loss of ACME and SCCmec elements and an increase in the prevalence of Sa5 prophage. These findings demonstrate that successful lineages of MRSA are not genetically static and continue to evolve, which may involve loss of molecular markers previously considered to define the lineage. While further surveillance of USA300 genomes is required to confirm wider SCCmec and ACME loss, such genomic changes might impact on the future epidemiology of this important S. aureus clone.
56 57 58
INTRODUCTION
59 60 61 62 63 64 65 66
Staphylococcus aureus is a major cause of human disease worldwide. One of the most successful lineages of this bacterial species is clonal complex 8 (CC8), from which a number of major methicillin-resistant S. aureus (MRSA) clones have emerged (Deurenberg & Stobberingh, 2008). This includes several pandemic MRSA clones associated with causing healthcare-associated (HA) infections as well as community-associated (CA) infections (Monecke et al., 2011). HA-MRSA CC8 clones include historic Archaic and Iberian clones as well as the more contemporaneous USA500 clone, whereas the current USA300 epidemic clone has emerged as the dominant cause of CA infections in the United States (US).
67 68 69 70 71 72 73 74 75 76 77
The first reports of USA300 date back to the early 2000s, when it was identified in a number of localised outbreaks of CA skin and soft tissue infections (SSTI) (Control & Prevention, 2001, Control & Prevention, 2003, McDougal et al., 2003, Tenover et al., 2006). Additional reports followed which demonstrated the widespread dissemination of USA300 across the US (Begier et al., 2004, Kazakova et al., 2005, Miller et al., 2005). Other than SSTI, USA300 has also been reported as a causative agent of CA pneumonia, sepsis, infective endocarditis and osteomyelitis (Francis et al., 2005, Gonzalez et al., 2005, Haque et al., 2007, Seybold et al., 2007). In addition to being responsible for a rising incidence of CA infections, USA300 has also been increasingly reported in association with nasal colonisation (Freitas et al., 2010, Tenover et al., 2008) and HA infections (Gonzalez et al., 2006, Patel et al., 2007, Seybold et al., 2006). In the healthcare setting, USA300 was reported to emerge alongside established
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
78 79
HA-MRSA clones (Chua et al., 2008, Limbago et al., 2009, Pasquale et al., 2013, Tattevin et al., 2009) or to replace them (Carey et al., 2010, Liu et al., 2008).
80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101
The striking epidemiology of USA300 within the US has prompted a number of genomic studies aimed at the identification of genetic features that might have promoted its transmission, host colonisation and disease pathogenesis. Early whole genome sequence analysis of a single USA300 strain revealed that the accessory genome included a Staphylococcal Cassette Chromosome mec (SCCmec) type IVa element, an arginine catabolic mobile element (ACME) type I, a S. aureus pathogenicity island 5 (SaPI5) and a PantonValentine leukocidin-carrying Sa2 prophage (Diep et al., 2006). These four mobile genetic elements (MGEs) represent common molecular markers of USA300, although analysis of a wider population demonstrated that their carriage is not ubiquitous (Uhlemann et al., 2014). Isolates belonging to USA300 also frequently carry two plasmids: a 27 kb plasmid carrying genes that confer resistance to penicillin (blaZ), erythromycin (msrA and mphBM) bacitracin (bcrA), kanamycin (aphA-3) and streptothricin (sat), and a 3.1 kb cryptic plasmid (Kennedy et al., 2010). The contribution of MGEs to the success of USA300 has been a subject of debate and generation of the hypothesis that the accessory genome has promoted the enhanced transmission of USA300 (Li et al., 2009). In particular, the ACME element has been suggested to promote the survival of USA300 on human skin and persistence within cutaneous abscesses (Thurlow et al., 2013). In contrast, the pathogenesis of USA300 may have been mediated by subtle genetic changes within the core genome (Highlander et al., 2007) together with modulated expression of core virulence determinants such as -toxin, phenolsoluble modulins (PSMs) and an accessory gene regulator (agr) (Li et al., 2009). Core genome SNPs might also result in a significantly different virulence phenotypes between USA300 isolates, associated with variable exoprotein expression (Kennedy et al., 2008).
102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121
Recently, the genomics of USA300 have been studied on a wider scale, through whole genome sequencing of larger collections derived from specific geographic regions (Alam et al., 2015, Uhlemann et al., 2014). This led to significant epidemiological findings such as the identification of households as long-term reservoirs of USA300. Comparative genomic analysis has also allowed the diversification of the USA300 lineage to be studied, including evolutionary events surrounding the emergence of the USA300 clade as well as microevolution within the population (Uhlemann et al., 2014). This exemplifies the power of whole genome sequencing for comprehensive analysis of bacterial pathogens, as it allows an in-depth interrogation of phylogenetic relationships, identification of genetic features associated with virulence and antimicrobial resistance and the reconstruction of significant evolutionary events (Harris et al., 2010, Holden et al., 2013). While previous studies describing the comparative genomics of USA300 focused on geographically restricted isolate collections, here we describe whole genome sequence analysis of S. aureus CC8 derived from 12 US healthcare centres located across 9 states. The isolates were collected in 2004, 2009 or 2010 and derived from the multi-centre Tigecycline Evaluation and Surveillance Trial (T.E.S.T) study and represent consecutive clinical isolates (methicillin-susceptible and methicillin-resistant) from both the community and hospital settings. As USA300 has become widespread across the US, an analysis of a collection from geographically distinct locations has allowed us to study a broader population of epidemiologically unrelated USA300 isolates as well as their microevolution over time. For this we interrogated the
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
122 123 124
phylogeny of the CC8 lineage, with particular focus on the USA300 clade, followed by analysis of temporal variation within core and accessory genomes as well as investigation of more subtle polymorphisms that define the phylogenetic structure of USA300 population.
125 126 127
METHODS
128
Bacterial isolates
129 130 131 132 133 134 135 136
Isolates analysed here derived from the Tigecycline Evaluation and Surveillance Trial (T.E.S.T.), a global multi-centre surveillance study. The trial was initiated in 2004, and since then sites have been added and removed. For the purpose of conducting a longitudinal assessment of S. aureus isolates associated with disease in the US, we prospectively selected US sites that were participating between 2004 and 2010 (5 sites). Further 7 sites active at the time of study were included. A total of 516 S. aureus isolates derived from 2004, 2009 and 2010 were collected and analysed. All isolates belonging to CC8 (n=191), as determined by multi-locus sequence typing, were selected for this work.
137 138 139 140 141 142 143 144 145 146 147 148
In line with T.E.S.T requirements, the participating sites collected consecutive isolates from patients with a documented infection. Only one isolate per patient was permitted and all body sites were considered an acceptable clinical source. Epidemiological details of the 191 S. aureus CC8 isolates analysed here are presented in Table S1. Briefly, isolates derived from the 12 T.E.S.T. sites that were located across 9 US states (Fig. 1), with 26% (50/191) collected in 2004, 31% (58/191) in 2009 and 43% (83/191) in 2010. For the purpose of this work the collecting sites were called healthcare centre 1 – 12 (HC1 - HC12). The collection consisted of 46 MSSA and 145 MRSA isolates. Isolates were collected from diverse clinical sources, categorised in this work to represent four main infection types defined as wound, invasive, respiratory or other. Isolates were collected from patients aged 0 to 95 in various clinical settings, with samples broadly defined in this study as derived from either inpatient or outpatient.
149 150
Whole genome sequencing
151 152 153 154 155 156
Genomic DNA was isolated using the Qiagen QIAcube system according to manufacturer’s protocol. Tagged DNA libraries were created using a method adapted from a standard Illumina Indexing protocol, as described previously (Hsu et al., 2015). Whole genome sequencing was performed on the Illumina HiSeq 2000 platform with 100 bp paired-end reads. The Illumina sequence data has been submitted to the European Nucleotide Archive (ENA) and the accession numbers are provided in Table S1.
157
Whole genome sequence data analysis
158 159 160
Single nucleotide polymorphisms (SNPs) were detected by mapping the paired-end reads against the S. aureus USA300_FPR3757 reference genome (GenBank accession NC_007793), using SMALT version 0.5.8. (https://www.sanger.ac.uk/resources/software/smalt/). The
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
161 162 163 164 165 166 167 168 169 170 171 172 173 174
generated whole genome sequence alignment was curated to exclude accessory regions (mobile genetic elements identified for USA300_FPR3757 reference genome) and variable sites associated with recombination, with the latter detected using Gubbins (Croucher et al., 2014). Based on the curated core genome SNP alignment a maximum likelihood phylogenetic tree was constructed using RAxML version 7.8.6 (Stamatakis, 2014) using generalised time reversible (GTR) model with GAMMA method of correction for among site rate variation and 100 bootstrap replications (Harris et al., 2010). De novo assembly of whole genome sequences was performed using Velvet (Zerbino & Birney, 2008) with Velvet Optimiser (https://github.com/Victorian-Bioinformatics-Consortium/VelvetOptimiser.git) and kmers ranging between 66% and 90% of the read length. Contigs of less than 300 bp were removed. The draft assemblies have been submitted to the ENA with accession numbers provided in Table S1. The assembled contigs were annotated using Prokka (Seemann, 2014). The accessory genome was defined as described previously (Harris et al., 2010).
175
Isolate genotyping
176 177 178 179 180 181 182 183
Multi-locus sequence typing was performed on assembled genomes using an in-house developed pipeline based on previously described typing scheme (Enright et al., 2000). Isolates were also spa genotyped by extracting the spa gene variable X region from assembled genomes using previously described primers (Kahl et al., 2005), and determining the spa type with the spaTyper tool (http://spatyper.fortinbras.us). MLST and spa genotypes of all isolates are presented in Table S1. In addition, in silico PCR was used to confirm carriage of prophages (Goerke et al., 2009) and to determine the SCCmec type amongst mecA positive samples (Milheiriço et al., 2007, Milheiriço et al., 2007).
184
Statistical analyses
185 186 187 188 189 190 191
To test the statistical significance of the temporal variation in the prevalence of USA300 isolates as well as selected mobile genetic elements, the likelihood ratio test was applied. The strength of phylogenetic signal of epidemiological trait was measured using phylo.d() function in R package caper based on 1000 permutations. The function computes a D value, which provides a measure of trait dispersion for categorical characters (Fritz & Purvis, 2010). A negative D statistic indicates that a trait is phylogenetically clustered whereas a positive D statistic indicates that a trait is over dispersed across the phylogenetic tree.
192 193 194
RESULTS AND DISCUSSION
195
CC8 population structure overview and temporal variation
196 197 198 199 200
Whole genome sequences of 191 clinical S. aureus isolates belonging to CC8 were analysed, the majority of which belonged to ST8 (179/191, 94%; Table S1). A total of ten single-locus variants (SLV) of ST8 were identified, which included six distinct novel STs. The majority of isolates belonged to spa type t008 (146/191, 76%; Table S1). Alignment of the core genomes (after excluding accessory and regions of predicted recombination, resulting in a core
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218
genome length of 2,592,236 bp) of all CC8 isolates revealed a total of 10651 SNP sites. To identify different sub-lineages, a phylogenetic tree of all CC8 samples was reconstructed with the addition of USA300, USA500 and COL reference genomes (Fig. 2). The tree revealed that the majority of isolates belonged to a single dominant clade (154/191, 81%), which contained the two USA300 reference strains (USA300_FPR3757 and USA300-ISMMS1) and thus was identified to represent the USA300 genotype. The population also contained four isolates that clustered with the USA500 reference strain (2395 USA500) and as such represented the USA500 clone. Six further isolates were closely related to the USA500 clade but descended from a distinct node and were labelled here USA500-like. The majority of these USA500 and USA500-like isolates (7/10) originated from a single healthcare centre (HC10, state of Georgia). The reconstructed phylogeny and detection of USA300 and USA500 genotypes in the context of a diverse CC8 population revealed that the two are only distantly related, in contrast to a previous suggestion that USA500 represents a progenitor of the USA300 clone (Li et al., 2009). A number of non-clonal MSSA CC8 isolates were more closely related to the USA300 clade than the USA500 sub-lineage suggesting that the two MRSA clones arose independently. The structure of CC8 phylogeny presented here is in line with a recently published overview of evolutionary relationships between various CC8 genomes (Boyle-Vavra et al., 2015).
219 220 221 222 223 224 225 226
Isolates belonging to the USA300 clade were detected amongst samples from all healthcare centres. Some genotypic heterogeneity was observed within the USA300 clade, which contained six SLVs of ST8. We also observed a level of spa type diversity amongst USA300 isolates. While the majority belonged to spa type t008 (132/154, 86%), 10 other spa types were identified, including t024 (8/154, 5%) and t681 (3/154, 2%). The detected genotype diversity of USA300 isolates based on multi- and single-locus genotyping underscores the benefit of applying core genome sequence phylogeny for comprehensive identification of clinically significant sub-lineages such as S. aureus USA300.
227 228 229 230 231 232 233 234 235
Focusing on the USA300 clade only, we investigated the strength of phylogenetic signal in the distribution of isolates based on the healthcare centre of origin, year of isolation, patient type and infection type. For this we calculated the D statistic, which provides a measure of trait dispersion. A random distribution of isolates based on geographic origin was observed for the majority of healthcare centres (D range 0.17-1.22), with exception of HC1 (D = -0.68). Phylogenetic clustering was also observed for isolates collected in 2004 (D = -0.08) with random distribution of isolates from 2009 and 2010 (D = 0.86 and D = 0.85, respectively). No phylogenetic signal could be observed based on patient type (outpatient, D = 0.73; inpatient, D = 0.91) or infection type (wound, D = 0.48; respiratory, D = 0.67; invasive, D = 1.1).
236 237 238 239 240 241 242 243
We next investigated whether the CC8 population structure and the distribution of USA300 genotype varied at different sampling time points. For this, isolates collected in 2004 together with 2009 and 2010 collections derived from matched healthcare centres were used (Fig. 1). We found that USA300 was the dominant clone in all years. No significant variation in the distribution of USA300 was observed between 2004 (46/50, 92%) and 2009 (23/24, 96%) collections (p = 0.52). However, the 2010 collection (29/40, 73%) revealed a significantly lower frequency of USA300 genotype in comparison with both 2004 (p < 0.05) and 2009 (p < 0.05). This suggests that the significant drop in the prevalence of USA300
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
244 245 246 247
observed for 2010 was likely specific to that year rather than a time trend. Outside of the USA300 clade, the USA500-like clone was observed only amongst more recent samples with a single isolate identified in 2009 (1/24, 4%) and a comparatively higher prevalence in 2010 (4/40, 10%).
248
Accessory genome of CC8
249 250 251 252 253
The genomes of 191 S. aureus CC8 isolates were analysed for the distribution of variable regions, followed by annotation to define the accessory component. Overall, a diverse mobile genetic repertoire was identified across the lineage, although isolates belonging to USA300, USA500 and USA500-like clones revealed presence of clade-associated MGEs (Fig. 2).
254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278
In total, five different prophage types were identified. The most common was Sa3, found in 96% of isolates although variation in carriage of prophage-encoded virulence factors was observed (Fig. S1). Nearly all Sa3-positive USA300 isolates carried the immune evasion cluster (IEC) composed of scn, chp and sak genes, which was also detected amongst some non-USA300 isolates. The second most common prophage, due to the dominance of USA300 within this population, was Sa2. All but one USA300 isolate carried the Panton-Valentine leukocidin-positive Sa2USA300, which was also detected in three non-USA300 isolates. Three further prophages were identified within the analysed CC8 population: Sa1, Sa5 and Sa7. All three were detected at considerably lower frequency than Sa3 or Sa2. Prophage Sa1 was detected sporadically but across the entire CC8 population whereas Sa7 was prevalent mostly amongst non-USA300 MRSA isolates and found in all USA500 and majority of USA500-like (5/6) samples. In contrast, prophage Sa5 was identified predominantly within the USA300 clade (47/154, 30%), and was randomly distributed (D=1.66) across the USA300 phylogeny suggesting multiple acquisition events rather than prevalence due to clonal propagation of Sa5-positive isolates. No putative virulence genes were identified within sequence of any of the three prophages. We investigated whether acquisition of these prophages might affect the bacterial host thorough gene inactivation at the insertion site. Analysis of the chromosomal environment surrounding the site of prophage insertion revealed that Sa1 and Sa7 inserted within intergenic locations downstream of the ironsulphur cluster assembly protein gene sufB (between coding sequences corresponding to USA300_FPR3757 SAUSA300_0822 and SAUSA300_0823), and the heme transport protein gene isdB, (between coding sequences corresponding to USA300_FPR3757 SAUSA300_1027 and SAUSA300_1028) respectively. In contrast, prophage Sa5 were found inserted within a coding sequence responsible for a putative metal-binding protein with iron-sulphur oxidoreductase domain (SAUSA300_1858), which would likely interfere with its expression.
279 280 281 282
Other accessory virulence genes amongst analysed CC8 isolates were associated with various pathogenicity islands, with SaPI5 identified in majority of USA300 isolates (139/154, 90%). Nearly all USA500 (3/4) and USA500-like (5/6) isolates carried the SaPI3, which similarly to SaPI5, encodes the sek and seq enterotoxin genes.
283 284 285
The majority of USA300 isolates carried ACME (137/154, 89%) and SCCmec type IVa elements (130/154, 84%), which were not found in any other isolates. Where the elements were absent this was predicted to have resulted from a deletion event, with corresponding
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
286 287 288 289 290 291 292 293 294 295 296 297 298 299
isolates dispersed across the phylogenetic tree. In line with a previous report (Uhlemann et al., 2014), an association between the absence of SCCmec and ACME was observed, with majority of ACME negative USA300 isolates also lacking the SCCmec (13/17, 76%). This is not surprising considering the proximal location of these elements in the USA300 chromosome. Nearly all SCCmec positive USA300 isolates carried a complete type IVa element although two isolates carried unusual recombinant variants of SCCmec IVa, which arose through putative recombination of SCCmec with ACME or another SCCmec element (Fig. S2). We also identified one isolate carrying a remnant SCC element lacking the mecA gene that exhibited similarity to SCCpbp4 from Staphylococcus epidermidis ATCC 12228. Finally, non-USA300 mecA-positive isolates carried either SCCmec IVd or SCCmec IVg. The carriage of SCCmec correlated with the methicillin susceptibility phenotype (Table S1) except for a single SCCmec positive isolate, which was identified as MSSA (PFESA2067). Further investigation revealed presence of a single base insertion in mecA gene resulting in a frame shift mutation and premature stop codon, which would likely lead to a loss of mecA expression.
300 301 302 303 304 305 306 307 308 309 310 311
The analysed CC8 population revealed a high prevalence of an accessory region recognised as integrative conjugative element 6013 (ICE6013) (Smyth & Robinson, 2009). The element was detected in all USA300 and USA500 isolates but was absent in all USA500-like samples. The ICE6013 does not appear to carry any specific virulence or resistance genes and lacks insertion specificity (Smyth & Robinson, 2009), integrating at a number of sites amongst the analysed CC8 isolates, including both inter- and intragenic positions (Fig. S3). In the USA300 isolates however, ICE6013 was only found inserted within a putative membrane protein gene (SAUSA300_0606), and thus an insertional inactivation of this particular coding sequence might provide selective advantage to USA300. Interestingly, this particular ICE6013 insertion site differs from the USA300_FPR3757 reference genome (Diep et al., 2006), which also contains this element, but is identical to ICE6013 location within the USA300-ISMMS1 genome (Altman et al., 2014).
312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329
There was only a sporadic carriage of resistance-associated transposon elements such as tetM-positive Tn6086, ermA- and spc-positive Tn554, and bla operon-carrying Tn552. All USA500-like isolates carried a 3.3 kb insertion element containing the trimethoprim resistance gene dfrG, not found elsewhere in this collection. Amongst USA300 isolates the antimicrobial resistance genes were carried predominantly on plasmids with the majority of samples (146/154, 95%) carrying a large plasmid resembling a previously reported 27 kb pUSA300HOUMR (Highlander et al., 2007). The plasmid contains genes that confer resistance to various antimicrobials including penicillin (blaZ), kanamycin (aphA-3), streptothricin (sat), bacitracin (bcrA), and erythromycin (msrA, mphBM), and homologous plasmids have been previously reported in other USA300 isolates (Kennedy et al., 2010, Uhlemann et al., 2014). However, amongst the 146 USA300 isolates carrying the pUSA300HOUMR-like plasmid 12 carried a variant that lacked some of the resistance genes described above, with at least five different variants detected (Fig. S1). The pUSA300HOUMR-like plasmid was not confined to the USA300 clade within this collection, and was detected in several other isolates. A few USA300 isolates (8/154, 5%) carried a circa 34 kb plasmid, which partly resembled the 37 kb pUSA03 that contains genes conferring resistance to erythromycin (ermC) and mupirocin (ileS-2) (Diep et al., 2006). At least three variants of this pUSA03-like plasmid were observed (Fig. S1). All lacked the ermC gene, and
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
330 331 332 333 334
only some carried the ileS-2 resistance determinant. In addition some of the ileS-2 positive variants also carried the kanamycin resistance gene addD. Also detected amongst USA300 isolates (3/154, 2%) was a 2.5 kb ermC-positive pWBG751-like plasmid and a small tetracycline resistance (tetK) plasmid (8/154, 5%), resembling the 4.4 kb pUSA02 (Diep et al., 2006).
335
Temporal variation in the accessory genome
336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355
We further analysed whether the variable distribution of selected MGEs was associated with the year of sampling. For this, we compared the distribution of SCCmec, ACME and Sa5 amongst USA300 isolates collected in 2004 with the 2009 and 2010 collections from matched healthcare centres (Table 1). All USA300 isolates carried SCCmec in 2004, with a single negative isolate in 2009. The frequency of SCCmec carriage was lowest amongst 2010 isolates, which represented a significant drop in comparison with both 2004 (p < 0.05) and 2009 (p < 0.05) groups. Emergence of SCCmec negative USA300 isolates resulting from a sporadic loss of the element has been previously reported, however at lower frequency than observed here (Uhlemann et al., 2014). As such the reduced frequency of SCCmec amongst USA300 isolates from 2010 might be specific to that year. Nevertheless a reported shift in -lactams in favour of MRSA-active drugs when treating SSTI might have influenced the epidemiology of CA-MRSA infections (Talan et al., 2011). Lower frequency of beta-lactam usage could result in a lower selective pressure to maintain SCCmec leading to gradual loss of this element. While USA300 has been recognised predominantly as a MRSA lineage, infections caused by MSSA USA300 have also been reported suggesting that transmission of USA300 may not be entirely dependent on SCCmec carriage (McCaskill et al., 2007, Talan et al., 2011). In support of hypothesis that a shift in epidemiology of MRSA might have occurred around 2010, we also observed an increase in prevalence of non-USA300 SCCmec negative isolates at this sampling time point.
356 357 358 359 360 361
As described above, carriage of ACME was associated with the distribution of SCCmec amongst USA300 isolates. Therefore in line with SCCmec prevalence, loss of ACME was only observed in 2009 and 2010 samples, both groups showing a significantly lower number of ACME positive USA300 isolates when contrasted with the 2004 group (p < 0.05). Although the prevalence of ACME negative USA300 isolates was more notable in 2010, the difference between 2009 and 2010 was not significant (p = 0.39).
362 363 364 365 366 367 368
The sample size of our collection was relatively low and thus the observed here temporal variation in the distribution of SCCmec and ACME among USA300 isolates and the apparent loss of these elements in more recent samples would require further surveillance. However, the occurrence of SCCmec and ACME negative USA300 isolates was not limited to a particular healthcare centre (Table 1). Furthermore, the lack of phylogenetic association between USA300 isolates lacking either SCCmec or ACME suggests that forces driving the loss of these important MGEs are acting upon the wider USA300 population.
369 370 371
In addition to changes in prevalence of SCCmec and ACME, we observed a trend of increasing prophage Sa5 frequency, although only the difference between 2004 and 2010 isolates was statistically significant (p < 0.05). The overall frequency of Sa5 carriage by
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
372 373 374 375 376
USA300 isolates collected in 2009 and 2010 was higher than the prevalence reported for New York City isolates (16%) (Uhlemann et al., 2014). However, the presence of Sa5 prophage within the latter collection was associated with a particular USA300 sub-clade, suggesting that carriage of this prophage might contribute to expansion of more successful sub-populations (Uhlemann et al., 2014).
377 378 379 380 381 382 383 384 385 386 387
The analysed here USA300 population demonstrated a stable temporal prevalence of antimicrobial resistance-associated MGEs. All isolates collected in 2004 carried the multiple drug resistance pUSA300HOUMR-like plasmid, which was also found in majority of 2009 (20/23, 87%) and 2010 isolates (28/29, 97%) from matched healthcare centres. A dip in pUSA300HOUMR-like plasmid prevalence amongst 2009 isolates was specific to a single centre and thus unlikely to represent a wider trend. It has been previously observed that carriage of aminoglycoside and bacitracin resistance genes associated with the pUSA300HOUMR-like plasmid might mediate resistance to certain topical antimicrobial agents available over-the-counter in the US (Suzuki et al., 2011). It is likely, therefore, that a continuous selective pressure has been maintaining this element within the USA300 population.
388 389 390 391 392 393 394 395 396 397 398 399
We also investigated whether the chromosomally-inserted MGEs carried by USA300 (Sa2, Sa3, SaPI5, ACME, SCCmec IVa and ICE6013) diversified over time leading to temporal variation in the distribution of pairwise SNP distances. To discount recombination-mediated variation, alignments of each MGE sequence were first queried for the presence of recombination regions and if detected, the corresponding isolate was removed from the analysis (up to two isolates for ICE6013, Sa3 and SCCmec IVa were removed). We found that the analysed MGEs were well conserved between the isolates, with a median pairwise SNP distance for all elements equal to 1. When comparing the median of pairwise SNP distances for individual MGEs between the 2004, 2009 and 2010 isolates, temporal variation was observed for all except ACME and ICE6013 elements (Fig. 3). For all an increase of median pairwise SNP distance by 1 was observed at either or both later sampling time points.
400 401 402 403
Outside the USA300 clade, it was noteworthy that the temporally distributed USA500-like sub-population differed markedly in its accessory genome from the closely related USA500 clone and could be distinguished based on SCCmec type, carriage of dfrG insertion fragment and absence of the ICE6013.
404
Core genome microevolution of USA300 clade
405 406 407 408 409 410 411 412 413
We investigated further the core genomes of USA300 isolates and found that the population formed a heterogeneous clade with a median pairwise SNP distance of 88 and a range of 0 165 SNPs. We then investigated if USA300 population has become more heterogeneous over time by comparing pairwise SNP distances of isolates collected in 2004 with the panels from 2009 and 2010 (isolates from matched healthcare centres only). We observed an increase in median pairwise SNP distance values between 2004 (62, MAD = 13) and 2009 (103, MAD = 13) as well as 2010 (98, MAD = 15) groups. As previously reported (Kennedy et al., 2008), the USA300 isolates demonstrate an elevated ratio of non-synonymous-to-synonymous SNPs (dN/dS), associated with the recent clonal expansion of this lineage and accumulation of
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
414 415 416 417 418 419 420 421 422 423 424 425 426 427 428
SNPs that have not yet been eliminated by purifying selection (Rocha et al., 2006). We observed a dN/dS median of 0.71 (MAD = 0.19) across USA300 core genomes, in comparison to the median 0.22 (MAD = 0.01) dN/dS ratio for non-USA300 isolates. In order to determine the dN/dS ratio of mutations that were acquired independently by the USA300 isolates after each diverged from the last common ancestor, we determined the dN/dS ratios on terminal branches of the USA300 clade. For clonal or very closely related isolates (up to 2 pairwise SNPs distance) a single representative was selected, and isolates that accumulated less than a total of 10 SNPs since divergence from the last common ancestor, or showed either no synonymous or no non-synonymous mutations were excluded. A total of 139 USA300 isolates were analysed and revealed a median dN/dS ratio of 2.6 (MAD = 1.1), which corresponded to a previously reported dN/dS ratio for another USA300 collection (Kennedy et al., 2008). Analysis of temporal changes in dN/dS ratio revealed equally high median values for 2004 (2.33, MAD = 1.23), 2009 (2.5, MAD = 0.4) and 2010 (2.8, MAD = 1.5) altogether indicating that purifying selection has not yet acted upon the USA300 population (Rocha et al., 2006).
429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456
To interrogate more closely the phylogenetic structure of the USA300 clade, we repeated the phylogeny reconstruction for USA300 isolates only (Fig. 4). Genomic divergence within the USA300 clade, based on branch lengths, was most pronounced on terminal branches and thus was strain-specific. In line with previous studies describing the population structure of USA300 isolates, the generated phylogeny revealed low resolution of inter-strain evolutionary relationships due to short internal branches. We therefore investigated further the clustering patterns of the innermost branches descending immediately from the root of USA300 clade, by focusing on branches that after each evolutionary split gave rise to a majority of the descending population. This revealed a subtle and sequential diversification process involving the formation of nested clades (Fig. 4). Based on the collection analysed in this study, at least six fine-scale divergence events descending from the root of the clade could be identified, each involving acquisition of up to five SNPs (Table 2). Where branches emerged due to multiple SNPs, in each case the mutations were independent in relation to their genomic position and thus did not represent a recombination. Nearly all mutations occurring in protein coding sequence were non-synonymous (11/12) and involved genes of various functions. Four of the non-synonymous SNPs (NS-SNP) occurring across three distinct branches were identified in genes associated with antimicrobial resistance. This included a gene encoding a putative drug resistance transporter ErmB/QacA (SAUSA300_2126), a gene encoding an efflux pump (SAUSA300_2489), recently associated with resistance to linoleic and arachidonic acids that are present on the skin and contribute to the low pH of this environment (Alnaseri et al., 2015) and gyrA (SAUSA300_006) and parC/grlA (SAUSA300_1251) genes associated with fluoroquinolone resistance. The emergence of fluoroquinolone resistance in the USA300 population was previously described and estimated to have occurred around 1995 (Uhlemann et al., 2014). Amongst other genes affected by non-synonymous SNPs was a putative virulence determinant (SAUSA300_1322), which belongs to the cvfC operon and contributes to control of haemolysin production and detergent resistance through positive regulation of thyA gene (Ikuo et al., 2010).
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
457 458 459 460 461 462 463
To further resolve the observed USA300 tree topology and formation of nested clades, we have reconstructed the phylogeny of the study USA300 isolates together with 329 genomes of previously sequenced USA300 isolates derived from New York city (Uhlemann et al., 2014). The tree topology of this expanded USA300 collection also revealed a nested clade population structure (Fig. S4). The clade-defining mutations corresponded to the polymorphisms described above (Table 2) demonstrating that the early diversification changes identified in this work can be detected across wider USA300 population.
464
Conclusions
465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482
This study provides an overview of the S. aureus CC8 population structure based on whole genome sequence analysis of clinical isolates derived from 9 US states, collected in 2004, 2009 and 2010. Unsurprisingly, it was dominated by the USA300 clone. The non-USA300 isolates represented a fifth of the CC8 population and included the USA500 clone as well as a closely related USA500-like genotype with a distinct accessory genome composition, which emerged at the later sampling time point. The USA500 clone emerged prior to the appearance and dissemination of USA300 genotype but has been diminishing in prevalence since (David et al., 2015, Nelson et al., 2015). This trend was also observed in our collection, with the majority of USA500 isolates identified amongst the 2004 samples. Furthermore, the distribution of both USA500 and USA500-like isolates was geographically restricted as most originated from a single site (HC10). This is in contrast to geographically dispersed USA300 isolates. There may also be additional regional differences associated with USA500 that we did not detect owing to the majority of USA500 isolates being detected from just one site. For example Benson et al. (2014) identified that most of USA500 isolates obtained from patients in New York area harboured the IS256 element located in close proximity to the rot gene. In our analysis we identified the presence of this IS element in the genomes of all USA500 isolates and a single USA500-like strain. However, insertion of IS256 next to rot was observed for only one of the USA500 strains.
483 484 485 486 487 488 489 490 491 492 493
The analysis of temporal variation in population structure of CC8 isolates indicated a significant drop in the prevalence of USA300 around 2010, in comparison with both 2004 and 2009. However, due to the limitations of our study such as narrow sampling time points, small sample size and lack of isolates representing the Western US we could not verify if this dip in frequency of USA300 clone represented a broader event or a time trend. A recent review reported that there has been an overall increase in spatio-temporal prevalence of USA300 across the US between 2000 and 2013 in the context of MRSA epidemiology (Carrel et al., 2015). However, regional differences as well as year-on-year fluctuations were also observed demonstrating a need for a national surveillance program that would allow more accurate monitoring of the epidemiology of clinically significant S. aureus clones such as USA300.
494 495 496 497 498 499
Based on this geographically diverse collection of CC8 isolates, we investigated the USA300 clade for evidence of microevolution. The observed limited phylogeographic clustering suggests a general lack of geographical expansion events resulting from emergence of locally adapted clones. This is in agreement with findings that at a local community level, multiple introductions of USA300 occur (Uhlemann et al., 2014). Analysis of core genomes revealed that USA300 isolates are becoming more heterogeneous as defined by a temporal increase
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
500 501 502 503 504 505 506 507 508
in pairwise SNP distances, which is underlined by a stable pattern of elevated dN/dS ratios. However, an analysis of the wider phylogenetic relationships defined by branches descending immediately from the root of the clade revealed the presence of subtle polymorphisms shared across the population. The pattern of accumulative SNP acquisition as defined by the formation of nested clades suggests that the USA300 ancestral isolates continued to diversify shortly after the emergence of the clade but prior to its clonal expansion. Although determining the phenotypic significance of the acquired SNPs is beyond the scope of this work, we speculate that at least some might represent adaptive evolution such as development of antimicrobial resistance.
509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533
Our data also suggests temporal changes in the composition of USA300 accessory genome involving small-scale loss of SCCmec and ACME elements and increased frequency of Sa5 carriage. These findings are subject to the previously described limitations of this study and thus might not be indicative of a continuous temporal trend of USA300 accessory genome re-shuffle. Still, the analysed here collection represents an unbiased sample of S. aureus CC8 population derived from diverse clinical sources, providing unique insight into the broader epidemiology and evolution of USA300 in the context of its parental lineage. This allowed us to capture the apparent instability of SCCmec and ACME carriage amongst more recent USA300 isolates. This is significant considering some strategies for identification of USA300 isolates based on defined genetic markers such as carriage of SCCmec or ACME (Alam et al., 2015, David et al., 2013), which might hinder identification of emerging USA300 variants that have lost these MGEs leading to their under-reporting. Also, taking into consideration the putative role of SCCmec and ACME in the successful dissemination of S. aureus USA300 it will be important to understand what leads to the loss of these elements. We speculate that the loss of SCCmec may have been driven by changes in antimicrobial prescription practices in the US. It is more challenging to hypothesize what factors could have contributed to the drop in ACME prevalence. However, the observation that absence of SCCmec and ACME coincided in majority of isolates, suggests that the elements are frequently lost in a single deletion event. This scenario is supported by previous findings that the CcrAB recombinases of SCCmec can mobilise ACME as well as catalyse an excision of the entire ACME-SCCmec composite island (Diep et al., 2008). While further work is required to investigate how this might impact on the epidemiology of USA300, the analysis presented here of spatially diverse USA300 isolates gives a new insight into the wider evolutionary changes occurring within this highly successful sub-lineage, and provides ground for further surveillance of this clone.
534 535 536
ACKNOWLEDGEMENTS
537 538 539
We thank the core sequencing and informatics teams at the Sanger Institute for their assistance as well as the following employees of Pfizer for their helpful suggestions: Li Hao, Robert Donald and Elizabeth Bieger. SRH and JP were supported by Wellcome Trust grant
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
540 541
098051. This work was funded by Pfizer Vaccine Research and Development. NM, CYT and ASA are employees of Pfizer and may own stock in the company.
542 543 544
ABBREVIATIONS
545 546 547 548 549 550 551 552
ACME: arginine catabolic mobile element; bp: base pair; CA-MRSA: community-associated methicillin-resistance Staphylococcus aureus; CC: clonal complex; HA-MRSA: hospitalassociated methicillin-resistant Staphylococcus aureus; HC: healthcare centre; ICE6013: integrative conjugative element 6013; MAD – median absolute deviation; MGE: mobile genetic element; MRSA: methicillin-resistance Staphylococcus aureus; MSSA: methicillinsusceptible Staphylococcus aureus; SaPI: staphylococcal pathogenicity island; SCCmec: Staphylococcal Cassette Chromosome mec; SNP: single nucleotide polymorphism; SLV: single locus variant; ST: sequence type; T.E.S.T.: Tigecycline Evaluation and Surveillance Trial
553 554 555
REFERENCES
556 557 558
Alam, M. T., Read, T. D., Petit, R. A., Boyle-Vavra, S., Miller, L. G., Eells, S. J., Daum, R. S. & David, M. Z. (2015). Transmission and microevolution of USA300 MRSA in US households: evidence from whole-genome sequencing. Mbio 6, e00054-00015.
559 560 561 562
Alnaseri, H., Arsic, B., Schneider, J. E., Kaiser, J. C., Scinocca, Z. C., Heinrichs, D. E. & McGavin, M. J. (2015). Inducible expression of a resistance-nodulation-division-type efflux pump in Staphylococcus aureus provides resistance to linoleic and arachidonic acids. J Bacteriol 197, 1893-1905.
563
Altman, D., Sebra, R., Hand, J., Attie, O., Deikus, G., Carpini, K., Patel, G., Rana, M.,
564
Arvelakis, A. & Grewal, P. (2014). Transmission of Methicillin‐Resistant Staphylococcus
565 566
aureus via Deceased Donor Liver Transplantation Confirmed by Whole Genome Sequencing. Am J Transplant 14, 2640-2644.
567 568 569 570
Begier, E. M., Frenette, K., Barrett, N. L., Mshar, P., Petit, S., Boxrud, D. J., Watkins-Colwell, K., Wheeler, S., Cebelinski, E. A. & Glennen, A. (2004). A high-morbidity outbreak of methicillin-resistant Staphylococcus aureus among players on a college football team, facilitated by cosmetic body shaving and turf burns. Clin Infect Dis 39, 1446-1453.
571 572 573
Benson, M. A., Ohneck, E. A., Ryan, C., Alonzo, F., Smith, H., Narechania, A., Kolokotronis, S. O., Satola, S. W., Uhlemann, A. C. & Sebra, R. (2014). Evolution of hypervirulence by a MRSA clone through acquisition of a transposable element. Mol Microbiol 93, 664-681.
574 575 576 577
Boyle-Vavra, S., Li, X., Alam, M. T., Read, T. D., Sieth, J., Cywes-Bentley, C., Dobbins, G., David, M. Z., Kumar, N. & Eells, S. J. (2015). USA300 and USA500 Clonal Lineages of Staphylococcus aureus Do Not Produce a Capsular Polysaccharide Due to Conserved Mutations in the cap5 Locus. mBio 6, e02585-02514.
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
578 579 580
Carey, A. J., Della-Latta, P., Huard, R., Wu, F., Graham, P. L., Carp, D. & Saiman, L. (2010). Changes in the molecular epidemiological characteristics of methicillin-resistant Staphylococcus aureus in a neonatal intensive care unit. Infect Control 31, 613-619.
581 582
Carrel, M., Perencevich, E. N. & David, M. Z. (2015). USA300 Methicillin-Resistant Staphylococcus aureus, United States, 2000–2013. Emerg Infect Dis 21, 1973.
583 584 585
Chua, T., Moore, C. L., Perri, M. B., Donabedian, S. M., Masch, W., Vager, D., Davis, S. L., Lulek, K., Zimnicki, B. & Zervos, M. J. (2008). Molecular epidemiology of methicillin-resistant Staphylococcus aureus bloodstream isolates in urban Detroit. J Clin Microbiol 46, 2345-2352.
586 587 588
Control, C. f. D. & Prevention (2001). Methicillin-resistant Staphylococcus aureus skin or soft tissue infections in a state prison--Mississippi, 2000. MMWR Morbidity and mortality weekly report 50, 919.
589 590 591
Control, C. f. D. & Prevention (2003). Methicillin-resistant Staphylococcus aureus infections among competitive sports participants--Colorado, Indiana, Pennsylvania, and Los Angeles County, 2000-2003. MMWR Morbidity and mortality weekly report 52, 793.
592 593 594
Croucher, N. J., Page, A. J., Connor, T. R., Delaney, A. J., Keane, J. A., Bentley, S. D., Parkhill, J. & Harris, S. R. (2014). Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res, gku1196.
595 596 597 598 599
David, M. Z., Taylor, A., Lynfield, R., Boxrud, D. J., Short, G., Zychowski, D., Boyle-Vavra, S. & Daum, R. S. (2013). Comparing pulsed-field gel electrophoresis with multilocus sequence typing, spa typing, staphylococcal cassette chromosome mec (SCCmec) typing, and PCR for Panton-Valentine leukocidin, arcA, and opp3 in methicillin-resistant Staphylococcus aureus isolates at a US medical center. J Clin Microbiol 51, 814-819.
600 601 602 603
David, M. Z., Acree, M. E., Sieth, J. J., Boxrud, D. J., Dobbins, G., Lynfield, R., Boyle-Vavra, S. & Daum, R. S. (2015). Pediatric Staphylococcus aureus Isolate Genotypes and Infections from the Dawn of the Community-Associated Methicillin-Resistant S. aureus Epidemic Era in Chicago, 1994 to 1997. J Clin Microbiol 53, 2486-2491.
604 605
Deurenberg, R. H. & Stobberingh, E. E. (2008). The evolution of Staphylococcus aureus. Infect Genet Evol 8, 747-763.
606 607 608 609
Diep, B. A., Gill, S. R., Chang, R. F., Phan, T. H., Chen, J. H., Davidson, M. G., Lin, F., Lin, J., Carleton, H. A. & Mongodin, E. F. (2006). Complete genome sequence of USA300, an epidemic clone of community-acquired meticillin-resistant Staphylococcus aureus. The Lancet 367, 731-739.
610 611 612 613 614
Diep, B. A., Stone, G. G., Basuino, L., Graber, C. J., Miller, A., des Etages, S.-A., Jones, A., Palazzolo-Ballance, A. M., Perdreau-Remington, F. & Sensabaugh, G. F. (2008). The arginine catabolic mobile element and staphylococcal chromosomal cassette mec linkage: convergence of virulence and resistance in the USA300 clone of methicillin-resistant Staphylococcus aureus. J Infect Dis 197, 1523-1530.
615 616 617
Enright, M. C., Day, N. P., Davies, C. E., Peacock, S. J. & Spratt, B. G. (2000). Multilocus sequence typing for characterization of methicillin-resistant and methicillin-susceptible clones ofStaphylococcus aureus. J Clin Microbiol 38, 1008-1015.
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
618 619 620 621
Francis, J. S., Doherty, M. C., Lopatin, U., Johnston, C. P., Sinha, G., Ross, T., Cai, M., Hansel, N. N., Perl, T. & Ticehurst, J. R. (2005). Severe community-onset pneumonia in healthy adults caused by methicillin-resistant Staphylococcus aureus carrying the Panton-Valentine leukocidin genes. Clin Infect Dis 40, 100-107.
622 623 624
Freitas, E. A., Harris, R. M., Blake, R. K. & Salgado, C. D. (2010). Prevalence of USA300 strain type of methicillin-resistant Staphylococcus aureus among patients with nasal colonization identified with active surveillance. Infect Control 31, 469-475.
625 626
Fritz, S. A. & Purvis, A. (2010). Selectivity in mammalian extinction risk and threat types: a new measure of phylogenetic signal strength in binary traits. Conserv Biol 24, 1042-1051.
627 628 629
Goerke, C., Pantucek, R., Holtfreter, S., Schulte, B., Zink, M., Grumann, D., Bröker, B. M., Doskar, J. & Wolz, C. (2009). Diversity of prophages in dominant Staphylococcus aureus clonal lineages. J Bacteriol 191, 3462-3468.
630 631 632 633
Gonzalez, B. E., Martinez-Aguilar, G., Hulten, K. G., Hammerman, W. A., Coss-Bu, J., AvalosMishaan, A., Mason, E. O. & Kaplan, S. L. (2005). Severe staphylococcal sepsis in adolescents in the era of community-acquired methicillin-resistant Staphylococcus aureus. Pediatrics 115, 642-648.
634 635 636
Gonzalez, B. E., Rueda, A. M., Shelburne, S. A., Musher, D. M., Hamill, R. J. & Hultén, K. G. (2006). Community-associated strains of methicillin-resistant Staphylococccus aureus as the cause of healthcare-associated infection. Infect Control 27, 1051-1056.
637 638 639
Haque, N., Davis, S., Manierski, C., Vager, D., Donabedian, S., Perri, M., Sabbagh, R., Cheema, F. & Zervos, M. (2007). Infective endocarditis caused by USA300 methicillinresistant Staphylococcus aureus (MRSA). Int J Antimicrob Agents 30, 72-77.
640 641 642
Harris, S. R., Feil, E. J., Holden, M. T., Quail, M. A., Nickerson, E. K., Chantratita, N., Gardete, S., Tavares, A., Day, N. & Lindsay, J. A. (2010). Evolution of MRSA during hospital transmission and intercontinental spread. Science 327, 469-474.
643 644 645
Highlander, S. K., Hultén, K. G., Qin, X., Jiang, H., Yerrapragada, S., Mason, E. O., Shang, Y., Williams, T. M., Fortunov, R. M. & Liu, Y. (2007). Subtle genetic changes enhance virulence of methicillin resistant and sensitive Staphylococcus aureus. BMC Microbiol 7, 99.
646 647 648 649
Holden, M. T., Hsu, L.-Y., Kurt, K., Weinert, L. A., Mather, A. E., Harris, S. R., Strommenger, B., Layer, F., Witte, W. & de Lencastre, H. (2013). A genomic portrait of the emergence, evolution, and global spread of a methicillin-resistant Staphylococcus aureus pandemic. Genome Res 23, 653-664.
650 651 652
Hsu, L.-Y., Harris, S. R., Chlebowicz, M. A., Lindsay, J. A., Koh, T.-H., Krishnan, P., Tan, T.-Y., Hon, P.-Y., Grubb, W. B. & Bentley, S. D. (2015). Evolutionary dynamics of methicillinresistant Staphylococcus aureus within a healthcare system. Genome Biol 16, 81.
653 654
Ikuo, M., Kaito, C. & Sekimizu, K. (2010). The cvfC operon of Staphylococcus aureus contributes to virulence via expression of the thyA gene. Microb Pathog 49, 1-7.
655 656 657 658
Kahl, B. C., Mellmann, A., Deiwick, S., Peters, G. & Harmsen, D. (2005). Variation of the polymorphic region X of the protein A gene during persistent airway infection of cystic fibrosis patients reflects two independent mechanisms of genetic change in Staphylococcus aureus. J Clin Microbiol 43, 502-505.
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
659 660 661
Kazakova, S. V., Hageman, J. C., Matava, M., Srinivasan, A., Phelan, L., Garfinkel, B., Boo, T., McAllister, S., Anderson, J. & Jensen, B. (2005). A clone of methicillin-resistant Staphylococcus aureus among professional football players. N Engl J Med 352, 468-475.
662 663 664 665
Kennedy, A. D., Otto, M., Braughton, K. R., Whitney, A. R., Chen, L., Mathema, B., Mediavilla, J. R., Byrne, K. A., Parkins, L. D. & Tenover, F. C. (2008). Epidemic communityassociated methicillin-resistant Staphylococcus aureus: recent clonal expansion and diversification. Proceedings of the National Academy of Sciences 105, 1327-1332.
666 667 668 669 670
Kennedy, A. D., Porcella, S. F., Martens, C., Whitney, A. R., Braughton, K. R., Chen, L., Craig, C. T., Tenover, F. C., Kreiswirth, B. N. & Musser, J. M. (2010). Complete nucleotide sequence analysis of plasmids in strains of Staphylococcus aureus clone USA300 reveals a high level of identity among isolates with closely related core genome sequences. J Clin Microbiol 48, 4504-4511.
671 672 673 674
Li, M., Diep, B. A., Villaruz, A. E., Braughton, K. R., Jiang, X., DeLeo, F. R., Chambers, H. F., Lu, Y. & Otto, M. (2009). Evolution of virulence in epidemic community-associated methicillin-resistant Staphylococcus aureus. Proceedings of the National Academy of Sciences 106, 5883-5888.
675 676 677 678
Limbago, B., Fosheim, G. E., Schoonover, V., Crane, C. E., Nadle, J., Petit, S., Heltzel, D., Ray, S. M., Harrison, L. H. & Lynfield, R. (2009). Characterization of methicillin-resistant Staphylococcus aureus isolates collected in 2005 and 2006 from patients with invasive disease: a population-based analysis. J Clin Microbiol 47, 1344-1351.
679 680 681 682
Liu, C., Graber, C. J., Karr, M., Diep, B. A., Basuino, L., Schwartz, B. S., Enright, M. C., O'Hanlon, S. J., Thomas, J. C. & Perdreau-Remington, F. (2008). A population-based study of the incidence and molecular epidemiology of methicillin-resistant Staphylococcus aureus disease in San Francisco, 2004–2005. Clin Infect Dis 46, 1637-1646.
683 684 685 686
McCaskill, M. L., Mason Jr, E. O., Kaplan, S. L., Hammerman, W., Lamberth, L. B. & Hultén, K. G. (2007). Increase of the USA300 clone among community-acquired methicillinsusceptible Staphylococcus aureus causing invasive infections. The Pediatric infectious disease journal 26, 1122-1127.
687 688 689 690
McDougal, L. K., Steward, C. D., Killgore, G. E., Chaitram, J. M., McAllister, S. K. & Tenover, F. C. (2003). Pulsed-field gel electrophoresis typing of oxacillin-resistant Staphylococcus aureus isolates from the United States: establishing a national database. J Clin Microbiol 41, 5113-5120.
691 692 693
Milheiriço, C., Oliveira, D. C. & de Lencastre, H. (2007). Multiplex PCR strategy for subtyping the staphylococcal cassette chromosome mec type IV in methicillin-resistant Staphylococcus aureus:‘SCCmec IV multiplex’. J Antimicrob Chemother 60, 42-48.
694 695 696
Milheiriço, C., Oliveira, D. C. & de Lencastre, H. (2007). Update to the multiplex PCR strategy for assignment of mec element types in Staphylococcus aureus. Antimicrob Agents Chemother 51, 3374-3377.
697 698 699 700
Miller, L. G., Perdreau-Remington, F., Rieg, G., Mehdi, S., Perlroth, J., Bayer, A. S., Tang, A. W., Phung, T. O. & Spellberg, B. (2005). Necrotizing fasciitis caused by communityassociated methicillin-resistant Staphylococcus aureus in Los Angeles. N Engl J Med 352, 1445-1453.
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
701 702 703
Monecke, S., Coombs, G., Shore, A. C., Coleman, D. C., Akpaka, P., Borg, M., Chow, H., Ip, M., Jatzwauk, L. & Jonas, D. (2011). A field guide to pandemic, epidemic and sporadic clones of methicillin-resistant Staphylococcus aureus. PLoS One 6, e17936.
704 705 706 707
Nelson, M. U., Bizzarro, M. J., Baltimore, R. S., Dembry, L. M. & Gallagher, P. G. (2015). Clinical and Molecular Epidemiology of Methicillin-Resistant Staphylococcus aureus in a Neonatal Intensive Care Unit in the Decade Following Implementation of an Active Detection and Isolation Program. J Clin Microbiol, JCM. 00470-00415.
708 709 710 711
Pasquale, T. R., Jabrocki, B., Salstrom, S.-J., Wiemken, T. L., Peyrani, P., Haque, N. Z., Scerpella, E. G., Ford, K. D., Zervos, M. J. & Ramirez, J. A. (2013). Emergence of methicillinresistant Staphylococcus aureus USA300 genotype as a major cause of late-onset nosocomial pneumonia in intensive care patients in the USA. Int J Infect Dis 17, e398-e403.
712 713 714
Patel, M., Kumar, R. A., Stamm, A. M., Hoesley, C. J., Moser, S. A. & Waites, K. B. (2007). USA300 genotype community-associated methicillin-resistant Staphylococcus aureus as a cause of surgical site infections. J Clin Microbiol 45, 3431-3433.
715 716 717
Rocha, E. P., Smith, J. M., Hurst, L. D., Holden, M. T., Cooper, J. E., Smith, N. H. & Feil, E. J. (2006). Comparisons of dN/dS are time dependent for closely related bacterial genomes. J Theor Biol 239, 226-235.
718
Seemann, T. (2014). Prokka: rapid prokaryotic genome annotation. Bioinformatics, btu153.
719 720 721 722
Seybold, U., Kourbatova, E. V., Johnson, J. G., Halvosa, S. J., Wang, Y. F., King, M. D., Ray, S. M. & Blumberg, H. M. (2006). Emergence of community-associated methicillin-resistant Staphylococcus aureus USA300 genotype as a major cause of health care—associated blood stream infections. Clin Infect Dis 42, 647-656.
723 724 725
Seybold, U., Talati, N., Kizilbash, Q., Shah, M., Blumberg, H. & Franco-Paredes, C. (2007). Hematogenous osteomyelitis mimicking osteosarcoma due to Community Associated Methicillin-Resistant Staphylococcus aureus. Infection 35, 190-193.
726 727
Smyth, D. S. & Robinson, D. A. (2009). Integrative and sequence characteristics of a novel genetic element, ICE6013, in Staphylococcus aureus. J Bacteriol 191, 5964-5975.
728 729
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312-1313.
730 731 732
Suzuki, M., Yamada, K., Nagao, M., Aoki, E., Matsumoto, M., Hirayama, T., Yamamoto, H., Hiramatsu, R., Ichiyama, S. & Iinuma, Y. (2011). Antimicrobial ointments and methicillinresistant Staphylococcus aureus USA300. Emerg Infect Dis 17, 1917-1920.
733 734 735
Talan, D. A., Krishnadasan, A., Gorwitz, R. J., Fosheim, G. E., Limbago, B., Albrecht, V. & Moran, G. J. (2011). Comparison of Staphylococcus aureus from skin and soft-tissue infections in US emergency department patients, 2004 and 2008. Clin Infect Dis 53, 144-149.
736 737
Tattevin, P., Diep, B. A., Jula, M. & Perdreau-Remington, F. (2009). Methicillin-resistant Staphylococcus aureus USA300 clone in long-term care facility. Emerg Infect Dis 15, 953.
738 739 740 741
Tenover, F. C., McDougal, L. K., Goering, R. V., Killgore, G., Projan, S. J., Patel, J. B. & Dunman, P. M. (2006). Characterization of a strain of community-associated methicillinresistant Staphylococcus aureus widely disseminated in the United States. J Clin Microbiol 44, 108-118.
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
742 743 744 745
Tenover, F. C., McAllister, S., Fosheim, G., McDougal, L. K., Carey, R. B., Limbago, B., Lonsway, D., Patel, J. B., Kuehnert, M. J. & Gorwitz, R. (2008). Characterization of Staphylococcus aureus isolates from nasal cultures collected from individuals in the United States in 2001 to 2004. J Clin Microbiol 46, 2837-2841.
746 747 748 749
Thurlow, L. R., Joshi, G. S., Clark, J. R., Spontak, J. S., Neely, C. J., Maile, R. & Richardson, A. R. (2013). Functional modularity of the arginine catabolic mobile element contributes to the success of USA300 methicillin-resistant Staphylococcus aureus. Cell host & microbe 13, 100107.
750 751 752 753
Uhlemann, A.-C., Dordel, J., Knox, J. R., Raven, K. E., Parkhill, J., Holden, M. T., Peacock, S. J. & Lowy, F. D. (2014). Molecular tracing of the emergence, diversification, and transmission of S. aureus sequence type 8 in a New York community. Proceedings of the National Academy of Sciences 111, 6738-6743.
754 755
Zerbino, D. R. & Birney, E. (2008). Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18, 821-829.
756 757
Zhang, H., Gao, S., Lercher, M. J., Hu, S. & Chen, W.-H. (2012). EvolView, an online tool for visualizing, annotating and managing phylogenetic trees. Nucleic Acids Res 40, W569-W572.
758 759 760
DATA BIBLIOGRAPHY
761 762 763 764 765 766 767 768 769 770 771 772 773
1. Altman, D.R., Sebra, R., Hand, J., Attie, O., Deikus, G., Carpini, K.W.D., Patel, G., Rana, M., Arvelakis, A., Grewal, P. and Dutta, J. GenBank accession NZ_CP007176 (2014). 2. Benson, M.A., Ohneck, E.A., Ryan, C., Alonzo, F., Smith, H., Narechania, A., Kolokotronis, S.O., Satola, S.W., Uhlemann, A.C., Sebra, R. and Deikus, G. GenBank accession NZ_CP007499 (2014). 3. Diep, B.A., Gill, S.R., Chang, R.F., Phan, T.H., Chen, J.H., Davidson, M.G., Lin, F., Lin, J., Carleton, H.A., Mongodin, E.F. and Sensabaugh, G.F. GenBank accession NC_007793 (2006). 4. Gill, S.R., Fouts, D.E., Archer, G.L., Mongodin, E.F., DeBoy, R.T., Ravel, J., Paulsen, I.T., Kolonay, J.F., Brinkac, L., Beanan, M. and Dodson, R.J. GenBank accession NC_002951 (2005). 5. Uhlemann, A.C., Dordel, J., Knox, J.R., Raven, K.E., Parkhill, J., Holden, M.T., Peacock, S.J. and Lowy, F.D. ENA accession PRJEB2870 (2014).
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Research paper template
774 775
FIGURES AND TABLES
776 777 778
Table 1. Temporal variation in the distribution of selected mobile genetic elements among USA300 isolates. Based on isolates derived from healthcare centres selected for temporal analysis (HC1, HC6, HC7, HC10).
779 780
Table 2. Descriptive summary of SNPs identified on branches of early divergence nested clades across the phylogenetic tree of analysed USA300 isolates. The phylogeny is presented in Fig. 4
781 782 783 784
Figure 1. Geographic distribution of healthcare centres (HC) that collected isolates analysed in this study. The geographic location on the map represents state coordinates, where the corresponding HC was based. Plots are scaled to represent total number of isolates from corresponding site. HCs that provided isolates used in temporal analyses are marked with a star.
785 786 787 788 789 790 791 792 793 794 795 796 797
Figure 2. Midpoint-rooted phylogenetic tree of the 191 analysed S. aureus CC8 isolates. Four reference genomes were included (corresponding nodes marked with a star). Branches of USA300, USA500 and USA500-like clades have been highlighted. The tree has been annotated to show the association between phylogenetic distribution and healthcare centre (HC) of origin (excluding reference genomes), year of collection (excluding reference genomes), patient type (excluding reference genomes), infection type (excluding reference genomes) and carriage of mobile genetic elements (MGE). The latter shows the distribution of identified prophages, pathogenicity islands (where novel sequence was identified name has been assigned based on virulence gene content), transposons and other elements such as ICE6013 and dfrG insertion element, and plasmids (name assigned based on close similarity to previously identified plasmid). Further details on variable carriage of virulence genes within Sa2 and Sa3 prophages, as well as variable distribution of antimicrobial resistance genes within pUSA300HOUMR-like and pUSA03-like plasmids have been provided in Fig. S1. The tree was annotated using EvolView (Zhang et al., 2012)
798 799 800
Figure 3. Temporal variation in pairwise SNP distance distributions between chromosomallyinserted MGEs carried by USA300 isolates. Based on isolates derived from healthcare centres selected for temporal analysis (HC1, HC6, HC7, HC10).
801 802 803 804 805
Figure 4. Mid-point rooted phylogenetic tree of S. aureus USA300 isolates. Two most closely related non-USA300 isolates were included as an out-group (isolates PFESA2332 and PFESA1492, branches coloured in red). Also, two reference genomes were included and corresponding nodes are marked with a star. Branches were highlighted to show consecutive clades of the nested clade structure, and labelled from 1 to 6. SNPs that define each labelled clade are described in Table 2.
806
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table_1
Click here to download Table Table_1.docx
*
HC1 HC6 HC7 HC10 Total p† p‡ p§
N 14 6 8 18 46
2004 SCCmec ACME 14 14 6 6 8 8 18 18 46 46
φSa5 2 1 3 2 8
*
N na 12 10 1 23
2009 SCCmec ACME na na 12 12 9 8 1 1 22 21 0.19 A
R228H
1915352
Intergenic
-
-
A -> G
-
3
386867
Intergenic
-
-
A -> T
-
4
906507
Intergenic
-
-
C -> A
-
1693739
Non-synonymous
SAUSA300_1543
oxygen-independent coproporphyrinogen III oxidase
C -> T
A145T
1908169
Non-synonymous
SAUSA300_1725
transaldolase
C -> T
V6I
2230768
Synonymous
SAUSA300_2068
hypothetical protein
T -> C
-
2690087
Non-synonymous
SAUSA300_2489
antibiotic transport-associated protein
G -> T
Q795K
7282
Non-synonymous
SAUSA300_0006
DNA gyrase GyrA
C -> T
S84L
1374405
Non-synonymous
SAUSA300_1251
DNA topoisomerase ParC/GrlA
C -> A
S80Y
837108
Non-synonymous
SAUSA300_0750
hypothetical protein
T -> G
D17E
1074815
Non-synonymous
SAUSA300_0980
putative membrane protein
A -> G
N71D
5 6
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Figure 1 50
Click here to download Figure Fig1.pdf
HC8 HC9
45
HC2
HC4 HC12 HC5
40 HC3 HC6
Temporal analysis HC
35 HC10 HC7
Isolate_count
30
10 HC1
HC11
20 30
25 -120
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
-100
-80
0.01 Figure 2
Clades
Prophages
COL
USA300
USA500_2395
USA500
USA300_ISMMS1
USA500-like
USA300_FPR3757
Pathogenicity islands
Transposons Click here to download Figure Fig2_revised.pdf
ACME & SCCmec
& other elements
2004
HC1
2009
HC2
2010
0.01 m es HC3
HC4
Reference genom es
1 7
HC5 COL HC6 USA500_2395
HC7 USA300_ISMMS1 HC8 USA300_FPR3757
ST 8 3007 3008
MS/MR
HC9
MSSA
HC HC10
MRSA
HC1 HC11 HC2 HC12 HC3
YearHC4 2004 HC5 2009 HC6
HC
Year of isolat ion HC
Healt hcare cent re HC1 HC6 HC7 HC10
2010 HC7 HC8
Pat ient t ype HC9ient Inpat HC10 Out pat ient HC11
Infect ion t ype HC12 Wound
Respirat ory Year 2004 Invasive Ot2009 her 2010
MGE present Pat ient t ype Inpat ient Out pat ient
Infect ion t ype Wound Respirat ory Invasive Ot her
MGE present
0.01 substitutions/site
Plasmids
Ye ar Pa I In ent fe ty φS cIo pe a n φS 1 ty pe a φS 2 a3 φS a φS 5 a Sa 7 PI Sa 3 P Sa I5 PI Sa _se P K-‐ Sa I_ea seQ P r _t AC I_e st M ar SC E _se C_ Cm se SC ec L Cm IV SC ec a Cm IV SC e d C c Tn me IVg 5 c Tn 54 NT 5 Tn 52 6 df 086 rG IC _IE E p 601 US 3 pU A30 SA 0H pU 01 OU -‐l pUSA0 ike MR -‐li SA 2-‐l ke pW 03 ike -‐ l B i k pW G7 e B 5 SA G7 1-‐lik P0 57 e 48 -‐lik A-‐ e lik e
Reference genom es
0.1
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Figure 3
Click here to download Figure Fig3.pdf ɸSa3 2004 2009 2010
2004 2009 2010
ɸSa2USA300
1
2
3
4
5
0
1
2
3
4
Pairwise SNP distance
SaPI5
ACME
5
2004 2009 2010
Pairwise SNP distance
2004 2009 2010
0
1
2
3
4
0
2
4
6
8
10
Pairwise SNP distance
SCCmec IVa
ICE6013
12
2004 2009 2010
Pairwise SNP distance
2004 2009 2010
0
0
2
4
6
Pairwise SNP distance
8 10 0.0 0.5 1.0 Downloaded from www.microbiologyresearch.org by
IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
1.5
2.0
Pairwise SNP distance
2.5
3.0
Figure 4
Click here to download Figure Fig4.pdf
0.01
0.01 Reference genom es
COL USA500_2395 Reference genom es
COL USA300_ISMMS1
1
USA500_2395 USA300_FPR3757
ST
USA300_ISMMS1
8USA300_FPR3757 0.003 substitutions/site
2
250
ST976 8
1150
250 2253 976 2319
3
3000 1150 3001 2253 3007
2319
4
3008
3000
3010
3001
3015
3007 Year3008 of isolat ion 2004
5
3010 2009 3015 2010
6
Healt cention re Year hcare of isolat HC1 2004 HC2
2009
HC3
2010
HC4 HC5
Healt hcare cent re
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Supplementary Material Files
Click here to download Supplementary Material Files Supplementary_Files_revised.pdf
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA1490
ERS410907
MSSA
8
t008
-
USA300
2009
HC8
Wisconsin
Outpatient
Other
PFESA1492
ERS410923
MSSA
8
t008
-
-
2009
HC8
Wisconsin
Inpatient
Other
PFESA1606
ERS410796
MSSA
8
t008
-
-
2009
HC4
New York
Inpatient
Invasive
PFESA1614
ERS410765
MSSA
8
t008
-
USA300
2009
HC4
New York
Outpatient
Invasive
PFESA1788
ERS410716
MSSA
8
t008
-
-
2009
HC5
Pennsylvania
Inpatient
Invasive
PFESA1885
ERS410629
MSSA
8
t008
-
USA300
2009
HC3
Missouri
Inpatient
Other
PFESA1975
ERS411176
MSSA
8
t008
-
USA300
2010
HC10
Georgia
Inpatient
Invasive
PFESA1976
ERS411184
MSSA
8
t008
-
USA300
2010
HC10
Georgia
Inpatient
Respiratory
PFESA2034
ERS432065
MSSA
8
t008
-
USA300
2010
HC9
Wisconsin
Outpatient
Other
PFESA2064
ERS365748
MSSA
8
t008
-
USA300
2010
HC7
Texas
Inpatient
Wound
PFESA2070
ERS365796
MSSA
8
t008
-
USA300
2009
HC7
Texas
Inpatient
Wound
PFESA2083
ERS365797
MSSA
8
t008
-
-
2010
HC1
Florida
Inpatient
Invasive
PFESA2084
ERS365805
MSSA
8
t008
-
USA300
2010
HC1
Florida
Inpatient
Other
PFESA2091
ERS365766
MSSA
8
t008
-
USA300
2010
HC1
Florida
Inpatient
Other
PFESA2161
ERS365815
MSSA
8
t008
-
USA300
2010
HC2
Michigan
Inpatient
Respiratory
PFESA2175
ERS410378
MSSA
8
t008
-
-
2010
HC2
Michigan
Outpatient
Respiratory
PFESA2178
ERS410402
MSSA
8
t008
-
USA300
2009
HC11
Florida
Inpatient
Other
PFESA2180
ERS410418
MSSA
8
t008
-
USA300
2009
HC11
Florida
Inpatient
Wound
PFESA2184
ERS410450
MSSA
8
t008
-
USA300
2009
HC11
Florida
Inpatient
Other
PFESA2196
ERS410451
MSSA
8
t008
-
-
2010
HC12
Pennsylvania
Inpatient
Wound
PFESA2274
ERS365784
MSSA
8
t008
-
USA300
2010
HC6
Tennessee
Inpatient
Invasive
PFESA2298
ERS365754
MSSA
8
t008
-
USA300
2010
HC2
Michigan
Inpatient
Wound
PFESA2303
ERS365794
MSSA
8
t008
-
-
2010
HC2
Michigan
Inpatient
Other
PFESA2332
ERS410342
MSSA
8
t008
-
-
2010
HC8
Wisconsin
Outpatient
Wound
PFESA2413
ERS365826
MSSA
8
t008
-
USA300
2010
HC7
Texas
Inpatient
Respiratory
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA2417
ERS411049
MSSA
8
t008
-
USA300
2010
HC7
Texas
Inpatient
Respiratory
PFESA2432
ERS410215
MSSA
8
t008
-
-
2010
HC5
Pennsylvania
Inpatient
Invasive
PFESA2449
ERS432202
MSSA
8
t008
-
USA300
2010
HC3
Missouri
Inpatient
Wound
PFESA2453
ERS432139
MSSA
8
t008
-
USA300
2010
HC9
Wisconsin
Outpatient
Wound
PFESA2460
ERS432195
MSSA
8
t008
-
USA300
2010
HC9
Wisconsin
Inpatient
Other
PFESA1306
ERS411147
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Wound
PFESA1308
ERS411163
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Invasive
PFESA1309
ERS411171
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Wound
PFESA1311
ERS411179
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Wound
PFESA1312
ERS411187
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Wound
PFESA1317
ERS411132
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Invasive
PFESA1322
ERS411172
MRSA
8
t008
IVa
USA300
2004
HC7
Texas
Inpatient
Wound
PFESA1326
ERS411112
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1328
ERS411204
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Inpatient
Wound
PFESA1329
ERS410929
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1333
ERS411133
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Inpatient
Invasive
PFESA1335
ERS411213
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Inpatient
Respiratory
PFESA1339
ERS411205
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Inpatient
Wound
PFESA1341
ERS410977
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Other
PFESA1342
ERS410985
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1345
ERS411009
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Other
PFESA1347
ERS411017
MRSA
8
t008
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1350
ERS411173
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Inpatient
Wound
PFESA1352
ERS410946
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Wound
PFESA1353
ERS410954
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Wound
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA1354
ERS410962
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Wound
PFESA1355
ERS410970
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1357
ERS410986
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1358
ERS410994
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1359
ERS411002
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1360
ERS411010
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1362
ERS411018
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1364
ERS410939
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1365
ERS411157
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Inpatient
Other
PFESA1366
ERS411149
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Inpatient
Other
PFESA1368
ERS411141
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Inpatient
Wound
PFESA1369
ERS410955
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1371
ERS411126
MRSA
8
t008
IVa
USA300
2004
HC10
Georgia
Inpatient
Other
PFESA1391
ERS411150
MRSA
8
t008
IVa
USA300
2004
HC6
Tennessee
Inpatient
Other
PFESA1398
ERS411020
MRSA
8
t008
IVa
USA300
2004
HC6
Tennessee
Outpatient
Wound
PFESA1399
ERS410933
MRSA
8
t008
IVa
USA300
2004
HC6
Tennessee
Outpatient
Wound
PFESA1404
ERS410949
MRSA
8
t008
IVa
USA300
2004
HC6
Tennessee
Outpatient
Other
PFESA1410
ERS411167
MRSA
8
t008
IVa
USA300
2004
HC6
Tennessee
Inpatient
Other
PFESA1476
ERS410890
MRSA
8
t008
IVa
USA300
2010
HC8
Wisconsin
Inpatient
Wound
PFESA1477
ERS410898
MRSA
8
t008
IVa
USA300
2010
HC8
Wisconsin
Outpatient
Other
PFESA1485
ERS410867
MRSA
8
t008
IVa
USA300
2009
HC8
Wisconsin
Inpatient
Other
PFESA1489
ERS410899
MRSA
8
t008
IVa
USA300
2009
HC8
Wisconsin
Outpatient
Wound
PFESA1491
ERS410915
MRSA
8
t008
IVa
USA300
2009
HC8
Wisconsin
Outpatient
Wound
PFESA1497
ERS410868
MRSA
8
t008
IVa
USA300
2009
HC8
Wisconsin
Inpatient
Other
PFESA1498
ERS410876
MRSA
8
t008
IVa
USA300
2009
HC8
Wisconsin
Inpatient
Other
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA1532
ERS410863
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Other
PFESA1537
ERS410903
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
unknown
Wound
PFESA1538
ERS410911
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Respiratory
PFESA1541
ERS410840
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Other
PFESA1545
ERS410872
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Wound
PFESA1547
ERS410888
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Respiratory
PFESA1579
ERS410747
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Wound
PFESA1580
ERS410755
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Wound
PFESA1583
ERS410771
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA1584
ERS410779
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Wound
PFESA1585
ERS411199
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Inpatient
Wound
PFESA1586
ERS410787
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA1589
ERS410811
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Wound
PFESA1590
ERS411207
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Inpatient
Other
PFESA1591
ERS410819
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA1593
ERS410740
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA1594
ERS410748
MRSA
8
t008
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA1618
ERS410797
MRSA
8
t008
IVa
USA300
2009
HC4
New York
Outpatient
Wound
PFESA1785
ERS410692
MRSA
8
t008
IVa
USA300
2009
HC5
Pennsylvania
Inpatient
Respiratory
PFESA1787
ERS410708
MRSA
8
t008
IVa
USA300
2009
HC5
Pennsylvania
Inpatient
Respiratory
PFESA1791
ERS410645
MRSA
8
t008
IVa
USA300
2009
HC5
Pennsylvania
Inpatient
Respiratory
PFESA1880
ERS410589
MRSA
8
t008
IVa
USA300
2010
HC3
Missouri
Inpatient
Other
PFESA1882
ERS410605
MRSA
8
t008
IVa
USA300
2009
HC3
Missouri
Inpatient
Other
PFESA1883
ERS410613
MRSA
8
t008
IVa
USA300
2010
HC3
Missouri
Inpatient
Wound
PFESA1888
ERS410558
MRSA
8
t008
IVa
USA300
2010
HC3
Missouri
Inpatient
Other
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA1889
ERS410566
MRSA
8
t008
IVa
USA300
2010
HC3
Missouri
Inpatient
Other
PFESA1977
ERS432053
MRSA
8
t008
IVa
USA300
2010
HC10
Georgia
Outpatient
Other
PFESA1978
ERS432061
MRSA
8
t008
IVa
USA300
2010
HC10
Georgia
Outpatient
Other
PFESA2055
ERS365779
MRSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Invasive
PFESA2057
ERS365795
MRSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Invasive
PFESA2059
ERS365811
MRSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Other
PFESA2062
ERS365827
MRSA
8
t008
IVa
USA300
2010
HC7
Texas
Inpatient
Wound
PFESA2063
ERS365740
MRSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Wound
PFESA2067
ERS365772
MSSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Other
PFESA2074
ERS365828
MRSA
8
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Wound
PFESA2075
ERS410365
MRSA
8
t008
IVa
USA300
2010
HC7
Texas
Outpatient
Invasive
PFESA2077
ERS365749
MRSA
8
t008
IVa
USA300
2010
HC1
Florida
Inpatient
Wound
PFESA2090
ERS365758
MRSA
8
t008
IVa
USA300
2010
HC1
Florida
Inpatient
Wound
PFESA2098
ERS365822
MRSA
8
t008
IVa
USA300
2010
HC1
Florida
Inpatient
Wound
PFESA2103
ERS365767
MRSA
8
t008
IVa
USA300
2010
HC1
Florida
Inpatient
Respiratory
PFESA2104
ERS365775
MRSA
8
t008
IVa
USA300
2010
HC1
Florida
Inpatient
Respiratory
PFESA2153
ERS410377
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Outpatient
Other
PFESA2157
ERS365791
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Inpatient
Wound
PFESA2158
ERS410401
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Outpatient
Other
PFESA2163
ERS410409
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Outpatient
Wound
PFESA2164
ERS365831
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Inpatient
Wound
PFESA2166
ERS410417
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Outpatient
Other
PFESA2169
ERS410441
MRSA
8
t008
IVa
USA300
2010
HC2
Michigan
Outpatient
Wound
PFESA2188
ERS410387
MRSA
8
t008
IVa
USA300
2009
HC11
Florida
Inpatient
Respiratory
PFESA2192
ERS410419
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Outpatient
Wound
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. For spa and SCCmec columns NT refers to non-typeable. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA2193
ERS410427
MRSA
8
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Wound
PFESA2272
ERS410276
MRSA
8
t008
IVa
USA300
2010
HC6
Tennessee
Outpatient
Wound
PFESA2273
ERS410284
MRSA
8
t008
IVa
USA300
2010
HC6
Tennessee
Outpatient
Other
PFESA2282
ERS365745
MRSA
8
t008
IVa
USA300
2010
HC6
Tennessee
Inpatient
Respiratory
PFESA2283
ERS410300
MRSA
8
t008
IVa
USA300
2010
HC6
Tennessee
Outpatient
Other
PFESA2285
ERS410308
MRSA
8
t008
IVa
USA300
2010
HC6
Tennessee
Outpatient
Other
PFESA2329
ERS410318
MRSA
8
t008
IVa
USA300
2010
HC8
Wisconsin
Outpatient
Other
PFESA2431
ERS410207
MRSA
8
t008
IVa
USA300
2010
HC5
Pennsylvania
Inpatient
Respiratory
PFESA2435
ERS410239
MRSA
8
t008
IVa
USA300
2010
HC5
Pennsylvania
Inpatient
Respiratory
PFESA2293
ERS365809
MRSA
8
t008
IVg
-
2010
HC2
Michigan
Inpatient
Invasive
PFESA2056
ERS365787
MRSA
8
t008
NT
USA300
2009
HC7
Texas
Inpatient
Respiratory
PFESA1884
ERS410621
MSSA
8
t024
-
-
2009
HC3
Missouri
Inpatient
Other
PFESA2176
ERS410386
MSSA
8
t024
-
USA300
2009
HC11
Florida
Inpatient
Wound
PFESA1304
ERS411131
MRSA
8
t024
IVa
USA300
2004
HC7
Texas
Inpatient
Invasive
PFESA1337
ERS410961
MRSA
8
t024
IVa
USA300
2004
HC1
Florida
Outpatient
Other
PFESA1346
ERS411189
MRSA
8
t024
IVa
USA300
2004
HC1
Florida
Inpatient
Wound
PFESA1388
ERS410996
MRSA
8
t024
IVa
USA300
2004
HC6
Tennessee
Outpatient
Invasive
PFESA1598
ERS410756
MRSA
8
t024
IVa
USA300
2009
HC6
Tennessee
Outpatient
Other
PFESA2156
ERS410393
MRSA
8
t024
IVa
USA300
2010
HC2
Michigan
Outpatient
Other
PFESA2469
ERS432172
MRSA
8
t024
IVa
USA300
2010
HC9
Wisconsin
Inpatient
Wound
PFESA1795
ERS410677
MSSA
8
t064
-
-
2009
HC5
Pennsylvania
Inpatient
Respiratory
PFESA2439
ERS432122
MSSA
8
t064
-
-
2010
HC5
Pennsylvania
Inpatient
Respiratory
PFESA1318
ERS411140
MRSA
8
t064
IVd
USA500
2004
HC7
Texas
Inpatient
Wound
PFESA1351
ERS410938
MRSA
8
t064
IVd
USA500
2004
HC10
Georgia
Outpatient
Wound
PFESA1361
ERS411165
MRSA
8
t064
IVd
USA500
2004
HC10
Georgia
Inpatient
Invasive
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA1987
ERS365739
MRSA
8
t064
IVd
USA500
2010
HC10
Georgia
Inpatient
Invasive
PFESA1974
ERS411168
MRSA
8
t064
IVg
USA500-like
2009
HC10
Georgia
Inpatient
Respiratory
PFESA1980
ERS432077
MRSA
8
t064
IVg
USA500-like
2010
HC10
Georgia
Outpatient
Other
PFESA1984
ERS411200
MRSA
8
t064
IVg
USA500-like
2010
HC10
Georgia
Inpatient
Respiratory
PFESA1985
ERS411208
MRSA
8
t064
IVg
USA500-like
2010
HC10
Georgia
Inpatient
Invasive
PFESA2086
ERS365821
MRSA
8
t064
IVg
USA500-like
2010
HC1
Florida
Inpatient
Wound
PFESA2199
ERS410380
MRSA
8
t064
IVg
USA500-like
2010
HC12
Pennsylvania
Inpatient
Respiratory
PFESA1327
ERS411120
MRSA
8
t068
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1367
ERS410947
MRSA
8
t121
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA2281
ERS234133
MRSA
8
t121
IVa
USA300
2010
HC6
Tennessee
Inpatient
Respiratory
PFESA2277
ERS365808
MSSA
8
t1348
-
-
2010
HC6
Tennessee
Inpatient
Wound
PFESA2087
ERS365829
MRSA
8
t1578
IVa
USA300
2010
HC1
Florida
Inpatient
Wound
PFESA1989
ERS365755
MSSA
8
t1705
-
-
2010
HC10
Georgia
Inpatient
Other
PFESA2094
ERS365790
MSSA
8
t1882
-
-
2010
HC1
Florida
Inpatient
Respiratory
PFESA2170
ERS410449
MRSA
8
t211
IVa
USA300
2010
HC2
Michigan
Outpatient
Other
PFESA2187
ERS410379
MSSA
8
t2179
-
USA300
2009
HC11
Florida
Inpatient
Wound
PFESA1390
ERS411142
MSSA
8
t2558
-
-
2004
HC6
Tennessee
Inpatient
Other
PFESA2331
ERS410334
MSSA
8
t3240
-
-
2010
HC8
Wisconsin
Outpatient
Wound
PFESA1478
ERS410906
MSSA
8
t334
-
-
2009
HC8
Wisconsin
Inpatient
Respiratory
PFESA2280
ERS365824
MSSA
8
t334
-
-
2010
HC6
Tennessee
Inpatient
Respiratory
PFESA1370
ERS410963
MRSA
8
t4166
IVa
USA300
2004
HC10
Georgia
Outpatient
Other
PFESA1890
ERS410574
MRSA
8
t622
IVa
USA300
2010
HC3
Missouri
Inpatient
Wound
PFESA1982
ERS432085
MRSA
8
t622
IVa
USA300
2010
HC10
Georgia
Outpatient
Other
PFESA1981
ERS411192
MRSA
8
t681
IVa
USA300
2010
HC10
Georgia
Inpatient
Respiratory
PFESA1990
ERS365763
MRSA
8
t681
IVa
USA300
2010
HC10
Georgia
Inpatient
Other
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Table S1. Description of 191 CC8 S. aureus isolates analysed in this study. For spa and SCCmec columns NT refers to non-typeable. Public name
ENA accession
Organism
ST
spa
SCCmec
Clone
Year
HC ID
State
Patient type
Infection type
PFESA2763
ERS409806
MRSA
8
t681
IVa
USA300
2010
HC2
Michigan
unknown
Other
PFESA2742
ERS409790
MSSA
8
t8286
-
-
2010
HC2
Michigan
unknown
Other
PFESA1344
ERS411001
MRSA
8
NT
IVa
USA300
2004
HC1
Florida
Outpatient
Wound
PFESA1988
ERS365747
MRSA
8
NT
IVa
USA300
2009
HC10
Georgia
Inpatient
Invasive
PFESA2330
ERS410326
MSSA
976
t008
-
-
2010
HC8
Wisconsin
Outpatient
Other
PFESA2279
ERS365816
MSSA
1150
NT
-
-
2010
HC6
Tennessee
Inpatient
Wound
PFESA1548
ERS410896
MRSA
2253
t008
IVg
-
2010
HC12
Pennsylvania
Inpatient
Invasive
PFESA2189
ERS410395
MRSA
2253
t008
IVg
-
2010
HC12
Pennsylvania
Inpatient
Wound
PFESA2203
ERS410412
MRSA
2253
t008
IVg
-
2010
HC12
Pennsylvania
Inpatient
Wound
PFESA2073
ERS365820
MRSA
2319
t008
IVa
USA300
2009
HC7
Texas
Inpatient
Wound
PFESA1617
ERS410789
MRSA
3000
t008
IVa
USA300
2010
HC4
New York
Inpatient
Wound
PFESA2473
ERS432204
MSSA
3001
t121
-
-
2010
HC9
Wisconsin
Inpatient
Wound
PFESA2278
ERS410292
MRSA
3007
t008
IVa
USA300
2010
HC6
Tennessee
Outpatient
Wound
PFESA1983
ERS432093
MRSA
3008
t008
IVa
USA300
2010
HC10
Georgia
Outpatient
Other
PFESA2054
ERS409812
MRSA
3010
t008
IVa
USA300
2009
HC7
Texas
Outpatient
Wound
PFESA1542
ERS410848
MRSA
3015
t008
IVa
USA300
2009
HC12
Pennsylvania
Inpatient
Other
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Figure S1. Midpoint-rooted phylogenetic tree of analysed S. aureus CC8 isolates with distribution of MGE-associated virulence and resistance genes. Four reference genomes were included (corresponding nodes marked with a star). Branches of USA300, USA500 and USA500-like clades have been highlighted. The tree has been annotated to show the variable distribution of virulence genes associated with prophages φSa2 (pvl) and φSa3 (sak, scn, chp, seK, seQ, seA) as well as the distribution
of
antimicrobial
resistance
genes
associated
with
plasmids
pUSA300HOUMR-like (blaZ, smrA, mphBM, aphA-3, bcrA, sat) and pUSA03-like
0.01
Reference genom es COL USA500_2395 USA300_ISMMS1 USA300_FPR3757
MGE present
VIR/AMR gene present 0.01
Clades USA300 USA500 USA500-like
0.01
MS/MR Reference genom es MSSA COL MRSA USA500_2395
ST USA300_ISMMS1 8USA300_FPR3757 976
MGE 1150
present 2253
VIR/AMR 2319 gene present 3000
3001 3007
0.01 3008substitutions/site 3010 3015
Year of isolat ion 2004 2009 2010
Healt hcare cent re HC1 HC2 HC3 HC4 HC5 HC6 HC7 HC8 HC9 HC10 HC11 HC12
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
pU S ile A03 S2 ,li aa $ ke ! dD $
pU S bl A30 aZ 0 m $ HO UM sr m A$ R, ph lik e$ ap BM h $ bc A23 $ rA sa $ t$
φS a sa 3 k$ sc n ch p$ se K$ se Q se $ A$
φS a p v 2! l$
(ileS, aadD).
Figure S2. Comparative alignment of recombinant variants of SCCmec and ACME elements identified in USA300 isolates. (a) Alignment of SCCmec-ACME hybrid identified in isolate PFESA2056 to SCCmecIVa-ACME region of S. aureus USA300_FPR3757 strain. (b) Alignment of SCCmec hybrid identified in isolate PFESA2090 to sequence of SCCmec IVa from USA300_FPR3757 and SCCmec V (5&5C2) from S. aureus JCSC6944 strains. (c) Alignment of region containing remnant SCC and ACME element identified in isolate PFESA2064 to SCCmecIVaACME region of USA300_FPR3757 strain and SCCpbp4 from S. epidermidis ATCC 12228. Alignment figures were constructed using Easyfig software v 2.1
ACME
SCCmec IVa
(a) S. aureus USA300_FPR3757,, PFESA2056,
SCCmec IVa
(b) S. aureus USA300_FPR3757,, PFESA2090, S. aureus JCSC6944,
SCCmec V (5&5C2)
ACME
SCCmec IVa
(c) S.,aureus,USA300_FPR3757,, PFESA2064, S.,epidermidis,ATCC,12228,
SCCpbp4
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Figure S3. Midpoint-rooted phylogenetic tree of analysed S. aureus CC8 isolates showing distribution of the ICE6013 mobile genetic element. Four reference genomes were included (corresponding nodes marked with a star). Branches of USA300, USA500 and USA500-like clades have been highlighted. Tree has been annotated to show correlation between phylogenetic distribution and, starting from the innermost annotation circle, ICE6013 carriage, ICE6013 insertion type (intergenic or intragenic) and ICE6013 insertion site. For the latter, name of the coding sequence from USA300_FPR3757 reference genome was used, for intergenic location sites names of the flanking coding sequences are provided. 0.01 0.01
0.1
Reference genom Reference genom eses
Clades
COL COL
USA300
USA500_2395 USA500_2395
USA500
USA300_ISMMS1 USA300_ISMMS1
USA500-like
USA300_FPR3757 USA300_FPR3757
Year of isolat ion 2004
ICE6013 carriage ICE6013 carriage
2009
Present Present
2010
ICE6013 insert t ype ICE6013 insert ionion t ype Int ergenic Int ergenic
ST 8
Int ragenic Int ragenic
3007
ICE6013 inset ICE6013 inset ionion sit sit e e
3008
SAUSA300_0606 SAUSA300_0606
SAUSA300_2502/SAUSA300_2503 SAUSA300_2502/SAUSA300_2503 MS/MR MSSA SAUSA300_2368 SAUSA300_2368 MRSA SAUSA300_2440/SAUSA300_244 SAUSA300_2440/SAUSA300_244
SAUSA300_2431 SAUSA300_2431 SAUSA300_0110 SAUSA300_0110
Healt hcare cent re
SAUSA300_2145 SAUSA300_2145
HC1 HC6
SAUSA300_1213/SAUSA300_1214 SAUSA300_1213/SAUSA300_1214 HC7 SAUSA300_0771/SAUSA300_0772 SAUSA300_0771/SAUSA300_0772 HC10 SAUSA300_2145/SAUSA300_2146 SAUSA300_2145/SAUSA300_2146 SAUSA300_1207 SAUSA300_1207 SAUSA300_2438/SAUSA300_2439 SAUSA300_2438/SAUSA300_2439 SAUSA300_1476/SAUSA300_1490 SAUSA300_1476/SAUSA300_1490
0.01 substitutions/site
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00
Figure S4. Mid-point rooted phylogenetic tree of 483 S. aureus USA300 isolates. The phylogeny consists of 154 USA300 isolates analysed in this study together with 329 previously published USA300 genomes derived from New York City (Uhlemann, Dordel et al. 2014). Two most closely related non-USA300 isolates were included as an out-group (isolates PFESA2332 and PFESA1492, branches coloured in red). Branches were coloured to highlight the phylogeny’s nested clade structure. Clades were labelled from 1 to 6, to illustrate a parallel population structure between USA300 isolates analysed in this study and this expanded collection. SNPs that define each labelled clade correspond to previously described polymorphisms (Table 1), except for clade 5 where mutation at site corresponding to 1374405 position in USA300_FPR3757, was detected as C ! T transition (5a) followed by T ! A transversion (5b), in contrast to previously described single step C ! A transversion. Two reference genomes were included and corresponding nodes are marked with a star. 0.01
0.01 Reference genom es
COL USA500_2395 Reference genom es
COL USA300_ISMMS1
1
USA500_2395 USA300_FPR3757
ST
USA300_ISMMS1 8USA300_FPR3757
2
0.002 250 substitutions/site
ST976 8
1150
250 2253
3
976 2319 3000 1150 3001 2253 3007
2319
3008
3000
3010
3001
3015
3007 Year3008 of isolat ion 2004
3010 2009
4 5a
3015 2010
Healt cention re Year hcare of isolat HC1 2004
5b
HC2
2009
HC3
2010
HC4 HC5
Healt hcare cent re HC6 HC1 HC7
HC2
6
HC8
HC3
HC9
HC4
HC10
HC5 HC11 HC6 HC12 HC7 MGE HC8 present HC9 HC10 HC11 HC12
MGE
0.002
present
Downloaded from www.microbiologyresearch.org by IP: 107.172.249.178 On: Sun, 15 May 2016 15:08:00