1
Title:
2
Draft genome sequence of Bacillus anthracis strain Sterne isolate 09RA8929
3 4
Authors:
5
Anne Busch1*, Mandy Carolina Elschner1, Daniela Jacob2, Roland Grunow2, Herbert
6
Tomaso1
7
1
8
Germany,
9
2
Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Jena,
Robert Koch Institute, Centre for Biological Threats and Special Pathogens, Highly
10
Pathogenic Microorganisms (ZBS 2), Berlin, Germany
11
Corresponding
12
[email protected]
13
Abstract
14
An attenuated Bacillus (B.) anthracis vaccine strain (Sterne) used as an attenuated
15
laboratory comparative strain was sequenced and analyzed. A comparison to an
16
assembly of B. anthracis strains Sterne isolate (NZ_CP009541 and NZ_CP009540)
17
was performed. The lack of the pX02 plasmid and pX01 in approximately five copies
18
were confirmed.
author
and
the
corresponding
19
1
author's
email
address
20
Text
21
Bacillus (B.) anthracis, a Gram-positive, spore-forming bacillus, is the etiological agent
22
of anthrax (1, 2). Anthrax is a zoonotic disease, which can infect humans directly or
23
through products. To prevent the livestock of anthrax, vaccines comprising live
24
B. anthracis strains (pX01+; pX02−) for animals are available (3). We sequenced and
25
analyzed the B. anthracis strain Sterne with the strain collection number 09RA8929 at
26
the Friedrich-Loeffler-Institut (FLI), to use it as a laboratory comparative reference
27
strain.
28
The strain was provided from a repository located at the Robert Koch Institute within
29
the framework of the European Union (EU) funded project ‘‘EQADeBa’’ and
30
“EMERGE” (Grant Agreement n° 677066) after two passages on Columbia sheep
31
blood agar plates (Oxoid, Wesel, Germany). At the FLI the isolate was cultivated for
32
diagnostic purposes and to include the isolate in the repository. The strain was
33
cultivated twice on Columbia sheep blood agar plates (Oxoid, Wesel, Germany). The
34
culture for DNA extraction was initiated from a single colony. DNA was extracted from
35
bacterial cells that were grown in cell culture flasks for 24 h. DNA purification was
36
performed using QIAGEN Genomic-tip 100/G and QIAGEN Genomic DNA buffer set
37
Kit (Qiagen, Hilden, Germany) following the manufacturer’s instructions. Genome
38
sequencing was carried out by use of Illumina DNA sequencing at GATC Biotech
39
(Germany) with library protocol adapted and validated using a HiSeq sequencer 2500.
40
A paired end library was constructed with target insert size of 300 bas pairs resulting
41
in 6 million reads. A total of 12 million sequences in a length of 151 bases were
42
generated, with an average phred score > 38. The sequences were assembled using
43
SPAdes 3.9.1. in Bayes Hammer mode to minimizes mismatches and short indels (--
44
careful, (4)) and filtered for contamination with Kraken (5). Subsequently, annotation 2
45
was performed with Prokka (in standard settings) (6). The assembly resulted in 33
46
contigs with an N50 value of 1162008 and a total sequence length containing
47
5366275 bases. Annotation predicted 12 RNAs, 5639 CDS (Coding DNA Sequences),
48
79 tRNAs and 1 tmRNA. A phylogenetic analysis on basis of the 16S rRNA sequences
49
with Mole-Blast and PhyloPhlAn (7-9) was performed resulting in the consistent results.
50
With phylogeny with PhyloPhlAn a differentiation within the Bacillus species was
51
possible, although the coding sequences of the plasmids and highly variable
52
sequences are excluded with this method (9). This bioinformatics approach is
53
complementing the methods of SNP typing and MLVA.
54
After quality checking, trimming and merging with bbduk and removal of duplicates the
55
reads were mapped to NZ_CP009540, NZ_CP009541 and NZ_CP010794 with the
56
bowtie2 module included in geneious (http://www.geneious.com, (7)) and variant
57
calling files were generated. Variants are reported with a minimum variant frequency
58
of 0.9, with a minimum p value of 10e-60, minimum strand bias p value of 10e-5, and
59
supported by at least five reads. A 37-fold mean coverage of the chromosome was
60
reported with 5 million bases showing 99.99 % identity with the reference sequence
61
(5227150 of 5227496). For the chromosome, 88 variants were called. 36 variants were
62
predicted to affect known proteins. The plasmid pX01 yielded a 161-fold mean
63
coverage of 181803 bases with 99.8 % identity to the reference sequence (181233 of
64
181624). For the plasmid, 13 variants were called; seven of these were affecting known
65
proteins (Table 1). With this described method and this data in future Bacillus anthracis
66
strains can be monitored. The average coverage ratio chromosome/plasmid suggests
67
that the plasmid pX01 is represented with approximately unusual five copies per cell.
68
It is reported that sequencing coverage provides exact copy-number estimates for the
69
plasmid copy number (Becker et al., 2016). No mapping occurred on pX02 reference
70
sequence NZ_CP010794 and the existence of pX02 could be excluded. 3
71
Accession
72
of B. anthracis strain Sterne isolate 09RA8929 has been deposited in NCBI GenBank
73
under the Biosample number SAMN09635715 and the Biosample Number
74
PRJNA422985. Raw data are submitted in the Sequence Read Archive (SRA) SRA
75
accession: SRP159486.
76
Acknowledgements: For this work Anne Busch was supported by a grant of the
77
German Federal Ministry of Education and Research and the work was executed within
78
the framework of the project Ess-B.A.R. (FKZ 13N13983). We would especially like to
79
thank CHAFEA for funding and realizing EMERGE, but also all partners for their
80
cooperation and for the external support to the Joint Action by ECDC and DG SANTE.
81
References:
82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102
1. 2. 3.
4.
5. 6. 7.
number:
The
genome
sequence
Bell JH. 1880. On Anthrax and Anthracaemia in Wool-Sorters, Heifers, and Sheep. Br Med J 2:656-7. Moayeri M, Leppla SH, Vrentas C, Pomerantsev AP, Liu S. 2015. Anthrax Pathogenesis. Annu Rev Microbiol 69:185-208. Adone R, Sali M, Francia M, Iatarola M, Donatiello A, Fasanella A. 2016. Development of a Sterne-Based Complement Fixation Test to Monitor the Humoral Response Induced by Anthrax Vaccines. Front Microbiol 7:19. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455-77. Wood DE, Salzberg SL. 2014. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15:R46. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068-9. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647-9.
103
4