Draft genome sequence of Bacillus anthracis strain Sterne isolate

0 downloads 0 Views 168KB Size Report
Draft genome sequence of Bacillus anthracis strain Sterne isolate 09RA8929. 2. 3. Authors: 4. Anne Busch1*, Mandy Carolina Elschner1, Daniela Jacob2, ...
1

Title:

2

Draft genome sequence of Bacillus anthracis strain Sterne isolate 09RA8929

3 4

Authors:

5

Anne Busch1*, Mandy Carolina Elschner1, Daniela Jacob2, Roland Grunow2, Herbert

6

Tomaso1

7

1

8

Germany,

9

2

Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Jena,

Robert Koch Institute, Centre for Biological Threats and Special Pathogens, Highly

10

Pathogenic Microorganisms (ZBS 2), Berlin, Germany

11

Corresponding

12

[email protected]

13

Abstract

14

An attenuated Bacillus (B.) anthracis vaccine strain (Sterne) used as an attenuated

15

laboratory comparative strain was sequenced and analyzed. A comparison to an

16

assembly of B. anthracis strains Sterne isolate (NZ_CP009541 and NZ_CP009540)

17

was performed. The lack of the pX02 plasmid and pX01 in approximately five copies

18

were confirmed.

author

and

the

corresponding

19

1

author's

email

address

20

Text

21

Bacillus (B.) anthracis, a Gram-positive, spore-forming bacillus, is the etiological agent

22

of anthrax (1, 2). Anthrax is a zoonotic disease, which can infect humans directly or

23

through products. To prevent the livestock of anthrax, vaccines comprising live

24

B. anthracis strains (pX01+; pX02−) for animals are available (3). We sequenced and

25

analyzed the B. anthracis strain Sterne with the strain collection number 09RA8929 at

26

the Friedrich-Loeffler-Institut (FLI), to use it as a laboratory comparative reference

27

strain.

28

The strain was provided from a repository located at the Robert Koch Institute within

29

the framework of the European Union (EU) funded project ‘‘EQADeBa’’ and

30

“EMERGE” (Grant Agreement n° 677066) after two passages on Columbia sheep

31

blood agar plates (Oxoid, Wesel, Germany). At the FLI the isolate was cultivated for

32

diagnostic purposes and to include the isolate in the repository. The strain was

33

cultivated twice on Columbia sheep blood agar plates (Oxoid, Wesel, Germany). The

34

culture for DNA extraction was initiated from a single colony. DNA was extracted from

35

bacterial cells that were grown in cell culture flasks for 24 h. DNA purification was

36

performed using QIAGEN Genomic-tip 100/G and QIAGEN Genomic DNA buffer set

37

Kit (Qiagen, Hilden, Germany) following the manufacturer’s instructions. Genome

38

sequencing was carried out by use of Illumina DNA sequencing at GATC Biotech

39

(Germany) with library protocol adapted and validated using a HiSeq sequencer 2500.

40

A paired end library was constructed with target insert size of 300 bas pairs resulting

41

in 6 million reads. A total of 12 million sequences in a length of 151 bases were

42

generated, with an average phred score > 38. The sequences were assembled using

43

SPAdes 3.9.1. in Bayes Hammer mode to minimizes mismatches and short indels (--

44

careful, (4)) and filtered for contamination with Kraken (5). Subsequently, annotation 2

45

was performed with Prokka (in standard settings) (6). The assembly resulted in 33

46

contigs with an N50 value of 1162008 and a total sequence length containing

47

5366275 bases. Annotation predicted 12 RNAs, 5639 CDS (Coding DNA Sequences),

48

79 tRNAs and 1 tmRNA. A phylogenetic analysis on basis of the 16S rRNA sequences

49

with Mole-Blast and PhyloPhlAn (7-9) was performed resulting in the consistent results.

50

With phylogeny with PhyloPhlAn a differentiation within the Bacillus species was

51

possible, although the coding sequences of the plasmids and highly variable

52

sequences are excluded with this method (9). This bioinformatics approach is

53

complementing the methods of SNP typing and MLVA.

54

After quality checking, trimming and merging with bbduk and removal of duplicates the

55

reads were mapped to NZ_CP009540, NZ_CP009541 and NZ_CP010794 with the

56

bowtie2 module included in geneious (http://www.geneious.com, (7)) and variant

57

calling files were generated. Variants are reported with a minimum variant frequency

58

of 0.9, with a minimum p value of 10e-60, minimum strand bias p value of 10e-5, and

59

supported by at least five reads. A 37-fold mean coverage of the chromosome was

60

reported with 5 million bases showing 99.99 % identity with the reference sequence

61

(5227150 of 5227496). For the chromosome, 88 variants were called. 36 variants were

62

predicted to affect known proteins. The plasmid pX01 yielded a 161-fold mean

63

coverage of 181803 bases with 99.8 % identity to the reference sequence (181233 of

64

181624). For the plasmid, 13 variants were called; seven of these were affecting known

65

proteins (Table 1). With this described method and this data in future Bacillus anthracis

66

strains can be monitored. The average coverage ratio chromosome/plasmid suggests

67

that the plasmid pX01 is represented with approximately unusual five copies per cell.

68

It is reported that sequencing coverage provides exact copy-number estimates for the

69

plasmid copy number (Becker et al., 2016). No mapping occurred on pX02 reference

70

sequence NZ_CP010794 and the existence of pX02 could be excluded. 3

71

Accession

72

of B. anthracis strain Sterne isolate 09RA8929 has been deposited in NCBI GenBank

73

under the Biosample number SAMN09635715 and the Biosample Number

74

PRJNA422985. Raw data are submitted in the Sequence Read Archive (SRA) SRA

75

accession: SRP159486.

76

Acknowledgements: For this work Anne Busch was supported by a grant of the

77

German Federal Ministry of Education and Research and the work was executed within

78

the framework of the project Ess-B.A.R. (FKZ 13N13983). We would especially like to

79

thank CHAFEA for funding and realizing EMERGE, but also all partners for their

80

cooperation and for the external support to the Joint Action by ECDC and DG SANTE.

81

References:

82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102

1. 2. 3.

4.

5. 6. 7.

number:

The

genome

sequence

Bell JH. 1880. On Anthrax and Anthracaemia in Wool-Sorters, Heifers, and Sheep. Br Med J 2:656-7. Moayeri M, Leppla SH, Vrentas C, Pomerantsev AP, Liu S. 2015. Anthrax Pathogenesis. Annu Rev Microbiol 69:185-208. Adone R, Sali M, Francia M, Iatarola M, Donatiello A, Fasanella A. 2016. Development of a Sterne-Based Complement Fixation Test to Monitor the Humoral Response Induced by Anthrax Vaccines. Front Microbiol 7:19. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455-77. Wood DE, Salzberg SL. 2014. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15:R46. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068-9. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647-9.

103

4