Phylogeography of the second plague pandemic revealed through analysis of historical Yersinia pestis genomes

Spyrou, Maria A.; Keller, Marcel; Tukhbatova, Rezeda I.; Scheib, Christiana L.; Nelson, Elizabeth A.; Andrades Valtueña, Aida; Neumann, Gunnar U.; Walker, Don; Alterauge, Amelie; Carty, Niamh; Cessford, Craig; Fetz, Hermann; Gourvennec, Michaël; Hartle, Robert; Henderson, Michael; von Heyking, Kristin; Inskip, Sarah A.; Kacki, Sacha; Key, Felix M.; Knox, Elizabeth L.; Later, Christian; Maheshwari-Aplin, Prishita; Peters, Joris; Robb, John E.; Schreiber, Jürgen; Kivisild, Toomas; Castex, Dominique; Lösch, Sandra; Harbeck, Michaela; Herbig, Alexander; Bos, Kirsten I.; Krause, Johannes

doi:10.1038/s41467-019-12154-0

Download PDF

Article
Open access
Published: 02 October 2019

Phylogeography of the second plague pandemic revealed through analysis of historical Yersinia pestis genomes

Nature Communications volume 10, Article number: 4470 (2019) Cite this article

31k Accesses
96 Citations
572 Altmetric
Metrics details

Subjects

Abstract

The second plague pandemic, caused by Yersinia pestis, devastated Europe and the nearby regions between the 14^th and 18^th centuries AD. Here we analyse human remains from ten European archaeological sites spanning this period and reconstruct 34 ancient Y. pestis genomes. Our data support an initial entry of the bacterium through eastern Europe, the absence of genetic diversity during the Black Death, and low within-outbreak diversity thereafter. Analysis of post-Black Death genomes shows the diversification of a Y. pestis lineage into multiple genetically distinct clades that may have given rise to more than one disease reservoir in, or close to, Europe. In addition, we show the loss of a genomic region that includes virulence-related genes in strains associated with late stages of the pandemic. The deletion was also identified in genomes connected with the first plague pandemic (541–750 AD), suggesting a comparable evolutionary trajectory of Y. pestis during both events.

The source of the Black Death in fourteenth-century central Eurasia

Article Open access 15 June 2022

Maria A. Spyrou, Lyazzat Musralina, … Johannes Krause

Yersinia pestis strains from Latvia show depletion of the pla virulence gene at the end of the second plague pandemic

Article Open access 03 September 2020

Julian Susat, Joanna H. Bonczarowska, … Ben Krause-Kyora

Plagued by a cryptic clock: insight and issues from the global phylogeny of Yersinia pestis

Article Open access 19 January 2023

Katherine Eaton, Leo Featherstone, … Hendrik N. Poinar

Introduction

One of the most devastating pandemics of human history was the second plague pandemic, which began with the infamous Black Death (BD, 1346–1353 AD) and continued with recurrent outbreaks in Europe, the Near East and North Africa until the 18th century AD^1,2. Its causative agent, Yersinia pestis³, is a highly virulent bacterium that causes bubonic, pneumonic, and septicaemic plague and today is maintained among wild rodent populations in eastern Europe, Asia, Africa and the Americas^4,5,6.

The first historically documented outbreaks of the second pandemic seem to have occurred in 1346 in the Lower Volga and Black Sea regions^1,7. Subsequently, the bacterium dispersed through the rest of Europe over the next seven years, causing reductions in the human population estimated to be as high as 60%¹. Recent studies on ancient Y. pestis DNA from medieval plague victims have contributed insights into these initial stages of the pandemic. Specifically, mid-14th-century Y. pestis genomes reconstructed from Saint-Laurent-de-la-Cabrerisse (southern France)⁸, Barcelona (Spain)⁹, London (England)¹⁰ and Oslo (Norway)⁸ were shown to be identical, suggesting the rapid dispersal of a single strain across Europe during the BD. Recently, the analysis of an additional low-coverage genome from Siena, Italy (BSS31)⁸, revealed the purported existence of Y. pestis strain diversity during the BD, a possibility that should be further explored.

After the BD, plague was a common scourge in Europe as evidenced by the thousands of recorded epidemics it supposedly caused between 1353 and the late 18th century^2,11. Whether these were caused by multiple introductions of the disease from an Asian source or by its local persistence in Europe is currently a topic of debate^9,12,13,14. While data from climatic proxies are considered as supportive of the former hypothesis¹², genetic evidence is interpreted in both directions^8,9,13. To date, ancient Y. pestis genomes from epidemics closely succeeding the BD in Europe have been sequenced from late-14th-century individuals in Bergen op Zoom (Netherlands), London (England) and the Middle Volga region of Russia. They cluster on a phylogenetic lineage that is a precursor to strains associated with the 19th-century third plague pandemic^9,15,16, and thus provide a link between medieval and modern plague. Moreover, Y. pestis genomes recovered from Ellwangen, Germany (1485–1627 calAD), and the Great Plague of Marseille in France (L’Observance, 1720–1722 AD) cluster on an independent lineage, here termed the “post-BD” lineage, that is to date unidentified among modern Y. pestis diversity. Both lineages descended from the strain associated with the BD and, hence, likely represent plague’s legacy in or around Europe after 1353.

At present, the source of the second pandemic and the route that the bacterium followed during its course of entry into Europe remain hypothetical since genomic data from early outbreaks in western Russia have thus far been elusive. In addition, the limited number of published ancient Y. pestis genomes^9,10,14 challenges our ability to construct hypotheses regarding the number of lineages responsible for the numerous post-BD outbreaks in Europe^2,11 and whether they derived from a single or multiple disease reservoirs. Here, we take steps to overcome these limitations by expanding the number of available Y. pestis genomes from multiple time periods and locations in order to gain additional knowledge on the early stages of the second pandemic, and to study the genetic diversity of the bacterium present in Europe after the BD. Additionally, we present a reanalysis of recently published data from the same time period⁸. Our results support the entrance of Y. pestis into Europe through the east during the initial wave of the pandemic and consistently demonstrate an absence of genetic diversity in the bacterium during the BD. Moreover, our genomic analysis of post-BD outbreaks from central and western Europe suggests the local diversification of an extinct Y. pestis lineage between the late-14th and 18th centuries that may have resided in more than one disease reservoir.

Results

Sample screening for signatures of Y. pestis DNA

Two approaches were used for the assessment of Y. pestis DNA in tooth specimens (n = 206) from ten archaeological sites spanning the 14th–17th centuries AD in Europe (Fig. 1, Supplementary Figs. 1–10 and Supplementary Note 1). First, a qPCR screening approach was employed for detection of the Y. pestis-specific gene, pla, located on the pPCP1 plasmid¹⁷ in 180 specimens from the cities of London (n = 40) in England, Toulouse (n = 42) in France, Brandenburg an der Havel¹³ (n = 3), Landsberg am Lech (n = 10), Manching-Pichl¹³ (n = 28), Nabburg (n = 12) and Starnberg (n = 3) in Germany, Laishevo (n = 10) in the Volga region of Russia, and Stans (n = 32) in Switzerland. Extracts from 41 teeth across these sites tested positive for pla (Supplementary Table 1). All extraction negative controls were free of amplification products. Amplification products from putatively positive individuals were not sequenced, as the presence of Y. pestis was subsequently assessed through whole-genome capture and high-throughput Illumina sequencing.

In addition, shotgun Next Generation Sequencing (NGS) data from individuals unearthed at the New Museums site (Augustinian Friary) in Cambridge (n = 26) were screened for Y. pestis with the MEGAN alignment tool (MALT)¹⁸ (see Methods). The output was post-processed within the pathogen screening pipeline HOPS¹⁹. The assessment of shotgun NGS reads produced from non-uracil-DNA-glycosylase (non-UDG) libraries revealed the potential presence of Y. pestis DNA in four individuals (Supplementary Table 2, Supplementary Fig. 11).

Y. pestis in-solution capture and whole-genome reconstruction

We prepared UDG-treated libraries^20,21 from all putatively positive samples and used a Y. pestis whole-genome in-solution capture approach²² combined with high-throughput sequencing for the retrieval of 1,299,105–79,055,317 raw reads per sequenced library. All data were mapped against the Y. pestis CO92 reference genome (NC_003143.1)³. This resulted in 86,278–3,822,030 unique mapping reads yielding 1.1–80.1-fold coverage across 34 individuals that span the time transect between the 14th and 17th centuries in Europe (Supplementary Table 3). More specifically, we could retrieve two Y. pestis genomes from Cambridge (England), five from London (England), one from Toulouse (France), three from Nabburg, two from Manching-Pichl¹³, one from Starnberg, one from Landsberg am Lech, two from Brandenburg an der Havel¹³ (all from Germany), two from Laishevo (Russia) and 15 from Stans (Switzerland). Of those, 24 isolates showed at least 50% of the reference genome covered at 5-fold (Table 1), which allowed for their confident inclusion in phylogenetic analysis. In addition, we nearly tripled the genomic coverage of the published “549_O” isolate from Ellwangen, Germany (now reaching 14.1-fold), which was previously processed by array-based capture using a different probe design⁹ (Supplementary Table 3).

Table 1 Post-capture sequencing statistics of all new Yersinia pestis genomes that passed quality criteria for inclusion in phylogenetic analysis

Full size table

Y. pestis phylogenetic reconstruction

To infer genetic relationships between the new and previously published Y. pestis isolates, we constructed phylogenies using the maximum likelihood (ML) method, allowing for up to 3% missing data (97% partial deletion) to accommodate lower coverage genomes. As a reference dataset, we used 233 modern isolates^{23,24,25,26,27} (as listed in ref. ²⁸), which represent most of the published Y. pestis genetic diversity. In addition, we included previously published second pandemic isolates (n = 15)^8,9,10,14, a 6th-century AD isolate from Germany²⁹, a 2nd- to 3rd-century AD isolate from the Tian Shan mountains in Kyrgyzstan³⁰, as well as three Bronze Age isolates from the Altai and Volga regions^31,32 (Supplementary Fig. 12).

All newly reconstructed genomes appear on Branch 1 and are closely related to the previously published second pandemic isolates from Europe (Fig. 2), thus confirming their authenticity. In addition, they seem to represent a diverse group of strains that were present across Europe between the 14th and 18th century AD (Fig. 2, Supplementary Data 1). A number of genomes (NAB005, BRA003, STN011 and STN004) were excluded from further analyses as they showed evidence of excess heterozygosity, which is atypical of bacterial genomes (Supplementary Fig. 13). This phenomenon likely arises from enrichment of non-target DNA stemming from closely related organisms, an issue frequently encountered in ancient metagenomic datasets^18,29,33. Moreover, these genomes had notably longer branch lengths in comparison to other contemporaneous isolates from the same archaeological contexts (Supplementary Fig. 14). Their assessment using the recently developed SNPEvaluation tool²⁸ (see Methods) classified their private SNP calls as false-positive, suggesting that the observed branch lengths are erroneous (Supplementary Data 2). Similarly, the previously published SLC1006 and BSS31 genomes⁸ were also excluded from further analyses as they also showed high heterozygosity (Supplementary Fig. 15) and exceedingly longer branch lengths compared to other 14th-century Y. pestis genomes (Supplementary Figs. 14 and 16).

Our phylogenetic reconstruction shows that the LAI009 isolate from Laishevo is ancestral to the BD isolates from southern, central, western and northern Europe, as well as to the previously published late 14th-century isolates from London (6330)¹⁰ and Bolgar City⁹ (Fig. 2). This genome possesses only one derived SNP distinguishing it from the N07 polytomy that gave rise to Branches 1–4 (Fig. 2; Supplementary Data 1)²³. Since all other second pandemic genomes share an additional derived SNP on Branch 1, we interpret LAI009 as the most ancestral form of the strain that entered Europe during the initial wave of the second pandemic that has been identified to date. Regarding the central and western European genomes, NAB003 from Nabburg does not show differences compared to previously published BD genomes from London and Barcelona^9,10. In addition, NMS003 from Cambridge was genotyped based on inspection of its SNP profile, despite it not fulfilling the genomic coverage criteria for inclusion in our phylogenetic analysis (Supplementary Table 3), as its archaeological context makes it distinct from other Y. pestis-positive individuals from the same site (see Supplementary Note 1). As a result, SNP inspection classified it as potentially identical to other BD genomes (Supplementary Data 3). By contrast, certain isolates associated with the BD period are seemingly distinct. For example, TRP002 from Toulouse, which dates to 1347–1350 based on archaeological evidence, forms its own unique branch (Fig. 2; Supplementary Data 1). Qualitative assessment of eight unique SNPs in TRP002 with SNPEvaluation²⁸ classified them as potential false-positives (see Methods, Supplementary Data 2). In addition, after visual inspection, all such variants appear in regions of the genome where reads from diverse sources seem to be mapping (Supplementary Fig. 17) and, therefore, were considered to be of exogenous origin. Similarly, we assessed one unique SNP identified in our re-analysis of the recently published OSL-1 genome from Oslo, Norway⁸ (Fig. 2). Visual inspection revealed it as a low-quality C-to-T transition that could be confined by aDNA damage (Supplementary Fig. 18). Finally, despite exclusion of BSS31 (Siena, Italy) from phylogenetic analysis, two previously identified unique SNPs in this genome were manually inspected, since they were presented as evidence for Y. pestis genetic diversity in Europe during the BD⁸. Importantly, BLASTn analysis of reads overlapping those regions (Supplementary Fig. 18, Supplementary Data 4 and 5) showed a 100% identity to environmental or other enteric bacterial species, but not to Y. pestis. We, hence, conclude that apart from LAI009 all reconstructed genomes associated with the initial pandemic wave have identical genotypes. In addition, we note that structural rearrangements could provide alternative means of genetic diversity. Although architectural differences are vastly abundant among modern Y. pestis genomes³⁴, their assessment in ancient Y. pestis is limited by the short read aDNA data produced here.

We find a number of genomes grouping with the previously described “post-BD” lineage together with published strains from Ellwangen (ELW098/549_O), Germany (1486–1630)⁹, and Marseille, France (1720–1722)¹⁴, which are descended from the European BD isolates (Fig. 2; Supplementary Data 1). Here, we identify the earliest evidence of this lineage in a 14th-century isolate from Manching-Pichl (MAN)¹³ (see Supplementary Note 1), which is followed by the more derived 15th- to 17th-century isolates from Starnberg (STA), Landsberg (LBG), Stans (STN) and Cambridge (NMS), as well as the 17th-century Brandenburg an der Havel (BRA)¹³ and London (BED), all of which provide further evidence for plague’s continuous presence in Europe after the BD. Of note, we retrieved eight nearly identical genomes from Stans (STN, maximum one SNP difference in two of eight genomes; mean SNP distance d = 0), and together with the four identical genomes from 17th-century London (BED) (d = 0), the five previously published nearly identical genomes from Marseille (OBS, maximum one SNP difference in one of five genomes, d = 0), and the seven identical BD isolates from various regions in Europe (d = 0), our results demonstrate low genetic diversity of the bacterium within local outbreaks and/or major epidemics of the second pandemic. In addition, we find that this “post-BD lineage” gave rise to (at least) two distinct clades within Europe, with the Ellwangen isolate being positioned closest to an apparent population split (Fig. 2). From this divergence, one clade gave rise to the strains associated with outbreaks in Germany and Switzerland (15th–17th century AD), and the second encompassed strains from 17th-century London (BED) and 18th-century Marseille (OBS). Notably, these two clades show dissimilar rates of substitution accumulation. For example, the mean SNP distance between the Ellwangen genome (ELW098/549_O) and the London (BED) genomes (d = 45) is double that observed between Ellwangen and Brandenburg (BRA, d = 22), despite an assumption of them being contemporaneous (early 17th century AD) based on archaeological dating (Fig. 2; Supplementary Table 1; Supplementary Note 1).

Analysis of substitution rate variation in Y. pestis

We used the Bayesian framework BEAST v1.8 in order to make an assessment of substitution rate variations across the genealogy of Branch 1 (n = 80), retaining high-quality second pandemic Y. pestis genomes and using available calibration points in our modern and ancient datasets (Supplementary Data 6). Previous studies have demonstrated that overdispersion among Y. pestis branch lengths is unlikely a result of natural selection, and have rather suggested a link between rate acceleration and geographic expansion of certain lineages during epidemic spread^16,23. Our analysis based on the coalescent skyline model (Fig. 3, Supplementary Fig. 19) suggests an over 40-fold difference between the fastest and slowest substitution rates identified on Branch 1 (Fig. 3). In particular, we observe the fastest rates in three internal branches (Fig. 3). The first spans the genetic distance between the strains from Ellwangen (549_O) and London (BED), and supports the conflicting branch lengths of BED and BRA strains described earlier (Fig. 3 and Supplementary Data 7). The second is the branch leading to the 1.ANT strains isolated from Africa (Congo and Uganda) (Fig. 3 and Supplementary Data 8). The broad history of 1.ANT and the time period associated with its establishment in Africa are unknown, though an introduction from Eurasia has been hypothesised^9,35. The third, which displays the fastest rate within the entire Branch 1, is the branch leading to 1.ORI isolates (Fig. 3 and Supplementary Data 9), which is associated with the global spread of Y. pestis via maritime routes during the third plague pandemic (1894–1950s)^15,16. Our results, therefore, support the idea of faster substitution rates during epidemic spread, here particularly noticeable for lineages known to have expanded over wide geographic areas.

Analysis of virulence-associated genomic profiles

To investigate the genomic profiles of all newly reconstructed genomes, we analysed the presence or absence of potential virulence-associated and evolutionary determinant genes located on the Y. pestis chromosome (Fig. 4a) and plasmids (Supplementary Fig. 20)^36,37, in comparison to published representatives of ancient and modern strains. We find that the genetic profiles of some of the previously characterised historical strains are influenced by the capture design used for their retrieval. Specifically, the second pandemic genomes “Bolgar 2370” and “Barcelona 3031” (ref. ⁹) and the first pandemic genome “Altenerding 2148” (ref. ²⁹) seem to lack coverage in certain Y. pestis-specific regions, since Yersinia pseudotuberculosis was previously used as a probe-design template for their enrichment^9,29 (Fig. 4a). Regarding the newly reconstructed strains, we find that most possess all analysed genes with the exception of the New Churchyard (BED) and Marseille (OBS) strains that lack the magnesium transporter genes mgtB and mgtC, as well as the Cambridge (NMS002) strain that is lacking the inv gene (Fig. 4). While invasin is associated with epithelial colonisation of Y. pseudotuberculosis and Yersinia enterocolitica, it is known to have been inactivated in Y. pestis³⁸. By contrast, magnesium transporters are considered vital for Y. pestis intracellular survival under low Mg²⁺ conditions³⁹, such as those encountered within macrophage phagosomes. Specifically for Y. pestis, mgtB disruption has been associated with a decreased ability for macrophage invasion resulting in its attenuated virulence in mice⁴⁰. Both mgtB and mgtC are present in all 233 modern Y. pestis genomes used in our comparative dataset. We explored these gene deletions in greater detail using BWA-MEM and identified them as part of a 49-kb missing region within the BED and OBS genomes (1,879,467–1,928,869 on CO92) (Fig. 4b, Supplementary Fig. 21) flanked by an IS100 element immediately following its downstream end, which is consistent with previously characterised disruptions or losses of Y. pestis genomic regions via insertion elements⁴¹. Apart from mgtB and mgtC, this region encompasses a set of 34 additional genes that code for both characterised and hypothetical proteins, most of which seem to be associated with phenotypic characteristics that appear inactivated in Y. pestis such as motility and chemotaxis as well as few genes associated with metabolism, structure synthesis and environmental stress response (Supplementary Fig. 21, Supplementary Table 4). In addition, the clade encompassing this deletion is associated with some of the late outbreaks of the second plague pandemic, i.e. during the 17th century in London, England (BED) (see Supplementary Note 1), and during the 18th-century Plague of Marseille, in France (OBS 1720–1722 AD)¹⁴, which was one of the last major epidemics that occurred in continental Europe⁴². Intriguingly, a nearly identical genomic deletion (45 kb), also including the mgtB and mgtC virulence-associated genes, was recently identified in ancient isolates from France (LVC, LSD)²⁸ sequenced from victims of the first plague pandemic (6th–8th centuries AD)²⁸. These genomes are described elsewhere and date within a wide temporal interval (550–650 AD), though based on existing data they appear to be the youngest first pandemic isolates sequenced to date²⁸.

Discussion

A series of studies have sufficiently demonstrated the preservation of Y. pestis in ancient human remains from a wide temporal transect^{8,9,10,14,22,29,31,32,43}. This study presents an extensive sampling of multiple European epidemic burials from the period between the 14th and 17th centuries in order to gain a more complete picture of Y. pestis’ genetic history during the second plague pandemic. Here, we nearly triple the amount of genomic data available from that time period (Fig. 1, Table 1 and Supplementary Table 3) and integration with existing datasets reveals key aspects regarding the initiation and progression of the second plague pandemic in Europe.

Based on historical sources alone, it has been difficult to determine the time at which Y. pestis first reached different parts of western Russia⁷. A commonly accepted view dates its arrival in the southwest, particularly in cities of Astrakhan and Sarai, in 1346^1,44 with subsequent spread into southern Europe from the Crimean peninsula. On the other hand, the dispersal of plague into northwestern Russia (i.e. in the cities of Pskov and Novgorod^7,44) may have followed an alternative route via the Baltic Sea, occurring at the end of the BD between 1351 and 1353^1,7,44. Such a notion of plague’s expansion from northern Europe eastwards is also supported by published ancient genomic data from the late 14th-century Middle Volga region of Russia⁹, though other scenarios may come to light with incorporation of additional genomic and historical data. Importantly, through analysis of our new strain from Laishevo (LAI009), which is phylogenetically ancestral to all second pandemic strains sequenced to date (Fig. 2), we provide evidence for the bacterium’s presence in the same region, ~2000 km northeast of the Crimean peninsula, prior to reaching southern Europe in 1347–1348¹ (currently represented by strains from Siena, Saint-Laurent-de-la-Cabrerisse, Barcelona and Toulouse^8,9). These results suggest that the N07-derived SNP previously termed “p1”⁹ (Fig. 2, Supplementary Fig. 12), that is common to all other second pandemic strains, was likely acquired within Europe during the onset of the BD. In addition, given the proximity of the LAI009 genome to the N07 node often associated with the initiation of the BD (Fig. 2, Supplementary Fig. 12)²³, further data will be necessary to accurately re-evaluate the geographic origin of Branch 1. Previous analyses have proposed East Asia as the mostly likely candidate for the N07 polytomy^10,23 (Fig. 2). Such claims, however, cannot yet be verified given; (1) the apparent East Asian sampling bias of modern isolates^23,45, (2) the lack of molecular evidence from East Asia dating to the early 14th century and (3) the scarcity of historical documentary sources from this region describing precise disease symptoms⁴⁶. In addition, recently published modern Y. pestis genomes from Central Asia show a rich diversity in the local plague foci^26,27, and further sampling from these regions has the potential to inform hypotheses on plague movement and evolution.

The identification of low genomic diversity during the initial wave of the second pandemic becomes particularly informative when attempting to reconstruct the spread of plague after 1353. Previous research based on climatic proxies¹² as well as PCR⁴⁷ and genomic⁸ data have proposed multiple introductory waves of Y. pestis into Europe as the main source for the post-BD outbreaks recorded until the 18th century. Here, using previously published^8,9,10,14 and new whole-genome data from 20 archaeological sites, we identify that all genomes associated with post-BD outbreaks in Europe derived from a single ancestral strain that was present in southern, central, western and northern Europe during the BD. We, therefore, interpret the current data as supporting a single entry of Y. pestis during the BD, though additional interpretations may arise through the discovery of unsampled diversity in western Eurasia. Subsequent to its entry, we observe the formation of two sister lineages (Fig. 2). The first lineage is responsible for the bacterium’s possible eastward expansion after the BD. It contains strains from late-14th-century Bergen op Zoom, London (6330)¹⁰ and the city of Bolgar (2370)⁹, as well as extant strains from Africa (1.ANT)⁴⁸, and, most importantly, a worldwide set of isolates associated with the third pandemic (1.ORI, 19th–20th centuries)^15,16,23 (Fig. 2). The second, here termed the “post-BD lineage”, is characterised by a profound genomic diversity identified within Europe that seems to have been restricted to the second pandemic, as no modern descendants have been identified for this lineage to date. It is represented by historical genomes isolated from 14th- to 18th-century Germany (MAN, STA, ELW, LBG and BRA), Switzerland (STN), England (NMS, BED) and France (OBS) (Fig. 2), suggesting that it persisted in Europe or its vicinity and caused infections over a wide geographic range. The fact that this lineage has no identified modern descendants is likely related to the disappearance of plague from Europe in the 18th century, possibly due to extinction of local reservoirs, as previously suggested⁹.

We find that the “post-BD lineage” gave rise to (at least) two distinct clades that separate the strains identified in Central Europe during the 15th–17th centuries, and those identified in 17th- to 18th-century England and France. Their distinction is corroborated not only by their genetic and geographic separation (Fig. 2), but also by potential differences in their genomic profiles (Fig. 4) and substitution rates (Fig. 3). The clade that exhibits a slower substitution rate is mainly represented by temporally and genetically closely related isolates from Germany and Switzerland (Fig. 2), which could indicate endemic circulation of the bacterium in that region. Such an observation may be compatible with the hypothesis of an Alpine rodent reservoir facilitating the spread of plague in Central Europe after the BD⁴⁹, although a possible sampling bias should be noted since the majority of our data derive from this region. On the other hand, the clade that exhibits a faster substitution rate (Fig. 3) appears to have had a wider geographic distribution. Given that both Marseille and London were among the main maritime trade centres in Europe during that time, it is plausible that introduction of the disease in these areas occurred via ships⁵⁰, although sources favouring local epidemic eruptions also exist⁵¹. Previous studies have demonstrated that transmission of Y. pestis via steamships during the 19th century played a significant role in initial introduction of the bacterium to several regions worldwide, such as in Madagascar where it persists until today^15,16,52,53. As such, the possibility of maritime introductions of plague into London and Marseille during the second pandemic vastly expands the breadth of potential geographic source(s) for these strains. Nevertheless, the phylogenetic positioning of the BED and OBS genomes within the “post-BD lineage” and in relation to other second pandemic isolates suggests they arose within Europe or its vicinity.

We identified a 49-kb deletion within both BED and OBS genomes (Fig. 4b), which caused the loss of two virulence-associated genes, mgtB and mgtC (Fig. 4a). This deletion could not be identified in other second pandemic or modern strains in our dataset (Supplementary Fig. 21). The inferred virulence potential of mgtB and mgtC genes is associated with intracellular survival of Y. pestis within macrophages^40,54. Their co-expression has been shown to affect the virulence exerted by other pathogenic enterobacteria under laboratory conditions^55,56 and both genes have been proposed as potential drug targets^40,57. Moreover, the function of mgtB was shown to be temperature-dependent, being active at 37 °C but not at 20 °C⁵⁸, suggesting its loss affects the bacterium in warm-blooded hosts. Intriguingly, a 45-kb deletion in the same region was identified in genomes associated with late outbreaks of the first plague pandemic (6th–8th century AD)²⁸, which sets it as a candidate for convergent evolution and raises questions regarding its functional importance. Given that all genomes displaying this deletion were obtained from plague victims, including the Great Plague of Marseille (1720–1722 AD) that is known to have caused high mortality, its occurrence may not have reduced the pathogen’s virulence, particularly since genome decay is a well-established characteristic of Y. pestis evolution^59,60. Nevertheless, since both lineages that show this deletion are likely extinct, its functional characterisation will be of importance to evaluate potential effects on maintenance in mammalian and arthropod hosts, in Europe, during the first and second pandemics.

The second plague pandemic has arguably caused the highest levels of mortality of the three recorded plague pandemics^1,61. It serves as a classic historical example of rapid infectious disease emergence, long-term local persistence and eventual extinction for reasons that are currently not understood. We have shown that extensive sampling of ancient Y. pestis genomic data can provide direct molecular evidence on the genetic relationships of strains present in Europe during that time. In addition, we provide relevant information regarding the initiation and progression of the second pandemic and suggest that a single source reservoir may be insufficient to explain the breadth of epidemics and Y. pestis’ genetic diversity in Europe during the 400-year course of the pandemic. Although certain key regions in western Eurasia remain under-sampled for ancient Y. pestis DNA, namely the eastern Mediterranean, Scandinavia and the Baltics, vast amounts of high-quality genomic data are becoming increasingly available. Their integration into disease modelling efforts, which consider vector transmission dynamics^62,63, climatic^12,64,65 and epidemiological data⁶⁶, as well as a critical re-evaluation of historical records⁶⁷, will become increasingly important for better understanding the second plague pandemic.

Methods

Tooth sampling, DNA extraction and Y. pestis qPCR screening

Laboratory work was primarily performed in the dedicated aDNA facilities of the Max Planck Institute for the Science of Human History in Jena. Part of the sampling and DNA extractions were performed at aDNA facilities of the ArchaeoBioCenter of the Ludwig Maximilian University of Munich and aDNA facilities of the University of Cambridge, Department of Archaeology.

One-hundred and eighty teeth from nine sites located in England (BED), France (TRP), Germany (NAB, MAN, STA, LBG, BRA), Russia (LAI) and Switzerland (STN) spanning the 14th–17th centuries (see Supplementary Note 1) were sectioned in the cementoenamel junction, and 30–70 mg of powder was removed from the surface of the pulp chamber using a dentist drill. This powder was then used for DNA extraction, using a protocol optimised for the retrieval of short fragments that are characteristic of ancient DNA⁶⁸. Tooth powder was incubated in 1 ml of lysis buffer (0.45 M EDTA, pH 8.0, and 0.25 mg/ml proteinase K) overnight (12–16 h) at 37 °C. Then, DNA was bound to the silica membrane of spin columns using 10 ml of GuHCl-based binding buffer as described before⁶⁸, followed by a purification that was performed using either the MinElute purification kit (Qiagen) or the High Pure Viral Nucleic Acid Large Volume Kit (Roche). DNA was eluted in 100 μl of TET (10 mM Tris-HCl, 1 mM EDTA pH 8.0, 0.05% Tween 20). Extraction blanks and a positive extraction control (cave bear specimen) were taken along for every extraction batch. All extracts were then evaluated for PCR inhibition, by spiking 2 μl of each extract in a qPCR reaction containing a standard of known concentration¹⁷. None of the extracts showed signs of PCR inhibitions and, therefore, all were tested by qPCR for the presence of the plasminogen activator gene (pla), located on the Y. pestis-specific pPCP1 plasmid using a published protocol¹⁷. PCR products were not sequenced as all putatively positive samples were subsequently evaluated through whole-genome enrichment and next-generation sequencing. All extraction and PCR blanks were free of amplification products.

In addition, 26 specimens from the Augustinian Friary in Cambridge (NMS) were sampled and DNA was extracted at the University of Cambridge. Roots were sawed from teeth using a sterile dremel cutting wheel and a UV-irradiated toothbrush was then used to briefly brush the roots with 5% w/v NaOCl. Subsequently, roots were soaked in 6% w/v bleach for 5 min, then rinsed twice with ddH₂O, and finally soaked in 70% ethanol for 2 min. The roots were then transferred to a sterile paper towel and UV irradiated for 50 min on each side. After irradiation, teeth were weighed and subsequently transferred in 5-ml or 15-ml tubes for DNA extraction. DNA extraction was carried out as follows: 2 ml of EDTA (0.5 M, pH 8.0) and 50 μl of Proteinase K (10 mg/ml) were used for every 100 mg of sample. Extractions were then incubated at room temperature for 72 h. Extracted DNA was concentrated using the Amicon Ultra-15 concentrators with a 30-kDa filter, down to 250 μl. DNA was then purified using the MinElute PCR purification kit (Qiagen) according to manufacturer’s instruction. For the elution step, column-bound DNA was incubated with 100 μl of Elution buffer for 10 min at 37 °C.

Non-UDG library preparation and metagenomic screening with HOPS

The following protocol was carried out in the ancient DNA facility of the University of Cambridge, Department of Archaeology.

Non-UDG libraries were prepared for the NMS samples (Augustinian Friary, Cambridge; Supplementary Table 2) with the NEBNext® Library Preparation Kit for 454 (E6070S, New England Biolabs, Ipswich, MA) using a modified version of the manufacturer’s protocol⁶⁹. Adaptors were constructed as previously described²¹. Indexing PCR reactions were set up as follows: 50 µl of DNA library, 1× PCR buffer, 2.5 mM MgCl₂, 1 mg/ml BSA, 0.2 µM in PE 1.0, 0.2 mM dNTP each, 0.1 U/µl HGS Taq Diamond and 0.2 µM indexing primer, with the following cycling conditions: 5 min at 94 °C, followed by 18 cycles of 30 s each at 94 °C, 60 °C and 68 °C, with a final extension of 7 min at 72 °C. Amplified products were purified using the MinElute kit (Qiagen) and DNA was eluted in 35 μl EB. The indexed libraries were then quantified using the Quant-iT™ PicoGreen® dsDNA kit (P7589, Invitrogen™ Life Technologies) on the Synergy™ HT Multi-Mode Microplate Reader with Gen5™ software. Subsequent shotgun sequencing of these libraries was carried out on an Illumina NextSeq500 platform (using the High-Output kit 1 × 75 cycle chemistry) at the University of Cambridge Biochemistry DNA Sequencing Facility.

The program MALT (version 0.4.0)¹⁸, integrated in the pathogen screening pipeline HOPS¹⁹, was used to assess the presence of Y. pestis DNA in the NMS specimens. A custom NCBI RefSeq (November 2017) database was used for running MALT, including all bacterial and viral assemblies marked as complete, a selection of eukaryotic pathogen genomes, as well as the human reference sequence (GRCh38). Genomes with keywords such as “unknown” were removed. A total of 15,361 genomes were retained in the database. Pre-processed shotgun NGS reads (.fastq) were used as input and the parameters were set as follows: 85 for the minimum percentage identity (-minPercentIdentity), 1 for the minimum support (-minSupport), using a top percentage value of 1 (-topPercent), a semi-global alignment mode, and with all remaining parameters set to default. The resulting “.rma6” output files were automatically post-processed with MALTExtract (in HOPS) against a list of 100 target bacterial pathogen species, and the resulting profiles were qualitatively assessed within HOPS for the number of aligning reads, the read edit distance against different taxa and the presence of aDNA damage patterns¹⁹.

UDG library preparation and Y. pestis whole-genome capture

All putative Y. pestis-positive samples were subsequently converted into Illumina double-stranded DNA libraries as described before²¹, using a starting volume of 50–60 μl, with an initial USER (New England Biolabs) treatment step, where UDG was used in combination with endonuclease VIII to excise uracil nucleotides that result from post-mortem DNA damage^20,70. Subsequently, full UDG-treated and partially UDG-treated libraries were quantified on a qPCR using the IS7/IS8 primer combination. Following, a double-indexing step was performed where libraries were split into multiple PCR reactions based on their initial quantification⁷¹, in order to ensure maximal amplification efficiency. Every reaction was assigned a maximum input of 2 × 10¹⁰ DNA molecules. A unique index combination (index primer containing a unique 8-bp identifier) was assigned to every library, and a 10-cycle amplification reaction was used to attach index combinations to DNA library molecules using Pfu Turbo Cx Hotstart DNA Polymerase (Agilent). PCR products were purified using the MinElute DNA purification kit (Qiagen), and eluted in TET (10 mM Tris-HCl, 1 mM EDTA pH 8.0, 0.05% Tween 20). After indexing, all libraries were amplified using Herculase II Fusion DNA Polymerase (Agilent) to a concentration of 200–300 ng/μl, in order to achieve 1–2 μg of DNA in a total of 7 μl. Products were again purified using the MinElute DNA purification kit (Qiagen), and eluted in TET (10 mM Tris-HCl, 1 mM EDTA pH 8.0, 0.05% Tween 20). In-solution whole-genome Y. pestis capture was then performed as described previously²², where the following genomes were used as templates for probe design: CO92 chromosome (NC_003143.1), CO92 plasmid pMT1 (NC_003134.1), CO92 plasmid pCD1 (NC_003131.1), KIM10 chromosome (NC_004088.1), Pestoides F chromosome (NC_009381.1) and Y. pseudotuberculosis IP 32953 chromosome (NC_006155.1). DNA captures were carried out on 96-well plates. Each sample was either captured in its individual well, or pooled with maximum one more sample from the same site. Capture enrichment was carried out for two rounds, except for sample NMS002 that was captured for one round. Blanks with non-overlapping index combinations were captured together.

Sequencing and read processing

After capture, all products were sequenced on an Illumina NextSeq500 platform using (1 × 151 + 8 + 8 cycles or 1 × 76 + 8 + 8 cycles) or on a HiSeq4000 (using 1 × 76 + 8 + 8 cycles or 2 × 76 + 8 + 8 cycles). Preprocessing of de-multiplexed reads was performed on the automated pipeline EAGER (v1.92.55)⁷² and involved the removal of Illumina adaptors and read merging using AdapterRemoval v2 (ref. ⁷³), as well as filtering all reads for sequencing quality (minimum base quality of 20) and length (to retrieve only reads ≥30 bp). Subsequently, reads were mapped with BWA (version 0.7.12)⁷⁴, implemented in EAGER, against the CO92 reference genome (NC_003143.1)³ using stringent parameters (-n 0.1, -l 32) for genome reconstruction and mean coverage calculation and more lenient parameters (-n 0.01, -l 32) for inspection of regions surrounding potential variants. Reads with mapping quality lower than 37 (-q) were removed using SAMtools (http://samtools.sourceforge.net/), and PCR duplicates were removed using the MarkDuplicates tool (http://broadinstitute.github.io/picard/). Prior to SNP identification, raw pre-processed reads from partially-UDG-treated libraries were trimmed for 2-bp at both ends to remove sites that could be affected by aDNA damage and, subsequently, were re-filtered for length and re-mapped using stringent parameters.

SNP calling and phylogenetic analysis

SNP calling was performed using the UnifiedGenotyper of the Genome Analysis Toolkit (GATK v3.5)⁷⁵. Our newly reconstructed genomes were analysed alongside previously published Y. pestis genomes, which included a modern-day dataset of 233 genomes^{23,24,25,26,27,48} (as listed in ref. ²⁸), three Bronze Age strains³¹, a 2nd- to 3rd-century AD isolate from the Tian Shan mountains in Kyrgyzstan³⁰, one Justinianic strain (Altenerding 2148)²⁹, 15 previously published historical genomes from the second plague pandemic^8,9,10,14 and a Y. pseudotuberculosis strain (IP32953)⁶⁰ that was used as outgroup for rooting the phylogeny. A vcf file was produced for every genome using the “EMIT_ALL_SITES” option, which generated a call for every position present in the reference genome. Furthermore, we used the custom java tool MultiVCFAnalyzer v0.85 (ref. ³³) (https://github.com/alexherbig/MultiVCFAnalyzer) to produce a SNP table of variant positions across all genomes analysed, using the following parameters: for homozygous alleles, a SNP would be called when covered at least 3-fold with a minimum genotyping quality of 30, and for heterozygous alleles, a variant would be called when 90% of reads would support it. In cases where none of the parameters would be met, an “N” would be inserted in the respective genomic position. In addition, we omitted previously defined noncore regions, as well as annotated repetitive elements, homoplasies, tRNAs, rRNAs and tmRNAs from our SNP analysis^16,23. In the present dataset, a total of 7,510 variant positions were identified. Subsequently, the annotation as well as the effect of each SNP was determined through the program SnpEff v3.1i (ref. ⁷⁶).

We used a SNP alignment produced by MultiVCFAnalyzer v0.85 to construct phylogenetic trees using the ML and maximum parsimony (MP) methods. Up to 3% missing data were included in the analysis (97% partial deletion), resulting in a total number of 6,058 SNPs used for phylogenetic reconstruction. The MP phylogeny was produced in MEGA7 (ref. ⁷⁷) in order to make a first assessment of genome topologies. The ML phylogenies were constructed with the program RAxML (version 8.2.9)⁷⁸ using the Generalised Time Reversible (GTR)⁷⁹ model with four gamma rate categories and 1000 bootstrap replicates to assess tree topology support.

Reanalysis of recently published non-UDG Y. pestis genomes

A recent study described the phylogenetic positioning and SNP analysis of five 14th century Y. pestis genomes⁸. As these genomes were non-UDG treated, they were reanalysed here using different criteria compared to other second pandemic and modern genomes in our dataset. Read pre-processing and merging was done as described in the above section “Sequencing and read processing”. In addition, read mapping against the CO92 reference genome (NC_003143.1) was performed using more lenient parameters in BWA⁸⁰ (-n 0.01, -l 16) than the ones previously reported⁸, to account for ancient DNA deamination within mapping reads. In our view, the usage of strict BWA mapping parameters for non-UDG data (i.e. –n 0.1) could potentially introduce a reference bias to the analysis, which could in turn affect SNP discovery and phylogenetic inferences. PCR duplicates were removed from all five datasets using MarkDuplicates (http://broadinstitute.github.io/picard/) and reads were filtered for mapping quality (q 37) using SAMtools (http://samtools.sourceforge.net/). The obtained mean coverage for each of the five re-analysed genomes was: 3.4-fold for BSS31 (27.8% covered 5-fold), 6.7-fold for SLC1006 (59.1% covered 5-fold), 30.5-fold for OSL-1 (91.7% covered 5-fold), 38.1-fold for Ber37 (95.2% covered 5-fold) and 46.1-fold for Ber45 (94.1% covered 5-fold). In addition, the obtained average fragment lengths for the five re-analysed genomes were as follows: 52.2 bp for BSS31, 71.5 bp for SLC1006, 108 bp for OSL-1, 61.9 bp for Ber37 and 69.7 bp for Ber45. Before SNP calling, MapDamage2.0 (ref. ⁸¹) was used to rescale base qualities, primarily on the extremities of mapped reads, to account for sites that could potentially be affected by aDNA deamination. Subsequently, SNPs were called using GATK and the resulting vcf files were comparatively assessed in MultiVCFAnalyzer v0.85 (ref. ³³) to compile a SNP table including all genomes in the dataset as described in the above section “SNP calling and phylogenetic analysis”.

Qualitative SNP assessment in UDG-treated data using SNPEvaluation

A frequent challenge faced when using ancient metagenomic datasets to reconstruct bacterial genomes is a strong environmental signal that can interfere with SNP assignments, especially in low-coverage data²⁹. Such an effect can interfere with phylogenetic analyses by creating artificial branch lengths, which can in turn affect evolutionary inferences. As such, in order to avoid erroneous SNP assignments, we qualitatively evaluated all private SNP calls for the newly reconstructed genomes that were used for phylogenetic analysis in this study (minimum 50% of the genome covered 5-fold (Table 1)). We used the recently developed SNPEvaluation tool (https://github.com/andreasKroepelin/SNP_Evaluation) to compare the SNP profiles that arise for each dataset under both stringent (BWA parameters -n 0.1, -l 32) and more lenient (BWA parameters: -n 0.1, -l 16) mapping parameters. Subsequently, the region around each SNP was evaluated within a 50-bp window, and was accepted as true when fulfilling the following criteria: (i) the ratio of coverage between the lenient and stringent mapping was not higher than 1.00, (ii) there were no heterozygous positions within this window when considering a high stringency mapping and (iii) no missing regions/bases were observed within close proximity to the identified SNP (see Supplementary Data 2). Note that the above criteria in SNPEvaluation have been determined and evaluated in UDG-treated metagenomic data²⁸ and, therefore, would need to be re-adapted for non-UDG-treated data that are heavily affected by aDNA deamination.

Heterozygosity estimates

Heterozygous variant calls were investigated given the disparity of branch lengths observed in certain newly reconstructed and previously published genomes (see Supplementary Figs. 14 and 16). Our approach takes into account the “haploid” nature of prokaryotic genomes, suggesting that “heterozygous” SNPs could either arise as a result of mixed infections or from erroneous mapping of DNA reads that belong to closely related bacterial contaminants. We performed SNP calling with the UnifiedGenotyper in GATK⁷⁵, using the “EMIT_ALL_SITES” option that generated a call for all positions in the reference genome. We then used MultiVCFAnalyzer v0.85 (ref. ³³) to compile a SNP table of variant positions with allele frequencies 10–90% across our dataset, hence accounting for all ambiguous heterozygous positions. Histograms of allele frequencies for all SNPs with <100% read support were constructed with R v3.4.1 (ref. ⁸²) using representative genomes from all sites.

Estimates of substitution rate variation in Y. pestis

In order to calculate the substitution rate variation across Y. pestis isolates associated with the second pandemic, we first assessed the temporal signal across Branch 1 that includes all genomes from both the second and third plague pandemics. For this, we computed an ML phylogeny in RaxML⁷⁸ using all Branch 1 genomes^{3,8,9,10,14,16,23,48,83,84} (modern + ancient n = 79), with the exception of the BD genomes TRP002 and OSL-1 that showed possible environmental contamination to be affecting their SNP calls. In addition, we used the strain 2.MED KIM10 (Branch 2) as outgroup for rooting the tree. Variant positions across this set of genomes were used for the analysis, allowing for up to 3% missing data (550 SNPs). We used TempEst v1.5 (http://tree.bio.ed.ac.uk/software/tempest/) for calculating the root-to-tip regression in relation to specimen or sampling ages. The calculated correlation coefficient (R) and R² values were 0.57 and 0.33, respectively, which permitted the proceeding with molecular dating analysis.

The Bayesian framework BEASTv1.8 (ref. ⁸⁵) was used to assess the substitution rate variation across the Y. pestis Branch 1 using the 2.MED KIM10 as outgroup. Our BEAUti setup took into consideration all archaeological, radiocarbon and sampling dates of both ancient and modern genomes (Supplementary Data 6) that were used as calibration points for the Bayesian phylogeny. Divergence dates for each node in the tree were estimated as years before the present, where the year 2005 was set as the present since it represents the most recently isolated modern Y. pestis strain on Branch 1 (1.ORI MG05). Monophyletic clades were defined based on the ML phylogeny (Supplementary Fig. 12). The GTR⁷⁹ substitution model (4 gamma rate categories) and a lognormal relaxed clock (clock rate tested and strict clock rejected in MEGA7⁷⁷) were used to set up two separate analyses using the coalescent constant size⁸⁶ and coalescent Bayesian skyline⁸⁷ demographic models. For each analysis, three independent chains of 50,000,000 states each were carried out and then combined using LogCombiner to ensure run convergence, with 10% burn-in. In addition, we estimated marginal likelihoods to determine the best demographic model for our dataset using path sampling and stepping stone sampling (PS/SS) implemented in BEAST v1.8 (ref. ⁸⁵). For this, each of the described runs was carried out for an additional 50,000,000 states (500,000 states divided into 100 steps) using an alpha parameter of 0.3, which determined the coalescent Bayesian skyline model as better fit for the current dataset. The results produced by the run considering this demographic model were then viewed in Tracer v1.6 (http://tree.bio.ed.ac.uk/software/tracer/) to ensure all relevant effective sample sizes (ESS) were >200. We used TreeAnnotator⁸⁵, to produce a maximum clade credibility (MCC) phylogeny using the best-fit model with 10% burn-in, which resulted in the processing of 13,503 trees. The MCC tree was viewed and modified in FigTree v1.4 (http://tree.bio.ed.ac.uk/software/figtree/) where branch lengths were represented as a function of age and mean rates were used to colour individual branches. Finally, the skyline plot was produced and viewed using Tracer v1.6 (http://tree.bio.ed.ac.uk/software/tracer/) after resampling states at a lower frequency (every 100,000) using LogCombiner⁸⁵.

Gene presence/absence and deletion analysis

In order to investigate the virulence-associated gene profiles of the newly reconstructed second pandemic genomes, the highest quality (coverage) genome from every site (LAI009, NAB003, TRP002, MAN008, STA001, NMS002, LBG002, STN014, BRA001, BED030) was used for comparison with each other and with previously published representatives of ancient (London BD 8124/8291/11972, OSL-1 Ber45, London 6330, Bolgar 2370, Barcelona 3031, Ellwangen 549_O, OBS137, RISE509, RT5, Altenerding 2148) and modern (1.ORI-CO92, 0.PE2-PESTF, 0.PE4-Microtus 91001) Y. pestis isolates as well as a Y. pseudotuberculosis strain (IP32953). All listed genomes were re-mapped against the CO92 chromosomal reference genome (NC_003143.1) without the use of a mapping quality filter of (-q 0). The coverage across 80 chromosomal and 43 plasmid virulence-associated and evolutionary determinant genes that were previously defined³⁶ was calculated using bedtools⁸⁸. The results are plotted in the form of a heatmap using the ggplot2 (ref. ⁸⁹) package in R version 3.4.1 (ref. ⁸²) and can be viewed in Fig. 4. In addition, we used BWA-MEM⁸⁰ to explore the precise coordinates of observed gene or region losses in all affected genomes using default parameters. For the visualisation of an identified deletion across BED and OBS isolates, we computed the average coverage across 3,000-bp windows in representative Y. pestis genomes from all analysed periods of the second pandemic, and subsequently used the program Circos⁹⁰ to produce coverage plots of a 20-fold maximum coverage. The coverage plots were arranged in chronological order as follows: LAI009, London BD 8124/8291/11972, Ber45, Bolgar 2370, MAN008, STA001, NMS002, ELW098, LBG002, STN014, BRA001, BED030, OBS137 and the reference genome CO92.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Raw sequencing data of the deep-sequenced genomes are available on the European Nucleotide Archive under project accession number PRJEB29990 . Other data supporting the findings of the study are available in this article and its Supplementary Information files, or from the corresponding authors upon request.

References

Benedictow, O. J. The Black Death, 1346-1353: The Complete History (Boydell and Brewer, Woodbridge, UK, and Rochester, N.Y., 2004).
Biraben, J.-N. Les Hommes et la peste en France et dans les pays européens et méditerranéens. t. 2, les hommes face à la peste (Mouton, Paris, 1976).
Parkhill, J. et al. Genome sequence of Yersinia pestis, the causative agent of plague. Nature 413, 523–527 (2001).
Article CAS ADS PubMed Google Scholar
Gage, K. L. & Kosoy, M. Y. Natural history of plague: perspectives from more than a century of research. Annu. Rev. Entomol. 50, 505–528 (2005).
Article CAS PubMed Google Scholar
Prentice, M. B. & Rahalison, L. Plague. Lancet 369, 1196–1207 (2007).
Article PubMed Google Scholar
Tikhomirov, E. Epidemiology and Distribution of Plague. in Plague manual: epidemiology, distribution, surveillance and control. (eds. Dennis, D. T. et al.) 11-41 (World Health Organisation, Geneva, 1999).
Alexander, J. T. Bubonic Plague in Early Modern Russia: Public Health and Urban Disaster (Johns Hopkins University Press, Baltimore, Maryland, USA, 1980).
Google Scholar
Namouchi, A. et al. Integrative approach using Yersinia pestis genomes to revisit the historical landscape of plague during the Medieval Period. Proc. Natl Acad. Sci. USA 115, E11790–E11797 (2018).
Article CAS PubMed PubMed Central Google Scholar
Spyrou, M. A. et al. Historical Y. pestis genomes reveal the European Black Death as the source of ancient and modern plague pandemics. Cell Host Microbe 19, 874–881 (2016).
Article CAS PubMed Google Scholar
Bos, K. I. et al. A draft genome of Yersinia pestis from victims of the Black Death. Nature 478, 506–510 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Büntgen, U., Ginzler, C., Esper, J., Tegel, W. & McMichael, A. J. Digitizing historical plague. Clin. Infect. Dis. 55, 1586–1588 (2012).
Article PubMed Google Scholar
Schmid, B. V. et al. Climate-driven introduction of the Black Death and successive plague reintroductions into Europe. Proc. Natl Acad. Sci. USA 112, 3020–3025 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Seifert, L. et al. Genotyping Yersinia pestis in historical plague: evidence for long-term persistence of Y. pestis in Europe from the 14th to the 17th Century. PLoS ONE 11, e0145194 (2016).
Article PubMed PubMed Central Google Scholar
Bos, K. I. et al. Eighteenth century Yersinia pestis genomes reveal the long-term persistence of an historical plague focus. eLife 5, e12994 (2016).
Article PubMed PubMed Central Google Scholar
Pollitzer, R. The Plague No. 26 (World Health Organization, Geneva, 1954).
Morelli, G. et al. Yersinia pestis genome sequencing identifies patterns of global phylogenetic diversity. Nat. Genet. 42, 1140–1143 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schuenemann, V. J. et al. Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc. Natl Acad. Sci. USA 108, E746–E752 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vågene, A. J. et al. Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico. Nat. Ecol. Evol. 2, 520–528 (2018).
Article PubMed Google Scholar
Huebler, R. et al. HOPS: automated detection and authentication of pathogen DNA in archaeological remains. Preprint at https://www.biorxiv.org/content/10.1101/534198v2 (2019).
Briggs, A. W. et al. Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic Acids Res. 38, e87 (2010).
Article PubMed Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, pdb.prot5448 (2010).
Article PubMed Google Scholar
Andrades Valtueña, A. et al. The Stone Age plague and its persistence in Eurasia. Curr. Biol. 27, 3683–3691 (2017). e3688.
Article PubMed Google Scholar
Cui, Y. et al. Historical variations in mutation rate in an epidemic pathogen, Yersinia pestis. Proc. Natl Acad. Sci. USA 110, 577–582 (2013).
Article CAS ADS PubMed Google Scholar
Kislichkina, A. A. et al. Nineteen whole-genome assemblies of Yersinia pestis subsp. microtus, including representatives of Biovars caucasica, talassica, hissarica, altaica, xilingolensis, and ulegeica. Genome Announc. 3, e01342–01315 (2015).
Article PubMed PubMed Central Google Scholar
Zhgenti, E. et al. Genome assemblies for 11 Yersinia pestis strains isolated in the Caucasus region. Genome Announc. 3, e01030–01015 (2015).
Article PubMed PubMed Central Google Scholar
Kutyrev, V. V. et al. Phylogeny and classification of Yersinia pestis through the lens of strains from the plague foci of Commonwealth of Independent States. Front. Microbiol. 9, 1106 (2018).
Article PubMed PubMed Central Google Scholar
Eroshenko, G. A. et al. Yersinia pestis strains of ancient phylogenetic branch 0. ANT are widely spread in the high-mountain plague foci of Kyrgyzstan. PLoS ONE 12, e0187230 (2017).
Article PubMed PubMed Central Google Scholar
Keller, M. et al. Ancient Yersinia pestis genomes from across Western Europe reveal early diversification during the First Pandemic (541-750 CE). Proc. Natl Acad. Sci. USA 116, 12363–12372 (2019).
Feldman, M. et al. A high-coverage Yersinia pestis genome from a sixth-century Justinianic plague victim. Mol. Biol. Evol. 33, 2911–2923 (2016).
Article CAS PubMed PubMed Central Google Scholar
de Barros Damgaard, P. et al. 137 ancient human genomes from across the Eurasian steppes. Nature 557, 369 (2018).
Article ADS Google Scholar
Rasmussen, S. et al. Early divergent strains of Yersinia pestis in Eurasia 5,000 years ago. Cell 163, 571–582 (2015).
Article CAS PubMed PubMed Central Google Scholar
Spyrou, M. A. et al. Analysis of 3800-year-old Yersinia pestis genomes suggests Bronze Age origin for bubonic plague. Nat. Commun. 9, 2234 (2018).
Article ADS PubMed PubMed Central Google Scholar
Bos, K. I. et al. Pre-Columbian mycobacterial genomes reveal seals as a source of New World human tuberculosis. Nature 514, 494–497 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Darling, A. E., Miklós, I. & Ragan, M. A. Dynamics of genome rearrangement in bacterial populations. PLoS Genet. 4, e1000128 (2008).
Article PubMed PubMed Central Google Scholar
Green, M. H. Putting Africa on the Black Death map: Narratives from genetics and history. Afriques (2018). Available at: http://journals.openedition.org/afriques/2125 (Accessed: 3rd September 2019).
Zhou, D. & Yang, R. Molecular Darwinian evolution of virulence in Yersinia pestis. Infect. Immun. 77, 2242–2250 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zhou, D. et al. Genetics of metabolic variations between Yersinia pestis biovars and the proposal of a new biovar, microtus. J. Bacteriol. 186, 5147–5152 (2004).
Article CAS PubMed PubMed Central Google Scholar
Simonet, M., Riot, B., Fortineau, N. & Berche, P. Invasin production by Yersinia pestis is abolished by insertion of an IS200-like element within the inv gene. Infect. Immun. 64, 375–379 (1996).
CAS PubMed PubMed Central Google Scholar
Groisman, E. A. et al. Bacterial Mg2+ homeostasis, transport, and virulence. Annu. Rev. Genet. 47, 625–646 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ford, D. C., Joshua, G. W., Wren, B. W. & Oyston, P. C. The importance of the magnesium transporter MgtB for virulence of Yersinia pseudotuberculosis and Yersinia pestis. Microbiology 160, 2710–2717 (2014).
Article CAS PubMed Google Scholar
Fetherston, J. D. & Perry, R. D. The pigmentation locus of Yersinia pestis KIM6+ is flanked by an insertion sequence and includes the structural genes for pesticin sensitivity and HMWP2. Mol. Microbiol. 13, 697–708 (1994).
Article CAS PubMed Google Scholar
Signoli, M., Bello, S. & Dutour, O. [Epidemic recrudescence of the Great Plague in Marseille (May-July 1722): excavation of a mass grave]. Med. Trop. (Mars) 58, 7–13 (1998).
CAS Google Scholar
Wagner, D. M. et al. Yersinia pestis and the Plague of Justinian 541–543 AD: a genomic analysis. Lancet Infect. Dis. 14, 319–326 (2014).
Article PubMed Google Scholar
Benedictow, O. J. The Black Death and Later Plague Epidemics in the Scandinavian Countries: Perspectives and Controversies (Walter de Gruyter GmbH & Co KG, Warsaw/Berlin, 2016).
Spyrou, M. A., Bos, K. I., Herbig, A. & Krause, J. Ancient pathogen genomics as an emerging tool for infectious disease research. Nat. Rev. Genet. 20, 323–340 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hymes, R. Epilogue: a hypothesis on the East Asian beginnings of the Yersinia pestis polytomy. in Pandemic Disease in the Medieval World: Rethinking the Black Death Vol. 1 (ed. Green, M. H.) 285–308 (Arc Medieval Press, Kalamazoo and Bradford, 2014).
Haensch, S. et al. Distinct clones of Yersinia pestis caused the black death. PLoS Pathog. 6, e1001134 (2010).
Article PubMed PubMed Central Google Scholar
Chain, P. S. et al. Complete genome sequence of Yersinia pestis strains Antiqua and Nepal516: evidence of gene reduction in an emerging pathogen. J. Bacteriol. 188, 4453–4463 (2006).
Article PubMed PubMed Central Google Scholar
Carmichael, A. G. Plague persistence in Western Europe: a hypothesis. in Pandemic Disease in the Medieval World: Rethinking the Black Death Vol. 1 (ed. Green, M. H.) 157–191 (Arc Medieval Press, Kalamazoo and Bradford, 2014).
Signoli, M., Séguy, I., Biraben, J.-N., Dutour, O. & Belle, P. Paleodemography and historical demography in the context of an epidemic. Population 57, 829–854 (2002).
Google Scholar
Cummins, N., Kelly, M. & Ó Gráda, C. Living standards and plague in London, 1560–1665. Econ. Hist. Rev. 69, 3–34 (2016).
Article Google Scholar
Brygoo, E.-R. Epidémiologie de la peste à Madagascar. Arch. Inst. Pasteur Madagascar 35, 9–147 (1966).
Vogler, A. J. et al. Temporal phylogeography of Yersinia pestis in Madagascar: insights into the long-term maintenance of plague. PLOS Negl. Trop. Dis. 11, e0005887 (2017).
Article PubMed PubMed Central Google Scholar
Grabenstein, J. P., Fukuto, H. S., Palmer, L. E. & Bliska, J. B. Characterization of phagosome trafficking and identification of PhoP-regulated genes important for survival of Yersinia pestis in macrophages. Infect. Immun. 74, 3727–3741 (2006).
Article CAS PubMed PubMed Central Google Scholar
Blanc‐Potard, A. B. & Groisman, E. A. The Salmonella selC locus contains a pathogenicity island mediating intramacrophage survival. EMBO J. 16, 5376–5385 (1997).
Article PubMed PubMed Central Google Scholar
Snavely, M., Miller, C. & Maguire, M. The mgtB Mg2+ transport locus of Salmonella typhimurium encodes a P-type ATPase. J. Biol. Chem. 266, 815–823 (1991).
CAS PubMed Google Scholar
Belon, C. et al. A macrophage subversion factor is shared by intracellular and extracellular pathogens. PLoS Pathog. 11, e1004969 (2015).
Article PubMed PubMed Central Google Scholar
Snavely, M., Florer, J., Miller, C. & Maguire, M. Magnesium transport in Salmonella typhimurium: 28Mg2+ transport by the CorA, MgtA, and MgtB systems. J. Bacteriol. 171, 4761–4766 (1989).
Article CAS PubMed PubMed Central Google Scholar
Achtman, M. et al. Yersinia pestis, the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosis. Proc. Natl Acad. Sci. USA 96, 14043–14048 (1999).
Article CAS ADS PubMed PubMed Central Google Scholar
Chain, P. S. et al. Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis. Proc. Natl Acad. Sci. USA 101, 13826–13831 (2004).
Article CAS ADS PubMed PubMed Central Google Scholar
Perry, R. D. & Fetherston, J. D. Yersinia pestis-etiologic agent of plague. Clin. Microbiol. Rev. 10, 35–66 (1997).
Article CAS PubMed PubMed Central Google Scholar
Dean, K. R. et al. Human ectoparasites and the spread of plague in Europe during the Second Pandemic. Proc. Natl Acad. Sci. USA 115, 1304–1309 (2018).
Article CAS PubMed PubMed Central Google Scholar
Keeling, M. K. & Gilligan, C. A. Metapopulation dynamics of bubonic plague. Nature 407, 903–906 (2000).
Article CAS ADS PubMed Google Scholar
Xu, L. et al. The trophic responses of two different rodent–vector–plague systems to climate change. Proc. Biol. Sci. 282, 20141846 (2015).
Article PubMed PubMed Central Google Scholar
Xu, L. et al. Wet climate and transportation routes accelerate spread of human plague. Proc. Biol. Sci. 281, 20133159 (2014).
Article PubMed PubMed Central Google Scholar
Whittles, L. K. & Didelot, X. Epidemiological analysis of the Eyam plague outbreak of 1665–1666. Proc. Biol. Sci. 283, 20160618 (2016).
Article PubMed PubMed Central Google Scholar
Roosen, J. & Curtis, D. R. Dangers of noncritical use of historical plague data. Emerg. Infect. Dis. 24, 103 (2018).
Article PubMed Central Google Scholar
Dabney, J. et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc. Natl Acad. Sci. USA 110, 15758–15763 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Rasmussen, M. et al. The genome of a Late Pleistocene human from a Clovis burial site in western Montana. Nature 506, 225 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Rohland, N., Harney, E., Mallick, S., Nordenfelt, S. & Reich, D. Partial uracil-DNA-glycosylase treatment for screening of ancient DNA. Philos. Trans. R Soc. Lond. B Biol. Sci. 370, 20130624 (2015).
Article PubMed PubMed Central Google Scholar
Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012).
Article CAS PubMed Google Scholar
Peltzer, A. et al. EAGER: efficient ancient genome reconstruction. Genome Biol. 17, 60 (2016).
Article PubMed PubMed Central Google Scholar
Schubert, M., Lindgreen, S. & Orlando, L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes 9, 88 (2016).
Article PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly ( Austin ) 6, 80–92 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Tavaré, S. Some probabilistic and statistical problems in the analysis of DNA sequences. in Lectures on Mathematics in the Life Sciences Vol. 17 (ed. Miura, R. M.) 57–86 (American Mathematical Society, Providence, Rhode Island, 1986).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://arxiv.org/abs/1303.3997 (2013).
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. & Orlando, L. mapDamage2. 0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013).
Article PubMed PubMed Central Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, Austria, 2015).
Auerbach, R. K. et al. Yersinia pestis evolution on a small timescale: comparison of whole genome sequences from North America. PLoS ONE 2, e770 (2007).
Article ADS PubMed PubMed Central Google Scholar
Eppinger, M. et al. Draft genome sequences of Yersinia pestis isolates from natural foci of endemic plague in China. J. Bacteriol. 191, 7628–7629 (2009).
Article CAS PubMed PubMed Central Google Scholar
Drummond, A. J. & Rambaut, A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 7, 214 (2007).
Article PubMed PubMed Central Google Scholar
Kingman, J. F. C. The coalescent. Stoch. Process. Appl. 13, 235–248 (1982).
Article MathSciNet Google Scholar
Drummond, A. J., Rambaut, A., Shapiro, B. & Pybus, O. G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 22, 1185–1192 (2005).
Article CAS PubMed Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wickham, H. ggplot2: Elegant Graphics For Data Analysis (Springer International Publishing AG, Switzerland, 2016).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Aditya K. Lankapalli and Stephen Clayton for computational analysis support. We thank Guido Brandt, Antje Wissgott, Cäcilia Freund and Marta Burri for laboratory support. We are grateful to Monica Green for critical comments on the manuscript. We thank Hans Sell and Michelle O’Reilly for graphics support. We thank Rafail’ M. Fattahov for facilitating excavations of the Laishevo III archaeological site, Ayrat Sitdikov for providing access to the Laishevo III skeletal assemblage and Elizaveta V. Volkova for assisting with sampling of skeletal material. In addition, we would like to thank Joke Somers for the anthropological analysis and sampling of the Stans individuals. We thank Bettina Jungklaus for providing the samples from Brandenburg an der Havel, Bernd Trautmann for morphological analyses, Jochen Haberstroh and Mathias Hensch for providing archaeological information, and the staff of the SAPM for support during sample collection. We also thank Benoît Kirschenbilder, for his initial involvement in this project in association with the Toulouse archaeological site (16 rue des Trente Six Ponts). The fieldwork at the New Churchyard was led by Alison Telfer, and radiocarbon dating was carried out by 14CHRONO Centre, The Queen’s University, Belfast, Northern Ireland. Analysis of radiocarbon dates from New Churchyard was performed by Derek Hamilton of the Scottish Universities Environmental Research Centre (SUERC), East Kilbride, Scotland, and Peter Marshall of Historic England. Radiocarbon dating for the Stans collection was performed at the LARA laboratory of the Department of Chemistry and Biochemistry at the University of Bern. Radiocarbon dating for all other material was performed in the Curt-Engelhorn-Zentrum Archäometrie gGmbH in Mannheim, Germany. The Cambridge work is supported by the Wellcome Trust (Award no. 2000368/Z/15/Z) and St. John’s College, Cambridge (J.E.R., T.K., C.C., C.L.S.); the European Union through the European Regional Development Fund (Project No. 2014-2020.4.01.16-0030) (C.L.S.); and the Estonian Research Council personal research grant (PRG243) (C.L.S). M.A.S., M.K., K.I.B. and J.K. were supported by the Max Planck Society and the ERC starting grant APGREID (to J.K.). R.T., A.H. and K.I.B. were supported by the Max Planck Society.

Author information

These authors contributed equally: Maria A. Spyrou, Marcel Keller.

Authors and Affiliations

Max Planck Institute for the Science of Human History, 07745, Jena, Germany
Maria A. Spyrou, Marcel Keller, Rezeda I. Tukhbatova, Elizabeth A. Nelson, Aida Andrades Valtueña, Gunnar U. Neumann, Alexander Herbig, Kirsten I. Bos & Johannes Krause
Institute for Archaeological Sciences, University of Tübingen, 72070, Tübingen, Germany
Maria A. Spyrou, Elizabeth A. Nelson & Johannes Krause
SNSB, State Collection for Anthropology and Palaeoanatomy Munich, 80333, Munich, Germany
Marcel Keller, Kristin von Heyking, Joris Peters & Michaela Harbeck
Laboratory of Structural Biology, Kazan Federal University, Kazan, Russian Federation, 420008
Rezeda I. Tukhbatova
Institute of Genomics, University of Tartu, Riia 23b, 51010, Tartu, Estonia
Christiana L. Scheib & Toomas Kivisild
MOLA (Museum of London Archaeology), London, N1 7ED, UK
Don Walker, Niamh Carty, Robert Hartle, Michael Henderson & Elizabeth L. Knox
Department of Physical Anthropology, Institute for Forensic Medicine, University of Bern, 3007, Bern, Switzerland
Amelie Alterauge & Sandra Lösch
Department of Archaeology, University of Cambridge, Downing St, Cambridge, CB2 3ER, UK
Craig Cessford, Prishita Maheshwari-Aplin & John E. Robb
Archaeological Service, State Archive Nidwalden, 6371, Nidwalden, Switzerland
Hermann Fetz
Archeodunum SAS, Agency Toulouse, 8 allée Michel de Montaigne, 31770, Colomiers, France
Michaël Gourvennec
McDonald Institute for Archaeological Research, University of Cambridge, Downing St, Cambridge, CB2 3ER, UK
Sarah A. Inskip
PACEA, CNRS Institute, Université de Bordeaux, 33615, Pessac, France
Sacha Kacki & Dominique Castex
Department of Archaeology, Durham University, South Rd, Durham, DH1 3LE, UK
Sacha Kacki
Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Felix M. Key
Bavarian State Department of Monuments and Sites, 80539, Munich, Germany
Christian Later
ArchaeoBioCenter and Department of Veterinary Sciences, Institute of Palaeoanatomy, Domestication Research and the History of Veterinary Medicine, Ludwig Maximilian University Munich, Kaulbachstr. 37/III, 80539, Munich, Germany
Joris Peters
Dig it! Company GbR, 86971, Peiting, Germany
Jürgen Schreiber
Department of Human Genetics, Katholieke Universiteit Leuven, 3000, Leuven, Belgium
Toomas Kivisild

Authors

Maria A. Spyrou
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Keller
View author publications
You can also search for this author in PubMed Google Scholar
Rezeda I. Tukhbatova
View author publications
You can also search for this author in PubMed Google Scholar
Christiana L. Scheib
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth A. Nelson
View author publications
You can also search for this author in PubMed Google Scholar
Aida Andrades Valtueña
View author publications
You can also search for this author in PubMed Google Scholar
Gunnar U. Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Don Walker
View author publications
You can also search for this author in PubMed Google Scholar
Amelie Alterauge
View author publications
You can also search for this author in PubMed Google Scholar
Niamh Carty
View author publications
You can also search for this author in PubMed Google Scholar
Craig Cessford
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Fetz
View author publications
You can also search for this author in PubMed Google Scholar
Michaël Gourvennec
View author publications
You can also search for this author in PubMed Google Scholar
Robert Hartle
View author publications
You can also search for this author in PubMed Google Scholar
Michael Henderson
View author publications
You can also search for this author in PubMed Google Scholar
Kristin von Heyking
View author publications
You can also search for this author in PubMed Google Scholar
Sarah A. Inskip
View author publications
You can also search for this author in PubMed Google Scholar
Sacha Kacki
View author publications
You can also search for this author in PubMed Google Scholar
Felix M. Key
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth L. Knox
View author publications
You can also search for this author in PubMed Google Scholar
Christian Later
View author publications
You can also search for this author in PubMed Google Scholar
Prishita Maheshwari-Aplin
View author publications
You can also search for this author in PubMed Google Scholar
Joris Peters
View author publications
You can also search for this author in PubMed Google Scholar
John E. Robb
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Schreiber
View author publications
You can also search for this author in PubMed Google Scholar
Toomas Kivisild
View author publications
You can also search for this author in PubMed Google Scholar
Dominique Castex
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Lösch
View author publications
You can also search for this author in PubMed Google Scholar
Michaela Harbeck
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Herbig
View author publications
You can also search for this author in PubMed Google Scholar
Kirsten I. Bos
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Krause
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A.S., M.K., R.I.T., M.Ha., K.I.B. and J.K. designed the study. M.A.S., M.K., R.T., E.A.N., C.L.S., G.U.N. and P.M.-A. performed laboratory work. M.A.S., M.K., A.A.V., F.M.K. and A.H. performed data analysis. D.W., A.A., N.C., H.F., M.G., R.H., M.He., K.v.H., S.A.I., S.K., E.L.K., J.P., J.E.R., D.C., S.L. and M.Ha. performed anthropological analysis, as well as identified and provided access to appropriate archaeological material. A.A., J.S., K.v.H., C.L. and C.C. facilitated excavations and provided access to unpublished archaeological information. T.K., M.Ha., A.H., K.I.B. and J.K. supervised different aspects of the study. M.S., M.K. and K.I.B. wrote the paper with contribution from all co-authors.

Corresponding authors

Correspondence to Maria A. Spyrou, Kirsten I. Bos or Johannes Krause.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Francois Balloux and Ludovic Orlando for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Supplementary Data 9

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Spyrou, M.A., Keller, M., Tukhbatova, R.I. et al. Phylogeography of the second plague pandemic revealed through analysis of historical Yersinia pestis genomes. Nat Commun 10, 4470 (2019). https://doi.org/10.1038/s41467-019-12154-0

Download citation

Received: 18 December 2018
Accepted: 15 August 2019
Published: 02 October 2019
DOI: https://doi.org/10.1038/s41467-019-12154-0

This article is cited by

Germs, genes and soil: tales of pathogens past
- Amber Dance
Nature (2023)
Plagued by a cryptic clock: insight and issues from the global phylogeny of Yersinia pestis
- Katherine Eaton
- Leo Featherstone
- Hendrik N. Poinar
Communications Biology (2023)
Genomic diversity of Yersinia pestis from Yunnan Province, China, implies a potential common ancestor as the source of two plague epidemics
- Jingliang Qin
- Yarong Wu
- Yujun Cui
Communications Biology (2023)
Parallel signatures of Mycobacterium tuberculosis and human Y-chromosome phylogeography support the Two Layer model of East Asian population history
- Matthew Silcocks
- Sarah J. Dunstan
Communications Biology (2023)
Death in the Time of Pandemic: A Tuscan Cholera Cemetery at Benabbio (1855)
- Antonio Fornaciari
Historical Archaeology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.