Typically, monitoring a community for pathogens has relied on individuals choosing (or having the opportunity) to test for particular pathogens. But for many pathogens, including SARS-CoV-2, infected individuals shed pathogen genetic material into wastewater, which can be sampled with relatively low effort. These samples can be analyzed not only for the presence or absence of pathogens, but also the abundance of that pathogen. They can also be used with genome sequencing technology to quantify the abundance of particular mutations. By comparing across sampling periods, this can indicate which mutations are on the rise and which are on the decline, and ultimately, forecast how the pathogen is evolving and whether that evolution should be of concern.
The plot below shows the results of analyses we run each week. These analyses use all SARS-CoV-2 sequences sampled from wastewater in NCBI’s Sequence Read Archive–see our Acknowledgements page for the contributors of these samples and statistics for on the data itself. The vertical and horizontal axes represent the abundance of mutations aggregated across a three-week period, with the more recent period on the vertical axis and the previous period on the horizontal axis.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
Astute viewers may notice that many mutations relative to the standard SARS-CoV-2 reference are omitted. This is because virtually every SARS-CoV-2 lineage that’s currently circulating descends from the lineage JN.1. To remove noise from the plot, we omit JN.1-defining mutations so viewers can focus on mutations that are not yet fixed and may thus be informative for forecasting. This caveat aside, this visualization follows the convention of calling all mutations relative to the ancestral Wuhan-I strain of SARS-CoV-2. Note also that we filter mutations so that they are only displayed if they appear at or above a 2% frequency in either of the periods compared.
In general, the first points we look at in these plots are points that are closest to the upper lefthand corner of the plot, as these points represent amino acids that have very rapidly increased in frequency between the time-periods compared in our analyses. These mutations may be helpful for forecasting particular SARS-CoV-2 lineages that are likely to increase in prevalence. Given the function of the mutated amino acid, it may also indicate how the pathogen is mutating to evade pre-existing immunity or gain other functionality that is advantageous in the battle between pathogen versus host.
Next, we look at points that are closest to the lower righthand corner of the plot, as these mutations are likely to have become less fit, evolutionarily speaking, at a rapid rate.
From there, we click on mutations toward the upper righthand corner to see their trajectories; these mutations have very gradually increased in frequency. The closer they are to the corner, the closer they are to being present in nearly every SARS-CoV-2 wastewater sample.
This searchable table displays all plotted mutations. It shows the nucleotide positions where changes occur, along with their corresponding gene region and standardized mutation naming conventions.
Position | NT Change | ORFs | AA Change | Associated Variants | Time Span Start | Time Span End | Count | Abundance | Total Coverage | Pos. Samples | 1k+ read samples | Samples Processed | Process end date |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Loading ITables v2.2.4 from the init_notebook_mode cell...
(need help?) |