This pipeline performs mutation analysis of SARS-CoV-2 and reports and quantifies the occurrence of lineages and single nucleotide (single-NT) mutations.
The visualizations below provide an overview of the evolution of VOCs found in the analyzed samples across given time points and locations. The abundance values for the variants are derived by deconvolution. For details please consult the documentation)
This pipeline is part of PiGx, a collection of highly reproducible genomics pipelines developed by the Bioinformatics & Omics Data Science platform at the Berlin Institute of Medical System Biology (BIMSB).
These plots provide an overview of the relative frequency dynamics of identified lineages at specific wastewater sampling locations over time.
The summary plot shows the results pooled by day and across locations by weighted average using the read number as weights.
Please use the tabs to access the not-pooled plots for each location.
This plot visualizes proportions of identified lineage abundances at the provided sampling locations.
Locations of wastewater processing plants have been generated arbitrarily and do not correspond to actual locations.
Use the slider to select a specific date or hit the Play button to display all snapshots successively. Click on a lineage in the legend to toggle its visibility in the map; double-click to view only the selected lineage.
The following plots provide an overview of detected single nucleotide mutations in different locations and how their relative frequency changes over time. Furthermore, mutations showing a significant frequency increase over time are highlighted.
Mutation notation
Mutations are noted in the pattern of
gene :: protein-sequence mutation : NT-sequence mutation
Please note that this translation was done for single mutations. Combinations of single-NT mutations that taken together may lead to a different amino acid are not yet taken into account.
To show the dynamic of significantly changing mutations over time a linear regression model was applied to the mutation results across all samples. The following table shows the showing the strongest increasing trend (p <= 0.05). The number of trending mutation is restricted to the top 20.
Mutations with significant increase in frequency over time
Download significant_mutations.csv
Download lm_res_all_mutations.csv (unfiltered)
These plots show the relative frequency of detected mutations with strong increasing trends in samples at specific wastewater sampling locations and how the frequencies change over time. Please use the tabs to access the not-pooled plots for each location.
This plot visualizes proportions of identified single mutation dynamics at the provided sampling locations.
Locations of wastewater processing plants have been generated arbitrarily and do not correspond to actual locations.
Use the slider to select a specific date or hit the Play button to display all snapshots successively. Click on a mutation in the legend to toggle its visibility in the map; double-click to view only the selected lineage.
Frequencies per lineage per sample, derived by deconvolution, pooled by weighted mean by read number”) Download variant_frequencies.csv