bioRxiv | 2021

Insertions in SARS-CoV-2 genome caused by template switch and duplications give rise to new variants of potential concern

 
 
 

Abstract


The appearance of multiple new SARS-CoV-2 variants during the winter of 2020-2021 is a matter of grave concern. Some of these new variants, such as B.1.351 and B.1.1.17, manifest higher infectivity and virulence than the earlier SARS-CoV-2 variants, with potential dramatic effects on the course of the COVID-19 pandemic. So far, analysis of new SARS-CoV-2 variants focused primarily on point nucleotide substitutions and short deletions that are readily identifiable by comparison to consensus genome sequences. In contrast, insertions have largely escaped the attention of researchers although the furin site insert in the spike protein is thought to be a determinant of SARS-CoV-2 virulence and other inserts might have contributed to coronavirus pathogenicity as well. Here, we investigate insertions in SARS-CoV-2 genomes and identify 141 unique inserts of different lengths. We present evidence that these inserts reflect actual virus variance rather than sequencing errors. Two principal mechanisms appear to account for the inserts in the SARS-CoV-2 genomes, polymerase slippage and template switch that might be associated with the synthesis of subgenomic RNAs. We show that inserts in the Spike glycoprotein can affect its antigenic properties and thus have to be monitored. At least, two inserts in the N-terminal domain of the Spike (ins246DSWG and ins15ATLRI) that were first detected in January 2021 are predicted to lead to escape from neutralizing antibodies whereas other inserts might result in escape from T-cell immunity.

Volume None
Pages None
DOI 10.21203/RS.3.RS-456574/V1
Language English
Journal bioRxiv

Full Text