bioRxiv | 2021

Polymorphism at four-fold degenerate site, but not at the intergenic regions, explains nucleotide compositional strand asymmetry in bacteria

 
 
 
 
 
 
 
 

Abstract


We investigated single nucleotide polymorphism in intergenic regions (IRs) and four-fold degenerate sites (FFS) in genomes of three γ-Proteobacteria and two Firmicutes to understand the mechanism of nucleotide compositional asymmetry between the leading and the lagging strands. Pattern of the polymorphism spectra were alike regarding transitions but variable regarding transversions in the IRs of these bacteria. Contrasting trends of complementary polymorphisms such as C→T vs G→A as well as A→G vs T→C in the IRs vindicated similar replication-associated strand asymmetry regarding cytosine and adenine deamination, respectively, across these bacteria. Surprisingly, the polymorphism pattern at FFS was different from that of the IRs and its frequency was always more than the IRs in these bacteria. Further, the polymorphism patterns within a bacterium were inconsistent across the five amino acids, which neither the replication nor the transcription-associated mutations could explain. However, the polymorphism at FFS coincided with amino acid specific codon usage bias in the five bacteria. Further, strand asymmetry in nucleotide composition could be explained by the polymorphism at FFS, not at the IRs. Therefore, polymorphisms at FFS might not be treated as nearly neutral unlike that in IRs in these bacteria.

Volume None
Pages None
DOI 10.1101/2021.10.07.463495
Language English
Journal bioRxiv

Full Text