In order to find traits or evolutionary relics of the primordial genome (the most primitive nucleic acid genome for earth's life) remained in modern genomes, we have studied the characteristics of dinucleotide frequencies across genomes. As the longer a sequence is, the more probable it would be modified during genome evolution. For that reason, short nucleotide sequences, especially dinucleotides, would have considerable chances to be intact during billions of years of evolution. Consequently, conservation of the genomic profiles of the frequencies of dinucleotides across modern genomes may exist and would be an evolutionary relic of the primordial genome.
Based on this assumption, we analyzed the frequency profiles of dinucleotides of the whole-genome sequences from 130 prokaryotic species (including archaea and bacteria). The statistical results show that the frequencies of the dinucleotides AC, AG, CA, CT, GA, GT, TC, and TG are well conserved across genomes, while the frequencies of other dinucleotides vary considerably among species. This conservation/variation seems to be linked to the distributions of dinucleotides throughout a genome and across genomes, and also to have relation to strand symmetry.
We argue and conclude that the phenomenon of frequency conservation would be evolutionary relics of the primordial genome, which may provide insights into the study of the origin and evolution of genomes.