Our genetic code is so well-built that it can actually read two completely different proteins from the exact same stretch of DNA.
April 2, 2026
Original Paper
The fitness landscape of overlapping genes
arXiv · 2604.00602
The Takeaway
Scientists found that the rules of biology are hard-coded to support 'overlapping genes,' where a single sequence stores two different sets of instructions in different reading frames. This suggests our genetic code may have evolved specifically for this extreme form of biological data compression.
From the abstract
Natural genomes sometimes encode two different proteins in staggered reading frames of the same DNA sequence. Despite the prevalence of these 'overlapping genes' across the tree of life, it remains unknown whether arbitrary protein pairs can overlap, to what extent such overlaps are feasible, or what design principles govern them. Here, we study compatibility, frustration, and connectivity in the fitness landscape of overlapping genes. We computationally design sequences de novo that satisfy the