Zipf's Law
Origin: George Kingsley Zipf
Biological Parallel
Zipf's Law states that in many datasets, the nth most frequent item appears with frequency proportional to 1/n. The most common word appears twice as often as the second most common, three times as often as the third, etc. Biology exhibits this pattern across every scale. Species abundance follows Zipf: a few species dominate, most are rare. In rainforests, the most abundant tree species is ~twice as abundant as the second, following the 1/n pattern. Gene expression obeys Zipf's Law: a small number of genes account for most mRNA molecules, while thousands of genes express at low levels. Neural firing patterns follow Zipf: a few neurons fire frequently, most fire rarely. This distribution emerges from optimization under constraints. Biological systems must perform many different functions, but resources are limited. Zipf distributions arise when you optimize for diversity subject to resource constraints—do the common things very efficiently, maintain capability for rare things at low cost. Metabolic pathway usage follows Zipf: glycolysis runs constantly; pentose phosphate pathway runs occasionally; exotic pathways run rarely. The distribution isn't arbitrary—it reflects usage frequency. Systems that must handle diverse inputs but can't maintain all capabilities at high readiness naturally produce Zipf distributions. The pattern is a fingerprint of constrained optimization: maintain some capabilities always-on, others on-demand, most barely-functional but present. Biology can't afford uniform readiness, so it follows Zipf.