Skip to content

The Compression Hypothesis

The central claim of this dissertation: meme fitness correlates with compressibility.

Let MM be a meme with information content I(M)I(M) bits. Define:

  • Compression ratio: ρ(M)=I(M)/L(M)\rho(M) = I(M) / L(M) where L(M)L(M) is the compressed representation length
  • Reconstruction fidelity: F(M,M^)F(M, \hat{M}) measuring similarity between original and reconstructed meme
  • Transmission probability: PT(M)P_T(M) — probability of successful human-to-human transmission
  • Fitness: W(M)=PT(M)F(M,M^)R(M)W(M) = P_T(M) \cdot F(M, \hat{M}) \cdot R(M) where RR is replication motivation

Claim: For memes in competition:

W(M)ρ(M)1W(M) \propto \rho(M)^{-1}

when controlling for content utility.

A meme that can be compressed to fewer bits while maintaining reconstruction fidelity has advantages:

  1. Lower cognitive load — Easier to remember
  2. Faster transmission — Takes less time to communicate
  3. Higher fidelity — Less information lost in compression/decompression
  4. More redundancy budget — Can afford error-correction bits
ProverbFull ExplanationCompression
”A stitch in time saves nine""Addressing problems early prevents them from becoming larger problems that require more effort to fix later”~5x
”Don’t count chickens before they hatch""Don’t make plans based on outcomes that haven’t yet occurred, as circumstances may change”~4x

The proverbs spread; the explanations don’t.

Many proverbs appear independently across cultures:

  • “Strike while the iron is hot” (English)
  • “Beat the iron while it’s hot” (German)
  • “Forge the iron while it’s hot” (French)

Same compressed representation, same meme. The compression is the message.

The hypothesis predicts:

  1. Short beats long — Controlling for content, shorter versions spread further
  2. Concrete beats abstract — Imagery compresses better than abstraction
  3. Rhythm aids memory — Phonetic patterns add error-correction
  4. Lossy beats lossless — Some detail loss is acceptable for transmission gain
  • What’s the optimal compression for different channel capacities?
  • How do we measure “reconstruction fidelity” for ideas?
  • Are there ideas that cannot be compressed below a transmission threshold?

See: Notes on Claims for working research