Trim model complexity
Prefer the simplest model of a situation that still fits all the evidence.
Why it works
Adding explanatory variables can always improve a model’s fit to past data, but each extra variable increases overfitting — the model starts explaining noise, not signal, and its predictions deteriorate. The parsimony principle in statistics (formalized as AIC or BIC) penalizes complexity to find the model that generalizes, not just memorizes.
How to do it
- When building any model or theory, ask: can I remove any variable without losing explanatory power?
- Test the simplified model against new data rather than the data it was built on.
- Use fit + simplicity as a combined criterion, not fit alone.
- Update the model only when a new variable demonstrably improves out-of-sample prediction.
Evidence
Overfitting is a well-documented failure mode in statistics and machine learning. Information-theoretic criteria (AIC, BIC) that penalize model complexity are standard tools for selecting models that generalize rather than memorize. (mechanistic)
Statistical parsimony criteria assume a specific loss function and data-generating model; they are guides, not oracles. Domain knowledge still matters.
Sources
- Akaike (1974), AIC criterion; Schwarz (1978), BIC criterion
Common mistake
Adding complexity until the explanation covers every observed quirk, then mistaking the coverage for understanding.
Practice this with IX Coach
IX Coach helps you identify when your theory about a recurring situation has more moving parts than the evidence warrants.
7 days free, then $40/month (~$1.30/day).