Over at 538, Nate Silver has an excellent discussion of the perils of “overfitting” statistical forecasting models. It’s good enough that I could see assigning it to my students in methods courses. Incidentally, I would argue that the opposite peril (“underfitting” if you will) is more common in standard, hypothesis-testing political science research. Because the goal is to establish a particular hypothesis, authors face a temptation to exclude relevant control variables and maximize degrees of freedom.
Silver on Model Overfitting
March 24, 2011 by Jason Sorens
Posted in methodology | Tagged regression analysis, statistical methodology | 5 Comments
5 Responses
Leave a Reply Cancel reply
About Pileus
Pileus is a group of scholars who examine public policy and philosophy in light of our respective disciplines. We differ in many ways but share a commitment to liberty and personal responsibility.
This blog is hosted by
The Fund for American Studies, an educational nonprofit (www.TFAS.org). TFAS is not responsible for the content of the blog and neither endorses nor condemns any of the content posted.What is a Pileus?
In ancient Rome, a pileus was a felt cap given to slaves upon receiving their freedom. Since then, this simple cap has been a powerful symbol of liberty.
Meet The Authors
Our Published Work
Archives
- June 2013
- May 2013
- April 2013
- March 2013
- February 2013
- January 2013
- December 2012
- November 2012
- October 2012
- September 2012
- August 2012
- July 2012
- June 2012
- May 2012
- April 2012
- March 2012
- February 2012
- January 2012
- December 2011
- November 2011
- October 2011
- September 2011
- August 2011
- July 2011
- June 2011
- May 2011
- April 2011
- March 2011
- February 2011
- January 2011
- December 2010
- November 2010
- October 2010
- September 2010
- August 2010
- July 2010
- June 2010
- May 2010
- April 2010
Tags
Adam Smith Ayn Rand capitalism Cheonan civil liberties decentralization economics elections Federalism financial reform freedom Free State Project government spending Growth of Government Hayek health care higher education Ideology immigration Libertarianism libertarians liberty migration Milton Friedman Moral Philosophy new hampshire North Korea Obamacare paternalism Paul Krugman President Obama Property Rights public debt Public Policy rand paul republicans Robert Nozick ron paul secession states Supreme Court Taxes Tea Party unions United KingdomBlogroll
- Beacon, The
- Brainstorm
- Cato at Liberty
- Coordination Problem
- Darwinian Conservatism
- Econlog
- FiveThirtyEight
- Foreign Policy
- Free State Blogs
- Front Porch Republic
- Greg Mankiw
- Guido Fawkes
- Hit & Run
- Imaginative Conservative
- Lawyers, Guns, and Money
- Let a Thousand Nations Bloom
- Liberty & Power
- Marginal Revolution
- Megan McArdle
- Offsetting Behaviour
- Phil Arena
- Pileus
- Political Class Dismissed
- Slate
- The Adam Smith Institute Blog
- The Corner
- The Monkey Cage
- The New Republic
- The Skeptics
- The Sociological Imagination
- The Volokh Conspiracy
- Tory Anarchist
- Unqualified Reservations
- Will Wilkinson
Post Categories
- 2010 Elections (15)
- 2012 election (64)
- Book Recommendations (13)
- British politics (7)
- Budget Deficit (67)
- bureaucracy (1)
- Canadian politics (2)
- capitalism (15)
- Chart of the Day (1)
- Civil Liberties (30)
- civil war (3)
- collective goods (1)
- Comparative culture (5)
- Congress (14)
- corporate welfare (9)
- courts (2)
- democratization (2)
- development (3)
- drugs (5)
- Economic geography (1)
- Economic recovery (26)
- Economics (71)
- Education (21)
- elections (2)
- electoral behavior (4)
- electoral systems (4)
- employment (5)
- Energy Policy (1)
- Enlightened Self-Interest (8)
- Entitlement Reform (12)
- Environment (2)
- Ethics (27)
- European Union (1)
- Eurozone (2)
- federalism (24)
- finance (9)
- fiscal policies (25)
- foreign policy (29)
- freedom (15)
- Greece (1)
- growth (11)
- Guest Post (9)
- Guns (2)
- Hayek (1)
- health care (22)
- Hero of the Week (1)
- History (18)
- History of Political Philosophy (1)
- ideologies (6)
- immigration (2)
- Immigration policy (6)
- inequality (10)
- institutions (22)
- international political economy (1)
- international relations (2)
- interstate conflict (1)
- IPE (3)
- judiciary (2)
- justice (3)
- labor (1)
- Law (31)
- left-libertarianism (4)
- libertarianism (13)
- libertarianism (6)
- Locke (2)
- Markets and Morality (5)
- Marriage (5)
- marriage law (4)
- methodology (4)
- National defense (7)
- Nozick (4)
- paternalism (1)
- political philosophy (22)
- Political Science (47)
- politics (56)
- psychology (8)
- Public Opinion (10)
- race and ethnicity (2)
- Rants and Raves (21)
- Rawls (4)
- redistribution (5)
- Regulation (38)
- religion (3)
- rent-seeking (8)
- representation (6)
- resource curse (2)
- secession (15)
- secession (11)
- social justice (2)
- Social psychology (2)
- socialism (5)
- Sociology and Anthropology (4)
- Spain (1)
- state politics (32)
- state politics (13)
- taxes (4)
- trade (4)
- Turkey (1)
- U.S. presidential elections (22)
- Uncategorized (1429)
- welfare policy (6)




That was a good post.
I have to respectfully disagree about the question of whether the modal political scientist is using too few control variables. I do think we ought to worry more than we do about omitted variable bias, but the standard practice of including everything that other authors have found to also correlate with your DV doesn’t guard against that, and it’s not very well appreciated that adding control variables not only reduces degrees of freedom, but can also increase bias on your estimate of the coefficient for the variables your most interested in.
If you’re interested, I’ve blogged about this here:
http://fparena.blogspot.com/2010/11/what-your-control-variables-dont-do-for.html
and here:
http://fparena.blogspot.com/2011/03/what-your-control-variables-dont-do-for.html
Interesting & important stuff. I’ll have to go through the Clarke piece. Certainly I’m as guilty as anyone for not thinking very much about how measurement error in control variables can introduce new bias. And the “collider variable” problem looks really disturbing! Economists are slightly ahead of political scientists in treating omitted variable bias with instrumental variables, but IV has its own pitfalls.
Yeah, it’s unfortunate the Clarke piece hasn’t gotten more attention.
I agree, economists tend to handle the issue better, but are a bit blind to the shortcomings of IVs. A lot of them tend to think instruments are magic.
Of course, it’s easy for me to criticize standard practice given that I hardly ever touch real data anymore…
This one wasn’t about control variables per se, but if you found the first two links of any interest, you might also want to check this one out:
http://fparena.blogspot.com/2010/11/were-all-modelers.html
The central problem here was Arthur Golderger referred to as “micronumerosity,” which was his mocking term for just not having enough data. The best name for a model with 16 observations is “creative description.”
Goldberger was mocking those who think that multicollinearity is a “problem.” As I like to say, multicollinearity is a problem only in the sense that it makes researchers very upset. It is also a problem because it makes them do very naughty things like dropping relevant variables, thus turning a non-problem (collinearity) into a serious problem (omitted variables).
A fairly good-sized chunk of political scientists would do better research if they just dropped the idea of confirming theories through statistical analysis and concentrated on telling better stories by combining quantitative stories with qualitative ones, since they just don’t have enough data.