Research

They might not be exactly identical, but shouldn't they at least taste the same?

A New Front in the Replication Wars: Economics

November 5, 2015 1962

They might not be exactly identical, but shouldn’t they at least taste the same?

A sense of crisis is developing in economics after two Federal Reserve economists came to the alarming conclusion that economics research is usually not replicable.

The economists took 67 empirical papers from 13 reputable academic journals. Without assistance from the original researchers they were only able to get the same result in a third of cases.

With the original researchers’ assistance, that percentage increased to about half, suggesting reporting practices and requirements are seriously deficient.

This article by Andreas Ortmann originally appeared at The Conversation, a Social Science Space partner site, under the title “The replication crisis has engulfed economics”

The replication crisis in psychology is well-documented. Science recently published a stunning report by the Open Science Collaboration. Almost 300 researchers were involved in trying to directly replicate the results of 100 papers published in 2008. This followed earlier exercises involving many labs (such as here, here and here.)

The researchers did not succeed in the clear majority of cases. On average they found the mean effect size to be only half of what was reported in the original studies. While the report has been questioned (here and here,) there is growing concern that a cornerstone of the scientific edifice is in serious need of renovation.

What’s the problem?
Researchers are too often granted inappropriate degrees of freedom, and some are just fraudulent. But that said, some of these distressing replication results are because good science is messy. It involves hard work and reasonable people can reasonably disagree on the various calls that have to be made.

A good illustration is this just-published study by Raphael Silberzahn and Eric Uhlmann. The researchers engaged in methodological debates with well-known data sleuth Uri Simohnson.

Simohnson questioned the results of an earlier study from the pair that suggested noble-sounding German names could boost careers. Re-running the analysis with a better analytical approach, Simonsohn did not confirm the effect. Silberzahn and Uhlmann eventually conceded the point in a joint paper with Simonsohn.

In their new study, the researchers provided a data set and asked more than two dozen teams of researchers to contribute. They sought to determine, based on the data set, whether skin color of soccer players from four major leagues (England, France, Germany, and Spain) influenced how often they were given a red card.

Somewhat shockingly, the answers were rather diverse. Of the 29 teams, 20 found a statistically significant correlation with the median, suggesting dark-skinned players were 1.3 times more likely than light-skinned players to be sent off.

But the researchers reported:

“Findings varied enormously, from a slight (and non-significant) tendency for referees to give more red cards to light-skinned players to a strong trend of giving more red cards to dark-skinned players.”

Interestingly, this diversity of results survived even after the researchers debated the methodological approach.

The upshot is that even under the best of circumstances – one data set, what seems like a straightforward question to answer, and an exchange of ideas on the best method – arriving at consensus can be extraordinarily difficult. And it surely becomes even more difficult with multiple data sets and many teams.

Further scrutiny
That, of course, is hardly news to most social scientists, who largely accept that any single study is worth only so much. This is why replication efforts and meta-analyses are as important as the recent focus on publication bias and underpowered studies. There is tantalising evidence that many experimental economics studies are severely under-powered (although the evidence so far has been established only for a very simple class of games).

It will be interesting to see the result of a current collaborative effort by economists to replicate 18 laboratory economics studies from 2011 to 2014.

It is not just the social sciences that are in the grip of replication crises. The extent and consequences of p-hacking, and publication biases (studies that report no effect not being published) in science, are well-documented and have been known for a while.

So, where to from here? With a number of journals (including the Journal of the Economic Science Association, Experimental Economics, Journal of Experimental Social Psychology, Journal of Personality and Social Psychology, Psychological Science, Perspectives on Psychological Science) opening their doors to replication in various guises, we can expect more results to seemingly discredit the social sciences.

Hopefully in the long run it will up the ante on what it takes for a study to be reliable. Replication studies can inflict considerable damage on individuals’ productivity and reputation. There’s a need for minimal reporting standards and acceptable replication etiquette to be clarified, such as whether original authors have to be invited or consulted. Journals should become more serious about their data set collection efforts, when not prevented by confidentiality.

Andreas Ortmann

Andreas Ortmann took up his current position of professor of experimental and behavioural economics in the School of Economics, UNSW Australia Business School in 2009. Prior to his appointment at the Business School, he was the (Boston Consulting Group) professor of economics at CERGE-EI, a joint workplace of Charles University and the Academy of Sciences, Prague, Czech Republic. Prior to that appointment, he taught at Bowdoin and Colby College, in the US state of Maine. He also was, for a year each, a visiting scholar of the Program on Non-Profit Organizations at Yale University, the Max-Planck Institute for Psychological Research in Munich, the Max-Planck Institute for Human Development in Berlin, and the Harvard Business School.

View all posts by Andreas Ortmann

Published

November 5, 2015

We Asked Where America’s Future Scientists Would Want to Live

By Christopher P. Scheitle, Katie Corcoran, and Taylor Remsburg

Read Now

From Regression to Reflection: A Mixed-Methods Journey

Research

April 28, 2025

From Regression to Reflection: A Mixed-Methods Journey

By Hema Thakur

Read Now

Resources

April 15, 2025

DORA to Launch Practical Guide to Responsible Research Assessment

By Social Science Space

Read Now

Nominations Open For 2025 John Maddox Prize for Promoting Evidence-Based Research

Recognition

February 21, 2025

Nominations Open For 2025 John Maddox Prize for Promoting Evidence-Based Research

By Emma Richards

Read Now

Survey Says … Most People Trust Scientists

Mathew Marques, Niels Mede, Viktoria Cologna, and Zoe Leviston 9477 Infrastructure, Insights, Research

Public trust in scientists is vital. It can help us with personal decisions on matters like health and provide evidence-based policymaking to […]

Read Now

Exploring the ‘Publish or Perish’ Mentality and its Impact on Research Paper Retractions

Nham Tran 38465 Research, Research Ethics

When scientists make important discoveries, both big and small, they typically publish their findings in scientific journals for others to read. This […]

Read Now