Higher Education Reform

DARPA Aims to Score Social and Behavioral Research

March 6, 2019 7191

The Pentagon’s innovation incubator has set itself an ambitious task – ranking the reliability of social science research that might apply to national security. The Defense Advanced Research Projects Agency’s Defense Sciences Office is currently asking for “innovative research proposals” to algorithmically assign a confidence score to social and behavioral research.

DARPA has named this program to develop an artificially intelligent quantitative metric Systematizing Confidence in Open Research and Evidence, or SCORE. As DARPA explains in its request for proposals:

These tools will assign explainable confidence scores with a reliability that is equal to, or better than, the best current human expert methods. If successful, SCORE will enable [Department of Defense] personnel to quickly calibrate the level of confidence they should have in the reproducibility and replicability of a given SBS result or claim, and thereby increase the effective use of SBS literature and research to address important human domain challenges, such as enhancing deterrence, enabling stability, and reducing extremism.

Outside observers have identified a wider collateral benefit to the academy from the proposal – a tool to address the so-called replication crisis in social science. An article by Adam Rogers at Wired, for example, is headlined “Darpa Wants to Solve Science’s Reproducibility Crisis With AI.”

DARPA implies that the replication crisis is itself a national security concern: “Taken in the context of growing numbers of journals, articles, and preprints, this current state of affairs could result in an SBS consumer mistakenly over-relying on weak SBS research or dismissing strong SBS research entirely.”

Last month, DARPA signed the Center for Open Science (COS) to a three-year agreement, worth $7.6 million, to create a database of 30,000 claims made in peer-reviewed and published papers. Alongside partners from the University of Pennsylvania and Syracuse University, COS will extract – automatically and manually – evidence about the claims, which will be merged with more traditional quality indicators like citations and whether the research was preregistered.

Three steps will follow once the database exists:

  • Experts will examine 10 percent of the claims, using surveys, panels and even prediction markets, for their likelihood of being replicated.
  • Other experts will create algorithms to examine the database’s contents and determine, artificially, their likelihood of being replicated.
  • Other researchers will attempt to replicate a sample of the database’s claims, allowing both the humans’ and the computers’ efforts to be measured and scored.

Appropriately, COS says its own work need to be reproducible. “We are committed to transparency of process and outcomes so that we are accountable to the research community to do the best job that we can,” said COS program manager Beatrix Arendt, “and so that all of our work can be scrutinized and reproduced for future research that will build on this work.”

“Whatever the outcome,” according to Brian Nosek, COS’ executive director, “we will learn a ton about the state of science and how we can improve.”

Rogers quote Microsoft sociologist Duncan Watts about the audacity of creating a scoring mechanism: “It’s such a DARPA thing to do, where they’re like, ‘We’re DARPA, we can just blaze in there and do this super-hard thing that nobody else has even thought about touching.’” Watts then adds, ““Good for them, man.” (Further demonstrating its chutzpah, DARPA has specifically excluded from SCORE proposals “research that primarily results in evolutionary improvements to the existing state of practice.”)

Ideally the scores and how they were determined would be understandable to a non-specialist. In addition, the scores could change based on new information.

As it tries to grade social and behavioral research, DARPA clearly acknowledges the need to fully embrace social science. “Given the accelerating sociotechnical complexity of today’s world—a world that is increasingly connected but often poorly understood—there are growing calls to more effectively leverage Social and Behavioral Sciences (SBS) to help address critical complex national security challenges in the Human Domain,” DARPA wrote in a 41-page document announcing the program in June 2018.

In addition to citing work that has obvious applications to security, such as reducing extremism, the documents cited other federal projects that have explicitly connected SBS and the Pentagon, such as the National Academies of Science’s Decadal Survey of Social and Behavioral Sciences for Applications to National Security and the Minerva Research Initiative (“Supporting social science for a safer world”).


Related Articles

Scientists Should Keep in Mind It’s Called the ‘Marketplace of Ideas’ for a Reason
Communication
December 29, 2025

Scientists Should Keep in Mind It’s Called the ‘Marketplace of Ideas’ for a Reason

Read Now
What Is a University For, After Gaza?
Higher Education Reform
December 23, 2025

What Is a University For, After Gaza?

Read Now
Survey Finds Social Scientists Feel Unsupported in Seeking Societal Impact
Impact
December 18, 2025

Survey Finds Social Scientists Feel Unsupported in Seeking Societal Impact

Read Now
Mutually Assured Distrust and the Gyrations of Trump’s Science Policy
Higher Education Reform
December 17, 2025

Mutually Assured Distrust and the Gyrations of Trump’s Science Policy

Read Now
Canada’s SSHRC Names 2025 Impact Winners

Canada’s SSHRC Names 2025 Impact Winners

One researcher studies how war affects children, another took a literal worm’s eye view to examine rural development, while two others scrutinized […]

Read Now
Why the United States’ ‘War on Woke’ is a Threat to Educational Futures Everywhere

Why the United States’ ‘War on Woke’ is a Threat to Educational Futures Everywhere

On November 4, 2024, the United States of America plunged into an era of unprecedented educational crisis. The ascendant presidency of Donald […]

Read Now
An AI Authorship Protocol Aims to Sharpen a Sometimes-Fuzzy Line

An AI Authorship Protocol Aims to Sharpen a Sometimes-Fuzzy Line

The latest generation of artificial intelligence models is sharper and smoother, producing polished text with fewer errors and hallucinations. As a philosophy […]

Read Now
0 0 votes
Article Rating
Subscribe
Notify of
guest

This site uses Akismet to reduce spam. Learn how your comment data is processed.

2 Comments
Newest
Oldest Most Voted
Inline Feedbacks
View all comments
Amy

All due respect “WOKE”, as well as other social engineering like DEI combined with other unrecognized programming is a highly concerning matter for mental health and technologies associated with programming may contribute to an increase in neurological disorders and poor social or behavioral disorders. Gender dysphoria for example is a serous matter of concern to the degree of passing laws against the psychological implementation of such thought processes in public AND private sectors.

Amy

please study the effects of AI analytics computation, Quantum computing and data collection COMBINED, including database storage as associated with foreign intelligence gathering and laws that govern cyberspace as implemented by foreign policy to ensure the safety and protections of the United States of America. Thank you much for your consideration.