Research Ethics

Academic Researchers Need Support and Incentives to Share Data

April 17, 2018 3329

To achieve real “open science” we need to open up all areas of research, including research data. Making data available for other researchers to find, use, reuse, and reproduce will make research more efficient and effective. Members of the newly formed UK Research and Innovation, an independent organisation that brings together the seven Research Councils, Innovate UK and Research England, the Wellcome Trust, and other UK funders have moved early to encourage and require data sharing. Yet researchers in the UK report lower percentages of data sharing than the global average. Policy must be coupled with greater support and education for researchers, and faster, easier routes to sharing data optimally. Incentives and credit for data sharing are also needed.

The post can originally be seen on the LSE Impact Blog under the title “We need more carrots: give academic researchers the support and incentives to share data” by Grace Baynes.

As a publisher I firmly believe research articles and scholarly books and monographs are important summaries and conclusions of years of work for researchers. However, the real building blocks of discovery are the data they produce.

Data sharing brings many benefits to society. According to the Open Data Institute, the value of public sector open data is between 0.4 percent and 1.5 percent of an economy’s GDP. An independent report found that the European Bioinformatics Institute returns £1 billion in annual efficiency savings to researchers worldwide. Data archiving can double the publication output of research projects, according to a study of 7,000 National Science Foundation and National Institutes of Health-funded research projects in the social sciences. Citation impact of research papers has also been shown to increase by as much as 50 percent when data are made available. It can help reduce duplication of effort and is a foundation for reproducibility research. Despite all these benefits, in 2017 only about half of research data were shared and a much smaller proportion were shared openly or in ways that maximize discoverability and reuse.

Last year, Springer Nature asked over 7,000 researchers about data sharing at the point of publishing a research article. We wanted to understand how much data sharing is actually happening, how and where researchers are sharing, the challenges they face, and where they need help. Our findings, Practical Challenges for Researchers in Data Sharing, are openly accessible in Figshare along with the survey data.

When submitting to a journal, 63 percent of respondents shared data files either as supplementary information, in a repository, or both. A slightly lower proportion share data in a repository (41 percent) than as supplementary information files (42 percent). Yet the willingness is there, with 80 percent of researchers surveyed in the State of Open Data 2017 report willing to share their data and the same proportion either already or amenable to using others’ data.

To their credit, UK and US funders have moved early to encourage and require data sharing through policies, pilots, and infrastructure; yet in our survey, researchers in the UK and US report lower percentages of data sharing than the global average of 63 percent:

Country	Percentage of respondents
Poland	76
Germany	75
Switzerland	69
Greece	69
Italy	68
Spain	66
France	65
Netherlands	64
Norway	64
Sweden	61
Denmark	60
Belgium	59
United Kingdom	58
Portugal	56
Australia	55
United States	55
Canada	50

Percentages of respondents sharing data through a repository, as supplementary information files, or both, in countries with >100 respondents. Source: Practical Challenges for Researchers in Data Sharing.

So while funder mandates continue to be essential, policy must be coupled with greater support and education for researchers, and faster, easier routes to sharing data optimally. The challenges facing researchers include a lack of time and expertise. In our survey, “Organising data in a presentable and useful way” was the most stated reason for not sharing data (46 percent of respondents). Other common challenges were: “Unsure about copyright and licensing” – 37 percent; “Not knowing which repository to use” – 33 percent; and “Lack of time to deposit data” – 26 percent.

From my conversations with scholarly communications officers working in UK research institutions, I think that time may be a more important issue than is reported in surveys such as ours. The issue for researchers may not be purely “lack of time” but “is it worth my time?” Published, citable datasets need to be viewed as research outputs on a par with a research article in terms of career advancement and assessment. We need to measure the usage and citations of datasets, and communicate the impact and benefits of data sharing. In the meantime, data publishing, and better data citation and linking, are part of the solution.

While sharing data as supplementary information is better than not sharing data at all, it is a sub-optimal solution. Data deposited in a repository is more findable and accessible. A number of publishers, including Springer Nature, are now depositing supplementary information into publicly accessible repositories. Through the Research Data Alliance, a group of publishers, funders, and research institutions are collaborating to agree a framework for journal data policies, to reduce complexity for authors and encourage good practice. Initiatives such as DataCite and the Joint Declaration of Data Citation Principles help make research data more citable and discoverable.

Scholarly communications offices and libraries have a key role to play in supporting researchers. In many research institutions, libraries and research data management teams are now offering expert advice, support, and infrastructure. Researchers in these institutions are fortunate to have such support. Governments, funders, institutions, libraries, and service providers like publishers all have a role to play to unlock the huge potential of research data. For example, at Springer Nature we offer a free Research Data Support Helpdesk and recommended repositories list, as well as an optional Research Data Support service to help researchers and institutions deposit their data in repositories and make it easier to find and use.

I don’t underestimate the size of the challenge. We are talking about shifting expected norms, skills, and behaviour so that data sharing and good practice becomes standard research practice. As well as policy, researchers need incentives, expert support, training, and infrastructure to make it seamless and easy to share data, and worth their while. That support needs to come when they need it, in ways that are accessible, easy to use, and that work with their research workflow. Achieving this is too complex, and the potential benefits too great, for a fragmented approach.

Grace Baynes

Grace Baynes is vice president, data & new product development for open research at Springer Nature. She is responsible for promoting open data and good research data practice; data publishing, including the journal Scientific Data; data services; and new product development across open science and open research.

View all posts by Grace Baynes

Published

April 17, 2018

Celebrating the National Survey of Health and Development: 1946-2026

By Robert Dingwall

Read Now

Infrastructure

October 1, 2025

Has Bad Science Become Big Busines

By Owen Brierley

Read Now

Resources

April 15, 2025

DORA to Launch Practical Guide to Responsible Research Assessment

By Social Science Space

Read Now

Research

October 10, 2024

Exploring the ‘Publish or Perish’ Mentality and its Impact on Research Paper Retractions

By Nham Tran

Read Now

Lee Miller: Ethics, photography and ethnography

Robert Dingwall 55589 Ethics, News, Research Ethics

Kate Winslet’s biopic of Lee Miller, the pioneering woman war photographer, raises some interesting questions about the ethics of fieldwork and their […]

Read Now

NSF Seeks Input on Research Ethics

Social Science Space 17689 Ethics, Research Ethics

In a ‘Dear Colleague’ letter released September 9, the NSF issued a ‘request for information,’ or RFI, from those interested in research ethics.

Read Now

Maintaining Anonymity In Double-Blind Peer Review During The Age of Artificial Intelligence

Leonard Bauersfeld, Angel Romero, Manasi Muglikar and Davide Scaramuzza 10776 Research, Research Ethics

The double-blind review process, adopted by many publishers and funding agencies, plays a vital role in maintaining fairness and unbiasedness by concealing the identities of authors and reviewers. However, in the era of artificial intelligence (AI) and big data, a pressing question arises: can an author’s identity be deduced even from an anonymized paper (in cases where the authors do not advertise their submitted article on social media)?

Read Now