Reporting Studies Conducted Using Open Access Data (ROAD) Guideline Statement

Saif Aldeen Alryalat

Lna Malkawi

Randa Farah

Introduction

In the era of information technology, the availability of open-access data has radically transformed the landscape of scientific research. This transformation is fueled by an increasing number of publicly available datasets across various disciplines, democratizing access to vast pools of information and accelerating scientific discovery.1 With this ready access to information, researchers can conduct studies without having to collect new data, leading to cost savings, increased efficiency, and the ability to investigate questions that would otherwise be impracticable.2

While the benefits of open-access data are profound, these developments have also raised new challenges concerning reporting studies conducted using these resources. The process of interpreting and drawing conclusions from open datasets requires an intricate understanding of the data’s context, collection methods, and limitations.3 Moreover, ensuring these studies’ transparency, reproducibility, and integrity necessitates comprehensive reporting standards that are suitably tailored to this unique research paradigm. However, the current reporting standards for studies based on open-access data exhibit limitations. There is a lack of uniformity in these standards across different disciplines, leading to inconsistencies in the quality and reliability of published studies. Moreover, crucial information such as the precise description of the dataset used, data processing steps, and methodological considerations often remain undisclosed, hampers these studies’ reproducibility and verifiability.4

Recognizing these gaps, we propose the Reporting of studies conducted using Open Access Data (ROAD) guidelines. The ROAD guideline aims to provide a comprehensive and standardized approach for reporting based on open-access data, thereby enhancing these studies’ clarity, transparency, and interpretability. This paper focuses on the formulation of these guidelines, their potential benefits, and the path toward their adoption.

Development of ROAD guideline

Our approach in developing the ROAD guideline was to build on a solid foundation of pre-existing guidelines for observational studies, tailoring and extending them to better fit the unique context of open data use. This involved the incorporation of specific recommendations and requirements that address the challenges and opportunities inherent in open-access data. In doing so, we expect the ROAD guidelines to significantly improve the clarity and quality of study reporting, ensuring that references to open-access data are precise and unambiguous. The methodology adopted for developing the Open Data research checklist encompassed a series of structured steps:

Preliminary Review and Requirement Gathering: Initially, the author’s team undertook a rigorous review of several research studies that used datasets from the Specimen and Data Repository Information Coordinating Center (BioLINCC) repository.5 This database was chosen for its richness of open data across diverse areas of bioinformatics.6

These studies were selected using a random selection tool, random.org, to eliminate selection bias and ensure a broad representation of approaches in open data use. Based on this review, a preliminary list of critical reporting requirements was drafted, setting the groundwork for best practice guidelines in reporting research using open data.

Consensus Meeting: After the review and requirement gathering phase, the author’s team met to discuss the initial draft checklist. The objective of this meeting was to foster a shared understanding, consolidate different perspectives, and reach a consensus on the components of the initial reporting checklist.

External Peer Review: We sent the consensus checklist, accompanied by a selection of Open Data publications that drew on BioLINCC repository data, to several active researchers with substantial experience and expertise in Open Data research. The aim was to review these materials, focusing on criticism, revision, and suggesting improvements to the checklist.

Final Author Meeting and Checklist Formulation: In the final phase of the process, the authors met again to discuss the feedback and suggestions received from the expert committee. Drawing on this input, the final version of the Open Data research checklist was formulated. This meeting synthesized all the information, finalized the guidelines, and established a concrete path forward for their adoption and dissemination.

The following are detailed ROAD guidelines. Moreover, the supplementary table provides a brief checklist for the ROAD guideline.

ROAD Guideline

  1. Title and Abstract

  2. Introduction

  3. Methods

  4. Results

  5. Discussion

  6. Other Information

Discussion

The ROAD guidelines serve as a comprehensive framework for researchers to accurately report their research findings using open-access data. The objective is to foster transparency, reproducibility, and credibility in research using such data while respecting the rights and stipulations of the original data providers.

The ROAD guidelines incorporate essential modifications to conventional research reporting protocols, tailoring them to effectively capture the unique challenges and opportunities of studies using open-access data. The guidelines offer researchers a structured approach to credibly present their research design, methodology, and findings while explicitly acknowledging the original data source.

In the ROAD checklist, we emphasize the significance of accurately acknowledging the source of the original open-access data, tailoring the study design to the unique context of open-access data, and outlining potential limitations that may arise from the original data. The checklist also guides researchers to maintain transparency in data handling, variable selection, bias mitigation, and statistical methods.

We strongly encourage researchers to adhere to these guidelines when using open access data. They should cite the original dataset, state their reasons for any differences in recruitment, exposure, follow-up, and data collection, and detail the criteria behind selecting their study sample.

In addition, authors should refrain from using the original data name or acronym in the study’s title unless requested by the original data providers. They should also discuss any limitations of their study, considering both the limitations of their research design and the original data’s potential shortcomings.

The ROAD guidelines recognize the importance of open-access data in promoting scientific discovery and standardized reporting to strengthen the research community’s trust in studies that make use of these resources. Adherence to these guidelines will ensure that the richness of open-access data is maximized, empowering researchers to produce robust, high-quality, and reliable research.

As open data continues to gain prominence in research, these guidelines offer a roadmap for researchers to navigate this landscape, promoting rigorous, transparent, and ethical reporting of studies using open-access data.

Limitation

While the ROAD guidelines provide a comprehensive framework for reporting research using open-access data, they have some limitations. The diversity and variability of open-access data sources, each with unique policies, quality standards, and potential biases, can pose challenges that the guidelines may not comprehensively address. The guidelines provide a general direction, but specific issues related to unique data sources may not be fully covered.

Furthermore, the dynamic nature of open access data, with constant updates and the release of new datasets, means that the guidelines, although based on current best practices, may need timely revisions to address emerging trends and issues in the open data landscape.

Ethical and legal considerations form another limitation; at the same time, the guidelines recommend appropriate citation and acknowledgment of original data providers, but they may not cover all ethical and legal considerations, such as those concerning privacy and data protection. Hence, researchers must ensure compliance with all relevant ethical and legal requirements when using open-access data. Finally, it must be emphasized that while the ROAD guidelines aim to enhance transparency and reproducibility in research using open-access data, they are not intended to replace rigorous study design, robust data analysis, and critical interpretation of results. Adherence to these fundamental principles is crucial in conducting high-quality research.

Conflict of interest

None.

179952
Supplementary table

Checklist for The Reporting of Studies Conducted using Open Access Data (ROAD) guidelines compared to standard guidelines for observational studies.