Skip to Main Content

Research Data Management: Publish Data

The Research Data Management Portal is designed to provide guidance, best practices, and resources on the steps within the research data lifecycle and its correlation to the requirements of established data management practices.

Steps to Publishing Your Data

Publishing Your Data

Overview

For information on NTL's repository ROSA P and how to utilize it visit the ROSA P LibGuide.

All submissions are required to be 508 Compliant. For more information on meeting this requirement and NTL's Accessibility policy, refer to the corresponding Accessibility LibGuide.

Review the Public Access Plan Requirements

  1. All research and its derivative products must be publicly accessible with no restrictions to access. This includes but is not limited to reports, datasets, geodatabases, code, statistical projects and visualizations, charts, images, websites, documentation, and any other product produced through the U.S. DOT. 
  2. In cases where the data contains personally identifiable information (PII), the dataset must be anonymized before publication.
  3. Final manuscripts accepted for publication should be submitted to the National Transportation Library (NTL), which has a MAP-21 (49 USC 6304) mandate to be the central repository for USDOT research and technical reports and a clearinghouse for Government transportation data. 
  4. When choosing a repository for archiving, all repositories must support the creation and maintenance of persistent identifiers (e.g., DOIs, handles, etc.) and must provide for maintenance of those identifiers throughout the preservation lifecycle of the data.
  5. Repositories also must support the creation and maintenance of DCAT-US metadata.

Common Issues with Fulfilling the Public Access Plan Requirements

Common Problem Solution
My data set has Personally Identifiable Information (PII). How do I publish my data? While the raw data that has been created cannot be published due to PII risk, anonymized data can be published in its place. ROSA P only accepts anonymized datasets for publication. Learn about how to protect the privacy of human research participants when sharing data with resources provided by the National Institute of Health.
I purchased my dataset/reused an existing free dataset. What do I submit to ROSA P? While you cannot submit a dataset you did not create, you must submit any analysis, calculations, and derivative products from this dataset. For example, if you use a open access dataset for your project, you must include analysis and charts made, visualizations made in statistical software, any code used to analyze or manipulate the data, and any new products or findings. Additionally, the name of the dataset, the variables you used, changes you made to the dataset, and any other information must be included in the data package submission in the README. 
We would like to control access by having a submission form so that the public and researchers can request to see our data. Is this allowed under the Public Access Plan? No. No restriction or pay-wall of any kind is allowed for U.S. DOT funded data. To be compliant with the public access plan, all data must be open and unrestricted to the public. This means that all research products must be accessible on a reputable repository. To view a list of DOT Approved repositories, please navigate to "Step 2: Choose a Repository."
My data is currently stored at my university on an Amazon Web Services (AWS) server. This is accessible to researchers at my university. Does this fulfil Public Access Requirements? No. While private or university servers are acceptable storage during the project's duration, the closed servers are not acceptable for long term preservation and archiving. 
I'm not sure what metadata schema to use for my dataset. Is metadata required for submission? Yes, DCAT-US metadata, formerly Project Open Metadata, is required for all DOT funded datasets. You must create a DCAT-US metadata file and include it with your submission. For more information on DCAT-US metadata and to access NTL's Template, please visit our Data Packages Libguide.
My data was created and stored in Microsoft Excel's XLSX format. Is this acceptable for submission?

No. According to the Public Access Plan Section 7.6.1: Publications, "The DOT will ensure the permanent preservation and long term accessibility of Publications resulting from DOT-funded research and programs by: Adopting sound, non-proprietary preservation standards and archival formats for Publications and associated content." While proprietary formats may be included as a part of your larger data package, open access formats must be included as well. For XLSX files, could be a .CSV file. 

 

Choose a Repository

When choosing a repository, it is important to consider the following elements:

  1. Does it support searchability?
  2. Does it distinguish between authors and depositors? 
  3. Does it use persistent identifiers such as DOIs, Handles, RORs, and ORCIDs?
  4. Does it have multiple forms of metadata storage and export, including the required DCAT-US metadata file for all DOT funded research?

These important considerations ensure your research will be preserved, traceable, and findable to your peers, your supervisors, your broader field, and the public.

U.S. DOT Approved Repositories

Repository Link Who Can Publish Here? Does the repository cost money to publish? Does the repository provide DOIs? CoreTrustSeal Certified?
Carnegie Mellon University KiltHub https://kilthub.cmu.edu/ Any Carnegie Mellon University Faculty, Staff, or Students Yes, free to CMU affiliates Yes
Dryad http://www.datadryad.org/ Anyone, even if not in a member institution Yes, if not affiliated with a Dryad Institution Yes
Figshare https://figshare.com/ Anyone, even if not a part of an institutional repository No unless more than 500 files or larger than 20GB Yes
Harvard Dataverse https://dataverse.harvard.edu/ Anyone No Yes
University of Michigan Inter-university Consortium for Political and Social Research https://www.icpsr.umich.edu/web/pages/ Anyone regardless of Member Institution Status No Yes
Iowa State University DataShare https://iastate.figshare.com/ Iowa State University Scholars and Researchers No Yes
Missouri University of Science and Technology Scholars' Mine https://scholarsmine.mst.edu/ Missouri University Science and Technology Faculty, Staff and Students Yes Yes
Odum Institute Data Archive https://dataverse.unc.edu/dataverse/odum University of North Carolina Faculty, Staff, and Students No Yes
National Transportation Library's ROSA P** https://rosap.ntl.bts.gov/ Those who work for or are funded by USDOT No Yes
San Jose State University ScholarWorks https://scholarworks.sjsu.edu/ SJSU Faculty, Staff, and Students No Yes
Syracuse University Qualitative Data Repository (QDR) https://qdr.syr.edu/ Anyone, but institutional members are free Yes, if not an institutional member Yes
Texas Digital Library Texas Data Repository https://dataverse.tdl.org/ Participating Texas Universities Faculty and Staff No Yes
University of Nebraska- Lincoln Libraries' Data Repository https://sandy.unl.edu/ University of Nebraska affiliates No Yes
Zenodo** https://zenodo.org/ Anyone without a designated repository No Yes

Last Updated January 17, 2025

**Preferred by the Data Services Team

 

CoreTrustSeal offers to any interested data repository a core level certification based on the DSA–WDS Core Trustworthy Data Repositories Requirements catalogue and procedures. This universal catalogue of requirements reflects the core characteristics of trustworthy data repositories. To find out more about CoreTrustSeal and their requirements, please visit their website

At the CoreTrustSeal site a user can search the list of repositories and view the list of repositories that have achieved the CoreTrustSeal Data Repository certification.

Prepare Your Submission Package (SIP)

While the hardest stages of a research project may be over, the creation of a submission package for a repository is one of the most important steps in publishing research. This step is crucial as it determines how your colleagues and the public will interact with and understand your research. Creating robust documentation, organizing your data into understandable folders and hierarchies, using clear naming techniques, and preserving your dataset and research products in open formats are all important steps to making your research more FAIR.

Using Open File Formats

For data preservation it is always best to preserve in non-proprietary, open formats. These preferred formats are as follows:
Text Dataset Images Multimedia Maps Metadata Collections
TXT CSV TIFF WARC TIFF JSON ZIP
PDF TSV PNG WMV PNG XML  
XML TAV JPG SWF Shapefiles    
WARC     WMA ArcGIS Project Files    
RTF     MPEG      
MD            

Creating Documentation for your Submission Package

When creating a submission package to send to your chosen repository, it is important to create documentation. This documentation makes your research easier to understand and reuse. Documentation can take many forms depending on the type of project and data you created. The following are types of documentation you might create for your project:

  • Dataset*
  • README file*
  • DCAT-US metadata file*
  • Data Management Plan*
  • Data Dictionary
  • Codebook

*Required for all submissions

When creating this documentation, it is important to make your dataset as accessible to the user as possible. Include authorship information, software requirements, a well-organized file structure, clear file naming, and descriptions of each file in the package. For more information on creating a robust data package, please visit our Data Packages LibGuide.

Components of a Data Package (Dataset, README, Data Dictionary, DCAT-US Metadata, Data Management Plan, Code Book)

Submit Your Dataset

Submitting to 

Reports and textual materials

Datasets

Submitting to a DOT Approved Repository

There are instances where the researcher or university uses a repository for their long-term data storage that is not ROSA P. In these instances, the depositor will submit the research to a DOT Approved Repository, or one that is conformant with the USDOT's Public Access Plan. Each of these repositories have different instructions for how to submit. However, by creating good file structure, clear names, robust documentation, required metadata, and by using open formats, most data submission packages should need minimal fixes for each repository. 


Despite using an external repository, U.S. DOT and our partners need to be informed of the publication of any and all resources that were funded by U.S. DOT and its programs. When submitting an item that is housed outside of ROSA P to NTL, please email the following:

In your email you must include the following:

  1. Final Report URL
  2. Final Report DOI/Handle
  3. Dataset URL
  4. Dataset DOI/Handle
  5. Funding Agreement Number/Contract Number (If Applicable)
  6. Research Hub Display ID
  7. Names and ORCIDs for all project investigators, contributors, and publication author(s)
  8. Who is submitting this publication/dataset
  9. Any other applicable information

After Submission

What Happens Next

Status of Content in ROSA P

Next Steps for Researchers

  • Ensure that your ORCID profile is updated with your research and publications
  • Add this research experience and your publications to other academic and professional profiles and CVs
  • Share your publication with your network
  • Start the research process again!