This page is adapted from the Iowa State University Library Data Management Plan (DMP) resource guide.
"A data management plan (DMP) is a written document that describes the data you expect to acquire or generate during the course of a research project, how you will manage, describe, analyze, and store those data, and what mechanisms you will use at the end of your project to share and preserve your data." —Stanford Libraries: Data management plans
The short answer is "no," but you are required to retain, share and make accessible data that validates your research findings. You should also consider preserving/sharing data that:
This guide uses the terms "data" and "research data" interchangeably. The definition of research data used by U.S. DOT is adapted from OMB Circular A-110:
Research data is defined as the recorded factual material commonly accepted in the scientific community as necessary to validate research findings, but not any of the following: preliminary analyses, drafts of scientific papers, plans for future research, peer reviews, or communications with colleagues. This "recorded" material excludes physical objects (e.g., laboratory samples). Research data also do include:
(A) Trade secrets, commercial information, materials necessary to be held confidential by a researcher until they are published, or similar information which is protected under law; and
(B) Personnel and medical information and similar information the disclosure of which would constitute a clearly unwarranted invasion of personal privacy, such as information that could be used to identify a particular person in a research study.
Metadata, commonly called "data about data," is information that describes data. Good metadata enables others to understand and reuse data that they themselves did not create. A minimum amount of metadata should be agreed upon and implemented before starting data collection. Data collection and documentation is easier if you know what you need to collect and how to record it. This also helps maintain data consistency and quality.
There are many different ways to record and share metadata. Some of the most common methods are:
Data repositories are devoted to keeping data accessible, safe and secure. They use special software, metadata, workflows and networks to meet these goals. Data repositories also help guarantee authenticity by providing control mechanisms and change logs. For these reasons, repositories are ideal for research data sharing, distribution and preservation.
Data repositories often have limits and restrictions governing which data they accept. Most have rules covering data formats and size limits, and require that data be documented. Some accept data from any research area, while others will only accept research from specific domains (such as biology or social sciences). The latter are known as disciplinary data repositories. Another type of specialized repository is the institutional data repository, which focuses on collecting the outputs of a specific organization, such as a university or federal agency.
See Submit to a Repository for more information and resources for locating data repositories.
Machine-readable data is data that can be read and processed by a computer. By comparison, human-readable data can only be read (and understood) by a human. It is important to understand that charts, graphs and most tables are not machine-readable, but the data they were generated from probably is.
Examples of human-readable data include books, PDFs representations of data (charts, graphs, tables, etc.), and datasets which have not been structured to be read by computers.
Examples of machine-readable data include data that has been encoded with a markup language (HTML, XML, etc.), datasets that have been structured to be read by computers, and data that is encoded for machine processing and is not human-readable.
Making data digitally accessible is part of making data machine-readable. There is no clear definition of this term, but it is generally understood that: