#s-lg-box-27659446.s-lib-box .s-lib-box-title { background-color: #ff0099; }
Skip to Main ContentUnless otherwise noted, content in this libguide is licenced under CC BY-NC 4.0
Welcome! This guide will help you make decisions about how to manage, store, and share your research data.
Funding agencies have policies regarding how data are archived and/or made available. But, what do they mean by “data”?
The United States Office of Management and Budget defines data in the following way:
"Research data is defined as the recorded factual material commonly accepted in the scientific community as necessary to validate research findings, but not any of the following: preliminary analyses, drafts of scientific papers, plans for future research, peer reviews, or communications with colleagues. This "recorded" material excludes physical objects (e.g., laboratory samples). Research data also do not include:
In a practical context...
Consider what set of files and information you would need to provide someone if they wanted to validate your published research findings. We should not and cannot save everything; prioritize the subset of your data that meets the definition above. As an example, raw data files may be critical in the initial processing phase of your research project, but they might become useless after the data have been converted to a more workable format (say, binary instrument output converted to ASCII). When facing the reality that you can’t save and manage every digital bit that you’ve ever collected, consider what would be useful to you or others if you had to reproduce your results.
Reproduced from the Research Data Services Guide from Oregon State University, created by Aaron Albertson, Beth Hillemann, & Ron Joslin.which is licensed under a Creative Commons Attribution NonCommercial 4.0 International License.
Data includes far more than just numbers. Any recorded material that validates research findings can be considered data. Some examples of research data:
Documents and spreadsheets Laboratory notebooks, field notebooks, diaries |
Contents of an application (input, output, logfiles for analysis software, simulation software, schemas) |
Questionnaires | Slides, specimens, or samples |
Transcripts | Models, algorithms, scripts, code, or software |
Codebooks | Spectra, spectroscope data Test responses |
Audio recordings | Methodologies and workflows |
Video recordings | Standard operating procedures and protocols |
Photographs, x-rays or negatives | Synthetic compounds |
Protein or genetic sequences | Organisms, cell lines, viruses, or cell products |