Skip to Main Content

Publish: How to Manage and Share Data

How to Manage and Share Data

Data Sharing

The Caltech Library is happy to provide guidance on where and how to share data in compliance with funder and journal mandates.

Choosing a Data Repository

Subject-Specific Repositories

We recommend researchers share data in subject- or data-type-specific data repositories where that is an expected practice in your research field. For example, the NIH Genomic Data Sharing policy requires that certain genetic data from projects funded by NIH must be shared in an appropriate data repository.


Caltech Library offers a free data sharing service, CaltechDATA at, that accepts any type of data associated with Caltech projects. CaltechDATA offers standard data preservation and DOI (permanent identifier) services.  We also offer services, at an additional cost, for preserving large volumes of data (> 500 GB); please contact us to discuss options and costs. Find out more information or read the CaltechDATA FAQ or contact us at

Other Repository Recommendations

If you have questions about choosing an appropriate subject repository, using CaltechDATA, or finding another generalist data repository, please contact and we will be happy to assist.


Data Management Resources

The Caltech Library developed several resources to help with the management of scientific research data.

Data Management Fundamentals

To learn about a range of data management topics, we encourage you to borrow the book "Data Management for Researchers: Organize, Maintain, and Share your Data for Research Success" (Call Number: Q180.55.E4 B75 2015), which was written by Caltech's Biology & Biological Engineering Librarian.

A shorter summary of 10 data management strategies is available in the open-access article "Foundational Practices of Research Data Management", which was co-written by Caltech's Biology & Biological Engineering Librarian.

Implementing Data Management

Caltech Library published The Research Data Management Workbook to help researchers implement data management practices into their research workflows. The Workbook contains 15 structured exercises covering the following data management activities:

  • Evaluate a laboratory notebook
  • Write a project-level README.txt
  • Create a data dictionary
  • Set up a file organization system
  • Create a file naming convention
  • Pick storage and backup systems
  • Test your backup
  • Write a living data management plan (DMP)
  • Determine data stewardship
  • Pick a data repository
  • Share data
  • Prepare data for future use
  • Convert data files types
  • Create an Archive folder
  • Separate from the institution

We encourage you to check out the Workbook to help you establish good data practices in your research.

You can access The Research Data Management Workbook on the internet or download the book as a PDF or EPUB file. The Workbook is available under a Creative Commons Attribution Non Commercial license, meaning you are free to download, adapt, and use the workbook so long as you provide attribution and do not sell the content.