Difference between revisions of "Data Management"

From Digital Scholarship Group
Jump to navigation Jump to search
Line 67: Line 67:
 
'''Roles & Responsibilities'''
 
'''Roles & Responsibilities'''
  
Explain how the responsibilities regarding the management of your data will be delegated. This should include time allocations, project management of technical aspects, training requirements, and contributions of non-project staff - individuals should be named where possible. Remember that those responsible for long-term decisions about your data will likely be the custodians of the repository/archive you choose to store your data. While the costs associated with your research (and the results of your research) must be specified in the Budget Justification portion of the proposal, you may want to reiterate who will be responsible for funding the management of your data. Consider the following:
+
*Explain how the responsibilities regarding the management of your data will be delegated. This should include time allocations, project management of technical aspects, training requirements, and contributions of non-project staff - individuals should be named where possible. Remember that those responsible for long-term decisions about your data will likely be the custodians of the DRS you choose to store your data.  
  
Outline the staff/organizational roles and responsibilities for implementing this data management plan.
+
 
Who will be responsible for data management and for monitoring the data management plan?
+
Consider the following:
How will adherence to this data management plan be checked or demonstrated?
+
 
What process is in place for transferring responsibility for the data?
+
* Outline the staff/organizational roles and responsibilities for implementing this data management plan.
Who will have responsibility over time for decisions about the data once the original personnel are no longer available?
+
* Who will be responsible for data management and for monitoring the data management plan?
 +
* How will adherence to this data management plan be checked or demonstrated?
 +
* What process is in place for transferring responsibility for the data?
 +
* Who will have responsibility over time for decisions about the data once the original personnel are no longer available?
  
  

Revision as of 12:22, 5 March 2015

What is a Data Management Plan?

A data management plan is a written assessment of how project or research data will be collected, organized, shared, maintained, and preserved.

Why Manage Your Data?

  • Fulfill requirements
  • Improve project efficiency
  • Organize large sets of data
  • Preservation
  • Reuse
  • Promote research

How Do I Create a DMP?

  • Establish data management goals
  • Consult funding agency guidelines (NSF, NEH, IMLS)
  • Review checklists of recommended data management topics
  • Use a data management planning tool, like DMPTool or DMPonline (UK)

Managing Your Data

  • Analyze the data (what kind(s) of data? how much data? who needs your data? how will it be used in the future?)
  • Organize the data (decide on file naming conventions, directory structures, metadata standards, data formats)
  • Decide how the data can be accessed (where will it be stored? what will be shared? how will it be shared? when will it be shared?)
  • Who is responsible for your data?

Working with Projects and Data

Data Interviews

  1. Invite project representatives to answer data management questions using the DSG template in the DMPTool
  2. Ask them to keep track of questions that are difficult to answer
  3. Meet with project reps to discuss difficult questions and provide guidance for difficult data management areas

DMPTool

  • What questions do we want to ask?
  • How do we want to organize the questions?
  • What is the end result?

Possible DM Questions

Data and Project Materials
What kinds of data? (genres, file formats)
How much?
Who is the audience for your data?
How might your data be reused?
What will will be needed to reuse your data?
Organization and Standards
How are your files named?
How is your file directory structured?
Are you using a metadata standard?
What data formats?
How are you documenting your data? (wiki, codebook)
Data access, sharing, and re-use policies
What do you plan to share?
How will users access the shared data?
When will users have access?
Can data be redistributed?
Can other works be derived from your data?
Are there ethical or legal restrictions on access and use?
How will restrictions be handled?
How will you guarantee safe, untampered data?
Where is your data stored?
What is the life span of your stored data?
Roles and responsibilities?
Who is responsible for metadata and documentation?
Who secures the data?
Who ensures data is backed up and not corrupted?



From the ODH DMP Tool

Roles & Responsibilities

  • Explain how the responsibilities regarding the management of your data will be delegated. This should include time allocations, project management of technical aspects, training requirements, and contributions of non-project staff - individuals should be named where possible. Remember that those responsible for long-term decisions about your data will likely be the custodians of the DRS you choose to store your data.


Consider the following:

  • Outline the staff/organizational roles and responsibilities for implementing this data management plan.
  • Who will be responsible for data management and for monitoring the data management plan?
  • How will adherence to this data management plan be checked or demonstrated?
  • What process is in place for transferring responsibility for the data?
  • Who will have responsibility over time for decisions about the data once the original personnel are no longer available?


Expected Data

Give a short description of the data, including amount (estimated amount or known amount) and content. If the project will be collecting data of a sensitive nature, note here and reflect upon it in subsequent sections. Data types could include XML spreadsheets, interview transcripts, text files, historical documents, diaries, field notes, geospatial data, citations, software code, algorithms, etc. Identify your methods for collecting data. NEH is also concerned with how data is managed and maintained during the project’s lifetime until it is "shared". Describe in this section your short-term (project's lifetime) data management plan. Consider these questions:

What data will be generated in the research? What data types will you be creating or capturing? How will you capture or create the data? If you will be using existing data, state that fact and include where you got it. What is the relationship between the data you are collecting and the existing data? What data will be preserved and shared? Where (physically) and on what media will you store the data during the project’s lifetime? How will you back-up the data during the project's lifetime and how regularly will back-ups be made?


Period of Data Retention

Explain the policies that may restrict the distribution of your data, and describe how you will make sure that access to data is made available in a timely manner. Consider these questions:

How long will the original data collector/creator/principal investigator retain the right to use the data before opening it up to wider use? Explain details of any embargo periods for political, commercial, patent or publisher reasons.


Data Formats and Dissemination

Describe the format of your data. Ideally, data formats will be chosen that are openly and freely available, and/or non-proprietary in nature. Consider the following:

Which file formats will you use for your data, and why? What transformations (to more shareable formats) will be necessary to prepare data for preservation and data sharing? What form will the metadata take? How will you create or capture these details? Which metadata standards will you use and why have you chosen them? (e.g. accepted domain-local standards, widespread usage). What contextual details (metadata) are needed to make the data you capture or collect meaningful? Describe how you will ensure dissemination of your data. Consider the following: How and when will you make the data available? (Include the resources needed to make the data available: equipment, systems, expertise, etc.) What other types of information should be shared regarding the data, e.g. the way it was generated, analytical and procedural, information? What is the process for gaining access to the data? Will any permission restrictions need to be placed on the data? How will you manage data with sensitive information? Are there ethical and privacy issues? If so, how will these be resolved? What have you done to comply with your obligations in your IRB Protocol?


Data Storage and Preservation of Access

Describe your long-term strategy for storing, archiving and preserving the data you will generate or use. Consider the following:

What is the long-term strategy for maintaining, curating and archiving the data? Which archive/repository/database have you identified as a place to deposit data? What procedures does your intended long-term data storage facility have in place for preservation and backup? How long will/should data be kept beyond the life of the project? What data will be preserved for the long-term? On what basis will data be selected for long-term preservation? What metadata/documentation will be submitted alongside the data or created on deposit/transformation in order to make the data reusable? What related information will be deposited?

Resources