Rescuing unloved data

Posted on February 17, 2017 by bwestra@uoregon.edu

Message of the day

“Data that is mobile, visible and well-loved stands a better chance of surviving” ~ Kurt Bollacker

Things to consider

Legacy, heritage and at-risk data share one common theme: barrier to access. Data that has been recorded by hand (field notes, lab notebooks, handwritten transcripts, measurements or ledgers) or on outdated technology or using proprietary formats are at risk. Born-digital files can be at risk too, since they can be susceptible to poor management, bit rot, or even direct attempts at reducing access.

Securing legacy data takes time, resources and expertise but is well worth the effort as old data can enable new research and the loss of data could impede future research. So how to approach reviving legacy or at-risk data?

How do you eat an elephant? One bite at a time.

Recover and inventory the data
- Format, type
- Accompanying material–codebooks, notes, marginalia
Organize the data
- Depending on discipline/subject: date, variable, content/subject
Assess the data
- Are there any gaps or missing information
- Triage–consider nature of data along with ease of recovery
Describe the data
- Assign metadata at the collection/file level
Digitize/normalize the data:
- Digitization is not preservation. Choose a file format that will retain its functionality (and accessibility!) over time: “Which file formats should I use?”
Review
- Confirm there are no gaps or indicate where gaps exist
Deposit and disseminate
- Make the data open and available for re-use

Stories

Resources

CODATA Data at Risk Task Group
RDA Data Rescue Interest Group
International data rescue portal
Center for International Earth Science Information Network: Curation of Scientific Data at Risk of Loss: Data Rescue and Dissemination
Curating a 23-year oceanographic time-series
Unlocking GATE: Gaining Access to Analog Data in a Digital World

Activities

There are many opportunities to rescue at-risk or legacy data. Locally, as faculty retire, reach out to departments to assist in curating existing yet inaccessible data. Regionally and nationally, partner with other stakeholders to revitalize at-risk data. Think: Citizen Science.

Get involved with the #datarefuge project

Share a tweet about today’s message with #LYD17 , or use #WhyILYD17 and you’ll be entered in a raffle for a book from Facet.

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

Posted in Data cleanup, Data rescue | Tagged Love Your Data Week, LYD17 | Leave a comment

Finding the right data

Posted on February 16, 2017 by bwestra@uoregon.edu

Message of the day

Need to find the right data? Have a clear question and know how to locate quality data sources.

Things to consider

In a 2004 Science Daily News article, the National Science Foundation used the phrase “here there be data” to highlight the exploratory nature of traversing the “untamed” scientific data landscape. The use of that phrase harkens to older maps of the world where unexplored territories or areas on maps bore the warning ‘here, there be [insert mythical/fantastical creatures]’ to alert explorers to the dangers of the unknown. While the research data landscape is (slightly) less foreboding, there’s still an adventurous quality to looking for research data.

Stories

And This is Why We Should Always Provide Our Data [PLOS ONE] http://blogs.plos.org/paleo/2013/01/25/and-this-is-why-we-should-always-provide-our-data/
The patience of the data hunter: https://www.dataone.org/data-stories/patience-data-hunter
Open data, authorship, and the early career scientist: http://ecologybits.com/index.php/2016/06/15/open-data-authorship-and-the-early-career-scientist/

Resources

1. Formulate a question

The data you find is only as good as the question you ask. Think of the age-old “who, what, where, when” criterion when putting together a question – specifying these elements helps to narrow the map of data available and can help direct where to look!

WHO (population)
WHAT (subject, discipline)
WHERE (location, place)
WHEN (longitudinal, snapshot)

This page from Michigan State University Libraries’ “How to find data & statistics” guide does a great job of further articulating these key elements to forming a question and putting together a data search strategy.

2. Locate data source(s)

After you’ve identified the question, then you can begin the scavenger hunt that is locating relevant source(s) of research data. One way to find data is to think about what organization, government, industry, discipline, etc., might gather and/or disseminate data relevant to your question.

Below are some good suggestions. You might also want to check out the UO Libraries guide to locating data.

If you’re looking for general, multidisciplinary data sets – check out sources like ICPSR (Inter-university Consortium for Political and Social Research) or Amazon Public Datasets. Lists of open data repositories, such as Open Access Data Repositories, can help point to more discipline specific data sets.

There are an increasing number of city or state-wide data portals – some examples: New York City, Hawaii, and Illinois – that provide access to regional data on everything from traffic patterns to restaurant inspection results.

At the federal level, several agencies and organizations provide access to nation-wide data sets like Data.gov, Census Bureau, Bureau of Labor Statistics, and Centers for Disease Control & Prevention.

For international data, look to sites like UNdata and World Health Organization, that cover a variety of countries and topics.

Science data tend to be distributed among a vast array of repositories, usually by specific discipline. See this page for some recommended repositories, or go to an Open Access Data Repositories list.

Check out this post from Nathan Yau, data viz whiz and creator of FlowingData — his post includes some of the sources listed above, but also highlights tips like scraping data from websites and using APIs to access data.

3. Cite accordingly

The ability to reuse data is only as good as its quality; the ability to find relevant data is only possible if it’s discoverable. As a producer of data, that means following many of the practices articulated in earlier posts. As a consumer of data, that means being a good citizen and citing your data sources.

In general, citing data follows the same template as any other citation — include pieces like author, title, year of publication, edition/version, persistent identifier (e.g., Digital Object Identifier, Uniform Resource Name). Check with your data source as well – they may provide guidance on how they want to be cited!

See DataONE and ICPSR pages on data citation for examples and more guidance.

Activities

BYODM — build your own (research) data map!Ask yourself:

What data sources are most relevant to my research?
Are there relevant data sets generated or held locally that I have access to?
What information do I need to retrace my steps back to these data (e.g., contact information, URLs, etc.)?

Share a tweet about today’s message with #LYD17, or use #WhyILYD17 and you’ll be entered in a raffle for a book from Facet.

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

Image credits:

Unagar, Pravin. (n.d.) “Romantic Location.” The Noun Project. https://thenounproject.com/term/romantic-location/611259/

Sáenz, D. (n.d.) “Map and compass.” The Noun Project. https://thenounproject.com/term/map-and-compass/113305/

Posted in Data centers & repositories, Data quality, Sharing / publishing | Tagged Love Your Data Week, LYD17 | Leave a comment

Good data examples

Posted on February 15, 2017 by bwestra@uoregon.edu

Message of the day

Good data are FAIR – Findable, Accessible, Interoperable, Re-usable

Things to consider

What makes data good?

It has to be readable and documented well enough for others (and a future you) to understand.
Data has to be findable to keep it from being lost. Information scientists have started to call such data FAIR — Findable, Accessible, Interoperable, Re-usable. One of the most important things you can do to keep your data FAIR is to deposit it in a trusted digital repository. Do not use your personal website as your data archive.
Tidy data are good data. Messy data are hard to work with.
Data quality is a process, starting with planning, all the way through to curation of the data for deposit.

Remember! “Documentation is a love letter to your data” (more about documentation)

Stories

Example: This dataset is still around and usable more than 50 years after the data were collection and more than 40 years after it was last used in a publication.

Counterexample: This article: http://www.sciencedirect.com/science/article/pii/S1751157709000881 promises “Statistical scripts and the raw dataset are included as supplemental data and are also available at http://www.researchremix.org.”

Alas: researchremix_sitecontent
(Used by recommendation of a researcher who has long since become enlightened. The data have made it into a trusted repository too.)

There are a number of guides to tidy data, from this blog post about tabular data, to these more detailed instructions about preparing data for archiving and sharing, and Hadley Wickham’s writeup on tidy data.

Resources

Have questions, or want to learn more? The UO data librarians can assist you.

If you want to learn on your own, Project TIER teaches undergraduate students how to structure data for reproducible research: http://www.projecttier.org/tier-protocol/specifications/

UK Data has great instructions for how to document your data: http://www.data-archive.ac.uk/create-manage/document

If you want to go all in, look at the instructions for documenting data in ICPRS’s Guide to Social Science Data Preparation and Archiving

Example: Data can take many forms. This compilation of “Morale and Intelligence Reports” collected by the UK Government during and after the war is a great example of qualitative historical data: https://discover.ukdataservice.ac.uk/catalogue/?sn=7465

Activities

What is your favorite data set? How/why is it good for your project? Try out the FAIR Principles to describe and share examples of good data for your discipline. Tell us on Twitter (#loveyourdata) or in the comments section below!

Share a tweet about today’s message with #LYD17, or use #WhyILYD17 and you’ll be entered in a raffle for a book from Facet.

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

.

Posted in Data cleanup, Documentation / metadata, Sharing / publishing | Tagged Love Your Data Week, LYD17 | Leave a comment

Documenting, describing, and defining data

Posted on February 14, 2017 by bwestra@uoregon.edu

Message of the day

Good documentation tells people they can trust your data by enabling validation, replication, and reuse.

Things to consider

Why does having good documentation matter?

It contributes to the quality and usefulness of your research and the data itself – for yourself, colleagues, students, and others.
It makes the analysis and write-up stages of your project easier and less stressful.
It helps your teammates, colleagues, and students understand and build on your work.
It helps to build trust in your research by allowing others to validate your data or methods.
It can help you answer questions about your work during pre-publication peer review and after publication.
It can make it easier for others to replicate or reuse your data. When they cite the data, you get credit! Include these citations in your CV, funding proposal, or promotion and tenure package.
It improves the integrity of the scholarly record by providing a more complete picture of how your research was conducted. This promotes public trust and support of research!
Some communities and fields have been talking about documentation for decades and have well-developed standards for documentation (e.g., geospatial data, clinical data, etc.), while others do not (e.g., psychology, education, engineering, etc.). No matter where your research community or field falls in this spectrum, you can start improving your documentation today!

Courtesy of @ajhmohr

Stories (learn from others’ mistakes and successes)

Error-laden database kills paper (Retraction Watch)
The value of a good inventory system
Metadata? I thought you were in charge of that
The case of the missing research protocol
The importance of documenting how your images and visualizations are created

Resources

Practical Tips by data type & format

Numeric/Spreadsheets
- Check out Christine Bahlai’s guide for using spreadsheets for scientific data
- Check out Kristin Briney’s video and blog post on data dictionaries
- Check out Colectica for Excel to document your spreadsheet
Lab notebooks
- Check out some tips from the experts
Observation
- Define all your codes clearly and operationally
- Document introductory & debriefing comments
- Make sure you’ve defined codes for non-verbal behavior
- Identify annotations separately from quotes or notes
Interview
- Documentation should include
  - your assumptions
  - rationale for choices in designing the interview
  - the interview questions or script (if applicable)
  - Relationship or map between the research questions and the interview questions
  - Codes or notations for non-verbal behavior
  - Syntax or codes to indicate annotations versus interview responses

General Resources

Georgia Tech’s documentation tips
Best Practices for Project Metadata
Readme files are a simple and low-tech way to start documenting your data better. Check out our basic guidance in this blog post, based on the sample readme.txt (filename = readme_template.txt) from IU or Cornell University’s data working group guide with tips for using readme files
Check out Kristin Briney’s post on taking better notes
Reining in your metadata – advice from an archivist
Cornell University data working group also has some tips for writing metadata

Activities

Option 1: Check out some of the documentation guidelines and standards for your own discipline and/or listed below. What can you borrow or learn from them to improve your own documentation?

USGS Data Management guidelines
CDISC has three foundational standards for clinical research data, including CDASH (Clinical Data Acquisitions Standards Harmonisation) & SDTM (Study Data Tabulation Model) & ADaM (Analysis Data Model)
Marine Metadata Interoperability
10 Simple Rules for a Computational Biologist’s Laboratory Notebook

Option 2: Have a colleague, labmate or teammate review your documentation for a specific project. Ask them to tell you what is missing or unclear. If it’s a long list, choose 2-3 things to focus on improving throughout the semester.

Bonus points: Set up a regular schedule for your lab or team to review and sign off on lab notebook pages, protocols, procedures manuals, data dictionaries, or whatever forms of documentation you use. Afterwards, reward yourselves with a beer, glass of wine, or treats!

Share a tweet about today’s message with #LYD17 , or use #WhyILYD17 and you’ll be entered in a raffle for a book from Facet.

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

Posted in Data quality, Documentation / metadata | Tagged Love Your Data Week, LYD17 | Leave a comment

Know Your Data Quality

Posted on February 13, 2017 by bwestra@uoregon.edu

Message of the day:

Data quality is the degree to which data meets the purposes and requirements of its use. It may refer to completeness, accuracy, credibility, timeliness, accessibility, consistency, or other factors.

Things to consider

Data quality reflects on you as a researcher, but it can also have an impact beyond your individual project. By one estimate, Bad Data Costs the U.S. $3 Trillion Per Year: “The reason bad data costs so much is that decision makers, managers, knowledge workers, data scientists, and others must accommodate it in their everyday work. And doing so is both time-consuming and expensive.”

Data quality is the responsibility of both data providers and data curators:
- Data providers ensure quality of their datasets, by research design choices, and how they choose to review, document, manage, and share datasets.
- Data curators work with the research community to address consistency, coverage, and metadata.
How does your discipline define and address data quality?
What tools and methods can you use in support of data quality?
How can we distinguish between good and bad data?

“Care and Quality are internal and external aspects of the same thing. A person who sees Quality and feels it as he works is a person who cares. A person who cares about what he sees and does is a person who’s bound to have some characteristic of quality.”
― Robert M. Pirsig, Zen and the Art of Motorcycle Maintenance: An Inquiry Into Values

So how does your research discipline address data quality? Here are some examples that might serve as food for thought:

Responsible conduct in data management guidelines from the Office of Research Integrity discuss integrity and quality assurance.
Social science data preparation and archiving guide from ICPSR on quantitative and qualitative data.
Veracity as a data quality issue in Big Data
Digital humanities data curation practices and their impact on data quality.
Cell culture research and applying best practices to ensure scientific reproducibility.
Data quality assessment (provides a table of various quality dimensions and their definitions) from Communications of the ACM, 45(4), 211.
How Do We Define Clinical Trial Data Quality if No Guidelines Exist?
CDISC (Clinical Data Interchange Standards Consortium) Standards

Some examples of what NOT to do:

Bad data issues guide
Examples of how not to prepare or provide data

Getting started – activities

Show your most recent dataset (or part of it) to your colleague and ask their opinion of its quality (exchanging datasets with a colleague makes this activity more fun).
Use criteria for good data (e.g., completeness, accuracy, fitness for use, documentation) to assess where your data stands.
Discuss your approaches to data collection and measures you took / could take to ensure integrity and completeness of your data.
Discuss steps to address missing or incomplete data in the context of your research. Does it matter? How much missing data affects validity, reliability or trustworthiness of your conclusions?
Check out the Calling Bull**** Course Syllabus (e.g., Food Stamp Fraud or the Musician Mortality Case Study) What can we learn about data quality from these stories?

Share a tweet about today’s message with #LYD17 , or use #WhyILYD17 and you’ll be entered in a raffle for a book from Facet.

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

Posted in Data cleanup, Data quality, Documentation / metadata | Tagged Love Your Data Week, LYD17 | Leave a comment

Why I Love Data Raffle

Posted on February 13, 2017 by bwestra@uoregon.edu

Love Your Data 2017 Raffle

#WhyILYD17 Raffle!

This year’s Love Your Data week is Feb. 13 – 17, 2017, and Facet Publishing is donating titles from their research data management series, which will be raffled off during the week.

To enter, please share a tweet about why you (or your institution) are participating in Love Your Data Week 2017 using #WhyILYD17

Follow @facetpublishing

Love Your Data Week – 2017 : Monday | Tuesday | Wednesday | Thursday | Friday

Posted in News | Tagged Love Your Data Week, LYD17 | Leave a comment

Readme File: Explaining Your Raw Data

Posted on February 9, 2017 by ahass

A readme file is an important addition to your raw data collections. Readme files allow others to understand and reuse your data after you have submitted it to repositories by explaining the nuances of your unique data collection.

Here are a couple of examples of data records that include readme files:

Paleosol data from Kenya deposited in UO Scholars’ Bank record
Clinical trial mandatory reporting study data deposited in Dryad

These guidelines may help you organize and complete a readme file for your completed dataset. You’re always welcome to contact the UO data librarians for assistance with creating readme files.

Getting Started
You can start preparing for your readme file when you start collecting your data. While collecting your data, make notes that will help you and others interpret and understand your data later. Although you might think you will remember everything about your data collection process, you might be surprised what you forget six months down the road.
Getting Organized
Embarking on the creation of a readme file may seem daunting, but starting with a well-organized outline can help ease the process. Cornell University’s Research Data Management Service Group offers a great, concise outline to get you started on organizing your document.
Completing a Readme File
Now that you have outlined what you are going to talk about in your readme file, it’s time to add the details! Fill out this form to the best of your abilities concentrating on the information in bold as this information will be the most helpful in the preservation and reuse of your data. Be thorough, yet succinct–only include information that will be helpful in understanding your data collection.
Submitting Your Dataset and Readme File
Once you have finished preparing your dataset and readme files, you can submit them to your chosen repository. Including a readme file with your data ensures that others will be able to understand and reuse your data (with respect to copyrights and permissions) for years to come.

Posted in Documentation / metadata, Readme files, Sharing / publishing | Leave a comment

11 Errors You Could Be Making With Your Tabular Data

Posted on February 6, 2017 by ahass

Many of us use spreadsheets to record, manage, and share data, but the ease of working with spreadsheets can mask some important considerations for your data. Below are some of the more common mistakes people make with their tabular data, and how to avoid them.

This material is based on “Formatting Problems” by Christie Bahlai and Aleksandra Pawlik via datacarpentry.org.

Multiple Tables:
Problem: Researchers commonly use one spreadsheet to organize multiple data tables. While this might be a convenient strategy during data collection, having multiple tables in the same spreadsheet confuses most data management systems. Having more than one table in a spreadsheet may imply false associations between datasets and could cause issues in data preservation in the future.Solution: Do not put more than one table in a single spreadsheet
Multiple Tabs:
Problem: Another logical solution to data organization is using several tabs in the same document to store data. Some researchers like to use separate tabs to distinguish between data characteristics (e.g. date of collection, region of collection, species type, etc.). This may seem appealing, but using this method to organize data puts you at risk for introducing inconsistencies into your collection or failing to see associations between your data.Solution: Instead of adding another tab to your document, consider adding another column to your data table. For example, instead of starting a new tab for every day of data collection, simply create a “Date of Sample” column and keep all your data on the same sheet.
Not Filling in Zeroes:
Problem: When collecting data some researchers like to leave data cells blank when they measure a value of zero. This may seem convenient, especially when you encounter multiple zero values, but many data systems recognize a value of zero as actual data while blank cells are interpreted as the absence of data. Leaving cells blank that should read “zero” can lead to issues with data calculations and preservation.Solution: Do not leave cells that should have a value of zero blank. Just enter the value of “0”.
Using Bad Null Values:
Problem: Using numerical values (9999, -9999, 0) to represent missing or null values can often confuse your data management system.Solution: Consult the image below from White et al, 2013, Nine simple ways to make it easier to (re)use your data. Ideas in Ecology and Evolution. to decide the best way to denote null values in your data.
Using Formatting to Convey Information:
Problem: Some researchers like to use highlighting, bolding, different fonts, or variation is font size to indicate important information about their data. Unfortunately, stylistic formatting is not usually recognized by data programs so any formatting information may get lost over time and reuse.Solution: Instead of adding special formatting features to convey information, create a new field or column and code the data appropriately. For example, if you wanted to indicate whether a sample was collected in March or April you could add a “Month” column and input March or April instead of highlighting the individual data to visually mark the categories.
Using Formatting for Visual Appeal:
Problem: Sure, merging two cells may look good, but merging cells could cause the computer to miss (or falsely identify) associations in the data.Solution: Do not merge cells. Consider re-arranging your data if needed, but do not merge cells.
Including Comments or Units in Cells:
Problem: While organizing their data, researchers may be tempted to include notes or unit measurements in data cells. This may seem practical, but putting extraneous information in your data cells can cause problems for data calculation or analysis software.Solution: Instead of making comments in your data spreadsheets, keep a readme file or some sort of meta data document. To avoid including units in cells, make sure that your headings indicate the unit of measurement for all data in that column. You should not be using multiple units of measurement in the same column.
More Than One Piece of Information in a Cell:
Problem: Sometimes including multiple pieces of information in a single cell makes sense for data organization purposes. For example, including both the city and state in one cell like “Eugene, OR” may seem logical, but including spaces or commas in data cells can cause problems with data analysis software as commas and spaces often indicate special formatting information.Solution: Do not put more than one piece of information in a single cell. Instead create multiple columns to accommodate additional information.
Field Name Problems:
Problem: Choosing heading names that are too long, complicated, have spaces between words, include uncommon abbreviations, or that might not make sense in 6 months may make future data use complicated and frustrating.Solution: Use succinct, clear heading names that explain the field without being too wordy, complex, or nuanced. Example: use a convention such as “Max_temp” over “Maximum Temperature in Degrees Celsius.”
Special Characters in Data:
Problem: Sometimes researchers use Excel as a word processor and include special characters such as symbols for temperature. Excel and other data systems cannot process special characters.Solution: Avoid using special characters when possible.
Inclusion of Metadata in Data Table:
Problem: Including metadata with your data is important, but metadata should not be included in the data spreadsheet. Including non-data information in your data spreadsheet can cause problems with electronic data systems.Solution: Keep a metadata or readme file that corresponds with your data, but keep it separate from your data spreadsheet.

Posted in Data cleanup, Data quality, Documentation / metadata, Sharing / publishing | Tagged Tabular Data | Leave a comment

Data Carpentry Workshop

Posted on March 2, 2016 by bwestra@uoregon.edu

Are you looking for:

better ways to organize spreadsheet data
tools to speed up cleaning up tabular (spreadsheet) data
an alternative to commercial statistics software (R)
how to create data visualizations in R
using relational databases to manage data

If any of these are of interest to you, then a Data Carpentry workshop may be what you are looking for. Edward Davis (Geology) and the UO Libraries are hosting a a remote broadcast of a session taking place at University of California Museum of Paleontology this week.

What: Data Carpentry workshop
When: this week March 3 – 4, from 8 to 4 pm each day
Where: Knight Library (limited seating; registration required). This will be a remote broadcast of a session taking place at University of California Museum of Paleontology. Assistants will be available to provide onsite support at the University of Oregon.

Registration: http://www.eventbrite.com/e/university-of-oregon-remote-data-carpentry-workshop-tickets-22175369126

Data Carpentry workshops are for any researcher who has data they want to analyze, and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data. We will cover data organization in spreadsheets, data cleaning, SQL, and R for data analysis and visualization. Participants should bring their laptops and plan to participate actively. By the end of the workshop learners should be able to more effectively manage and analyze data and be able to apply the tools and approaches directly to their ongoing research.

More about Data Carpentry

In many domains of research the rapid generation of large amounts of data is fundamentally changing how research is done. The deluge of data presents great opportunities, but also many challenges in managing, analyzing and sharing data.

Data Carpentry is designed to teach basic concepts, skills and tools for working more effectively with data. The workshop is aimed at researchers at all career stages and is designed for learners with little to no prior knowledge of programming, shell scripting, or command line tools.

More information on the workshop: http://www.datacarpentry.org/2016-03-03-ucmp/

Local contact: Prof. Edward Davis (edavis@uoregon.edu)

Posted in Analysis / statistics, Data cleanup, Data visualization, Workshops & Events | Leave a comment

Think Big-Transforming, Extending, Reusing Data

Posted on February 12, 2016 by jocain

This is Love Your Data week, and each day we’ll be sharing a post about one or more fundamental data management practices that you can use. Part 5 of 5 (parts 1, 2, 3, 4)

GOOD PRACTICE

While best practices for sharing your data are still evolving, there are some things to keep in mind when choosing to share your data:

When archiving your data choose an appropriate venue for your discipline. If you have any questions about choosing an appropriate data archive, contact your librarian.
Share ethically. Make certain that all sensitive information is redacted before submitting your data to an appropriate archive.
When sharing your data, include the metadata. Metadata, in part, documents your data. It tells others about your data: how it was created, who created it, and potentially, any stipulations for use of the data. For more information about metadata, consult UO Libraries page on Metadata & Data Documentation.
Before depositing your data be aware of any associated intellectual property rights. While copyright is not applicable to most research data in the U.S., licensing can apply. Want to learn more? Check out this guide from the University of Minnesota Libraries for a more thorough explanation of intellectual property, licenses, and research data.

Need more information? Make sure to consult the UO Libraries RDM page on Sharing Data.

TODAY’S ACTIVITY

What will future generations do with your data? How will it change the world? Think about ways in which your data can be used by scholars, change-makers, and everyday citizens to make a difference in the world.

TELL US

How do you share you data? How do you make it accessible and intelligible for future users? What are some of your concerns about sharing data? How can we make sharing data easier for data producers? And of course, what would make reusing data easier for all levels of consumers out there?

Twitter: #LYD16 Instagram: #LYD16 Facebook:#LYD16

RESOURCES

For additional information check out the resources board, the changing face of data on Pinterest, and consult the with the UO Libraries Research Data Management page on Sharing Data

Source:materials adapted from LYD website

Posted in News | Leave a comment

UO Research Data Management Blog

Rescuing unloved data

Message of the day

Things to consider

Stories

Resources

Activities

The 2017 Love Your Data Week is February 13 – 17, 2017. Monday Tuesday Wednesday Thursday Friday
Adopted with permission from the international Love Your Data Week 2017 materials.

Documenting, describing, and defining data

Message of the day

Things to consider

Stories (learn from others’ mistakes and successes)