Practical de-identification guide

From Responsible Data Wiki
Revision as of 17:15, 1 October 2014 by 84.0.213.221 (Talk)

Jump to: navigation, search

Subtitle: one sentence on what it does, who is it for, and what is its goal

Outputs

There is one output actual for this group and one wishlist output, that doesn't yet exist.

The actual output is the Basic De-identification Solution Matrix, an editable Google Spreadsheet that list common variable types from the fields of health, education, finance, environmental, political, a list of de-identification solutions for these types of data, and some suggestions about what forms of de-identification are most useful for each type of data.

This output intersects really well with the work done by the Responsible Data Risk Mapping group, because in order to make decisions about what data to de-identify you need to assess the possible harm of that data first and after harm analysis you need to mitigate that harm by... de-identifying!

The wishlist output of this group is a piece of software that would automate different de-identification functions, so an individual could select the variables they'd like to identify and how and de-identification would then be carried out automatically.

Connection to previous RDFs

unknown...

Intermediate Work Products

Variables to Watch out for When Anonymizing Data (Google Spreadsheet)<br / Types of Data Releases (Google Spreadsheet)<br / Common Variables by Releaser and Field (Google Spreadsheet)<br / Day 1 Report-Back (Google Spreadsheet)<br /

Audience

Personas, use cases, context

Next steps

Contributors

Food for thought

  • concepts, problems
  • questions to ask frequently
  • preventions: what do you actually do in concrete terms to prevent these things from happening
  • reactions: responsible responses for when things go wrong

Resources (we <3 links!)

Feel free to link any and all background material, additional info, useful resources, etc. The more the merrier!