Difference between revisions of "Practical de-identification guide"

From Responsible Data Wiki
Jump to: navigation, search
(Connection to previous RDFs)
(Notes)
Line 15: Line 15:
 
unknown...
 
unknown...
  
== Notes ==
+
== Intermediate Work Products ==
''Free text''
+
 
 +
[https://docs.google.com/spreadsheets/d/1rFgN61xBCUE5cidaCicZrx8DXtQYoHdOYXfWr1udk0w/edit#gid=0 Variables to Watch out for When Anonymizing Data] (Google Spreadsheet)
 +
[https://docs.google.com/spreadsheets/d/1rFgN61xBCUE5cidaCicZrx8DXtQYoHdOYXfWr1udk0w/edit#gid=2115878413 Types of Data Releases] (Google Spreadsheet)
 +
[https://docs.google.com/spreadsheets/d/1rFgN61xBCUE5cidaCicZrx8DXtQYoHdOYXfWr1udk0w/edit#gid=1362233057 Common Variables by Releaser and Field] (Google Spreadsheet)
 +
[https://docs.google.com/spreadsheets/d/1rFgN61xBCUE5cidaCicZrx8DXtQYoHdOYXfWr1udk0w/edit#gid=821726784 Day 1 Report-Back]  (Google Spreadsheet)
  
 
== Audience ==
 
== Audience ==

Revision as of 17:14, 1 October 2014

Subtitle: one sentence on what it does, who is it for, and what is its goal

Outputs

There is one output actual for this group and one wishlist output, that doesn't yet exist.

The actual output is the Basic De-identification Solution Matrix, an editable Google Spreadsheet that list common variable types from the fields of health, education, finance, environmental, political, a list of de-identification solutions for these types of data, and some suggestions about what forms of de-identification are most useful for each type of data.

This output intersects really well with the work done by the Responsible Data Risk Mapping group, because in order to make decisions about what data to de-identify you need to assess the possible harm of that data first and after harm analysis you need to mitigate that harm by... de-identifying!

The wishlist output of this group is a piece of software that would automate different de-identification functions, so an individual could select the variables they'd like to identify and how and de-identification would then be carried out automatically.

Connection to previous RDFs

unknown...

Intermediate Work Products

Variables to Watch out for When Anonymizing Data (Google Spreadsheet) Types of Data Releases (Google Spreadsheet) Common Variables by Releaser and Field (Google Spreadsheet) Day 1 Report-Back (Google Spreadsheet)

Audience

Personas, use cases, context

Next steps

Contributors

Food for thought

  • concepts, problems
  • questions to ask frequently
  • preventions: what do you actually do in concrete terms to prevent these things from happening
  • reactions: responsible responses for when things go wrong

Resources (we <3 links!)

Feel free to link any and all background material, additional info, useful resources, etc. The more the merrier!