Successfully reported this slideshow.
Your SlideShare is downloading. ×

DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney

Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Upcoming SlideShare
Sla03tt
Sla03tt
Loading in …3
×

Check these out next

1 of 15 Ad

DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney

Download to read offline

The DataTags framework makes it easy for data producers to deposit, data publishers to store and distribute, and data users to access and use datasets containing confidential information, in a standardized and responsible way. The talk will first introduce the concepts and tools behind DataTags, and then focus on the user-facing component of the system - Tagging Server (available today at datatags.org). We will conclude by describing how future versions of Dataverse will use DataTags to automatically handle sensitive datasets, that can only be shared under some restrictions.

The DataTags framework makes it easy for data producers to deposit, data publishers to store and distribute, and data users to access and use datasets containing confidential information, in a standardized and responsible way. The talk will first introduce the concepts and tools behind DataTags, and then focus on the user-facing component of the system - Tagging Server (available today at datatags.org). We will conclude by describing how future versions of Dataverse will use DataTags to automatically handle sensitive datasets, that can only be shared under some restrictions.

Advertisement
Advertisement

More Related Content

Similar to DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney (20)

More from datascienceiqss (20)

Advertisement

DataTags: Sharing Privacy Sensitive Data by Latanya Sweeney

  1. 1.   share  sensi)ve  data  with  confidence   Latanya  Sweeney   latanya@fas.harvard.edu    latanyasweeney.org  
  2. 2. Gender, Race Ethnicity Micro-ethnicity (sub groups) Median Household income** Attended elite boarding school? Hometown Foreign Country? Hometown State* Primary academic major* Secondary academic major* Dorm Freshman Year* Dorm Neighborhood, 4 zones Network linkages of roommates On Facebook? Facebook: Political view Facebook: Interested In Facebook: number of friends (school) Facebook: total number of friends Facebook: number of Picture Friends Facebook: Favorite Movies* Facebook: Favorite Music* Facebook: Favorite Books* Facebook: linkages of friends Jason’s  Dataset  
  3. 3. Joe’s  Dataset   Description * Date of visit (month, day and year) Transaction# Unique patient identifier * Patient 5-digit ZIP code * Month, day and Year of Birth * Gender Unique Provider ID Provider 5-digit ZIP code * ICD9 diagnosis code 1 * ICD9 diagnosis code 2 * ICD9 diagnosis code 3 * ICD9 diagnosis code 4 * ICD9 diagnosis code 5 * ICD9 diagnosis code 6
  4. 4. Francesca’s Combined Data  
  5. 5. Chevron  Refinery   Liberty/  Atchison  Villages   Interstate   Levin-­‐Richmond  Terminal  Corp  (marine)   General  Chemical  Corp  Rail  yard   Julia’s  Interviews   N   l
  6. 6. All  these  people  need  to  store  data     in  a  manner  that  respects     legal  and  ethical  commitments.     #1     Readme   Files   #2     Uniform   storage  and   handling  
  7. 7. How     does  a   researcher   comply     with  more   than  2000   privacy   laws?  
  8. 8. Legal  Experts  Codify   Jurisprudence     into  the  six  levels.   Set  of  computer  rules   for  tagging  data  on   ingesUon   Data  with  its  dataTag   deposit  into  a  dataTags-­‐ compliant  repository   Wiki  approach   HarmonizaUon   Modeling  
  9. 9. Legal  Experts  Codify   Jurisprudence     into  the  six  levels.   Set  of  computer  rules   for  tagging  data  on   ingesUon   Data  with  its  dataTag   deposit  into  a  dataTags-­‐ compliant  repository   Expert  System:   decision  tree   Expert  System:   rule-­‐based  
  10. 10. Legal  Experts  Codify   Jurisprudence     into  the  six  levels.   Set  of  computer  rules   for  tagging  data  on   ingesUon   Data  with  its  dataTag   deposit  into  a  dataTags-­‐ compliant  repository   Interview  Server   Remote  API  Q&A   Binary,  Remote  Exec  
  11. 11. Legal  Experts  Codify   Jurisprudence     into  the  six  levels.   Set  of  computer  rules   for  tagging  data  on   ingesUon   Data  with  its  dataTag   deposit  into  a  dataTags-­‐ compliant  repository   Dataverse   iRods  
  12. 12. Legal  Experts  Codify   Jurisprudence     into  the  six  levels.   Set  of  computer  rules   for  tagging  data  on   ingesUon   Data  with  its  dataTag   deposit  into  a  dataTags-­‐ compliant  repository   Interview   datatags.org  

×