SlideShare a Scribd company logo
CHEN, TSAI-MIN(陳 在民) Ph.D. Candidate
Graduate Program of Data Science, National Taiwan University and
Academia Sinica
Supervisors: Dr. Tsao, Yu(曹 昱) &
Dr. Shen, Chun-Yen(沈 俊嚴)
Data De-identification
and Re-identification
(Attack and Defense Contest)
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Jellyfish-like Talent
Entomology
Microbiology Artificial
Intelligence
人工智慧學校
專案處-顧問
中研院生物醫學所
研究助理
日本京大學
分子微生物學所
台灣大學
昆蟲學系
政治大學
政治學系
Information
Security
中研院&台灣大學
資料科學學位學程
Find a way out(O)
Cross-domain?(X)
https://www.linkedin.com/in/%E9%99%B3%E5%9C%A8%E6%B0%91-tsai-min-chen-4675449a/
Political
Science
Medicine
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
3
• De-identification is the process used
to prevent someone's personal
identity from being revealed.
• Biomedical data may be de-identified
in order to comply with USA HIPAA
regulations that define and stipulate
patient privacy laws.
What is de-identification for?
Rights (OCR), Office for Civil (2012-09-07). "Methods for De-identification of PHI". HHS.gov. Retrieved 2020-11-08.
Health Insurance Portability and
Accountability Act (HIPAA)
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Attack and Defense Contest
(Data De-identification and Re-identification)
Records(3915):
Unique personal ID
17 Features Our decrypting
model
Encrypted Records
A.I.
17
Features
A.I.
Our encrypting
model
:
:
De-identification Re-identification
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Evaluation Metrics for De-identification
Convergence in distribution
Perfect Conditions for metrics:
Feature dependency
Individual similarity
Convergence in distribution & Feature dependency
Convergence in distribution
Convergence in distribution
Convergence in distribution
Convergence in distribution & Feature dependency
Convergence in distribution & Feature dependency
Convergence in distribution & Feature dependency
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Individual similarity?
Feature dependency
Convergence in distribution
Perfect Conditions for metrics:
Convergence in distribution & Feature dependency
Convergence in distribution
Convergence in distribution
Convergence in distribution
Convergence in distribution & Feature dependency
Convergence in distribution & Feature dependency
Convergence in distribution & Feature dependency
Row Swapping for De-identification
Genetic Algorithm
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Encrypting Model by Genetic Algorithm
Best individual similarity
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
De-identification Leaderboard
1
Best individual similarity
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Evaluation Metrics for Re-identification
• To be simple, just 2 points:
1. Preventing contestants
knowing real personal IDs
of your encrypted data.
2. Deciphering contestants’
real personal IDs of their
encrypted data.
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Evaluation Metrics for Re-identification
• To be simple, just 2 points:
1. Preventing contestants
knowing real personal IDs
of your encrypted data.
2. Deciphering contestants’
real personal IDs of their
encrypted data.
Genetic Algorithm is with
randomness naturally.
Deep Learning Algorithm
is strong in decoding.
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
De-identification
Decrypting Model
Decrypting Model by Deep Learning
?
?
?
?
?
Re-identification
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Re-identification Leaderboard
Ranking Team Name Defense Rate(%) My Hit Rate(%)
1 b99612040 100 0
1 Phil0125 100 0
1 ck99831 100 0
1 zd408 100 0
5 jokerL88 98 0
6 SkyLiNing 86 14
7 Chris7 60 21
8 xiew9222 39 25
9 ken14420 33 52
Possible to decode
Myself
Impossible to decode
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Conclusion
I am the Champion!! • To be simple, just 2 points:
1. Randomness is
impossible to decode.
2. Deep Learning is
strong for those
possible to decode.
Re-identification
De-identification
Introduction Conclusion
CHEN, TSAI-MIN
Ph.D. Candidate
Question & Answer
Any questions are fine!
一、頂級期刊:
1.IEEE Trans. on Consumer Electronics (minor revision)
2.iScience doi:10.1016/j.isci.2020.100886.
3.Nature Communications doi:10.1038/s41467-021-23165-1.
二、程式競賽:
1.資料去識別化與重新識別攻防競賽 冠軍
2.三總 2022資料科學競賽 冠軍
3.Tomofun 狗音辨識 AI 百萬挑戰賽 2021 第五名
4.IEEE Big Data FEMH Voice Data Challenge 2019 亞軍
5.The China Physiological Signal Challenge 2018 冠軍
6&7.英科智能MolHack 線上黑客松2018 冠軍、2019 季軍
8.第16回數理工學Programming Contest 季軍
三、獎項榮耀:
1.臺大 優秀青年
2.AI智能雲端運算應用競賽 優選
3.NTU-IBM量子計算Qiskit Hackathon Taiwan 季軍
4.臺大 利他獎
5.臺大統計應用論文競賽 優等獎
6.臺大學生代表大會 學生代表
7.量子計算 Quantum Computing 社群發起人
8.AI in TAIWAN台灣人工智慧黑客松 優等獎
9.TMU–MIT醫療數據松 季軍
10.台灣人工智慧技術大賽 亞軍
11.臺大成績優良 兩次
陳在民、1991/03/22出生、男性
…更多請看
LinkedIn

More Related Content

Recently uploaded

Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
fkyes25
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 

Recently uploaded (20)

Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 

Featured

Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter
 

Featured (20)

Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 

Data De-identification and Re-identification

  • 1. CHEN, TSAI-MIN(陳 在民) Ph.D. Candidate Graduate Program of Data Science, National Taiwan University and Academia Sinica Supervisors: Dr. Tsao, Yu(曹 昱) & Dr. Shen, Chun-Yen(沈 俊嚴) Data De-identification and Re-identification (Attack and Defense Contest)
  • 2. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Jellyfish-like Talent Entomology Microbiology Artificial Intelligence 人工智慧學校 專案處-顧問 中研院生物醫學所 研究助理 日本京大學 分子微生物學所 台灣大學 昆蟲學系 政治大學 政治學系 Information Security 中研院&台灣大學 資料科學學位學程 Find a way out(O) Cross-domain?(X) https://www.linkedin.com/in/%E9%99%B3%E5%9C%A8%E6%B0%91-tsai-min-chen-4675449a/ Political Science Medicine
  • 3. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate 3 • De-identification is the process used to prevent someone's personal identity from being revealed. • Biomedical data may be de-identified in order to comply with USA HIPAA regulations that define and stipulate patient privacy laws. What is de-identification for? Rights (OCR), Office for Civil (2012-09-07). "Methods for De-identification of PHI". HHS.gov. Retrieved 2020-11-08. Health Insurance Portability and Accountability Act (HIPAA)
  • 4. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Attack and Defense Contest (Data De-identification and Re-identification) Records(3915): Unique personal ID 17 Features Our decrypting model Encrypted Records A.I. 17 Features A.I. Our encrypting model : : De-identification Re-identification
  • 5. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Evaluation Metrics for De-identification Convergence in distribution Perfect Conditions for metrics: Feature dependency Individual similarity Convergence in distribution & Feature dependency Convergence in distribution Convergence in distribution Convergence in distribution Convergence in distribution & Feature dependency Convergence in distribution & Feature dependency Convergence in distribution & Feature dependency
  • 6. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Individual similarity? Feature dependency Convergence in distribution Perfect Conditions for metrics: Convergence in distribution & Feature dependency Convergence in distribution Convergence in distribution Convergence in distribution Convergence in distribution & Feature dependency Convergence in distribution & Feature dependency Convergence in distribution & Feature dependency Row Swapping for De-identification Genetic Algorithm
  • 7. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Encrypting Model by Genetic Algorithm Best individual similarity
  • 8. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate De-identification Leaderboard 1 Best individual similarity
  • 9. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Evaluation Metrics for Re-identification • To be simple, just 2 points: 1. Preventing contestants knowing real personal IDs of your encrypted data. 2. Deciphering contestants’ real personal IDs of their encrypted data.
  • 10. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Evaluation Metrics for Re-identification • To be simple, just 2 points: 1. Preventing contestants knowing real personal IDs of your encrypted data. 2. Deciphering contestants’ real personal IDs of their encrypted data. Genetic Algorithm is with randomness naturally. Deep Learning Algorithm is strong in decoding.
  • 11. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate De-identification Decrypting Model Decrypting Model by Deep Learning ? ? ? ? ? Re-identification
  • 12. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Re-identification Leaderboard Ranking Team Name Defense Rate(%) My Hit Rate(%) 1 b99612040 100 0 1 Phil0125 100 0 1 ck99831 100 0 1 zd408 100 0 5 jokerL88 98 0 6 SkyLiNing 86 14 7 Chris7 60 21 8 xiew9222 39 25 9 ken14420 33 52 Possible to decode Myself Impossible to decode
  • 13. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Conclusion I am the Champion!! • To be simple, just 2 points: 1. Randomness is impossible to decode. 2. Deep Learning is strong for those possible to decode.
  • 14. Re-identification De-identification Introduction Conclusion CHEN, TSAI-MIN Ph.D. Candidate Question & Answer Any questions are fine! 一、頂級期刊: 1.IEEE Trans. on Consumer Electronics (minor revision) 2.iScience doi:10.1016/j.isci.2020.100886. 3.Nature Communications doi:10.1038/s41467-021-23165-1. 二、程式競賽: 1.資料去識別化與重新識別攻防競賽 冠軍 2.三總 2022資料科學競賽 冠軍 3.Tomofun 狗音辨識 AI 百萬挑戰賽 2021 第五名 4.IEEE Big Data FEMH Voice Data Challenge 2019 亞軍 5.The China Physiological Signal Challenge 2018 冠軍 6&7.英科智能MolHack 線上黑客松2018 冠軍、2019 季軍 8.第16回數理工學Programming Contest 季軍 三、獎項榮耀: 1.臺大 優秀青年 2.AI智能雲端運算應用競賽 優選 3.NTU-IBM量子計算Qiskit Hackathon Taiwan 季軍 4.臺大 利他獎 5.臺大統計應用論文競賽 優等獎 6.臺大學生代表大會 學生代表 7.量子計算 Quantum Computing 社群發起人 8.AI in TAIWAN台灣人工智慧黑客松 優等獎 9.TMU–MIT醫療數據松 季軍 10.台灣人工智慧技術大賽 亞軍 11.臺大成績優良 兩次 陳在民、1991/03/22出生、男性 …更多請看 LinkedIn