SlideShare a Scribd company logo
1 of 20
An analysis of patient diversity in
oncology clinical trials
SAMIA ANSARI, UNIVERSITY OF GEORGIA
SARTOGRAPHY, LLC + OPENSOURCE CONNECTIONS, LLC
DATASTART WORKSHOP PRESENTATION 09.27.2016
How well are
women and
ethnic/racial
minorities
represented in
cancer clinical trials?
U. S. population
Cancer
incidence
Trial
Enrollment
Background: Clinical development
 Only a small percentage of cancer patients participate
(Murthy, 2004)
 Women and minorities underrepresented in biomedical
research (Oh, 2015)
 Need to focus precision medicine initiatives like Cancer
Moonshot
Federal Classifications
Category Classifications
Gender • Male
• Female
Ethnicity • Hispanic
• Non-Hispanic
Race • Black
• White
• Asian/Pacific Islander
(API)
• American Indian (AI)
 Used by NIH/OMB to
collect U.S. demographic
data
Why is
representation
important?
 Generalizability of study results
 Alleviate race-related outcome
disparities
 Stipulations on federal funding
 Healthcare consumer
demographics
3 datasets
Dataset Source Information
Surveillance, Epidemiology, and
End Results (SEER) Program
National Cancer Institute Cancer incidence rates
Aggregated Analysis of
ClinicalTrials.gov (AACT)
Clinical Trials Transformation
Initiative
Trial enrollment numbers
Bridged-Race Population
Estimates
U.S. Census Bureau and National
Center for Health Statistics
National demographic data
Parameters
 2002-2012 enrollment period
 5 cancer types
 Lung, colorectal, ovarian, prostatic, breast
 Omitted studies for which
 no demographic data
 categories that don’t map to NIH/OMB groups
 multiple cancer types studied in one trial
Methodology
 Estimate cancer diagnosis numbers for each
demographic group
 Extract trial enrollment data from AACT database
 Find any relationships between demographics and
enrollment status
Methodology, cont.
Tools
 Data extraction:
 SEER*Stat software
 SQL
 Ruby
 Analysis:
 R/RStudio
 STATA13
Results - Gender
0.0
20.0
40.0
60.0
80.0
100.0
Lung,
Male
Lung,
Female
CRC,
Male
CRC,
Female
Proportion(%)
Patient Type
Trial Enrollment vs. Incident Cases
Trial enrollees
Incident cases
Results - Race
0
1
2
White Black Asian-Pacific
Islander
American
Indian
Oddsratio
Racial group
Comparative odds of enrollment
Results – Race
0.0
20.0
Proportion(%)
Cancer
Black patient enrollment, 2002-2012
Proportion of trial enrollees Proportion of national incident cases
Proportion of population
Results – Race
0.0
5.0
10.0
15.0
All Lung CRC Breast Ovarian Prostatic
Proportion(%)
Cancer
API patient enrollment, 2002-2012
Proportion of trial enrollees Proportion of national incident cases Proportion of population
Results – Race
0.00
2.00
Proportion(%)
Cancer
AI patient enrollment, 2002-2012
Proportion of trial enrollees Proportion of national incident cases
Proportion of population
Lessons learned
 Ethnicity and race data not adequately captured by trial
sponsors/AACT
 Not enough ethnicity data
 Limit customization of race/ethnicity fields
 Trial sponsors should include yearly accrual numbers
Next steps
 Build on initial analysis
 Manuscript in progress
 Possible collaboration
 Applying for fellowships!
Additional information
Contact me!
 khatsar@gmail.com
 @sfansari
 linkedin.com/in/samiaansari
 angel.co/samia-ansari
Project blog
 Blog posts on
OpenSourceConnections site:
 Introduction to my project
 Deploying the AACT Oracle database
using Docker
 Learning the shape of my data
 Expanding frequency data to case
form in R and STATA
Acknowledgements
Mentorship
 Dan Funk (Sartography, LLC)
 Eric Pugh (OpenSource Connections, LLC)
 Dr. Joy Peterson (UGA BHSI)
Support
 SouthBDHub DataStart Program
 Computing Community Consortium
Sartography and OpenSource
Connections
 Search analytics
 Data visualization
 Technical personalization,
recommendation systems
 Clients include
 NLM
 NCBI
 USPTO
 Academic institutions
Bibliography
 Murthy VH, Krumholz HM, Gross CP. Participation in cancer clinical trials: race-, sex-,
and age-based disparities. Journal of the American Medical Association. Available
online: http://www.ncbi.nlm.nih.gov/pubmed/15187053/
 OH SS, Galanter J, Thakur N, Pino-Yanes M, Barcelo NE, White MJ, de Bruin DM,
Greenblatt RM, Bibbins-Domingo K, Wu AHB, Borrell LN, Gunter C, Powe NR,
Burchard EG. Diversity in Clinical and Biomedical Research: A Promise Yet to be
Fulfilled. PLoS Medicine. 2015; 12. Available online:
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679830/
 U.S. National Library of Medicine. Data, Tools, and Statistics. NIH website. Last
updated: 11 April 2016. Available online:
https://www.nlm.nih.gov/hsrinfo/datasites.html

More Related Content

What's hot

Streamlining the Residency Application and Match Process: Innovations in Appl...
Streamlining the Residency Application and Match Process: Innovations in Appl...Streamlining the Residency Application and Match Process: Innovations in Appl...
Streamlining the Residency Application and Match Process: Innovations in Appl...Zach Jarou
 
EMRA Match v4.0: An Alternative to Doximity's Residency Navigator
EMRA Match v4.0: An Alternative to Doximity's Residency NavigatorEMRA Match v4.0: An Alternative to Doximity's Residency Navigator
EMRA Match v4.0: An Alternative to Doximity's Residency NavigatorZach Jarou
 
EMRA Hangouts: A Novel Emergency Medicine Advising Platform
EMRA Hangouts: A Novel Emergency Medicine Advising PlatformEMRA Hangouts: A Novel Emergency Medicine Advising Platform
EMRA Hangouts: A Novel Emergency Medicine Advising PlatformZach Jarou
 
Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Department of Health
 
Diagnostic criteria and clinical guidelines standardization to automate case ...
Diagnostic criteria and clinical guidelines standardization to automate case ...Diagnostic criteria and clinical guidelines standardization to automate case ...
Diagnostic criteria and clinical guidelines standardization to automate case ...Melanie Courtot
 
Developing Evidence-Based Guidelines for Patient-Accessible Medical Records
Developing Evidence-Based Guidelines for Patient-Accessible Medical RecordsDeveloping Evidence-Based Guidelines for Patient-Accessible Medical Records
Developing Evidence-Based Guidelines for Patient-Accessible Medical RecordsChristina Zarcadoolas
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinarAnn-Marie Roche
 
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...chelliah paramasivan
 
Appam2011 access de_leire
Appam2011 access de_leireAppam2011 access de_leire
Appam2011 access de_leiresoder145
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Francisco J Grajales III
 

What's hot (11)

Streamlining the Residency Application and Match Process: Innovations in Appl...
Streamlining the Residency Application and Match Process: Innovations in Appl...Streamlining the Residency Application and Match Process: Innovations in Appl...
Streamlining the Residency Application and Match Process: Innovations in Appl...
 
EMRA Match v4.0: An Alternative to Doximity's Residency Navigator
EMRA Match v4.0: An Alternative to Doximity's Residency NavigatorEMRA Match v4.0: An Alternative to Doximity's Residency Navigator
EMRA Match v4.0: An Alternative to Doximity's Residency Navigator
 
EMRA Hangouts: A Novel Emergency Medicine Advising Platform
EMRA Hangouts: A Novel Emergency Medicine Advising PlatformEMRA Hangouts: A Novel Emergency Medicine Advising Platform
EMRA Hangouts: A Novel Emergency Medicine Advising Platform
 
Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz
 
Diagnostic criteria and clinical guidelines standardization to automate case ...
Diagnostic criteria and clinical guidelines standardization to automate case ...Diagnostic criteria and clinical guidelines standardization to automate case ...
Diagnostic criteria and clinical guidelines standardization to automate case ...
 
Developing Evidence-Based Guidelines for Patient-Accessible Medical Records
Developing Evidence-Based Guidelines for Patient-Accessible Medical RecordsDeveloping Evidence-Based Guidelines for Patient-Accessible Medical Records
Developing Evidence-Based Guidelines for Patient-Accessible Medical Records
 
Designingforinnovationppayneucsf06 2105final-150610135139-lva1-app6892
Designingforinnovationppayneucsf06 2105final-150610135139-lva1-app6892Designingforinnovationppayneucsf06 2105final-150610135139-lva1-app6892
Designingforinnovationppayneucsf06 2105final-150610135139-lva1-app6892
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinar
 
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...
AWARENESS OF RIGHT TO INFORMATION ACT AMONG THE COLLEGE STUDENTS OF PERIYAR E...
 
Appam2011 access de_leire
Appam2011 access de_leireAppam2011 access de_leire
Appam2011 access de_leire
 
Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...Quality and safety of health information on the Internet: Who decides and ho...
Quality and safety of health information on the Internet: Who decides and ho...
 

Similar to DataStart Final

Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingWarren Kibbe
 
National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingWarren Kibbe
 
Patient profiling disaggregating the data
Patient profiling disaggregating the dataPatient profiling disaggregating the data
Patient profiling disaggregating the datanhsnwHELP
 
Participation of the population in decisions about their health and in the pr...
Participation of the population in decisions about their health and in the pr...Participation of the population in decisions about their health and in the pr...
Participation of the population in decisions about their health and in the pr...Pydesalud
 
Advancing The Prevention And Cure Of Cancer
Advancing The Prevention And Cure Of CancerAdvancing The Prevention And Cure Of Cancer
Advancing The Prevention And Cure Of Cancerfondas vakalis
 
Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Peter Embi
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014muink
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707Warren Kibbe
 
2016 CRI Year-in-Review
2016 CRI Year-in-Review2016 CRI Year-in-Review
2016 CRI Year-in-ReviewPeter Embi
 
African american women and colorectal cancer research: Why Your Participation...
African american women and colorectal cancer research: Why Your Participation...African american women and colorectal cancer research: Why Your Participation...
African american women and colorectal cancer research: Why Your Participation...Kelly Brittain
 
Connecting Patients and Clinical Trials Digitally
Connecting Patients and Clinical Trials DigitallyConnecting Patients and Clinical Trials Digitally
Connecting Patients and Clinical Trials DigitallyKatja Reuter, PhD
 
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011Vojtech Huser
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data AnalysisREY DECASTRO
 
Kibbe One Voice Against Cancer 20170605
Kibbe One Voice Against Cancer 20170605Kibbe One Voice Against Cancer 20170605
Kibbe One Voice Against Cancer 20170605Warren Kibbe
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Warren Kibbe
 
Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Jerry Lee
 
M&E Systems for Evaluation: Where M meets E
M&E Systems for Evaluation: Where M meets EM&E Systems for Evaluation: Where M meets E
M&E Systems for Evaluation: Where M meets EMEASURE Evaluation
 
Telemedicine - Moving Beyond the Video Visit
Telemedicine - Moving Beyond the Video VisitTelemedicine - Moving Beyond the Video Visit
Telemedicine - Moving Beyond the Video VisitHoward Reis
 

Similar to DataStart Final (20)

Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meetingDay 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
 
National Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data SharingNational Cancer Data Ecosystem and Data Sharing
National Cancer Data Ecosystem and Data Sharing
 
Big data sharing
Big data sharingBig data sharing
Big data sharing
 
Patient profiling disaggregating the data
Patient profiling disaggregating the dataPatient profiling disaggregating the data
Patient profiling disaggregating the data
 
Participation of the population in decisions about their health and in the pr...
Participation of the population in decisions about their health and in the pr...Participation of the population in decisions about their health and in the pr...
Participation of the population in decisions about their health and in the pr...
 
Advancing The Prevention And Cure Of Cancer
Advancing The Prevention And Cure Of CancerAdvancing The Prevention And Cure Of Cancer
Advancing The Prevention And Cure Of Cancer
 
Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014Clinical Research Informatics (CRI) Year-in-Review 2014
Clinical Research Informatics (CRI) Year-in-Review 2014
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707ISCB ECCB BD2K keynote Kibbe 201707
ISCB ECCB BD2K keynote Kibbe 201707
 
2016 CRI Year-in-Review
2016 CRI Year-in-Review2016 CRI Year-in-Review
2016 CRI Year-in-Review
 
African american women and colorectal cancer research: Why Your Participation...
African american women and colorectal cancer research: Why Your Participation...African american women and colorectal cancer research: Why Your Participation...
African american women and colorectal cancer research: Why Your Participation...
 
Connecting Patients and Clinical Trials Digitally
Connecting Patients and Clinical Trials DigitallyConnecting Patients and Clinical Trials Digitally
Connecting Patients and Clinical Trials Digitally
 
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011
Vojtech huser-2009-amia-clinical-research-informatics-panel-eligibility-v011
 
ENAR 2020
ENAR 2020ENAR 2020
ENAR 2020
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data Analysis
 
Kibbe One Voice Against Cancer 20170605
Kibbe One Voice Against Cancer 20170605Kibbe One Voice Against Cancer 20170605
Kibbe One Voice Against Cancer 20170605
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016
 
Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.Keynote at NVIDIA GPU Technology Conference in D.C.
Keynote at NVIDIA GPU Technology Conference in D.C.
 
M&E Systems for Evaluation: Where M meets E
M&E Systems for Evaluation: Where M meets EM&E Systems for Evaluation: Where M meets E
M&E Systems for Evaluation: Where M meets E
 
Telemedicine - Moving Beyond the Video Visit
Telemedicine - Moving Beyond the Video VisitTelemedicine - Moving Beyond the Video Visit
Telemedicine - Moving Beyond the Video Visit
 

DataStart Final

  • 1. An analysis of patient diversity in oncology clinical trials SAMIA ANSARI, UNIVERSITY OF GEORGIA SARTOGRAPHY, LLC + OPENSOURCE CONNECTIONS, LLC DATASTART WORKSHOP PRESENTATION 09.27.2016
  • 2. How well are women and ethnic/racial minorities represented in cancer clinical trials? U. S. population Cancer incidence Trial Enrollment
  • 3. Background: Clinical development  Only a small percentage of cancer patients participate (Murthy, 2004)  Women and minorities underrepresented in biomedical research (Oh, 2015)  Need to focus precision medicine initiatives like Cancer Moonshot
  • 4. Federal Classifications Category Classifications Gender • Male • Female Ethnicity • Hispanic • Non-Hispanic Race • Black • White • Asian/Pacific Islander (API) • American Indian (AI)  Used by NIH/OMB to collect U.S. demographic data
  • 5. Why is representation important?  Generalizability of study results  Alleviate race-related outcome disparities  Stipulations on federal funding  Healthcare consumer demographics
  • 6. 3 datasets Dataset Source Information Surveillance, Epidemiology, and End Results (SEER) Program National Cancer Institute Cancer incidence rates Aggregated Analysis of ClinicalTrials.gov (AACT) Clinical Trials Transformation Initiative Trial enrollment numbers Bridged-Race Population Estimates U.S. Census Bureau and National Center for Health Statistics National demographic data
  • 7. Parameters  2002-2012 enrollment period  5 cancer types  Lung, colorectal, ovarian, prostatic, breast  Omitted studies for which  no demographic data  categories that don’t map to NIH/OMB groups  multiple cancer types studied in one trial
  • 8. Methodology  Estimate cancer diagnosis numbers for each demographic group  Extract trial enrollment data from AACT database  Find any relationships between demographics and enrollment status
  • 9. Methodology, cont. Tools  Data extraction:  SEER*Stat software  SQL  Ruby  Analysis:  R/RStudio  STATA13
  • 11. Results - Race 0 1 2 White Black Asian-Pacific Islander American Indian Oddsratio Racial group Comparative odds of enrollment
  • 12. Results – Race 0.0 20.0 Proportion(%) Cancer Black patient enrollment, 2002-2012 Proportion of trial enrollees Proportion of national incident cases Proportion of population
  • 13. Results – Race 0.0 5.0 10.0 15.0 All Lung CRC Breast Ovarian Prostatic Proportion(%) Cancer API patient enrollment, 2002-2012 Proportion of trial enrollees Proportion of national incident cases Proportion of population
  • 14. Results – Race 0.00 2.00 Proportion(%) Cancer AI patient enrollment, 2002-2012 Proportion of trial enrollees Proportion of national incident cases Proportion of population
  • 15. Lessons learned  Ethnicity and race data not adequately captured by trial sponsors/AACT  Not enough ethnicity data  Limit customization of race/ethnicity fields  Trial sponsors should include yearly accrual numbers
  • 16. Next steps  Build on initial analysis  Manuscript in progress  Possible collaboration  Applying for fellowships!
  • 17. Additional information Contact me!  khatsar@gmail.com  @sfansari  linkedin.com/in/samiaansari  angel.co/samia-ansari Project blog  Blog posts on OpenSourceConnections site:  Introduction to my project  Deploying the AACT Oracle database using Docker  Learning the shape of my data  Expanding frequency data to case form in R and STATA
  • 18. Acknowledgements Mentorship  Dan Funk (Sartography, LLC)  Eric Pugh (OpenSource Connections, LLC)  Dr. Joy Peterson (UGA BHSI) Support  SouthBDHub DataStart Program  Computing Community Consortium
  • 19. Sartography and OpenSource Connections  Search analytics  Data visualization  Technical personalization, recommendation systems  Clients include  NLM  NCBI  USPTO  Academic institutions
  • 20. Bibliography  Murthy VH, Krumholz HM, Gross CP. Participation in cancer clinical trials: race-, sex-, and age-based disparities. Journal of the American Medical Association. Available online: http://www.ncbi.nlm.nih.gov/pubmed/15187053/  OH SS, Galanter J, Thakur N, Pino-Yanes M, Barcelo NE, White MJ, de Bruin DM, Greenblatt RM, Bibbins-Domingo K, Wu AHB, Borrell LN, Gunter C, Powe NR, Burchard EG. Diversity in Clinical and Biomedical Research: A Promise Yet to be Fulfilled. PLoS Medicine. 2015; 12. Available online: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4679830/  U.S. National Library of Medicine. Data, Tools, and Statistics. NIH website. Last updated: 11 April 2016. Available online: https://www.nlm.nih.gov/hsrinfo/datasites.html