SlideShare a Scribd company logo
The ACP
MS Repository
A Case Study

tranSMART Community Meeting, Nov. 2013

Stephen Wicks, Ph.D.
The ACP Repository: a Case Study

What is Multiple Sclerosis?









Chronic inflammatory/demyelination disorder affecting
the CNS. (about 0.1%)
Leading cause of neurological disability in young
adults.
Symptoms are variable and significant. They include
vision, cognition, locomotion, pain, disorientation,
dexterity, mood, bowel/bladder control, others.
Generally progressive, but progression is idiosyncratic.
(CISRRMSSPMS, vs. CISPPMS etc.)
Complex etiology
The ACP Repository: a Case Study

What is the cost of MS?
 Difficult

and costly to diagnose (MRI, symptom
variability leads to extensive differential diagnosis)
 Treatments can slow progression, but are expensive.
 Many different drugs exist, but patient stratification
for maximum efficacy and minimum side effects is
non-existent. “Role the dice”
 Often strikes early in life, and is a life-long disability.
 Average Diagnosis at about 30. 5% before 16.
Orion Bionetworks
 ACP

is a founding member of Orion.
 Orion seeks to cure MS by harnessing the power of
computational modeling of disease progression.
 ACP will provide its data to Orion in tranSMART to
facilitate this goal.
 Rancho BioSciences will curate and harmonize the
ACP data for Orion
The ACP Repository: a Case Study

ACP and the MS Repository
 Founded

in 2001 by an MIT entrepreneur with MS
 ACP MS Repository started in 2006. The goal was
to identify the cause of MS.
 ACP MS Repository enrollment shut down this
year. Approximately 3200 participants enrolled.
 Biosamples, demographics, medical history etc.
 Research data
 OPT-UP
The ACP Repository: a Case Study

Repository Enrollment Status (6/21/2013)
• 3,220 subjects enrolled; 467 longitudinal visits completed
Subject Breakdown

Treatment Status

Subjects: 3220

TX+

TX
Naïve

TX-

Time Since Diagnosis
Pending

< 5 yrs

5-10 yrs

> 10 yrs

Pending

RRMS:1436

146

229

116

675

388

291

82

SPMS:234

164

53

13

4

27

58

149

0

PPMS:117

46

38

31

2

48

37

32

0

CIS:88

19

2

62

5

74

5

5

4

TM:175

12

17

131

15

106

32

24

13

NMO:383

200

66

39

78

252

52

13

66

ON:16

1

1

12

2

12

1

0

3

ADEM:32

MS:
1787

945

1

6

20

5

24

5

1

2

Cases:
2481

Controls: 739

• DNA, RNA, Plasma, Serum, PBMCs + data from 52 page CRF
6
The ACP Repository: a Case Study

The ACP Engine
“Matchmaker”
Database Graphical
User Interface

MS Discovery Forum
Reviewing Developments in the MS Field
Communicating with MS Researchers

Allowing MS Researchers
Worldwide to Explore the
ACP Repository Database

ACP Repository
$13 million
Invested

3200+ participants
Biosamples & Datasets

77 sets of biosamples+data

(b,m)illions of datapoints,
From 36 studies, so far

MS Researchers
Worldwide
Academia & Industry

Insights and Results

Mechanisms

Diagnostics

Causes

Treatments
The ACP Repository: a Case Study

ACP MS Repository
Open-access collection of highly annotated bloodderived samples plus data from MS, related diseases, &
control subjects gathered from 2006-2013.
 Requirement for research data derived from samples to
be with them (ACP) allowed us to obtain IP protection).
“Workingdeposited (with a provision forcritical samples and confirm our
results for only $20,000. If I had to obtain these samples from scratch, it would
 Contributes to MS+ research in many ways:
have cost $1 million and added 5 years to the project.”
 - Enables studies that might not be conducted School of
Thomas M. Aune, PhD, Molecular Biology, Vanderbilt University
Medicine otherwise (academic & commercial)
(from Scientific American)
 Creates a common results database for studies from
multiple bio-analytical techniques on overlapping sets
of subjects.
 Approximately 3200 participants.

The ACP Repository: a Case Study

Case Report Form (CRF)
Curation challenges
The ACP Repository: a Case Study

ACP Case Report Form
48 Page (first visit) and 38 page (second visit)
complete clinical workup
 Form completed with the assistance of a
clinical research associate over a several hour
interview (with sample draw and lab workup)
 Broad data: 80 distinct tables in an SQL
database
 Deep data: in flat data files, more than 20
million cells

The ACP Repository: a Case Study

CRF Sample Fields

Study drugs Drug”measurelethaly so) drug meaningless.
103 distinct textual responses.“CS-0777”, orordinal with betaseron, BETASERON, etc.
Illustrates some Nothe problems associated order was units dataset.
Inappropriate (sometimesdrugfrequency.
of consistent etc…
of trail enrollment
“First Drug”, “Second “Betseron”, beta-seron, curating this
“BG00012 (FUMARATE) OR PLACEBO”
The ACP Repository: a Case Study

DMD Curation Solutions
 We

applied drug ontologies and mapping
vocabularies where needed.
 We repaired and consolidated dose,
frequency, etc. to a single measure with 3
values (high, standard, low)
 We re-formatted the data to eliminate the
ambiguous cardinal ordering of reporting
The ACP Repository: a Case Study

CRF Sample Fields

Multiple Drugs (Observations) were addressed with…
The ACP Repository: a Case Study

VISIT_NAME application
The ACP Repository: a Case Study

Controlled Vocabularies
(sports)
 ~5000

responses
 779 distinct sports reported
 When filtered by “ski”,29
 “gym”, 45
 “walk”, 30; “jog”, 17, “run”, 40
The ACP Repository: a Case Study

Controlled Vocabularies (sports)

All sports mapped to a 29 term vocabulary.
The ACP Repository: a Case Study

Controlled Vocabularies (pets)
 ~6500

pets reported
 600 distinct pets reported
 When filtered by “dog”, 112, however, this misses
mispellings (“diog”, “dot”, “pubs”, dog-like pets
“wolf”, “half-wolf”, “mutt”, and breeds (“poddle”,
“poodle”, “Afghan Hound”, etc.)
 59 additional dog-like entries
The ACP Repository: a Case Study

Controlled Vocabularies (pets)

All pets mapped to a 31 category controlled vocabulary
The ACP Repository: a Case Study

Medication Curation Challenges









Amitriptaline
>10,000 medications listed.
Amitriptylin
Amitriptyline
2703 distinct medications listed.
Amitriptyline HCL
Mapped these to 614 real medications (e.g.
Amitroptyline
Amitriptyline)
Amitryetyline
This was split into twoAmitrypatiline
tables:
Amitryptailine
 Continuing Medications (541 entities)
Amitryptaline
 Stopped Medications (317 entities)
Amitryptilin
VISIT_NAME was used Amitryptiline distinct observations
to represent
across the whole study
Amitryptilline
Amitryptylene
Truly longitudinal measures were reified in the tree
hierarchy in the data Amitryptylinefile.
mapping
The ACP Repository: a Case Study

ACP Repository
Tree
The ACP Repository: a Case Study

Date and Time Coding
 All
Dates in multiple formats:
15/12/2001
15/Dec/2001
Dec-2001
2001
Dec./2001
12/2001
--/--/-----/2001
------------/12/2001

dates converted to
periods (Months, Years,
or Days) prior to the
relevant blood draw
date.
 Dates were represented
by International
Standard ISO 8601. i.e.
YYYY-MM-DD (e.g. 200112-15)
The ACP Repository: a Case Study

Repository Usage




77 studies ongoing or completed
36 studies have returned data to ACP
Data types:








Low-D biomarker (antibodies, metabolites, serum
markers of inflammation, etc.)
Low-D genotype data
High-D SNP/GWAS data
Gene-expression studies
Whole-genome sequencing (2 distinct studies)

Study types:




Etiology
Diagnostics
Disease activity biomarkers
The ACP Repository: a Case Study

Repository Usage
The ACP Repository: a Case Study

Research Data Curation Challenges
 Few

guidelines provided to researchers for data
formatting or treatment
 Often little or no documentation describing how
the data was generated or handled (raw vs.
normalized, transformations e.g.)
 Load study meta-data (contact info, description,
etc. at the node level)
The ACP Repository: a Case Study

Sample Study Results
Biogen gene expression study:
Designed to identify gene-expression
profiles that discriminate progressive
forms of MS from relapsing-remitting
forms of the disease.
The ACP Repository: a Case Study

Future Directions








Rancho BioSciences is providing guidance to ACP
for data-collection practices going forward (e.g.
OPT-UP)
We loaded the clinical data and 6 sample study
datasets into an Oracle-based tranSMART instance
that we host in-house for QC purposes.
The full dataset is slated to be loaded into a 1.1
postgreSQL-based tranSMART instance (hosted by
Recombinant by Deloitte for Orion).
This and other data sources (Inst. For Neuroscience
at B&W) will be analyzed and modeled by Orion
The ACP Repository: a Case Study

Thanks for your time!
Questions?

More Related Content

Viewers also liked

tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMARTtranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
David Peyruc
 

Viewers also liked (8)

tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMARTtranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
 

Similar to tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study

Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
mikaelhuss
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
inscit2006
 
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuJax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Anne Deslattes Mays
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
Computer Science Club
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
Elena Sügis
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
FranciscoJAzuajeG
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
inside-BigData.com
 
Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...
CSITiaesprime
 
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
ijcsa
 
AAPM Foster July 2009
AAPM Foster July 2009AAPM Foster July 2009
AAPM Foster July 2009
Ian Foster
 
Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...
BASMAJUMAASALEHALMOH
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
IJRES Journal
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
Sivagowry Shathesh
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
Servio Fernando Lima Reina
 
AETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 NiceAETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 Nice
Martin Hofmann-Apitius
 
C0344023028
C0344023028C0344023028
C0344023028
inventionjournals
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Cemal Ardil
 
BioData World Basel 2018
BioData World Basel 2018BioData World Basel 2018
BioData World Basel 2018
Anne Deslattes Mays
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
Rainu Rajeev
 

Similar to tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study (20)

Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuJax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
 
Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...
 
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
 
AAPM Foster July 2009
AAPM Foster July 2009AAPM Foster July 2009
AAPM Foster July 2009
 
Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
AETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 NiceAETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 Nice
 
C0344023028
C0344023028C0344023028
C0344023028
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
 
BioData World Basel 2018
BioData World Basel 2018BioData World Basel 2018
BioData World Basel 2018
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
 

More from David Peyruc

tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
David Peyruc
 
Community
CommunityCommunity
Community
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-datatranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker DiscoverytranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
David Peyruc
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
David Peyruc
 

More from David Peyruc (9)

tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
 
Community
CommunityCommunity
Community
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-datatranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker DiscoverytranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
 
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
 

Recently uploaded

Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
shivalingatalekar1
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
Jim Jacob Roy
 
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptxEar and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
rishi2789
 
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdfMedical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Jim Jacob Roy
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
AyeshaZaid1
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
19various
 
Identifying Major Symptoms of Slip Disc.
 Identifying Major Symptoms of Slip Disc. Identifying Major Symptoms of Slip Disc.
Identifying Major Symptoms of Slip Disc.
Gokuldas Hospital
 
Hiranandani Hospital Powai News [Read Now].pdf
Hiranandani Hospital Powai News [Read Now].pdfHiranandani Hospital Powai News [Read Now].pdf
Hiranandani Hospital Powai News [Read Now].pdf
Dr. Sujit Chatterjee CEO Hiranandani Hospital
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
Health Advances
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
Dr. Jyothirmai Paindla
 
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptxVestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.GawadHemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
NephroTube - Dr.Gawad
 
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic ApproachIntegrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Ayurveda ForAll
 
Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)
Josep Vidal-Alaball
 
The Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of RespirationThe Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of Respiration
MedicoseAcademics
 
share - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptxshare - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptx
Tina Purnat
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
KafrELShiekh University
 
Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024
Torstein Dalen-Lorentsen
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
HongBiThi1
 

Recently uploaded (20)

Cardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdfCardiac Assessment for B.sc Nursing Student.pdf
Cardiac Assessment for B.sc Nursing Student.pdf
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
 
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptxEar and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
Ear and its clinical correlations By Dr. Rabia Inam Gandapore.pptx
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
 
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdfMedical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
 
Identifying Major Symptoms of Slip Disc.
 Identifying Major Symptoms of Slip Disc. Identifying Major Symptoms of Slip Disc.
Identifying Major Symptoms of Slip Disc.
 
Hiranandani Hospital Powai News [Read Now].pdf
Hiranandani Hospital Powai News [Read Now].pdfHiranandani Hospital Powai News [Read Now].pdf
Hiranandani Hospital Powai News [Read Now].pdf
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
 
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptxVestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
Vestibulocochlear Nerve by Dr. Rabia Inam Gandapore.pptx
 
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.GawadHemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
Hemodialysis: Chapter 4, Dialysate Circuit - Dr.Gawad
 
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic ApproachIntegrating Ayurveda into Parkinson’s Management: A Holistic Approach
Integrating Ayurveda into Parkinson’s Management: A Holistic Approach
 
Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)Artificial Intelligence Symposium (THAIS)
Artificial Intelligence Symposium (THAIS)
 
The Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of RespirationThe Nervous and Chemical Regulation of Respiration
The Nervous and Chemical Regulation of Respiration
 
share - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptxshare - Lions, tigers, AI and health misinformation, oh my!.pptx
share - Lions, tigers, AI and health misinformation, oh my!.pptx
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
 
Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
 

tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study

  • 1. The ACP MS Repository A Case Study tranSMART Community Meeting, Nov. 2013 Stephen Wicks, Ph.D.
  • 2. The ACP Repository: a Case Study What is Multiple Sclerosis?      Chronic inflammatory/demyelination disorder affecting the CNS. (about 0.1%) Leading cause of neurological disability in young adults. Symptoms are variable and significant. They include vision, cognition, locomotion, pain, disorientation, dexterity, mood, bowel/bladder control, others. Generally progressive, but progression is idiosyncratic. (CISRRMSSPMS, vs. CISPPMS etc.) Complex etiology
  • 3. The ACP Repository: a Case Study What is the cost of MS?  Difficult and costly to diagnose (MRI, symptom variability leads to extensive differential diagnosis)  Treatments can slow progression, but are expensive.  Many different drugs exist, but patient stratification for maximum efficacy and minimum side effects is non-existent. “Role the dice”  Often strikes early in life, and is a life-long disability.  Average Diagnosis at about 30. 5% before 16.
  • 4. Orion Bionetworks  ACP is a founding member of Orion.  Orion seeks to cure MS by harnessing the power of computational modeling of disease progression.  ACP will provide its data to Orion in tranSMART to facilitate this goal.  Rancho BioSciences will curate and harmonize the ACP data for Orion
  • 5. The ACP Repository: a Case Study ACP and the MS Repository  Founded in 2001 by an MIT entrepreneur with MS  ACP MS Repository started in 2006. The goal was to identify the cause of MS.  ACP MS Repository enrollment shut down this year. Approximately 3200 participants enrolled.  Biosamples, demographics, medical history etc.  Research data  OPT-UP
  • 6. The ACP Repository: a Case Study Repository Enrollment Status (6/21/2013) • 3,220 subjects enrolled; 467 longitudinal visits completed Subject Breakdown Treatment Status Subjects: 3220 TX+ TX Naïve TX- Time Since Diagnosis Pending < 5 yrs 5-10 yrs > 10 yrs Pending RRMS:1436 146 229 116 675 388 291 82 SPMS:234 164 53 13 4 27 58 149 0 PPMS:117 46 38 31 2 48 37 32 0 CIS:88 19 2 62 5 74 5 5 4 TM:175 12 17 131 15 106 32 24 13 NMO:383 200 66 39 78 252 52 13 66 ON:16 1 1 12 2 12 1 0 3 ADEM:32 MS: 1787 945 1 6 20 5 24 5 1 2 Cases: 2481 Controls: 739 • DNA, RNA, Plasma, Serum, PBMCs + data from 52 page CRF 6
  • 7. The ACP Repository: a Case Study The ACP Engine “Matchmaker” Database Graphical User Interface MS Discovery Forum Reviewing Developments in the MS Field Communicating with MS Researchers Allowing MS Researchers Worldwide to Explore the ACP Repository Database ACP Repository $13 million Invested 3200+ participants Biosamples & Datasets 77 sets of biosamples+data (b,m)illions of datapoints, From 36 studies, so far MS Researchers Worldwide Academia & Industry Insights and Results Mechanisms Diagnostics Causes Treatments
  • 8. The ACP Repository: a Case Study ACP MS Repository Open-access collection of highly annotated bloodderived samples plus data from MS, related diseases, & control subjects gathered from 2006-2013.  Requirement for research data derived from samples to be with them (ACP) allowed us to obtain IP protection). “Workingdeposited (with a provision forcritical samples and confirm our results for only $20,000. If I had to obtain these samples from scratch, it would  Contributes to MS+ research in many ways: have cost $1 million and added 5 years to the project.”  - Enables studies that might not be conducted School of Thomas M. Aune, PhD, Molecular Biology, Vanderbilt University Medicine otherwise (academic & commercial) (from Scientific American)  Creates a common results database for studies from multiple bio-analytical techniques on overlapping sets of subjects.  Approximately 3200 participants. 
  • 9. The ACP Repository: a Case Study Case Report Form (CRF) Curation challenges
  • 10. The ACP Repository: a Case Study ACP Case Report Form 48 Page (first visit) and 38 page (second visit) complete clinical workup  Form completed with the assistance of a clinical research associate over a several hour interview (with sample draw and lab workup)  Broad data: 80 distinct tables in an SQL database  Deep data: in flat data files, more than 20 million cells 
  • 11. The ACP Repository: a Case Study CRF Sample Fields Study drugs Drug”measurelethaly so) drug meaningless. 103 distinct textual responses.“CS-0777”, orordinal with betaseron, BETASERON, etc. Illustrates some Nothe problems associated order was units dataset. Inappropriate (sometimesdrugfrequency. of consistent etc… of trail enrollment “First Drug”, “Second “Betseron”, beta-seron, curating this “BG00012 (FUMARATE) OR PLACEBO”
  • 12. The ACP Repository: a Case Study DMD Curation Solutions  We applied drug ontologies and mapping vocabularies where needed.  We repaired and consolidated dose, frequency, etc. to a single measure with 3 values (high, standard, low)  We re-formatted the data to eliminate the ambiguous cardinal ordering of reporting
  • 13. The ACP Repository: a Case Study CRF Sample Fields Multiple Drugs (Observations) were addressed with…
  • 14. The ACP Repository: a Case Study VISIT_NAME application
  • 15. The ACP Repository: a Case Study Controlled Vocabularies (sports)  ~5000 responses  779 distinct sports reported  When filtered by “ski”,29  “gym”, 45  “walk”, 30; “jog”, 17, “run”, 40
  • 16. The ACP Repository: a Case Study Controlled Vocabularies (sports) All sports mapped to a 29 term vocabulary.
  • 17. The ACP Repository: a Case Study Controlled Vocabularies (pets)  ~6500 pets reported  600 distinct pets reported  When filtered by “dog”, 112, however, this misses mispellings (“diog”, “dot”, “pubs”, dog-like pets “wolf”, “half-wolf”, “mutt”, and breeds (“poddle”, “poodle”, “Afghan Hound”, etc.)  59 additional dog-like entries
  • 18. The ACP Repository: a Case Study Controlled Vocabularies (pets) All pets mapped to a 31 category controlled vocabulary
  • 19. The ACP Repository: a Case Study Medication Curation Challenges       Amitriptaline >10,000 medications listed. Amitriptylin Amitriptyline 2703 distinct medications listed. Amitriptyline HCL Mapped these to 614 real medications (e.g. Amitroptyline Amitriptyline) Amitryetyline This was split into twoAmitrypatiline tables: Amitryptailine  Continuing Medications (541 entities) Amitryptaline  Stopped Medications (317 entities) Amitryptilin VISIT_NAME was used Amitryptiline distinct observations to represent across the whole study Amitryptilline Amitryptylene Truly longitudinal measures were reified in the tree hierarchy in the data Amitryptylinefile. mapping
  • 20. The ACP Repository: a Case Study ACP Repository Tree
  • 21. The ACP Repository: a Case Study Date and Time Coding  All Dates in multiple formats: 15/12/2001 15/Dec/2001 Dec-2001 2001 Dec./2001 12/2001 --/--/-----/2001 ------------/12/2001 dates converted to periods (Months, Years, or Days) prior to the relevant blood draw date.  Dates were represented by International Standard ISO 8601. i.e. YYYY-MM-DD (e.g. 200112-15)
  • 22. The ACP Repository: a Case Study Repository Usage    77 studies ongoing or completed 36 studies have returned data to ACP Data types:       Low-D biomarker (antibodies, metabolites, serum markers of inflammation, etc.) Low-D genotype data High-D SNP/GWAS data Gene-expression studies Whole-genome sequencing (2 distinct studies) Study types:    Etiology Diagnostics Disease activity biomarkers
  • 23. The ACP Repository: a Case Study Repository Usage
  • 24. The ACP Repository: a Case Study Research Data Curation Challenges  Few guidelines provided to researchers for data formatting or treatment  Often little or no documentation describing how the data was generated or handled (raw vs. normalized, transformations e.g.)  Load study meta-data (contact info, description, etc. at the node level)
  • 25. The ACP Repository: a Case Study Sample Study Results Biogen gene expression study: Designed to identify gene-expression profiles that discriminate progressive forms of MS from relapsing-remitting forms of the disease.
  • 26. The ACP Repository: a Case Study Future Directions     Rancho BioSciences is providing guidance to ACP for data-collection practices going forward (e.g. OPT-UP) We loaded the clinical data and 6 sample study datasets into an Oracle-based tranSMART instance that we host in-house for QC purposes. The full dataset is slated to be loaded into a 1.1 postgreSQL-based tranSMART instance (hosted by Recombinant by Deloitte for Orion). This and other data sources (Inst. For Neuroscience at B&W) will be analyzed and modeled by Orion
  • 27. The ACP Repository: a Case Study Thanks for your time! Questions?

Editor's Notes

  1. Ethos of ACP = removing barriers to research. The first challenge was to create a new type of infrastructure for research to get conducted within. They say the journey of a thousand miles begins with a single step – for us the beginning of the journey was with a single study.