SlideShare a Scribd company logo
1 of 29
Download to read offline
© 2017 Riffyn Inc hello@riffyn.com
Never Miss a Discovery
PREP MEDIA
THAW CELLS INOCULATE
ADD
TREATMENT
INCUBATE
IMMUNO
ASSAY
MEASURE
CELL COUNT
0.1954
0.2122
0.2397
0.1823 10.589
10.897
2.6349
10.732
2.5732
2.7589
11.344
2.6349
© 2017 Riffyn Inc hello@riffyn.com
* Error rate = R&D results that fail to reproduce, leading to wasted
resources, failed tech transfers and missed discoveries.
Life Sciences
Chemicals/Materials
Energy
Food/AG
Mining and Metals
Pulp, Paper, Textiles
Water/Wastewater
Other
0 50 100 150 200 250 300
Sources:
2013 R&D Magazine Global Funding Forecast
NSF Science and Engineering Indicators 2012
CIA World Fact Book
Prinz, et al., Believe it or not: how much can we rely on published data on potential drug targets?, Nat. Rev. Drug Disc., 2011
Begley & Ellis, Raise Standards for Preclinical Cancer Research, Nature, 483, 2012
Ioannidis & Khoury, Improving Validation Practices in “Omics” Research, Science, 334, 2011
Halford, The Second Annual State of Translational Research, Sigma-Aldrich / AAAS, 2014
Freedman, et al., The Economics of Reproducibility in Preclinical Research, PLOS Biology, 2015
R&D Spend
$420B
R&D losses
$100B>25%
error rate*
Annual R&D Spend ($B)
The problem in science today: >$100B of lost R&D productivity each year
© 2017 Riffyn Inc hello@riffyn.com
Poor data utilization is accepted as a price for flexibility and innovation
A	lot	of	today’s	operations	are	achieved	through	trial	
and	error	and	“tribal	knowledge”
Scientist	at	a	global	ethanol	producer
© 2017 Riffyn Inc hello@riffyn.com
“We can’t determine which variables matter.”
“Data context is only in people’s head.”
“Process transfer is unreliable.”
This is the consequence of data fragmentation in the daily life of a scientist
© 2017 Riffyn Inc hello@riffyn.com
Communication in science today is a bit like this
“The problem in science
today isn’t recording data,
it’s integrating methods
and results so we can do
science.”
© 2017 Riffyn Inc hello@riffyn.com
Today, 80% of time is wasted wrangling scientific data, not analyzing it
DATA
STRUCTURE
QUALITY
CONTROL
ANALYZE
Machine LearningExperiment
80% of a data
scientist’s effort
20% of a data
scientist’s effort
DATA WRANGLING
© 2017 Riffyn Inc hello@riffyn.com
This is a wrangler
© 2017 Riffyn Inc hello@riffyn.com
For	one	paper,	securing	the	necessary	data	took	a	year.	And	the	authors	of	four	other	
papers	have	stopped	communicating	with	the	project	altogether.	Most	authors	were	
happy	to	collaborate,	but	it	has	taken	longer	than	expected	to	locate	the	relevant	data.
R.	Van	Noorden,	Sluggish	data	sharing	hampers	reproducibility	effort,	Nature	News,	3	June	2015
http://www.nature.com/news/sluggish-data-sharing-hampers-reproducibility-effort-1.17694
This is a data wrangler
© 2017 Riffyn Inc hello@riffyn.com
This where scientific data analytics needs to be
80% of your time analyzing data for scientific breakthroughs, not wrangling data
DATA ANALYZE
Machine-aided Design & Learning
80% of a data
scientist’s effort
STRUCTURE
QUALITY
BY	DESIGN
SCIENTIFIC
DESIGN
20% of a data
scientist’s effort
© 2017 Riffyn Inc hello@riffyn.com
Why are we buried in data and starving for information?
© 2017 Riffyn Inc hello@riffyn.com
This is how R&D is performed and communicated today
Artisanal know-how is stuck in 400-year-old concepts for scientific documentation and data
TECH	TRANSFER	/	SCALE-UP
Ad	hoc	knowledge	
capture/transfer	between	
R&D	to	Manufacturing
EARLY	R&D
Un-computable	&	
ambiguous	protocols
© 2017 Riffyn Inc hello@riffyn.com
We are starving for information because …
we cannot access computable data sets, and they’re not annotated with methodological context
From	Riffyn	2014	survey	of	life	science	and	chemical	companies
R&D	issue %	with	issue
Data	cannot	be	integrated,	is	hard	to	find,	or	is	not	annotated 66%
Cannot	assess	the	impact	of	process	changes 52%
Data	is	unstructured	or	incomplete 48%
© 2017 Riffyn Inc hello@riffyn.com
The poor access to data and context means that …
errors accumulate because we can’t identify their causes, & progress is slowed
1X R&D pace
2007 2008 2009 2010
2X R&D pace
6X error reduction
Year
Reducing error 6X in screening process doubles the rate of cell line yield improvement
Source: Gardner, TS (2013) Trends in Biotech. 31:3, 123-125.
BEFORE VARIANCE
REDUCTION
AFTER VARIANCE
REDUCTION
30% relative error
5% relative error
Productyieldofastrain
© 2017 Riffyn Inc hello@riffyn.com
Fragmented data and accumulation of errors means that …
discoveries get lost and buried in a sea of disjointed data
Parameter X
ParameterY
Data from a single experiment
Data from a 100s of experiments
Linking data across experiments identifies critical process parameters
and unexpected correlations
© 2017 Riffyn Inc hello@riffyn.com
How do we find information in our data?
© 2017 Riffyn Inc hello@riffyn.com
It starts by recognizing today’s scientific paradigm is missing ”blueprints”
Thus the most relevant methodological data is stuck in people’s heads
“We work on what we know, not what matters.”
© 2017 Riffyn Inc hello@riffyn.com
A metaphor: evolution of geographic information systems toward CAD
c. 1700s c. 1800s
© 2017 Riffyn Inc hello@riffyn.com
Geographic information systems
Digital, visual, living documents with integrated data -> a foundation for AI
c. 2000s (Maps) c. 2010s (AI)
© 2017 Riffyn Inc hello@riffyn.com
Never Miss a Discovery
The Riffyn Scientific Development Environment (SDE) is a cloud-
based application for reproducible science and faster discovery
Design, execute, and share scientific experiments as visual,
computable data sets for interactive analysis and machine learning
© 2017 Riffyn Inc hello@riffyn.com
Riffyn is transforming scientific “spreadsheet hell” …
© 2017 Riffyn Inc hello@riffyn.com
…to digital scientific process maps & linked data for statistical learning
© 2017 Riffyn Inc hello@riffyn.com
Riffyn Scientific Development Environment
Flexible process design, analysis and improvement across the scientific development lifecycle
Riffyn MapTM
Where you design your
processes, workflows &
experiments
Riffyn TrackTM
Where you track your
samples, equipment
and measurement data
across your workflow
Riffyn DiscoverTM
Where you pick hits,
eliminate variation, and
establish cause & effect
Riffyn ShareTM
Where you collaborate, review,
and transfer all your methods
and results
Riffyn BridgeTM
where you connect data sources including legacy
applications, enterprise systems, and devices
© 2017 Riffyn Inc hello@riffyn.com
Design
Riffyn MapTM
© 2017 Riffyn Inc hello@riffyn.com
Measure
Riffyn TrackTM
© 2017 Riffyn Inc hello@riffyn.com
Riffyn DiscoverTM
15mL
250 mL
250 mL
50 L
15mL
250 mL
250 mL
50 L
Analyze
© 2017 Riffyn Inc hello@riffyn.com
Share
Riffyn ShareTM
© 2017 Riffyn Inc hello@riffyn.com
DRAG-AND-DROP	EXPERIMENT	DESIGN	
AUTOMATICALLY	LINKS	METHODS	TO	
RESULTS
VISUAL	DESIGN
DATA	IS	PRESENTED	IN	A	STATISTICAL	DATA	
FRAME	READY	FOR	VISUALIZATION,	QUALITY	
ANALYSIS,	CORRELATION	&	MACHINE	LEARNING
REAL-TIME	DATA	INTEGRATION
INACCESSIBLE	METHODS DATA	FRAGMENTATION
Riffyn Scientific Development Environment
solves solves
Addresses 3 critical barriers to reproducible outcomes & faster product development
“The problem isn’t capturing data, it’s integrating results so we can do science”
ONLINE	AND	OFFLINE	DATA	CAPTURED	AND	
INSTANTLY	INTEGRATED	FOR	IMMEDIATE	
INTERROGATION
ANALYSIS	IN	SECONDS
EXPERIMENTAL	ERROR	&	
MISSED	DISCOVERIES
solves
© 2017 Riffyn Inc hello@riffyn.com
EXPERIMENT	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	
TABLE
EXPERIMENT	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	
TABLE
EXPERIMENT	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	TABLE
SINGLE-VERSION	DATA	
TABLE
Variable1
DATA FROM 2 PROPERTIES ON ALL RUNS IN ONE
EXPERIMENT
DATA ON 2 PROPERTIES FROM SELECTED
OBSERVATIONS IN MULTIPLE VERSIONS &
EXPERIMENTS
Variable 2
PROCESS DATA TABLE
FILTER TO SELECTED OBSERVATIONS CORRELATE 2 OR MORE VAROAB;ES
Immediate access to
• Material genealogy
• Error / Variance analysis
• Statistical process control
• Hit picking
• Root cause analysis
• Design of Experiments
• AI / statistical learning pipeline
Riffyn Scientific Development Environment …
delivers every data point structured, contextualized & ready for machine learning / AI
Variables
Observations
© 2017 Riffyn Inc hello@riffyn.com
Founded in 2014 Offices in Oakland & Boston
Cloud-based software
biopharm chemicals food tech transfer
Our mission is to deliver research we can trust and build on efficiently, like high-
quality parts in a global supply chain

More Related Content

Similar to Buried in data and starving for information

Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
Aridhia Informatics Ltd
 

Similar to Buried in data and starving for information (20)

Analytica 2014 - Biotech Forum - IDBS Bioprocess Execution System
Analytica 2014 - Biotech Forum - IDBS Bioprocess Execution SystemAnalytica 2014 - Biotech Forum - IDBS Bioprocess Execution System
Analytica 2014 - Biotech Forum - IDBS Bioprocess Execution System
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation Analytics
 
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationLFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
 
Future of RWE Digital Pharma West presentation
Future of RWE Digital Pharma West presentationFuture of RWE Digital Pharma West presentation
Future of RWE Digital Pharma West presentation
 
Future of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentationFuture of RWE - Big Data and Analytics for Pharma 2017 presentation
Future of RWE - Big Data and Analytics for Pharma 2017 presentation
 
Formula For Case Intake Success
Formula For Case Intake SuccessFormula For Case Intake Success
Formula For Case Intake Success
 
Advice for Healthcare IT Startups NYC Executive Informational and Networking ...
Advice for Healthcare IT Startups NYC Executive Informational and Networking ...Advice for Healthcare IT Startups NYC Executive Informational and Networking ...
Advice for Healthcare IT Startups NYC Executive Informational and Networking ...
 
Webinar: Leveraging big data in life sciences & healthcare
Webinar: Leveraging big data in life sciences & healthcareWebinar: Leveraging big data in life sciences & healthcare
Webinar: Leveraging big data in life sciences & healthcare
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
AI-LAB
AI-LABAI-LAB
AI-LAB
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
How to Scale BI and Analytics with Hadoop-based Platforms
How to Scale BI and Analytics with Hadoop-based PlatformsHow to Scale BI and Analytics with Hadoop-based Platforms
How to Scale BI and Analytics with Hadoop-based Platforms
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Put Analytics And Automation At The Core Of Security – Joseph Blankenship – S...
Put Analytics And Automation At The Core Of Security – Joseph Blankenship – S...Put Analytics And Automation At The Core Of Security – Joseph Blankenship – S...
Put Analytics And Automation At The Core Of Security – Joseph Blankenship – S...
 
Understanding the Relationship Between Agile, Lean and DevOps
Understanding the Relationship Between Agile, Lean and DevOps Understanding the Relationship Between Agile, Lean and DevOps
Understanding the Relationship Between Agile, Lean and DevOps
 
Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
Challenges in Clinical Research: Aridhia's Disruptive Technology Approach to ...
 
Challenges in Clinical Research: Aridhia Disrupts Technology Approach to Rese...
Challenges in Clinical Research: Aridhia Disrupts Technology Approach to Rese...Challenges in Clinical Research: Aridhia Disrupts Technology Approach to Rese...
Challenges in Clinical Research: Aridhia Disrupts Technology Approach to Rese...
 
IOT206_A Cloud-Based IoT Platform Purposefully Built For Healthcare
IOT206_A Cloud-Based IoT Platform Purposefully Built For HealthcareIOT206_A Cloud-Based IoT Platform Purposefully Built For Healthcare
IOT206_A Cloud-Based IoT Platform Purposefully Built For Healthcare
 
The universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using themThe universe of identifiers and how ANDS is using them
The universe of identifiers and how ANDS is using them
 
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
SciBite - Role Of Ontologies (Pistoia Alliance Webinar)
 

Recently uploaded

Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
Sérgio Sacani
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
Sérgio Sacani
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Sérgio Sacani
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
Continuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discsContinuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discs
Sérgio Sacani
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Sérgio Sacani
 
The solar dynamo begins near the surface
The solar dynamo begins near the surfaceThe solar dynamo begins near the surface
The solar dynamo begins near the surface
Sérgio Sacani
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
Sérgio Sacani
 

Recently uploaded (20)

Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
 
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
Constraints on Neutrino Natal Kicks from Black-Hole Binary VFTS 243
 
GBSN - Microbiology Lab (Compound Microscope)
GBSN - Microbiology Lab (Compound Microscope)GBSN - Microbiology Lab (Compound Microscope)
GBSN - Microbiology Lab (Compound Microscope)
 
RACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptxRACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptx
 
Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
 
SCHISTOSOMA HEAMATOBIUM life cycle .pdf
SCHISTOSOMA HEAMATOBIUM life cycle  .pdfSCHISTOSOMA HEAMATOBIUM life cycle  .pdf
SCHISTOSOMA HEAMATOBIUM life cycle .pdf
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
Continuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discsContinuum emission from within the plunging region of black hole discs
Continuum emission from within the plunging region of black hole discs
 
family therapy psychotherapy types .pdf
family therapy psychotherapy types  .pdffamily therapy psychotherapy types  .pdf
family therapy psychotherapy types .pdf
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
 
Mining Activity and Investment Opportunity in Myanmar.pptx
Mining Activity and Investment Opportunity in Myanmar.pptxMining Activity and Investment Opportunity in Myanmar.pptx
Mining Activity and Investment Opportunity in Myanmar.pptx
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent University
 
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
Ostiguy & Panizza & Moffitt (eds.) - Populism in Global Perspective. A Perfor...
 
The solar dynamo begins near the surface
The solar dynamo begins near the surfaceThe solar dynamo begins near the surface
The solar dynamo begins near the surface
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
 
GBSN - Microbiology Lab (Microbiology Lab Safety Procedures)
GBSN -  Microbiology Lab (Microbiology Lab Safety Procedures)GBSN -  Microbiology Lab (Microbiology Lab Safety Procedures)
GBSN - Microbiology Lab (Microbiology Lab Safety Procedures)
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
 

Buried in data and starving for information

  • 1. © 2017 Riffyn Inc hello@riffyn.com Never Miss a Discovery PREP MEDIA THAW CELLS INOCULATE ADD TREATMENT INCUBATE IMMUNO ASSAY MEASURE CELL COUNT 0.1954 0.2122 0.2397 0.1823 10.589 10.897 2.6349 10.732 2.5732 2.7589 11.344 2.6349
  • 2. © 2017 Riffyn Inc hello@riffyn.com * Error rate = R&D results that fail to reproduce, leading to wasted resources, failed tech transfers and missed discoveries. Life Sciences Chemicals/Materials Energy Food/AG Mining and Metals Pulp, Paper, Textiles Water/Wastewater Other 0 50 100 150 200 250 300 Sources: 2013 R&D Magazine Global Funding Forecast NSF Science and Engineering Indicators 2012 CIA World Fact Book Prinz, et al., Believe it or not: how much can we rely on published data on potential drug targets?, Nat. Rev. Drug Disc., 2011 Begley & Ellis, Raise Standards for Preclinical Cancer Research, Nature, 483, 2012 Ioannidis & Khoury, Improving Validation Practices in “Omics” Research, Science, 334, 2011 Halford, The Second Annual State of Translational Research, Sigma-Aldrich / AAAS, 2014 Freedman, et al., The Economics of Reproducibility in Preclinical Research, PLOS Biology, 2015 R&D Spend $420B R&D losses $100B>25% error rate* Annual R&D Spend ($B) The problem in science today: >$100B of lost R&D productivity each year
  • 3. © 2017 Riffyn Inc hello@riffyn.com Poor data utilization is accepted as a price for flexibility and innovation A lot of today’s operations are achieved through trial and error and “tribal knowledge” Scientist at a global ethanol producer
  • 4. © 2017 Riffyn Inc hello@riffyn.com “We can’t determine which variables matter.” “Data context is only in people’s head.” “Process transfer is unreliable.” This is the consequence of data fragmentation in the daily life of a scientist
  • 5. © 2017 Riffyn Inc hello@riffyn.com Communication in science today is a bit like this “The problem in science today isn’t recording data, it’s integrating methods and results so we can do science.”
  • 6. © 2017 Riffyn Inc hello@riffyn.com Today, 80% of time is wasted wrangling scientific data, not analyzing it DATA STRUCTURE QUALITY CONTROL ANALYZE Machine LearningExperiment 80% of a data scientist’s effort 20% of a data scientist’s effort DATA WRANGLING
  • 7. © 2017 Riffyn Inc hello@riffyn.com This is a wrangler
  • 8. © 2017 Riffyn Inc hello@riffyn.com For one paper, securing the necessary data took a year. And the authors of four other papers have stopped communicating with the project altogether. Most authors were happy to collaborate, but it has taken longer than expected to locate the relevant data. R. Van Noorden, Sluggish data sharing hampers reproducibility effort, Nature News, 3 June 2015 http://www.nature.com/news/sluggish-data-sharing-hampers-reproducibility-effort-1.17694 This is a data wrangler
  • 9. © 2017 Riffyn Inc hello@riffyn.com This where scientific data analytics needs to be 80% of your time analyzing data for scientific breakthroughs, not wrangling data DATA ANALYZE Machine-aided Design & Learning 80% of a data scientist’s effort STRUCTURE QUALITY BY DESIGN SCIENTIFIC DESIGN 20% of a data scientist’s effort
  • 10. © 2017 Riffyn Inc hello@riffyn.com Why are we buried in data and starving for information?
  • 11. © 2017 Riffyn Inc hello@riffyn.com This is how R&D is performed and communicated today Artisanal know-how is stuck in 400-year-old concepts for scientific documentation and data TECH TRANSFER / SCALE-UP Ad hoc knowledge capture/transfer between R&D to Manufacturing EARLY R&D Un-computable & ambiguous protocols
  • 12. © 2017 Riffyn Inc hello@riffyn.com We are starving for information because … we cannot access computable data sets, and they’re not annotated with methodological context From Riffyn 2014 survey of life science and chemical companies R&D issue % with issue Data cannot be integrated, is hard to find, or is not annotated 66% Cannot assess the impact of process changes 52% Data is unstructured or incomplete 48%
  • 13. © 2017 Riffyn Inc hello@riffyn.com The poor access to data and context means that … errors accumulate because we can’t identify their causes, & progress is slowed 1X R&D pace 2007 2008 2009 2010 2X R&D pace 6X error reduction Year Reducing error 6X in screening process doubles the rate of cell line yield improvement Source: Gardner, TS (2013) Trends in Biotech. 31:3, 123-125. BEFORE VARIANCE REDUCTION AFTER VARIANCE REDUCTION 30% relative error 5% relative error Productyieldofastrain
  • 14. © 2017 Riffyn Inc hello@riffyn.com Fragmented data and accumulation of errors means that … discoveries get lost and buried in a sea of disjointed data Parameter X ParameterY Data from a single experiment Data from a 100s of experiments Linking data across experiments identifies critical process parameters and unexpected correlations
  • 15. © 2017 Riffyn Inc hello@riffyn.com How do we find information in our data?
  • 16. © 2017 Riffyn Inc hello@riffyn.com It starts by recognizing today’s scientific paradigm is missing ”blueprints” Thus the most relevant methodological data is stuck in people’s heads “We work on what we know, not what matters.”
  • 17. © 2017 Riffyn Inc hello@riffyn.com A metaphor: evolution of geographic information systems toward CAD c. 1700s c. 1800s
  • 18. © 2017 Riffyn Inc hello@riffyn.com Geographic information systems Digital, visual, living documents with integrated data -> a foundation for AI c. 2000s (Maps) c. 2010s (AI)
  • 19. © 2017 Riffyn Inc hello@riffyn.com Never Miss a Discovery The Riffyn Scientific Development Environment (SDE) is a cloud- based application for reproducible science and faster discovery Design, execute, and share scientific experiments as visual, computable data sets for interactive analysis and machine learning
  • 20. © 2017 Riffyn Inc hello@riffyn.com Riffyn is transforming scientific “spreadsheet hell” …
  • 21. © 2017 Riffyn Inc hello@riffyn.com …to digital scientific process maps & linked data for statistical learning
  • 22. © 2017 Riffyn Inc hello@riffyn.com Riffyn Scientific Development Environment Flexible process design, analysis and improvement across the scientific development lifecycle Riffyn MapTM Where you design your processes, workflows & experiments Riffyn TrackTM Where you track your samples, equipment and measurement data across your workflow Riffyn DiscoverTM Where you pick hits, eliminate variation, and establish cause & effect Riffyn ShareTM Where you collaborate, review, and transfer all your methods and results Riffyn BridgeTM where you connect data sources including legacy applications, enterprise systems, and devices
  • 23. © 2017 Riffyn Inc hello@riffyn.com Design Riffyn MapTM
  • 24. © 2017 Riffyn Inc hello@riffyn.com Measure Riffyn TrackTM
  • 25. © 2017 Riffyn Inc hello@riffyn.com Riffyn DiscoverTM 15mL 250 mL 250 mL 50 L 15mL 250 mL 250 mL 50 L Analyze
  • 26. © 2017 Riffyn Inc hello@riffyn.com Share Riffyn ShareTM
  • 27. © 2017 Riffyn Inc hello@riffyn.com DRAG-AND-DROP EXPERIMENT DESIGN AUTOMATICALLY LINKS METHODS TO RESULTS VISUAL DESIGN DATA IS PRESENTED IN A STATISTICAL DATA FRAME READY FOR VISUALIZATION, QUALITY ANALYSIS, CORRELATION & MACHINE LEARNING REAL-TIME DATA INTEGRATION INACCESSIBLE METHODS DATA FRAGMENTATION Riffyn Scientific Development Environment solves solves Addresses 3 critical barriers to reproducible outcomes & faster product development “The problem isn’t capturing data, it’s integrating results so we can do science” ONLINE AND OFFLINE DATA CAPTURED AND INSTANTLY INTEGRATED FOR IMMEDIATE INTERROGATION ANALYSIS IN SECONDS EXPERIMENTAL ERROR & MISSED DISCOVERIES solves
  • 28. © 2017 Riffyn Inc hello@riffyn.com EXPERIMENT DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE EXPERIMENT DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE EXPERIMENT DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE SINGLE-VERSION DATA TABLE Variable1 DATA FROM 2 PROPERTIES ON ALL RUNS IN ONE EXPERIMENT DATA ON 2 PROPERTIES FROM SELECTED OBSERVATIONS IN MULTIPLE VERSIONS & EXPERIMENTS Variable 2 PROCESS DATA TABLE FILTER TO SELECTED OBSERVATIONS CORRELATE 2 OR MORE VAROAB;ES Immediate access to • Material genealogy • Error / Variance analysis • Statistical process control • Hit picking • Root cause analysis • Design of Experiments • AI / statistical learning pipeline Riffyn Scientific Development Environment … delivers every data point structured, contextualized & ready for machine learning / AI Variables Observations
  • 29. © 2017 Riffyn Inc hello@riffyn.com Founded in 2014 Offices in Oakland & Boston Cloud-based software biopharm chemicals food tech transfer Our mission is to deliver research we can trust and build on efficiently, like high- quality parts in a global supply chain