© 2017 IBM Corporation
An Ecosystem Approach to Data Science
Dr. Alex Liu, Chief Data Scientist, Analytics Services, IBM
October 2018
© 2017 IBM Corporation2
Alex Liu Introduction
▪ Chief Data Scientist – Analytics Services at IBM
▪ A Data Scientist Thought Leader
▪ Chief Data Scientist for a few corporations before
joined IBM
▪ Taught advanced data analytics for the
University of South California and the University
of California at Irvine
▪ Consulted for the United Nations, Ingram Micro
…
▪ M.S. and Ph.D. from Stanford University
© 2017 IBM Corporation3
Data Science projects return very valuable results
but a lot failed
▪Netflix, for example, integrates data science
into each part of their business; they
estimate a billion dollars in incremental
value from their personalization and
recommendation alone.
▪Knight Capital Group, for instance, lost
$440 million in 45 minutes after a mistake in
updating a model (New York times).
▪Gartner estimated that 60% of big data
projects fail in 2016, and in 2017.
© 2017 IBM Corporation4
Solutions needed – an ecosystem approach with cognitive assistance
▪ Data science ecosystems
 To attack the issues of high data scientists turn over ratios.
 To create a platform of sharing data.
 To create ways of sharing expertise
▪ Cognitive Assistance for Data science
 To attack the issues of complexity of predictive modeling
 To take care of tedious tasks
 To optimize data science processes
 To produce fast analytics
 To produce instant analytics
▪ Los Angeles Meetup Community
▪ Local face to face community – more than 1100 members
▪ https://www.meetup.com/RMDS_LA/
▪ https://www.linkedin.com/groups/1895501 has 29K participants
© 2017 IBM Corporation
An ecosystem approach of data science
A data science ECOSYSTEM has three basic elements
1) Data portal, 2) Data Science platform, 3) Data Science community
© 2017 IBM Corporation6
The defining characteristics of an ecosystem - mutuality & orchestration
Markets comprise entities that operate
out of individual self-interest
Ecosystems comprise entities that operate
out of orchestrated, mutual shared-interest
A set of individuals or organizations who
exchange products or services within an
environment governed by the laws of supply
and demand
A set of individuals or organizations who
formally or informally operate together to
produce something of greater value for the
mutual benefit of the ecosystem as a whole
Ecosystems exists because operating in an orchestrated environment, participants
can deliver more value within the ecosystem acting together than acting alone
© 2017 IBM Corporation7
Ecosystems can yield substantial benefits
New capabilities Improved access Improved Agility
Increase Success Ratio of Data Science Projects
Embrace ecosystems’ strategic potential
Ecosystems enable organizations
to access critical capabilities that
they would otherwise have
difficulty obtaining
Ecosystems support greater
access to new or different
resources such as new talents,
new tools, new data sets
Ecosystems support quick
creation of new types of products,
with different combinations of
organizations and assets
© 2017 IBM Corporation8
101
010
101
Platform
~ IBM DSX
Events Data Social Media
Analytical
Insights for Smart
Cities
Connecting all
the data
scientists from
a DS
community
Applications
Optimizing Operations Solutions
IoT Data
EX1: City Open Data Serving Business and Citizens
City Open Data + WATSON Studio + RMDS Community
A data science ecosystem with city open data
EX2: Weather Data Serving Retails
Weather Data + WATSON Studio + RMDS Community
A data science ecosystem with weather data
101
010
101
Platform
~ IBM DSX
Weather Data Transaction
Analytical
Insights for Smart
Commerces
Connecting all
the data
scientists from
a DS
community
Applications
Optimizing Operations Solutions
IoT Data
© 2017 IBM Corporation10
Ecosystem Service Offerings
▪ Developing data science ecosystems for enterprises
▪ Monetizing data with ecosystem approaches
▪ Building user communities with ecosystem approaches
▪ Los Angeles Meetup Community
▪ Local face to face community – more than 1100 members
▪ https://www.meetup.com/RMDS_LA/
▪ https://www.linkedin.com/groups/1895501 has 29K participants
© 2017 IBM Corporation11
RMDS Data Science Ecosystem
Management Services
▪ RM4Es Based Workflow Management
▪ ResearchMap Based Asset Management
▪ Data Portal, DS Platforms and DS Communities integration and management
▪ Los Angeles Meetup Community
▪ Local face to face community – more than 1100 members
▪ https://www.meetup.com/RMDS_LA/
▪ https://www.linkedin.com/groups/1895501 has 29K participants
RM4Es TM
• Equation
• Estimation
• Evaluation
• Explanantion/Execution

An Ecosystem Approach to Data Science

  • 1.
    © 2017 IBMCorporation An Ecosystem Approach to Data Science Dr. Alex Liu, Chief Data Scientist, Analytics Services, IBM October 2018
  • 2.
    © 2017 IBMCorporation2 Alex Liu Introduction ▪ Chief Data Scientist – Analytics Services at IBM ▪ A Data Scientist Thought Leader ▪ Chief Data Scientist for a few corporations before joined IBM ▪ Taught advanced data analytics for the University of South California and the University of California at Irvine ▪ Consulted for the United Nations, Ingram Micro … ▪ M.S. and Ph.D. from Stanford University
  • 3.
    © 2017 IBMCorporation3 Data Science projects return very valuable results but a lot failed ▪Netflix, for example, integrates data science into each part of their business; they estimate a billion dollars in incremental value from their personalization and recommendation alone. ▪Knight Capital Group, for instance, lost $440 million in 45 minutes after a mistake in updating a model (New York times). ▪Gartner estimated that 60% of big data projects fail in 2016, and in 2017.
  • 4.
    © 2017 IBMCorporation4 Solutions needed – an ecosystem approach with cognitive assistance ▪ Data science ecosystems  To attack the issues of high data scientists turn over ratios.  To create a platform of sharing data.  To create ways of sharing expertise ▪ Cognitive Assistance for Data science  To attack the issues of complexity of predictive modeling  To take care of tedious tasks  To optimize data science processes  To produce fast analytics  To produce instant analytics ▪ Los Angeles Meetup Community ▪ Local face to face community – more than 1100 members ▪ https://www.meetup.com/RMDS_LA/ ▪ https://www.linkedin.com/groups/1895501 has 29K participants
  • 5.
    © 2017 IBMCorporation An ecosystem approach of data science A data science ECOSYSTEM has three basic elements 1) Data portal, 2) Data Science platform, 3) Data Science community
  • 6.
    © 2017 IBMCorporation6 The defining characteristics of an ecosystem - mutuality & orchestration Markets comprise entities that operate out of individual self-interest Ecosystems comprise entities that operate out of orchestrated, mutual shared-interest A set of individuals or organizations who exchange products or services within an environment governed by the laws of supply and demand A set of individuals or organizations who formally or informally operate together to produce something of greater value for the mutual benefit of the ecosystem as a whole Ecosystems exists because operating in an orchestrated environment, participants can deliver more value within the ecosystem acting together than acting alone
  • 7.
    © 2017 IBMCorporation7 Ecosystems can yield substantial benefits New capabilities Improved access Improved Agility Increase Success Ratio of Data Science Projects Embrace ecosystems’ strategic potential Ecosystems enable organizations to access critical capabilities that they would otherwise have difficulty obtaining Ecosystems support greater access to new or different resources such as new talents, new tools, new data sets Ecosystems support quick creation of new types of products, with different combinations of organizations and assets
  • 8.
    © 2017 IBMCorporation8 101 010 101 Platform ~ IBM DSX Events Data Social Media Analytical Insights for Smart Cities Connecting all the data scientists from a DS community Applications Optimizing Operations Solutions IoT Data EX1: City Open Data Serving Business and Citizens City Open Data + WATSON Studio + RMDS Community A data science ecosystem with city open data
  • 9.
    EX2: Weather DataServing Retails Weather Data + WATSON Studio + RMDS Community A data science ecosystem with weather data 101 010 101 Platform ~ IBM DSX Weather Data Transaction Analytical Insights for Smart Commerces Connecting all the data scientists from a DS community Applications Optimizing Operations Solutions IoT Data
  • 10.
    © 2017 IBMCorporation10 Ecosystem Service Offerings ▪ Developing data science ecosystems for enterprises ▪ Monetizing data with ecosystem approaches ▪ Building user communities with ecosystem approaches ▪ Los Angeles Meetup Community ▪ Local face to face community – more than 1100 members ▪ https://www.meetup.com/RMDS_LA/ ▪ https://www.linkedin.com/groups/1895501 has 29K participants
  • 11.
    © 2017 IBMCorporation11 RMDS Data Science Ecosystem Management Services ▪ RM4Es Based Workflow Management ▪ ResearchMap Based Asset Management ▪ Data Portal, DS Platforms and DS Communities integration and management ▪ Los Angeles Meetup Community ▪ Local face to face community – more than 1100 members ▪ https://www.meetup.com/RMDS_LA/ ▪ https://www.linkedin.com/groups/1895501 has 29K participants RM4Es TM • Equation • Estimation • Evaluation • Explanantion/Execution