SlideShare a Scribd company logo
1 of 24
FROM “BIG DATA” TO
DATAWARE
SIM Technology Leadership Summit
May 20, 2015
MADRONA OVERVIEW
• Madrona is a leading venture capital firm focused on sourcing and
growing early-stage technology companies in the Pacific Northwest
• About $1 billion under management across five funds
–Investors include the University of Washington, University of Virginia, Irvine
Foundation, University of North Carolina, and strategic individuals
• Investments made in over 100 companies the past 20 years with over 50
active portfolio companies and over 40 positive exits
• Madrona team
–7 Managing Directors
–Strategic Directors and Venture Partners include: Sujal Patel, Steve Singh,
John McAdam, Prof. Oren Etzioni, and Prof. Dan Weld
THE PNW TECH ECOSYSTEM IS STRONG AND
GROWING
Anchor Tenants
Large Tech Satellite Offices
Mid-Cap Tech with Seattle HQ
World-Class Research
OUR FUTURE
1995 TODAY (2015) 2035
COMMUNICATION Snail mail, fax, early email
SMS, Facebook, Skype,
Snapchat & Twitter
Virtual Reality Rooms
DEVICES Desktop PCs Smart Mobile Devices
Embedded on you &
everything else (IoT)
SOFTWARE/
DATAWARE
Packaged/Licensed SaaS subscription/Apps Intelligent apps
INTERNET/
CONNECTIVITY
Dial up modem 56k
“Ubiquitous” broadband 100
Mbps to mobile
“Always On” and IoT
COMPUTE/STORAGE
Pentium processor 100 MIPS
Single-core
~$1 million/TB
Intel Xeon E7 processor –
4000 MIPS
Multi-core
$59/TB
$5/Petabyte
INFRASTRUCTURE Internet & Dedicated servers Cloud Real-time hybrid marketplace
COMMERCE 1 book/10 days/$5 delivery
Anything 2 days free; 50,000
items in 2 hours free delivery
Drones or autonomous car
delivery & 3D printed
WHAT IS “DATAWARE”?
A framework for describing the combination of data, software, math
formulas and “predictive” analytics that help data savvy teams turn
information and insights into profitable actions.
5
Why Now?
• Cloud Enablement: “Cloud” abstracts hardware into software and enables
unprecedented elasticity, scale and speed
• Big Data: The volume, velocity and variety of data types and stores has expanded
rapidly while the value of retaining/leveraging data often exceeds the cost
• Legacy “Datastores”: Highly structured and constrained systems (databases, data
warehouses, BI tools) that are too rigid to unlock data’s full value yet too ubiquitous
and important to NOT leverage
• Emerging Solutions: A combination of point solutions, systematic approaches and
“vertical” services emerging to leverage these trends in an agile manner. These
solutions require a structured framework to prioritize market opportunities
INSERT BIG DATA LANDSCAPE SLIDE
6
MADRONA DATAWARE FRAMEWORK
7
INTELLIGENT APPS &
SERVICES
DATA
INTELLIGENCE
ENABLING
INFRASTRUCTURE
AgileDataStack
Marc Benihoff, Founder and CEO of Salesforce.com, when asked what he
thinks is the major tech trend of the next five years responded that we are
in an “AI Spring.” Fortune Term Sheet 1/6/15
WHAT MAKES THE DATA “BIG”?
Value More valuable to store than throw away
8
Variety Different sources & structures create opportunities…
& challenges
Volume Easy, plentiful & cheap data to collect & store
Velocity Speed of turning data into actionable insights – batch vs.
real-time!
DATA INPUTS
• Legacy Databases: Highly structured, transactional focused,
generally rigid
– Databases with SQL queries (OLTP)
– Historic “Extract, Transform, Load” tools (ETL)
– Data warehouses and data cubes
– Business Intelligence (BI) and “Online Analytics Processing (OLAP)”
• “Big Data” Sources: Structure variety, high volume/velocity, agile
– “Not Only SQL” (NoSQL) data repositories
– Allow for “Extract, Load, Transform” (ELT) flexibility
– Continuous, online (streamed) data flows
– Relationship focus vs. Relational focus
9
Places Things
Profiles
WHERE DOES DATA & METADATA COME FROM?
People
• Consumers
• Office Workers
• Field Workers
• Citizens
• Partners
• Customers
• Home
• Work
• Stores
• Destinations
• Routes
• Individuals
• Demographics
• Devices
• Locations
• Objects
• “Campaigns”
• Biology
• “Networks"
• Devices
• Vehicles
• Machines
• Medical
• Homes
• Content
WHY DOES IT MATTER?
From To
Structure Mostly structured
(relational)
Flexibly structured
(relationship)
Flexibility Rigid & slow
(R + cubes +BI)
Agile & rapid
(Python + graphs/ML + UI)
Availability Offline & batch Online & continuous
Key Drivers Code & “Rules”
(“hard coded”,
structured learning)
Data, Statistics, Discovery
(“machine learned”,
“inferred”, Bayesian)
Conceptually Certainty & consistency Iteration & “surprise”
11
TECHNOLOGY SECTOR IMPACT OF “DATAWARE”
YEARS: 0 – 2 2 – 5 5+
Relational Databases
(Oracle, MSFT) + ?? -
Traditional Infrastructure
(HP, IBM, Dell) + - --
Traditional Apps
(Oracle, SAP) + +/- -
Cloud Infrastructure ++ ++ +
SAAS ++ ++ +/-
12
BIG COMPANY “LEADING INDICATORS”
• Microsoft-AzureML, Revolution Analytics, much more
• HP reorganizes software business around “Big Data”
• Salesforce.com buys RelateIQ for $390M for “data
cloud”
• Oracle builds “data cloud” team including Blue Kai and
Datalogix
• SAP promotes HANA, buys Concur
• IBM advertises Watson, Blue Mix
• AWS – AmazonML, Lambda, Kinesis
13
KEY QUESTIONS
• How do big, especially software-driven, companies unlock their “data
silos”?
• How will traditional databases/warehouses, newer “big data” stores and
integrated big data “lakes” compliment or compete?
• What models will emerge to capture value in “data intelligence”?
• To what extent can intelligent apps and services disrupt legacy
apps/services?
14
MADRONA DATAWARE FRAMEWORK
15
INTELLIGENT APPS &
SERVICES
DATA
INTELLIGENCE
ENABLING
INFRASTRUCTURE
AgileDataStack
KEYS TO EMBRACING DATAWARE
1. Enabling infrastructure complex (Hadoop/Cloudera,
NoSQL/MongoDB, Spark, Legacy) & hard/expensive but getting
simplified and cheaper
2. Data Intelligence holds big promise but scarcity of “data
scientists” requires professional services (Dato, Context
Relevant, Atigeo, Palantir) and systematic, standardized
approaches from emerging companies
3. Early “App Intelligence” that is real-time and agile already exists
(ad serving, content recommendations, personalization, vertical
markets). Tremendous opportunity here to reinvent categories
4. Opportunities also exist in the data pipeline (Trifacta) and data
management, but tend to be deeper technical systems
16
APPLICATION INTELLIGENCE
1. What will an “application” look like in 5+ years?
2. What will make that application “intelligent”?
17
=
+
+
Apps
Algos
Data
App Intelligence
MADRONA DATAWARE INVESTMENTS
18
INTELLIGENT
APPS &
SERVICES
DATA
INTELLIGENCE
ENABLING
INFRASTRUCTURE
AGILEDATASTACK
YIELDEX
DATO
BOOMERANG
JOBALINE HIGHSPOTBIZIBLE
PLACED
M
A
X
P
O
I
N
T
A
P
P
T
I
O
S
E
E
Q
Q
U
M
U
L
O
C
O
N
T
E
X
T
R
E
L
E
V
A
N
T
ALGORITHMIA
IGNEOUS
I
C
E
B
R
G
E
X
T
R
A
H
O
P
Fund III Fund IV Fund V
Appendix
19
Dataware Case Study: Apptio
20
Category: “Full Stack”
Focus: Data-driven enterprise SAAS for CIO & team to run the
business of IT (TBM)
Revenue: $100M+
Lineage: Startups, HP, IBM/rational
Keys: • Combine legacy General Ledger & modern usage data to
“cost” services and share with users
• Define industry data & metadata standard – ATUM
• Deliver real-time enterprise SAAS solution
Investors: Madrona Venture Group, Greylock Partners, Shasta
Ventures, Andreessen Horowitz, T. Rowe Price
Dataware Case Study: Cloudera
21
Category: Enabling Infrastructure
Focus: Became the industry standard for extracting, storing and
managing a variety of data types so that they can enable
data intelligence and data-driven services to suceed
Revenue: $100M+
Lineage: Hadoop, Open Source, Google, UW
Keys: • Early player in being a diverse, indexed data store
• Helped define the “file system”, called HDFS, for
managing large-scale data stores
• Attempting to be the underlying platform for dataware
Investors: Accel Partners, Greylock Partners, Intel, T. Rowe Price
Dataware Case Study: Dato
22
Category: Data Intelligence
Focus: Leverage machine learning and various data types from
inspiration to insight and to build scalable, predictive and
recommendation systems
Revenue: < $10M
Lineage: UW, Carnegie Mellon
Keys: • Use S-frames to combine graph, table, text & image
data types
• Build an “end to end” data intelligence system from
prototype to production
• Deliver predictive and recommender systems as services
or stand alone applications for business customers
Investors: Madrona Venture Group, NEA, Vulcan
Dataware Case Study: Placed.com
23
Category: App Intelligence
Focus: Combine location database & active panel data to analyze
and optimize advertising and marketing programs
Revenue: < $10M
Lineage: Farecast, Quantcast, aQuantive
Keys: • Leverage data science to build highly accurate place
database
• Create statistically significant panels to measure
physical world impact of digital advertising
• Embed service into mobile add ecosystem to deliver
actionable insights
Investors: Madrona Venture Group, Two Sigma
Dataware Case Study: Trifacta
24
Category: Continuous Data Pipeline
Focus: Automate the process of cleaning, normalizing and
preparing data for “Data Intelligence” use cases
Revenue: Unknown
Lineage: Stanford (Jeff Herr), Cal (Joe Hellerstein)
Keys: • Focus on core “Data Wrangling” problem
• Use machine learning to recognize patterns & suggest
automated fixes
• Simple visualization/UI
Investors: Greylock Partners, Accel Partners, Ignition Partners

More Related Content

What's hot

What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a projectRichardPierce28
 
Maximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformMaximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformNeo4j
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Data Con LA
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Data-Centric Business Transformation Using Knowledge Graphs
Data-Centric Business Transformation Using Knowledge GraphsData-Centric Business Transformation Using Knowledge Graphs
Data-Centric Business Transformation Using Knowledge GraphsAlan Morrison
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Down to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data ServicesDown to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data ServicesInside Analysis
 
Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightSunil Ranka
 
Modern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the IndustryModern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the IndustryTableau Software
 
A Big Data Journey
A Big Data JourneyA Big Data Journey
A Big Data JourneyPaul Boal
 
Make AI & BI work at Scale
Make AI & BI work at ScaleMake AI & BI work at Scale
Make AI & BI work at ScaleSteve Nouri
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science TeamsGanes Kesari
 
ATAAS2016 - Big data analytics – data visualization himanshu and santosh
ATAAS2016 - Big data analytics – data visualization   himanshu and santoshATAAS2016 - Big data analytics – data visualization   himanshu and santosh
ATAAS2016 - Big data analytics – data visualization himanshu and santoshAgile Testing Alliance
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 
A Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelA Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelDATAVERSITY
 
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...Memoori
 
Transport routing optimization
Transport routing optimizationTransport routing optimization
Transport routing optimizationMaarten Van Oost
 
Location decisions Center of Gravity
Location decisions Center of GravityLocation decisions Center of Gravity
Location decisions Center of GravityMaarten Van Oost
 
Rise of the Data Democracy
Rise of the Data DemocracyRise of the Data Democracy
Rise of the Data DemocracyBrendan Aldrich
 
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bhaskar Ghosh
 

What's hot (20)

What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a project
 
Maximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformMaximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data Platform
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Data-Centric Business Transformation Using Knowledge Graphs
Data-Centric Business Transformation Using Knowledge GraphsData-Centric Business Transformation Using Knowledge Graphs
Data-Centric Business Transformation Using Knowledge Graphs
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Down to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data ServicesDown to Business: Taking Action Quickly with Linked Data Services
Down to Business: Taking Action Quickly with Linked Data Services
 
Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to Foresight
 
Modern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the IndustryModern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the Industry
 
A Big Data Journey
A Big Data JourneyA Big Data Journey
A Big Data Journey
 
Make AI & BI work at Scale
Make AI & BI work at ScaleMake AI & BI work at Scale
Make AI & BI work at Scale
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science Teams
 
ATAAS2016 - Big data analytics – data visualization himanshu and santosh
ATAAS2016 - Big data analytics – data visualization   himanshu and santoshATAAS2016 - Big data analytics – data visualization   himanshu and santosh
ATAAS2016 - Big data analytics – data visualization himanshu and santosh
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
A Pragmatic AI Maturity Model
A Pragmatic AI Maturity ModelA Pragmatic AI Maturity Model
A Pragmatic AI Maturity Model
 
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
 
Transport routing optimization
Transport routing optimizationTransport routing optimization
Transport routing optimization
 
Location decisions Center of Gravity
Location decisions Center of GravityLocation decisions Center of Gravity
Location decisions Center of Gravity
 
Rise of the Data Democracy
Rise of the Data DemocracyRise of the Data Democracy
Rise of the Data Democracy
 
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
 

Similar to Matt McIlwain opening keynote

Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Denodo
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Denodo
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationDenodo
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AIDATAVERSITY
 
Cisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt onlyCisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt onlyArthur_Hansen
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseCaserta
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Denodo
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Denodo
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Denodo
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
OpenSistemas Corporate Presentation
OpenSistemas Corporate PresentationOpenSistemas Corporate Presentation
OpenSistemas Corporate PresentationOpenSistemas
 

Similar to Matt McIlwain opening keynote (20)

Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Cisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt onlyCisco event 6 05 2014v3 wwt only
Cisco event 6 05 2014v3 wwt only
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the Enterprise
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Just ask Watson Seminar
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
OpenSistemas Corporate Presentation
OpenSistemas Corporate PresentationOpenSistemas Corporate Presentation
OpenSistemas Corporate Presentation
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 

Matt McIlwain opening keynote

  • 1. FROM “BIG DATA” TO DATAWARE SIM Technology Leadership Summit May 20, 2015
  • 2. MADRONA OVERVIEW • Madrona is a leading venture capital firm focused on sourcing and growing early-stage technology companies in the Pacific Northwest • About $1 billion under management across five funds –Investors include the University of Washington, University of Virginia, Irvine Foundation, University of North Carolina, and strategic individuals • Investments made in over 100 companies the past 20 years with over 50 active portfolio companies and over 40 positive exits • Madrona team –7 Managing Directors –Strategic Directors and Venture Partners include: Sujal Patel, Steve Singh, John McAdam, Prof. Oren Etzioni, and Prof. Dan Weld
  • 3. THE PNW TECH ECOSYSTEM IS STRONG AND GROWING Anchor Tenants Large Tech Satellite Offices Mid-Cap Tech with Seattle HQ World-Class Research
  • 4. OUR FUTURE 1995 TODAY (2015) 2035 COMMUNICATION Snail mail, fax, early email SMS, Facebook, Skype, Snapchat & Twitter Virtual Reality Rooms DEVICES Desktop PCs Smart Mobile Devices Embedded on you & everything else (IoT) SOFTWARE/ DATAWARE Packaged/Licensed SaaS subscription/Apps Intelligent apps INTERNET/ CONNECTIVITY Dial up modem 56k “Ubiquitous” broadband 100 Mbps to mobile “Always On” and IoT COMPUTE/STORAGE Pentium processor 100 MIPS Single-core ~$1 million/TB Intel Xeon E7 processor – 4000 MIPS Multi-core $59/TB $5/Petabyte INFRASTRUCTURE Internet & Dedicated servers Cloud Real-time hybrid marketplace COMMERCE 1 book/10 days/$5 delivery Anything 2 days free; 50,000 items in 2 hours free delivery Drones or autonomous car delivery & 3D printed
  • 5. WHAT IS “DATAWARE”? A framework for describing the combination of data, software, math formulas and “predictive” analytics that help data savvy teams turn information and insights into profitable actions. 5 Why Now? • Cloud Enablement: “Cloud” abstracts hardware into software and enables unprecedented elasticity, scale and speed • Big Data: The volume, velocity and variety of data types and stores has expanded rapidly while the value of retaining/leveraging data often exceeds the cost • Legacy “Datastores”: Highly structured and constrained systems (databases, data warehouses, BI tools) that are too rigid to unlock data’s full value yet too ubiquitous and important to NOT leverage • Emerging Solutions: A combination of point solutions, systematic approaches and “vertical” services emerging to leverage these trends in an agile manner. These solutions require a structured framework to prioritize market opportunities
  • 6. INSERT BIG DATA LANDSCAPE SLIDE 6
  • 7. MADRONA DATAWARE FRAMEWORK 7 INTELLIGENT APPS & SERVICES DATA INTELLIGENCE ENABLING INFRASTRUCTURE AgileDataStack Marc Benihoff, Founder and CEO of Salesforce.com, when asked what he thinks is the major tech trend of the next five years responded that we are in an “AI Spring.” Fortune Term Sheet 1/6/15
  • 8. WHAT MAKES THE DATA “BIG”? Value More valuable to store than throw away 8 Variety Different sources & structures create opportunities… & challenges Volume Easy, plentiful & cheap data to collect & store Velocity Speed of turning data into actionable insights – batch vs. real-time!
  • 9. DATA INPUTS • Legacy Databases: Highly structured, transactional focused, generally rigid – Databases with SQL queries (OLTP) – Historic “Extract, Transform, Load” tools (ETL) – Data warehouses and data cubes – Business Intelligence (BI) and “Online Analytics Processing (OLAP)” • “Big Data” Sources: Structure variety, high volume/velocity, agile – “Not Only SQL” (NoSQL) data repositories – Allow for “Extract, Load, Transform” (ELT) flexibility – Continuous, online (streamed) data flows – Relationship focus vs. Relational focus 9
  • 10. Places Things Profiles WHERE DOES DATA & METADATA COME FROM? People • Consumers • Office Workers • Field Workers • Citizens • Partners • Customers • Home • Work • Stores • Destinations • Routes • Individuals • Demographics • Devices • Locations • Objects • “Campaigns” • Biology • “Networks" • Devices • Vehicles • Machines • Medical • Homes • Content
  • 11. WHY DOES IT MATTER? From To Structure Mostly structured (relational) Flexibly structured (relationship) Flexibility Rigid & slow (R + cubes +BI) Agile & rapid (Python + graphs/ML + UI) Availability Offline & batch Online & continuous Key Drivers Code & “Rules” (“hard coded”, structured learning) Data, Statistics, Discovery (“machine learned”, “inferred”, Bayesian) Conceptually Certainty & consistency Iteration & “surprise” 11
  • 12. TECHNOLOGY SECTOR IMPACT OF “DATAWARE” YEARS: 0 – 2 2 – 5 5+ Relational Databases (Oracle, MSFT) + ?? - Traditional Infrastructure (HP, IBM, Dell) + - -- Traditional Apps (Oracle, SAP) + +/- - Cloud Infrastructure ++ ++ + SAAS ++ ++ +/- 12
  • 13. BIG COMPANY “LEADING INDICATORS” • Microsoft-AzureML, Revolution Analytics, much more • HP reorganizes software business around “Big Data” • Salesforce.com buys RelateIQ for $390M for “data cloud” • Oracle builds “data cloud” team including Blue Kai and Datalogix • SAP promotes HANA, buys Concur • IBM advertises Watson, Blue Mix • AWS – AmazonML, Lambda, Kinesis 13
  • 14. KEY QUESTIONS • How do big, especially software-driven, companies unlock their “data silos”? • How will traditional databases/warehouses, newer “big data” stores and integrated big data “lakes” compliment or compete? • What models will emerge to capture value in “data intelligence”? • To what extent can intelligent apps and services disrupt legacy apps/services? 14
  • 15. MADRONA DATAWARE FRAMEWORK 15 INTELLIGENT APPS & SERVICES DATA INTELLIGENCE ENABLING INFRASTRUCTURE AgileDataStack
  • 16. KEYS TO EMBRACING DATAWARE 1. Enabling infrastructure complex (Hadoop/Cloudera, NoSQL/MongoDB, Spark, Legacy) & hard/expensive but getting simplified and cheaper 2. Data Intelligence holds big promise but scarcity of “data scientists” requires professional services (Dato, Context Relevant, Atigeo, Palantir) and systematic, standardized approaches from emerging companies 3. Early “App Intelligence” that is real-time and agile already exists (ad serving, content recommendations, personalization, vertical markets). Tremendous opportunity here to reinvent categories 4. Opportunities also exist in the data pipeline (Trifacta) and data management, but tend to be deeper technical systems 16
  • 17. APPLICATION INTELLIGENCE 1. What will an “application” look like in 5+ years? 2. What will make that application “intelligent”? 17 = + + Apps Algos Data App Intelligence
  • 18. MADRONA DATAWARE INVESTMENTS 18 INTELLIGENT APPS & SERVICES DATA INTELLIGENCE ENABLING INFRASTRUCTURE AGILEDATASTACK YIELDEX DATO BOOMERANG JOBALINE HIGHSPOTBIZIBLE PLACED M A X P O I N T A P P T I O S E E Q Q U M U L O C O N T E X T R E L E V A N T ALGORITHMIA IGNEOUS I C E B R G E X T R A H O P Fund III Fund IV Fund V
  • 20. Dataware Case Study: Apptio 20 Category: “Full Stack” Focus: Data-driven enterprise SAAS for CIO & team to run the business of IT (TBM) Revenue: $100M+ Lineage: Startups, HP, IBM/rational Keys: • Combine legacy General Ledger & modern usage data to “cost” services and share with users • Define industry data & metadata standard – ATUM • Deliver real-time enterprise SAAS solution Investors: Madrona Venture Group, Greylock Partners, Shasta Ventures, Andreessen Horowitz, T. Rowe Price
  • 21. Dataware Case Study: Cloudera 21 Category: Enabling Infrastructure Focus: Became the industry standard for extracting, storing and managing a variety of data types so that they can enable data intelligence and data-driven services to suceed Revenue: $100M+ Lineage: Hadoop, Open Source, Google, UW Keys: • Early player in being a diverse, indexed data store • Helped define the “file system”, called HDFS, for managing large-scale data stores • Attempting to be the underlying platform for dataware Investors: Accel Partners, Greylock Partners, Intel, T. Rowe Price
  • 22. Dataware Case Study: Dato 22 Category: Data Intelligence Focus: Leverage machine learning and various data types from inspiration to insight and to build scalable, predictive and recommendation systems Revenue: < $10M Lineage: UW, Carnegie Mellon Keys: • Use S-frames to combine graph, table, text & image data types • Build an “end to end” data intelligence system from prototype to production • Deliver predictive and recommender systems as services or stand alone applications for business customers Investors: Madrona Venture Group, NEA, Vulcan
  • 23. Dataware Case Study: Placed.com 23 Category: App Intelligence Focus: Combine location database & active panel data to analyze and optimize advertising and marketing programs Revenue: < $10M Lineage: Farecast, Quantcast, aQuantive Keys: • Leverage data science to build highly accurate place database • Create statistically significant panels to measure physical world impact of digital advertising • Embed service into mobile add ecosystem to deliver actionable insights Investors: Madrona Venture Group, Two Sigma
  • 24. Dataware Case Study: Trifacta 24 Category: Continuous Data Pipeline Focus: Automate the process of cleaning, normalizing and preparing data for “Data Intelligence” use cases Revenue: Unknown Lineage: Stanford (Jeff Herr), Cal (Joe Hellerstein) Keys: • Focus on core “Data Wrangling” problem • Use machine learning to recognize patterns & suggest automated fixes • Simple visualization/UI Investors: Greylock Partners, Accel Partners, Ignition Partners