SlideShare a Scribd company logo
BUILDING A SUCCESSFUL 
DATA STARTUP 
Chris Neumann | 
@ckneumann 
© 2014 Datahero, Inc.
BUILDING 
DATA COMPANIES 
IS HARD 
© 2014 Datahero, Inc.
STARTUPS BEGIN LIKE 
THIS 
© 2014 Datahero, Inc.
DATA STARTUPS ARE 
ABOUT 
TIMING 
© 2014 Datahero, Inc.
IT’S ABOUT TIME 
Successful data startups recognize 
fundamental shifts in technology or business 
processes 
and create solutions that address new pain 
points 
© 2014 Datahero, Inc.
IT’S ABOUT TIMING 
• The Data Warehouse market resulted from a need for 
businesses to have a “single source of truth” 
• The Big Data market resulted from the dramatic increase 
in data volumes from machine-generated data 
• The Cloud BI market is emerging as a result of the 
decentralization of enterprise data as services move to 
the cloud 
© 2014 Datahero, Inc.
BIG DATA 
© 2014 Datahero, Inc.
BEFORE 
• Companies generally analyzed relatively 
small volumes of (transactional) data 
• Larger volumes of data could be stored in 
data warehouses, but calculations were 
relatively simple (aggregates, basic 
business metrics, etc. 
© 2014 Datahero, Inc.
THE SHIFT 
• Machine-generated data, such as from web server logs, 
provided far more granular information about customers 
• Companies wanted to perform complex analysis on these 
larger volumes of data in order to better understand 
customer behavior 
– There was also the potential for entirely new 
businesses built around data 
© 2014 Datahero, Inc.
THE CHALLENGE 
• The volume of structured data companies wanted to 
analyze was becoming larger than what could fit in a 
single server 
The rate of growth of data now exceeded the rate of 
increase of storage density 
• This shifted the performance bottleneck to the network 
© 2014 Datahero, Inc.
THE SOLUTIONS 
• First Generation: Fix it with hardware! 
– Make servers bigger (Teradata) 
– Make the network faster (Netezza) 
• Second Generation: Fix it with software! 
(Aster Data, Greenplum, Vertica) 
– Be smarter about where we store the data 
– Be smarter about when we move the data 
– Be smarter about how we move the data 
© 2014 Datahero, Inc.
THE SOLUTIONS 
Of the five original “Big Data” startups: 
4 were acquired in a span of less 
than a year for more than $2.5B total 
The fifth was one of the acquirers 
© 2014 Datahero, Inc.
CLOUD DATA 
© 2014 Datahero, Inc.
BEFORE 
• The data business users wanted to analyze was 
generated by on-premises software 
• Centralized data stores (EDWs) were used to aggregate 
data from a small number of strategic sources 
• BI teams would create reports for business users to 
access 
© 2014 Datahero, Inc.
THE SHIFT 
• Over the past five years, business software is being 
replaced with cloud services, the vast majority of which 
are departmental 
• Company data is no longer stored primarily in on-premises 
systems, but is increasingly found in the cloud 
• For the first time in more than 20 years, company data is 
becoming decentralized 
© 2014 Datahero, Inc.
THE CHALLENGE 
• Companies now have a large number of remote data 
sources each used by a small number of users 
For the first time ever, business users have direct 
access to their data 
• Users now have access to the data they want to work 
with, but don’t have the tools to take advantage of it 
© 2014 Datahero, Inc.
THE SOLUTIONS 
• First Generation: Pull the data back! 
– Custom integrations to pull cloud data down into on-premises data warehouses 
– Put the data warehouse in the cloud…and then pull the rest of the data in (GoodData, Birst, RJMetrics) 
• Second Generation: Leave the data where it is! 
(DataHero, SumAll) 
– Treat cloud business services as the systems of record 
– Take advantage of existing security and permission models 
– Eliminate the process bottleneck by empowering users to connect directly to the services they need 
© 2014 Datahero, Inc.
CUSTOM CONNECTORS 
• High-speed connectors built in collaboration with 
partners for optimal performance 
• Robust, extensible framework supports rapid 
development of new connectors 
• Secure integrations leverage partner security models 
for consistent data visibility 
DATA DECODER 
• Advanced classification algorithms identify and 
normalize data types across services and files 
• Semantic types such as URLs, Email Addresses and 
Lists extend traditional data types to provide added 
metadata 
• Confidence intervals drive an intuitive feedback 
interface with users 
EXTENSIBLE CONNECTION 
FRAMEWORK 
DATA DECODER 
C 
O 
N 
N 
E 
C 
T 
O 
R 
C 
O 
N 
N 
E 
C 
T 
O 
R 
C 
O 
N 
N 
E 
C 
T 
O 
R 
THE SOLUTIONS
THE SOLUTIONS 
INTUITIVE HTML5 INTERFACE 
• Intuitive drag-and-drop interface created 
through user-centric design process involving 
thousands of hours of user testing and 
hundreds of users 
“NO CODE” DATA COMBINATIONS 
• Intuitive interface enables business users to 
combine (join) data across services and 
spreadsheets without coding or SQL 
• Recommendation engine suggests common 
keys based on metadata derived by the Data 
Decoder
IDENTIFY THE SHIFT, 
THEN FIND THE PAIN 
POINT 
© 2014 Datahero, Inc.
BE THE PAINKILLER 
• Every major shift in technology and/or 
business process results in new 
opportunities and new pain points 
– Many of those pain points will have easy 
solutions 
– A few will require fundamentally new 
approaches 
© 2014 Datahero, Inc.
SO WHERE 
ARE WE? 
© 2014 Datahero, Inc.
© 2014 Datahero, Inc.
© 2014 Datahero, Inc.
© 2014 Datahero, Inc.
WE’RE HIRING! 
DataHero.com/jobs 
© 2014 Datahero, Inc.
chris@datahero.com 
@ckneumann 
© 2014 Datahero, Inc.

More Related Content

What's hot

Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Denodo
 
Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)
Denodo
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
Niu Bai
 
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
Denodo
 

What's hot (20)

Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
 
Best Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse QuicklyBest Practices for Building a Warehouse Quickly
Best Practices for Building a Warehouse Quickly
 
A Big Data Journey
A Big Data JourneyA Big Data Journey
A Big Data Journey
 
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
Reinventing and Simplifying Data Management for a Successful Hybrid and Multi...
 
Self Service Analytics enabled by Data Virtualization from Denodo
Self Service Analytics enabled by Data Virtualization from DenodoSelf Service Analytics enabled by Data Virtualization from Denodo
Self Service Analytics enabled by Data Virtualization from Denodo
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
 
Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
4 ways to cut your e discovery costs in half-webinar-exterro-druva
4 ways to cut your e discovery costs in half-webinar-exterro-druva4 ways to cut your e discovery costs in half-webinar-exterro-druva
4 ways to cut your e discovery costs in half-webinar-exterro-druva
 
Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)
 
[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh
 
Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
 
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Foundational Strategies for Trusted Data: Getting Your Data to the CloudFoundational Strategies for Trusted Data: Getting Your Data to the Cloud
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
 
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
A Successful Data Strategy for Insurers in Volatile Times (ASEAN)
 
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)
 

Viewers also liked

Viewers also liked (19)

Sisense Business Intelligence v6.0.0 r2 kor
Sisense  Business Intelligence v6.0.0 r2 kor Sisense  Business Intelligence v6.0.0 r2 kor
Sisense Business Intelligence v6.0.0 r2 kor
 
Klipfolio stratigent webinar
Klipfolio stratigent webinarKlipfolio stratigent webinar
Klipfolio stratigent webinar
 
Presentation Netvibes
Presentation NetvibesPresentation Netvibes
Presentation Netvibes
 
Jeff Sauer @ All Things DATA 2016 - Automating Google Analytics
Jeff Sauer @ All Things DATA 2016 - Automating Google AnalyticsJeff Sauer @ All Things DATA 2016 - Automating Google Analytics
Jeff Sauer @ All Things DATA 2016 - Automating Google Analytics
 
Service Cloud Mobile Dashboard
Service Cloud Mobile DashboardService Cloud Mobile Dashboard
Service Cloud Mobile Dashboard
 
Case study on survey data analysis with Klipfolio
Case study on survey data analysis with KlipfolioCase study on survey data analysis with Klipfolio
Case study on survey data analysis with Klipfolio
 
DataHero / Eventbrite - API Best Practices
DataHero / Eventbrite - API Best PracticesDataHero / Eventbrite - API Best Practices
DataHero / Eventbrite - API Best Practices
 
GAUCbe 2015 - Dashboard Building - Involving clients to find the right metric...
GAUCbe 2015 - Dashboard Building - Involving clients to find the right metric...GAUCbe 2015 - Dashboard Building - Involving clients to find the right metric...
GAUCbe 2015 - Dashboard Building - Involving clients to find the right metric...
 
Grafana Review
Grafana ReviewGrafana Review
Grafana Review
 
Richslide for enterprise
Richslide for enterpriseRichslide for enterprise
Richslide for enterprise
 
Klipfolio The 8 must have metrics for your marketing dashboard
Klipfolio   The 8 must have metrics for your marketing dashboardKlipfolio   The 8 must have metrics for your marketing dashboard
Klipfolio The 8 must have metrics for your marketing dashboard
 
관제, IoT/Wearable에서 주목할 정보행동
관제, IoT/Wearable에서 주목할 정보행동관제, IoT/Wearable에서 주목할 정보행동
관제, IoT/Wearable에서 주목할 정보행동
 
Difference between business intelligence, business analytics, and business an...
Difference between business intelligence, business analytics, and business an...Difference between business intelligence, business analytics, and business an...
Difference between business intelligence, business analytics, and business an...
 
Pbi iot data viz
Pbi iot data vizPbi iot data viz
Pbi iot data viz
 
Groupby -Power bi dashboard in hour by vishal pawar-Presentation
Groupby -Power bi dashboard in hour by vishal pawar-Presentation Groupby -Power bi dashboard in hour by vishal pawar-Presentation
Groupby -Power bi dashboard in hour by vishal pawar-Presentation
 
BI Dashboard Formula Methodology: How to make your first big data visualizati...
BI Dashboard Formula Methodology: How to make your first big data visualizati...BI Dashboard Formula Methodology: How to make your first big data visualizati...
BI Dashboard Formula Methodology: How to make your first big data visualizati...
 
TWP Meetup
TWP MeetupTWP Meetup
TWP Meetup
 
Building a Real Time Dashboard with Amazon Kinesis, Amazon Lambda and Amazon ...
Building a Real Time Dashboard with Amazon Kinesis, Amazon Lambda and Amazon ...Building a Real Time Dashboard with Amazon Kinesis, Amazon Lambda and Amazon ...
Building a Real Time Dashboard with Amazon Kinesis, Amazon Lambda and Amazon ...
 
BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides) BI Dashboard Best Practices Webinar 2016 (Slides)
BI Dashboard Best Practices Webinar 2016 (Slides)
 

Similar to Dataweek Presentation from Chris Neumann

Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 

Similar to Dataweek Presentation from Chris Neumann (20)

The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
 
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
SOA with Data Virtualization (session 4 from Packed Lunch Webinar Series)
 
Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 
The Journey to Success with Big Data
The Journey to Success with Big DataThe Journey to Success with Big Data
The Journey to Success with Big Data
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
[DSC Europe 23] Ivan Dundovic - How To Treat Your Data As A Product.pptx
[DSC Europe 23] Ivan Dundovic - How To Treat Your Data As A Product.pptx[DSC Europe 23] Ivan Dundovic - How To Treat Your Data As A Product.pptx
[DSC Europe 23] Ivan Dundovic - How To Treat Your Data As A Product.pptx
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1

 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
 
Make from your it department a competitive differentiator for your business
Make from your it department a competitive differentiator for your businessMake from your it department a competitive differentiator for your business
Make from your it department a competitive differentiator for your business
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
 

Recently uploaded

Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 

Recently uploaded (20)

Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 

Dataweek Presentation from Chris Neumann

  • 1. BUILDING A SUCCESSFUL DATA STARTUP Chris Neumann | @ckneumann © 2014 Datahero, Inc.
  • 2. BUILDING DATA COMPANIES IS HARD © 2014 Datahero, Inc.
  • 3. STARTUPS BEGIN LIKE THIS © 2014 Datahero, Inc.
  • 4. DATA STARTUPS ARE ABOUT TIMING © 2014 Datahero, Inc.
  • 5. IT’S ABOUT TIME Successful data startups recognize fundamental shifts in technology or business processes and create solutions that address new pain points © 2014 Datahero, Inc.
  • 6. IT’S ABOUT TIMING • The Data Warehouse market resulted from a need for businesses to have a “single source of truth” • The Big Data market resulted from the dramatic increase in data volumes from machine-generated data • The Cloud BI market is emerging as a result of the decentralization of enterprise data as services move to the cloud © 2014 Datahero, Inc.
  • 7. BIG DATA © 2014 Datahero, Inc.
  • 8. BEFORE • Companies generally analyzed relatively small volumes of (transactional) data • Larger volumes of data could be stored in data warehouses, but calculations were relatively simple (aggregates, basic business metrics, etc. © 2014 Datahero, Inc.
  • 9. THE SHIFT • Machine-generated data, such as from web server logs, provided far more granular information about customers • Companies wanted to perform complex analysis on these larger volumes of data in order to better understand customer behavior – There was also the potential for entirely new businesses built around data © 2014 Datahero, Inc.
  • 10. THE CHALLENGE • The volume of structured data companies wanted to analyze was becoming larger than what could fit in a single server The rate of growth of data now exceeded the rate of increase of storage density • This shifted the performance bottleneck to the network © 2014 Datahero, Inc.
  • 11. THE SOLUTIONS • First Generation: Fix it with hardware! – Make servers bigger (Teradata) – Make the network faster (Netezza) • Second Generation: Fix it with software! (Aster Data, Greenplum, Vertica) – Be smarter about where we store the data – Be smarter about when we move the data – Be smarter about how we move the data © 2014 Datahero, Inc.
  • 12. THE SOLUTIONS Of the five original “Big Data” startups: 4 were acquired in a span of less than a year for more than $2.5B total The fifth was one of the acquirers © 2014 Datahero, Inc.
  • 13. CLOUD DATA © 2014 Datahero, Inc.
  • 14. BEFORE • The data business users wanted to analyze was generated by on-premises software • Centralized data stores (EDWs) were used to aggregate data from a small number of strategic sources • BI teams would create reports for business users to access © 2014 Datahero, Inc.
  • 15. THE SHIFT • Over the past five years, business software is being replaced with cloud services, the vast majority of which are departmental • Company data is no longer stored primarily in on-premises systems, but is increasingly found in the cloud • For the first time in more than 20 years, company data is becoming decentralized © 2014 Datahero, Inc.
  • 16. THE CHALLENGE • Companies now have a large number of remote data sources each used by a small number of users For the first time ever, business users have direct access to their data • Users now have access to the data they want to work with, but don’t have the tools to take advantage of it © 2014 Datahero, Inc.
  • 17. THE SOLUTIONS • First Generation: Pull the data back! – Custom integrations to pull cloud data down into on-premises data warehouses – Put the data warehouse in the cloud…and then pull the rest of the data in (GoodData, Birst, RJMetrics) • Second Generation: Leave the data where it is! (DataHero, SumAll) – Treat cloud business services as the systems of record – Take advantage of existing security and permission models – Eliminate the process bottleneck by empowering users to connect directly to the services they need © 2014 Datahero, Inc.
  • 18. CUSTOM CONNECTORS • High-speed connectors built in collaboration with partners for optimal performance • Robust, extensible framework supports rapid development of new connectors • Secure integrations leverage partner security models for consistent data visibility DATA DECODER • Advanced classification algorithms identify and normalize data types across services and files • Semantic types such as URLs, Email Addresses and Lists extend traditional data types to provide added metadata • Confidence intervals drive an intuitive feedback interface with users EXTENSIBLE CONNECTION FRAMEWORK DATA DECODER C O N N E C T O R C O N N E C T O R C O N N E C T O R THE SOLUTIONS
  • 19. THE SOLUTIONS INTUITIVE HTML5 INTERFACE • Intuitive drag-and-drop interface created through user-centric design process involving thousands of hours of user testing and hundreds of users “NO CODE” DATA COMBINATIONS • Intuitive interface enables business users to combine (join) data across services and spreadsheets without coding or SQL • Recommendation engine suggests common keys based on metadata derived by the Data Decoder
  • 20. IDENTIFY THE SHIFT, THEN FIND THE PAIN POINT © 2014 Datahero, Inc.
  • 21. BE THE PAINKILLER • Every major shift in technology and/or business process results in new opportunities and new pain points – Many of those pain points will have easy solutions – A few will require fundamentally new approaches © 2014 Datahero, Inc.
  • 22. SO WHERE ARE WE? © 2014 Datahero, Inc.
  • 26. WE’RE HIRING! DataHero.com/jobs © 2014 Datahero, Inc.
  • 27. chris@datahero.com @ckneumann © 2014 Datahero, Inc.