SlideShare a Scribd company logo
Do You Hadoop?
A Survey of Big Data Practitioners
February 12, 2014
Bradley Graham
Big Data Research from Sand Hill Group
• Mindset over Data Set: A Big Data Prescription for
Setting the Market Pace
– Presents powerful learnings of some of the most
successful implementers of enterprise Big Data
– Provides prescriptive executive-level guidance for
adopting and using Big Data
– Use as a planning guide or benchmarking tool
– Purchase at http://bit.ly/SH_BD_14_S

• Do You Hadoop? A Survey of Big Data Practitioners
–
–
–
–
–

2

Clarifies Big Data (Hadoop-based) initiative status
Identifies pain points and barriers to adoption
Illuminates usage changes over the next 12-18 months
Use as a benchmarking tool
Download at http://bit.ly/SH_H2014
Broad Cross-Sectional View of the User Base
Company Size

Industries

(Number of Employees)

Large
47.4%

Small
33.3%

Other
13.3%
Telecommunications
2.2%

Technology
32.6%
Medium
19.3%

Participant Roles
Consumer Services
11.9%
Education
5.9%
Financial
Services
8.1%
Government
1.5%
Healthcare
2.2%

Industrials
22.2%

Other
8.9%
Academic
3.7%

Technology/Analytics
professional
50.4%

Consultant
14.1%

Business
sponsor/user
23.0%

• Startups and large established companies are leading the charge
• Technology industry use is strongly correlated to startup companies
• Companies serving large and/or diverse customer groups are a natural fit for Big Data
(e.g., retail, media and entertainment, and financial services)
• Business sponsor and user participation renders a more complete picture of Big
Data’s value and impact
3
Still Early Days
Hadoop Initiatives Status
(All Companies)
Percent of Total Population

50%

44.4%

40%
30%
20%

16.3%
11.1%

10%

8.1%

9.6%

10.4%

Piloting first
solution

First solution
deployed

Supporting
multiple analytics

0%
Exploring and
educating

Conducting POC Developing first
solution

• Solid progress is being made
• Majority of the companies have identified a business problem to address
• Use of multiple analytics suggest compelling value has been realized

4
Beware of Small and Agile Competitors
Hadoop Initiative Status
(by Company Size)

Percent of Category

100%
80%
60%

35.0%

36.4%
63.6%

45.0%

27.3%
18.2%
18.2%
Conducting POC

14.3%

13.3%
36.4%

26.7%

23.1%

35.7%

7.7%

0%
Exploring and
educating

50.0%
69.2%

20.0%

40%
20%

60.0%

Developing first
solution
Small

Medium

Piloting first solution

First solution
deployed

Supporting multiple
analytics

Large

• The median phase by company size is:
– Small: Exploring and educating
– Medium: Conducting POC
– Large: Conducting POC

• Data-centric startups enabled small companies to surpass medium-size companies in
the advanced stages
5
Satisfaction is Driving Continued Investment
• Higher overall satisfaction among
business sponsors and users relates to:
Hadoop/Big Data Initiative Satisfaction
(By Role)
100%
16.2%

9.7%

48.5%

61.3%

80%
60%

35.3%

29.0%

0%
Technology/Analytics
professional
Less Than

6

• Technology professionals’ higher than
expected satisfaction is likely attributed
to:
– Success with the technology
– Producing results that satisfied the business
stakeholders

40%
20%

– Profound insights
– More effective actions

Meets

Business sponsor/user

Better Than

• Challenges associated with Big Data are
nevertheless impacting satisfaction
– 3x more were less than satisfied (35.6%) vs.
more than satisfied (11.1%)
Mastering the Basics and Moving on to Advanced Applications
Most Commonly Reported #1 Uses of Hadoop
(Current vs. Future)
#1 Current Uses
(as of October 2013)

#1 Future Uses
(in 12 – 18 months)

Change
From
Current

Data Preparation (25.2%)

Advanced Analytics (24.4%)

—

Business Intelligence (17.8%)

Data Preparation (17.8%)

-7.4%

Basic Analytics (17.0%)

Business Intelligence (14.1%)
Archive More Data (14.1%)

-3.7%
—

• Leading current uses are:
– Foundational
– Support or augment the existing
solution portfolio (e.g., DW/BI and
small data analytics)
– Support Big Data experimentation

Top Uses of Hadoop
(Current vs. Future)
Top Current Uses
(as of October 2013)
Basic Analytics (58.5%)

Advanced Analytics (61.5%)

—

Business Intelligence (48.1%)

Business Intelligence (45.9%)

-2.2%

Data Preparation (45.9%)

7

Top Future Uses
(in 12 – 18 months)

Change
From
Current

Data Preparation (40.7%)

-5.2%
—

• Leading uses in 12-18 months
emphasize:
– New data types (streaming, geographic,
syndicated, etc. data)
– Advanced analytics (e.g., risk,
propensity and optimizations)
The Data Does Indeed Tell the Story
Data Types in the Hadoop Environment
Percent of Total Population

80%
60.7%
60%

52.6%
45.9%

40%

28.9%
22.2%

19.3%

16.3%

20%

14.1%

0%
Operational

Log

Online

Geographic

Partner

3rd party

Files
(Documents
and Media)

• Declining storage costs encourage a store everything approach
• Most prevalent data types parallel the focus of current usage
• Less frequently hosted data types hint at future Big Data applications

8

Streaming
Big Data and Hadoop are Complicated

Challenges Associated with Hadoop/Big Data
Most Commonly Reported
#1 Hadoop-related Challenges

Top Hadoop-related Challenges

Knowledge and experience (46.7%)

Knowledge and experience (65.2%)

Skills availability (20.7%)

Skills availability (52.6%)

Development effort (6.7%)

Development effort (40.7%)

• Resources remain the dominant issue for the foreseeable future
– Internal skills and experience gap
– Limited ability to repurpose existing resources
– Competition for “journey talent” (i.e., those successfully navigating the process at least once)

• Other frustrations are the technology challenges and level of effort related to:
– Implementing, maintaining and provisioning the environment
– Designing, building and maintaining solutions

• Performance, interoperability and other current second tier issues may prove to be
larger than expected issues down the road if left unaddressed
9
Success in Numbers
• Navigating the Hadoop/Big Data complexities requires a trusted
partner ecosystem
– It's far too complex at this point to go it alone

• Augment and, through a collaborative working model, edify internal
resources
• Gain access to value-added products and services that simplify:
– Infrastructure implementation and provisioning
– Solution development and use
– Data access and management

Effective partnering can address critical and second tier issues
while reducing time to value

10
Thank you!
Bradley Graham
Executive Director, Carpe Datum Rx
bgraham@Amberoon.com
CarpeDatumRx.com

11

More Related Content

What's hot

Introduction
IntroductionIntroduction
Introduction
Lee Schlenker
 
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
Perficient, Inc.
 
Data governance - An Insight
Data governance - An InsightData governance - An Insight
Data governance - An Insight
Vivek Mohan
 
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
DataWorks Summit
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
InsightInnovation
 
99+ Siebel CTMS Best Practices You Should Follow
99+ Siebel CTMS Best Practices You Should Follow99+ Siebel CTMS Best Practices You Should Follow
99+ Siebel CTMS Best Practices You Should Follow
Perficient, Inc.
 
KSA Business Intelligence Qualifications
KSA Business Intelligence QualificationsKSA Business Intelligence Qualifications
KSA Business Intelligence Qualifications
JDOLIV
 
Leveraging Technology to Empower Patients and Reduce Healthcare Costs
Leveraging Technology to Empower Patients and Reduce Healthcare CostsLeveraging Technology to Empower Patients and Reduce Healthcare Costs
Leveraging Technology to Empower Patients and Reduce Healthcare Costs
Perficient, Inc.
 
Reinventing Marketing from the Bottom Up
Reinventing Marketing from the Bottom UpReinventing Marketing from the Bottom Up
Reinventing Marketing from the Bottom Up
Microsoft
 
For Life Sciences, the Future of Master Data Management is in the Cloud
For Life Sciences, the Future of Master Data Management is in the CloudFor Life Sciences, the Future of Master Data Management is in the Cloud
For Life Sciences, the Future of Master Data Management is in the Cloud
Cognizant
 
AIIA_DataAnalytics_Project_External_20160721
AIIA_DataAnalytics_Project_External_20160721AIIA_DataAnalytics_Project_External_20160721
AIIA_DataAnalytics_Project_External_20160721
Graeme Wood
 
Operational Risk Management Data Validation Architecture
Operational Risk Management Data Validation ArchitectureOperational Risk Management Data Validation Architecture
Operational Risk Management Data Validation Architecture
Alan McSweeney
 
From DQ to DG
From DQ to DGFrom DQ to DG
From DQ to DG
Jorge Garcia
 
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
Perficient, Inc.
 
How to Convert Unknown Consumers into Patients Using Social Media
How to Convert Unknown Consumers into Patients Using Social MediaHow to Convert Unknown Consumers into Patients Using Social Media
How to Convert Unknown Consumers into Patients Using Social Media
Perficient, Inc.
 
How Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
How Northwestern Medicine is Leveraging Epic to Enable Value-Based CareHow Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
How Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
Perficient, Inc.
 
Incorporating Digital Technology into Clinical Trials
Incorporating Digital Technology into Clinical TrialsIncorporating Digital Technology into Clinical Trials
Incorporating Digital Technology into Clinical Trials
Perficient, Inc.
 
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
jmariani14
 
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
Perficient, Inc.
 
YZU - Big Data Science - Course Information
YZU - Big Data Science - Course InformationYZU - Big Data Science - Course Information
YZU - Big Data Science - Course Information
Ren-Hao (PAN) Pan
 

What's hot (20)

Introduction
IntroductionIntroduction
Introduction
 
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
Healthcare Reform & Physician Loyalty: What Can CRM Do To Support ACOs?
 
Data governance - An Insight
Data governance - An InsightData governance - An Insight
Data governance - An Insight
 
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
Microsoft HDInsight as a Big Data and Interoperability Platform to Drive Poin...
 
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
Data Quality Doesn’t Just Happen: And Here’s What Some of the Industry’s Most...
 
99+ Siebel CTMS Best Practices You Should Follow
99+ Siebel CTMS Best Practices You Should Follow99+ Siebel CTMS Best Practices You Should Follow
99+ Siebel CTMS Best Practices You Should Follow
 
KSA Business Intelligence Qualifications
KSA Business Intelligence QualificationsKSA Business Intelligence Qualifications
KSA Business Intelligence Qualifications
 
Leveraging Technology to Empower Patients and Reduce Healthcare Costs
Leveraging Technology to Empower Patients and Reduce Healthcare CostsLeveraging Technology to Empower Patients and Reduce Healthcare Costs
Leveraging Technology to Empower Patients and Reduce Healthcare Costs
 
Reinventing Marketing from the Bottom Up
Reinventing Marketing from the Bottom UpReinventing Marketing from the Bottom Up
Reinventing Marketing from the Bottom Up
 
For Life Sciences, the Future of Master Data Management is in the Cloud
For Life Sciences, the Future of Master Data Management is in the CloudFor Life Sciences, the Future of Master Data Management is in the Cloud
For Life Sciences, the Future of Master Data Management is in the Cloud
 
AIIA_DataAnalytics_Project_External_20160721
AIIA_DataAnalytics_Project_External_20160721AIIA_DataAnalytics_Project_External_20160721
AIIA_DataAnalytics_Project_External_20160721
 
Operational Risk Management Data Validation Architecture
Operational Risk Management Data Validation ArchitectureOperational Risk Management Data Validation Architecture
Operational Risk Management Data Validation Architecture
 
From DQ to DG
From DQ to DGFrom DQ to DG
From DQ to DG
 
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
Engage Patients, Reduce Manual Processes and Drive Key Insights with Interope...
 
How to Convert Unknown Consumers into Patients Using Social Media
How to Convert Unknown Consumers into Patients Using Social MediaHow to Convert Unknown Consumers into Patients Using Social Media
How to Convert Unknown Consumers into Patients Using Social Media
 
How Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
How Northwestern Medicine is Leveraging Epic to Enable Value-Based CareHow Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
How Northwestern Medicine is Leveraging Epic to Enable Value-Based Care
 
Incorporating Digital Technology into Clinical Trials
Incorporating Digital Technology into Clinical TrialsIncorporating Digital Technology into Clinical Trials
Incorporating Digital Technology into Clinical Trials
 
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
IDG MarketPulse: Virtual Graphics Processing Unit (vGPU)
 
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
How to Load Data More Quickly and Accurately into Oracle's Life Sciences Data...
 
YZU - Big Data Science - Course Information
YZU - Big Data Science - Course InformationYZU - Big Data Science - Course Information
YZU - Big Data Science - Course Information
 

Similar to Sand Hill Hadoop-Big Data Study - 140212

Hadoop Does Not Equal Big Data
Hadoop Does Not Equal Big Data Hadoop Does Not Equal Big Data
Hadoop Does Not Equal Big Data
Enterprise Management Associates
 
A Snapshot into the Evolving Role of Audience Insight
A Snapshot into the Evolving Role of Audience InsightA Snapshot into the Evolving Role of Audience Insight
A Snapshot into the Evolving Role of Audience Insight
Filipp Paster
 
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience InsightThe Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
Filipp Paster
 
InvestmentNews 2015 Adviser Technology Study: Key Findings
InvestmentNews 2015 Adviser Technology Study: Key FindingsInvestmentNews 2015 Adviser Technology Study: Key Findings
InvestmentNews 2015 Adviser Technology Study: Key Findings
INResearch
 
Data Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from DataData Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from Data
Precisely
 
CSCMP 2014: Big Data Use in Retail Supply Chains
CSCMP 2014: Big Data Use in Retail Supply ChainsCSCMP 2014: Big Data Use in Retail Supply Chains
CSCMP 2014: Big Data Use in Retail Supply Chains
AnnibalSodero
 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity Scorecard
DataWorks Summit
 
Data & Analytics Sample Slides_NEW.pdf
Data & Analytics Sample Slides_NEW.pdfData & Analytics Sample Slides_NEW.pdf
Data & Analytics Sample Slides_NEW.pdf
IDG
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?
Dell World
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
Precisely
 
Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013
Intel IT Center
 
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
CompTIA
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
CompTIA
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slides
William Lam
 
Exploring How to Use Hadoop in your Healthcare Big Data Strategy
Exploring How to Use Hadoop in your Healthcare Big Data StrategyExploring How to Use Hadoop in your Healthcare Big Data Strategy
Exploring How to Use Hadoop in your Healthcare Big Data Strategy
Health Catalyst
 
Webinar: Attaining Excellence in Big Data Integration
Webinar: Attaining Excellence in Big Data IntegrationWebinar: Attaining Excellence in Big Data Integration
Webinar: Attaining Excellence in Big Data Integration
SnapLogic
 
Accenture big-data
Accenture big-dataAccenture big-data
Accenture big-data
Planimedia
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
BSP Media Group
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
SAS Canada
 

Similar to Sand Hill Hadoop-Big Data Study - 140212 (20)

Hadoop Does Not Equal Big Data
Hadoop Does Not Equal Big Data Hadoop Does Not Equal Big Data
Hadoop Does Not Equal Big Data
 
A Snapshot into the Evolving Role of Audience Insight
A Snapshot into the Evolving Role of Audience InsightA Snapshot into the Evolving Role of Audience Insight
A Snapshot into the Evolving Role of Audience Insight
 
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience InsightThe Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
The Outlook for Data 2017: A Snapshot Into the Evolving Role of Audience Insight
 
InvestmentNews 2015 Adviser Technology Study: Key Findings
InvestmentNews 2015 Adviser Technology Study: Key FindingsInvestmentNews 2015 Adviser Technology Study: Key Findings
InvestmentNews 2015 Adviser Technology Study: Key Findings
 
Data Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from DataData Trends for 2019: Extracting Value from Data
Data Trends for 2019: Extracting Value from Data
 
CSCMP 2014: Big Data Use in Retail Supply Chains
CSCMP 2014: Big Data Use in Retail Supply ChainsCSCMP 2014: Big Data Use in Retail Supply Chains
CSCMP 2014: Big Data Use in Retail Supply Chains
 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity Scorecard
 
Data & Analytics Sample Slides_NEW.pdf
Data & Analytics Sample Slides_NEW.pdfData & Analytics Sample Slides_NEW.pdf
Data & Analytics Sample Slides_NEW.pdf
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
 
Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013
 
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
CompTIA Colloquium 2014: Big Data: Are You Ready for this Growing Market?
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slides
 
Exploring How to Use Hadoop in your Healthcare Big Data Strategy
Exploring How to Use Hadoop in your Healthcare Big Data StrategyExploring How to Use Hadoop in your Healthcare Big Data Strategy
Exploring How to Use Hadoop in your Healthcare Big Data Strategy
 
Webinar: Attaining Excellence in Big Data Integration
Webinar: Attaining Excellence in Big Data IntegrationWebinar: Attaining Excellence in Big Data Integration
Webinar: Attaining Excellence in Big Data Integration
 
Accenture big-data
Accenture big-dataAccenture big-data
Accenture big-data
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
 

Recently uploaded

HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
maazsz111
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 

Recently uploaded (20)

HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 

Sand Hill Hadoop-Big Data Study - 140212

  • 1. Do You Hadoop? A Survey of Big Data Practitioners February 12, 2014 Bradley Graham
  • 2. Big Data Research from Sand Hill Group • Mindset over Data Set: A Big Data Prescription for Setting the Market Pace – Presents powerful learnings of some of the most successful implementers of enterprise Big Data – Provides prescriptive executive-level guidance for adopting and using Big Data – Use as a planning guide or benchmarking tool – Purchase at http://bit.ly/SH_BD_14_S • Do You Hadoop? A Survey of Big Data Practitioners – – – – – 2 Clarifies Big Data (Hadoop-based) initiative status Identifies pain points and barriers to adoption Illuminates usage changes over the next 12-18 months Use as a benchmarking tool Download at http://bit.ly/SH_H2014
  • 3. Broad Cross-Sectional View of the User Base Company Size Industries (Number of Employees) Large 47.4% Small 33.3% Other 13.3% Telecommunications 2.2% Technology 32.6% Medium 19.3% Participant Roles Consumer Services 11.9% Education 5.9% Financial Services 8.1% Government 1.5% Healthcare 2.2% Industrials 22.2% Other 8.9% Academic 3.7% Technology/Analytics professional 50.4% Consultant 14.1% Business sponsor/user 23.0% • Startups and large established companies are leading the charge • Technology industry use is strongly correlated to startup companies • Companies serving large and/or diverse customer groups are a natural fit for Big Data (e.g., retail, media and entertainment, and financial services) • Business sponsor and user participation renders a more complete picture of Big Data’s value and impact 3
  • 4. Still Early Days Hadoop Initiatives Status (All Companies) Percent of Total Population 50% 44.4% 40% 30% 20% 16.3% 11.1% 10% 8.1% 9.6% 10.4% Piloting first solution First solution deployed Supporting multiple analytics 0% Exploring and educating Conducting POC Developing first solution • Solid progress is being made • Majority of the companies have identified a business problem to address • Use of multiple analytics suggest compelling value has been realized 4
  • 5. Beware of Small and Agile Competitors Hadoop Initiative Status (by Company Size) Percent of Category 100% 80% 60% 35.0% 36.4% 63.6% 45.0% 27.3% 18.2% 18.2% Conducting POC 14.3% 13.3% 36.4% 26.7% 23.1% 35.7% 7.7% 0% Exploring and educating 50.0% 69.2% 20.0% 40% 20% 60.0% Developing first solution Small Medium Piloting first solution First solution deployed Supporting multiple analytics Large • The median phase by company size is: – Small: Exploring and educating – Medium: Conducting POC – Large: Conducting POC • Data-centric startups enabled small companies to surpass medium-size companies in the advanced stages 5
  • 6. Satisfaction is Driving Continued Investment • Higher overall satisfaction among business sponsors and users relates to: Hadoop/Big Data Initiative Satisfaction (By Role) 100% 16.2% 9.7% 48.5% 61.3% 80% 60% 35.3% 29.0% 0% Technology/Analytics professional Less Than 6 • Technology professionals’ higher than expected satisfaction is likely attributed to: – Success with the technology – Producing results that satisfied the business stakeholders 40% 20% – Profound insights – More effective actions Meets Business sponsor/user Better Than • Challenges associated with Big Data are nevertheless impacting satisfaction – 3x more were less than satisfied (35.6%) vs. more than satisfied (11.1%)
  • 7. Mastering the Basics and Moving on to Advanced Applications Most Commonly Reported #1 Uses of Hadoop (Current vs. Future) #1 Current Uses (as of October 2013) #1 Future Uses (in 12 – 18 months) Change From Current Data Preparation (25.2%) Advanced Analytics (24.4%) — Business Intelligence (17.8%) Data Preparation (17.8%) -7.4% Basic Analytics (17.0%) Business Intelligence (14.1%) Archive More Data (14.1%) -3.7% — • Leading current uses are: – Foundational – Support or augment the existing solution portfolio (e.g., DW/BI and small data analytics) – Support Big Data experimentation Top Uses of Hadoop (Current vs. Future) Top Current Uses (as of October 2013) Basic Analytics (58.5%) Advanced Analytics (61.5%) — Business Intelligence (48.1%) Business Intelligence (45.9%) -2.2% Data Preparation (45.9%) 7 Top Future Uses (in 12 – 18 months) Change From Current Data Preparation (40.7%) -5.2% — • Leading uses in 12-18 months emphasize: – New data types (streaming, geographic, syndicated, etc. data) – Advanced analytics (e.g., risk, propensity and optimizations)
  • 8. The Data Does Indeed Tell the Story Data Types in the Hadoop Environment Percent of Total Population 80% 60.7% 60% 52.6% 45.9% 40% 28.9% 22.2% 19.3% 16.3% 20% 14.1% 0% Operational Log Online Geographic Partner 3rd party Files (Documents and Media) • Declining storage costs encourage a store everything approach • Most prevalent data types parallel the focus of current usage • Less frequently hosted data types hint at future Big Data applications 8 Streaming
  • 9. Big Data and Hadoop are Complicated Challenges Associated with Hadoop/Big Data Most Commonly Reported #1 Hadoop-related Challenges Top Hadoop-related Challenges Knowledge and experience (46.7%) Knowledge and experience (65.2%) Skills availability (20.7%) Skills availability (52.6%) Development effort (6.7%) Development effort (40.7%) • Resources remain the dominant issue for the foreseeable future – Internal skills and experience gap – Limited ability to repurpose existing resources – Competition for “journey talent” (i.e., those successfully navigating the process at least once) • Other frustrations are the technology challenges and level of effort related to: – Implementing, maintaining and provisioning the environment – Designing, building and maintaining solutions • Performance, interoperability and other current second tier issues may prove to be larger than expected issues down the road if left unaddressed 9
  • 10. Success in Numbers • Navigating the Hadoop/Big Data complexities requires a trusted partner ecosystem – It's far too complex at this point to go it alone • Augment and, through a collaborative working model, edify internal resources • Gain access to value-added products and services that simplify: – Infrastructure implementation and provisioning – Solution development and use – Data access and management Effective partnering can address critical and second tier issues while reducing time to value 10
  • 11. Thank you! Bradley Graham Executive Director, Carpe Datum Rx bgraham@Amberoon.com CarpeDatumRx.com 11