SlideShare a Scribd company logo
Reinventing the Information Pipeline
From Big Data Strategy to Big Value December 2016
Agenda
• Introduction
• Challenges in the Information Pipeline
• Paxata in the Converged Data Platform
Paxata’s mission (since 2012)
Deliver the only enterprise-grade data preparation platform
for everyone to transform raw, meaningless data into
valuable, contextual and complete information
4
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
83%Companies agree that data is
their most strategic asset
5
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
6
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
7
The data chasm
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
Challenges in the Information Pipeline
Traditional data preparation
creates a bottleneck
Traditional data preparation creates a bottleneck
Business teams have complex data sources for analytics projects
Traditional data preparation creates a bottleneck
Business teams funnel their requirements to IT
IT-centric data preparation
Business
Information
Traditional data preparation creates a bottleneck
IT runs requirements through a linear ETL process
executed with manual scripting or coding
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
IT reviews with business. Makes changes, fixes errors.
(Repeat)
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Business teams make decisions before data is available
-or-
Ask for changes and restart the process.
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designed for highly specialized technical people to prepare data for
business teams
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designing for highly specialized technical
people to prepare data for business teams.
Expensive
Complicated
Error-prone
Time-consuming
Modern architecture balances
freedom with responsibility
Modern architecture: balancing freedom with responsibility
Built for business
•Freedom and
flexibility with
collaboration
Modern architecture: balancing freedom with responsibility
Collect and manage data
Time
Built for business
•Freedom and
flexibility with
collaboration
Enabled by IT
•Data governance,
scale, efficiency
Modern information pipeline is
Built for business
Freedom and flexibility with collaboration
Enabled by IT
Data governance, scale, efficiency
Data prep must address the
range of information workers
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
Data Developer
Data Analyst
Business Analyst
Information
Worker
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
(200K)
Data Developer
(600K)
Data Analyst
(100M)
Business Analyst
(275M)
Information
Worker
(460M)
Paxata accelerates the
data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Consumer experience for preparing data
Architecture of the Paxata Adaptive Information Platform
Architecture of the Paxata Adaptive Information Platform
Contact us
Paxata in the apps gallery
Register for Paxata Live:
www.paxata.com/events
info@paxata.com
www.youtube.com/PaxataTV
www.paxata.com
December 8, 2016© Paxata, Inc. 32
Thank You!

More Related Content

What's hot

The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
Caserta
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
DataWorks Summit
 
Data catalog
Data catalogData catalog
Data catalog
iamtodor
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Caserta
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Caserta
 
Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL Code
DATAVERSITY
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
Jonathan Raspaud
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
DATAVERSITY
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
Caserta
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data Intelligence
Alation
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROI
DATAVERSITY
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
Caserta
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
Caserta
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperative
Leigh Hill
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
Harvinder Atwal
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
DATAVERSITY
 
Chief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - PresentationChief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - Presentation
Srinivasan Sankar
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
North Texas Chapter of the ISSA
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
DATAVERSITY
 

What's hot (20)

The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
 
Data catalog
Data catalogData catalog
Data catalog
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL Code
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data Intelligence
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROI
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperative
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
 
Chief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - PresentationChief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - Presentation
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 

Viewers also liked

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
University of Twente
 
Data Culture Series - Keynote - 27th Jan, London
Data Culture Series -  Keynote - 27th Jan, LondonData Culture Series -  Keynote - 27th Jan, London
Data Culture Series - Keynote - 27th Jan, London
Jonathan Woodward
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
Swiss Big Data User Group
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 Trends
Retigence Technologies
 
Continuous Performance Testing
Continuous Performance TestingContinuous Performance Testing
Continuous Performance Testing
Grid Dynamics
 
광역화 집단에너지사업제안서
광역화 집단에너지사업제안서광역화 집단에너지사업제안서
광역화 집단에너지사업제안서
Seokho Shin
 
Exploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryExploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban Forestry
Azavea
 
Data Preparation for Data Science
Data Preparation for Data ScienceData Preparation for Data Science
Data Preparation for Data Science
DataWorks Summit/Hadoop Summit
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answersselinasimpson205
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
SoftServe
 
Jagger release 2.0
Jagger release 2.0Jagger release 2.0
Jagger release 2.0
Grid Dynamics
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data Intelligence
Sumo Logic
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
DATAVERSITY
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
✔ Eric David Benari, PMP
 
Потоковая обработка больших данных
Потоковая обработка больших данныхПотоковая обработка больших данных
Потоковая обработка больших данных
CEE-SEC(R)
 
Cohodatawebinar
Cohodatawebinar Cohodatawebinar
Cohodatawebinar
Murugesan Arumugam
 
Engine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture EnhancementsEngine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture Enhancements
Engine Yard
 
6 tips for improving ruby performance
6 tips for improving ruby performance6 tips for improving ruby performance
6 tips for improving ruby performance
Engine Yard
 

Viewers also liked (20)

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
 
MonitoringFrameWork
MonitoringFrameWorkMonitoringFrameWork
MonitoringFrameWork
 
Data Culture Series - Keynote - 27th Jan, London
Data Culture Series -  Keynote - 27th Jan, LondonData Culture Series -  Keynote - 27th Jan, London
Data Culture Series - Keynote - 27th Jan, London
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 Trends
 
Continuous Performance Testing
Continuous Performance TestingContinuous Performance Testing
Continuous Performance Testing
 
광역화 집단에너지사업제안서
광역화 집단에너지사업제안서광역화 집단에너지사업제안서
광역화 집단에너지사업제안서
 
Exploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryExploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban Forestry
 
Data Preparation for Data Science
Data Preparation for Data ScienceData Preparation for Data Science
Data Preparation for Data Science
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answers
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Jagger release 2.0
Jagger release 2.0Jagger release 2.0
Jagger release 2.0
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data Intelligence
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
 
Потоковая обработка больших данных
Потоковая обработка больших данныхПотоковая обработка больших данных
Потоковая обработка больших данных
 
Geemus
GeemusGeemus
Geemus
 
Cohodatawebinar
Cohodatawebinar Cohodatawebinar
Cohodatawebinar
 
Engine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture EnhancementsEngine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture Enhancements
 
6 tips for improving ruby performance
6 tips for improving ruby performance6 tips for improving ruby performance
6 tips for improving ruby performance
 

Similar to Reinventing the Modern Information Pipeline: Paxata and MapR

Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
SAP Technology
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
DATAVERSITY
 
Decision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDecision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great Data
DLT Solutions
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
Big Data Week
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Denodo
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...CompTIA
 
Leveraging Streaming Data through Automation
Leveraging Streaming Data through AutomationLeveraging Streaming Data through Automation
Leveraging Streaming Data through Automation
Enterprise Management Associates
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
DataWorks Summit
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Denodo
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052kavi172
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052Gilbert Rozario
 
Fried data summit big data for lob content
Fried data summit big data for lob contentFried data summit big data for lob content
Fried data summit big data for lob content
Jeff Fried
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
Hussain Sultan
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Denodo
 
Data-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply ChainData-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply Chain
DATAVERSITY
 

Similar to Reinventing the Modern Information Pipeline: Paxata and MapR (20)

Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Decision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDecision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great Data
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
 
Leveraging Streaming Data through Automation
Leveraging Streaming Data through AutomationLeveraging Streaming Data through Automation
Leveraging Streaming Data through Automation
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Fried data summit big data for lob content
Fried data summit big data for lob contentFried data summit big data for lob content
Fried data summit big data for lob content
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Data-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply ChainData-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply Chain
 

Recently uploaded

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 

Recently uploaded (20)

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 

Reinventing the Modern Information Pipeline: Paxata and MapR

  • 1. Reinventing the Information Pipeline From Big Data Strategy to Big Value December 2016
  • 2. Agenda • Introduction • Challenges in the Information Pipeline • Paxata in the Converged Data Platform
  • 3. Paxata’s mission (since 2012) Deliver the only enterprise-grade data preparation platform for everyone to transform raw, meaningless data into valuable, contextual and complete information
  • 4. 4 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 83%Companies agree that data is their most strategic asset
  • 5. 5 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 6. 6 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 7. 7 The data chasm Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 8. Challenges in the Information Pipeline
  • 10. Traditional data preparation creates a bottleneck Business teams have complex data sources for analytics projects
  • 11. Traditional data preparation creates a bottleneck Business teams funnel their requirements to IT IT-centric data preparation Business Information
  • 12. Traditional data preparation creates a bottleneck IT runs requirements through a linear ETL process executed with manual scripting or coding IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 13. Traditional data preparation creates a bottleneck IT reviews with business. Makes changes, fixes errors. (Repeat) IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 14. Business teams make decisions before data is available -or- Ask for changes and restart the process. IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 15. Designed for highly specialized technical people to prepare data for business teams IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 16. Designing for highly specialized technical people to prepare data for business teams. Expensive Complicated Error-prone Time-consuming
  • 18. Modern architecture: balancing freedom with responsibility Built for business •Freedom and flexibility with collaboration
  • 19. Modern architecture: balancing freedom with responsibility Collect and manage data Time Built for business •Freedom and flexibility with collaboration Enabled by IT •Data governance, scale, efficiency
  • 20. Modern information pipeline is Built for business Freedom and flexibility with collaboration Enabled by IT Data governance, scale, efficiency
  • 21. Data prep must address the range of information workers
  • 22. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist Data Developer Data Analyst Business Analyst Information Worker
  • 23. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist (200K) Data Developer (600K) Data Analyst (100M) Business Analyst (275M) Information Worker (460M)
  • 24. Paxata accelerates the data to information pipeline
  • 25. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 26. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 27. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive
  • 28. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive Consumer experience for preparing data
  • 29. Architecture of the Paxata Adaptive Information Platform
  • 30. Architecture of the Paxata Adaptive Information Platform
  • 31. Contact us Paxata in the apps gallery Register for Paxata Live: www.paxata.com/events info@paxata.com www.youtube.com/PaxataTV www.paxata.com
  • 32. December 8, 2016© Paxata, Inc. 32 Thank You!

Editor's Notes

  1. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  2. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  3. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  4. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  5. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  6. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  7. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  8. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  9. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  10. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  11. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  12. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  13. Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  14. Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  15. Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep. Source: Prakash VC deck
  16. Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep.
  17. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information