SlideShare a Scribd company logo
Grab some
coffee and
enjoy the
pre-­show
banter
before the
top of the
hour!
The Briefing Room
Big Data Refinery: Distilling Value for User-Driven Analytics
Twitter Tag: #briefr The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
@eric_kavanagh
Twitter Tag: #briefr The Briefing Room
  Reveal the essential characteristics of enterprise
software, good and bad
  Provide a forum for detailed analysis of today s innovative
technologies
  Give vendors a chance to explain their product to savvy
analysts
  Allow audience members to pose serious questions... and
get answers!
Mission
Twitter Tag: #briefr The Briefing Room
Topics
April: BIG DATA
May: CLOUD
June: INNOVATORS
Twitter Tag: #briefr The Briefing Room
Refinery is the Perfect Term
Ø Data Quality is a byproduct
Ø Master Data Management is an enabler
Ø Data Integration is changing
Twitter Tag: #briefr The Briefing Room
Analyst: John Myers
John Myers is Managing
Research Director of
Business Intelligence at
Enterprise Management
Associates
Twitter Tag: #briefr The Briefing Room
Pentaho
Pentaho offers a variety of business intelligence and
analytics products
Pentaho’s platform includes the Streamlined Data Refinery,
which provides access to any data source, and includes data
integration, governance, discovery, analysis and
visualization
Pentaho’s solution is designed to be user-driven for ease of
access and self-service
Twitter Tag: #briefr The Briefing Room
Guest: Chuck Yarbrough
Chuck is the Director of Big Data Product
Marketing at Pentaho, a leading big data
analytics company that helps organizations
engineer big data connections, blend data
and report and visualize all of their data.
Much of Chuck's focus at Pentaho is in
educating organizations on how big data
can help win, serve and retain customers,
lower costs and grow revenue through the
proper use of big data. A life-long
participant in the data game, Chuck has
held leadership roles at Deloitte
Consulting, SAP Business Objects, Hyperion
and National Semiconductor.
Twitter Tag: #briefr The Briefing Room
Perceptions & Questions
Analyst:
John Myers
Third Alternative Between Artistry and
Commodity
Slide 11
Artistry vs Commodity
2Years
8Hours
Slide 12 © 2015 Enterprise Management Associates, Inc.
Customizable Applications:
Best of Both Worlds
Slide 13 © 2015 Enterprise Management Associates, Inc.
Design Patterns:
Connecting Big Data
Slide 14 © 2015 Enterprise Management Associates, Inc.
Data Refining with a Hybrid Data Ecosystem
Slide 15 © 2015 Enterprise Management Associates, Inc.
Questions
Slide 16
Discussion Questions
•  Why not just use the data integration tools
that exist within the Hadoop “stack”? For
example, sqoop and flume are both provided by
Hadoop
•  There are differences between sandbox
environments and “operationalizating” data
integration for on-going operations. How can
Pentaho’s blueprints make those tasks
easier?
© 2015 Enterprise Management Associates, Inc.Slide 17
Discussion Questions
•  “Data Refinery” brings up images of a one-way
process from a crude state to a finished product
– much a like crude oil being “cracked” into
various products like heating oil, motor oil,
gasoline and jet fuel. Does Pentaho view the
“refining” of data as a one way proposition?
Or a more bi/multi-directional approach?
Slide 18 © 2015 Enterprise Management Associates, Inc.
Discussion Questions
•  Data governance is a wide ranging practice in
the work of data management. How does
Pentaho position itself within the breadth of
the concept of data governance?
•  Some have described “data wrangling” via the
Stanford wrangle project as self-service data
integration. How does Pentaho compare/
contrast with wrangle’s approach?
Slide 19 © 2015 Enterprise Management Associates, Inc.
Twitter Tag: #briefr The Briefing Room
Twitter Tag: #briefr The Briefing Room
Upcoming Topics
www.insideanalysis.com
April: BIG DATA
May: CLOUD
June: INNOVATORS
Twitter Tag: #briefr The Briefing Room
THANK YOU
for your
ATTENTION!
Some images provided courtesy of Wikimedia Commons and "Anacortes Refinery
31911" by Walter Siegmund (talk) - Own work. Licensed under CC BY 2.5 via
Wikimedia Commons - http://commons.wikimedia.org/wiki/
File:Anacortes_Refinery_31911.JPG#/media/File:Anacortes_Refinery_31911.JPG

More Related Content

What's hot

Industrializing data science: a view from the trenches
Industrializing data science: a view from the trenchesIndustrializing data science: a view from the trenches
Industrializing data science: a view from the trenches
Alexander Backus
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
BigDataExpo
 
HiLighter R+D seminar
HiLighter R+D seminarHiLighter R+D seminar
HiLighter R+D seminar
Astrid Fackelmann
 
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
CIPA GmbH
 
Cwin16 tls-datalab for scientists
Cwin16 tls-datalab for scientistsCwin16 tls-datalab for scientists
Cwin16 tls-datalab for scientists
Capgemini
 
Miplm Case Study Smart Production
Miplm Case Study Smart ProductionMiplm Case Study Smart Production
Miplm Case Study Smart Production
MIPLM
 
Tableau and PowerPivot jenstirrup
Tableau and PowerPivot jenstirrupTableau and PowerPivot jenstirrup
Tableau and PowerPivot jenstirrup
Jen Stirrup
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show Guide
NetAppUK
 
Business Discovery PPT
Business Discovery PPTBusiness Discovery PPT
Business Discovery PPTpdalalau
 
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
GE코리아
 
Cbt storage at scale use case deck
Cbt storage at scale use case deckCbt storage at scale use case deck
Cbt storage at scale use case deck
jaswantinxero
 
Zdeněk Pejcel, Salesforce / technology.future 2016
Zdeněk Pejcel, Salesforce / technology.future 2016Zdeněk Pejcel, Salesforce / technology.future 2016
Zdeněk Pejcel, Salesforce / technology.future 2016
technologyfuture
 
Visual Frog - Leap into the Cloud
Visual Frog - Leap into the CloudVisual Frog - Leap into the Cloud
Visual Frog - Leap into the Cloud
cmcdonne
 
Application Portfolio Management Webinar
Application Portfolio Management WebinarApplication Portfolio Management Webinar
Application Portfolio Management Webinar
LeanIX GmbH
 
Andrew Aitken / technology.future 2016
Andrew Aitken / technology.future 2016Andrew Aitken / technology.future 2016
Andrew Aitken / technology.future 2016
technologyfuture
 
Product Cost Management PowerPoint Presentation Slides
Product Cost Management PowerPoint Presentation Slides Product Cost Management PowerPoint Presentation Slides
Product Cost Management PowerPoint Presentation Slides
SlideTeam
 

What's hot (18)

Industrializing data science: a view from the trenches
Industrializing data science: a view from the trenchesIndustrializing data science: a view from the trenches
Industrializing data science: a view from the trenches
 
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AIDynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
 
HiLighter R+D seminar
HiLighter R+D seminarHiLighter R+D seminar
HiLighter R+D seminar
 
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
(EN) Success Story openpack - Klingele: Klingele relies on openpack and the D...
 
Cwin16 tls-datalab for scientists
Cwin16 tls-datalab for scientistsCwin16 tls-datalab for scientists
Cwin16 tls-datalab for scientists
 
Miplm Case Study Smart Production
Miplm Case Study Smart ProductionMiplm Case Study Smart Production
Miplm Case Study Smart Production
 
Tableau and PowerPivot jenstirrup
Tableau and PowerPivot jenstirrupTableau and PowerPivot jenstirrup
Tableau and PowerPivot jenstirrup
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show Guide
 
Business Discovery
Business DiscoveryBusiness Discovery
Business Discovery
 
Business Discovery PPT
Business Discovery PPTBusiness Discovery PPT
Business Discovery PPT
 
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
[GE Innovation Forum 2015] The Industrial Internet by Bill Ruh
 
pivotal_overview
pivotal_overviewpivotal_overview
pivotal_overview
 
Cbt storage at scale use case deck
Cbt storage at scale use case deckCbt storage at scale use case deck
Cbt storage at scale use case deck
 
Zdeněk Pejcel, Salesforce / technology.future 2016
Zdeněk Pejcel, Salesforce / technology.future 2016Zdeněk Pejcel, Salesforce / technology.future 2016
Zdeněk Pejcel, Salesforce / technology.future 2016
 
Visual Frog - Leap into the Cloud
Visual Frog - Leap into the CloudVisual Frog - Leap into the Cloud
Visual Frog - Leap into the Cloud
 
Application Portfolio Management Webinar
Application Portfolio Management WebinarApplication Portfolio Management Webinar
Application Portfolio Management Webinar
 
Andrew Aitken / technology.future 2016
Andrew Aitken / technology.future 2016Andrew Aitken / technology.future 2016
Andrew Aitken / technology.future 2016
 
Product Cost Management PowerPoint Presentation Slides
Product Cost Management PowerPoint Presentation Slides Product Cost Management PowerPoint Presentation Slides
Product Cost Management PowerPoint Presentation Slides
 

Similar to Big Data Refinery: Distilling Value for User-Driven Analytics

AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
AppDynamics
 
Big Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the GameBig Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the Game
Inside Analysis
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
Inside Analysis
 
The Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business NowThe Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business Now
Inside Analysis
 
Crawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with HadoopCrawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with Hadoop
Inside Analysis
 
Drive It Home: A Roadmap for Today's Data-Driven Culture
Drive It Home: A Roadmap for Today's Data-Driven CultureDrive It Home: A Roadmap for Today's Data-Driven Culture
Drive It Home: A Roadmap for Today's Data-Driven Culture
Inside Analysis
 
Washington DC DataOps Meetup -- Nov 2019
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019
DataKitchen
 
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
Inside Analysis
 
Data and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationData and its Role in Your Digital Transformation
Data and its Role in Your Digital Transformation
VMware Tanzu
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
Inside Analysis
 
Pentaho-BI
Pentaho-BIPentaho-BI
Pentaho-BIEdureka!
 
Role of Data in Digital Transformation
Role of Data in Digital TransformationRole of Data in Digital Transformation
Role of Data in Digital Transformation
VMware Tanzu
 
2015 insight business_partner_summit_sessions
2015 insight business_partner_summit_sessions2015 insight business_partner_summit_sessions
2015 insight business_partner_summit_sessions
Beth Jarvis-Maude
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Inside Analysis
 
Repurpose, Reuse and Refresh Content
Repurpose, Reuse and Refresh ContentRepurpose, Reuse and Refresh Content
Repurpose, Reuse and Refresh Content
Pam Didner
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis
 
17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization
Cognizant
 
Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar
ibi
 

Similar to Big Data Refinery: Distilling Value for User-Driven Analytics (20)

AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
AppSphere 15 - Shining a Light on Shadow IT: A New Way of Working for "Busine...
 
Big Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the GameBig Data Enabled: How YARN Changes the Game
Big Data Enabled: How YARN Changes the Game
 
Where the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information AccessWhere the Warehouse Ends: A New Age of Information Access
Where the Warehouse Ends: A New Age of Information Access
 
The Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business NowThe Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business Now
 
Crawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with HadoopCrawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with Hadoop
 
Drive It Home: A Roadmap for Today's Data-Driven Culture
Drive It Home: A Roadmap for Today's Data-Driven CultureDrive It Home: A Roadmap for Today's Data-Driven Culture
Drive It Home: A Roadmap for Today's Data-Driven Culture
 
Washington DC DataOps Meetup -- Nov 2019
Washington DC DataOps Meetup   -- Nov 2019Washington DC DataOps Meetup   -- Nov 2019
Washington DC DataOps Meetup -- Nov 2019
 
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
 
Repurpose, Reuse, Refresh
Repurpose, Reuse, RefreshRepurpose, Reuse, Refresh
Repurpose, Reuse, Refresh
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Data and its Role in Your Digital Transformation
Data and its Role in Your Digital TransformationData and its Role in Your Digital Transformation
Data and its Role in Your Digital Transformation
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
Pentaho-BI
Pentaho-BIPentaho-BI
Pentaho-BI
 
Role of Data in Digital Transformation
Role of Data in Digital TransformationRole of Data in Digital Transformation
Role of Data in Digital Transformation
 
2015 insight business_partner_summit_sessions
2015 insight business_partner_summit_sessions2015 insight business_partner_summit_sessions
2015 insight business_partner_summit_sessions
 
Age of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide DiscoveryAge of Exploration: How to Achieve Enterprise-Wide Discovery
Age of Exploration: How to Achieve Enterprise-Wide Discovery
 
Repurpose, Reuse and Refresh Content
Repurpose, Reuse and Refresh ContentRepurpose, Reuse and Refresh Content
Repurpose, Reuse and Refresh Content
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
 
17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization
 
Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar Modern Data Integration Expert Session Webinar
Modern Data Integration Expert Session Webinar
 

More from Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
Inside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
Inside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
Inside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
Inside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
Inside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
Inside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Inside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
Inside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Inside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
Inside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
Inside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
Inside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
Inside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
Inside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
Inside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
Inside Analysis
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
Inside Analysis
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey Malafsky
Inside Analysis
 
WebAction-Sami Abkay
WebAction-Sami AbkayWebAction-Sami Abkay
WebAction-Sami Abkay
Inside Analysis
 

More from Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey Malafsky
 
WebAction-Sami Abkay
WebAction-Sami AbkayWebAction-Sami Abkay
WebAction-Sami Abkay
 

Recently uploaded

A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 

Recently uploaded (20)

A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 

Big Data Refinery: Distilling Value for User-Driven Analytics

  • 1. Grab some coffee and enjoy the pre-­show banter before the top of the hour!
  • 2. The Briefing Room Big Data Refinery: Distilling Value for User-Driven Analytics
  • 3. Twitter Tag: #briefr The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com @eric_kavanagh
  • 4. Twitter Tag: #briefr The Briefing Room   Reveal the essential characteristics of enterprise software, good and bad   Provide a forum for detailed analysis of today s innovative technologies   Give vendors a chance to explain their product to savvy analysts   Allow audience members to pose serious questions... and get answers! Mission
  • 5. Twitter Tag: #briefr The Briefing Room Topics April: BIG DATA May: CLOUD June: INNOVATORS
  • 6. Twitter Tag: #briefr The Briefing Room Refinery is the Perfect Term Ø Data Quality is a byproduct Ø Master Data Management is an enabler Ø Data Integration is changing
  • 7. Twitter Tag: #briefr The Briefing Room Analyst: John Myers John Myers is Managing Research Director of Business Intelligence at Enterprise Management Associates
  • 8. Twitter Tag: #briefr The Briefing Room Pentaho Pentaho offers a variety of business intelligence and analytics products Pentaho’s platform includes the Streamlined Data Refinery, which provides access to any data source, and includes data integration, governance, discovery, analysis and visualization Pentaho’s solution is designed to be user-driven for ease of access and self-service
  • 9. Twitter Tag: #briefr The Briefing Room Guest: Chuck Yarbrough Chuck is the Director of Big Data Product Marketing at Pentaho, a leading big data analytics company that helps organizations engineer big data connections, blend data and report and visualize all of their data. Much of Chuck's focus at Pentaho is in educating organizations on how big data can help win, serve and retain customers, lower costs and grow revenue through the proper use of big data. A life-long participant in the data game, Chuck has held leadership roles at Deloitte Consulting, SAP Business Objects, Hyperion and National Semiconductor.
  • 10. Twitter Tag: #briefr The Briefing Room Perceptions & Questions Analyst: John Myers
  • 11. Third Alternative Between Artistry and Commodity Slide 11
  • 12. Artistry vs Commodity 2Years 8Hours Slide 12 © 2015 Enterprise Management Associates, Inc.
  • 13. Customizable Applications: Best of Both Worlds Slide 13 © 2015 Enterprise Management Associates, Inc.
  • 14. Design Patterns: Connecting Big Data Slide 14 © 2015 Enterprise Management Associates, Inc.
  • 15. Data Refining with a Hybrid Data Ecosystem Slide 15 © 2015 Enterprise Management Associates, Inc.
  • 17. Discussion Questions •  Why not just use the data integration tools that exist within the Hadoop “stack”? For example, sqoop and flume are both provided by Hadoop •  There are differences between sandbox environments and “operationalizating” data integration for on-going operations. How can Pentaho’s blueprints make those tasks easier? © 2015 Enterprise Management Associates, Inc.Slide 17
  • 18. Discussion Questions •  “Data Refinery” brings up images of a one-way process from a crude state to a finished product – much a like crude oil being “cracked” into various products like heating oil, motor oil, gasoline and jet fuel. Does Pentaho view the “refining” of data as a one way proposition? Or a more bi/multi-directional approach? Slide 18 © 2015 Enterprise Management Associates, Inc.
  • 19. Discussion Questions •  Data governance is a wide ranging practice in the work of data management. How does Pentaho position itself within the breadth of the concept of data governance? •  Some have described “data wrangling” via the Stanford wrangle project as self-service data integration. How does Pentaho compare/ contrast with wrangle’s approach? Slide 19 © 2015 Enterprise Management Associates, Inc.
  • 20. Twitter Tag: #briefr The Briefing Room
  • 21. Twitter Tag: #briefr The Briefing Room Upcoming Topics www.insideanalysis.com April: BIG DATA May: CLOUD June: INNOVATORS
  • 22. Twitter Tag: #briefr The Briefing Room THANK YOU for your ATTENTION! Some images provided courtesy of Wikimedia Commons and "Anacortes Refinery 31911" by Walter Siegmund (talk) - Own work. Licensed under CC BY 2.5 via Wikimedia Commons - http://commons.wikimedia.org/wiki/ File:Anacortes_Refinery_31911.JPG#/media/File:Anacortes_Refinery_31911.JPG