SlideShare a Scribd company logo
1 of 28
Volume, Variety and Velocity all increasing rapidly
Source: IBM, Oct 2012
OMG!…The Opportunity Is Huge
{"frequentlyPurchasedWith": [], "color": "Black", "skutype": "parent", "productTemplate":
"Computer_Accessory", "salesRankMediumTerm": "3039", "shortDescription": "Compatible with Windows 8 and RT and
Android 3.0 tablets; Bluetooth technology; convertible stand/carrying case", "includedItemList": [{"includedItem":
"Logitech Tablet Keyboard for Windows 8 and RT and Android 3.0+ Tablets"}, {"includedItem": "4 AAA
batteries"}, {"includedItem": "Owner's manual"}], "subclassId": 2409, "sku": 6541967, "width": "12.3"", "subclass":
"BLUETOOTH KEYBOARDS", "source": "BoxStore", "modelNumber": "920-004569", "digital": false, "department":
"COMPUTERS", "type": "HardGood", "productId": 1218752781558, "description": "None", "technologyCode":
"None", "longDescription": "This Logitech 920-004569 keyboard features a low-profile, 65-key design for easy, comfortable
typing on your Windows 8 or RT or Android 3.0 tablet. The convertible stand allows comfortable viewing and provides on-
the-go protection for the keyboard.", "categoryPath": [{"name": "Box Store", "id": "cat00000"}, {"name": "Computers &
Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E-Readers", "id": "pcmcat209000050006"}, {"name": "Tablet
Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet Docks, Keyboards & Stands", "id":
"pcmcat242000050003"}], "manufacturer": "Logitech", "classId": 492, "upc": "097855090973", "regularPrice":
69.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku": 4974041}, {"sku": 1306578835}, {"sku":
4640745}, {"sku": 9610542}, {"sku": 8785729}, {"sku": 6640676}]}
{"frequentlyPurchasedWith": [], "color": "Gray", "skutype": "parent", "productTemplate":
"Computer_Accessory", "salesRankMediumTerm": "None", "shortDescription": "Compatible with BlackBerry Playbook
tablets; wool construction; TPU plastic cradle; elastic band; metallic clip; functions as a stand; play-through
design", "includedItemList": [{"includedItem": "DICOTA TabBook Case for BlackBerry Playbook Tablets"}], "subclassId":
2404, "sku": 6738835, "width": "5.5"", "subclass": "SO TABLET ACCY", "source": "BoxStore", "modelNumber":
"D30203", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId":
1218789793935, "description": "None", "technologyCode": "None", "longDescription": "This DICOTA TabBook D30203
case helps keep your BlackBerry Playbook tablet safe from hazards, with wool construction and a TPU plastic cradle for
durability and an elastic band to keep your tablet snug and secure in the case.", "categoryPath": [{"name": "Box
Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E-
Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet
Cases, Covers & Sleeves", "id": "pcmcat242000050002"}], "manufacturer": "DICOTA", "classId": 492, "upc":
"7332752000964", "regularPrice": 49.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku": 6118882}, {"sku":
6118737}, {"sku": 1305180220}, {"sku": 4124528}, {"sku": 1304792301}, {"sku": 6744043}]}
InformationWeek 2013
There’s certainly
challenges
A Success Formula for a Hadoop Project
✚= =
WHAT
your
company &
you
HOW
best
practices
MORE BI?
What Can We Make Happen?
“Data Platform of Intent”
Big Data
Full
Fidelity Analysis
Unstructured
Behavioral,
Open, Affordable
HADOOP
HQL
BI
Data Cube
Analysis
Structured, Transact
ional
Closed, Expensive
RDBS & EDW
SQL
Powered by the CommunityPowered by Vendors
Report On What Happened?
Database of Transactions
1. What auxiliary products should we recommend?
1. What new features should our product have?
1. How can we eliminate support issues?
Hadoop Innovation Use Cases … some examples
Another Idea
Score Your Predictive Models On Hadoop
Model Builder Model Description
Hive
UDFs
Standard
Hive
Reinvent Your Career
Lead
Thrive
This is the time
to be the
river, not the
rock
Big Data Projects on Hadoop
We’re On a journey that’s just starting
You Are Here….Time to get going!
But first!
 Education
 Training
 Certifications
#1. Form a Partnership With LOB
 Find a use case
 Identify some budget
 Form a project team
 Be willing to educate others
 Partner on a small POC, don’t boil the ocean
Hadoop Project Success
Best Practice #1
Teams are Highly Cross-Functional
 Product Manager (LOB)
 Power Analysts (IT or LOB)
 Business Analyst (LOB)
 Product Manager (LOB
 IT Architect (IT)
 Project Manager (IT/LOB)
“By 2016 the CMO will have
more budget than the CIO”
- Gartner Group
Marketing The “Budget Richest” LOB?
#2. Use the Right Big Data Analytics tooling
• Supports the entire time
• Reuse and share for speed and efficiency
• Leverage pre-built analytics
Hadoop Project Success
Best Practice #2
PERSONALIZED ANALYTICS HUB
#3. Embrace Full Fidelity Big Data Analytics
 Not sampled, all the data – maintains richness
 Don’t replicate or move the data
 Keep complexity and TCO low
Hadoop Project Success
Best Practice #3
Data
Warehouse
OLTP to OLAP
Mapping
Analyst
In Summary
In BI, The Analyst Was at the End of the Process
Ordering App
Financial App
Master Data
Staging
OLAP
Reports
BI Using
Data Cube
Analysis
Structured, Sampled
Transitional, Closed
, Expensive
RDBS & EDW
SQL
Driven by Vendors
In Big Data Analytics on Hadoop
The Analyst is at the Center of the Process
Application
AnalyticsData
Unstructured
Behavioral,
Open, Affordable
HADOOP
HQL
Analyst
Full Fidelitiy
Analytics
Get distracted….
Or
Big Data on Hadoop Project
Thank You
We’re happy to help
Questions

More Related Content

Similar to The Formula for Hadoop Project Success

Boost user experience is your processes with Adaptive Cards - SPS Cologne
Boost user experience is your processes with Adaptive Cards - SPS CologneBoost user experience is your processes with Adaptive Cards - SPS Cologne
Boost user experience is your processes with Adaptive Cards - SPS CologneTomasz Poszytek
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDBDenny Lee
 
As You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data AnalyticsAs You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data AnalyticsInside Analysis
 
Old code doesn't stink - Detroit
Old code doesn't stink - DetroitOld code doesn't stink - Detroit
Old code doesn't stink - DetroitMartin Gutenbrunner
 
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...Thomas Gölles
 
Oracle magazine November December 2018
Oracle magazine November December 2018Oracle magazine November December 2018
Oracle magazine November December 2018Erik Gur
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudInside Analysis
 
Worst Practices in Artificial Intelligence
Worst Practices in Artificial IntelligenceWorst Practices in Artificial Intelligence
Worst Practices in Artificial IntelligenceWilliam Tsoi
 
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data LakeITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data LakeITCamp
 
Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...EDB
 
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...Data Con LA
 
Anatomy of a data science project
Anatomy of a data science projectAnatomy of a data science project
Anatomy of a data science projectAdam Sroka
 
Microsoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichMicrosoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichSascha Corti
 
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft TeamsSharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft TeamsThomas Gölles
 
JSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestJSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestMatthew Groves
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and PythonTravis Oliphant
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoSpark Summit
 
Big Data Expo 2015 - MapR Impacting Business As It Happens
Big Data Expo 2015 - MapR Impacting Business As It HappensBig Data Expo 2015 - MapR Impacting Business As It Happens
Big Data Expo 2015 - MapR Impacting Business As It HappensBigDataExpo
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyNeo4j
 

Similar to The Formula for Hadoop Project Success (20)

Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
 
Boost user experience is your processes with Adaptive Cards - SPS Cologne
Boost user experience is your processes with Adaptive Cards - SPS CologneBoost user experience is your processes with Adaptive Cards - SPS Cologne
Boost user experience is your processes with Adaptive Cards - SPS Cologne
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDB
 
As You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data AnalyticsAs You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data Analytics
 
Old code doesn't stink - Detroit
Old code doesn't stink - DetroitOld code doesn't stink - Detroit
Old code doesn't stink - Detroit
 
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...
SharePoint Saturday Bremen - Unite your modern workplace with Microsoft's AI ...
 
Oracle magazine November December 2018
Oracle magazine November December 2018Oracle magazine November December 2018
Oracle magazine November December 2018
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the Cloud
 
Worst Practices in Artificial Intelligence
Worst Practices in Artificial IntelligenceWorst Practices in Artificial Intelligence
Worst Practices in Artificial Intelligence
 
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data LakeITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
 
Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...Application Development & Database Choices: Postgres Support for non Relation...
Application Development & Database Choices: Postgres Support for non Relation...
 
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
 
Anatomy of a data science project
Anatomy of a data science projectAnatomy of a data science project
Anatomy of a data science project
 
Microsoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichMicrosoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack Zürich
 
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft TeamsSharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
 
JSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa TechfestJSON Data Modeling - July 2018 - Tulsa Techfest
JSON Data Modeling - July 2018 - Tulsa Techfest
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and Python
 
Mastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott CordoMastering Your Customer Data on Apache Spark by Elliott Cordo
Mastering Your Customer Data on Apache Spark by Elliott Cordo
 
Big Data Expo 2015 - MapR Impacting Business As It Happens
Big Data Expo 2015 - MapR Impacting Business As It HappensBig Data Expo 2015 - MapR Impacting Business As It Happens
Big Data Expo 2015 - MapR Impacting Business As It Happens
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

The Formula for Hadoop Project Success

  • 1.
  • 2. Volume, Variety and Velocity all increasing rapidly Source: IBM, Oct 2012 OMG!…The Opportunity Is Huge
  • 3. {"frequentlyPurchasedWith": [], "color": "Black", "skutype": "parent", "productTemplate": "Computer_Accessory", "salesRankMediumTerm": "3039", "shortDescription": "Compatible with Windows 8 and RT and Android 3.0 tablets; Bluetooth technology; convertible stand/carrying case", "includedItemList": [{"includedItem": "Logitech Tablet Keyboard for Windows 8 and RT and Android 3.0+ Tablets"}, {"includedItem": "4 AAA batteries"}, {"includedItem": "Owner's manual"}], "subclassId": 2409, "sku": 6541967, "width": "12.3"", "subclass": "BLUETOOTH KEYBOARDS", "source": "BoxStore", "modelNumber": "920-004569", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId": 1218752781558, "description": "None", "technologyCode": "None", "longDescription": "This Logitech 920-004569 keyboard features a low-profile, 65-key design for easy, comfortable typing on your Windows 8 or RT or Android 3.0 tablet. The convertible stand allows comfortable viewing and provides on- the-go protection for the keyboard.", "categoryPath": [{"name": "Box Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E-Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet Docks, Keyboards & Stands", "id": "pcmcat242000050003"}], "manufacturer": "Logitech", "classId": 492, "upc": "097855090973", "regularPrice": 69.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku": 4974041}, {"sku": 1306578835}, {"sku": 4640745}, {"sku": 9610542}, {"sku": 8785729}, {"sku": 6640676}]} {"frequentlyPurchasedWith": [], "color": "Gray", "skutype": "parent", "productTemplate": "Computer_Accessory", "salesRankMediumTerm": "None", "shortDescription": "Compatible with BlackBerry Playbook tablets; wool construction; TPU plastic cradle; elastic band; metallic clip; functions as a stand; play-through design", "includedItemList": [{"includedItem": "DICOTA TabBook Case for BlackBerry Playbook Tablets"}], "subclassId": 2404, "sku": 6738835, "width": "5.5"", "subclass": "SO TABLET ACCY", "source": "BoxStore", "modelNumber": "D30203", "digital": false, "department": "COMPUTERS", "type": "HardGood", "productId": 1218789793935, "description": "None", "technologyCode": "None", "longDescription": "This DICOTA TabBook D30203 case helps keep your BlackBerry Playbook tablet safe from hazards, with wool construction and a TPU plastic cradle for durability and an elastic band to keep your tablet snug and secure in the case.", "categoryPath": [{"name": "Box Store", "id": "cat00000"}, {"name": "Computers & Tablets", "id": "abcat0500000"}, {"name": "iPad, Tablets & E- Readers", "id": "pcmcat209000050006"}, {"name": "Tablet Accessories", "id": "pcmcat231800050009"}, {"name": "Tablet Cases, Covers & Sleeves", "id": "pcmcat242000050002"}], "manufacturer": "DICOTA", "classId": 492, "upc": "7332752000964", "regularPrice": 49.99, "class_1": "TABLET ACCESSORIES", "relatedProducts": [{"sku": 6118882}, {"sku": 6118737}, {"sku": 1305180220}, {"sku": 4124528}, {"sku": 1304792301}, {"sku": 6744043}]}
  • 5. A Success Formula for a Hadoop Project ✚= = WHAT your company & you HOW best practices
  • 7.
  • 8. What Can We Make Happen? “Data Platform of Intent” Big Data Full Fidelity Analysis Unstructured Behavioral, Open, Affordable HADOOP HQL BI Data Cube Analysis Structured, Transact ional Closed, Expensive RDBS & EDW SQL Powered by the CommunityPowered by Vendors Report On What Happened? Database of Transactions
  • 9. 1. What auxiliary products should we recommend? 1. What new features should our product have? 1. How can we eliminate support issues? Hadoop Innovation Use Cases … some examples
  • 10.
  • 11. Another Idea Score Your Predictive Models On Hadoop Model Builder Model Description Hive UDFs Standard Hive
  • 12.
  • 14. This is the time to be the river, not the rock
  • 15. Big Data Projects on Hadoop We’re On a journey that’s just starting You Are Here….Time to get going!
  • 16. But first!  Education  Training  Certifications
  • 17. #1. Form a Partnership With LOB  Find a use case  Identify some budget  Form a project team  Be willing to educate others  Partner on a small POC, don’t boil the ocean Hadoop Project Success Best Practice #1
  • 18. Teams are Highly Cross-Functional  Product Manager (LOB)  Power Analysts (IT or LOB)  Business Analyst (LOB)  Product Manager (LOB  IT Architect (IT)  Project Manager (IT/LOB)
  • 19. “By 2016 the CMO will have more budget than the CIO” - Gartner Group Marketing The “Budget Richest” LOB?
  • 20.
  • 21. #2. Use the Right Big Data Analytics tooling • Supports the entire time • Reuse and share for speed and efficiency • Leverage pre-built analytics Hadoop Project Success Best Practice #2
  • 23. #3. Embrace Full Fidelity Big Data Analytics  Not sampled, all the data – maintains richness  Don’t replicate or move the data  Keep complexity and TCO low Hadoop Project Success Best Practice #3
  • 24. Data Warehouse OLTP to OLAP Mapping Analyst In Summary In BI, The Analyst Was at the End of the Process Ordering App Financial App Master Data Staging OLAP Reports BI Using Data Cube Analysis Structured, Sampled Transitional, Closed , Expensive RDBS & EDW SQL Driven by Vendors
  • 25. In Big Data Analytics on Hadoop The Analyst is at the Center of the Process Application AnalyticsData Unstructured Behavioral, Open, Affordable HADOOP HQL Analyst Full Fidelitiy Analytics
  • 27. Or Big Data on Hadoop Project
  • 28. Thank You We’re happy to help Questions

Editor's Notes

  1. Transform PMML models to standard Hadoop UDF’s. Leverage the power of Hadoop to score models across all data in Hadoop and increase their accuracy.
  2. Today we are previewing Karmasphere 3.0 capabilities. One of the key new features we announced is the availability of a personalized Analytics dashboard. The dashboard helps users organize their big data analytics, presents the users with relevant analytics work products at their fingertips. It also provides activity status so you can complete projects on time and on budget