SlideShare a Scribd company logo
1 of 27
Building Your Big Data
  Analytics Strategy: Block by
             Block



                         @impetuscalling




                     Recorded version available at                  1
http://www.impetus.com/webinar_registration?event=archived&eid=53
Outline

                      Building a Big Data Strategy
                         Big Data & 3V’s
                                    3V s
                         3 V’s model
                         Big Data Analytics Lifecycle
                      Strategy Selection
                      Technology Selection
                         Hadoop E
                         H d    Ecosystem
                                      t
                         Alternative
                      Putting it Together
                            g      g
                      Case Studies and Applications
                      Q&A’s

                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   2
Building a Big Data Strategy

                      Gather Requirements
                         What needs to be done?
                                                                     Requirements
                         Objectives
                      Choose Candidate Strategy Options                          Candidate
                                                                                  Strategy
                         Patterns & Best Practices                               Selection


                      Choose Tools and Technology                                              Tools &
                                                                                             Technology
                                                                                              Selection
                      Implementation
                      I l        i
                         Operational Readiness                                                        Implementation




                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53                         3
Big Data & 3V’s Model

                      What is Big Data?
                         Define by size or volume
                         or by ‘breakdown’


                      3V’s model
                         Variety of Data
                         Volume of D t
                         V l     f Data
                         Velocity of Data




                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   4
Big Data Analytics Life Cycle




                                            Ingestion                                     Visualization




                      Creation                                      Analysis




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53                   5
BIG Data Analytics Life Cycle: Concerns




                                            Ingestion                                     Visualization
          •   Storage                                        • Tools &
          •   Elasticity                                       Technologies
                                     • Integrations          • Testing               • Channels
          •   Monitoring
                                     • Tools &               • Pre Built             • In Memory
          •   Compression
                   p                   Technologies                                    Support
                                                               Solutions
                                     • Standardization                               • Standardization

                      Creation                                      Analysis




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53                   6
BIG Data Analytics Life Cycle & 3V’s


                      Simple and potent tool to analyze strategy requirements
                         Answer simple questions of how much what type and at what rate
                                                        much,
                      Applicable to each phase
                      Using matrix to select suitable strategy
                          g                                 gy
                      Dictates the potential choice of solutions, tools & technologies




                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   7
BIG Data Analytics Life Cycle & 3V’s



                                                 How M h?
                                                 H   Much?           What T
                                                                     Wh t Type?
                                                                              ?           What R t ?
                                                                                          Wh t Rate?


                      Creation
                      •   Storage
                      •   Elasticity
                      •   Monitoring
                      •   Compression

                      Ingestion
                      Analysis
                      Visualization

                                               Recorded version available at 
Impetus Proprietary       http://www.impetus.com/webinar_registration?event=archived&eid=53            8
Strategy Selection




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   9
Big Data Analytics Strategy


                      Creation             Ingestion               Analysis             Visualization




                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53               10
Technology Selection




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   11
The Hadoop Ecosystem




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   12
Alternate/Emerging Options

                      Making stuff Faster
                         Pervasive Datarush Hstreaming
                                   Datarush,
                         Cloud Map Reduce
                         HPCC, Datastax Brisk, Platform Computing
                         MARS, GPMR
                         Major MPPs-in-database MR-Oracle, Aster etc
                         Hadapt
                      NOSQL
                         Cassandra, MongoDB, Hbase
                         Riak
                         Redis


                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   13
Alternate/Emerging Options

                      Graph Type DB’s
                         Neo4j
                         HyperGraphDB
                         InfiniteGraph
                         Pregel
                         Trinity
                      Faster SQL DB’s
                                 DB s
                         VoltDB, Clustrix
                      Hardware + Software Solutions
                         Exadata , Parstream
                         Virtualized Options of Hardware + Software Solutions such as
                          e ou d
                         Xeround

                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   14
Putting it Together




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   15
Indirect Analytics over Hadoop




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   16
Direct Analytics over Hadoop




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   17
Analytics over Hadoop with MPP DW




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   18
Case Studies




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53   19
Social Media Analytics

                      Problem Statement
                         Analytics on huge data sets populated from live streaming data
                         Simplifying services, cost reduction, proactive analysis on
                         customer’s feedback
                      Challenges
                         Live data streaming from social media websites
                         Clustering
                              Learn typical comments, demands, questions
                              Value: Helps identify response / behavior anomalies
                         Classification
                              Learn to identify known patterns automatically
                              Value: useful in filtering, pre-emptive addressing, gaining
                              customer confidence
                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   20
Social Media Analytics (cont..)

                      Approach
                         Prepare matrix to capture How Much? What Type? What Rate
                                                       Much?,     Type?,
                         against each phase
                         Use big data solution strategy covering all concerns of big data
                         analytics lifecycle
                      Solution
                         Architected a flexible and scalable solution with near real time
                         streaming of social media d t on d il /h l scheduled j b
                          t     i    f    i l    di data    daily/hourly h d l d jobs
                         Built a solution based on Hadoop, HBase, Hive and Mahout




                                             Recorded version available at 
Impetus Proprietary     http://www.impetus.com/webinar_registration?event=archived&eid=53   21
Solution Overview

                  Creation              Ingestion               Analysis             Visualization




                                           Recorded version available at 
Impetus Proprietary   http://www.impetus.com/webinar_registration?event=archived&eid=53              22
Summing Up


             Creating a matrix to build suitable strategy
                  Enables creation of a platform or a solution to manage 3Vs of data
            Solutions, tools & technologies
                  Hadoop based Big Data Analytics is a scalable and cost effective
                  option
            Strategy selection




                                         Recorded version available at 
Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53   23
About Us

Strategic partners for software product engineering and R&D
Thought
Tho ght leaders in cutting-edge technologies
                   c tting edge
Mature processes and practices that are methodical, yet flexible
Diverse domain expertise



          Our
          O services in Big Data and Analytics
                i    i Bi D t      d A l ti
              Expert consulting
              Proof-of-concept & Implementation
              Support services



                      Recorded version available at 
 http://www.impetus.com/webinar_registration?event=archived&eid=53
Questions




             Please send in your questions
                     using the chat panel




                     Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=53   25
Thank you
                          y
                   For more information,
           write to us at inquiry@impetus.com




                         @impetuscalling




                     Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=53
Building Your Big Data Analytics Strategy- Impetus Webinar

More Related Content

What's hot

Analytics Strategy and Roadmap Offering v2 (1)
Analytics Strategy and Roadmap Offering v2 (1)Analytics Strategy and Roadmap Offering v2 (1)
Analytics Strategy and Roadmap Offering v2 (1)
Joey Amanchukwu
 
Valuing the data asset
Valuing the data assetValuing the data asset
Valuing the data asset
Bala Iyer
 

What's hot (20)

Integrate Your Data Science & Omni-channel Strategy to Reduce Cost and Increa...
Integrate Your Data Science & Omni-channel Strategy to Reduce Cost and Increa...Integrate Your Data Science & Omni-channel Strategy to Reduce Cost and Increa...
Integrate Your Data Science & Omni-channel Strategy to Reduce Cost and Increa...
 
Predictive vs Prescriptive Analytics
Predictive vs Prescriptive AnalyticsPredictive vs Prescriptive Analytics
Predictive vs Prescriptive Analytics
 
Future and scope of big data analytics in Digital Finance and banking.
Future and scope of big data analytics in Digital Finance and banking.Future and scope of big data analytics in Digital Finance and banking.
Future and scope of big data analytics in Digital Finance and banking.
 
Enova presentation at the Chief Analytics Officer Forum East Coast USA (#CAOF...
Enova presentation at the Chief Analytics Officer Forum East Coast USA (#CAOF...Enova presentation at the Chief Analytics Officer Forum East Coast USA (#CAOF...
Enova presentation at the Chief Analytics Officer Forum East Coast USA (#CAOF...
 
Computer Vision: Coming to a Store Near You - Brent Biddulph
Computer Vision: Coming to a Store Near You - Brent BiddulphComputer Vision: Coming to a Store Near You - Brent Biddulph
Computer Vision: Coming to a Store Near You - Brent Biddulph
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
 
What we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analyticsWhat we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analytics
 
State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...
State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...
State Farm presentation at the Chief Analytics Officer Forum East Coast USA (...
 
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
Dow Chemical presentation at the Chief Analytics Officer Forum East Coast USA...
 
Customer analytics. Turn big data into big value
Customer analytics. Turn big data into big valueCustomer analytics. Turn big data into big value
Customer analytics. Turn big data into big value
 
Big Data Strategies
Big Data StrategiesBig Data Strategies
Big Data Strategies
 
Seagate
SeagateSeagate
Seagate
 
Analytics Strategy and Roadmap Offering v2 (1)
Analytics Strategy and Roadmap Offering v2 (1)Analytics Strategy and Roadmap Offering v2 (1)
Analytics Strategy and Roadmap Offering v2 (1)
 
Big Data & Analytics Client Examples
Big Data & Analytics Client ExamplesBig Data & Analytics Client Examples
Big Data & Analytics Client Examples
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
 
Valuing the data asset
Valuing the data assetValuing the data asset
Valuing the data asset
 
Company Evolution – Evolving Beyond the Traditional Scope Through Data Moneti...
Company Evolution – Evolving Beyond the Traditional Scope Through Data Moneti...Company Evolution – Evolving Beyond the Traditional Scope Through Data Moneti...
Company Evolution – Evolving Beyond the Traditional Scope Through Data Moneti...
 
TLabs - deutsche telekom
TLabs -  deutsche telekomTLabs -  deutsche telekom
TLabs - deutsche telekom
 
Driving Change in Relationship-Driven Businesses | How Citi Uses Data Science...
Driving Change in Relationship-Driven Businesses | How Citi Uses Data Science...Driving Change in Relationship-Driven Businesses | How Citi Uses Data Science...
Driving Change in Relationship-Driven Businesses | How Citi Uses Data Science...
 
Analytics with Descriptive, Predictive and Prescriptive Techniques
Analytics with Descriptive, Predictive and Prescriptive TechniquesAnalytics with Descriptive, Predictive and Prescriptive Techniques
Analytics with Descriptive, Predictive and Prescriptive Techniques
 

Viewers also liked

Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
MITX
 
Data Analytics Strategy
Data Analytics StrategyData Analytics Strategy
Data Analytics Strategy
eHealthCareers
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
Capgemini
 

Viewers also liked (16)

Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Data Analytics Strategy
Data Analytics StrategyData Analytics Strategy
Data Analytics Strategy
 
Big Data Session Presentations
Big Data Session PresentationsBig Data Session Presentations
Big Data Session Presentations
 
AzureDay - Introduction Big Data Analytics.
AzureDay  - Introduction Big Data Analytics.AzureDay  - Introduction Big Data Analytics.
AzureDay - Introduction Big Data Analytics.
 
E xamplecg strategy analytics and operations excellence transformation programs
E xamplecg strategy analytics and operations excellence transformation programsE xamplecg strategy analytics and operations excellence transformation programs
E xamplecg strategy analytics and operations excellence transformation programs
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
 
On Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challengesOn Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challenges
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
Lean LaunchPad: Analytics Workshop
Lean LaunchPad: Analytics WorkshopLean LaunchPad: Analytics Workshop
Lean LaunchPad: Analytics Workshop
 
How to Build a Rock-Solid Analytics and Business Intelligence Strategy
How to Build a Rock-Solid Analytics and Business Intelligence StrategyHow to Build a Rock-Solid Analytics and Business Intelligence Strategy
How to Build a Rock-Solid Analytics and Business Intelligence Strategy
 
Unify Line of Business Data with SAP Digital Boardroom
Unify Line of Business Data with SAP Digital BoardroomUnify Line of Business Data with SAP Digital Boardroom
Unify Line of Business Data with SAP Digital Boardroom
 
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
 
Building A Bi Strategy
Building A Bi StrategyBuilding A Bi Strategy
Building A Bi Strategy
 

Similar to Building Your Big Data Analytics Strategy- Impetus Webinar

DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
Neeraj Goswami
 
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
IntelCollab.com
 

Similar to Building Your Big Data Analytics Strategy- Impetus Webinar (20)

Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
 
Keepit Course 5: Revision
Keepit Course 5: RevisionKeepit Course 5: Revision
Keepit Course 5: Revision
 
DataLyzer Brochure Gage
DataLyzer Brochure GageDataLyzer Brochure Gage
DataLyzer Brochure Gage
 
Gregs BI Presentation
Gregs BI PresentationGregs BI Presentation
Gregs BI Presentation
 
Collaborative Lifecycle Managmenent - an Introduction
Collaborative Lifecycle Managmenent - an IntroductionCollaborative Lifecycle Managmenent - an Introduction
Collaborative Lifecycle Managmenent - an Introduction
 
Genomics Deployments - How to Get Right with Software Defined Storage
 Genomics Deployments -  How to Get Right with Software Defined Storage Genomics Deployments -  How to Get Right with Software Defined Storage
Genomics Deployments - How to Get Right with Software Defined Storage
 
Creating Scalable Analytics Processes
Creating Scalable Analytics ProcessesCreating Scalable Analytics Processes
Creating Scalable Analytics Processes
 
Enterprise Grade Data Labeling - Design Your Ground Truth to Scale in Produ...
Enterprise Grade Data Labeling - Design Your Ground Truth to Scale in Produ...Enterprise Grade Data Labeling - Design Your Ground Truth to Scale in Produ...
Enterprise Grade Data Labeling - Design Your Ground Truth to Scale in Produ...
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
 
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
How Technology Intelligence Can Forecast Disruptive Innovations and Fuel Comp...
 
Data-Driven Design for User Experience
Data-Driven Design for User Experience Data-Driven Design for User Experience
Data-Driven Design for User Experience
 
Driving IT Transformation with Agile Analytics
Driving IT Transformation with Agile AnalyticsDriving IT Transformation with Agile Analytics
Driving IT Transformation with Agile Analytics
 
AI in the Enterprise at Scale
AI in the Enterprise at ScaleAI in the Enterprise at Scale
AI in the Enterprise at Scale
 
Valdas Maksimavičius - Reducing Technology Risks through Prototyping
Valdas Maksimavičius - Reducing Technology Risks through PrototypingValdas Maksimavičius - Reducing Technology Risks through Prototyping
Valdas Maksimavičius - Reducing Technology Risks through Prototyping
 
Webinar: Maximizing the ROI of IT by Simplifying Technology Complexity
Webinar: Maximizing the ROI of IT by Simplifying Technology ComplexityWebinar: Maximizing the ROI of IT by Simplifying Technology Complexity
Webinar: Maximizing the ROI of IT by Simplifying Technology Complexity
 
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'
 
Top 10 Tips for Selecting a Threat and Vulnerability Management Solution
Top 10 Tips for Selecting a Threat and Vulnerability Management SolutionTop 10 Tips for Selecting a Threat and Vulnerability Management Solution
Top 10 Tips for Selecting a Threat and Vulnerability Management Solution
 
ECR Europe Forum '05. Category Management in a limited data environment. Intr...
ECR Europe Forum '05. Category Management in a limited data environment. Intr...ECR Europe Forum '05. Category Management in a limited data environment. Intr...
ECR Europe Forum '05. Category Management in a limited data environment. Intr...
 
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
ANIn Chennai April 2024 |Beyond Big Bang: Technical Agility in Vintage Produc...
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 

More from Impetus Technologies

Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
Impetus Technologies
 

More from Impetus Technologies (20)

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus Webinar
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in Elasticsearch
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus Webcast
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus Webcast
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 

Building Your Big Data Analytics Strategy- Impetus Webinar

  • 1. Building Your Big Data Analytics Strategy: Block by Block @impetuscalling Recorded version available at  1 http://www.impetus.com/webinar_registration?event=archived&eid=53
  • 2. Outline Building a Big Data Strategy Big Data & 3V’s 3V s 3 V’s model Big Data Analytics Lifecycle Strategy Selection Technology Selection Hadoop E H d Ecosystem t Alternative Putting it Together g g Case Studies and Applications Q&A’s Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 2
  • 3. Building a Big Data Strategy Gather Requirements What needs to be done? Requirements Objectives Choose Candidate Strategy Options Candidate Strategy Patterns & Best Practices Selection Choose Tools and Technology Tools & Technology Selection Implementation I l i Operational Readiness Implementation Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 3
  • 4. Big Data & 3V’s Model What is Big Data? Define by size or volume or by ‘breakdown’ 3V’s model Variety of Data Volume of D t V l f Data Velocity of Data Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 4
  • 5. Big Data Analytics Life Cycle Ingestion Visualization Creation Analysis Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 5
  • 6. BIG Data Analytics Life Cycle: Concerns Ingestion Visualization • Storage • Tools & • Elasticity Technologies • Integrations • Testing • Channels • Monitoring • Tools & • Pre Built • In Memory • Compression p Technologies Support Solutions • Standardization • Standardization Creation Analysis Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 6
  • 7. BIG Data Analytics Life Cycle & 3V’s Simple and potent tool to analyze strategy requirements Answer simple questions of how much what type and at what rate much, Applicable to each phase Using matrix to select suitable strategy g gy Dictates the potential choice of solutions, tools & technologies Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 7
  • 8. BIG Data Analytics Life Cycle & 3V’s How M h? H Much? What T Wh t Type? ? What R t ? Wh t Rate? Creation • Storage • Elasticity • Monitoring • Compression Ingestion Analysis Visualization Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 8
  • 9. Strategy Selection Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 9
  • 10. Big Data Analytics Strategy Creation Ingestion Analysis Visualization Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 10
  • 11. Technology Selection Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 11
  • 12. The Hadoop Ecosystem Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 12
  • 13. Alternate/Emerging Options Making stuff Faster Pervasive Datarush Hstreaming Datarush, Cloud Map Reduce HPCC, Datastax Brisk, Platform Computing MARS, GPMR Major MPPs-in-database MR-Oracle, Aster etc Hadapt NOSQL Cassandra, MongoDB, Hbase Riak Redis Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 13
  • 14. Alternate/Emerging Options Graph Type DB’s Neo4j HyperGraphDB InfiniteGraph Pregel Trinity Faster SQL DB’s DB s VoltDB, Clustrix Hardware + Software Solutions Exadata , Parstream Virtualized Options of Hardware + Software Solutions such as e ou d Xeround Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 14
  • 15. Putting it Together Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 15
  • 16. Indirect Analytics over Hadoop Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 16
  • 17. Direct Analytics over Hadoop Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 17
  • 18. Analytics over Hadoop with MPP DW Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 18
  • 19. Case Studies Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 19
  • 20. Social Media Analytics Problem Statement Analytics on huge data sets populated from live streaming data Simplifying services, cost reduction, proactive analysis on customer’s feedback Challenges Live data streaming from social media websites Clustering Learn typical comments, demands, questions Value: Helps identify response / behavior anomalies Classification Learn to identify known patterns automatically Value: useful in filtering, pre-emptive addressing, gaining customer confidence Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 20
  • 21. Social Media Analytics (cont..) Approach Prepare matrix to capture How Much? What Type? What Rate Much?, Type?, against each phase Use big data solution strategy covering all concerns of big data analytics lifecycle Solution Architected a flexible and scalable solution with near real time streaming of social media d t on d il /h l scheduled j b t i f i l di data daily/hourly h d l d jobs Built a solution based on Hadoop, HBase, Hive and Mahout Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 21
  • 22. Solution Overview Creation Ingestion Analysis Visualization Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 22
  • 23. Summing Up Creating a matrix to build suitable strategy Enables creation of a platform or a solution to manage 3Vs of data Solutions, tools & technologies Hadoop based Big Data Analytics is a scalable and cost effective option Strategy selection Recorded version available at  Impetus Proprietary http://www.impetus.com/webinar_registration?event=archived&eid=53 23
  • 24. About Us Strategic partners for software product engineering and R&D Thought Tho ght leaders in cutting-edge technologies c tting edge Mature processes and practices that are methodical, yet flexible Diverse domain expertise Our O services in Big Data and Analytics i i Bi D t d A l ti Expert consulting Proof-of-concept & Implementation Support services Recorded version available at  http://www.impetus.com/webinar_registration?event=archived&eid=53
  • 25. Questions Please send in your questions using the chat panel Recorded version available at  http://www.impetus.com/webinar_registration?event=archived&eid=53 25
  • 26. Thank you y For more information, write to us at inquiry@impetus.com @impetuscalling Recorded version available at  http://www.impetus.com/webinar_registration?event=archived&eid=53