SlideShare a Scribd company logo
1 of 15
The entire contents of this document are subject to copyright with all rights reserved. All copyrightable text and graphics, the selection, arrangement and presentation
of all information and the overall design of the document are the sole and exclusive property of Virtusa.
Copyright © 2010 Virtusa Corporation. All rights reserved
Click to edit Master title style
2000 West Park Drive
Westborough MA 01581 USA
Phone: 508 389 7300 Fax: 508 366 9901
Managing Growing
Transaction Volumes Using
Hadoop
Arvind Purushothaman – Director, IM Practice
2 © Virtusa Corporation ● Confidential
Agenda
• Context Setting
• CIO’s mandate
•
• Coexistence of architectures
• Evaluation
• Summary
3 © Virtusa Corporation ● Confidential
During this
presentation...
In the Millennial World 15 minutes is a long time……..
1.8Mn tweets
will be generated
Apple will receive
about 700,000
App downloads
Brands & Organisations
will receive around
500,000 likes on
Facebook
Over 14Mn status
updates on FACEBOOK
54,000 photos
will be shared on
INSTAGRAM
Over 13Mn pieces of
new FACEBOOK content
will be created
Over 3Bn email
messages will be
sent
Google will receive over
30Mn Search Queries
Over 8,000 new
websites will be created
Sources:
Forrester Research, Hubspot
Centre for Social Media, The
Social Skinny, AlTwitter
4 © Virtusa Corporation ● Confidential
…Consumers will
spend over $5Mn
online shopping
During the course of this presentation……..
44% of companies who
tweet acquired new
customers
Almost 8 new people
come onto the internet
every second
57% of Companies
who blog acquired new
customers
61% of global internet
users research products
online
9/10 mobile
searches lead to
action…
…Over half
lead to purchaseSources:
Forrester Research, Hubspot
Centre for Social Media, The
Social Skinny, AlTwitter
5 © Virtusa Corporation ● Confidential
BIG DATA
BIG NOISE
BIG OPPORTUNITY
Technology enables you to make sense out of
ALL Available Data
6 © Virtusa Corporation ● Confidential
How the Industry defines Big Data ?
Gartner Defines
Big Data is high-volume, high-velocity and
high-variety information assets that
demand cost effective, innovative forms
of information processing for enhanced
insight and decision making
Forrester Defines
The frontier of a firm’s ability to store,
process and access (SPA) all the data it
needs to operate effectively, make
decisions, reduce risks, and serve
customers.
IBM: “….Big data is more than simply a
matter of size; it is an opportunity to find
insights in new and emerging types of
data and content, to make your business
more agile, and to answer questions that
were previously considered beyond your
reach..”
Oracle: “…. Big Data refers to datasets that
grow so large that it is difficult to
capture, store, manage, share, analyze
and visualize with the
typical database software tools…”
Website
Network Switches
Social Media
RFIDTransactional /
operational systems
7 © Virtusa Corporation ● Confidential
CIO’s manifesto
Support business
growth through
innovation
Lower costs
Both are not optional – you need to lower costs
and innovate at the same time
In the Information Management world, this means
exponentially more data volumes, different types of data
More investments in data storage, computing power, licenses
What is the way forward?
8 © Virtusa Corporation ● Confidential
Relational/Analytical
Relational/Analytical
Financial
Data
Marketing
Data
Data Warehouse
(Relational)
Data Mart
Data Mart
Sales Data
Data Warehouse
Access
Parametric & Ad
Hoc reporting
OLAP
Dashboards
Exploratory
Visualization
Direct Data AccessETL
Data Points Data stores Access to BI Platform Insight Generation
Hadoop As Data Transformation Platform
Transactions
Logs
Big Data Cluster
(Hadoop)
Parsed data
Analytic
data sets
Raw Data Master Data
Real Time Store
(No SQL)
Big Data Access
BusinessIntelligencePlatform
Statistical
Analysis
Machine
Learning
OpenSourceETL
StreamingETL
9 © Virtusa Corporation ● Confidential
Hybrid Architecture For A Telecom Client That Leverages HDFS,
HBase, and Oracle 11g
Integration &
Infrastructure
Platform
SDEDS (APP10765)
Bill & Payments
Platform
ONM (APP10487)
HDFS
HADOOP CLUSTER
Raw Call Data
CDR Store
MapReduce
 ICS
 OCS
 Answered
 Unanswered
 Diverted
 Others
REST
GATEWAY
UI Reports
UI Reports
UI Reports
ETL
Call Summary Data
Oracle DB
 Month
 Date
 Hour
Level
10 © Virtusa Corporation ● Confidential
Technology Components Of Hadoop
Core
• HDFS + MapReduce
Data Movement
• Relational Database – Sqoop
• Real-time – Flume
NoSQL
•HBase
Scheduling
• Oozie
Analytics
• Cloudera Impala, Tableau with Hive
Machine Learning
•Mahout
11 © Virtusa Corporation ● Confidential
3W’s – What, Where and When
Traditional DW data
Semi and Un-structured dataHistorical , Infrequently AccessedLegal & Regulatory
Insights
Post shelf life
Post processing – DW
85% tables and 50%
columns unused*
* Source: TDWI
12 © Virtusa Corporation ● Confidential
Decision Points
Source: Dr. Amr Awadallah and Dan Graham, “Hadoop
and the Data Warehouse: When to Use Which”, copublished
by Cloudera, Inc. and Teradata Corporation.
*HBase.
13 © Virtusa Corporation ● Confidential
Cost Considerations
ETL Hadoop
Hardware Expensive Low
Software Expensive Low
Development Medium Medium
Maintenance High Low
Investment High upfront Invest as needed
14 © Virtusa Corporation ● Confidential
How Can You Get Started
• Hadoop as an Enterprise Data Management platform is here to
stay
• Get started – either moving “unused data” or bringing in
additional sources and types of data
• In addition to “back-end” type functions, it provides Analytical
capabilities in its own right
• To start small, leverage Hadoop on the Cloud
• Co-Existence is going to be the key for successful adoption
Build a good use case before you start, build
a POC, Evangelize It
US - Boston, New York UK - Windsor, London India – Hyderabad, Chennai Sri Lanka - Colombo
www.virtusa.com
© 2010 All rights reserved. Virtusa and all other related logos are either registered trademarks or trademarks of Virtusa Corporation in the United States, the European Union, and/or India. All
other company and service names are the property of their respective holders and may be registered trademarks or trademarks in the United States and/or other countries.

More Related Content

What's hot

Enabling digital business with governed data lake
Enabling digital business with governed data lakeEnabling digital business with governed data lake
Enabling digital business with governed data lakeKaran Sachdeva
 
Data Science in Enterprise
Data Science in EnterpriseData Science in Enterprise
Data Science in EnterpriseJosh Yeh
 
Unlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaUnlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaCloudera, Inc.
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeCloudera, Inc.
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Datameer
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyCloudera, Inc.
 
Recipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataRecipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataFadi Yousuf
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyCloudera, Inc.
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Cloudera, Inc.
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...StampedeCon
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsCase study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsHappiest Minds Technologies
 
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14Phillip Delaney
 
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...
Usama Fayyad talk at IIT Madras on March 27, 2015:  BigData, AllData, Old Dat...Usama Fayyad talk at IIT Madras on March 27, 2015:  BigData, AllData, Old Dat...
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...Usama Fayyad
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationCloudera, Inc.
 
Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarDatameer
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data LakeCaserta
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersCloudera, Inc.
 

What's hot (20)

Enabling digital business with governed data lake
Enabling digital business with governed data lakeEnabling digital business with governed data lake
Enabling digital business with governed data lake
 
Data Science in Enterprise
Data Science in EnterpriseData Science in Enterprise
Data Science in Enterprise
 
Unlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaUnlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and Cloudera
 
Becoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural ChangeBecoming Data-Driven Through Cultural Change
Becoming Data-Driven Through Cultural Change
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
Recipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big DataRecipes for Unlocking Value from Big Data
Recipes for Unlocking Value from Big Data
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsCase study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
 
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14
AURIN Data Hubs Supporting Smarter Cities - Phil Delaney, Locate14
 
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...
Usama Fayyad talk at IIT Madras on March 27, 2015:  BigData, AllData, Old Dat...Usama Fayyad talk at IIT Madras on March 27, 2015:  BigData, AllData, Old Dat...
Usama Fayyad talk at IIT Madras on March 27, 2015: BigData, AllData, Old Dat...
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
 
Analyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop WebinarAnalyzing Unstructured Data in Hadoop Webinar
Analyzing Unstructured Data in Hadoop Webinar
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent Offers
 

Viewers also liked

Enterprise Reporting Journey at Merial
Enterprise Reporting Journey at MerialEnterprise Reporting Journey at Merial
Enterprise Reporting Journey at MerialArvind Purushothaman
 
Uncover hidden treasures in your business data
Uncover hidden treasures in your business dataUncover hidden treasures in your business data
Uncover hidden treasures in your business dataesankara
 
Bi Portal Implementation Service Offering
Bi Portal Implementation Service OfferingBi Portal Implementation Service Offering
Bi Portal Implementation Service Offeringguesta853
 
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...Reynolds Journalism Institute (RJI)
 
IBM Cognos 10 Under the Hood
IBM Cognos 10 Under the HoodIBM Cognos 10 Under the Hood
IBM Cognos 10 Under the HoodSenturus
 
OBIA HR Analytics: Transform complex data into business decisions
OBIA HR Analytics: Transform complex data into business decisionsOBIA HR Analytics: Transform complex data into business decisions
OBIA HR Analytics: Transform complex data into business decisionsArvind Purushothaman
 
Agile Business Intelligence - course notes
Agile Business Intelligence - course notesAgile Business Intelligence - course notes
Agile Business Intelligence - course notesEvan Leybourn
 
Big Data in Hong Kong -- Dr. Toa Charm
Big Data in Hong Kong -- Dr. Toa CharmBig Data in Hong Kong -- Dr. Toa Charm
Big Data in Hong Kong -- Dr. Toa Charmorcsab
 
Agile Business Intelligence
Agile Business IntelligenceAgile Business Intelligence
Agile Business IntelligenceEvan Leybourn
 
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...Senturus
 

Viewers also liked (13)

Enterprise Reporting Journey at Merial
Enterprise Reporting Journey at MerialEnterprise Reporting Journey at Merial
Enterprise Reporting Journey at Merial
 
Who moved my BI?
Who moved my BI?Who moved my BI?
Who moved my BI?
 
Cognos Presentation Gartner BI
Cognos Presentation Gartner BICognos Presentation Gartner BI
Cognos Presentation Gartner BI
 
Uncover hidden treasures in your business data
Uncover hidden treasures in your business dataUncover hidden treasures in your business data
Uncover hidden treasures in your business data
 
Bi Portal Implementation Service Offering
Bi Portal Implementation Service OfferingBi Portal Implementation Service Offering
Bi Portal Implementation Service Offering
 
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...
Scott Gurian - Cross-platform Collaborations on Investigative and Enterprise ...
 
Enterprise Reporting
Enterprise ReportingEnterprise Reporting
Enterprise Reporting
 
IBM Cognos 10 Under the Hood
IBM Cognos 10 Under the HoodIBM Cognos 10 Under the Hood
IBM Cognos 10 Under the Hood
 
OBIA HR Analytics: Transform complex data into business decisions
OBIA HR Analytics: Transform complex data into business decisionsOBIA HR Analytics: Transform complex data into business decisions
OBIA HR Analytics: Transform complex data into business decisions
 
Agile Business Intelligence - course notes
Agile Business Intelligence - course notesAgile Business Intelligence - course notes
Agile Business Intelligence - course notes
 
Big Data in Hong Kong -- Dr. Toa Charm
Big Data in Hong Kong -- Dr. Toa CharmBig Data in Hong Kong -- Dr. Toa Charm
Big Data in Hong Kong -- Dr. Toa Charm
 
Agile Business Intelligence
Agile Business IntelligenceAgile Business Intelligence
Agile Business Intelligence
 
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...
Case Studies: Enterprise BI vs Self-Service Analytics Tools: Real Life Consid...
 

Similar to Managing Growing Transaction Volumes Using Hadoop

The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015CMR WORLD TECH
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concretoHP Enterprise Italia
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopCloudera, Inc.
 
Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera, Inc.
 
Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Cloudera, Inc.
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Cloudera, Inc.
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessInside Analysis
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AIDATAVERSITY
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)Moacyr Passador
 

Similar to Managing Growing Transaction Volumes Using Hadoop (20)

The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concreto
 
Impala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on HadoopImpala Unlocks Interactive BI on Hadoop
Impala Unlocks Interactive BI on Hadoop
 
Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010Cloudera - Mike Olson - Hadoop World 2010
Cloudera - Mike Olson - Hadoop World 2010
 
Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010Keynote - Cloudera - Mike Olson - Hadoop World 2010
Keynote - Cloudera - Mike Olson - Hadoop World 2010
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets

 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica WebinarHortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Ask bigger questions
Ask bigger questionsAsk bigger questions
Ask bigger questions
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven Business
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
 

Recently uploaded

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Managing Growing Transaction Volumes Using Hadoop

  • 1. The entire contents of this document are subject to copyright with all rights reserved. All copyrightable text and graphics, the selection, arrangement and presentation of all information and the overall design of the document are the sole and exclusive property of Virtusa. Copyright © 2010 Virtusa Corporation. All rights reserved Click to edit Master title style 2000 West Park Drive Westborough MA 01581 USA Phone: 508 389 7300 Fax: 508 366 9901 Managing Growing Transaction Volumes Using Hadoop Arvind Purushothaman – Director, IM Practice
  • 2. 2 © Virtusa Corporation ● Confidential Agenda • Context Setting • CIO’s mandate • • Coexistence of architectures • Evaluation • Summary
  • 3. 3 © Virtusa Corporation ● Confidential During this presentation... In the Millennial World 15 minutes is a long time…….. 1.8Mn tweets will be generated Apple will receive about 700,000 App downloads Brands & Organisations will receive around 500,000 likes on Facebook Over 14Mn status updates on FACEBOOK 54,000 photos will be shared on INSTAGRAM Over 13Mn pieces of new FACEBOOK content will be created Over 3Bn email messages will be sent Google will receive over 30Mn Search Queries Over 8,000 new websites will be created Sources: Forrester Research, Hubspot Centre for Social Media, The Social Skinny, AlTwitter
  • 4. 4 © Virtusa Corporation ● Confidential …Consumers will spend over $5Mn online shopping During the course of this presentation…….. 44% of companies who tweet acquired new customers Almost 8 new people come onto the internet every second 57% of Companies who blog acquired new customers 61% of global internet users research products online 9/10 mobile searches lead to action… …Over half lead to purchaseSources: Forrester Research, Hubspot Centre for Social Media, The Social Skinny, AlTwitter
  • 5. 5 © Virtusa Corporation ● Confidential BIG DATA BIG NOISE BIG OPPORTUNITY Technology enables you to make sense out of ALL Available Data
  • 6. 6 © Virtusa Corporation ● Confidential How the Industry defines Big Data ? Gartner Defines Big Data is high-volume, high-velocity and high-variety information assets that demand cost effective, innovative forms of information processing for enhanced insight and decision making Forrester Defines The frontier of a firm’s ability to store, process and access (SPA) all the data it needs to operate effectively, make decisions, reduce risks, and serve customers. IBM: “….Big data is more than simply a matter of size; it is an opportunity to find insights in new and emerging types of data and content, to make your business more agile, and to answer questions that were previously considered beyond your reach..” Oracle: “…. Big Data refers to datasets that grow so large that it is difficult to capture, store, manage, share, analyze and visualize with the typical database software tools…” Website Network Switches Social Media RFIDTransactional / operational systems
  • 7. 7 © Virtusa Corporation ● Confidential CIO’s manifesto Support business growth through innovation Lower costs Both are not optional – you need to lower costs and innovate at the same time In the Information Management world, this means exponentially more data volumes, different types of data More investments in data storage, computing power, licenses What is the way forward?
  • 8. 8 © Virtusa Corporation ● Confidential Relational/Analytical Relational/Analytical Financial Data Marketing Data Data Warehouse (Relational) Data Mart Data Mart Sales Data Data Warehouse Access Parametric & Ad Hoc reporting OLAP Dashboards Exploratory Visualization Direct Data AccessETL Data Points Data stores Access to BI Platform Insight Generation Hadoop As Data Transformation Platform Transactions Logs Big Data Cluster (Hadoop) Parsed data Analytic data sets Raw Data Master Data Real Time Store (No SQL) Big Data Access BusinessIntelligencePlatform Statistical Analysis Machine Learning OpenSourceETL StreamingETL
  • 9. 9 © Virtusa Corporation ● Confidential Hybrid Architecture For A Telecom Client That Leverages HDFS, HBase, and Oracle 11g Integration & Infrastructure Platform SDEDS (APP10765) Bill & Payments Platform ONM (APP10487) HDFS HADOOP CLUSTER Raw Call Data CDR Store MapReduce  ICS  OCS  Answered  Unanswered  Diverted  Others REST GATEWAY UI Reports UI Reports UI Reports ETL Call Summary Data Oracle DB  Month  Date  Hour Level
  • 10. 10 © Virtusa Corporation ● Confidential Technology Components Of Hadoop Core • HDFS + MapReduce Data Movement • Relational Database – Sqoop • Real-time – Flume NoSQL •HBase Scheduling • Oozie Analytics • Cloudera Impala, Tableau with Hive Machine Learning •Mahout
  • 11. 11 © Virtusa Corporation ● Confidential 3W’s – What, Where and When Traditional DW data Semi and Un-structured dataHistorical , Infrequently AccessedLegal & Regulatory Insights Post shelf life Post processing – DW 85% tables and 50% columns unused* * Source: TDWI
  • 12. 12 © Virtusa Corporation ● Confidential Decision Points Source: Dr. Amr Awadallah and Dan Graham, “Hadoop and the Data Warehouse: When to Use Which”, copublished by Cloudera, Inc. and Teradata Corporation. *HBase.
  • 13. 13 © Virtusa Corporation ● Confidential Cost Considerations ETL Hadoop Hardware Expensive Low Software Expensive Low Development Medium Medium Maintenance High Low Investment High upfront Invest as needed
  • 14. 14 © Virtusa Corporation ● Confidential How Can You Get Started • Hadoop as an Enterprise Data Management platform is here to stay • Get started – either moving “unused data” or bringing in additional sources and types of data • In addition to “back-end” type functions, it provides Analytical capabilities in its own right • To start small, leverage Hadoop on the Cloud • Co-Existence is going to be the key for successful adoption Build a good use case before you start, build a POC, Evangelize It
  • 15. US - Boston, New York UK - Windsor, London India – Hyderabad, Chennai Sri Lanka - Colombo www.virtusa.com © 2010 All rights reserved. Virtusa and all other related logos are either registered trademarks or trademarks of Virtusa Corporation in the United States, the European Union, and/or India. All other company and service names are the property of their respective holders and may be registered trademarks or trademarks in the United States and/or other countries.