Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Building Information Platform - Integration of Hadoop with SAP HANA and HANA VORA
1. (C)2016 VUPICO LLC.
Building Information Analytics
Platform
VUPICO LLC
27TH OCTOBER 2016
– Integration of Hadoop with SAP® HANA and HANA VORA
2. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
3. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
4. (C)2016 VUPICO LLC.
Analytics is all we Do!
Offices in Japan, India and the USA
Focused on generating value to clients through Analytics
Hadoop
SAP® HANA
Predictive Analytics
Official Consulting Partner of Hortonworks Hadoop
http://hortonworks.com/partner/vupico/
VUPICO Profile
5. (C)2016 VUPICO LLC.
SAP® BI HANA Hybrid Model with
Hadoop Integration Project – First in Japan
Two implementations in Japan
• One successfully implemented
• One in progress
Significant value generated through insights
First SAP® HANA VORA project in Japan to
integrate Hadoop Spark and SAP® HANA
Take advantage of SAP® in-memory solution and computing
Successful Predictive Analytics POC in Japan for Vending Machines
VUPICO Profile
6. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
7. (C)2016 VUPICO LLC.
Business Value of Data
Integrated /
Predictive
Analytics & Modeling
Scenario Evaluation
and Risk Management
KPI Dashboards
Model Development
Model
Validation
Predictive Analytics
Financial Metrics and
Analytics
Profitability and
Cost Analytics
Operational Metrics and Analysis
Financial
Consolidation
Planning, Budgeting,
Estimation, Forecasting
Information Management
and Data Warehouse
Optimized business information
Leading Indicators (KPIs)
Lagging Indicators (KRIs)
Information
Process Model
Data &
Sources
CostSavingsandContinuousImprovement
byapplyingresultsofanalysistodecisionmaking
Financial and Transactional Systems Structured and Unstructured Data
GL
CCA / PCA
Sales Transportation
Production SalesForce
CRM / Campaigns
Detail Data
Summarized Data
Twitter
Facebook
LinkedIn
Market ResearchCompetitor
Customer Behavioral Insights
Predict Future based on
Trends / Correlations
Agility and Interactivity for
KPIs to Run the Business
Data Storage
Manage the Business
Run the Business
8. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
9. (C)2016 VUPICO LLC.
What can Integration Information Analytics Platform
do for Customer?
System requirement depends on role
Strategic Leadership Business User IT Department
Dynamic Extensibility
OLAP + OLTP
Operational Data marts
Cost Cost Benefits of
Storing data in Hadoop
Remove Silo’s of Data
Real-time access of huge data
Operational report out of
OLTP system
Improved business
functionality
Consistent UI across all
devices with dashboards
Reduction in TCO
Faster decision making
based on insights
From Analytics to
Business Value
Improved User
Experience for better
productivity
10hrs to 5min
Ageing report
100k to 5k
Reconciliation
exceptions
30min to 5min
Custom transaction
5min to 10 sec
FI Transaction
Integrated HANA/Hadoop
Platform can Bring
Agility
Performance
Simplicity
Mobility
User experience
Flexibility
Cost efficiency
10. (C)2016 VUPICO LLC.
• Accommodate different types of users across all BI capabilities and on the full range of technology,
including mobile devices.Usability
• It’s critical that a BI solution deliver a common view of the business
• The single view must be based on all the data, and the quality of the data must be maintained to
ensure user confidence.
Common Business View
• Solution must be flexible so that data models can be quickly modified to support ever changing
business environment.Agility
• BI architecture needs to be scalable to support growing data and analytical requirements
• BI platform must scale in a linear fashion to thousands of users across a global enterprise.Scalability
• BI must be open—in terms of the data you can access and for integration with existing and new
applications, portals, security systems and more.Openness
• Deploying the BI information platform to users in required format—must be a simple activity, as does
making changes to the way information is deployed.Deplorability
To Consider when Building Analytics Platform
11. (C)2016 VUPICO LLC.
Scale Fast
Start
Small
THINK BIG
Think Big and Start Small
Prepare enterprise data integration strategy
Which data to bring to Hadoop first?
Identify how Hadoop can benefit your
organization
Implement data governance
What data will be stored where? Hadoop vs HANA
vs other systems
Have strong data access strategy
Visualization
Reporting
Data Science
How to Approach Building an Analytics Platform
12. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
13. (C)2016 VUPICO LLC.
Combine best of both worlds
Integrated enterprise data with common view
of business and customers
Hadoop
Scalable Storage
Distributed Computing - disk based
Structured & Unstructured Data
Cost Efficient Way of Storing Data
SAP® HANA
Analyze data in real-time
In-memory Platform
Easy integration with SAP
Native predictive, text and spatial
algorithms
Integrated Information Analytics Platform
- Hadoop & SAP® HANA -
14. (C)2016 VUPICO LLC.
Information Analytics Platform
Information Analytics Platform
IoT
Twitter®
External Data
SalesForce®
SAP ERP®
Mainframe
Source System
Batch
Real Time
SAP® HANA
Replication Visualization
HDFS
YARN
HANA
VORA
In Memory
Database
SAP
Template
Analytical
Tool
BI Tool
Dashboard
ETL
Historical Data
Other System
Mobile
Hadoop and SAP® HANA Integration
Integration Landscape
15. (C)2016 VUPICO LLC.
Info
Hadoop and SAP® HANA Integration
Information Analytics Platform
SAP® HANA
Hadoop
HDFS
YARN
HANA
VORA
In Memory
Database
SAP
Template
Growing
Commodity and
Opensource
Benefit
Unstructured Data
Mature Product
Real-time
Ultra-high speed
Advanced Analytics
For Data Scientist
Data archive
For IT Department
Use Case
KPI Dashboard
For Executive
Business Analysis
Report
For Management
Operational Report
For Business User
Integration Benefit and Use Case
16. (C)2016 VUPICO LLC.
Hadoop and SAP® HANA Integration
Simulation
Sales Forest
Report
(Full Year)
Sales Data
(Budget)
Sales Data
(Historical)
Sales Data
(Budget)
Sales Data
(RE)
Integration scenario
- Report / Analytics will access both data SAP HANA® and Hadoop
Sales Data
(Actual)
Sales Data
(Actual)
Sales Data
(Historical)
Sales Data
(YTD)
Simulation
Detail Data
Summary Data
KPI Sales
(YTM)
Source System
SAP HANA
Hadoop
Visualization
18. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
19. (C)2016 VUPICO LLC.
Information Analytics Platform
Information Analytics Platform
IoT
Twitter®
External Data
SalesForce®
SAP ERP®
Mainframe
Source System
Batch
Real Time
SAP® HANA
Replication Visualization
HDFS
YARN
HANA
VORA
In Memory
Database
SAP
Template
Analytical
Tool
BI Tool
Dashboard
ETL
Historical Data
Other System
Mobile
What is SAP® HANA VORA?
Bridge between Hadoop and SAP HANA
20. (C)2016 VUPICO LLC.
SAP® HANA VORA
SAP HANA Platform
SDA –
Smart Data Access
SDI –
Smart Data Integration
vUDF –
Virtual Function
Map/
Reduce
Federate
(SQL)
Load
(JDBC
Access)
Execute
Job
HDFS
Access
YARN
HDFSFiles Files Files
VORA
Files
Spark
Controller
Spark
Controller
Visualization / Reporting / Dashboards / Predictive Analytics
SAP® SalesForce®
Mainframe Oracle® Twitter® IOT
21. (C)2016 VUPICO LLC.
Hadoop and SAP® HANA Integration with HANA VORA
Simulation
Sales Forest
Report
(Full Year)
Sales Data
(Budget)
Sales Data
(Historical)
Sales Data
(Budget)
Enhanced Integration scenario with HANA VORA
Sales Data
(Actual)
Sales Data
(Actual)
Sales Data
(Historical)
Sales Data
(YTD)
Simulation
Detail Data
Summary Data
KPI Sales
(YTM)
Source System
SAP HANA
Hadoop
Visualization
Sales Data
(RE)
HANA VORA
(In memory)
22. (C)2016 VUPICO LLC.
Key Benefits of integrating SAP® HANA with
HANA VORA/Hadoop
Parent
Child Child
m
cm
mm
km
Hierarchies Unit ConversionCurrency Conversion In-memory Analysis
Query
Fast Processing
Benefits
Query Optimizer
23. (C)2016 VUPICO LLC.
Hadoop-VORA Architecture
Flume
1.5.2 Sqoop 1.4.6
Unstructured or semiStructured Data Structured Data
HDFS Distributed Files SystemHadoop 2.7.1
HBASE
Database
1.1.2
Yarn Processing
SAP® HANA VORA 1.3
Ambari
2.1.2.1
Spark
SQL
1.5.2
ZOOKEEPER
Coordination
3.4.6 SAP®
HANA
VORA
Oozie (Workflow) 4.2.0
Kafka
0.9.0
Pig
0.15.0
Hive
1.2.1
Tez 0.7.0
Cluster Provisioning Tool
SAP®
Hadoop Eco.
27. (C)2016 VUPICO LLC.
in SAP HANA – Create virtual table
Data Integration SAP HANA and Hadoop
28. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
29. (C)2016 VUPICO LLC.
A set of business intelligence technologies that uncovers relationships and patterns
within large volumes of data that can be used to predict behavior and events
Predictive Analytics is technology that learns form experience to predict the future
behavior of individuals in order to drive better business decisions
ç
High
High
Low
Business Value
Complexity
REPORTING
What Happened?
ANALYSIS
Why did it happen?
MONITORING
What is happening now?
PREDICTION
What might happen?
Business Intelligence Technologies
Query, reporting and search
tools
Cubes and visualization
utilities
Dashboards and
Scoreboards
Predictive Analytics
Predictive Analytics on
Integrated Platform
30. (C)2016 VUPICO LLC.
• Integrate Predictive Analytics
• Enable real-time analytics with SAP HANA and/or
• Run predictive analytics as batch on Hadoop
• Combine structured and unstructured data for analytics
• R integration within Hadoop or/and SAP HANA with access
to 5000+ open source algorithms
• Predictive analytics extended into BI reports / dashboards
/ alerts or mobile devices
Business users know what happened,
they want to know why and what’s likely to happen next
lim
𝑛→∞
1
Predictive Analytics on
Integrated Platform
31. (C)2016 VUPICO LLC.
Key Benefits of Predictive Analytics on
Integrated Platform
• Reduced development time
• Create models in minutes/hours rather then days
• Scale to terabytes or petabytes of data
• Automate data preparation, modelling and deployment tasks
• Integrated visualization and reporting
Fraud Detection
Modeling user behaviors
Predictive Maintenance
Sport Analytics
Vending Machines
32. (C)2016 VUPICO LLC.
VUPICO Profile
Business Value of Data
Building Information Analytics Platform
Integration Hadoop and SAP® HANA
SAP® HANA VORA
Predictive Analytics
How to generate value from Hadoop
33. (C)2016 VUPICO LLC.
Lessons Learnt
Be prepared for Hadoop Evolution
Hadoop ecosystem continues to grow and evolve
Keep up with Hadoop news and trends
Join Hadoop communities (meetups, forums, on-line communities etc.)
Prioritize data to set foundation for analytics
You don’t have to solve all problems at once!
Prepare data governance early and share its with all
Ensure compatibility amongst Hadoop / SAP HANA and Vora versions
34. (C)2016 VUPICO LLC.
Questions?
How to contact us:
Takuya Okamoto
Email: takuya.okamoto@vupico.com
Simon Vukojevic
Email: simon@vupico.com
www.vupico.com
35. (C)2016 VUPICO LLC.
Job Opportunity
1. Data Scientist Tokyo, Japan or Hyderabad, India
2. Hadoop Architect Tokyo, Japan or Hyderabad, India
3. SAP BW4HANA Tokyo, Japan or Hyderabad, India
4. SAP HANA VORA Tokyo, Japan or Hyderabad, India
Aside: outsized returns come from analytic applications that push insights to the front-line. Existing BI users just having access to more data becomes table stakes.
Aside: outsized returns come from analytic applications that push insights to the front-line. Existing BI users just having access to more data becomes table stakes.
Aside: outsized returns come from analytic applications that push insights to the front-line. Existing BI users just having access to more data becomes table stakes.
Aside: outsized returns come from analytic applications that push insights to the front-line. Existing BI users just having access to more data becomes table stakes.
Aside: outsized returns come from analytic applications that push insights to the front-line. Existing BI users just having access to more data becomes table stakes.