8. How do we realize MDA in a Hadoop Centric World?
HDF
Hadoop
HDFS
HBase Hive SOLR
YARN
Storm
Service
Management /
Workflow
SIEM
Spark
Raw Network Stream
Network Metadata Stream
Data Stores
Syslog
Raw Application Logs
Other Streaming Telemetry
TALK TRACK
I’m about to go over the products, consulting and training that Hortonworks offers, and I want you to keep this image in mind.
Remember:
The Internet of Anything is doubling the amount of data in the world every 2 years.
Connected Data Platforms deliver an open-architected solution to manage data, both in motion and at rest, empowering your organization to gain Actionable Intelligence delivered to your end users through Modern Data Apps.
Hortonworks DataFlow (aka HDF) manages your data in motion—bringing it to where you need it for real-time analysis to capture perishable insights or into storage for historical analysis.
Hortonworks Data Platform (aka HDP) stores the data at rest and provides historical insights through deep, detailed analysis of everything that’s already happened.
Those historical insights from HDP help optimize your data ingest with HDF, which in turn optimizes your data at rest.
This is how HDF, HDP, and Modern Data Applications deliver actionable intelligence to your end users.
And Actionable Intelligence is the beating heart animating the Future of Data.
[NEXT SLIDE]
CapOne – Ingesting from everywhere
Email, Syslog, Applog, Netflow…
Moving to “Cloud Only model”….even looking to use “docker Containers” in Amazon…
The team puts together a detailed architecture of the proposed solution using HDP and HDF. The architecture considers sources data from the numerous sources including Server Logs, Application Logs, XML and Senso data. This data is easily accepted into the flexible schema of HDP using HDF and Sqoop. The data is processed using Pig and analyzed using Spark. Then the data is made available in a real-time dashboard as well as to visualization and reporting tools.
[NEXT SLIDE]