The Hortonworks Data Platform therefore addresses all of these capabilities – completely in Open Source.
YARN is the architectural center of HDP and Hadoop that not only enables multiple data access engines across batch, interactive and real-time to all work on a single set of data but also extends Hadoop to integrate with the existing systems and tools you already have in your data center.
HDP delivers on the key enterprise requirements of governance, security and operations.
And of course it is supported on the widest possible range of deployment options: from Linux, to Windows (the only hadoop offering on Windows), appliance (from Microsoft or Teradata) or Cloud (Microsoft, Rackspace and more).
HDP is a comprehensive data management platform with one goal in mind: to enable an enterprise architecture with Hadoop.
Finally, there is only ONE Apache Hadoop. Every other package of hadoop is a vendor derivation of the platform.
At Hortonworks, everything we package in HDP is from the very latest components at the apache software foundation. This ensures that our customers have access to the very latest innovation from the community, to which we then apply enterprise software rigor to the build, test and release process to create HDP.
HDP “IS” Apache Hadoop – it is not a vendor derivative that has been forked and modified, it IS Apache Hadoop, no additions, no hold-backs.
When comparing Hadoop offerings vendors it is critical to understand this picture as it makes it clear where vendors are diverging from the community approach and ultimately locking customers out of the community innovation.
The modern data architecture simply does not work unless it integrates with the systems and tools you already deploy. HDP enables your existing data platforms to expand the data you have under management through integration. The goal of HDO is to augment not replace these existing systems as we very clearly understand that you need to ruuse skills.
Further, through our work within the Hadoop community to deliver YARN, we have opened up Hadoop and unlocked innovation in the community of data center ISVs can extend their applications so that they can run natively IN Hadoop as just another workload operating on the single set of data lake. They can now function as a first class citizen alongside any other workload in Hadoop.
Cloudbreak is the infrastructure-agnostic and secure Hadoop as a Service API for multi-tenant clusters.
This technology first appeared as a beta in July 2014 and marked the first collaboration between Hortonworks and the SequenceIQ team.
We leveraged the extensibility of Apache Ambari via Blueprints to deliver this easy to use deployment technology.
Hadoop provisioning using Docker containers was also presented at Hadoop Summit 2014 in San Jose, and we observed an overwhelmingly positive reception from the open source community for these innovations.
The SequenceIQ team developed Periscope to bring policy-based autoscaling to Hadoop.
Periscope ensures that you can meet your service level agreements while running your applications.
Just like Cloudbreak, Periscope is built atop Apache Ambari and Apache Hadoop YARN, and it leverages the latest cutting edge features of these projects.