Big Data Analytics - Is Your Elephant Enterprise Ready?

1,412 views

Published on

Hadoop’s cost effective scalability and flexibility to analyze all data types is driving organizations everywhere to embrace big data analytics. From proof of concept to deployment across the enterprise, join Datameer and Hortonworks as we answer the ‘now what?’ when rolling out your Hadoop big data analytics project. This webinar will address critical project components such as data security, data privacy, high availability, user training and use case development.

Published in: Education
0 Comments
5 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,412
On SlideShare
0
From Embeds
0
Number of Embeds
31
Actions
Shares
0
Downloads
0
Comments
0
Likes
5
Embeds 0
No embeds

No notes for slide
  • In summary, by addressing these elements, we can provide an Enterprise Hadoop distribution which includes the:Core ServicesPlatform ServicesData ServicesOperational ServicesRequired by the Enterprise user.And all of this is done in 100% open source, and tested at scale by our team (together with our partner Yahoo) to bring Enterprise process to an open source approach. And finally this is the distribution that is endorsed by the ecosystem to ensure interoperability in your environment.
  • In summary, by addressing these elements, we can provide an Enterprise Hadoop distribution which includes the:Core ServicesPlatform ServicesData ServicesOperational ServicesRequired by the Enterprise user.And all of this is done in 100% open source, and tested at scale by our team (together with our partner Yahoo) to bring Enterprise process to an open source approach. And finally this is the distribution that is endorsed by the ecosystem to ensure interoperability in your environment.
  • Committed to building 100% open source Hadoop for the Enterprise
  • So how does this get brought together into our distribution? It is really pretty straightforward, but also very unique:We start with this group of open source projects that I described and that we are continually driving in the OSS community. [CLICK] We then package the appropriate versions of those open source projects, integrate and test them using a full suite, including all the IP for regression testing contributed by Yahoo, and [CLICK] contribute back all of the bug fixes to the open source tree. From there, we package and certify a distribution in the from of the Hortonworks Data Platform (HDP) that includes both Hadoop Core as well as the related projects required by the Enterprise user, and provide to our customers.Through this application of Enterprise Software development process to the open source projects, the result is a 100% open source distribution that has been packaged, tested and certified by Hortonworks. It is also 100% in sync with the open source trees.
  • In summary, by addressing these elements, we can provide an Enterprise Hadoop distribution which includes the:Core ServicesPlatform ServicesData ServicesOperational ServicesRequired by the Enterprise user.And all of this is done in 100% open source, and tested at scale by our team (together with our partner Yahoo) to bring Enterprise process to an open source approach. And finally this is the distribution that is endorsed by the ecosystem to ensure interoperability in your environment.
  • Big Data Analytics - Is Your Elephant Enterprise Ready?

    1. 1. Enterprise Class New World AnalyticsJanuary 29, 2013© Hortonworks Inc. 2013 Page 1
    2. 2. Your Presenters Matthew Schumpert Director of Solutions Engineering Datameer Jim Walker Director, Product Marketing Hortonworks Page 2 © Hortonworks Inc. 2013
    3. 3. Analytics that tame the elephant! © 2012 Datameer, Inc. All rights reserved.
    4. 4. Keys to success 1. Be forward thinking 2. Open your eyes to a new paradigm 3. Keep it real 4. Be “mission-critical” focused © 2012 Datameer, Inc. All rights reserved.
    5. 5. Mission Critical BI AAA • Authentication: • Unified Identity management (LDAP/AD) & SSO (SAML) • Authorization: Role-based access (w/ custom roles) • Accountability: Audit trail, ownership Table Management • Metadata catalog, partitioning Data Quality: Dealing with dirty data Pipelines • Dependencies, scheduling, retention policies Workload management • base on user/group/project • shared service model w/ chargebacks Page 5 © 2012 Datameer, Inc. All rights reserved.
    6. 6. Mission Critical Hadoop• Seamless cluster expansion – limitless and immediate scale• High availability of the data platform• Disaster recovery• IT Operations – clustered environment needs ops• Tested, hardened distribution• Open Page 6 © Hortonworks Inc. 2013
    7. 7. Demo: what to look for….• Authentication, Authorization, Accountability• Table Management: Metadata catalog, partitioning• Data Quality: Dealing with dirty data• Dependencies, scheduling, retention windows Page 7 © Hortonworks Inc. 2013
    8. 8. Datameer Capabilities:Seamless Data Integration Powerful Analytics Business Infographics• Structured, semi and unstructured • Interactive spreadsheet UI • WYSIWYG data mash ups• 40+ Connectors • Over 200 built-in analytic functions • Rich, personalized, branded• Connector plug-in API • Macros & function plug-in API • Delivery to any device Page 8
    9. 9. Datameer In the Data Center: © Hortonworks Inc. 2013 Page 9
    10. 10. Hortonworks Snapshot We develop, distribute and support the ONLY 100% open source Enterprise Hadoop distributionDevelop Distribute Support• We employ the core • We distribute the only 100% • We are uniquely positioned architects, builders and Open Source Enterprise to deliver the highest quality operators of Apache Hadoop Hadoop Distribution: of Hadoop support Hortonworks Data• We drive innovation within Platform • We enable the ecosystem to Apache Software work better with Hadoop Foundation projects • We engineer, test & certify HDP for enterprise usageEndorsed by Strategic Partners Page 10 © Hortonworks Inc. 2013
    11. 11. Hortonworks Process for Enterprise HadoopUpstream Community Projects Downstream Enterprise Product Virtuous cycle when development & fixed issues done upstream & stable project releases flow downstream Integrate & Test Fixed Issues Apache Design & Pig Test & Patch Develop Apache Release Package Hadoop & Certify Apache Stable Project Hortonworks Hive Releases Design & Develop Data Platform Apache Apache HCatalo HBase g Distribute Apache Other Ambari Apache Projects No Lock-in: Integrated, tested & certified distribution lowers risk by ensuring close alignment with Apache projects Page 11 © Hortonworks Inc. 2013
    12. 12. HDP: Enterprise Hadoop Distribution OPERATIONAL DATA Hortonworks SERVICES SERVICES Data Platform (HDP) Manage & Store, Operate at Process and Enterprise Hadoop Scale Access Data • The ONLY 100% open source HADOOP CORE Distributed and complete distribution Storage & Processing PLATFORM SERVICES Enterprise Readiness • Enterprise grade, proven and tested at scale HORTONWORKS DATA PLATFORM (HDP) • Ecosystem endorsed to ensure interoperability OS Cloud VM Appliance Page 12 © Hortonworks Inc. 2013
    13. 13. Datameer and Hortonworks: A best-of-breed solution and team • The leading purpose-built Hadoop BI platform from Datameer • Cutting-edge distribution and production support from Hortonworks • Expertise in managing the world’s largest clusters (Yahoo!) Datameer fully certified on HDP (1.0-1.2) Currently engaged with multiple joint clients Committed to the Apache Hadoop ecosystem • Contributors to Apache Nutch on Datameer staff • Continuing releases that track Apache releases (0.20.1  1.1.1) • Native support for HBase, Hive, Kerberos, and MapReduce 2.0! © 2012 Datameer, Inc. All rights reserved.

    ×