Your SlideShare is downloading. ×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

What the Enterprise Requires - Business Continuity and Visibility

1,796
views

Published on

Cloudera Enterprise BDR delivers centralized disaster recovery for data and metadata, enabling you to prepare for disaster by moving data to your secondary site automatically. Cloudera Navigator 1.0 …

Cloudera Enterprise BDR delivers centralized disaster recovery for data and metadata, enabling you to prepare for disaster by moving data to your secondary site automatically. Cloudera Navigator 1.0 provides data governance capabilities such as verifying access privileges and auditing access to all data stored in Hadoop, which are critical for customers that are in highly regulated industries and have stringent compliance requirements.

This presentation will teach you how to:

- Centrally configure and manage replication workflows for files (HDFS) and metadata (Hive)
- Consistently meet or exceed SLAs and RTOs through simplified management and process automation
- Track access permissions and actual accesses to all data objects in Hive, HBase, and HDFS
- Answer the questions:
- Who has access to which data object(s)
- Which data objects were accessed by a user
- When was a data object accessed and by whom
- What data assets were accessed using a service
- Which device was used to access

Published in: Technology

0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,796
On Slideshare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
0
Comments
0
Likes
3
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • Lots of data landing in Cloudera EnterpriseHuge quantitiesMany different sourcesMany different structuresVarying levels of sensitivityDifferent users are working with the same dataAdministrators – how do I ensure the right data is accessible by the right users and applications?Compliance officers – how do I report on who has been accessing the data?Analysts – how do I find out what data is available, where it came from and what it looks like?Need an easy way to empower users and administrators to be effective
  • Introducing…the first DM suite for Hadoop
  • Transcript

    • 1. Cloudera Enterprise BDR & Cloudera Navigator Jai Ranganathan, Director Products, BDR Tushar Shanbhag, Director Products , Cloudera Navigator1
    • 2. Agenda 1 Cloudera Enterprise BDR  Technology Overview  Demo 2 Cloudera Navigator  Technology Overview  Demo 3 Q&A2
    • 3. Cloudera Enterprise BDR Jai Ranganathan3
    • 4. Why You Need Cloudera Enterprise BDR 1 Cloudera Enterprise is a Mission-Critical Part of the Data Management Infrastructure  Stores valuable data & runs important workloads  Business continuity is a MUST HAVE 2 Managing Business Continuity for Hadoop is Complex  Different services that store data – HDFS, HBase, Hive  Backup & disaster recovery is configured separately for each  Processes are manual4
    • 5. Cloudera Enterprise BDR Simplified Management of Backup & DR Policies Central Configuration Define backup and disaster recover policies and apply across services Monitoring & Alerting Track progress of replication jobs and get notified SITE A SITE B when data is out of sync HIVE HIVE HDFS HDFS Performance & Reliability NODES NODES High performance, CDH-optimized replication using MapReduce (via DistCP)5
    • 6. Cloudera Enterprise BDR Version 1.0 CLOUDERA ENTERPRISE CLOUDERA MANAGER SELECT CONFIGURE SYNCHRONIZE MONITOR DISASTER RECOVERY MODULE CDH HDFS DISTRIBUTED REPLICATION HIVE METASTORE REPLICATION HIGH PERFORMANCE REPLICATION THE ONLY DISASTER RECOVERY SOLUTION USING MAPREDUCE FOR METADATA HDFS HIVE6
    • 7. Management Capabilities Cloudera Enterprise BDR Version 1.0 SELECT Select subset of data or tables to be replicated CONFIGURE Configure schedule and options for data replication SYNCHRONIZE Perform synchronization using appropriate tools MONITOR Report progress, track errors, generate alerts7
    • 8. Platform Enhancements CDH 4.2 1 Distributed Copy  Hardened, production-ready DistCP across clusters  Kerberos integration  Cross-cluster HA and federation  Full API access through Cloudera Manager  Detailed error and progress reporting 2 Metastore Replication  SQL import/export between two different metastores  Fix file paths and other cluster-specific information 3 HBase  HBase snapshots v1 (not supported in Cloudera Enterprise BDR 1.0)8
    • 9. Benefits of Cloudera Enterprise BDR  Centrally manage backup & DR workflows Reduce Complexity  Simple setup via an intuitive user interface  Simplify processes to meet or exceed SLAs & Recovery Time Objectives (RTOs) Maximize Efficiency  Optimize system performance & network impact through scheduling  Eliminate error-prone manual processes Reduce Risk & Exposure  Get notified when issues occur  The only solution for metadata replication (Hive)9
    • 10. Cloudera Enterprise BDR Optional Add-On for Business Continuity • Backup & DR Management w/Cloudera Manager • 8x5 or 24x7 Support • Optional Upgrade from INGEST STORE EXPLORE PROCESS ANALYZE SERVE Enterprise Core • Available Now MANAGEMENT CLOUDERA MANAGER (Sold with Support) SOFTWARE, DATA MANAGEMENT & TECHNICAL SUPPORT CORE BDR (SUBSCRIPTION) CDH 100% OPEN SOURCE OS OPEN SOURCE PROJECTS10
    • 11. Cloudera Navigator Tushar Shanbhag11
    • 12. Why You Need Cloudera Navigator 1 Lots of Data Landing in Cloudera Enterprise  Huge quantities  Many different sources – structured & unstructured  Varying levels of sensitivity 2 Many Users Working with the Data  Administrators & compliance officers  Analysts & data scientists  Business users 3 Need to Effectively Control & Consume Data  Get visibility & control over the environment  Discover, explore and consume data12
    • 13. Cloudera Navigator Data Management Suite for Cloudera Enterprise Audit & Access Management Ensuring appropriate permissions & auditing CLOUDERA NAVIGATOR on data access Audit & Discovery & Lifecycle Access Lineage Exploration Mgmt. Discovery & Exploration Mgmt Discover what data is available and what it Enterprise Metadata Repository looks like  Business metadata  Lineage metadata  Operational metadata Lineage Tracing data back to its original source CDH Lifecycle Management HDFS HBASE HIVE Migration of data based on policies13
    • 14. Cloudera Navigator 1.0 Data Audit & Access Management Verify Permissions View which users and groups have access to files and directories IAM / LDAP SYSTEM Audit Configuration Configuration of audit tracking for CLOUDERA NAVIGATOR 1.0 HDFS, HBase and Hive ACCESS AUDIT LOG HDFS SERVICE SERVICE VIEW PERMISSIONS AUDIT LOG CONFIG Audit Dashboard AUDIT LOG COLLECTION HBASE Simple, queryable interface to view data access Information Export 3rd PARTY SIEM / GRC SYSTEM HIVE Export audit information for integration with SIEM tools14
    • 15. Benefits of Cloudera Navigator 1.0  Store sensitive data Control  Maintain full audit history  The first & only centralized audit tool for Hadoop  Verify access permissions to files & directories Visibility  Report on data access by user and type  View permissions for LDAP/IAM users Integration  Export audit data for integration with 3rd party SIEM tools15
    • 16. Cloudera Navigator 1.0 Data Management Suite for Cloudera Enterprise • Centralized Audit Management & Access Control • 8x5 or 24x7 Support INGEST STORE EXPLORE PROCESS ANALYZE SERVE • Add-On to Enterprise Core • Available Now MANAGEMENT SOFTWARE, DATA CLOUDERA NAVIGATOR MANAGEMENT & AUDIT & TECHNICAL SUPPORT ACCESS (SUBSCRIPTION) CLOUDERA MANAGER CORE CDH 100% OPEN SOURCE OS OPEN SOURCE PROJECTS16
    • 17. 17

    ×