1. HBase on MapR LohitVijayaRenu, MapR Technologies, Inc. HBasecontributor day at Yahoo, June 30 2011
2. Who am I? LohitVijayaRenu, Software Engineer at MapR Technologies (lohit@maprtech.com) MapR Combines the best of the Hadoop community contributions with significant internally financed infrastructure development to provide complete distribution for Apache Hadoop (www.mapr.com)
3. HBase on MapR Backups using Snapshots Performance on MapR Highly available MapR MapR Control System
16. REST API for creation and deletion of snapshotsREAD / WRITE /hbase /hbase/.snapshot/Snapshot20110630 /hbase/.snaphsot/Snapshot20110629 /hbase/.snaphsot/Snapshot3 MapR REDIRECT ON WRITE FOR SNAPSHOT Data Blocks A B C C’ D Snapshot 3 Snapshot 20110629 Snapshot 20110630
17. MapR Snapshots HBase table in DFS Take snapshot on running HBase Restore from snapshot
18. MapR Control System Snapshot information Snapshot Schedules All UI operations have REST APIs More info at www.mapr.com
24. REST API for setup, start and stop mirrorBackup Production Datacenter 2 Datacenter 1 WAN
25. HBase performance "...Initially, when the table was empty I was getting around 300 inserts per second with 50 writing threads. Then, when the region split and a second server was added the rate suddenly jumped to 3000 inserts/sec per server, so ~6000 for the two servers...“ EranKutner "...My scenario is similar, we need under 10k rows, 10-20 columns and which can have thousands of version with value not greater than 300 bytes...Can we get 40-50k records/sec insertion speed in HBase??...“ GauravVashishth Source: hbase-user group
34. HBase High Availability "...In HBase 0.90 I have seen that it has a fault tolerant behavior of triggering lease recovery and closing the file when the writer dies in the middle. Yet does hbase have any workaround/recovery when NameNode is restarted in the middle of the file write(possibly the HLog file , after some syncs)???..." Gokulakannan M source: hbase-user group
35. MapR High Availability No single point of failure Distributed NameNode Automatic and transparent failover Better performance Replicated and persisted to disk Fully distributed and highly scalable Real time HBase on MapR HBASE READ / WRITE MapR (No Single Point of Failure) Node Node Node NN NN NN Node Node Node NN NN NN