Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Managing the Dewey Decimal System

OCLC has been using HBase since 2012 to enable single-search-box access to over a billion items from your library and the world’s library collection. This talk will provide an overview of how HBase is structured to provide this information and some of the challenges they have encountered to scale to support the world catalog and how they have overcome them.

  • Login to see the comments

  • Be the first to like this

Managing the Dewey Decimal System

  1. 1. Confidential – Restricted Cloudera’s Vision for HBase Krishna Maheshwari Director, Product Management
  2. 2. Confidential – Restricted 2 Where are we today With bulleted list • #17 DBMS by popularity1, #5 by revenue2 • Large ecosystem (Nifi, Kafka, Sqoop, Hive, Impala, SOLR, Ranger, Atlas, etc) • Supports NoSQL, SQL, Geospatual, Graph, TimeSeries, Key Value and other use cases • Sold by: Cloudera, IBM, Microsoft, Amazon, Teradata, Oracle and more 1. As per db-engines 2. Cloudera anlaysis
  3. 3. Confidential – Restricted 3 What has HBase enabled? • Operationalizing ML / AI to revolutionize healthcare, public utilities, etc • Serving webscale content • Empowering big data analytics for operational and offline uses • Acting as a resilient store of record
  4. 4. Confidential – Restricted 4 What’s changed since HBase began • Acceptable trade-offs – Agility vs ownership – Simplicity vs control • Infrastructure as code • Rise of “HTAP” systems • Everyone offers NoSQL Big data getting bigger
  5. 5. Confidential – Restricted 5 Next 10 years • Auto-resiliency, auto-scaling • Self-optimization through AI/ML • Multi-modal • Performance
  6. 6. Confidential – Restricted 6 User complaints can act as guideposts • Hard to setup • Complex to configure and tune • Not quite multi-tenant • Slow at analytics • Doesn’t scale-up
  7. 7. Confidential – Restricted 7 Where will Cloudera focus? • Operational use cases • Integration • Infrastructure as code • Performance
  8. 8. Confidential – Restricted THANK YOU