Home
Explore
Submit Search
Upload
Login
Signup
Advertisement
Check these out next
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
Yahoo - Moving beyond running 100% of Apache Pig jobs on Apache Tez
DataWorks Summit
HBaseCon 2012 | HBase for the Worlds Libraries - OCLC
Cloudera, Inc.
Operationalizing YARN based Hadoop Clusters in the Cloud
DataWorks Summit/Hadoop Summit
HBaseCon 2015: Solving HBase Performance Problems with Apache HTrace
HBaseCon
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
Cloudera, Inc.
Stsg17 speaker yousunjeong
Yousun Jeong
1
of
20
Top clipped slide
HBaseCon 2013: Being Smarter Than the Smart Meter
Jul. 9, 2013
•
0 likes
6 likes
×
Be the first to like this
Show More
•
4,266 views
views
×
Total views
0
On Slideshare
0
From embeds
0
Number of embeds
0
Report
Technology
Business
Presented by: Jay Talreja, Oracle
Cloudera, Inc.
Follow
Cloudera, Inc.
Advertisement
Advertisement
Advertisement
Recommended
HBaseCon 2015: Apache Kylin - Extreme OLAP Engine for Hadoop
HBaseCon
3.3K views
•
21 slides
HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environmen...
Cloudera, Inc.
3.3K views
•
27 slides
Empower Data-Driven Organizations
DataWorks Summit/Hadoop Summit
1.5K views
•
34 slides
HBaseConAsia2018: Track2-5: JanusGraph-Distributed graph database with HBase
Michael Stack
1.1K views
•
38 slides
Tame that Beast
DataWorks Summit/Hadoop Summit
1.6K views
•
35 slides
HBaseCon 2015 General Session: State of HBase
HBaseCon
4.5K views
•
35 slides
More Related Content
Slideshows for you
(20)
DataStax | DataStax Enterprise Advanced Replication (Brian Hess & Cliff Gilmo...
DataStax
•
363 views
Yahoo - Moving beyond running 100% of Apache Pig jobs on Apache Tez
DataWorks Summit
•
420 views
HBaseCon 2012 | HBase for the Worlds Libraries - OCLC
Cloudera, Inc.
•
3.9K views
Operationalizing YARN based Hadoop Clusters in the Cloud
DataWorks Summit/Hadoop Summit
•
1.4K views
HBaseCon 2015: Solving HBase Performance Problems with Apache HTrace
HBaseCon
•
4.5K views
Disaster Recovery and Cloud Migration for your Apache Hive Warehouse
DataWorks Summit
•
504 views
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
Cloudera, Inc.
•
7.5K views
Stsg17 speaker yousunjeong
Yousun Jeong
•
2.4K views
Time-oriented event search. A new level of scale
DataWorks Summit/Hadoop Summit
•
546 views
Powering a Virtual Power Station with Big Data
DataWorks Summit/Hadoop Summit
•
769 views
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
DataWorks Summit
•
742 views
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
DataWorks Summit/Hadoop Summit
•
1.4K views
Never late again! Job-Level deadline SLOs in YARN
DataWorks Summit
•
320 views
Stinger Initiative - Deep Dive
Hortonworks
•
5.3K views
HBaseCon 2013:High-Throughput, Transactional Stream Processing on Apache HBase
Cloudera, Inc.
•
4.6K views
Dchug m7-30 apr2013
jdfiori
•
955 views
Cloudera Impala
Scott Leberknight
•
5.6K views
Dealing with Changed Data in Hadoop
DataWorks Summit
•
10.1K views
Keep your Hadoop Cluster at its Best
DataWorks Summit/Hadoop Summit
•
2K views
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
•
908 views
Viewers also liked
(20)
HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBase
Cloudera, Inc.
•
3.2K views
HBaseCon 2012 | Living Data: Applying Adaptable Schemas to HBase - Aaron Kimb...
Cloudera, Inc.
•
3.2K views
HBaseCon 2013: 1500 JIRAs in 20 Minutes
Cloudera, Inc.
•
4.1K views
HBaseCon 2012 | Relaxed Transactions for HBase - Francis Liu, Yahoo!
Cloudera, Inc.
•
3.1K views
Tales from the Cloudera Field
HBaseCon
•
4K views
HBaseCon 2013: Apache HBase, Meet Ops. Ops, Meet Apache HBase.
Cloudera, Inc.
•
7.1K views
HBaseCon 2013: Apache Hadoop and Apache HBase for Real-Time Video Analytics
Cloudera, Inc.
•
4.8K views
HBaseCon 2012 | Building Mobile Infrastructure with HBase
Cloudera, Inc.
•
2.6K views
HBaseCon 2015: DeathStar - Easy, Dynamic, Multi-tenant HBase via YARN
HBaseCon
•
2.9K views
HBaseCon 2012 | Content Addressable Storages for Fun and Profit - Berk Demir,...
Cloudera, Inc.
•
3.8K views
HBaseCon 2012 | Leveraging HBase for the World’s Largest Curated Genomic Data...
Cloudera, Inc.
•
3.5K views
HBase Read High Availability Using Timeline-Consistent Region Replicas
HBaseCon
•
4K views
Cross-Site BigTable using HBase
HBaseCon
•
3.5K views
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon
•
3.3K views
HBaseCon 2013: Rebuilding for Scale on Apache HBase
Cloudera, Inc.
•
3.9K views
HBaseCon 2012 | Unique Sets on HBase and Hadoop - Elliot Clark, StumbleUpon
Cloudera, Inc.
•
3.4K views
HBaseCon 2013: Evolving a First-Generation Apache HBase Deployment to Second...
Cloudera, Inc.
•
4.1K views
HBaseCon 2013: Apache HBase on Flash
Cloudera, Inc.
•
4.2K views
HBaseCon 2012 | Scaling GIS In Three Acts
Cloudera, Inc.
•
3.6K views
HBaseCon 2013: ETL for Apache HBase
Cloudera, Inc.
•
6.9K views
Advertisement
Similar to HBaseCon 2013: Being Smarter Than the Smart Meter
(20)
2013 05 Oracle big_dataapplianceoverview
jdijcks
•
912 views
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Srivatsan Ramanujam
•
8.3K views
Expand a Data warehouse with Hadoop and Big Data
jdijcks
•
3.7K views
Greenplum Architecture
Alexey Grishchenko
•
6.5K views
Simplifying Real-Time Architectures for IoT with Apache Kudu
Cloudera, Inc.
•
4.1K views
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BigData_Europe
•
158 views
Big data oracle_introduccion
Fran Navarro
•
1.6K views
Amplitude wave architecture - Test
Kiran Naiga
•
266 views
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
•
3.1K views
Bilbao oracle12c keynote
Aitor Ibañez
•
479 views
InfoSphere BigInsights
Wilfried Hoge
•
10.1K views
CS-Op Analytics
Cloudera, Inc.
•
1K views
Fast Range Aggregate Queries for Big Data Analysis
IRJET Journal
•
34 views
Aucfanlab Datalake - Big Data Management Platform -
Aucfan
•
601 views
Webinar future dataintegration-datamesh-and-goldengatekafka
Jeffrey T. Pollock
•
990 views
Anna Vergeles, Nataliia Manakova "Unsupervised Real-Time Stream-Based Novelty...
Fwdays
•
202 views
Oracle Big Data Appliance and Big Data SQL for advanced analytics
jdijcks
•
3K views
BIG DATA ANALYTICS MEANS “IN-DATABASE” ANALYTICS
TIBCO Spotfire
•
2.2K views
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Certus Solutions
•
1.3K views
Solution Use Case Demo: The Power of Relationships in Your Big Data
InfiniteGraph
•
3.2K views
More from Cloudera, Inc.
(20)
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
•
78 views
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
•
5.6K views
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
•
6.2K views
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
•
4.5K views
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
•
3.6K views
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
•
2.5K views
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
•
4.8K views
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
•
2.7K views
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
•
1.6K views
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
•
1.6K views
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
•
2.5K views
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
•
1.7K views
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
•
1.3K views
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
•
2.3K views
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
•
1.5K views
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
•
957 views
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
•
2.2K views
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
•
1.4K views
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
•
1K views
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
•
1.2K views
Advertisement
Recently uploaded
(20)
How to Build Real-Time Analytics Applications like Netflix, Confluent, and Re...
confluent
•
0 views
Stream Processing with Flink and Stream Sharing
confluent
•
0 views
Real-time fraud detection
confluent
•
0 views
Rive
Artmiker Studios
•
0 views
Smart Digital Receipts OnePass
confluent
•
0 views
Seize Success: Offshore Development Services in Mumbai | Unlock the Benefits ...
Sagar Salvi
•
0 views
MIE20232.pptx
Institute of Information Systems (HES-SO)
•
0 views
Data in Motion Tour ANZ Sydney 2023 Keynote.pdf
confluent
•
0 views
DS Fusion CE - External Transactions.pptx
VatsalaC1
•
0 views
Biking on the edge - Jerome Mies - SRD23
SURFevents
•
0 views
finalppt-150606051347-lva1-app6892.pptx
AJAYVISHALRP
•
0 views
Magento development company in Birmingham.pdf
IosAndWeb Technologies
•
0 views
What's new in web in 2023
RajeshKumar825078
•
0 views
What is the Structure and Working Principle of WDM Devices.pdf
HYC Co., Ltd
•
0 views
Monitoring Oceans - Chris Atherton - SRD23
SURFevents
•
0 views
The CAFE community: a local, inclusive programming community for researchers ...
SURFevents
•
0 views
Data Governance: From speed dating to lifelong partnership
Precisely
•
0 views
normal vs. cute.pptx
ShaliniSreedharan1
•
0 views
How Can ISO/IEC 27001 Help Organizations Align With the EU Cybersecurity Regu...
PECB
•
0 views
AIDRC Gitex Africa - Final edited_TL.pdf
Thierry Lestable
•
0 views
HBaseCon 2013: Being Smarter Than the Smart Meter
Copyright © 2013,
Oracle and/or its affiliates. All rights reserved.1 Being Smarter than the Smart Meter - Cloud Operational Grid Analytics Jay Talreja Senior Manager, Software Development UGBU
2 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Agenda Introduction Smart Meters & Grid Analytics Data Model Filters, DataSets & Analytical Calc Engine Operations Conclusion Q & A
3 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Introduction Startup that pioneered Analytics within the Smart Grid/Utilities Domain 2007 2010 2012
4 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Introduction Early adopters of HBase Python with Thrift 2007 2010 2012
5 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Introduction Acquired by Oracle in December 2012 2007 2010 2012
6 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Smart Meter & Grid Analytics Smart Meter Characteristics: Sub daily energy reading (1 Min/5 Min/15 Min/30 Min/Hourly) Time Series - highly granular data Two Way Automated Communication Power Outage Notification Power Quality Monitoring Smart Meter Promises: Enable dynamic pricing Improved Outage Management Empower the end user with detailed energy usage
7 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Smart Meter & Grid Analytics Smart Metering dawns a new age in Grid Analytics Real Time Accessibility Data explosion (~ 1000 fold increase in data collection) What makes the grid smart ? Not Big Data and the capacity to store it But the capability to identity patterns hidden within the terabytes of data What is needed then is a Smart Grid platform that can: Help identify theft Help predict system failures before they occur Help utilities better manage their operations
8 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Smart Meter & Grid Analytics How can I accurately identify homes that are consuming more energy than their neighbors ? It’s going to be a hot summer this year !! Are my devices sized correctly ? No blackouts please !! Revenue loss (due to theft) is a growing concern. How can I identify theft patterns now that I have smart meters ? Distribution Planning Revenue Protection Energy Efficiency
9 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Smart Meter & Grid Analytics The Smart Meter Analytics platform that we built is: Cloud based – results delivered via the Web Supports sub second (~ 100 ms) data retrieval that powers the User Interface and enables data visualization Supports batch based analytics We have developed our own distributed framework in Python All interaction with Hbase is via the Thrift API Highly Configurable Allows analysts and other non technical groups to implement generic algorithms Shared middle tier that serves both the User Interface and allows exploratory analytics
10 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. With HBase and it’s distributed storage the platform can easily scale to meet the analytics needs of the biggest utilities across the globe Smart Meter & Grid Analytics The biggest cluster (16 Nodes) (so far) has 2 years worth of history for 4 Million Smart meters 25 TB ~ 7 Billion Rows ~ 500 Billion Values Multiple clusters – shared as well as dedicated All new clusters to run on Oracle’s Big Data Appliance
11 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Smart Meter Analytics Platform HBase (CDH4) HDFS (Oracle Hadoop -CDH4) E T L Dataset (Configurable Query API) Analytic Engine User Interface Smart Grid Data Weather 3rd Party Data Export back to Smart Grid Filter
12 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Data Model Fact Point Time Value • Abstract data model • Generic • Extensible • Storage in HBase mirrors the Access Patterns Fact Point Time Value Hourly kWh Meter xyz June 13th 2013 13:00 0.0875 Hourly Temp. F KDCA June 13th 2013 13:00 75 Power Out Event Meter xyz June 13th 2013 13:00 NULL
13 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Filters, DataSets & Analytical Calc Engine NO SQL Data Sets Dataset and Filters provide a configurable querying capability that provides fast access to the terabytes of time series data stored in HBase
14 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Analytical CalcDB Routines Filters, DataSets & Analytical Calc Engine SELECT WHERE GROUP BY DATASET FIELDS FILTER COMPONENTS TIME WINDOWS /METRICS
15 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Filters, DataSets & Analytical Calc Engine • Filters are akin to the WHERE clause of a SQL query • Data driven and configurable • In Batch analytics • To operate on a subset of the population that satisfy filtering criteria • In the User Interface • Visualize data for select points that satisfy the filtering criteria Find all meters where the hourly consumption for last week is between 0.5 and 1 kWh ? Also, only find meters that are in my zip code
16 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Filters, DataSets & Analytical Calc Engine • Operate on the point population as determined by the Filter • Consist of fields (like columns in a SELECT query) • Each dataset field can look at data for a different time period (time windows) and aggregate it to a single value (Aggregate Functions) • Datasets support aggregate metrics out of the box • e.g. SUM/MIN/MAX/STD DEV./Nth Metrics etc For the meters that I selected, give me the average daily consumption over the past week, past year and last summer
17 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Filters, DataSets & Analytical Calc Engine • The Analytic calc engine can be compared to a DB routine (function/stored procedure) • A calc follows a graphical execution model and lets developers implement custom logic • Allow complex analytics to be run and let data be saved back to Hbase Compare the average consumption and save only meters where last summer’s consumption was greater
18 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Cluster Operations
19 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Conclusion The generic data model in conjunction with HBase’s schema less storage has enabled us to build a Smart Meter Analytics platform that can scale up to meet the Big Data needs in the Smart Grid/Utilities Domain By mapping storage in HBase to the data access patterns we have successfully used the platform to serve both real time and batch analytics Filters ,DataSets offer a powerful, expressive, configuration based querying capability and attempt to bridge the NoSQL gap From our experience, Hbase has proven to be a robust, resilient distributed data storage with low latency random access easily manageable by a few developers
20 Copyright ©
2013 Oracle and/or its affiliates. All rights reserved. Questions ?
Advertisement