Smart buildings

Smart buildings
Cameron Roach
9 November 2017

Overview
Smart buildings
Analysing the data
Building an end product
·
·
·
2/26

Smart buildings
Trying to be smart with commercial building data

How do commercial buildings work?
Facility managers (FMs) oversee the day to day operations of a commercial
building.
Used to be a whole team!
Shrinking maintenance budgets and increasing complexity makes this a
challenging problem.
Building operations are automated by Building Management Systems (BMSs).
·
·
·
·
4/26

What is a BMS?
Commercial buildings contain Building Management Systems (BMSs) to
improve indoor environment quality and reduce energy consumption.
A BMS will control heating, cooling, ventilation and lighting systems.
Contain thousands of points for sensors (temperature, humidity), actuators
(fans, motors, dampers) and software (schedule, trend logs, calculations).
A BMS will monitor sensors and adjust actuators based on their readings.
For example, if high temperatures are recorded in a room, dampers will open
and air handlers will modulate to provide cooler air.
·
·
·
·
·
5/26

How does this work in practice?
Vendor sets up a BMS. The BMS will behave in a certain way based predefined
rules.
BMS systems are costly to implement and to modify. Can require a lot of
coding to change the BMS's behaviour.
The bigger the BMS is the harder it is to find what matters. Locating problems
is difficult and time-consuming.
For example, a heating valve might be locked open. If this isn't detected the
BMS will cool the room to reach the required temperature.
·
·
·
·
7/26

So what can we do?
Help facility managers identify if a BMS is operating optimally.
Buildings Alive's goal is to
·
Fault detection
Diagnostics
-
-
·
Collect BMS data using our E2 device
Analyse and transform data into useful information.
Help guide FM's to find out what's wrong.
Provide timely and actionable information.
-
-
-
-
8/26

Analysing the data
Feature generation, dimensionality reduction, clustering

Feature generation
Dealing with thousands of unevenly spaced time-series.
Uneven spacing in time-series presents difficulties.
Rather than rounding or imputing data we can generate features and work
with them instead.
·
·
·
12/26

What features might be useful?
Feature generation for time-series clustering is discussed in Wang, Smith, and Hyndman (2006). Some
useful features for our case might be
Normalise these features using their median, , and interquartile range, ,
Mean
Standard deviation
Kurtosis
Skewness
Biggest change ( )
Smallest change ( )
Number of "mean crossings" per day
·
·
·
·
· { − }maxi
∣
∣yti
yti−1
∣
∣
· { − }mini
∣
∣yti
yti−1
∣
∣
·
M IQR
= .y
∗
y − M
IQR
13/26

Dimension reduction and clustering
Too many sensors to visualise easily.
Use dimensionality reduction.
Identify clusters and singletons.
·
·
·
16/26

Which clustering algorithm?
Method Advantages Disadvantages
K-means Easy to learn. Outperformed by other algorithms.
Hierarchical clustering
Informative - produces a
dendrogram.
Not suitable for large data sets -
time complexity.
Affinity propagation
Automatically determines number of
clusters.
Not suitable for large data sets - time
complexity.
Spectral clustering Good performance.
See Nadler and Galun (2007). Time complexity
of .
( log(n))n
2
( t)n
2
( )n
3
17/26

Image: (“Comparing Different Clustering Algorithms on Toy Datasets” 2017)
18/26

Obligatory mathematics slide
Spectral clustering
We are given points and a similarity matrix . Define the weight matrix, degree matrix and
graph Laplacian as
where,
Once is determined find the eigenvectors corresponding to the smallest eigenvalues of .
Finally, cluster the rows of using K-means.
n ∈xi ℝ
p
S
W
D
L
= ( ) ∈wij ℝ
n×n
= diag ( )di
= D − W,
is the weight between nodes and based on , and,
is the weighted degree of node .
· wij i j S
· =di ∑
n
j=1
wij i
L m Zn×m m L
Zn×m
19/26

Building a prototype
Prototyping with Dash

Dash
Recently released by Plotly.
Easily build web applications for
data analytics.
Open sourced under the MIT
license.
Works nicely with the existing Plotly
graphing libraries.
·
·
·
·
Python equivalent of R's Shiny.·
21/26

Simple example
import dash
from dash.dependencies import Input, Output
import dash_core_components as dcc
import dash_html_components as html
import pandas as pd
app = dash.Dash()
app.layout = html.Div([
dcc.Dropdown(id='my-dropdown',
options=[{'label': 'Option A', 'value': 'A'},
{'label': 'Option B', 'value': 'B'}]),
dcc.Graph(id='my-graph')
])
@app.callback(Output('my-graph', 'figure'),
[Input('my-dropdown', 'value')])
def update_graph(dd_value):
df_query = df.query("Variable == @dd_value")
return {'data': [{'x': df_query.x, 'y': df_query.y}]}
if __name__ == '__main__':
app.run_server()
23/26

Thank you for listening!
Any questions?

References
“Comparing Different Clustering Algorithms on Toy Datasets.” 2017. http://scikit-
learn.org/stable/auto_examples/cluster/plot_cluster_comparison.html.
Friedman, Jerome, Trevor Hastie, and Robert Tibshirani. 2001. The Elements of
Statistical Learning. Vol. 1. Springer series in statistics New York.
Murphy, Kevin P. 2012. Machine Learning: A Probabilistic Perspective. MIT Press.
Nadler, Boaz, and Meirav Galun. 2007. “Fundamental Limitations of Spectral
Clustering.” In Advances in Neural Information Processing Systems 19, edited by P B
Schölkopf, J C Platt, and T Hoffman, 1017–24. MIT Press.
Von Luxburg, Ulrike. 2007. “A Tutorial on Spectral Clustering.” Statistics and
Computing.
Wang, Xiaozhe, Kate Smith, and Rob Hyndman. 2006. “Characteristic-Based
Clustering for Time Series Data.” Data Mining and Knowledge Discovery 13 (3): 335–
64.
26/26

Smart buildings

Recommended

Recommended

More Related Content

Similar to Smart buildings

Similar to Smart buildings (20)

Recently uploaded

Recently uploaded (20)

Smart buildings