Dr. Margriet Groenendijk | IBM
PyCon UK | 15 September 2018 | Cardiff
Navigating the Magical Data
Visualisation Forest
http://ibm.biz/slides-margriet
Navigating the Magical Data
Visualisation Forest
Data Visualisation
@MargrietGr
Data Visualisation
with Python
@MargrietGr
Data Visualisation
with Python
in a Jupyter
notebook
@MargrietGr
Jupyter notebooks
Local
Linux, Mac, Windows
Anaconda
SaaS
Google
Microsoft
Amazon
IBM
and others
@MargrietGr
DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Data Visualisation
with Python
@MargrietGr
matplotlib
DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Data Visualisation
with Python
@MargrietGr
matplotlib
@MargrietGr
Data from Barnola et al. Nature, 329, 408-414 (1987)
import pandas as pd
%matplotlib inline
co2_ice = pd.read_csv("data/co2.csv")
co2_ice.plot(x='ice Age', y='CO2 ppmv')
matplotlib
Data from Barnola et al. Nature, 329, 408-414 (1987)
import pandas as pd
%matplotlib inline
co2_ice = pd.read_csv("data/co2.csv")
co2_ice.plot(x='ice Age', y='CO2 ppmv')
Data from Barnola
et al. Nature, 329,
408-414 (1987)
@MargrietGr
matplotlib
Data from: https://www.esrl.noaa.gov/gmd/ccgg/trends/data.html
@MargrietGr
matplotlib
import numpy as np
import matplotlib.pyplot as plt
import matplotlib as mpl
co2_mlo = pd.read_csv("data/co2_weekly_mlo.csv")
co2_mlo = co2_mlo.replace(-999.99, np.nan)
fig, ax = plt.subplots()
ax.plot(co2_mlo['decimal'], co2_mlo['ppm'],'-
',color='#01807D', linewidth=2)
ax.set_title('Observed CO$_2$ at Mauna
Loa',loc='right',fontsize=20)
ax.set_xlabel('year',fontsize=16)
ax.set_ylabel('CO$_2$ (ppmv)',fontsize=16)
ax.set_title('CO$_2$ at Mauna
Loa',loc='right',fontsize=20)
plt.show()
@MargrietGr
matplotlib
Seaborn
DOC ID / Month XX, 2018 / © 2018 IBM Corporation
Data Visualisation
with Python
@MargrietGr
seaborn
Data from Barnola et al. Nature, 329, 408-414 (1987)
import seaborn as sns
sns.distplot(co2_mlo['ppm'].dropna())
sns.distplot(co2_ice['CO2
ppmv'].dropna())
@MargrietGr
seaborn
Data from Barnola et al. Nature, 329, 408-414 (1987)
import seaborn as sns
sns.distplot(co2_mlo['ppm'].dropna())
sns.distplot(co2_ice['CO2
ppmv'].dropna())
@MargrietGr
Fun!
But syntax is different
for each package
@MargrietGr
matplotlib
Seaborn
Bokeh
Brunel
PdVega
Plotly
…
DOC ID / Month XX, 2018 / © 2018 IBM Corporation
@MargrietGr
matplotlib
Seaborn
Bokeh
Brunel
PdVega
Plotly
…
DOC ID / Month XX, 2018 / © 2018 IBM Corporation
@MargrietGr
PixieDust
https://pixiedust.github.io
/pixiedust/
@MargrietGr
PixieDust
Load data from
anywhere into a Pandas
or Spark DataFrame
@MargrietGr
Load data
import pixiedust
pixiedust.sampleData()
@MargrietGr
Load data
import pixiedust
pixiedust.sampleData()
@MargrietGr
Load data
import pixiedust
df = pixiedust.sampleData(6)
df = pixiedust.sampleData(
"https://openobjectstore.mybluemix.
net/misc/milliondollarhomes.csv")
df = pixiedust.sampleData(6,
forcePandas = True)
@MargrietGr
PixieDust
Load data from anywhere
into a Pandas or Spark
DataFrame
Display and filter data
import pixiedust
display(co2_ice)
@MargrietGr
PixieDust
Demo
@MargrietGr
http://ibm.biz/forest-notebook
PixieDust
Load data from anywhere
into a Pandas or Spark
DataFrame
Display and filter data
Create interactive
dashboard with
PixieApps
@MargrietGr
PixieDust
Publish charts and
PixieApps as a web app
using the PixieGateway
Load data from anywhere
into a Pandas or Spark
DataFrame
Display and filter data
Create interactive
dashboard with
PixieApps
@MargrietGr
PixieDust
Publish charts and
PixieApps as a web app
using the PixieGateway
Debug code in a notebook
with the PixieDebugger
Load data from anywhere
into a Pandas or Spark
DataFrame
Display and filter data
Create interactive
dashboard with
PixieApps
@MargrietGr
PixieDust
Book!
https://www.packtpub.com/
big-data-and-business-
intelligence/thoughtful-data-
science
@MargrietGr
import pixiedust
display(df)
@MargrietGr
Thank you!
Find me on Twitter
@MargrietGr
@IBMCodeLondon
Slides
http://ibm.biz/slides-
margriet
Notebook
http://ibm.biz/forest-
notebook
Learn more
http://ibm.biz/pycon-uk
Pixiedust
https://pixiedust.github.io/
pixiedust/
Watson Studio
https://dataplatform.cloud.
ibm.com/
@MargrietGr
Navigating the Magical Data Visualisation Forest

Navigating the Magical Data Visualisation Forest