SlideShare a Scribd company logo
Hands-on session: Python
Research Data Visualisation Workshop
Leighton Pritchard1,2,3
1
Information and Computational Sciences,
2
Centre for Human and Animal Pathogens in the Environment,
3
Dundee Effector Consortium,
The James Hutton Institute, Invergowrie, Dundee, Scotland, DD2 5DA
Acceptable Use Policy
Recording of this talk, taking photos, discussing the content using
email, Twitter, blogs, etc. is permitted (and encouraged),
providing distraction to others is minimised.
These slides will be made available at
http://www.slideshare.net/leightonp
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
What I do
Computational biologist (1996-present)
protein sequence-structure-function (1996-1999)
yeast metabolism (1999-2003)
plant pathology (2003-present)
Large datasets
sequence/genomic
metabolomics
statistics
geographical
Visualisation as communication to (wet) biologists
protein structures
metabolic flux
comparative genomics/evolution
statistical plots
Big data. . .
GenomeDiagram a b c
a
Pritchard et al. (2006) Bioinformatics doi:10.1093/bioinformatics/btk021
b
Toth et al. (2006) Ann. Rev. Phytopath. doi:10.1146/annurev.phyto.44.070505.143444
c
http://biopython.org
Functional adaptation in Pba a b
a
Toth et al. (2006) Ann. Rev. Phytopath. doi:10.1146/annurev.phyto.44.070505.143444
b
http://biopython.org
GenomeDiagram/SciArt a b c
a
Pritchard et al. (2006) Bioinformatics doi:10.1093/bioinformatics/btk021
b
Shemilt (2009) in“Digital Visual Culture: Theory and Practice” ISBN 978-1-84150-248-9
c
Shemilt (2010) in “Art Practice in a Digital Culture”, ISBN 978-0-7546-7623-2
Influence
Free open-source comparative
genomics visualisation library
Impact
Artwork (prints, audio-visual
installation) exhibited in UK and
internationally
Comparative metabolism a b c
a
Biopython KGML/KEGG visualisation module
b
https://github.com/widdowquinn/Notebooks-Bioinformatics
c
http://biopython.org
PyANI: prokaryote classification a b
a
http://widdowquinn.github.io/pyani/
b
Pritchard et al. (2016) Anal. Methods doi:10.1039/C5AY02550H
Pectobacterium_atrosepticum_SCRI1043_uid57957Pectobacterium_atrosepticum_NCPPB8549Pectobacterium_atrosepticum_NCPPB3404Pectobacterium_atrosepticum_21APectobacterium_atrosepticum_JG10-08Pectobacterium_carotovorum_PC1_uid59295Pectobacterium_carotovorum_subsp_carotovorum_NCPPB312Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3841Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3839Pectobacterium_carotovorum_subsp_carotovorum_NCPPB3395Pectobacterium_carotovorum_PCC21_uid174335Pectobacterium_carotovorum_subsp_brasiliensis_B5Pectobacterium_carotovorum_subsp_brasiliensis_B4Pectobacterium_betavasculorum_NCPPB2293Pectobacterium_betavasculorum_NCPPB2795Pectobacterium_wasabiae_NCPPB3702Pectobacterium_wasabiae_NCPPB3701Pectobacterium_wasabiae_WPP163_uid41297Pectobacterium_SCC3193_uid193707Dickeya_solani_AMYI01Dickeya_solani_AMWE01Dickeya_solani_GBBC2040Dickeya_solani_IPO2222Dickeya_solani_MK16Dickeya_solani_MK10Dickeya_dianthicola_NCPPB_3534Dickeya_dianthicola_GBBC2039Dickeya_dianthicola_NCPPB_453Dickeya_dianthicola_IPO980Dickeya_spp_NCPPB_3274Dickeya_spp_MK7Dickeya_dadantii_NCPPB_2976Dickeya_dadantii_NCPPB_898Dickeya_dadantii_NCPPB_3537Dickeya_dadantii_3937_uid52537Pantoea_ananatis_AJ13355_uid162073Pantoea_ananatis_LMG_20103_uid46807Pantoea_ananatis_PA13_uid162181Pantoea_ananatis_uid86861Erwinia_amylovora_CFBP1430_uid46839Erwinia_amylovora_ATCC_49946_uid46943Erwinia_Ejp617_uid159955Erwinia_pyrifoliae_Ep1_96_uid40659Erwinia_pyrifoliae_DSM_12163_uid159693Dickeya_dadantii_Ech703_uid59363Dickeya_paradisiaca_NCPPB_2511Dickeya_aquatica_DW_0440Dickeya_aquatica_CSL_RW240Erwinia_tasmaniensis_Et1_99_uid59029Pantoea_At_9b_uid55845Pantoea_vagans_C9_1_uid49871Erwinia_billingiae_Eb661_uid50547Dickeya_zeae_APMV01Dickeya_zeae_AJVN01Dickeya_zeae_CSL_RW192Dickeya_zeae_NCPPB_3531Dickeya_dadantii_Ech586_uid42519Dickeya_zeae_APWM01Dickeya_zeae_NCPPB_2538Dickeya_zeae_MK19Dickeya_zeae_NCPPB_3532Dickeya_spp_NCPPB_569Dickeya_chrysanthami_NCPPB_402Dickeya_chrysanthami_NCPPB_516Dickeya_zeae_Ech1591_uid59297Dickeya_chrysanthami_NCPPB_3533
Pectobacterium_atrosepticum_SCRI1043_uid57957Pectobacterium_atrosepticum_NCPPB8549Pectobacterium_atrosepticum_NCPPB3404Pectobacterium_atrosepticum_21APectobacterium_atrosepticum_JG10-08Pectobacterium_carotovorum_PC1_uid59295Pectobacterium_carotovorum_subsp_carotovorum_NCPPB312Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3841Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3839Pectobacterium_carotovorum_subsp_carotovorum_NCPPB3395Pectobacterium_carotovorum_PCC21_uid174335Pectobacterium_carotovorum_subsp_brasiliensis_B5Pectobacterium_carotovorum_subsp_brasiliensis_B4Pectobacterium_betavasculorum_NCPPB2293Pectobacterium_betavasculorum_NCPPB2795Pectobacterium_wasabiae_NCPPB3702Pectobacterium_wasabiae_NCPPB3701Pectobacterium_wasabiae_WPP163_uid41297Pectobacterium_SCC3193_uid193707Dickeya_solani_AMYI01Dickeya_solani_AMWE01Dickeya_solani_GBBC2040Dickeya_solani_IPO2222Dickeya_solani_MK16Dickeya_solani_MK10Dickeya_dianthicola_NCPPB_3534Dickeya_dianthicola_GBBC2039Dickeya_dianthicola_NCPPB_453Dickeya_dianthicola_IPO980Dickeya_spp_NCPPB_3274Dickeya_spp_MK7Dickeya_dadantii_NCPPB_2976Dickeya_dadantii_NCPPB_898Dickeya_dadantii_NCPPB_3537Dickeya_dadantii_3937_uid52537Pantoea_ananatis_AJ13355_uid162073Pantoea_ananatis_LMG_20103_uid46807Pantoea_ananatis_PA13_uid162181Pantoea_ananatis_uid86861Erwinia_amylovora_CFBP1430_uid46839Erwinia_amylovora_ATCC_49946_uid46943Erwinia_Ejp617_uid159955Erwinia_pyrifoliae_Ep1_96_uid40659Erwinia_pyrifoliae_DSM_12163_uid159693Dickeya_dadantii_Ech703_uid59363Dickeya_paradisiaca_NCPPB_2511Dickeya_aquatica_DW_0440Dickeya_aquatica_CSL_RW240Erwinia_tasmaniensis_Et1_99_uid59029Pantoea_At_9b_uid55845Pantoea_vagans_C9_1_uid49871Erwinia_billingiae_Eb661_uid50547Dickeya_zeae_APMV01Dickeya_zeae_AJVN01Dickeya_zeae_CSL_RW192Dickeya_zeae_NCPPB_3531Dickeya_dadantii_Ech586_uid42519Dickeya_zeae_APWM01Dickeya_zeae_NCPPB_2538Dickeya_zeae_MK19Dickeya_zeae_NCPPB_3532Dickeya_spp_NCPPB_569Dickeya_chrysanthami_NCPPB_402Dickeya_chrysanthami_NCPPB_516Dickeya_zeae_Ech1591_uid59297Dickeya_chrysanthami_NCPPB_3533
0.00
0.25
0.50
0.75
1.00
ANIm_percentage_identity
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Data visualisation is art and science
. . .storytelling in pictorial or graphical format
Stories to yourself (sense-making)
Stories to others (communication)
Cautionary tales
Data visualisation is art and science
. . .storytelling in pictorial or graphical format
Stories to yourself (sense-making)
summarise big stories quickly
data exploration and mining
identify areas/items of importance
find relationships and patterns
Stories to others (communication)
Cautionary tales
Data visualisation is art and science
. . .storytelling in pictorial or graphical format
Stories to yourself (sense-making)
summarise big stories quickly
data exploration and mining
identify areas/items of importance
find relationships and patterns
Stories to others (communication)
present your interpretation of data
make a specific point
assert a relationship or pattern
demonstrate significance
Cautionary tales
Data visualisation is art and science
. . .storytelling in pictorial or graphical format
Stories to yourself (sense-making)
summarise big stories quickly
data exploration and mining
identify areas/items of importance
find relationships and patterns
Stories to others (communication)
present your interpretation of data
make a specific point
assert a relationship or pattern
demonstrate significance
Cautionary tales
avoid distortion
make the reader think about the data, not the presentation
avoid chartjunk (excessive decoration)
aim for high data:ink ratio
The point of data visualisation a
a
https://en.wikipedia.org/wiki/Data visualization
Where does visualisation belong?
Communicating effectively
Understand the data
Know (or be receptive to) the message
Know your audience
Communicating effectively
Understand the data
size
cardinality
meaning
relationships
Know (or be receptive to) the message
Know your audience
Communicating effectively
Understand the data
size
cardinality
meaning
relationships
Know (or be receptive to) the message
what does pictorial representation mean?
match graphical relationships to data relationships
Know your audience
Communicating effectively
Understand the data
size
cardinality
meaning
relationships
Know (or be receptive to) the message
what does pictorial representation mean?
match graphical relationships to data relationships
Know your audience
how do people process pictorial information
how does your audience process information
domain-specific representations
A model of communication a
a
Randy Olson (2009) Don’t Be Such a Scientist
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Elementary Perceptual Tasks a
a
Cleveland & McGill (1984) J. Am. Stat. Ass.
The most basic visual tasks:
Implementations a
a
http://www.datavizcatalogue.com/
Position: common scale
Scatterplot
Bar Chart
Angle
Pie Chart
Do(ugh)nut Chart
Curvature
Arc Diagram
Chord Diagram
Gestalt principles a
a
https://emeeks.github.io/gestaltdataviz
proximity: close objects perceived as
groups
enclosure: bounded objects perceived
as groups
continuity: aligned objects perceived
as continuous
similarity: similar attributes perceived
as groups
closure: open objects perceived as
complete
connection: connected items
perceived as groups
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
What works best? Experiment a b
a
Cleveland & McGill (1984) J. Am. Stat. Ass.
b
Heer & Bostock (2010) CHI 2010
Empirical measurements of interpretation
Subjects shown graphs representing same data
(log2) error in subjects’ accuracy compared by graph type
Judgement types
1-3: Position on a common scale (bar chart, stacked bar
chart)
4-5: Length encoding (stacked bar chart)
6: Angle (pie chart)
7-9: Area (bubble chart, aligned rectangles, treemap)
What works best? Result a b
a
Cleveland & McGill (1984) J. Am. Stat. Ass.
b
Heer & Bostock (2010) CHI 2010
We have inherent biases
that can distort
information recovered
Position > Angle ≈
Length > Area
Accuracy plateaus as
charts increase in size
Gridlines improve
accuracy
Aspect ratios affect area
judgements (squares
worst)
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
People hate pie charts
http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts
especially Edward Tufte
A table is nearly always better than a dumb pie chart; the only worse design than a pie
chart is several of them[...] pie charts should never be used. - ”The Visual Display of
Quantitative Information”
”E pur si muove. . .” a b
a
Eells (1926) J Am. Stat. Ass.
b
Simkin & Hastie (1987) J Am. Stat. Ass.
For proportions of a whole:
Pie charts read as accurately as bar charts
As number of components in the chart increases, bars are less efficient than pie
charts
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Bar charts are bad. . .mmmkay?
There is an ongoing backlash against bar charts
(and I’m not picking on Nick, he just tweets a lot. . .)
But are they really that bad?
Interpretation of bars and lines a
a
Zacks & Tversky (1999) Mem. Cognit.
People interpret bars and lines differently
Experiment 1: In absence of context (arbitrary X, Y )
bars: discrete comparison (24:0)
lines: trend assessment (0:35)
Interpretation of bars and lines a
a
Zacks & Tversky (1999) Mem. Cognit.
People interpret bars and lines differently
Experiment 2: With context (discrete or continuous data)
Bars vs. lines
People naturally interpret bar charts as categorical data
People naturally interpret line graphs as trends
Using bars for trend data or lines for categorical data can
mislead the reader
Bar charts can mislead a
a
Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128
Do these bars differ in value?
Bar charts can mislead a
a
Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128
Do these bars differ in value?
Bar charts represent data as a
single point: lossy compression.
Could different datasets give the
same bar chart?
Bars are lossy compression a
a
Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128
Bars hide detail:
Number of data points
Variance of data points
Distribution of data points (outliers, etc.)
Bars may mislead on statistics a
a
Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128
Bars may imply incorrect test statistics:
Overlaps, outliers, covariates, sample sizes masked
Bars for paired data a
a
Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128
Bars imply independence of data:
Better than bar charts?
Bar chart with SE bars suggests group 2 is highest
Better than bar charts?
Bar chart with SD bars suggests there is overlap
Better than bar charts?
Univariate scatterplots show sample sizes, outliers, variance
Any chart can mislead
Any chart can mislead a
a
Spurious Correlations, tylervigen.com
Any chart can mislead a
a
https://xkcd.com/1138/
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Scatterplots a
a
http://www.datavizcatalogue.com/
Scatterplots should be awesome:
Positions on common scale (lowest error representation)
Show all data: outliers, sample sizes, trends, etc.
Framing affects interpretation a
a
Cleveland et al. (1982) Science doi:10.1126/science.216.4550.1138
Point cloud size affects interpretation of correlation
(more diffuse interpreted as lower correlation coefficient)
Interpreting correlation is difficult a
a
Fisher et al. (2014) PeerJ doi:10.7717/peerj.589
People don’t judge significance well
47.4% of significant relationships correctly classified
74.6% of non-significant relationships correctly classified
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Latency affects usage
Increasing latency to 0.5s:
decreases user activity
decreases dataset
coverage
reduces rate of hypothesis
generation
changes data exploration
strategy
reduces future interaction
with other graphics
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Python libraries
Matplotlib http://matplotlib.org/
Seaborn https://stanford.edu/ mwaskom/software/seaborn/
ggplot for Python http://yhat.github.io/ggplot/
Bokeh http://bokeh.pydata.org/
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Exercise choices a
a
https://github.com/widdowquinn/Teaching-Data-Visualisation
One-variable, continuous data
Grammar of Graphics
Interactive map with bokeh
Two-variable, continuous x, y data
Arrays, colormaps, surface plots
Making movies
Table of Contents
1 Introduction
Why listen to me?
What is visualisation?
Elementary perceptual tasks
2 Evidence-based representation
What representations work best?
Pie charts
Bars and lines
Scatterplots
Interactive plots
3 Hands-on session
Python libraries
Exercises
Let’s get started
Let’s get started a
a
https://xkcd.com/1654/
Licence: CC-BY-SA
By: Leighton Pritchard
This presentation is licensed under the Creative Commons
Attribution ShareAlike license
https://creativecommons.org/licenses/by-sa/4.0/

More Related Content

What's hot

Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2
Larry Smarr
 
Machine Learning
Machine LearningMachine Learning
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
Larry Smarr
 
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Mick Watson
 
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
Hadeel El Bardisy
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Larry Smarr
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
William Hsiao
 
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
Mads Albertsen
 
Bioinformatics workshop presentation
Bioinformatics   workshop presentationBioinformatics   workshop presentation
Bioinformatics workshop presentation
SKUAST-Kashmir
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Arvinder Singh
 
McNatt CV June 2016
McNatt CV June 2016McNatt CV June 2016
McNatt CV June 2016
Matthew McNatt
 
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
Bayesian Taxonomic Assignment for the Next-Generation MetagenomicsBayesian Taxonomic Assignment for the Next-Generation Metagenomics
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
Jonathan Eisen
 
updated resume
updated resumeupdated resume
updated resume
Rajalaxmi Muruggan
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Elia Brodsky
 
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic SciencesBuilding an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Larry Smarr
 
Candidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologistCandidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologist
Jonathan Duckworth
 
Web applications for rapid microbial taxonomy identification
Web applications for rapid microbial taxonomy identification Web applications for rapid microbial taxonomy identification
Web applications for rapid microbial taxonomy identification
ExternalEvents
 
Paper-based synthetic gene networks
Paper-based synthetic gene networksPaper-based synthetic gene networks
Paper-based synthetic gene networks
GHMHI_MIT
 
Synthetic Biology & Global Health - Claire Marris
Synthetic Biology & Global Health - Claire MarrisSynthetic Biology & Global Health - Claire Marris
Synthetic Biology & Global Health - Claire Marris
STEPS Centre
 
Introduction to Bioinformatics Slides
Introduction to Bioinformatics SlidesIntroduction to Bioinformatics Slides
Introduction to Bioinformatics Slides
Saide OER Africa
 

What's hot (20)

Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2Cross-Disciplinary Biomedical Research at Calit2
Cross-Disciplinary Biomedical Research at Calit2
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
Discovery and Annotation of Novel Proteins from Rumen Gut Metagenomic Sequenc...
 
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
Polyketide Synthase type III Isolated from Uncultured Deep-Sea Proteobacteriu...
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
[2013.12.02] Mads Albertsen: Extracting Genomes from Metagenomes
 
Bioinformatics workshop presentation
Bioinformatics   workshop presentationBioinformatics   workshop presentation
Bioinformatics workshop presentation
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
 
McNatt CV June 2016
McNatt CV June 2016McNatt CV June 2016
McNatt CV June 2016
 
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
Bayesian Taxonomic Assignment for the Next-Generation MetagenomicsBayesian Taxonomic Assignment for the Next-Generation Metagenomics
Bayesian Taxonomic Assignment for the Next-Generation Metagenomics
 
updated resume
updated resumeupdated resume
updated resume
 
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
Mastering RNA-Seq (NGS Data Analysis) - A Critical Approach To Transcriptomic...
 
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic SciencesBuilding an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
 
Candidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologistCandidate 113701 (srg) senior biologist
Candidate 113701 (srg) senior biologist
 
Web applications for rapid microbial taxonomy identification
Web applications for rapid microbial taxonomy identification Web applications for rapid microbial taxonomy identification
Web applications for rapid microbial taxonomy identification
 
Paper-based synthetic gene networks
Paper-based synthetic gene networksPaper-based synthetic gene networks
Paper-based synthetic gene networks
 
Synthetic Biology & Global Health - Claire Marris
Synthetic Biology & Global Health - Claire MarrisSynthetic Biology & Global Health - Claire Marris
Synthetic Biology & Global Health - Claire Marris
 
Introduction to Bioinformatics Slides
Introduction to Bioinformatics SlidesIntroduction to Bioinformatics Slides
Introduction to Bioinformatics Slides
 

Viewers also liked

2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
Akira Asano
 
360 degree
360 degree360 degree
What is a scholarly article?
What is a scholarly article?What is a scholarly article?
What is a scholarly article?
jlsebai
 
Vacaciones
VacacionesVacaciones
Vacaciones
laurita10079
 
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
Akira Asano
 
The Park Great Work Place Award
The Park Great Work Place AwardThe Park Great Work Place Award
The Park Great Work Place AwardPrince Raj
 
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
Akira Asano
 
Convoca 1o. sabatino 2014
Convoca 1o.  sabatino 2014Convoca 1o.  sabatino 2014
Convoca 1o. sabatino 2014
agssports.com
 
2014SteveDevichReference
2014SteveDevichReference2014SteveDevichReference
2014SteveDevichReferenceaschaefmn
 
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
Akira Asano
 
Bab viii
Bab viiiBab viii
Bab viii
rifky2009
 
Koster public vita
Koster public vitaKoster public vita
Koster public vita
Jean Koster
 
Mètodes anticonceptius.
Mètodes anticonceptius. Mètodes anticonceptius.
Mètodes anticonceptius.
Teresa Rodriguez Verdun
 
Re - Entry Drilling Project
Re - Entry Drilling ProjectRe - Entry Drilling Project
Re - Entry Drilling Project
Andika Mahardika
 
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
Akira Asano
 

Viewers also liked (15)

2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
2015年度春学期 画像情報処理 第10回 画像の集合演算とオープニング
 
360 degree
360 degree360 degree
360 degree
 
What is a scholarly article?
What is a scholarly article?What is a scholarly article?
What is a scholarly article?
 
Vacaciones
VacacionesVacaciones
Vacaciones
 
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
2015年度春学期 画像情報処理 第9回 離散フーリエ変換と離散コサイン変換 (2015. 6. 17)
 
The Park Great Work Place Award
The Park Great Work Place AwardThe Park Great Work Place Award
The Park Great Work Place Award
 
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
2016年度秋学期 応用数学(解析) 第12回 複素関数・正則関数 (2016. 12. 15)
 
Convoca 1o. sabatino 2014
Convoca 1o.  sabatino 2014Convoca 1o.  sabatino 2014
Convoca 1o. sabatino 2014
 
2014SteveDevichReference
2014SteveDevichReference2014SteveDevichReference
2014SteveDevichReference
 
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
2015年度春学期 画像情報処理 第4回 離散フーリエ変換 (2015. 4.30)
 
Bab viii
Bab viiiBab viii
Bab viii
 
Koster public vita
Koster public vitaKoster public vita
Koster public vita
 
Mètodes anticonceptius.
Mètodes anticonceptius. Mètodes anticonceptius.
Mètodes anticonceptius.
 
Re - Entry Drilling Project
Re - Entry Drilling ProjectRe - Entry Drilling Project
Re - Entry Drilling Project
 
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
2016年度秋学期 統計学 第7回 データの関係を知る(2)-回帰分析 (2016. 11. 7)
 

Similar to RDVW Hands-on session: Python

Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
Elia Brodsky
 
Curate locally, think globally
Curate locally, think globallyCurate locally, think globally
Curate locally, think globally
Valerie Wood
 
Computational of Bioinformatics
Computational of BioinformaticsComputational of Bioinformatics
Computational of Bioinformatics
ijtsrd
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
Philip Bourne
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
webuploader
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
nadimissimple
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
Leiden University Medical Center
 
Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015
Monica Munoz-Torres
 
DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07
Núria Queralt Rosinach
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
myGrid team
 
From Laboratory to e-Laboratory
From Laboratory to e-LaboratoryFrom Laboratory to e-Laboratory
From Laboratory to e-Laboratory
Leiden University Medical Center
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
Scott Edmunds
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
c.titus.brown
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
Paul Groth
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
LizLyon
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
Elena Sügis
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
c.titus.brown
 
X team 2 - presentation
X team 2 - presentationX team 2 - presentation
X team 2 - presentation
Rayna Harris
 
Knowledge Science for AI-based biomedical and clinical applications
Knowledge Science for AI-based biomedical and clinical applicationsKnowledge Science for AI-based biomedical and clinical applications
Knowledge Science for AI-based biomedical and clinical applications
Catia Pesquita
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
mare34
 

Similar to RDVW Hands-on session: Python (20)

Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
 
Curate locally, think globally
Curate locally, think globallyCurate locally, think globally
Curate locally, think globally
 
Computational of Bioinformatics
Computational of BioinformaticsComputational of Bioinformatics
Computational of Bioinformatics
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
 
Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015Data Visualization And Annotation Workshop at Biocuration 2015
Data Visualization And Annotation Workshop at Biocuration 2015
 
DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
From Laboratory to e-Laboratory
From Laboratory to e-LaboratoryFrom Laboratory to e-Laboratory
From Laboratory to e-Laboratory
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
X team 2 - presentation
X team 2 - presentationX team 2 - presentation
X team 2 - presentation
 
Knowledge Science for AI-based biomedical and clinical applications
Knowledge Science for AI-based biomedical and clinical applicationsKnowledge Science for AI-based biomedical and clinical applications
Knowledge Science for AI-based biomedical and clinical applications
 
The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960The seven-deadly-sins-of-bioinformatics3960
The seven-deadly-sins-of-bioinformatics3960
 

More from Leighton Pritchard

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Leighton Pritchard
 
Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010
Leighton Pritchard
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
Leighton Pritchard
 
Microbial Agrogenomics 4/2/2015, UK-MX Workshop
Microbial Agrogenomics 4/2/2015, UK-MX WorkshopMicrobial Agrogenomics 4/2/2015, UK-MX Workshop
Microbial Agrogenomics 4/2/2015, UK-MX Workshop
Leighton Pritchard
 
Sequencing and Beyond?
Sequencing and Beyond?Sequencing and Beyond?
Sequencing and Beyond?
Leighton Pritchard
 
Highly Discriminatory Diagnostic Primer Design From Whole Genome Data
Highly Discriminatory Diagnostic Primer Design From Whole Genome DataHighly Discriminatory Diagnostic Primer Design From Whole Genome Data
Highly Discriminatory Diagnostic Primer Design From Whole Genome Data
Leighton Pritchard
 
ICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad Report
Leighton Pritchard
 
Adventures in Bioinformatics (2012)
Adventures in Bioinformatics (2012)Adventures in Bioinformatics (2012)
Adventures in Bioinformatics (2012)
Leighton Pritchard
 
Golden Rules of Bioinformatics
Golden Rules of BioinformaticsGolden Rules of Bioinformatics
Golden Rules of Bioinformatics
Leighton Pritchard
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In Sequences
Leighton Pritchard
 
Repeatable plant pathology bioinformatic analysis: Not everything is NGS data
Repeatable plant pathology bioinformatic analysis: Not everything is NGS dataRepeatable plant pathology bioinformatic analysis: Not everything is NGS data
Repeatable plant pathology bioinformatic analysis: Not everything is NGS data
Leighton Pritchard
 
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
Leighton Pritchard
 
Rapid generation of E.coli O104:H4 PCR diagnostics
Rapid generation of E.coli O104:H4 PCR diagnosticsRapid generation of E.coli O104:H4 PCR diagnostics
Rapid generation of E.coli O104:H4 PCR diagnostics
Leighton Pritchard
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Leighton Pritchard
 
Mining Plant Pathogen Genomes for Effectors
Mining Plant Pathogen Genomes for EffectorsMining Plant Pathogen Genomes for Effectors
Mining Plant Pathogen Genomes for Effectors
Leighton Pritchard
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
Leighton Pritchard
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
Leighton Pritchard
 
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, TurinA Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
Leighton Pritchard
 

More from Leighton Pritchard (18)

Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymesReverse-and forward-engineering specificity of carbohydrate-processing enzymes
Reverse-and forward-engineering specificity of carbohydrate-processing enzymes
 
Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
 
Microbial Agrogenomics 4/2/2015, UK-MX Workshop
Microbial Agrogenomics 4/2/2015, UK-MX WorkshopMicrobial Agrogenomics 4/2/2015, UK-MX Workshop
Microbial Agrogenomics 4/2/2015, UK-MX Workshop
 
Sequencing and Beyond?
Sequencing and Beyond?Sequencing and Beyond?
Sequencing and Beyond?
 
Highly Discriminatory Diagnostic Primer Design From Whole Genome Data
Highly Discriminatory Diagnostic Primer Design From Whole Genome DataHighly Discriminatory Diagnostic Primer Design From Whole Genome Data
Highly Discriminatory Diagnostic Primer Design From Whole Genome Data
 
ICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad Report
 
Adventures in Bioinformatics (2012)
Adventures in Bioinformatics (2012)Adventures in Bioinformatics (2012)
Adventures in Bioinformatics (2012)
 
Golden Rules of Bioinformatics
Golden Rules of BioinformaticsGolden Rules of Bioinformatics
Golden Rules of Bioinformatics
 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In Sequences
 
Repeatable plant pathology bioinformatic analysis: Not everything is NGS data
Repeatable plant pathology bioinformatic analysis: Not everything is NGS dataRepeatable plant pathology bioinformatic analysis: Not everything is NGS data
Repeatable plant pathology bioinformatic analysis: Not everything is NGS data
 
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
What makes the enterobacterial plant pathogen Pectobacterium atrosepticum dif...
 
Rapid generation of E.coli O104:H4 PCR diagnostics
Rapid generation of E.coli O104:H4 PCR diagnosticsRapid generation of E.coli O104:H4 PCR diagnostics
Rapid generation of E.coli O104:H4 PCR diagnostics
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Mining Plant Pathogen Genomes for Effectors
Mining Plant Pathogen Genomes for EffectorsMining Plant Pathogen Genomes for Effectors
Mining Plant Pathogen Genomes for Effectors
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
 
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, TurinA Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
A Systems Biology Perspective on Plant-Pathogen Interactions 2012-05-08, Turin
 

Recently uploaded

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 

Recently uploaded (20)

Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 

RDVW Hands-on session: Python

  • 1. Hands-on session: Python Research Data Visualisation Workshop Leighton Pritchard1,2,3 1 Information and Computational Sciences, 2 Centre for Human and Animal Pathogens in the Environment, 3 Dundee Effector Consortium, The James Hutton Institute, Invergowrie, Dundee, Scotland, DD2 5DA
  • 2. Acceptable Use Policy Recording of this talk, taking photos, discussing the content using email, Twitter, blogs, etc. is permitted (and encouraged), providing distraction to others is minimised. These slides will be made available at http://www.slideshare.net/leightonp
  • 3. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 4. What I do Computational biologist (1996-present) protein sequence-structure-function (1996-1999) yeast metabolism (1999-2003) plant pathology (2003-present) Large datasets sequence/genomic metabolomics statistics geographical Visualisation as communication to (wet) biologists protein structures metabolic flux comparative genomics/evolution statistical plots
  • 6. GenomeDiagram a b c a Pritchard et al. (2006) Bioinformatics doi:10.1093/bioinformatics/btk021 b Toth et al. (2006) Ann. Rev. Phytopath. doi:10.1146/annurev.phyto.44.070505.143444 c http://biopython.org
  • 7. Functional adaptation in Pba a b a Toth et al. (2006) Ann. Rev. Phytopath. doi:10.1146/annurev.phyto.44.070505.143444 b http://biopython.org
  • 8. GenomeDiagram/SciArt a b c a Pritchard et al. (2006) Bioinformatics doi:10.1093/bioinformatics/btk021 b Shemilt (2009) in“Digital Visual Culture: Theory and Practice” ISBN 978-1-84150-248-9 c Shemilt (2010) in “Art Practice in a Digital Culture”, ISBN 978-0-7546-7623-2 Influence Free open-source comparative genomics visualisation library Impact Artwork (prints, audio-visual installation) exhibited in UK and internationally
  • 9. Comparative metabolism a b c a Biopython KGML/KEGG visualisation module b https://github.com/widdowquinn/Notebooks-Bioinformatics c http://biopython.org
  • 10. PyANI: prokaryote classification a b a http://widdowquinn.github.io/pyani/ b Pritchard et al. (2016) Anal. Methods doi:10.1039/C5AY02550H Pectobacterium_atrosepticum_SCRI1043_uid57957Pectobacterium_atrosepticum_NCPPB8549Pectobacterium_atrosepticum_NCPPB3404Pectobacterium_atrosepticum_21APectobacterium_atrosepticum_JG10-08Pectobacterium_carotovorum_PC1_uid59295Pectobacterium_carotovorum_subsp_carotovorum_NCPPB312Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3841Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3839Pectobacterium_carotovorum_subsp_carotovorum_NCPPB3395Pectobacterium_carotovorum_PCC21_uid174335Pectobacterium_carotovorum_subsp_brasiliensis_B5Pectobacterium_carotovorum_subsp_brasiliensis_B4Pectobacterium_betavasculorum_NCPPB2293Pectobacterium_betavasculorum_NCPPB2795Pectobacterium_wasabiae_NCPPB3702Pectobacterium_wasabiae_NCPPB3701Pectobacterium_wasabiae_WPP163_uid41297Pectobacterium_SCC3193_uid193707Dickeya_solani_AMYI01Dickeya_solani_AMWE01Dickeya_solani_GBBC2040Dickeya_solani_IPO2222Dickeya_solani_MK16Dickeya_solani_MK10Dickeya_dianthicola_NCPPB_3534Dickeya_dianthicola_GBBC2039Dickeya_dianthicola_NCPPB_453Dickeya_dianthicola_IPO980Dickeya_spp_NCPPB_3274Dickeya_spp_MK7Dickeya_dadantii_NCPPB_2976Dickeya_dadantii_NCPPB_898Dickeya_dadantii_NCPPB_3537Dickeya_dadantii_3937_uid52537Pantoea_ananatis_AJ13355_uid162073Pantoea_ananatis_LMG_20103_uid46807Pantoea_ananatis_PA13_uid162181Pantoea_ananatis_uid86861Erwinia_amylovora_CFBP1430_uid46839Erwinia_amylovora_ATCC_49946_uid46943Erwinia_Ejp617_uid159955Erwinia_pyrifoliae_Ep1_96_uid40659Erwinia_pyrifoliae_DSM_12163_uid159693Dickeya_dadantii_Ech703_uid59363Dickeya_paradisiaca_NCPPB_2511Dickeya_aquatica_DW_0440Dickeya_aquatica_CSL_RW240Erwinia_tasmaniensis_Et1_99_uid59029Pantoea_At_9b_uid55845Pantoea_vagans_C9_1_uid49871Erwinia_billingiae_Eb661_uid50547Dickeya_zeae_APMV01Dickeya_zeae_AJVN01Dickeya_zeae_CSL_RW192Dickeya_zeae_NCPPB_3531Dickeya_dadantii_Ech586_uid42519Dickeya_zeae_APWM01Dickeya_zeae_NCPPB_2538Dickeya_zeae_MK19Dickeya_zeae_NCPPB_3532Dickeya_spp_NCPPB_569Dickeya_chrysanthami_NCPPB_402Dickeya_chrysanthami_NCPPB_516Dickeya_zeae_Ech1591_uid59297Dickeya_chrysanthami_NCPPB_3533 Pectobacterium_atrosepticum_SCRI1043_uid57957Pectobacterium_atrosepticum_NCPPB8549Pectobacterium_atrosepticum_NCPPB3404Pectobacterium_atrosepticum_21APectobacterium_atrosepticum_JG10-08Pectobacterium_carotovorum_PC1_uid59295Pectobacterium_carotovorum_subsp_carotovorum_NCPPB312Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3841Pectobacterium_carotovorum_subsp_oderiferum_NCPPB3839Pectobacterium_carotovorum_subsp_carotovorum_NCPPB3395Pectobacterium_carotovorum_PCC21_uid174335Pectobacterium_carotovorum_subsp_brasiliensis_B5Pectobacterium_carotovorum_subsp_brasiliensis_B4Pectobacterium_betavasculorum_NCPPB2293Pectobacterium_betavasculorum_NCPPB2795Pectobacterium_wasabiae_NCPPB3702Pectobacterium_wasabiae_NCPPB3701Pectobacterium_wasabiae_WPP163_uid41297Pectobacterium_SCC3193_uid193707Dickeya_solani_AMYI01Dickeya_solani_AMWE01Dickeya_solani_GBBC2040Dickeya_solani_IPO2222Dickeya_solani_MK16Dickeya_solani_MK10Dickeya_dianthicola_NCPPB_3534Dickeya_dianthicola_GBBC2039Dickeya_dianthicola_NCPPB_453Dickeya_dianthicola_IPO980Dickeya_spp_NCPPB_3274Dickeya_spp_MK7Dickeya_dadantii_NCPPB_2976Dickeya_dadantii_NCPPB_898Dickeya_dadantii_NCPPB_3537Dickeya_dadantii_3937_uid52537Pantoea_ananatis_AJ13355_uid162073Pantoea_ananatis_LMG_20103_uid46807Pantoea_ananatis_PA13_uid162181Pantoea_ananatis_uid86861Erwinia_amylovora_CFBP1430_uid46839Erwinia_amylovora_ATCC_49946_uid46943Erwinia_Ejp617_uid159955Erwinia_pyrifoliae_Ep1_96_uid40659Erwinia_pyrifoliae_DSM_12163_uid159693Dickeya_dadantii_Ech703_uid59363Dickeya_paradisiaca_NCPPB_2511Dickeya_aquatica_DW_0440Dickeya_aquatica_CSL_RW240Erwinia_tasmaniensis_Et1_99_uid59029Pantoea_At_9b_uid55845Pantoea_vagans_C9_1_uid49871Erwinia_billingiae_Eb661_uid50547Dickeya_zeae_APMV01Dickeya_zeae_AJVN01Dickeya_zeae_CSL_RW192Dickeya_zeae_NCPPB_3531Dickeya_dadantii_Ech586_uid42519Dickeya_zeae_APWM01Dickeya_zeae_NCPPB_2538Dickeya_zeae_MK19Dickeya_zeae_NCPPB_3532Dickeya_spp_NCPPB_569Dickeya_chrysanthami_NCPPB_402Dickeya_chrysanthami_NCPPB_516Dickeya_zeae_Ech1591_uid59297Dickeya_chrysanthami_NCPPB_3533 0.00 0.25 0.50 0.75 1.00 ANIm_percentage_identity
  • 11. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 12. Data visualisation is art and science . . .storytelling in pictorial or graphical format Stories to yourself (sense-making) Stories to others (communication) Cautionary tales
  • 13. Data visualisation is art and science . . .storytelling in pictorial or graphical format Stories to yourself (sense-making) summarise big stories quickly data exploration and mining identify areas/items of importance find relationships and patterns Stories to others (communication) Cautionary tales
  • 14. Data visualisation is art and science . . .storytelling in pictorial or graphical format Stories to yourself (sense-making) summarise big stories quickly data exploration and mining identify areas/items of importance find relationships and patterns Stories to others (communication) present your interpretation of data make a specific point assert a relationship or pattern demonstrate significance Cautionary tales
  • 15. Data visualisation is art and science . . .storytelling in pictorial or graphical format Stories to yourself (sense-making) summarise big stories quickly data exploration and mining identify areas/items of importance find relationships and patterns Stories to others (communication) present your interpretation of data make a specific point assert a relationship or pattern demonstrate significance Cautionary tales avoid distortion make the reader think about the data, not the presentation avoid chartjunk (excessive decoration) aim for high data:ink ratio
  • 16. The point of data visualisation a a https://en.wikipedia.org/wiki/Data visualization Where does visualisation belong?
  • 17. Communicating effectively Understand the data Know (or be receptive to) the message Know your audience
  • 18. Communicating effectively Understand the data size cardinality meaning relationships Know (or be receptive to) the message Know your audience
  • 19. Communicating effectively Understand the data size cardinality meaning relationships Know (or be receptive to) the message what does pictorial representation mean? match graphical relationships to data relationships Know your audience
  • 20. Communicating effectively Understand the data size cardinality meaning relationships Know (or be receptive to) the message what does pictorial representation mean? match graphical relationships to data relationships Know your audience how do people process pictorial information how does your audience process information domain-specific representations
  • 21. A model of communication a a Randy Olson (2009) Don’t Be Such a Scientist
  • 22. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 23. Elementary Perceptual Tasks a a Cleveland & McGill (1984) J. Am. Stat. Ass. The most basic visual tasks:
  • 24. Implementations a a http://www.datavizcatalogue.com/ Position: common scale Scatterplot Bar Chart Angle Pie Chart Do(ugh)nut Chart Curvature Arc Diagram Chord Diagram
  • 25. Gestalt principles a a https://emeeks.github.io/gestaltdataviz proximity: close objects perceived as groups enclosure: bounded objects perceived as groups continuity: aligned objects perceived as continuous similarity: similar attributes perceived as groups closure: open objects perceived as complete connection: connected items perceived as groups
  • 26. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 27. What works best? Experiment a b a Cleveland & McGill (1984) J. Am. Stat. Ass. b Heer & Bostock (2010) CHI 2010 Empirical measurements of interpretation Subjects shown graphs representing same data (log2) error in subjects’ accuracy compared by graph type Judgement types 1-3: Position on a common scale (bar chart, stacked bar chart) 4-5: Length encoding (stacked bar chart) 6: Angle (pie chart) 7-9: Area (bubble chart, aligned rectangles, treemap)
  • 28. What works best? Result a b a Cleveland & McGill (1984) J. Am. Stat. Ass. b Heer & Bostock (2010) CHI 2010 We have inherent biases that can distort information recovered Position > Angle ≈ Length > Area Accuracy plateaus as charts increase in size Gridlines improve accuracy Aspect ratios affect area judgements (squares worst)
  • 29. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 30. People hate pie charts http://www.storytellingwithdata.com/blog/2011/07/death-to-pie-charts especially Edward Tufte A table is nearly always better than a dumb pie chart; the only worse design than a pie chart is several of them[...] pie charts should never be used. - ”The Visual Display of Quantitative Information”
  • 31. ”E pur si muove. . .” a b a Eells (1926) J Am. Stat. Ass. b Simkin & Hastie (1987) J Am. Stat. Ass. For proportions of a whole: Pie charts read as accurately as bar charts As number of components in the chart increases, bars are less efficient than pie charts
  • 32. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 33. Bar charts are bad. . .mmmkay? There is an ongoing backlash against bar charts (and I’m not picking on Nick, he just tweets a lot. . .) But are they really that bad?
  • 34. Interpretation of bars and lines a a Zacks & Tversky (1999) Mem. Cognit. People interpret bars and lines differently Experiment 1: In absence of context (arbitrary X, Y ) bars: discrete comparison (24:0) lines: trend assessment (0:35)
  • 35. Interpretation of bars and lines a a Zacks & Tversky (1999) Mem. Cognit. People interpret bars and lines differently Experiment 2: With context (discrete or continuous data)
  • 36. Bars vs. lines People naturally interpret bar charts as categorical data People naturally interpret line graphs as trends Using bars for trend data or lines for categorical data can mislead the reader
  • 37. Bar charts can mislead a a Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128 Do these bars differ in value?
  • 38. Bar charts can mislead a a Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128 Do these bars differ in value? Bar charts represent data as a single point: lossy compression. Could different datasets give the same bar chart?
  • 39. Bars are lossy compression a a Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128 Bars hide detail: Number of data points Variance of data points Distribution of data points (outliers, etc.)
  • 40. Bars may mislead on statistics a a Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128 Bars may imply incorrect test statistics: Overlaps, outliers, covariates, sample sizes masked
  • 41. Bars for paired data a a Weissgerber et al. (2015) PLoS Biol. doi:10.1371/journal.pbio.1002128 Bars imply independence of data:
  • 42. Better than bar charts? Bar chart with SE bars suggests group 2 is highest
  • 43. Better than bar charts? Bar chart with SD bars suggests there is overlap
  • 44. Better than bar charts? Univariate scatterplots show sample sizes, outliers, variance
  • 45. Any chart can mislead
  • 46. Any chart can mislead a a Spurious Correlations, tylervigen.com
  • 47. Any chart can mislead a a https://xkcd.com/1138/
  • 48. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 49. Scatterplots a a http://www.datavizcatalogue.com/ Scatterplots should be awesome: Positions on common scale (lowest error representation) Show all data: outliers, sample sizes, trends, etc.
  • 50. Framing affects interpretation a a Cleveland et al. (1982) Science doi:10.1126/science.216.4550.1138 Point cloud size affects interpretation of correlation (more diffuse interpreted as lower correlation coefficient)
  • 51. Interpreting correlation is difficult a a Fisher et al. (2014) PeerJ doi:10.7717/peerj.589 People don’t judge significance well 47.4% of significant relationships correctly classified 74.6% of non-significant relationships correctly classified
  • 52. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 53. Latency affects usage Increasing latency to 0.5s: decreases user activity decreases dataset coverage reduces rate of hypothesis generation changes data exploration strategy reduces future interaction with other graphics
  • 54. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 55. Python libraries Matplotlib http://matplotlib.org/ Seaborn https://stanford.edu/ mwaskom/software/seaborn/ ggplot for Python http://yhat.github.io/ggplot/ Bokeh http://bokeh.pydata.org/
  • 56. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 57. Exercise choices a a https://github.com/widdowquinn/Teaching-Data-Visualisation One-variable, continuous data Grammar of Graphics Interactive map with bokeh Two-variable, continuous x, y data Arrays, colormaps, surface plots Making movies
  • 58. Table of Contents 1 Introduction Why listen to me? What is visualisation? Elementary perceptual tasks 2 Evidence-based representation What representations work best? Pie charts Bars and lines Scatterplots Interactive plots 3 Hands-on session Python libraries Exercises Let’s get started
  • 59. Let’s get started a a https://xkcd.com/1654/
  • 60. Licence: CC-BY-SA By: Leighton Pritchard This presentation is licensed under the Creative Commons Attribution ShareAlike license https://creativecommons.org/licenses/by-sa/4.0/