SlideShare a Scribd company logo
1 of 21
Download to read offline
http://publicationslist.org/junio
Data Analysis
More Than Two Variables: Graphical Multivariate Analysis
Prof. Dr. Jose Fernando Rodrigues Junior
ICMC-USP
http://publicationslist.org/junio
What is it about?
More than two variables determine a tough analytical problem
Although, there are multivariate graphical techniques, they can
not deal too many variables (less than 15 – 25)
http://publicationslist.org/junio
What is it about?
More than two variables determine a tough analytical
problem
Although, there are multivariate graphical techniques, they
can not deal too many variables (less than 15 – 25)
http://publicationslist.org/junio
Three variables
For example, consider the data defined by function:
= , =	
2
+ −
2
+
4
that corresponds to the three-variable setting y, x, and a
One way to analyze this is by means of a surface plot:
http://publicationslist.org/junio
Three variables
However, the surface plot does not permit an intuitive
reading of quantitative information
Another way is to use a two-dimensional xy plot with
multiple curves, one for each value of interest of one of the
variables; in the example, variable a is considered for values 2,
1, 0, -1, -2
http://publicationslist.org/junio
Three variables
Surface plots and multiple-curve xy plots can be in
combination, one providing an aesthetically appealing
overview, the other providing fine details for values of
interest
It is interesting to note that surface plots go against the
commonsense that 3D plots should be more
informative than 2D plots
Yet another possibility is to project the function into the
base plane bellow the surface, using either:
 contour plots
 false-color plots
http://publicationslist.org/junio
Three variables
The false-color plot is achieved by mapping all values of the
dependent variable y following a palette of colors
The false-color plot for function f(x, a)
http://publicationslist.org/junio
Three variables
The false-color plot is achieved by mapping all values of the
dependent variable y following a palette of colors
The false-color plot for function f(x, a)
False-color plots are very effective for presenting
quantitative information
It is important to note, however, that its efficiency
depends heavily on the color mapping, which must be
intuitive according to the task at hand
For an overview of color-mapping guidelines, see the
textbook on page 104
http://publicationslist.org/junio
More
There are basically two ways to get more information on a
plot
 Put similar graphs next to each other and vary the
variables in a systematic fashion from one subgraph to the next 
multiplots
 Make the graph elements themselves richer with color, shape,
and interaction
Multiplots
 The most common forms of multiplots are the scatter-plot
matrix, and the co-plot,
http://publicationslist.org/junio
Scatter-plot matrix
The scatter-plot matrix is constructed considering all the
possible two-variable combinations achieved from the
set of variables
For each combination, a sub-region of the space is
reserved and all the combinations are put together according
to a straight layout
The more variables, the bigger must be the screen,
limits start to manifest around 10 variables, the same for the
number of data points, limited around 100
For example, consider a 250 wines data set consisting of
seven different properties: acidity, sugar, chlorides, sulfur
dioxide, density, alcohol, and quality
The data can be found in the “Wine Quality” data set, available at the UCI Machine
Learning repository - http://archive.ics.uci.edu/ml/.
http://publicationslist.org/junio
Scatter-plot matrix
http://publicationslist.org/junio
Scatter-plot matrix
The scatter plot shows:
• sugar content and density are positively correlated
• as the alcohol content goes up, density goes down, inverse
correlation
• wine quality seems to increase with increasing alcohol content:
apparently, more potent wines are considered to be better
http://publicationslist.org/junio
Co-plots
Co-plots work by partitioning the data according to one of
the variables and plotting each partition in a different plot
In this example, the upper
figure shows how one of
the variables was used to
partition the data
 It is possible to notice that
the intervals overlap and have
different sizes so that each
plot has the same number
of points
http://publicationslist.org/junio
Co-plots
Co-plots work by partitioning the data according to one of
the variables and plotting each partition in a different plot
In this example, the upper
figure shows how one of
the variables was used to
partition the data
 It is possible to notice that
the intervals overlap and have
different sizes so that each
plot has the same number
of points
The partitioning can be determined in many different
ways according to the goals of the analysis
http://publicationslist.org/junio
Composition
Another way to visualize more than two variables is to
compose multiple plots according to some of the variables
For example: imagine a company that makes five products
labeled A, B, C, D, and E, and two questions:
 how many items of each kind are produced overall
 how the item mix is changing over time
A simple solution, but not quite
effective is to plot (days x quantity)
the different curves all together
http://publicationslist.org/junio
Composition
Another solution is to use the same plot but stacking the
information, so as to have a notion of total
 In absolute numbers (left), or
 In relative contributions (percentage)
http://publicationslist.org/junio
Composition
Another solution is to use the same plot but stacking the
information, so as to have a notion of total
 In absolute numbers (left), or
 In relative contributions (percentage)
Surprisingly, the total number of
items does not change appreciably
over the long run
Here, one can see the contribution of
each product, an information that
was lost in the left plot
http://publicationslist.org/junio
Composition
Another solution is to use the same plot but stacking the
information, so as to have a notion of total
 In absolute numbers (left), or
 In relative contributions (percentage)
Surprisingly, the total number of
items does not change appreciably
over the long run
Here, one can see the contribution of
each product, an information that
was lost in the left plot
It is desirable to use multiple kinds of plots combined
in order to verify the many aspects of data
http://publicationslist.org/junio
Parallel Coordinates
In a parallel coordinate plot, the coordinate axes are parallel
to each other
For every data point, its value for each of the variables is
marked on the corresponding axis, and then all these points
are connected with lines
http://publicationslist.org/junio
Information Visualization
 Many other techniques are presented according the findings of the field
known as InformationVisualization:
 Glyphs
 Chernoff Faces
 Tree-maps
 Star coordinates
 Table Lens
 Multidimensional Projection
 And many others, all improved by means of interaction techniques:
 Querying and zooming
 Linking and Brushing
 Combined projections, and so forth
http://publicationslist.org/junio
References
 Philipp K. Janert, Data Analysis with Open Source Tools,
O’Reilly, 2010.
 Wikipedia, http://en.wikipedia.org
 Wolfram MathWorld, http://mathworld.wolfram.com/

More Related Content

What's hot

Statistics
StatisticsStatistics
Statisticsdiereck
 
Presenting information presentation.
Presenting information presentation.Presenting information presentation.
Presenting information presentation.danielle93
 
Different types of charts
Different types of chartsDifferent types of charts
Different types of chartsZakaria Salim
 
Choosing the Right Graphs
Choosing the Right GraphsChoosing the Right Graphs
Choosing the Right GraphsOdogray
 
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas Intro
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas IntroGreater Cambridge Greater Peterborough Economic Assessment Data Atlas Intro
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas IntroCambridgeshireInsight
 
Types of graphs and charts and their uses with examples and pics
Types of graphs and charts and their uses  with examples and picsTypes of graphs and charts and their uses  with examples and pics
Types of graphs and charts and their uses with examples and picsMakati Science High School
 
Top 7 types of Statistics Graphs for Data Representation
Top 7 types of Statistics Graphs for Data RepresentationTop 7 types of Statistics Graphs for Data Representation
Top 7 types of Statistics Graphs for Data RepresentationStat Analytica
 
Understanding on graph in bpo
Understanding on graph in bpoUnderstanding on graph in bpo
Understanding on graph in bpoAjit Kumar
 
Electronic spreadsheet
Electronic spreadsheetElectronic spreadsheet
Electronic spreadsheetSomveerSingh20
 
How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?Janvi Arora
 
Pivot table essential learning 1
Pivot table   essential learning 1Pivot table   essential learning 1
Pivot table essential learning 1Vijay Perepa
 

What's hot (20)

Graphical Displays of Data
Graphical Displays of DataGraphical Displays of Data
Graphical Displays of Data
 
Statistics
StatisticsStatistics
Statistics
 
Presenting information presentation.
Presenting information presentation.Presenting information presentation.
Presenting information presentation.
 
Ways of presenting information
Ways of presenting informationWays of presenting information
Ways of presenting information
 
Graphs
GraphsGraphs
Graphs
 
Different types of charts
Different types of chartsDifferent types of charts
Different types of charts
 
Types of Chart
Types of ChartTypes of Chart
Types of Chart
 
Types of Charts
Types of ChartsTypes of Charts
Types of Charts
 
Choosing the Right Graphs
Choosing the Right GraphsChoosing the Right Graphs
Choosing the Right Graphs
 
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas Intro
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas IntroGreater Cambridge Greater Peterborough Economic Assessment Data Atlas Intro
Greater Cambridge Greater Peterborough Economic Assessment Data Atlas Intro
 
Types of graphs and charts and their uses with examples and pics
Types of graphs and charts and their uses  with examples and picsTypes of graphs and charts and their uses  with examples and pics
Types of graphs and charts and their uses with examples and pics
 
Top 7 types of Statistics Graphs for Data Representation
Top 7 types of Statistics Graphs for Data RepresentationTop 7 types of Statistics Graphs for Data Representation
Top 7 types of Statistics Graphs for Data Representation
 
Charts And Graphs
Charts And GraphsCharts And Graphs
Charts And Graphs
 
TYPES ON CHARTS
TYPES ON CHARTS TYPES ON CHARTS
TYPES ON CHARTS
 
Understanding on graph in bpo
Understanding on graph in bpoUnderstanding on graph in bpo
Understanding on graph in bpo
 
Electronic spreadsheet
Electronic spreadsheetElectronic spreadsheet
Electronic spreadsheet
 
How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?How to visualize web analytics data and choose a graph?
How to visualize web analytics data and choose a graph?
 
R
RR
R
 
Julie S
Julie SJulie S
Julie S
 
Pivot table essential learning 1
Pivot table   essential learning 1Pivot table   essential learning 1
Pivot table essential learning 1
 

Viewers also liked

Muni chatarpal considerations for grc
Muni chatarpal   considerations for grcMuni chatarpal   considerations for grc
Muni chatarpal considerations for grcjpkush
 
Introduction to matlab lecture 4 of 4
Introduction to matlab lecture 4 of 4Introduction to matlab lecture 4 of 4
Introduction to matlab lecture 4 of 4Randa Elanwar
 
Matlab ploting
Matlab plotingMatlab ploting
Matlab plotingAmeen San
 
Use of GIS in house hunting in Roorkee
Use of GIS in house hunting in RoorkeeUse of GIS in house hunting in Roorkee
Use of GIS in house hunting in RoorkeeAkhil Prabhakar
 
Optimization techniques in pharmaceutical formulation and processing
Optimization techniques in pharmaceutical formulation and processingOptimization techniques in pharmaceutical formulation and processing
Optimization techniques in pharmaceutical formulation and processingReshma Fathima .K
 
Minitab simplified by chetan dhal
Minitab simplified by chetan dhalMinitab simplified by chetan dhal
Minitab simplified by chetan dhalChetan Dhal
 
Lecture 02 visualization and programming
Lecture 02   visualization and programmingLecture 02   visualization and programming
Lecture 02 visualization and programmingSmee Kaem Chann
 
Optimization techniques in pharmaceutical processing
Optimization techniques in pharmaceutical processingOptimization techniques in pharmaceutical processing
Optimization techniques in pharmaceutical processingNaval Garg
 
Meet minitab tutorial
Meet minitab tutorialMeet minitab tutorial
Meet minitab tutorialshanmu31
 
controlled drug delivery system classification
controlled drug delivery system classificationcontrolled drug delivery system classification
controlled drug delivery system classificationravipharmabwm
 
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSING
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSINGOPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSING
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSINGJamia Hamdard
 

Viewers also liked (17)

Graph Plots in Matlab
Graph Plots in MatlabGraph Plots in Matlab
Graph Plots in Matlab
 
Matlab plotting
Matlab plottingMatlab plotting
Matlab plotting
 
Muni chatarpal considerations for grc
Muni chatarpal   considerations for grcMuni chatarpal   considerations for grc
Muni chatarpal considerations for grc
 
Abir project
Abir projectAbir project
Abir project
 
Introduction to matlab lecture 4 of 4
Introduction to matlab lecture 4 of 4Introduction to matlab lecture 4 of 4
Introduction to matlab lecture 4 of 4
 
Matlab ploting
Matlab plotingMatlab ploting
Matlab ploting
 
Use of GIS in house hunting in Roorkee
Use of GIS in house hunting in RoorkeeUse of GIS in house hunting in Roorkee
Use of GIS in house hunting in Roorkee
 
Optimization techniques in pharmaceutical formulation and processing
Optimization techniques in pharmaceutical formulation and processingOptimization techniques in pharmaceutical formulation and processing
Optimization techniques in pharmaceutical formulation and processing
 
Minitab simplified by chetan dhal
Minitab simplified by chetan dhalMinitab simplified by chetan dhal
Minitab simplified by chetan dhal
 
Lecture 02 visualization and programming
Lecture 02   visualization and programmingLecture 02   visualization and programming
Lecture 02 visualization and programming
 
The Power of Surface Modelling
The Power of Surface ModellingThe Power of Surface Modelling
The Power of Surface Modelling
 
Optimization techniques in pharmaceutical processing
Optimization techniques in pharmaceutical processingOptimization techniques in pharmaceutical processing
Optimization techniques in pharmaceutical processing
 
Meet minitab tutorial
Meet minitab tutorialMeet minitab tutorial
Meet minitab tutorial
 
Vibration Mécanique
Vibration MécaniqueVibration Mécanique
Vibration Mécanique
 
Optimization techniques
Optimization techniques Optimization techniques
Optimization techniques
 
controlled drug delivery system classification
controlled drug delivery system classificationcontrolled drug delivery system classification
controlled drug delivery system classification
 
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSING
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSINGOPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSING
OPTIMIZATION IN PHARMACEUTICS,FORMULATION & PROCESSING
 

Similar to Data analysis04 morethantwovariables

105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-newhomeworkping7
 
Macromolecules
MacromoleculesMacromolecules
Macromoleculesganctil
 
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...Simar Neasy
 
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptx
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptxGraphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptx
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptxRanggaMasyhuriNuur
 
Data Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneData Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneCam Taylor
 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSScsula its training
 
OLD SEVEN TOOLS OF QUALTIY MANAGEMENT
OLD SEVEN TOOLS OF QUALTIY MANAGEMENTOLD SEVEN TOOLS OF QUALTIY MANAGEMENT
OLD SEVEN TOOLS OF QUALTIY MANAGEMENTANNA UNIVERSITY
 
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docxDesign PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docxcarolinef5
 
Displaying data using charts and graphs
Displaying data using charts and graphsDisplaying data using charts and graphs
Displaying data using charts and graphsCharles Flynt
 
The use of data visualization to tell effective
The use of data visualization to tell effectiveThe use of data visualization to tell effective
The use of data visualization to tell effectivegentlemoro
 
Data visualization data sources data types
Data visualization   data sources   data typesData visualization   data sources   data types
Data visualization data sources data typesManokamnaKochar1
 
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
Chapter 4 Problem 31. For problem three in chapter four,   a teac.docxChapter 4 Problem 31. For problem three in chapter four,   a teac.docx
Chapter 4 Problem 31. For problem three in chapter four, a teac.docxrobertad6
 

Similar to Data analysis04 morethantwovariables (20)

Data structure
Data   structureData   structure
Data structure
 
Data analysis02 twovariables
Data analysis02 twovariablesData analysis02 twovariables
Data analysis02 twovariables
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-new
 
Bba 2001
Bba 2001Bba 2001
Bba 2001
 
Macromolecules
MacromoleculesMacromolecules
Macromolecules
 
Data analysis03 timeasa-variable
Data analysis03 timeasa-variableData analysis03 timeasa-variable
Data analysis03 timeasa-variable
 
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...
A Critique Of Anscombe S Work On Statistical Analysis Using Graphs (2013 Home...
 
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptx
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptxGraphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptx
Graphical Presentation of Data - Rangga Masyhuri Nuur LLU 27.pptx
 
A Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACMA Tour through the Data Vizualization Zoo - Communications of the ACM
A Tour through the Data Vizualization Zoo - Communications of the ACM
 
Data Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneData Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbne
 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSS
 
Data analysis01 singlevariable
Data analysis01 singlevariableData analysis01 singlevariable
Data analysis01 singlevariable
 
Tqm old tools
Tqm old toolsTqm old tools
Tqm old tools
 
OLD SEVEN TOOLS OF QUALTIY MANAGEMENT
OLD SEVEN TOOLS OF QUALTIY MANAGEMENTOLD SEVEN TOOLS OF QUALTIY MANAGEMENT
OLD SEVEN TOOLS OF QUALTIY MANAGEMENT
 
Tqm old tools
Tqm old toolsTqm old tools
Tqm old tools
 
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docxDesign PatternsChristian Behrenshttpswww.behance.netgall.docx
Design PatternsChristian Behrenshttpswww.behance.netgall.docx
 
Displaying data using charts and graphs
Displaying data using charts and graphsDisplaying data using charts and graphs
Displaying data using charts and graphs
 
The use of data visualization to tell effective
The use of data visualization to tell effectiveThe use of data visualization to tell effective
The use of data visualization to tell effective
 
Data visualization data sources data types
Data visualization   data sources   data typesData visualization   data sources   data types
Data visualization data sources data types
 
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
Chapter 4 Problem 31. For problem three in chapter four,   a teac.docxChapter 4 Problem 31. For problem three in chapter four,   a teac.docx
Chapter 4 Problem 31. For problem three in chapter four, a teac.docx
 

More from Universidade de São Paulo

Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema HadoopIntrodução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema HadoopUniversidade de São Paulo
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...Universidade de São Paulo
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Universidade de São Paulo
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Universidade de São Paulo
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUniversidade de São Paulo
 
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsVertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsUniversidade de São Paulo
 
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelUniversidade de São Paulo
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...Universidade de São Paulo
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesUniversidade de São Paulo
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Universidade de São Paulo
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkUniversidade de São Paulo
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyUniversidade de São Paulo
 
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...Universidade de São Paulo
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsUniversidade de São Paulo
 

More from Universidade de São Paulo (20)

A gentle introduction to Deep Learning
A gentle introduction to Deep LearningA gentle introduction to Deep Learning
A gentle introduction to Deep Learning
 
Computação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalhoComputação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalho
 
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema HadoopIntrodução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approach
 
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale GraphsVertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
 
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
 
An introduction to MongoDB
An introduction to MongoDBAn introduction to MongoDB
An introduction to MongoDB
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...
 
Apresentacao vldb
Apresentacao vldbApresentacao vldb
Apresentacao vldb
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media images
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring network
 
Graph-based Relational Data Visualization
Graph-based RelationalData VisualizationGraph-based RelationalData Visualization
Graph-based Relational Data Visualization
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical Study
 
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
 
Dawarehouse e OLAP
Dawarehouse e OLAPDawarehouse e OLAP
Dawarehouse e OLAP
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisions
 

Recently uploaded

MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 

Recently uploaded (20)

MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 

Data analysis04 morethantwovariables

  • 1. http://publicationslist.org/junio Data Analysis More Than Two Variables: Graphical Multivariate Analysis Prof. Dr. Jose Fernando Rodrigues Junior ICMC-USP
  • 2. http://publicationslist.org/junio What is it about? More than two variables determine a tough analytical problem Although, there are multivariate graphical techniques, they can not deal too many variables (less than 15 – 25)
  • 3. http://publicationslist.org/junio What is it about? More than two variables determine a tough analytical problem Although, there are multivariate graphical techniques, they can not deal too many variables (less than 15 – 25)
  • 4. http://publicationslist.org/junio Three variables For example, consider the data defined by function: = , = 2 + − 2 + 4 that corresponds to the three-variable setting y, x, and a One way to analyze this is by means of a surface plot:
  • 5. http://publicationslist.org/junio Three variables However, the surface plot does not permit an intuitive reading of quantitative information Another way is to use a two-dimensional xy plot with multiple curves, one for each value of interest of one of the variables; in the example, variable a is considered for values 2, 1, 0, -1, -2
  • 6. http://publicationslist.org/junio Three variables Surface plots and multiple-curve xy plots can be in combination, one providing an aesthetically appealing overview, the other providing fine details for values of interest It is interesting to note that surface plots go against the commonsense that 3D plots should be more informative than 2D plots Yet another possibility is to project the function into the base plane bellow the surface, using either:  contour plots  false-color plots
  • 7. http://publicationslist.org/junio Three variables The false-color plot is achieved by mapping all values of the dependent variable y following a palette of colors The false-color plot for function f(x, a)
  • 8. http://publicationslist.org/junio Three variables The false-color plot is achieved by mapping all values of the dependent variable y following a palette of colors The false-color plot for function f(x, a) False-color plots are very effective for presenting quantitative information It is important to note, however, that its efficiency depends heavily on the color mapping, which must be intuitive according to the task at hand For an overview of color-mapping guidelines, see the textbook on page 104
  • 9. http://publicationslist.org/junio More There are basically two ways to get more information on a plot  Put similar graphs next to each other and vary the variables in a systematic fashion from one subgraph to the next  multiplots  Make the graph elements themselves richer with color, shape, and interaction Multiplots  The most common forms of multiplots are the scatter-plot matrix, and the co-plot,
  • 10. http://publicationslist.org/junio Scatter-plot matrix The scatter-plot matrix is constructed considering all the possible two-variable combinations achieved from the set of variables For each combination, a sub-region of the space is reserved and all the combinations are put together according to a straight layout The more variables, the bigger must be the screen, limits start to manifest around 10 variables, the same for the number of data points, limited around 100 For example, consider a 250 wines data set consisting of seven different properties: acidity, sugar, chlorides, sulfur dioxide, density, alcohol, and quality The data can be found in the “Wine Quality” data set, available at the UCI Machine Learning repository - http://archive.ics.uci.edu/ml/.
  • 12. http://publicationslist.org/junio Scatter-plot matrix The scatter plot shows: • sugar content and density are positively correlated • as the alcohol content goes up, density goes down, inverse correlation • wine quality seems to increase with increasing alcohol content: apparently, more potent wines are considered to be better
  • 13. http://publicationslist.org/junio Co-plots Co-plots work by partitioning the data according to one of the variables and plotting each partition in a different plot In this example, the upper figure shows how one of the variables was used to partition the data  It is possible to notice that the intervals overlap and have different sizes so that each plot has the same number of points
  • 14. http://publicationslist.org/junio Co-plots Co-plots work by partitioning the data according to one of the variables and plotting each partition in a different plot In this example, the upper figure shows how one of the variables was used to partition the data  It is possible to notice that the intervals overlap and have different sizes so that each plot has the same number of points The partitioning can be determined in many different ways according to the goals of the analysis
  • 15. http://publicationslist.org/junio Composition Another way to visualize more than two variables is to compose multiple plots according to some of the variables For example: imagine a company that makes five products labeled A, B, C, D, and E, and two questions:  how many items of each kind are produced overall  how the item mix is changing over time A simple solution, but not quite effective is to plot (days x quantity) the different curves all together
  • 16. http://publicationslist.org/junio Composition Another solution is to use the same plot but stacking the information, so as to have a notion of total  In absolute numbers (left), or  In relative contributions (percentage)
  • 17. http://publicationslist.org/junio Composition Another solution is to use the same plot but stacking the information, so as to have a notion of total  In absolute numbers (left), or  In relative contributions (percentage) Surprisingly, the total number of items does not change appreciably over the long run Here, one can see the contribution of each product, an information that was lost in the left plot
  • 18. http://publicationslist.org/junio Composition Another solution is to use the same plot but stacking the information, so as to have a notion of total  In absolute numbers (left), or  In relative contributions (percentage) Surprisingly, the total number of items does not change appreciably over the long run Here, one can see the contribution of each product, an information that was lost in the left plot It is desirable to use multiple kinds of plots combined in order to verify the many aspects of data
  • 19. http://publicationslist.org/junio Parallel Coordinates In a parallel coordinate plot, the coordinate axes are parallel to each other For every data point, its value for each of the variables is marked on the corresponding axis, and then all these points are connected with lines
  • 20. http://publicationslist.org/junio Information Visualization  Many other techniques are presented according the findings of the field known as InformationVisualization:  Glyphs  Chernoff Faces  Tree-maps  Star coordinates  Table Lens  Multidimensional Projection  And many others, all improved by means of interaction techniques:  Querying and zooming  Linking and Brushing  Combined projections, and so forth
  • 21. http://publicationslist.org/junio References  Philipp K. Janert, Data Analysis with Open Source Tools, O’Reilly, 2010.  Wikipedia, http://en.wikipedia.org  Wolfram MathWorld, http://mathworld.wolfram.com/