Visualization and Data Mapping

Daekwon Kim
propellerheaven@gmail.com
King Sejong
William Playfair
Score Data
Class

Mathematics

Korean

English

Science

Score

80

90

60

50
Source Code of Bar Plot
barplot(
c(80,90,60,70),
names.arg=c(Mathematics,Korean,English,Science),
col=rainbow(4),
ylim=c(0,100)
)
Graph
Data mapping
데이터(변수)를눈에보이는요소에연결시키는일
위표에서우리가가진변수
과목(Class):Nominal
점수(Score):Ratio
Variables = Aesthetics
Class=barorbar.name
Score=bar.height
Graph Again
Where is the color?
Graphliteracy
Variables = Aesthetics
Data.Class=bar.name
Data.Class=bar.color(Duplication!)
Data.Score=bar.height
barplot(
#Date.Score=bar.height
c(80,90,60,70),
#Data.Class=bar.name
names.arg=c(Mathematics,Korean,English,Science),
#Data.Class=bar.color
col=rainbow(4),
#Y-axisrange
ylim=c(0,100)
)
Barplots
Useless Colors
Remove color
barplot(
#Date.Score=bar.height
c(80,90,60,70),
#Data.Class=bar.name
names.arg=c(Mathematics,Korean,English,Science),
#Y-axisrange
ylim=c(0,100)
)
Graph
Libreoffice Calc
Another Data
Class

Korean

English

Physics

Biology

Chemisry

Category

Language

Language

Science

Science

Science

Score

90

85

70

75

60
Data Mapping Again
Data.Class=bar.name
Data.Category=bar.color
Data.Score=bar.height
barplot(
#Date.Score=bar.height
c(90,85,70,75,60),
#Data.Class=bar.name
names.arg=c(Korean,English,Physics,Biology,Chemistry),
#Data.Category=Color
col=c(gold,gold,azure,azure,azure)
#Y-axisrange
ylim=c(0,100)
)
Graph
ggplot2
HadleyWickham
그래프를그리기위한R패키지
R에서가장많이쓰이는패키지
#ggplot2패키지설치
install.packages('ggplot2')
#ggplot2패키지로드
library(ggplot2)
The grammar of graphics


LelandWilkinson
ggplot2는이책의이론에기반
ggplot2gg는Thegrammarofgraphics의줄임말
Data
data_class=
c(Korean,English,Physics,Biology,Chemistry)
data_score=
c(90,85,70,75,60)
data_category=
c(language,language,science,science,science)
score_data=data.frame(
class=data_class,
category=data_category,
score=data_score
)
#평가
classcategoryscore
1Koreanlanguage90
2Englishlanguage85
3Physicsscience70
4Biologyscience75
5Chemistryscience60
Source Code
#그래프시작,데이터지정
bargraph=ggplot(score_data)+
#그래프종류지정
geom_bar()+
#데이터와시각적요소의맵핑1
#Data.Class=bar.name
aes(x=class)+
#데이터와시각적요소의맵핑2
#Data.Score=bar.height
aes(y=score)+
#Y축범위지정
ylim(0,100)
Graph
Mapping Category Data To Color
#그래프시작,데이터지정
bargraph=ggplot(score_data)+
#그래프종류지정
geom_bar()+
#데이터와시각적요소의맵핑1
#Data.Class=bar.name
aes(x=factor(class))+
#데이터와시각적요소의맵핑2
#Data.Score=bar.height
aes(y=score)+
#데이터와시각적요소의맵핑3
#Data.Category=bar.color
aes(fill=category)+
#Y축범위지정
ylim(0,100)
Graph
Coordination
#그래프시작,데이터지정
bargraph=ggplot(score_data)+
#그래프종류지정
geom_bar()+
#데이터와시각적요소의맵핑1
#Data.Class=bar.name
aes(x=class)+
#데이터와시각적요소의맵핑2
#Data.Score=bar.height
aes(y=score)+
#데이터와시각적요소의맵핑3
#Data.Category=bar.color
aes(fill=category)+
#좌표계지정
coord_flip()+
#Y축범위지정
ylim(0,100)
Graph
Creating Graph Process
Creating Graph Process
Variables
Algebra
Scales
Statistics
Geometry
Coordinates
Aesthetics
Creating Graph Process
Process
Variables
Algebra
Scales
Statistics
Geometry
Coordinates
Aesthetics
Who
Science
Programmer
Designer
Mathematical Graph
Twovariables
3 Dimension
Threevariables
Gapminder
Prof.HansRosling
What Dimension?
XYspaceisnot2dementioninvisualization
X
Y
Color
Size
Interaction
Year
Nation
…
Charles Joseph Minard
WhatDimension?
Lapisan
MangakaMap
2008
Putne
2013
Moplo
2013
Thank you very much!

Visualization and data mapping