SlideShare a Scribd company logo
The use of Data visualization to
tell effective stories
By:
Moro Seidu (2018)
Email: gentlemoro@iwatchafrica.org
Outline of Presentation
 PART ONE: THEORY
 Data Presentation & Visualization
 Data
 Forms of Data Presentation & Visualization
 Text form;
 tabular form;
 graphical form.
 Summary
 PART TWO: PRACTICAL
 Text Mining
 Sentimental Analysis
 Wordcloud Analysis
Data
 Data usually represents unprocessed numbers,
text, pictures or statements
 Data are usually collected in a raw format and
thus the inherent information is difficult to
understand
 Therefore, raw data need to be summarized,
processed, and analysed.
 Data set needs to be presented in a certain way
depending on what it is used for.
Data…
 Planning how the data will be presented is essential
before appropriately processing raw data.
 However, no matter how well the information derived
from the raw data is manipulated, it should be presented
in an effective format, otherwise, it would be a great loss
for both authors and readers.
 These days, data are often summarized, organized, and
analysed with statistical packages or graphics software.
 Data must be prepared in such a way that they are
properly recognized by the program being used.
Data…
 This presentation does not discuss data preparation
process, which involves creating a data frame,
creating/changing rows and columns, changing the level of
a factor, categorical variable, coding, dummy variables,
variable transformation, data transformation, missing value,
outlier treatment, and noise removal.
Data Presentation/Visualisation
 Methods of presentation must be determined according
to the
 data format,
 the method of analysis to be used, and
 the information to be emphasized.
 Even when the same information is being conveyed,
different methods of presentation must be employed
depending on what specific information is going to be
emphasized.
Data Presentation/Visualisation…
 A method of presentation must be chosen after carefully
weighing the advantages and disadvantages of different
methods of presentation.
 This conference shall focus on three different methods
of presentation:
Text form
tabular form
graphical form
 One thing to always bear in mind regardless of what
method is used, however, is the simplicity of
presentation.
Text presentation
 Text is the main method of conveying information as it
is used to explain results and trends, and provide
contextual information.
 Data are fundamentally presented in paragraphs or
sentences.
 Text can be used to provide interpretation or emphasize
certain data.
 If quantitative information to be conveyed consists of
one or two numbers, it is more appropriate to use text
(written language) than tables or graphs.
Text presentation…
 For instance, information about the occurrence of
accident in 2016–2017 can be presented with the use of
a few numbers:
 “The occurrence of accident was 11% in 2016 and 15%
in 2017; no significant difference of occurrence was
found between the two years.”
 If this information were to be presented in a graph or a
table, it would occupy an unnecessarily large space on
the page, without enhancing the readers'
understanding of the data.
Text presentation…
 If more data are to be presented, or other
information such as that regarding data trends
are to be conveyed, a table or a graph would be
more appropriate.
 By nature, data take longer to read when
presented as texts and when the main text
includes a long list of information, readers and
reviewers may have difficulties in understanding
the information.
Table presentation
 Tables are the most appropriate for presenting
individual information, and can present both
quantitative and qualitative information.
Advantages of using tables
 Tables can accurately present information that cannot
be presented with a graph. A number such as
“132.145852” can be accurately expressed in a table.
 Information with different units can be presented
together on a table. For instance, atmospheric
temperature, speed of a vehicle, number of drivers,
travelling distance and time of travel can be presented
together in one table.
 tables are useful for summarizing and comparing
quantitative information of different variables.
Table presentation…
Disadvantages of using tables
 the interpretation of information takes longer in tables than in
graphs,
 tables are not appropriate for studying data trends.
 since all data are of equal importance in a table, it is not easy to
identify and selectively choose the information required.
Table presentation…
 Heat maps help to improve visualization of information on tables.
 Heat maps help to further visualize the information presented in a table
by applying colours to the background of cells.
 By adjusting the colours or colour saturation, information is conveyed in
a more visible manner, and readers can quickly identify the information of
interest.
 Software such as Excel (in Microsoft Office) have features that enable
easy creation of heat maps through the options available on the
“conditional formatting” menu.
Graph presentation
 Whereas tables can be used for presenting all the
information, graphs simplify complex information by
using images and emphasizing data patterns or trends,
and are useful for summarizing, explaining, or exploring
quantitative data.
 While graphs are effective for presenting large amounts
of data, they can be used in place of tables to present
small sets of data.
 A graph format that best presents information must be
chosen so that readers and reviewers can easily
understand the information.
Graph presentation…
Frequently used graph formats:
Scatter plot
Bar graph and histogram
Pie chart
Line plot
Box and whisker chart
Scatter plot
 Scatter plots present data on the x- and y-axes and are
used to investigate an association (correlation)
between two variables.
 A point represents each individual or object, and an
association between two variables can be studied by
analysing patterns across multiple points.
 A regression line is added to a graph to determine
whether the association between two variables can be
explained or not.
 If multiple points exist at an identical location, the
correlation level may not be clear.
 In this case, a correlation coefficient or regression line
can be added to further elucidate the correlation.
Bar graph and histogram
 A bar graph is used to indicate and compare values
in a discrete category or group, and the frequency or
other measurement parameters (i.e. mean).
 Depending on the number of categories, and the
size or complexity of each category, bars may be
created vertically or horizontally.
 The height (or length) of a bar represents the
amount of information in a category.
 Bar graphs are flexible, and can be used in a
grouped or subdivided bar format in cases of two or
more data sets in each category.
Bar graph and histogram…
 By comparing the endpoints of bars, one
can identify the largest and the smallest
categories, and understand gradual
differences between each category.
 One form of vertical bar graph is the
stacked vertical bar graph. A stack vertical
bar graph is used to compare the sum of
each category, and analyse parts of a
category.
Pie chart
 A pie chart, which is used to represent nominal data
(in other words, data classified in different
categories), visually represents a distribution of
categories.
 It is generally the most appropriate format for
representing information grouped into a small
number of categories.
 It is also used for data that have no other way of
being represented aside from a table (i.e. frequency
table).
Line plot
 A line plot is useful for representing time-series data such
as monthly electricity bills and yearly unemployment rates;
in other words, it is used to study variables that are
observed over time.
 Line graphs are especially useful for studying patterns and
trends across data that include climatic influence, large
changes or turning points, and
 are also appropriate for representing not only time-series
data, but also data measured over the progression of a
continuous variable such as distance.
 It is also useful to represent multiple data sets on a single
line graph to compare and analyse patterns across
different data sets.
Box and whisker chart
 A box and whisker chart does not make any assumptions
about the underlying statistical distribution, and
represents variations in samples of a population;
therefore, it is appropriate for representing
nonparametric data.
 A box and whisker chart consists of boxes that represent
interquartile range (one to three), the median and the
mean of the data, and whiskers presented as lines
outside of the boxes.
 Whiskers can be used to present the largest and smallest
values in a set of data or only a part of the data.
 The spacing at both ends of the box indicates dispersion
in the data. The relative location of the median
demonstrated within the box indicates skewness
Summary
 Text, tables, and graphs are effective communication media that present and convey data
and information.
 They aid readers in understanding the content of research, sustain their interest, and
effectively present large quantities of complex information.
 As readers will scan through these presentations before reading the entire text, their
importance cannot be disregarded.
 For this reason, authors or publishers must pay as close attention to selecting appropriate
methods of data presentation as when they were collecting data of good quality and
analysing them.
 Understanding of different methods of data presentation and their appropriate use will
enable one to develop the ability to recognize and interpret inappropriately presented
data or data presented in such a way that it deceives readers' eyes
PART II ( PRACTICALS)
Materials:
 R, RStudio,
 Library (tm, tidytext, janeaustenr, wordcloud, dplyr, stringr, gplot, SnowballC),
Exercise
 Text mining
 Sentimental Analysis
 Wordcloud analysis

More Related Content

What's hot

Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
Ultimate Multimedia Consult
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
Maloy Manna, PMP®
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
VARUN KUMAR
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
Hemant Chetwani
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Dr. C.V. Suresh Babu
 
Data preparation
Data preparationData preparation
Data preparation
Tony Nguyen
 
Exploring Data
Exploring DataExploring Data
Exploring Data
Datamining Tools
 
Exploratory data analysis
Exploratory data analysis Exploratory data analysis
Exploratory data analysis
Peter Reimann
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
Marco Torchiano
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
saba khan
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
Thiyagu K
 
Data and its Types
Data and its TypesData and its Types
Data and its Types
RajaKrishnan M
 
3 Data Mining Tasks
3  Data Mining Tasks3  Data Mining Tasks
3 Data Mining Tasks
Mahmoud Alfarra
 
Brief introduction to data visualization
Brief introduction to data visualizationBrief introduction to data visualization
Brief introduction to data visualization
Zach Gemignani
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
Sreenatha Reddy K R
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
Vishwas N
 
Data visualization introduction
Data visualization introductionData visualization introduction
Data visualization introduction
ManokamnaKochar1
 
Data Mining: Data Preprocessing
Data Mining: Data PreprocessingData Mining: Data Preprocessing
Data Mining: Data Preprocessing
Lakshmi Sarvani Videla
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
Eva Durall
 

What's hot (20)

Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
 
Bias and variance trade off
Bias and variance trade offBias and variance trade off
Bias and variance trade off
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Data preparation
Data preparationData preparation
Data preparation
 
Exploring Data
Exploring DataExploring Data
Exploring Data
 
Exploratory data analysis
Exploratory data analysis Exploratory data analysis
Exploratory data analysis
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Data Visualisation.pdf
Data Visualisation.pdfData Visualisation.pdf
Data Visualisation.pdf
 
Data and its Types
Data and its TypesData and its Types
Data and its Types
 
3 Data Mining Tasks
3  Data Mining Tasks3  Data Mining Tasks
3 Data Mining Tasks
 
Brief introduction to data visualization
Brief introduction to data visualizationBrief introduction to data visualization
Brief introduction to data visualization
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Data visualization introduction
Data visualization introductionData visualization introduction
Data visualization introduction
 
Data Mining: Data Preprocessing
Data Mining: Data PreprocessingData Mining: Data Preprocessing
Data Mining: Data Preprocessing
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
 

Similar to The use of data visualization to tell effective

Diagramatic and graphical representation of data Notes on Statistics.ppt
Diagramatic and graphical representation of data Notes on Statistics.pptDiagramatic and graphical representation of data Notes on Statistics.ppt
Diagramatic and graphical representation of data Notes on Statistics.ppt
aigil2
 
Use-of-ICT-in-presentation.pdf
Use-of-ICT-in-presentation.pdfUse-of-ICT-in-presentation.pdf
Use-of-ICT-in-presentation.pdf
Pratibha Sagar
 
Diowane2003
Diowane2003Diowane2003
Diowane2003SFYC
 
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdfGraphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Himakshi7
 
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptxData-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
shyrahipolitoe
 
Unit 2_ Descriptive Analytics for MBA .pptx
Unit 2_ Descriptive Analytics for MBA .pptxUnit 2_ Descriptive Analytics for MBA .pptx
Unit 2_ Descriptive Analytics for MBA .pptx
JANNU VINAY
 
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and AI...
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and  AI...Artificial Intelligence - Data Analysis, Creative & Critical Thinking and  AI...
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and AI...
deboshreechatterjee2
 
Bigdata
BigdataBigdata
Uses of graph
Uses of graphUses of graph
Uses of graph
NimraKhan91
 
The uses of Tables & graphs
The uses of Tables & graphsThe uses of Tables & graphs
The uses of Tables & graphs
Franco Jesús
 
DATA PRESENTATION METHODS - 1.pptx
DATA PRESENTATION METHODS - 1.pptxDATA PRESENTATION METHODS - 1.pptx
DATA PRESENTATION METHODS - 1.pptx
vineetarun1
 
Top 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their UsesTop 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their Uses
Stat Analytica
 
Use of Maps and Charts in Research
Use of Maps and Charts in ResearchUse of Maps and Charts in Research
Use of Maps and Charts in Research
Shashikant Kumar, Ph.D
 
Numerical & graphical presentation of data
Numerical & graphical presentation of dataNumerical & graphical presentation of data
Numerical & graphical presentation of data
Sarfraz Ahmad
 
Business Analytics 1 Module 3.pdf
Business Analytics 1 Module 3.pdfBusiness Analytics 1 Module 3.pdf
Business Analytics 1 Module 3.pdf
Jayanti Pande
 
Statistics Class 10 CBSE
Statistics Class 10 CBSE Statistics Class 10 CBSE
Statistics Class 10 CBSE
Smitha Sumod
 
Bba 2001
Bba 2001Bba 2001
Bba 2001
Monie Joey
 
DATA-PROCESSING.pptx
DATA-PROCESSING.pptxDATA-PROCESSING.pptx
DATA-PROCESSING.pptx
MeetHooda
 
Extension presentation
Extension presentationExtension presentation
Extension presentation
alkapant123
 

Similar to The use of data visualization to tell effective (20)

Diagramatic and graphical representation of data Notes on Statistics.ppt
Diagramatic and graphical representation of data Notes on Statistics.pptDiagramatic and graphical representation of data Notes on Statistics.ppt
Diagramatic and graphical representation of data Notes on Statistics.ppt
 
Use-of-ICT-in-presentation.pdf
Use-of-ICT-in-presentation.pdfUse-of-ICT-in-presentation.pdf
Use-of-ICT-in-presentation.pdf
 
Graphs
Graphs  Graphs
Graphs
 
Diowane2003
Diowane2003Diowane2003
Diowane2003
 
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdfGraphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
 
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptxData-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
Data-Presentation-and-Interpretation-in-Tabular-Graphical.pptx
 
Unit 2_ Descriptive Analytics for MBA .pptx
Unit 2_ Descriptive Analytics for MBA .pptxUnit 2_ Descriptive Analytics for MBA .pptx
Unit 2_ Descriptive Analytics for MBA .pptx
 
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and AI...
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and  AI...Artificial Intelligence - Data Analysis, Creative & Critical Thinking and  AI...
Artificial Intelligence - Data Analysis, Creative & Critical Thinking and AI...
 
Bigdata
BigdataBigdata
Bigdata
 
Uses of graph
Uses of graphUses of graph
Uses of graph
 
The uses of Tables & graphs
The uses of Tables & graphsThe uses of Tables & graphs
The uses of Tables & graphs
 
DATA PRESENTATION METHODS - 1.pptx
DATA PRESENTATION METHODS - 1.pptxDATA PRESENTATION METHODS - 1.pptx
DATA PRESENTATION METHODS - 1.pptx
 
Top 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their UsesTop 8 Different Types Of Charts In Statistics And Their Uses
Top 8 Different Types Of Charts In Statistics And Their Uses
 
Use of Maps and Charts in Research
Use of Maps and Charts in ResearchUse of Maps and Charts in Research
Use of Maps and Charts in Research
 
Numerical & graphical presentation of data
Numerical & graphical presentation of dataNumerical & graphical presentation of data
Numerical & graphical presentation of data
 
Business Analytics 1 Module 3.pdf
Business Analytics 1 Module 3.pdfBusiness Analytics 1 Module 3.pdf
Business Analytics 1 Module 3.pdf
 
Statistics Class 10 CBSE
Statistics Class 10 CBSE Statistics Class 10 CBSE
Statistics Class 10 CBSE
 
Bba 2001
Bba 2001Bba 2001
Bba 2001
 
DATA-PROCESSING.pptx
DATA-PROCESSING.pptxDATA-PROCESSING.pptx
DATA-PROCESSING.pptx
 
Extension presentation
Extension presentationExtension presentation
Extension presentation
 

More from gentlemoro

The role of fdi in india agriculture slide share
The role of fdi in india agriculture   slide shareThe role of fdi in india agriculture   slide share
The role of fdi in india agriculture slide share
gentlemoro
 
Questionnaire and schedules design
Questionnaire and schedules designQuestionnaire and schedules design
Questionnaire and schedules design
gentlemoro
 
National income
National incomeNational income
National income
gentlemoro
 
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
gentlemoro
 
Important marketing Functions
Important marketing FunctionsImportant marketing Functions
Important marketing Functions
gentlemoro
 
Coordination
CoordinationCoordination
Coordination
gentlemoro
 
Gender issues in entrepneur development
Gender issues in entrepneur developmentGender issues in entrepneur development
Gender issues in entrepneur development
gentlemoro
 
Batsau peace conference: Elections and Violence
Batsau peace conference: Elections and ViolenceBatsau peace conference: Elections and Violence
Batsau peace conference: Elections and Violence
gentlemoro
 

More from gentlemoro (8)

The role of fdi in india agriculture slide share
The role of fdi in india agriculture   slide shareThe role of fdi in india agriculture   slide share
The role of fdi in india agriculture slide share
 
Questionnaire and schedules design
Questionnaire and schedules designQuestionnaire and schedules design
Questionnaire and schedules design
 
National income
National incomeNational income
National income
 
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
FISCAL AND MONETARY POLICIES (GHANA AND INDIA )
 
Important marketing Functions
Important marketing FunctionsImportant marketing Functions
Important marketing Functions
 
Coordination
CoordinationCoordination
Coordination
 
Gender issues in entrepneur development
Gender issues in entrepneur developmentGender issues in entrepneur development
Gender issues in entrepneur development
 
Batsau peace conference: Elections and Violence
Batsau peace conference: Elections and ViolenceBatsau peace conference: Elections and Violence
Batsau peace conference: Elections and Violence
 

Recently uploaded

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 

The use of data visualization to tell effective

  • 1. The use of Data visualization to tell effective stories By: Moro Seidu (2018) Email: gentlemoro@iwatchafrica.org
  • 2.
  • 3. Outline of Presentation  PART ONE: THEORY  Data Presentation & Visualization  Data  Forms of Data Presentation & Visualization  Text form;  tabular form;  graphical form.  Summary  PART TWO: PRACTICAL  Text Mining  Sentimental Analysis  Wordcloud Analysis
  • 4. Data  Data usually represents unprocessed numbers, text, pictures or statements  Data are usually collected in a raw format and thus the inherent information is difficult to understand  Therefore, raw data need to be summarized, processed, and analysed.  Data set needs to be presented in a certain way depending on what it is used for.
  • 5. Data…  Planning how the data will be presented is essential before appropriately processing raw data.  However, no matter how well the information derived from the raw data is manipulated, it should be presented in an effective format, otherwise, it would be a great loss for both authors and readers.  These days, data are often summarized, organized, and analysed with statistical packages or graphics software.  Data must be prepared in such a way that they are properly recognized by the program being used.
  • 6. Data…  This presentation does not discuss data preparation process, which involves creating a data frame, creating/changing rows and columns, changing the level of a factor, categorical variable, coding, dummy variables, variable transformation, data transformation, missing value, outlier treatment, and noise removal.
  • 7. Data Presentation/Visualisation  Methods of presentation must be determined according to the  data format,  the method of analysis to be used, and  the information to be emphasized.  Even when the same information is being conveyed, different methods of presentation must be employed depending on what specific information is going to be emphasized.
  • 8. Data Presentation/Visualisation…  A method of presentation must be chosen after carefully weighing the advantages and disadvantages of different methods of presentation.  This conference shall focus on three different methods of presentation: Text form tabular form graphical form  One thing to always bear in mind regardless of what method is used, however, is the simplicity of presentation.
  • 9. Text presentation  Text is the main method of conveying information as it is used to explain results and trends, and provide contextual information.  Data are fundamentally presented in paragraphs or sentences.  Text can be used to provide interpretation or emphasize certain data.  If quantitative information to be conveyed consists of one or two numbers, it is more appropriate to use text (written language) than tables or graphs.
  • 10. Text presentation…  For instance, information about the occurrence of accident in 2016–2017 can be presented with the use of a few numbers:  “The occurrence of accident was 11% in 2016 and 15% in 2017; no significant difference of occurrence was found between the two years.”  If this information were to be presented in a graph or a table, it would occupy an unnecessarily large space on the page, without enhancing the readers' understanding of the data.
  • 11. Text presentation…  If more data are to be presented, or other information such as that regarding data trends are to be conveyed, a table or a graph would be more appropriate.  By nature, data take longer to read when presented as texts and when the main text includes a long list of information, readers and reviewers may have difficulties in understanding the information.
  • 12. Table presentation  Tables are the most appropriate for presenting individual information, and can present both quantitative and qualitative information. Advantages of using tables  Tables can accurately present information that cannot be presented with a graph. A number such as “132.145852” can be accurately expressed in a table.  Information with different units can be presented together on a table. For instance, atmospheric temperature, speed of a vehicle, number of drivers, travelling distance and time of travel can be presented together in one table.  tables are useful for summarizing and comparing quantitative information of different variables.
  • 13. Table presentation… Disadvantages of using tables  the interpretation of information takes longer in tables than in graphs,  tables are not appropriate for studying data trends.  since all data are of equal importance in a table, it is not easy to identify and selectively choose the information required.
  • 14. Table presentation…  Heat maps help to improve visualization of information on tables.  Heat maps help to further visualize the information presented in a table by applying colours to the background of cells.  By adjusting the colours or colour saturation, information is conveyed in a more visible manner, and readers can quickly identify the information of interest.  Software such as Excel (in Microsoft Office) have features that enable easy creation of heat maps through the options available on the “conditional formatting” menu.
  • 15. Graph presentation  Whereas tables can be used for presenting all the information, graphs simplify complex information by using images and emphasizing data patterns or trends, and are useful for summarizing, explaining, or exploring quantitative data.  While graphs are effective for presenting large amounts of data, they can be used in place of tables to present small sets of data.  A graph format that best presents information must be chosen so that readers and reviewers can easily understand the information.
  • 16. Graph presentation… Frequently used graph formats: Scatter plot Bar graph and histogram Pie chart Line plot Box and whisker chart
  • 17. Scatter plot  Scatter plots present data on the x- and y-axes and are used to investigate an association (correlation) between two variables.  A point represents each individual or object, and an association between two variables can be studied by analysing patterns across multiple points.  A regression line is added to a graph to determine whether the association between two variables can be explained or not.  If multiple points exist at an identical location, the correlation level may not be clear.  In this case, a correlation coefficient or regression line can be added to further elucidate the correlation.
  • 18. Bar graph and histogram  A bar graph is used to indicate and compare values in a discrete category or group, and the frequency or other measurement parameters (i.e. mean).  Depending on the number of categories, and the size or complexity of each category, bars may be created vertically or horizontally.  The height (or length) of a bar represents the amount of information in a category.  Bar graphs are flexible, and can be used in a grouped or subdivided bar format in cases of two or more data sets in each category.
  • 19. Bar graph and histogram…  By comparing the endpoints of bars, one can identify the largest and the smallest categories, and understand gradual differences between each category.  One form of vertical bar graph is the stacked vertical bar graph. A stack vertical bar graph is used to compare the sum of each category, and analyse parts of a category.
  • 20. Pie chart  A pie chart, which is used to represent nominal data (in other words, data classified in different categories), visually represents a distribution of categories.  It is generally the most appropriate format for representing information grouped into a small number of categories.  It is also used for data that have no other way of being represented aside from a table (i.e. frequency table).
  • 21. Line plot  A line plot is useful for representing time-series data such as monthly electricity bills and yearly unemployment rates; in other words, it is used to study variables that are observed over time.  Line graphs are especially useful for studying patterns and trends across data that include climatic influence, large changes or turning points, and  are also appropriate for representing not only time-series data, but also data measured over the progression of a continuous variable such as distance.  It is also useful to represent multiple data sets on a single line graph to compare and analyse patterns across different data sets.
  • 22. Box and whisker chart  A box and whisker chart does not make any assumptions about the underlying statistical distribution, and represents variations in samples of a population; therefore, it is appropriate for representing nonparametric data.  A box and whisker chart consists of boxes that represent interquartile range (one to three), the median and the mean of the data, and whiskers presented as lines outside of the boxes.  Whiskers can be used to present the largest and smallest values in a set of data or only a part of the data.  The spacing at both ends of the box indicates dispersion in the data. The relative location of the median demonstrated within the box indicates skewness
  • 23. Summary  Text, tables, and graphs are effective communication media that present and convey data and information.  They aid readers in understanding the content of research, sustain their interest, and effectively present large quantities of complex information.  As readers will scan through these presentations before reading the entire text, their importance cannot be disregarded.  For this reason, authors or publishers must pay as close attention to selecting appropriate methods of data presentation as when they were collecting data of good quality and analysing them.  Understanding of different methods of data presentation and their appropriate use will enable one to develop the ability to recognize and interpret inappropriately presented data or data presented in such a way that it deceives readers' eyes
  • 24. PART II ( PRACTICALS) Materials:  R, RStudio,  Library (tm, tidytext, janeaustenr, wordcloud, dplyr, stringr, gplot, SnowballC), Exercise  Text mining  Sentimental Analysis  Wordcloud analysis