Visual AnaVisual Analyticslytics
inin BigBig DataData
Saurabh ShanbhagSaurabh Shanbhag
TE IT 3134085TE IT 3134085
T120388690T120388690
What isWhat is big databig data??
• Big data is a term for data sets that are so large or
complex that traditional data processing applications
are inadequate.
• Nowadays, huge databases are required by big
companies and proper analysis of this big data is a
necessity.
Need forbig dataNeed forbig data analyticsanalytics
• Big data analytics examines large
amounts of data to uncover
hidden patterns, correlations and
other insights.
• Analytics proves helpful for:
(1)Cost Reduction.
(2)Faster, better decision making.
(3)New, smarter products and
services.
• Predictive analysis is facilitated
majorly by data visualization.
Big DataBig Data VisualizationVisualization
• Data visualization is the
presentation of data in a
pictorial or graphical
format.
• It enables decision makers
to see analytics presented
visually, so they can grasp
difficult concepts or identify
new patterns.
• With interactive visualization, you can use
technology to drill down into charts and graphs for
more detail, interactively changing what data you
see and how it’s processed.
How It WorksHow It Works
• First, We integrate all the data, ordered and unordered
and store it together.
• Then analytics algorithms are used for effective analysis
of this huge data.
• The results are then displayed using techniques that help
the user to understand the data better.
Big Data Visualization as anBig Data Visualization as an
InterfaceInterface
• Visualization is the interface
between the data storage
and the user.
• The long, core process of
detailed analysis of data
takes place in the
background and the sorted,
analysed data is displayed in
the best form to the user. No
details are exposed to the
users.
Importance ofImportance of Visual AnalyticsVisual Analytics forfor
Big DataBig Data
• Using charts or graphs to
visualize large complex
data is a lot easier than
poring over spreadsheets
or reports.
• Data visualization is a
quick, easy way to convey
concepts in a universal
manner – and you can
experiment with different
scenarios by making slight
adjustments
HumanHuman PerceptionPerception & Better& Better
UnderstandingUnderstanding
• A human eye can distinguish differences in line length, shape
orientation, and color (hue) readily without significant processing
effort. These are referred to as pre-attentive attributes.
• Cognition refers to processes in human beings like perception,
attention, learning, memory, thought, concept formation, reading,
and problem solving. 
• The basis of data visualization evolved because as a picture is
worth a thousand words, data displayed graphically allows for an
easier comprehension of the information. 
CharacteristicsCharacteristics of Goodof Good
VisualizationVisualization
• avoid distorting what the data has to
say.
• present many numbers in a small space.
• make large data sets coherent.
• encourage the eye to compare different
pieces of data.
• reveal the data at several levels of detail
• serve a reasonably clear purpose:
description, exploration, tabulation or
decoration
• be closely integrated with the statistical
and verbal descriptions of a data set.
3V’s3V’s andand 3C’s3C’s of Big Dataof Big Data
VisualizationVisualization
• 3 V’s of Big Data:
(1)Variety
(2)Volume
(3)Velocity
• 3 C’s Of Visualization:
(1)Coherence
(2)Context
(3)Cognition
Data-Aesthetic-InteractionData-Aesthetic-Interaction
TriangleTriangle
Types ofTypes of BasicBasic DiagramsDiagrams used forused for
Simple DataSimple Data VisualizationVisualization
• 1. Bar Charts
A bar chart or bar graph is
a chart that presents grouped data
with rectangular bars with lengths
proportional to the value that they
represent.
• 2. Histogram
A histogram is a graphical
representation of the distribution of
numerical data. It is an estimate of
the probability distribution of a
continuous variable (quantitative
variable)
• 3. Pie Chart
A type of graph in which a
circle is divided into sectors
that each represent a
proportion of the whole.
• 4. 3D pie charts
Another dimension is added to
the above shown pie chart to
add more functionality.
• 5. Line Graphs:
A line chart or line graph is
a type of chart which
displays information as a
series of data points called
'markers' connected by
straightline segments
• 6. Simple Tables:
The simple table consists of
rows and columns and the
data is accordingly placed in
corresponding cells.
DiagramsDiagrams Used forUsed for BigBig DataData
VisualizationVisualization
• 1. Scatter Plot
A scatter plot is a plot of the values of Y
versus the corresponding values of X:
Vertical axis: variable Y--usually the
response variable. Horizontal axis: variable
X--usually some variable we suspect may
ber related to the response.
• 2. 3D Scatter Plot
A 3D scatter plot allows the
visualization of multivariate data. A
dimension is added for additional
functionality.
• 3. Network Graph:
Study of graphs, which are
mathematical structures used
to model pairwise relations
between objects. A graph in
this context is made up of
vertices, nodes, or points
which are connected by edges,
arcs, or lines.
• 4. Tree Maps:
A treemap is a visual method
for displaying hierarchical data
that uses nested rectangles to
represent the branches of a
tree diagram. Each rectangles
has an area proportional to the
amount of data it represents.
• 5. Stream Graph:
A stream graph, is a type of
stacked area graph which is
displaced around a central
axis, resulting in a flowing,
organic shape.
• 6. Heat Map:
A heat map is a graphical
representation of data where
the individual values contained
in a matrix are represented as
colors. Fractal maps and
tree maps both often use a
similar system of color-coding
to represent the values taken
by a variable in a hierarchy.
• 7.GANTT Chart:
It is a chart in which a series of
horizontal lines shows the
amount of work done or
production completed in
certain periods of time in
relation to the amount planned
for those periods.
• 8. 3D Graphs:
A three-dimensional graph of a
relationship g (x, y, z ) among
three variables. A three-
dimensional graph is typically
drawn on a two-dimensional
page or screen using
perspective methods. Also
known as terrains.
DecidingDeciding Which Visual is BestWhich Visual is Best
• One of the biggest challenges for business users is deciding which
visual should be used to best represent the information.
• When you’re first exploring a new data set, autocharts are especially
useful because they provide a quick view of large amounts of data.
This data exploration capability is helpful even to experienced
statisticians as they seek to speed up the analytics lifecycle process
because it eliminates the need for repeated sampling to determine
which data is appropriate for each model.
How is it being used?How is it being used?
• Regardless of industry or size, all types of businesses are
using data visualization to help make sense of their data.
Here’s how:
• Comprehend information
quickly
• Identify relationships and
patterns
• Pinpoint emerging trends
• Communicate the story to
others
• Manipulate and interact
directly with data
Big Data VisualizationBig Data Visualization ToolsTools
• 1. Polymaps
3. Flot
• 2. Nodebox
Thinking fortheThinking forthe FutureFuture
• Data visualization is entering a new era. Emerging sources of
intelligence & theoretical developments in multidimensional
imaging are reshaping the potential value that analytics and
insights can provide, with visualization playing a key role. The
principles of effective data visualization won’t change. However,
next-gen technologies and evolving cognitive frameworks are
opening new horizons, moving data visualization from art to
science.
ReferencesReferences
• [1] S. Y. Kung , “Visualization of Big Data”,
Cognitive Informatics & Cognitive Computing (ICCI*CC)
, pages. 447-449, 6-8 July 2015.
DOI: 10.1109/ICCI-CC.2015.7259428
• [2] I. Herman , G. Melancon ; M. S. Marshall,
“Graph Visualization and navigation” IEEE
Transactions on Visualization and Computer
Graphics (Volume:6 , Issue:1), Pages 24-43,
August 2002.
DOI: 10.1109/2945.841119
• [3] Deepa Gupta, Sameera Siddiqui; “Big data
Implementation and Visualization”, Advances in
Engineering and Technology Research (ICAETR),
2014 International Conference ,
pages-1-10, issue no. 2347-9337, July 2014.
DOI: 10.1109/ICAETR.2014.7012883
Visual Analytics in Big Data

Visual Analytics in Big Data

  • 1.
    Visual AnaVisual Analyticslytics ininBigBig DataData Saurabh ShanbhagSaurabh Shanbhag TE IT 3134085TE IT 3134085 T120388690T120388690
  • 2.
    What isWhat isbig databig data?? • Big data is a term for data sets that are so large or complex that traditional data processing applications are inadequate. • Nowadays, huge databases are required by big companies and proper analysis of this big data is a necessity.
  • 3.
    Need forbig dataNeedforbig data analyticsanalytics • Big data analytics examines large amounts of data to uncover hidden patterns, correlations and other insights. • Analytics proves helpful for: (1)Cost Reduction. (2)Faster, better decision making. (3)New, smarter products and services. • Predictive analysis is facilitated majorly by data visualization.
  • 4.
    Big DataBig DataVisualizationVisualization • Data visualization is the presentation of data in a pictorial or graphical format. • It enables decision makers to see analytics presented visually, so they can grasp difficult concepts or identify new patterns. • With interactive visualization, you can use technology to drill down into charts and graphs for more detail, interactively changing what data you see and how it’s processed.
  • 5.
    How It WorksHowIt Works • First, We integrate all the data, ordered and unordered and store it together. • Then analytics algorithms are used for effective analysis of this huge data. • The results are then displayed using techniques that help the user to understand the data better.
  • 6.
    Big Data Visualizationas anBig Data Visualization as an InterfaceInterface • Visualization is the interface between the data storage and the user. • The long, core process of detailed analysis of data takes place in the background and the sorted, analysed data is displayed in the best form to the user. No details are exposed to the users.
  • 7.
    Importance ofImportance ofVisual AnalyticsVisual Analytics forfor Big DataBig Data • Using charts or graphs to visualize large complex data is a lot easier than poring over spreadsheets or reports. • Data visualization is a quick, easy way to convey concepts in a universal manner – and you can experiment with different scenarios by making slight adjustments
  • 8.
    HumanHuman PerceptionPerception &Better& Better UnderstandingUnderstanding • A human eye can distinguish differences in line length, shape orientation, and color (hue) readily without significant processing effort. These are referred to as pre-attentive attributes. • Cognition refers to processes in human beings like perception, attention, learning, memory, thought, concept formation, reading, and problem solving.  • The basis of data visualization evolved because as a picture is worth a thousand words, data displayed graphically allows for an easier comprehension of the information. 
  • 9.
    CharacteristicsCharacteristics of GoodofGood VisualizationVisualization • avoid distorting what the data has to say. • present many numbers in a small space. • make large data sets coherent. • encourage the eye to compare different pieces of data. • reveal the data at several levels of detail • serve a reasonably clear purpose: description, exploration, tabulation or decoration • be closely integrated with the statistical and verbal descriptions of a data set.
  • 10.
    3V’s3V’s andand 3C’s3C’sof Big Dataof Big Data VisualizationVisualization • 3 V’s of Big Data: (1)Variety (2)Volume (3)Velocity • 3 C’s Of Visualization: (1)Coherence (2)Context (3)Cognition
  • 11.
  • 12.
    Types ofTypes ofBasicBasic DiagramsDiagrams used forused for Simple DataSimple Data VisualizationVisualization • 1. Bar Charts A bar chart or bar graph is a chart that presents grouped data with rectangular bars with lengths proportional to the value that they represent. • 2. Histogram A histogram is a graphical representation of the distribution of numerical data. It is an estimate of the probability distribution of a continuous variable (quantitative variable)
  • 13.
    • 3. PieChart A type of graph in which a circle is divided into sectors that each represent a proportion of the whole. • 4. 3D pie charts Another dimension is added to the above shown pie chart to add more functionality.
  • 14.
    • 5. LineGraphs: A line chart or line graph is a type of chart which displays information as a series of data points called 'markers' connected by straightline segments • 6. Simple Tables: The simple table consists of rows and columns and the data is accordingly placed in corresponding cells.
  • 15.
    DiagramsDiagrams Used forUsedfor BigBig DataData VisualizationVisualization • 1. Scatter Plot A scatter plot is a plot of the values of Y versus the corresponding values of X: Vertical axis: variable Y--usually the response variable. Horizontal axis: variable X--usually some variable we suspect may ber related to the response. • 2. 3D Scatter Plot A 3D scatter plot allows the visualization of multivariate data. A dimension is added for additional functionality.
  • 16.
    • 3. NetworkGraph: Study of graphs, which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of vertices, nodes, or points which are connected by edges, arcs, or lines. • 4. Tree Maps: A treemap is a visual method for displaying hierarchical data that uses nested rectangles to represent the branches of a tree diagram. Each rectangles has an area proportional to the amount of data it represents.
  • 17.
    • 5. StreamGraph: A stream graph, is a type of stacked area graph which is displaced around a central axis, resulting in a flowing, organic shape. • 6. Heat Map: A heat map is a graphical representation of data where the individual values contained in a matrix are represented as colors. Fractal maps and tree maps both often use a similar system of color-coding to represent the values taken by a variable in a hierarchy.
  • 18.
    • 7.GANTT Chart: Itis a chart in which a series of horizontal lines shows the amount of work done or production completed in certain periods of time in relation to the amount planned for those periods. • 8. 3D Graphs: A three-dimensional graph of a relationship g (x, y, z ) among three variables. A three- dimensional graph is typically drawn on a two-dimensional page or screen using perspective methods. Also known as terrains.
  • 19.
    DecidingDeciding Which Visualis BestWhich Visual is Best • One of the biggest challenges for business users is deciding which visual should be used to best represent the information. • When you’re first exploring a new data set, autocharts are especially useful because they provide a quick view of large amounts of data. This data exploration capability is helpful even to experienced statisticians as they seek to speed up the analytics lifecycle process because it eliminates the need for repeated sampling to determine which data is appropriate for each model.
  • 20.
    How is itbeing used?How is it being used? • Regardless of industry or size, all types of businesses are using data visualization to help make sense of their data. Here’s how: • Comprehend information quickly • Identify relationships and patterns • Pinpoint emerging trends • Communicate the story to others • Manipulate and interact directly with data
  • 21.
    Big Data VisualizationBigData Visualization ToolsTools • 1. Polymaps 3. Flot • 2. Nodebox
  • 22.
    Thinking fortheThinking fortheFutureFuture • Data visualization is entering a new era. Emerging sources of intelligence & theoretical developments in multidimensional imaging are reshaping the potential value that analytics and insights can provide, with visualization playing a key role. The principles of effective data visualization won’t change. However, next-gen technologies and evolving cognitive frameworks are opening new horizons, moving data visualization from art to science.
  • 23.
    ReferencesReferences • [1] S.Y. Kung , “Visualization of Big Data”, Cognitive Informatics & Cognitive Computing (ICCI*CC) , pages. 447-449, 6-8 July 2015. DOI: 10.1109/ICCI-CC.2015.7259428 • [2] I. Herman , G. Melancon ; M. S. Marshall, “Graph Visualization and navigation” IEEE Transactions on Visualization and Computer Graphics (Volume:6 , Issue:1), Pages 24-43, August 2002. DOI: 10.1109/2945.841119 • [3] Deepa Gupta, Sameera Siddiqui; “Big data Implementation and Visualization”, Advances in Engineering and Technology Research (ICAETR), 2014 International Conference , pages-1-10, issue no. 2347-9337, July 2014. DOI: 10.1109/ICAETR.2014.7012883