SlideShare a Scribd company logo
1 of 29
Challenges in Analytics for BIGData
Dr. Prasant Misra
W: https://sites.google.com/site/prasantmisra
Disclaimer:
The opinions expressed in this presentation and on the following slides are solely those
of the presenter and not necessarily those of the organization that he works for.
A simple narrative to BIG DATA
8/26/2016 2
DATA whose characteristics exceeds the capabilities of conventional
algorithms, systems and techniques to derive useful value is considered BIG
datascience.berkeley.edu
The term if very fuzzy and means different things to different groups of people ….
How did we arrive at this stage ?
8/26/2016 3
1960 - 70
1980 - 90
2000 -10 and
beyond
Year
Size
8/26/2016 4
History of Computing
Accessibility to cyber end points have increased drastically …
8/26/2016 5
Device Proliferation
http://www.onethatmatters.com/wp-content/uploads/2015/12/Internet-of-Things-why.png
8/26/2016 6
DATA Proliferation
Web & Social
Media
Enter-
prises
Gov.
Example of a Killer APP: Navigation
https://www.microsoft.com/en-us/research/event/tutorial-mobile-location-sensing
8/26/2016 7
8/26/2016 8
Local Search - I
Context Service Example
Current
Location
Local business
8/26/2016 9
Local Search - II
Context Service Example
Current
Location
Local business and
directions
+
Time Tracks
Businesses in
driving direction
8/26/2016 10
Local Search - III
Context Service Example
Current
Location
Local business and
directions
+
Time Tracks
Businesses in
driving direction
+
History
Personalized
directions
 Take 520 East
8/26/2016 11
Local Search - IV
Context Service Example
Current
Location
Local business and
directions
+
Time Tracks
Businesses in
driving direction
+
History
Personalized
directions
+
Community
Tourist
recommendation
35% people pick
the scenic route
8/26/2016 12
Local Search - V
Alert: Bad
Traffic
Consider
Alternate
route
Context Service Example
Current
Location
Local business and
directions
Tracks
Businesses in
driving direction
+
History
Personalized
directions
+
Community
Tourist
recommendation
+
Push
alerts, triggers,
reminders
BIG Data for Location Analytics …
8/26/2016 13
Analytics: Span across Verticals & Horizontals
Depending on the type and quality of analytics, system could manifest themselves into:
 User-centric Systems — Systems That Know/Aware
 Adaptive Systems — Systems That Learn
 Cognitive Systems — Systems That Reason
E
N
E
R
G
Y
W
A
T
E
R
R
E
T
A
I
L
T
E
L
C
O
M
H
E
A
L
T
H
Time, Location Management
Sensor, Device Management
Network Management
Cloud Infra Management
Customer Management
The flavor of Data that is BIG
8/26/2016 14
The 4 Dimensions of BIG Data
8/26/2016 15
Analytics
8/26/2016 16
Value
8/26/2016 17
Hindsight and Insight/
Insights into the PAST
Foresight/
Insights into the FUTURE
Skill
Descriptive
“WHAT has
happened ? ”
Diagnostic
“WHY did this
happen ?”
Prescriptive
“WHAT should
we do ?”
Predictive
“WHAT could
happen ? ”
Information Optimization
Analytics : Category
DASHBOARD
FORECAST ACTIONS,
RULES,
RECOMMs
Example: Energy Analytics for a PV Microgrid
8/26/2016 18
Descriptive: What is the total energy, instantaneous energy and power, etc., …?
Diagnostic: Why is the panel temperature decreasing when the solar irradiance is high and the wind
speed is very low ?
Predictive: Can I forecast the plant output for tomorrow, or can I generate 4kWh net energy ?
Predictive : What actions should be undertaken for the plant to reach 4kW energy generation capacity
from its current 2 kW ?
8/26/2016 19
Analytics : Methodology
 Reason and Plan with Uncertain Knowledge
 Quantify uncertainty & Probabilistic reasoning: Bayesian networks, Conditional distributions
 Probabilistic reasoning over time:
Hidden Markov models, Kalman filters, Dynamic Bayesian networks
 Simple decisions: Utility theory, Decision networks
 Complex decisions: Partial observable Markov Decision Process (MDP), Game theoretic
models
 Planning graphs
 Learning and Data Mining:
 [Supervised | Semi-supervised | Unsupervised | Reinforcement] learning – Classification,
Clustering
 Different type of ANN | Deep Learning Networks | Support Vector Machines
Challenges
8/26/2016 20
Data to Knowledge Pipeline
8/26/2016 21
Cyber & Physical Space Entities
Edge
Global Infra
Data Ingestion
Data Analysis
Applications
Data source
“Big” data Infra
“Little” data Infra
Decision making
with Knowledge
DATA @ REST (VOLUME)
Archival/Static data (TBs) in Data stores
DATA @ MOTION (VELOCITY)
Streaming data
DATA @ MANY FORMS (VARIETY)
Structured/Unstructured, Text, Multimedia, Audio, Video
DATA @ DOUBT (VERACITY)
Data with uncertainty that may be due to
incompleteness, missing points, etc.,
NATURE of INGESTED DATA
COGNITIVE
Learn dynamically ?
PRESCRIPTIVE
What are the best outcomes ?
PREDICTIVE
What could happen ?
DESCRIPTIVE
What has happened ?
DISCOVERY
What do we have ?
NATURE of ANALYSIS
A first list of challenges derived from the V’s
8/26/2016 22
 Volume:
 How much data is really relevant to the problem solution & what is the cost of processing ?
 Can you really afford to store and process all that data ?
 Velocity
 A lot of data is coming in at high speed
 Need for streaming versus block approach to data analysis
 How to analyze data in-flight and combine with data at-rest
 Variety:
 A small fraction is in structured formats (e.g., relational, XML, etc.)
 A fair amount is semi-structured (e.g., web logs, etc.)
 The rest of the data is unstructured (e.g., text, photographs, etc.)
 No single data model can currently handle the diversity
 Veracity:
 Cover term for: Accuracy, Precision, Reliability, Integrity
 What is it that you don’t know about the data ?
Top Challenges
8/26/2016 23
 Data acquisition
 Is raw data of interest in totality ?
 Challenge:
 design efficient filters and compression techniques in a manner that does not discard useful
information; automatically generate the right meta data to describe it
 Data reduction
 Will traditional data reduction approaches (via compression) become overwhelming ?
 Challenge: introduction of new data collection practices and models as per analytical needs;
compact (space, time) representations/dictionary/basis; parsimonious model (low-
dimensionality, compressed sensing and sparse data capture models)
 “Big-Little” Data
 Device cloud vs. Conventional cloud; Distributed data and Peer-to-Peer Federation
 Challenge: how to combine Big and Little data for meaningful analytics (often in real time)
 Analytics from the Edge to the Cloud
 Will the current model of pushing all data to a central cloud for analytics scale, be inefficient, and
alleviate privacy concerns ?
 Challenge: how to automate distributed analytics and decision making on subsets of “Little” and
“Big” data; within the constraints of device capability, privacy needs, energy and network costs,
and application QoS
Top Challenges
8/26/2016 24
 Handling inconsistent/incomplete/missing data and outliers
 Is this critical ?
 Challenge: design robust imputation algorithms
 Heterogeneous Data Fusion
 Is there a need to analyze the relationship between heterogeneous data objects/streams
 Challenge: Extract right amount of semantics, sequential data fusion via transform spaces
 Scalability with multi-level hierarchy
 Will traditional methods of data navigational and search in deep hierarchy be scalable ?
 Challenge: design newer alternatives
 Data summarization for interactive Query
 Will examination of datasets (all at once) become difficult ?
 Data summarization let users request data with particular characteristics
 Data summarization: organize data based on the presence/type of feature
 Scientific data features: geometrical, topological, statistical
 Non-scientific data features: related to semantic/syntactic components of the data
 Challenge:
 extraction of meaningful features, both from high and low dimension data
 data storage and indexing in an I/O efficient format for rapid runtime retrieval
Top Challenges
8/26/2016 25
 Analytics of temporally/spatially evolving features
 Do data features occur at different spatial and temporal scales ?
 Challenge: effective visual techniques that are computationally practical and that can take
advantage of humans unique cognitive ability to track those feature changes
 Representation of evidence and uncertainty
 Interpretation of evidence is subject to person performing this task, and depends on his prior
knowledge, subjective settings and viewpoint
 Uncertainty quantification models the consequence based on the presented evidence and then
predicts the qualities of the corresponding outcome
 Challenge: how to represent evidence and uncertainty clearly and without bias through
visualization
 Sense making to users/decision makers
 Involves examining all the assumptions made and retracing the analysis
 There can be many sources of error: computer systems can have bugs, models almost always have
assumptions, and results can be based on erroneous data. For all of these reasons, users will try
to understand, and verify, the results produced by the computer.
 Challenge: what should the man-machine interface for this look like ?
Platforms and Tools
8/26/2016 26
8/26/2016 27
8/26/2016 28
Scale and Size DOES matter !!!
8/26/2016 29
References
 Stephen H. Kaisler et. Al ,“Big data and analytics: challenges and issues”
 Pak Chung Wong, Han-Wei Shen, Chaomei Chen, “Top Ten Interaction Challenges
in Extreme-Scale Visual Analytics”
http://link.springer.com/chapter/10.1007/978-1-4471-2804-5_12#page-1
 Other info graphics from the web !!!

More Related Content

What's hot

Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
hariri2019.pdf
hariri2019.pdfhariri2019.pdf
hariri2019.pdfAkuhuruf
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloOCTO Technology
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big DataDataWorks Summit
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and AnalyticsDhruv Saxena
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralFrank Kienle
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Data Science Thailand
 
Smart IoT for Connected Manufacturing
Smart IoT for Connected ManufacturingSmart IoT for Connected Manufacturing
Smart IoT for Connected ManufacturingAmit Sheth
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big DataRevolution Analytics
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Artificial Intelligence Institute at UofSC
 

What's hot (20)

Data science presentation
Data science presentationData science presentation
Data science presentation
 
hariri2019.pdf
hariri2019.pdfhariri2019.pdf
hariri2019.pdf
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Big Data for Ag (2019)
Big Data for Ag (2019)Big Data for Ag (2019)
Big Data for Ag (2019)
 
Big data road map
Big data road mapBig data road map
Big data road map
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
 
Introduction to Data Science and Analytics
Introduction to Data Science and AnalyticsIntroduction to Data Science and Analytics
Introduction to Data Science and Analytics
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information Collateral
 
Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)Introduction to Data Science (Data Science Thailand Meetup #1)
Introduction to Data Science (Data Science Thailand Meetup #1)
 
Smart IoT for Connected Manufacturing
Smart IoT for Connected ManufacturingSmart IoT for Connected Manufacturing
Smart IoT for Connected Manufacturing
 
Data Science: Not Just For Big Data
Data Science: Not Just For Big DataData Science: Not Just For Big Data
Data Science: Not Just For Big Data
 
Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
 
Why Data Science is a Science
Why Data Science is a ScienceWhy Data Science is a Science
Why Data Science is a Science
 

Viewers also liked

Emerging Big Data & Analytics Trends with Hadoop
Emerging Big Data & Analytics Trends with Hadoop Emerging Big Data & Analytics Trends with Hadoop
Emerging Big Data & Analytics Trends with Hadoop InnoTech
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
 
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewModern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewGreat Wide Open
 
Big Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBig Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBoston Consulting Group
 
Link Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoTLink Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoTPrasant Misra
 
Hadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHanborq Inc.
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation HadoopVarun Narang
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 

Viewers also liked (10)

Jsm big-data
Jsm big-dataJsm big-data
Jsm big-data
 
Emerging Big Data & Analytics Trends with Hadoop
Emerging Big Data & Analytics Trends with Hadoop Emerging Big Data & Analytics Trends with Hadoop
Emerging Big Data & Analytics Trends with Hadoop
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An OverviewModern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
 
Big Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data setsBig Data: tools and techniques for working with large data sets
Big Data: tools and techniques for working with large data sets
 
Link Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoTLink Layer Protocols for WSN-based IoT
Link Layer Protocols for WSN-based IoT
 
Hadoop HDFS Detailed Introduction
Hadoop HDFS Detailed IntroductionHadoop HDFS Detailed Introduction
Hadoop HDFS Detailed Introduction
 
Big Data
Big DataBig Data
Big Data
 
Seminar Presentation Hadoop
Seminar Presentation HadoopSeminar Presentation Hadoop
Seminar Presentation Hadoop
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 

Similar to Challenges in Analytics for BIG Data

Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Seattle DAML meetup
 
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...Thomas Rones
 
Streaming HYpothesis REasoning
Streaming HYpothesis REasoningStreaming HYpothesis REasoning
Streaming HYpothesis REasoningWilliam Smith
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...ijdpsjournal
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...ijdpsjournal
 
Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyShital Kat
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...Editor IJCATR
 
Smart Data for Smart Labs
Smart Data for Smart Labs Smart Data for Smart Labs
Smart Data for Smart Labs OSTHUS
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysisPoonam Kshirsagar
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Miningnabil_alsharafi
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)NikitaRajbhoj
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data IJCERT JOURNAL
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Mr.Sameer Kumar Das
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerMicrosoft
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 

Similar to Challenges in Analytics for BIG Data (20)

Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016
 
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
 
Streaming HYpothesis REasoning
Streaming HYpothesis REasoningStreaming HYpothesis REasoning
Streaming HYpothesis REasoning
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
 
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
SEAMLESS AUTOMATION AND INTEGRATION OF MACHINE LEARNING CAPABILITIES FOR BIG ...
 
Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop Technology
 
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
A Comparative Study of Various Data Mining Techniques: Statistics, Decision T...
 
Smart Data for Smart Labs
Smart Data for Smart Labs Smart Data for Smart Labs
Smart Data for Smart Labs
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53Sameer Kumar Das International Conference Paper 53
Sameer Kumar Das International Conference Paper 53
 
Innovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringerInnovation med big data – chr. hansens erfaringer
Innovation med big data – chr. hansens erfaringer
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 

More from Prasant Misra

Hybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric FleetsHybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric FleetsPrasant Misra
 
Reinforcement Learning for EVRP with V2G
Reinforcement Learning for EVRP with V2GReinforcement Learning for EVRP with V2G
Reinforcement Learning for EVRP with V2GPrasant Misra
 
How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?Prasant Misra
 
A Short Course on the Internet of Things
A Short Course on the Internet of ThingsA Short Course on the Internet of Things
A Short Course on the Internet of ThingsPrasant Misra
 
Emerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial ApplicationsEmerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial ApplicationsPrasant Misra
 
Energy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPSEnergy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPSPrasant Misra
 

More from Prasant Misra (6)

Hybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric FleetsHybrid Planner for Smart Charging of Electric Fleets
Hybrid Planner for Smart Charging of Electric Fleets
 
Reinforcement Learning for EVRP with V2G
Reinforcement Learning for EVRP with V2GReinforcement Learning for EVRP with V2G
Reinforcement Learning for EVRP with V2G
 
How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?How can Mobility be used to solve Social Problems?
How can Mobility be used to solve Social Problems?
 
A Short Course on the Internet of Things
A Short Course on the Internet of ThingsA Short Course on the Internet of Things
A Short Course on the Internet of Things
 
Emerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial ApplicationsEmerging Networking Technologies for Industrial Applications
Emerging Networking Technologies for Industrial Applications
 
Energy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPSEnergy Efficient GPS Acquisition with Sparse-GPS
Energy Efficient GPS Acquisition with Sparse-GPS
 

Recently uploaded

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Challenges in Analytics for BIG Data

  • 1. Challenges in Analytics for BIGData Dr. Prasant Misra W: https://sites.google.com/site/prasantmisra Disclaimer: The opinions expressed in this presentation and on the following slides are solely those of the presenter and not necessarily those of the organization that he works for.
  • 2. A simple narrative to BIG DATA 8/26/2016 2 DATA whose characteristics exceeds the capabilities of conventional algorithms, systems and techniques to derive useful value is considered BIG datascience.berkeley.edu The term if very fuzzy and means different things to different groups of people ….
  • 3. How did we arrive at this stage ? 8/26/2016 3
  • 4. 1960 - 70 1980 - 90 2000 -10 and beyond Year Size 8/26/2016 4 History of Computing Accessibility to cyber end points have increased drastically …
  • 6. 8/26/2016 6 DATA Proliferation Web & Social Media Enter- prises Gov.
  • 7. Example of a Killer APP: Navigation https://www.microsoft.com/en-us/research/event/tutorial-mobile-location-sensing 8/26/2016 7
  • 8. 8/26/2016 8 Local Search - I Context Service Example Current Location Local business
  • 9. 8/26/2016 9 Local Search - II Context Service Example Current Location Local business and directions + Time Tracks Businesses in driving direction
  • 10. 8/26/2016 10 Local Search - III Context Service Example Current Location Local business and directions + Time Tracks Businesses in driving direction + History Personalized directions  Take 520 East
  • 11. 8/26/2016 11 Local Search - IV Context Service Example Current Location Local business and directions + Time Tracks Businesses in driving direction + History Personalized directions + Community Tourist recommendation 35% people pick the scenic route
  • 12. 8/26/2016 12 Local Search - V Alert: Bad Traffic Consider Alternate route Context Service Example Current Location Local business and directions Tracks Businesses in driving direction + History Personalized directions + Community Tourist recommendation + Push alerts, triggers, reminders BIG Data for Location Analytics …
  • 13. 8/26/2016 13 Analytics: Span across Verticals & Horizontals Depending on the type and quality of analytics, system could manifest themselves into:  User-centric Systems — Systems That Know/Aware  Adaptive Systems — Systems That Learn  Cognitive Systems — Systems That Reason E N E R G Y W A T E R R E T A I L T E L C O M H E A L T H Time, Location Management Sensor, Device Management Network Management Cloud Infra Management Customer Management
  • 14. The flavor of Data that is BIG 8/26/2016 14
  • 15. The 4 Dimensions of BIG Data 8/26/2016 15
  • 17. Value 8/26/2016 17 Hindsight and Insight/ Insights into the PAST Foresight/ Insights into the FUTURE Skill Descriptive “WHAT has happened ? ” Diagnostic “WHY did this happen ?” Prescriptive “WHAT should we do ?” Predictive “WHAT could happen ? ” Information Optimization Analytics : Category DASHBOARD FORECAST ACTIONS, RULES, RECOMMs
  • 18. Example: Energy Analytics for a PV Microgrid 8/26/2016 18 Descriptive: What is the total energy, instantaneous energy and power, etc., …? Diagnostic: Why is the panel temperature decreasing when the solar irradiance is high and the wind speed is very low ? Predictive: Can I forecast the plant output for tomorrow, or can I generate 4kWh net energy ? Predictive : What actions should be undertaken for the plant to reach 4kW energy generation capacity from its current 2 kW ?
  • 19. 8/26/2016 19 Analytics : Methodology  Reason and Plan with Uncertain Knowledge  Quantify uncertainty & Probabilistic reasoning: Bayesian networks, Conditional distributions  Probabilistic reasoning over time: Hidden Markov models, Kalman filters, Dynamic Bayesian networks  Simple decisions: Utility theory, Decision networks  Complex decisions: Partial observable Markov Decision Process (MDP), Game theoretic models  Planning graphs  Learning and Data Mining:  [Supervised | Semi-supervised | Unsupervised | Reinforcement] learning – Classification, Clustering  Different type of ANN | Deep Learning Networks | Support Vector Machines
  • 21. Data to Knowledge Pipeline 8/26/2016 21 Cyber & Physical Space Entities Edge Global Infra Data Ingestion Data Analysis Applications Data source “Big” data Infra “Little” data Infra Decision making with Knowledge DATA @ REST (VOLUME) Archival/Static data (TBs) in Data stores DATA @ MOTION (VELOCITY) Streaming data DATA @ MANY FORMS (VARIETY) Structured/Unstructured, Text, Multimedia, Audio, Video DATA @ DOUBT (VERACITY) Data with uncertainty that may be due to incompleteness, missing points, etc., NATURE of INGESTED DATA COGNITIVE Learn dynamically ? PRESCRIPTIVE What are the best outcomes ? PREDICTIVE What could happen ? DESCRIPTIVE What has happened ? DISCOVERY What do we have ? NATURE of ANALYSIS
  • 22. A first list of challenges derived from the V’s 8/26/2016 22  Volume:  How much data is really relevant to the problem solution & what is the cost of processing ?  Can you really afford to store and process all that data ?  Velocity  A lot of data is coming in at high speed  Need for streaming versus block approach to data analysis  How to analyze data in-flight and combine with data at-rest  Variety:  A small fraction is in structured formats (e.g., relational, XML, etc.)  A fair amount is semi-structured (e.g., web logs, etc.)  The rest of the data is unstructured (e.g., text, photographs, etc.)  No single data model can currently handle the diversity  Veracity:  Cover term for: Accuracy, Precision, Reliability, Integrity  What is it that you don’t know about the data ?
  • 23. Top Challenges 8/26/2016 23  Data acquisition  Is raw data of interest in totality ?  Challenge:  design efficient filters and compression techniques in a manner that does not discard useful information; automatically generate the right meta data to describe it  Data reduction  Will traditional data reduction approaches (via compression) become overwhelming ?  Challenge: introduction of new data collection practices and models as per analytical needs; compact (space, time) representations/dictionary/basis; parsimonious model (low- dimensionality, compressed sensing and sparse data capture models)  “Big-Little” Data  Device cloud vs. Conventional cloud; Distributed data and Peer-to-Peer Federation  Challenge: how to combine Big and Little data for meaningful analytics (often in real time)  Analytics from the Edge to the Cloud  Will the current model of pushing all data to a central cloud for analytics scale, be inefficient, and alleviate privacy concerns ?  Challenge: how to automate distributed analytics and decision making on subsets of “Little” and “Big” data; within the constraints of device capability, privacy needs, energy and network costs, and application QoS
  • 24. Top Challenges 8/26/2016 24  Handling inconsistent/incomplete/missing data and outliers  Is this critical ?  Challenge: design robust imputation algorithms  Heterogeneous Data Fusion  Is there a need to analyze the relationship between heterogeneous data objects/streams  Challenge: Extract right amount of semantics, sequential data fusion via transform spaces  Scalability with multi-level hierarchy  Will traditional methods of data navigational and search in deep hierarchy be scalable ?  Challenge: design newer alternatives  Data summarization for interactive Query  Will examination of datasets (all at once) become difficult ?  Data summarization let users request data with particular characteristics  Data summarization: organize data based on the presence/type of feature  Scientific data features: geometrical, topological, statistical  Non-scientific data features: related to semantic/syntactic components of the data  Challenge:  extraction of meaningful features, both from high and low dimension data  data storage and indexing in an I/O efficient format for rapid runtime retrieval
  • 25. Top Challenges 8/26/2016 25  Analytics of temporally/spatially evolving features  Do data features occur at different spatial and temporal scales ?  Challenge: effective visual techniques that are computationally practical and that can take advantage of humans unique cognitive ability to track those feature changes  Representation of evidence and uncertainty  Interpretation of evidence is subject to person performing this task, and depends on his prior knowledge, subjective settings and viewpoint  Uncertainty quantification models the consequence based on the presented evidence and then predicts the qualities of the corresponding outcome  Challenge: how to represent evidence and uncertainty clearly and without bias through visualization  Sense making to users/decision makers  Involves examining all the assumptions made and retracing the analysis  There can be many sources of error: computer systems can have bugs, models almost always have assumptions, and results can be based on erroneous data. For all of these reasons, users will try to understand, and verify, the results produced by the computer.  Challenge: what should the man-machine interface for this look like ?
  • 28. 8/26/2016 28 Scale and Size DOES matter !!!
  • 29. 8/26/2016 29 References  Stephen H. Kaisler et. Al ,“Big data and analytics: challenges and issues”  Pak Chung Wong, Han-Wei Shen, Chaomei Chen, “Top Ten Interaction Challenges in Extreme-Scale Visual Analytics” http://link.springer.com/chapter/10.1007/978-1-4471-2804-5_12#page-1  Other info graphics from the web !!!