Correlation Analysis on Live Data Streams

Arun Kejariwal
Arun KejariwalStatistical Learning Principal at Machine Zone, Inc.
1
CORRELATION ANALYSIS
ON LIVE DATA STREAMS
ARUN KEJARIWAL
2
STREAMING
Serve data at rest
LIVE
Serve data as it is being generated
STREAMING vs. LIVE
MOBILE DATA TRAFFIC
17% of total IP traffic by 2021
VR/AR Traffic
CAGR of 82% from 2016-2021
ANNUAL GLOBAL IP TRAFFIC
3.3 ZB by 2021
BROADBAND SPEEDS
Reach 53 Mbps by 2021
INTERNET VIDEO SURVEILLANCE
3.4% of all Internet video traffic
by 2021
LIVE INTERNET VIDEO
13% of Internet video traffic
by 2021
THE NUMBERS
3
https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/complete-white-paper-c11-481360.html
4
DATA FUSION
CHARACTERISTICS
DISTRIBUTED
HETEROGENEOUS
NON-LINEAR
INTERNET OF THINGS
Sensors and Actuators
SMART CITIES
SMART HEALTH CARE
POLLUTON MONITORING
INSIGHTS
PROBABILISTIC METHODS
THEORY OF EVIDENCE
MACHINE LEARNING
FUZZY LOGIC
CHALLENGES
DATA: INCONSISTENCY
MISSINGNESS, NOISE
LATENCY
BATTERY CONSUMPTION
INTELLIGENT DECISION MAKING
5
TRAFFIC SURVEILLANCE
CONGESTION DETECTION
ACCIDENT DETECTION
HEALTH EMERGENCY
RISK ALERT
RAIN WATER CLOGGING, HYDROPLANING
MUDSLIDE
AN EXAMPLE APPLICATION OF DATA FUSION
CORRELATION ANALYSIS
6
CORRELATION ANALYSIS
A LONG HISTORY!
7
EARLY FLAVOR
DATES BACK to 1885!
8
APPLICATION DOMAINS
9
BIOLOGY ANTHROPLOGY AGRICULTURE PHILOSOPHY PSYCHOLOGY
FINANCE ECONOMETRICS STATISTICS NETWORKING OPERATIONS
Common representation learning
Example: Across audio and video modalities
in a single live stream
Multimodal
Linear/non-linear correlation
Example: Across univariate time series of
operations data
Unimodal
CORRELATION ANALYSIS
FLAVORS: AT A HIGH LEVEL
10
CROSS-MODAL ANALYSIS
Joint Representation of Text-
Audio-Video
INTERPRETABILITY
Comparative analysis of deep
representations
AUTHENTICITY
Live streams vs. Recording
OBJECT IDENTIFICATION
Multiple cameras
MULTI-STREAM
SYNCHRONIZATION
Different vantage points
Unreliable timestamps
LOCALIZATION
Video ⟷ Audio
Audio ⟷ Text
CORRELATION ANALYSIS
WHY BOTHER?
11
Pearson
Spearman 𝜌
Kendall 𝜏
Goodman and Kruskal
COMMON TYPES OF CORRELATION
12
Ties are dropped
CORRELATION ANALYSIS
A REAL LIFE EXAMPLE
13
Root cause analysis
Expose investment avenues
Surface optimization opportunities
Risk minimization
Medical diagnosis
Learning
MULTIPLE TIME SERIES
14
15
Symmetric
Lack of context
Spurious correlations
Non-actionable
Network topology
CORRELATION MATRIX
Scalability
Hundreds of millions of time series
Use cases
Multiple Regression
Discriminant Analysis
Mahalanobis Distance
CORRELATION MATRIX
16
* Figure borrowed from [Mueen et al. 2010]
*
Thresholded(=0.5)CorrelationMatrix
350x350CorrelationMatrix
MULTIPLE CORRELATION
1957
MULTIVARIATE CASE
17
Proportion of variance of xj that cannot be
explained by other independent variables
Depends on the choice of
the dependent variable
Correlation
Matrix
2-D case
18
Time
Varying
ROLLING CORRELATION
CONSTANTLY EVOLVING DATA
19
STOCHASTICITY
20
Why bother? Multiple flavors
If X, Y are independent, then 𝜌(X, Y) = 0
However, the converse is not true, as only
the first two moments are considered
e.g., 𝜌(X, Y) = 0 even when Y = X2
Not invariant under nonlinear strictly
increasing transformations
Pearson’s correlation only recognizes linear
dependence
var(X) and var(Y) have to be finite
Non-stationary random processes
Stochastic correlation process
Applications
Finance (Brownian motion), biology
Time-varying correlation
Rolling correlation - lagged indicator
Dynamic Conditional Correlation [Engle’02]
Local Correlation [Langnau’09]
Wishart autoregressive process [Gourieroux’09]
Transformations [van Emmerich’06]
arctan and
Modified Jacobi process [Ma’09]
tanh transformation [Teng’16]
SPURIOUS CORRELATIONS
SPURIOUS CORRELATION
* Figure borrowed from [Anscombe 1973]
*
WHY CONTEXT IS IMPORTANT?
22
Identical r=0.816
Clearly spurious
Linear correlation is perhaps not the right metic
Identical summary statistics
*
ROBUSTNESS
r =0.8 r =0.2
r is highly sensitive to slight change, as measured
by Kolmogorov distance, in one of the marginal
distributions
Bivariate Normal
Distribution
Contaminated Normal
Distribution
PEARSON CORRELATION
23
* Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox
Influence Function (IF) of Pearson’s Correlation
Unbounded
Pearson’s correlation does not have infinitesimal
robustness
x x
y y
zz
Recall, first order approximation of sample IF of r is:
r-i : correlation coefficient based on all but the ith
observation
24
ROBUSTNESS
*
* Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox
Sensitivity to anomalies
Linear relationship between x & y but r = -0.21
PEARSON CORRELATION
Quadrant (signum) correlation coefficient#
# [Blomqvist, 1950] ^[Pasman and Shevlyakov, 1987]
Correlation median estimator^
ROBUSTNESS
PEARSON CORRELATION
25
Based on Robust Principal Variables
Alternatives
ROBUSTNESS
PEARSON CORRELATION
26
Percentage Bend Correlation
27
SPEED-ACCURACY TRADE-OFF
Live Data
STREAMING CORRELATION
Incremental, One pass
CHARACTERISTICS
29
Other correlation measures are not amenable to incremental
computation
Applications
Security
Correlation power analysis
30
APPROXIMATION ALGORITHMS
STREAMING CORRELATION
Wide Spectrum of Approaches
Sliding Windows, Damped Windows
Reduction
Smoothing
Down sampling
DFT [Agrawal et al. 1993, Zhu and Shasha 2002,
Qiu et al. 2018]
DWT [Chan and Fu 1999, Popivanov & Miller 2002]
PCA, SVD
PAA [Faloutsos and Yi 2000]
APAC [Chakrabarti et al. 2001]
Random projections [Grellmann et al. 2016]
LSH [Sundaram et al. 2013]
DATA SKETCHES
31
Bursts
Page views, Clicks, Retweets
Correlated burstiness (e.g., data center operations)
Root-cause analysis
STREAMING CORRELATION
* Figure borrowed from [Sakurai et al. 2005]
*
Lag between time series
Lagged correlation/Cross-correlation
ACTIONABLE INSIGHTS
32
[Zhu and Shasha 2002, Levine et al. 2016, Wu et al. 2017]
[Vlachos et al. 2008, Kotov et al. 2011, Shafer et al. 2012, Kusmierczyk and Norvag 2015]
33
Motion
Inertia
Motion
Occlusion
Velocity
Estimation
Motion-Outlier
Detection
Blur
Removal
VIDEO CORRELATION
TEMPORAL COHERENCE
PREDICTION MOTION
1952
“… prediction motion is the continuation of tracking motion
after a target disappears from view.”
1955
1962
Unlike duration of target presentation, target speed
exerts an influence on prediction accuracy.
EXPLORED >5 DECADES BACK!
34
PREDICTION MOTION
35
Human Motion/Robotics
[Bütepage et al. 2017, Martinez et a al. 2017, Byravan & Fox]
Traffic Prediction
[Hermes et al. 2010, Walker et al. 2014, Yu et al. 2017]
360-Degree Video (AR/VR)
[Bao et al. 2017, Vishwanath et al., 2017]
Potpourri: Deep Learning Based Approaches
[Oh et al. 2015, Mathieu et al. 2016, Liang et al. 2017]
AUDIO-VIDEO CORRELATION
A LONG HISTORY: EXPERIMENTAL PSYCHOLOGY -> DEEP LEARNING
36
AUDIO-VIDEO CORRELATION
1952
People utilize visual and postural experiences in perceiving the position
of an object in the field, of the whole field.
1897
Localization of sounds varied, being different when the source of sound was in sight from what it
was when this was out of sight, and also in the latter case differing with different directions of
attention, or with different suggestions as to the direction from which the sound came.
1941
SOUND LOCALIZATION: AN APPLICATION
37
AUDIO-VIDEO CORRELATION
1977
1960
1976
LIP READING: AN APPLICATION
38
Demonstrated the influence of vision on speech perception
McGurk and MacDonald
Established the relationship of the visually perceived
symbols to the underlying linguistic system
Use visual information as an aid when white noise made speech difficult to hear
1954
There’s a great opportunity for the visual contribution
at low speech-to-noise ratios
AUDIO-VIDEO CORRELATION
2007
2004
Combined acoustic and visual feature vectors to distinguish live synchronous audio-video
recordings from replay attacks that use audio with a still photo.
LIVENESS AND SYNCHRONIZATION: AN APPLICATION
39
Extracted the correlated components of audio and lip features
based on Canonical Correlation Analysis.
2009
Showed that there exists a relationship between perception of video presented
in screen and accompanying audio signals, both stereo and spatial
40
AUDIO-VIDEO-TEXT
Speaker identification in multi-speaker scenarios
[Huang & Kingsbury 2013, Chung & Zisserman 2016, Torfi et al. 2017]
Cross-Modal Correlation Learning in Audio and Lyrics
[Yu et al. 2017, Tang et al. 2017]
Speech enhancement
[Xu et al. 2014, Hou et al. 2017, Kolbæk et al. 2017]
Action Recognition and Video Highlight Detection
[Wu et al. 2013, Sun et al. 2013, Takahashi et al. 2017]
DEEP LEARNING WAY
Emotion Recognition
[Tzirakis et al. 2017, Pini et al. 2017]
41
OPEN PROBLEMS
42
Online anomaly detection: speed-accuracy trade-off
On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data*
by Choudhary et al. 2017
Breakouts/Changepoints
Skew in location of anomalies
SUSCEPTIBILITY TO ANOMALIES
* https://arxiv.org/pdf/1710.04735.pdf
43
HETEROSCEDASTICITY
Methods
(Adjusted) Percentile Bootstrap [Wilcox 1996]
Nested Bootstrap [DiCiccio et al. 1992]
Challenge: Detecting non-linear correlation
2001
44
Potential sources: Event based monitoring via sensors, Occlusion in a video
Techniques: Resampling
Lomb-Scargle Fourier Transform [Andersson 2007, Rehfeld et al. 2011]
Kernel - such as Laplacian, Gaussian - based methods
IRREGULARLY SPACED TIME SERIES
TIME VARYING CORRELATION
45
* Figure borrowed from [Fu et al. 2013]
TVCC: Time Varying Correlation Coefficient
fMRI time-courses of 4 Regions of Interests (ROI)
✦ Time varying joint distribution
✦ Parameter non-constancy/Instability
F Test [Chow’60]
SupF Test [Quandt’60]
Lagrange Multiplier Test
[Nabeya and Tanaka’88, Nylom’89]
✦ Co-integrated processes
I(1) [Hansen’90]
✦ Stochastic vs. Deterministic
✦ Dynamic Conditional Correlation (DCC)
[Engle’02]
*
Post stimulus period with significant difference
(p<0.05) w.r.t. pre-stimulus interval
STREAMING CORRELATION
Missing data
Packets being dropped owing to, say, unexpected
high traffic
Data collection: Every, say, 5 seconds
How to scale analysis to milli-seconds granularity?
Unequal length time series
Different sampling rates
Small samples
Bootstrapping
Low SNR (Signal to Noise Ratio)
46
NEXT FRONTIER
LEVERAGING MULTIMODAL & CONTINUOUSLY EVOLVING CORRELATION FOR SELF-LEARNING
47
SELF-LEARNING MACHINES
A LONG HISTORY!
49
SELF-LEARNING
EARLY WORK: GAMES
19501914
constructed a device which played an end game of king
and rook against king. The machine played the side with king and rook and would
force checkmate in a few moves however its human opponent played.
Gerald Tesauro
199519592002
2002
2017
Amongst the first and most famous was the
chess-playing automaton constructed in 1769
by Baron Kempelen …
1953
Alan Turing
2012
1970
GAME PLAYING
POTPOURRI
50
19962007
The game of checkers has roughly 500 billion billion
possible positions (5 × 1020)
SELF-LEARNING
2016
REINFORCEMENT LEARNING
51
2016201720172018
AlphaGo
Mastering the game of Go with deep neural networks and tree search
by Silver et al.
AlphaGo Zero
Mastering the game of Go without Human knowledge
by Silver et al.
Alpha Zero
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
by Silver et al.
Libratus
Superhuman AI for heads-up no-limit poker: Libratus beats top professionals
by Brown and Sandholm
Correlation Analysis on Live Data Streams
Correlation Analysis on Live Data Streams
54
Rank Correlation Methods
by Kendall and Gibbons
READINGS
Correlation and Regression
by Bobko
Correlation
by Garson
Robust Correlation: Theory and Applications
by Shevlyakov and Oja
A Mathematical Theory of Evidence
by Shafer
BOOKS
55
Fast Approximate Correlation of Massive Time-series Data
by Queen et al., 2010
READINGS
Local Correlation Detection with Linearity Enhancement in
Streaming Data
by Xie et al., 2013
Fast Distributed Correlation Discovery Over Streaming
Time-Series Data
by Guo et al., 2015
Random Projection of Fast and Efficient Multivariate
Correlation Analysis of High-Dimensional Data: A New Approach
by Grellmann et al., 2016
Detection of Highly Correlated Live Data Streams
by Alseghayer et al., 2017
STREAMING CORRELATION
56
Perception of body position and of the position of the visual
field
by Witkin, 1949
READINGS
Autocorrelation, a principle for the evaluation of sensory
information by the nervous system
by Reichardt, 1961
EARLY RESEARCH IN AUDIO-VISUAL CORRELATION
Binocular cross-correlation in time and space
by Tyler and Julesz, 1978
Neurontropy. an entropy like measure of neural correlation
by Julesz and Tyler, 1976
Cross correlation of sensory stimuli and electroencephalogram
by Morgan, 1969
57
Patch to the future: Unsupervised visual prediction
by Walker et al., 2014
READINGS
Motion-Prediction-based Multicast for 360-Degree Video
Transmissions
by Bao et al., 2017
Dual Motion GAN for Future-Flow Embedded Video
Prediction

by Liang et al., 2017
Deep representation learning for human motion prediction
and classification
by Bütepage et al., 2017
On human motion prediction using recurrent neural
networks
by Martinez, 2017
RECENT WORKS IN PREDICTION MOTION
58
On Deep Multi-View Representation Learning
by Wang et al., 2015
READINGS
Correlational Neural Networks
by Chandar et al., 2017
Common Representation Learning Using Step-based
Correlation Multi-Modal CNN

by Bhatt et al., 2017
Objects that Sound
by Arandjelović and Zisserman, 2017
Deep Correlation Feature Learning for Face Verification in
the Wild
by Deng et al., 2017
COMMON REPRESENTATION LEARNING
59
READINGS
Audiovisual Synchronization and Fusion Using Canonical
Correlation Analysis
by Sargin et al., 2007
The predictive power of trajectory motion
by Watamaniuk, 2005
Seeing motion behind occluders
by Watamaniuk and McKee, 1995
Temporal Coherence Theory For The Detection And
Measurement Of Visual Motion
by Grzywacz et al.,1995
Probabilistic Motion Estimation Based On Temporal Coherence
by Burgi et al., 2000
AUDIO-VISUAL RESEARCH
60
A New Method of Audio-Visual Correlation Analysis
by Kunka and Kostek, 2009
READINGS
Uncertainty in ontologies: Dempster–Shafer theory for data
fusion applications
by Bellenger and Gatepaille, 2011
City Data Fusion: Sensor Data Fusion in the Internet of Things
by Wang et al., 2015
Data Fusion and IoT for Smart Ubiquitous Environments: A
Survey
by Alam et al., 2017
Correlation Analysis of Audio and Video Contents: A Metadata
based Approach
by Algur et al., 2015
POTPOURRI
61
READINGS
POTPOURRI
Correlation detection as a general mechanism for
multisensory integration
by Parise and Ernst, 2016
Origin of information-limiting noise correlations
by Kanitscheidera et al., 2015
Temporal structure and complexity affect audio-visual
correspondence detection
by Denison et al., 2013
Correlation versus causation in multisensory perception
by Mitterer, Jesse, 2010
62
READINGS
MATHEMATICS
A Bayesian approach to problems in stochastic estimation
and control
by Ho and Lee, 1964
A Generalization of Bayesian Inference
by Dempster, 1968
Multidimensional Scaling
by Kruskal and Wish, 1978
63
RESOURCES
http://www.tylervigen.com/spurious-correlations
http://www.sumsar.net/blog/2013/08/robust-bayesian-
estimation-of-correlation/
http://www.sumsar.net/blog/2013/08/bayesian-estimation-of-
correlation/
JAGS: http://mcmc-jags.sourceforge.net/
BUGS: http://www.openbugs.net
https://blog.quantopian.com/bayesian-correlation-estimation/
https://jimgrange.wordpress.com/2017/11/19/bayesian-
estimation-of-partial-correlations/
64
https://www.mouser.com/applications/sensor-fusion-iot/
RESOURCES
https://cacm.acm.org/magazines/2017/11/222180-heads-up-
limit-holdem-poker-is-solved
1 of 64

Recommended

Correlation Analysis on Live Data Streams by
Correlation Analysis on Live Data StreamsCorrelation Analysis on Live Data Streams
Correlation Analysis on Live Data StreamsArun Kejariwal
2.1K views67 slides
IRJET- A Review on Moving Object Detection in Video Forensics by
IRJET- A Review on Moving Object Detection in Video Forensics IRJET- A Review on Moving Object Detection in Video Forensics
IRJET- A Review on Moving Object Detection in Video Forensics IRJET Journal
32 views2 slides
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR by
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORCHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORsipij
14 views11 slides
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES by
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES sipij
85 views10 slides
A New Approach for video denoising and enhancement using optical flow Estimation by
A New Approach for video denoising and enhancement using optical flow EstimationA New Approach for video denoising and enhancement using optical flow Estimation
A New Approach for video denoising and enhancement using optical flow EstimationIRJET Journal
52 views4 slides
Pedestrian Counting in Video Sequences based on Optical Flow Clustering by
Pedestrian Counting in Video Sequences based on Optical Flow ClusteringPedestrian Counting in Video Sequences based on Optical Flow Clustering
Pedestrian Counting in Video Sequences based on Optical Flow ClusteringCSCJournals
295 views16 slides

More Related Content

Similar to Correlation Analysis on Live Data Streams

Video Browsing By Direct Manipulation - Draft 1 by
Video Browsing By Direct Manipulation - Draft 1Video Browsing By Direct Manipulation - Draft 1
Video Browsing By Direct Manipulation - Draft 1Vashira Ravipanich
445 views23 slides
50120140504012 by
5012014050401250120140504012
50120140504012IAEME Publication
287 views14 slides
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING by
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNINGHUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNINGIRJET Journal
2 views7 slides
50120140502009 by
5012014050200950120140502009
50120140502009IAEME Publication
314 views17 slides
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds by
Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds IJECEIAES
19 views9 slides
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT... by
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...IJCSES Journal
60 views17 slides

Similar to Correlation Analysis on Live Data Streams(20)

Video Browsing By Direct Manipulation - Draft 1 by Vashira Ravipanich
Video Browsing By Direct Manipulation - Draft 1Video Browsing By Direct Manipulation - Draft 1
Video Browsing By Direct Manipulation - Draft 1
Vashira Ravipanich445 views
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING by IRJET Journal
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNINGHUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
IRJET Journal2 views
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds by IJECEIAES
Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds
IJECEIAES19 views
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT... by IJCSES Journal
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...
A SURVEY ON VARIOUS APPROACHES TO FINGERPRINT MATCHING FOR PERSONAL VERIFICAT...
IJCSES Journal60 views
2013APRU_NO40-abstract-mobilePIV_YangYaoYu by Yao-Yu Yang
2013APRU_NO40-abstract-mobilePIV_YangYaoYu2013APRU_NO40-abstract-mobilePIV_YangYaoYu
2013APRU_NO40-abstract-mobilePIV_YangYaoYu
Yao-Yu Yang125 views
Crowd Recognition System Based on Optical Flow Along with SVM classifier by IJECEIAES
Crowd Recognition System Based on Optical Flow Along with SVM classifierCrowd Recognition System Based on Optical Flow Along with SVM classifier
Crowd Recognition System Based on Optical Flow Along with SVM classifier
IJECEIAES49 views
Review On Different Feature Extraction Algorithms by IRJET Journal
Review On Different Feature Extraction AlgorithmsReview On Different Feature Extraction Algorithms
Review On Different Feature Extraction Algorithms
IRJET Journal59 views
Object Detection using SURF features by IRJET Journal
Object Detection using SURF featuresObject Detection using SURF features
Object Detection using SURF features
IRJET Journal44 views
Experimental analysis of non-Gaussian noise resistance on global method optic... by journalBEEI
Experimental analysis of non-Gaussian noise resistance on global method optic...Experimental analysis of non-Gaussian noise resistance on global method optic...
Experimental analysis of non-Gaussian noise resistance on global method optic...
journalBEEI46 views
Understanding user interactivity for immersive communications and its impact ... by Alpen-Adria-Universität
Understanding user interactivity for immersive communications and its impact ...Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ... by lauratoni4
Understanding user interactivity for immersive communications and its impact ...Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...
lauratoni4363 views
Fast Feature Pyramids for Object Detection by suthi
Fast Feature Pyramids for Object DetectionFast Feature Pyramids for Object Detection
Fast Feature Pyramids for Object Detection
suthi 978 views
Performance evaluation of aodv and olsr in vanet under realistic mobility by IAEME Publication
Performance evaluation of aodv and olsr in vanet under realistic mobilityPerformance evaluation of aodv and olsr in vanet under realistic mobility
Performance evaluation of aodv and olsr in vanet under realistic mobility
IAEME Publication558 views
Measuring the Effects of Rational 7th and 8th Order Distortion Model in the R... by IOSRJVSP
Measuring the Effects of Rational 7th and 8th Order Distortion Model in the R...Measuring the Effects of Rational 7th and 8th Order Distortion Model in the R...
Measuring the Effects of Rational 7th and 8th Order Distortion Model in the R...
IOSRJVSP54 views
A Hybrid Virtual Reality Simulation System for Wave Energy Conversion by ijceronline
A Hybrid Virtual Reality Simulation System for Wave Energy ConversionA Hybrid Virtual Reality Simulation System for Wave Energy Conversion
A Hybrid Virtual Reality Simulation System for Wave Energy Conversion
ijceronline436 views
IRJET- Behavior Analysis from Videos using Motion based Feature Extraction by IRJET Journal
IRJET-  	  Behavior Analysis from Videos using Motion based Feature ExtractionIRJET-  	  Behavior Analysis from Videos using Motion based Feature Extraction
IRJET- Behavior Analysis from Videos using Motion based Feature Extraction
IRJET Journal40 views

More from Arun Kejariwal

Anomaly Detection At The Edge by
Anomaly Detection At The EdgeAnomaly Detection At The Edge
Anomaly Detection At The EdgeArun Kejariwal
581 views54 slides
Serverless Streaming Architectures and Algorithms for the Enterprise by
Serverless Streaming Architectures and Algorithms for the EnterpriseServerless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the EnterpriseArun Kejariwal
2.8K views227 slides
Sequence-to-Sequence Modeling for Time Series by
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesArun Kejariwal
3.2K views64 slides
Sequence-to-Sequence Modeling for Time Series by
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesArun Kejariwal
1.9K views45 slides
Model Serving via Pulsar Functions by
Model Serving via Pulsar FunctionsModel Serving via Pulsar Functions
Model Serving via Pulsar FunctionsArun Kejariwal
1.7K views44 slides
Designing Modern Streaming Data Applications by
Designing Modern Streaming Data ApplicationsDesigning Modern Streaming Data Applications
Designing Modern Streaming Data ApplicationsArun Kejariwal
2.6K views227 slides

More from Arun Kejariwal(20)

Anomaly Detection At The Edge by Arun Kejariwal
Anomaly Detection At The EdgeAnomaly Detection At The Edge
Anomaly Detection At The Edge
Arun Kejariwal581 views
Serverless Streaming Architectures and Algorithms for the Enterprise by Arun Kejariwal
Serverless Streaming Architectures and Algorithms for the EnterpriseServerless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the Enterprise
Arun Kejariwal2.8K views
Sequence-to-Sequence Modeling for Time Series by Arun Kejariwal
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
Arun Kejariwal3.2K views
Sequence-to-Sequence Modeling for Time Series by Arun Kejariwal
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
Arun Kejariwal1.9K views
Model Serving via Pulsar Functions by Arun Kejariwal
Model Serving via Pulsar FunctionsModel Serving via Pulsar Functions
Model Serving via Pulsar Functions
Arun Kejariwal1.7K views
Designing Modern Streaming Data Applications by Arun Kejariwal
Designing Modern Streaming Data ApplicationsDesigning Modern Streaming Data Applications
Designing Modern Streaming Data Applications
Arun Kejariwal2.6K views
Deep Learning for Time Series Data by Arun Kejariwal
Deep Learning for Time Series DataDeep Learning for Time Series Data
Deep Learning for Time Series Data
Arun Kejariwal1.7K views
Modern real-time streaming architectures by Arun Kejariwal
Modern real-time streaming architecturesModern real-time streaming architectures
Modern real-time streaming architectures
Arun Kejariwal7.2K views
Anomaly detection in real-time data streams using Heron by Arun Kejariwal
Anomaly detection in real-time data streams using HeronAnomaly detection in real-time data streams using Heron
Anomaly detection in real-time data streams using Heron
Arun Kejariwal4.7K views
Data Data Everywhere: Not An Insight to Take Action Upon by Arun Kejariwal
Data Data Everywhere: Not An Insight to Take Action UponData Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action Upon
Arun Kejariwal1.5K views
Real Time Analytics: Algorithms and Systems by Arun Kejariwal
Real Time Analytics: Algorithms and SystemsReal Time Analytics: Algorithms and Systems
Real Time Analytics: Algorithms and Systems
Arun Kejariwal23K views
Finding bad apples early: Minimizing performance impact by Arun Kejariwal
Finding bad apples early: Minimizing performance impactFinding bad apples early: Minimizing performance impact
Finding bad apples early: Minimizing performance impact
Arun Kejariwal1.1K views
Statistical Learning Based Anomaly Detection @ Twitter by Arun Kejariwal
Statistical Learning Based Anomaly Detection @ TwitterStatistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ Twitter
Arun Kejariwal5.1K views
Days In Green (DIG): Forecasting the life of a healthy service by Arun Kejariwal
Days In Green (DIG): Forecasting the life of a healthy serviceDays In Green (DIG): Forecasting the life of a healthy service
Days In Green (DIG): Forecasting the life of a healthy service
Arun Kejariwal793 views
Gimme More! Supporting User Growth in a Performant and Efficient Fashion by Arun Kejariwal
Gimme More! Supporting User Growth in a Performant and Efficient FashionGimme More! Supporting User Growth in a Performant and Efficient Fashion
Gimme More! Supporting User Growth in a Performant and Efficient Fashion
Arun Kejariwal2.3K views
A Systematic Approach to Capacity Planning in the Real World by Arun Kejariwal
A Systematic Approach to Capacity Planning in the Real WorldA Systematic Approach to Capacity Planning in the Real World
A Systematic Approach to Capacity Planning in the Real World
Arun Kejariwal5.5K views
Isolating Events from the Fail Whale by Arun Kejariwal
Isolating Events from the Fail WhaleIsolating Events from the Fail Whale
Isolating Events from the Fail Whale
Arun Kejariwal2K views
Techniques for Minimizing Cloud Footprint by Arun Kejariwal
Techniques for Minimizing Cloud FootprintTechniques for Minimizing Cloud Footprint
Techniques for Minimizing Cloud Footprint
Arun Kejariwal1.4K views

Recently uploaded

How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...ShapeBlue
171 views28 slides
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023BookNet Canada
44 views19 slides
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...ShapeBlue
162 views25 slides
Ransomware is Knocking your Door_Final.pdf by
Ransomware is Knocking your Door_Final.pdfRansomware is Knocking your Door_Final.pdf
Ransomware is Knocking your Door_Final.pdfSecurity Bootcamp
98 views46 slides
Transcript: Redefining the book supply chain: A glimpse into the future - Tec... by
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...BookNet Canada
41 views16 slides
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...ShapeBlue
199 views20 slides

Recently uploaded(20)

How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ... by ShapeBlue
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
How to Re-use Old Hardware with CloudStack. Saving Money and the Environment ...
ShapeBlue171 views
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by BookNet Canada
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
BookNet Canada44 views
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue162 views
Transcript: Redefining the book supply chain: A glimpse into the future - Tec... by BookNet Canada
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
Transcript: Redefining the book supply chain: A glimpse into the future - Tec...
BookNet Canada41 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue199 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue152 views
State of the Union - Rohit Yadav - Apache CloudStack by ShapeBlue
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStack
ShapeBlue303 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash162 views
"Node.js Development in 2024: trends and tools", Nikita Galkin by Fwdays
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin
Fwdays33 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue120 views
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... by ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue164 views
Business Analyst Series 2023 - Week 4 Session 8 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 8Business Analyst Series 2023 -  Week 4 Session 8
Business Analyst Series 2023 - Week 4 Session 8
DianaGray10145 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10146 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li91 views
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue196 views
The Power of Generative AI in Accelerating No Code Adoption.pdf by Saeed Al Dhaheri
The Power of Generative AI in Accelerating No Code Adoption.pdfThe Power of Generative AI in Accelerating No Code Adoption.pdf
The Power of Generative AI in Accelerating No Code Adoption.pdf
Saeed Al Dhaheri39 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays36 views
LLMs in Production: Tooling, Process, and Team Structure by Aggregage
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team Structure
Aggregage57 views
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue224 views

Correlation Analysis on Live Data Streams

  • 1. 1 CORRELATION ANALYSIS ON LIVE DATA STREAMS ARUN KEJARIWAL
  • 2. 2 STREAMING Serve data at rest LIVE Serve data as it is being generated STREAMING vs. LIVE
  • 3. MOBILE DATA TRAFFIC 17% of total IP traffic by 2021 VR/AR Traffic CAGR of 82% from 2016-2021 ANNUAL GLOBAL IP TRAFFIC 3.3 ZB by 2021 BROADBAND SPEEDS Reach 53 Mbps by 2021 INTERNET VIDEO SURVEILLANCE 3.4% of all Internet video traffic by 2021 LIVE INTERNET VIDEO 13% of Internet video traffic by 2021 THE NUMBERS 3 https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/complete-white-paper-c11-481360.html
  • 4. 4 DATA FUSION CHARACTERISTICS DISTRIBUTED HETEROGENEOUS NON-LINEAR INTERNET OF THINGS Sensors and Actuators SMART CITIES SMART HEALTH CARE POLLUTON MONITORING INSIGHTS PROBABILISTIC METHODS THEORY OF EVIDENCE MACHINE LEARNING FUZZY LOGIC CHALLENGES DATA: INCONSISTENCY MISSINGNESS, NOISE LATENCY BATTERY CONSUMPTION INTELLIGENT DECISION MAKING
  • 5. 5 TRAFFIC SURVEILLANCE CONGESTION DETECTION ACCIDENT DETECTION HEALTH EMERGENCY RISK ALERT RAIN WATER CLOGGING, HYDROPLANING MUDSLIDE AN EXAMPLE APPLICATION OF DATA FUSION
  • 9. APPLICATION DOMAINS 9 BIOLOGY ANTHROPLOGY AGRICULTURE PHILOSOPHY PSYCHOLOGY FINANCE ECONOMETRICS STATISTICS NETWORKING OPERATIONS
  • 10. Common representation learning Example: Across audio and video modalities in a single live stream Multimodal Linear/non-linear correlation Example: Across univariate time series of operations data Unimodal CORRELATION ANALYSIS FLAVORS: AT A HIGH LEVEL 10
  • 11. CROSS-MODAL ANALYSIS Joint Representation of Text- Audio-Video INTERPRETABILITY Comparative analysis of deep representations AUTHENTICITY Live streams vs. Recording OBJECT IDENTIFICATION Multiple cameras MULTI-STREAM SYNCHRONIZATION Different vantage points Unreliable timestamps LOCALIZATION Video ⟷ Audio Audio ⟷ Text CORRELATION ANALYSIS WHY BOTHER? 11
  • 12. Pearson Spearman 𝜌 Kendall 𝜏 Goodman and Kruskal COMMON TYPES OF CORRELATION 12 Ties are dropped
  • 13. CORRELATION ANALYSIS A REAL LIFE EXAMPLE 13 Root cause analysis Expose investment avenues Surface optimization opportunities Risk minimization Medical diagnosis Learning
  • 15. 15 Symmetric Lack of context Spurious correlations Non-actionable Network topology CORRELATION MATRIX Scalability Hundreds of millions of time series Use cases Multiple Regression Discriminant Analysis Mahalanobis Distance
  • 16. CORRELATION MATRIX 16 * Figure borrowed from [Mueen et al. 2010] * Thresholded(=0.5)CorrelationMatrix 350x350CorrelationMatrix
  • 17. MULTIPLE CORRELATION 1957 MULTIVARIATE CASE 17 Proportion of variance of xj that cannot be explained by other independent variables Depends on the choice of the dependent variable Correlation Matrix 2-D case
  • 20. STOCHASTICITY 20 Why bother? Multiple flavors If X, Y are independent, then 𝜌(X, Y) = 0 However, the converse is not true, as only the first two moments are considered e.g., 𝜌(X, Y) = 0 even when Y = X2 Not invariant under nonlinear strictly increasing transformations Pearson’s correlation only recognizes linear dependence var(X) and var(Y) have to be finite Non-stationary random processes Stochastic correlation process Applications Finance (Brownian motion), biology Time-varying correlation Rolling correlation - lagged indicator Dynamic Conditional Correlation [Engle’02] Local Correlation [Langnau’09] Wishart autoregressive process [Gourieroux’09] Transformations [van Emmerich’06] arctan and Modified Jacobi process [Ma’09] tanh transformation [Teng’16]
  • 22. SPURIOUS CORRELATION * Figure borrowed from [Anscombe 1973] * WHY CONTEXT IS IMPORTANT? 22 Identical r=0.816 Clearly spurious Linear correlation is perhaps not the right metic Identical summary statistics
  • 23. * ROBUSTNESS r =0.8 r =0.2 r is highly sensitive to slight change, as measured by Kolmogorov distance, in one of the marginal distributions Bivariate Normal Distribution Contaminated Normal Distribution PEARSON CORRELATION 23 * Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox Influence Function (IF) of Pearson’s Correlation Unbounded Pearson’s correlation does not have infinitesimal robustness x x y y zz Recall, first order approximation of sample IF of r is: r-i : correlation coefficient based on all but the ith observation
  • 24. 24 ROBUSTNESS * * Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox Sensitivity to anomalies Linear relationship between x & y but r = -0.21 PEARSON CORRELATION Quadrant (signum) correlation coefficient# # [Blomqvist, 1950] ^[Pasman and Shevlyakov, 1987] Correlation median estimator^
  • 25. ROBUSTNESS PEARSON CORRELATION 25 Based on Robust Principal Variables Alternatives
  • 29. STREAMING CORRELATION Incremental, One pass CHARACTERISTICS 29 Other correlation measures are not amenable to incremental computation Applications Security Correlation power analysis
  • 31. STREAMING CORRELATION Wide Spectrum of Approaches Sliding Windows, Damped Windows Reduction Smoothing Down sampling DFT [Agrawal et al. 1993, Zhu and Shasha 2002, Qiu et al. 2018] DWT [Chan and Fu 1999, Popivanov & Miller 2002] PCA, SVD PAA [Faloutsos and Yi 2000] APAC [Chakrabarti et al. 2001] Random projections [Grellmann et al. 2016] LSH [Sundaram et al. 2013] DATA SKETCHES 31
  • 32. Bursts Page views, Clicks, Retweets Correlated burstiness (e.g., data center operations) Root-cause analysis STREAMING CORRELATION * Figure borrowed from [Sakurai et al. 2005] * Lag between time series Lagged correlation/Cross-correlation ACTIONABLE INSIGHTS 32 [Zhu and Shasha 2002, Levine et al. 2016, Wu et al. 2017] [Vlachos et al. 2008, Kotov et al. 2011, Shafer et al. 2012, Kusmierczyk and Norvag 2015]
  • 34. PREDICTION MOTION 1952 “… prediction motion is the continuation of tracking motion after a target disappears from view.” 1955 1962 Unlike duration of target presentation, target speed exerts an influence on prediction accuracy. EXPLORED >5 DECADES BACK! 34
  • 35. PREDICTION MOTION 35 Human Motion/Robotics [Bütepage et al. 2017, Martinez et a al. 2017, Byravan & Fox] Traffic Prediction [Hermes et al. 2010, Walker et al. 2014, Yu et al. 2017] 360-Degree Video (AR/VR) [Bao et al. 2017, Vishwanath et al., 2017] Potpourri: Deep Learning Based Approaches [Oh et al. 2015, Mathieu et al. 2016, Liang et al. 2017]
  • 36. AUDIO-VIDEO CORRELATION A LONG HISTORY: EXPERIMENTAL PSYCHOLOGY -> DEEP LEARNING 36
  • 37. AUDIO-VIDEO CORRELATION 1952 People utilize visual and postural experiences in perceiving the position of an object in the field, of the whole field. 1897 Localization of sounds varied, being different when the source of sound was in sight from what it was when this was out of sight, and also in the latter case differing with different directions of attention, or with different suggestions as to the direction from which the sound came. 1941 SOUND LOCALIZATION: AN APPLICATION 37
  • 38. AUDIO-VIDEO CORRELATION 1977 1960 1976 LIP READING: AN APPLICATION 38 Demonstrated the influence of vision on speech perception McGurk and MacDonald Established the relationship of the visually perceived symbols to the underlying linguistic system Use visual information as an aid when white noise made speech difficult to hear 1954 There’s a great opportunity for the visual contribution at low speech-to-noise ratios
  • 39. AUDIO-VIDEO CORRELATION 2007 2004 Combined acoustic and visual feature vectors to distinguish live synchronous audio-video recordings from replay attacks that use audio with a still photo. LIVENESS AND SYNCHRONIZATION: AN APPLICATION 39 Extracted the correlated components of audio and lip features based on Canonical Correlation Analysis. 2009 Showed that there exists a relationship between perception of video presented in screen and accompanying audio signals, both stereo and spatial
  • 40. 40 AUDIO-VIDEO-TEXT Speaker identification in multi-speaker scenarios [Huang & Kingsbury 2013, Chung & Zisserman 2016, Torfi et al. 2017] Cross-Modal Correlation Learning in Audio and Lyrics [Yu et al. 2017, Tang et al. 2017] Speech enhancement [Xu et al. 2014, Hou et al. 2017, Kolbæk et al. 2017] Action Recognition and Video Highlight Detection [Wu et al. 2013, Sun et al. 2013, Takahashi et al. 2017] DEEP LEARNING WAY Emotion Recognition [Tzirakis et al. 2017, Pini et al. 2017]
  • 42. 42 Online anomaly detection: speed-accuracy trade-off On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data* by Choudhary et al. 2017 Breakouts/Changepoints Skew in location of anomalies SUSCEPTIBILITY TO ANOMALIES * https://arxiv.org/pdf/1710.04735.pdf
  • 43. 43 HETEROSCEDASTICITY Methods (Adjusted) Percentile Bootstrap [Wilcox 1996] Nested Bootstrap [DiCiccio et al. 1992] Challenge: Detecting non-linear correlation 2001
  • 44. 44 Potential sources: Event based monitoring via sensors, Occlusion in a video Techniques: Resampling Lomb-Scargle Fourier Transform [Andersson 2007, Rehfeld et al. 2011] Kernel - such as Laplacian, Gaussian - based methods IRREGULARLY SPACED TIME SERIES
  • 45. TIME VARYING CORRELATION 45 * Figure borrowed from [Fu et al. 2013] TVCC: Time Varying Correlation Coefficient fMRI time-courses of 4 Regions of Interests (ROI) ✦ Time varying joint distribution ✦ Parameter non-constancy/Instability F Test [Chow’60] SupF Test [Quandt’60] Lagrange Multiplier Test [Nabeya and Tanaka’88, Nylom’89] ✦ Co-integrated processes I(1) [Hansen’90] ✦ Stochastic vs. Deterministic ✦ Dynamic Conditional Correlation (DCC) [Engle’02] * Post stimulus period with significant difference (p<0.05) w.r.t. pre-stimulus interval
  • 46. STREAMING CORRELATION Missing data Packets being dropped owing to, say, unexpected high traffic Data collection: Every, say, 5 seconds How to scale analysis to milli-seconds granularity? Unequal length time series Different sampling rates Small samples Bootstrapping Low SNR (Signal to Noise Ratio) 46
  • 47. NEXT FRONTIER LEVERAGING MULTIMODAL & CONTINUOUSLY EVOLVING CORRELATION FOR SELF-LEARNING 47
  • 49. 49 SELF-LEARNING EARLY WORK: GAMES 19501914 constructed a device which played an end game of king and rook against king. The machine played the side with king and rook and would force checkmate in a few moves however its human opponent played. Gerald Tesauro 199519592002
  • 50. 2002 2017 Amongst the first and most famous was the chess-playing automaton constructed in 1769 by Baron Kempelen … 1953 Alan Turing 2012 1970 GAME PLAYING POTPOURRI 50 19962007 The game of checkers has roughly 500 billion billion possible positions (5 × 1020)
  • 51. SELF-LEARNING 2016 REINFORCEMENT LEARNING 51 2016201720172018 AlphaGo Mastering the game of Go with deep neural networks and tree search by Silver et al. AlphaGo Zero Mastering the game of Go without Human knowledge by Silver et al. Alpha Zero Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm by Silver et al. Libratus Superhuman AI for heads-up no-limit poker: Libratus beats top professionals by Brown and Sandholm
  • 54. 54 Rank Correlation Methods by Kendall and Gibbons READINGS Correlation and Regression by Bobko Correlation by Garson Robust Correlation: Theory and Applications by Shevlyakov and Oja A Mathematical Theory of Evidence by Shafer BOOKS
  • 55. 55 Fast Approximate Correlation of Massive Time-series Data by Queen et al., 2010 READINGS Local Correlation Detection with Linearity Enhancement in Streaming Data by Xie et al., 2013 Fast Distributed Correlation Discovery Over Streaming Time-Series Data by Guo et al., 2015 Random Projection of Fast and Efficient Multivariate Correlation Analysis of High-Dimensional Data: A New Approach by Grellmann et al., 2016 Detection of Highly Correlated Live Data Streams by Alseghayer et al., 2017 STREAMING CORRELATION
  • 56. 56 Perception of body position and of the position of the visual field by Witkin, 1949 READINGS Autocorrelation, a principle for the evaluation of sensory information by the nervous system by Reichardt, 1961 EARLY RESEARCH IN AUDIO-VISUAL CORRELATION Binocular cross-correlation in time and space by Tyler and Julesz, 1978 Neurontropy. an entropy like measure of neural correlation by Julesz and Tyler, 1976 Cross correlation of sensory stimuli and electroencephalogram by Morgan, 1969
  • 57. 57 Patch to the future: Unsupervised visual prediction by Walker et al., 2014 READINGS Motion-Prediction-based Multicast for 360-Degree Video Transmissions by Bao et al., 2017 Dual Motion GAN for Future-Flow Embedded Video Prediction by Liang et al., 2017 Deep representation learning for human motion prediction and classification by Bütepage et al., 2017 On human motion prediction using recurrent neural networks by Martinez, 2017 RECENT WORKS IN PREDICTION MOTION
  • 58. 58 On Deep Multi-View Representation Learning by Wang et al., 2015 READINGS Correlational Neural Networks by Chandar et al., 2017 Common Representation Learning Using Step-based Correlation Multi-Modal CNN by Bhatt et al., 2017 Objects that Sound by Arandjelović and Zisserman, 2017 Deep Correlation Feature Learning for Face Verification in the Wild by Deng et al., 2017 COMMON REPRESENTATION LEARNING
  • 59. 59 READINGS Audiovisual Synchronization and Fusion Using Canonical Correlation Analysis by Sargin et al., 2007 The predictive power of trajectory motion by Watamaniuk, 2005 Seeing motion behind occluders by Watamaniuk and McKee, 1995 Temporal Coherence Theory For The Detection And Measurement Of Visual Motion by Grzywacz et al.,1995 Probabilistic Motion Estimation Based On Temporal Coherence by Burgi et al., 2000 AUDIO-VISUAL RESEARCH
  • 60. 60 A New Method of Audio-Visual Correlation Analysis by Kunka and Kostek, 2009 READINGS Uncertainty in ontologies: Dempster–Shafer theory for data fusion applications by Bellenger and Gatepaille, 2011 City Data Fusion: Sensor Data Fusion in the Internet of Things by Wang et al., 2015 Data Fusion and IoT for Smart Ubiquitous Environments: A Survey by Alam et al., 2017 Correlation Analysis of Audio and Video Contents: A Metadata based Approach by Algur et al., 2015 POTPOURRI
  • 61. 61 READINGS POTPOURRI Correlation detection as a general mechanism for multisensory integration by Parise and Ernst, 2016 Origin of information-limiting noise correlations by Kanitscheidera et al., 2015 Temporal structure and complexity affect audio-visual correspondence detection by Denison et al., 2013 Correlation versus causation in multisensory perception by Mitterer, Jesse, 2010
  • 62. 62 READINGS MATHEMATICS A Bayesian approach to problems in stochastic estimation and control by Ho and Lee, 1964 A Generalization of Bayesian Inference by Dempster, 1968 Multidimensional Scaling by Kruskal and Wish, 1978