lec18.pptx

Environmental Data Analysis with MatLab
Lecture 18:
Cross-correlation

Lecture 01 Using MatLab
Lecture 02 Looking At Data
Lecture 03 Probability and Measurement Error
Lecture 04 Multivariate Distributions
Lecture 05 Linear Models
Lecture 06 The Principle of Least Squares
Lecture 07 Prior Information
Lecture 08 Solving Generalized Least Squares Problems
Lecture 09 Fourier Series
Lecture 10 Complex Fourier Series
Lecture 11 Lessons Learned from the Fourier Transform
Lecture 12 Power Spectral Density
Lecture 13 Filter Theory
Lecture 14 Applications of Filters
Lecture 15 Factor Analysis
Lecture 16 Orthogonal functions
Lecture 17 Covariance and Autocorrelation
Lecture 18 Cross-correlation
Lecture 19 Smoothing, Correlation and Spectra
Lecture 20 Coherence; Tapering and Spectral Analysis
Lecture 21 Interpolation
Lecture 22 Hypothesis testing
Lecture 23 Hypothesis Testing continued; F-Tests
Lecture 24 Confidence Limits of Spectra, Bootstraps
SYLLABUS

purpose of the lecture
generalize the idea of autocorrelation
to multiple time series

Review of last lecture
autocorrelation
correlations between samples within a
time series

high degree of short-term correlation
what ever the river was doing yesterday, its probably
doing today, too
because water takes time to drain away

0 500 1000 1500 2000 2500 3000 3500 4000
0
1
2
x 10
4
time, days
discharge,
cfs
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
0
2
4
6
8
x 10
9
frequency, cycles per day
PSD,
(cfs)
2
per
cycle/day
A) time series, d(t)
time t, days
d(t),
cfs
Neuse River Hydrograph

low degree of intermediate-term correlation
what ever the river was doing last month, today it could
be doing something completely different
because storms are so unpredictable

moderate degree of long-term correlation
what ever the river was doing this time last year, its
probably doing today, too
because seasons repeat

0 0.5 1 1.5 2 2.5
x 10
4
0
0.5
1
1.5
2
2.5
x 10
4
discharge
discharge
lagged
by
1
days
0 0.5 1 1.5 2 2.5
x 10
4
0
0.5
1
1.5
2
2.5
x 10
4
discharge
discharge
lagged
by
3
days
0 0.5 1 1.5 2 2.5
x 10
4
0
0.5
1
1.5
2
2.5
x 10
4
discharge
discharge
lagged
by
30
days
1 day 3 days 30 days

-30 -20 -10 0 10 20 30
0
5
x 10
6
lag, days
autocorrelation
-3000 -2000 -1000 0 1000 2000 3000
-5
0
5
x 10
6
lag, days
autocorrelation
Autocorrelation Function
3
1 30

formula for autocorrelation
autocorrelation
at lag (k-1)Δt

autocorrelation similar to convolution

autocorrelation similar to convolution
note difference in sign

Important Relation #1
autocorrelation is the convolution of a
time series with its time-reversed self

Important Relationship #2
Fourier Transform of an autocorrelation
is proportional to the
Power Spectral Density of time series

Part 1
correlations between time-series

scenario
discharge correlated with rain
but discharge is delayed behind rain
because rain takes time to drain
from the land

time, days
time, days
rain,
mm/day
dischagre,
m
3
/s

time, days
time, days
rain,
mm/day
dischagre,
m
3
/s
rain ahead of
discharge

time, days
time, days
rain,
mm/day
dischagre,
m
3
/s
shape not
exactly the
same, either

treat two time series u and v probabilistically
p.d.f.
p(ui, vi+k-1)
with elements lagged by time
(k-1)Δt
and compute its covariance

this defines the cross-correlation

just a generalization of the auto-correlation
different times in
the same time series
different times in
different time series

like autocorrelation, similar to convolution

As with auto-correlation
two important properties
#1: relationship to convolution
#2: relationship to Fourier Transform

As with auto-correlation
two important properties
#1: relationship to convolution
#2: relationship to Fourier Transform
cross-spectral density

Part 2
aligning time-series
a simple application of cross-correlation

central idea
two time series are best aligned
at the lag at which they are most correlated,
which is
the lag at which their cross-correlation is maximum

10 20 30 40 50 60 70 80 90 100
-1
0
1
0
1
u(t)
v(t)
two similar time-series, with a time shift
(this is simple “test” or “synthetic” dataset)

-20 -10 0 10 20
-5
0
5
time
cross-correlation
cross-correlate

-20 -10 0 10 20
-5
0
5
time
cross-correlation
maximum
time lag
find maximum

In MatLab
compute cross-
correlation

In MatLab
compute cross-
correlation
find maximum

In MatLab
compute cross-
correlation
find maximum
compute time lag

10 20 30 40 50 60 70 80 90 100
-1
0
10 20 30 40 50 60 70 80 90 100
-1
0
1
u(t)
v(t+tlag)
align time series with measured lag

A)
B)
2 4 6 8 10 12 14
0
500
time, days
solar,
W/m2
2 4 6 8 10 12 14
0
50
100
time, days
ozone,
ppb
500
W/m2
solar insolation and ground level ozone
(this is a real dataset from West Point NY)

B)
2 4 6 8 10 12 14
0
500
time, days
solar,
W/m2
2 4 6 8 10 12 14
0
50
100
time, days
ozone,
ppb
500
W/m2
solar insolation and ground level ozone
note time lag

-10 -5 0 5 10
0
1
2
3
4
x 10
6
time, hours
cross-correlation
C)
maximum
time lag
3 hours

0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
0
500
time, days
solar
radiation,
W/m2
0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
0
50
100
3.00 hour lag
time, days
ozone,
ppb
A)
B)
original
delagged

lec18.pptx

Recommended

Recommended

More Related Content

Similar to lec18.pptx

Similar to lec18.pptx (20)

Recently uploaded

Recently uploaded (20)

lec18.pptx

Editor's Notes