SlideShare a Scribd company logo
010100101010111001
010010111010100001
00010010010101001010101010100001
Data Smashing
Zero-Knowledge Feature-free Anomaly Detection
Data Smashing
Zero-Knowledge Feature-free Anomaly Detection
Ishanu Chattopadhyay
Computation Institute
University of Chicago
a·nom·a·ly
noun
something that deviates from what is standard, normal, or expected.
“there are a number of anomalies in the present system”
synonyms:
oddity, peculiarity, abnormality, irregularity, inconsistency, incongruity,
aberration, quirk, rarity
Deterministic Behavior
Specifying “Normal” Might Be Easier
Stochastic Process Deterministic + Noise
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
7 · 102
8 · 102
9 · 102
1 · 103
−1 · 101
−5 · 100
0 · 100
5 · 100
1 · 101
time
Stochastic Process Deterministic + Noise
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
7 · 102
8 · 102
9 · 102
1 · 103
−1 · 101
−5 · 100
0 · 100
5 · 100
1 · 101
time
Stochastic Process Deterministic + Noise
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
7 · 102
8 · 102
9 · 102
1 · 103
−1 · 101
−5 · 100
0 · 100
5 · 100
1 · 101
Similar ?
time
Brainwaves: Identical Stimuli
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−4 · 100
−2 · 100
0 · 100
2 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−5 · 100
0 · 100
5 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−5 · 100
0 · 100
5 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−4 · 100
−2 · 100
0 · 100
2 · 100
4 · 100
time
Are they
from the
same individual?
Brainwaves: Identical Stimuli
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−4 · 100
−2 · 100
0 · 100
2 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−5 · 100
0 · 100
5 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−5 · 100
0 · 100
5 · 100
5 · 101
1 · 102
1.5 · 102
2 · 102
2.5 · 102
−4 · 100
−2 · 100
0 · 100
2 · 100
4 · 100
time
Subject A
Subject B
State of Art
Requires Features
Feature 1
Feature 2
Feature 3
Similarity:
Distance between
feature vectors
Data Smashing
b
c
a
b
c
a
cababcbababcbbacbcbcbababacacabaccbcbccab· · ·
Quantize: bcbbbbacabbcbbbbabbbbba· · ·
Quantize: abbababcbbbabbbabcbbbab· · ·
Invert: cacacccaaabcacbcacabacaaacacac· · ·
Signal 1
Signal 2
Data Smashing
b
c
a
b
c
a
cababcbababcbbacbcbcbababacacabaccbcbccab· · ·
Quantize: bcbbbbacabbcbbbbabbbbba· · ·
Quantize: abbababcbbbabbbabcbbbab· · ·
Invert: cacacccaaabcacbcacabacaaacacac· · ·
Noise?Noise?
Signal 1
Signal 2
Data Smashing
I. Chattopadhyay and H. Lipson , “Data Smashing: Uncovering Lurking Order
In Data”, Royal Society Interface, 2014 vol. 11 no. 101 20140826
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Data Smashing
Anti-streams
Intuitive Description
Stream s −→ Anti-stream s
Anti-streams
Intuitive Description
Stream s −→ Anti-stream s
0 1
0
0.2
0.4
0.6
0.8
1
0.7
0.3
0 1
0
0.2
0.4
0.6
0.8
1
0.3
0.7
Anti-streams
Intuitive Description
Stream s −→ Anti-stream s
0 1
0
0.2
0.4
0.6
0.8
1
0.7
0.3
0 1
0
0.2
0.4
0.6
0.8
1
0.3
0.7
00 01 10 11
0.2
0.4
0.48
0.24
0.16
0.12
00 01 10 11
0.2
0.4
0.1
0.2
0.3
0.4
Anti-streams
Intuitive Description
Stream s −→ Anti-stream s
0 1
0
0.2
0.4
0.6
0.8
1
0.7
0.3
0 1
0
0.2
0.4
0.6
0.8
1
0.3
0.7
00 01 10 11
0.2
0.4
0.48
0.24
0.16
0.12
00 01 10 11
0.2
0.4
0.1
0.2
0.3
0.4
000 001 010 011 100 101 110 111
0
0.2
0.4
000 001 010 011 100 101 110 111
0
0.1
0.2
Quantization
Mapping Continuous Data to Symbol Stream
Quantization Alphabet Σ = {0, 1, 2, 3}
The Black Box Approach
Dynamical System As A Symbol Generator
1
1
22
Pr(0|s) = 0.8
Pr(1|s) = 0
Pr(2|s) = 0.2
The Black Box Approach
Dynamical System As A Symbol Generator
0
0
10
2
22
0
11
2
21
1
22
Pr(0|s) = 0.4
Pr(1|s) = 0.3
Pr(2|s) = 0.3
Ergodic Stationary Quantized Process ⇐⇒ Probability Measure on Infinite Strings
System As A Symbol Generator
Probability Space (Σω
, B, µ)
Σω : Set of strictly infinite strings on alphabet Σ
B : smallest σ-algebra generated by the sets xΣω : x ∈ Σ
µ : Probability measure on infinite strings:
µ(xΣω
) → [0, 1]
x∈Σ
µ(xΣω
) = 1
System As A Symbol Generator
Probability Space (Σω
, B, µ)
Σω : Set of strictly infinite strings on alphabet Σ
B : smallest σ-algebra generated by the sets xΣω : x ∈ Σ
µ : Probability measure on infinite strings:
µ(xΣω
) → [0, 1]
x∈Σ
µ(xΣω
) = 1
Probability Measure on Infinite Strings =⇒ Equivalence Relation on Finite Strings
∀x1, x2 ∈ Σ , x1 ∼ x2
if ∀x ∈ Σ , µ(x1xΣω
) = µ(x2xΣω
)
Equivalence Classes are causal states
Probabilistic Finite State Automata
Models For Quantized Stationary Ergodic Stochastic Processes
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
q1 q2
q3q4
σ1|0.7
σ1|0.7
σ0|0.7
σ0|0.7
σ0|0.3
σ1|0.9
σ
1|0.1σ
0|0.1
q1
σ0|0.5
σ1|0.5
℘1 ℘2
G1
G2
G3
Space Of PFSA
Space Of Measures
Adding Probability Measures
Consider two measures:
℘1 : B → [0, 1]
℘2 : B → [0, 1]
Define a binary operation:
℘1 ⊕ ℘2 ℘3
where
℘3(xΣω
) = ℘1(xΣω
)℘2(xΣω
) " Constant
x∈Σ
℘3(xΣω
) = 1
Adding Probability Measures
Consider two measures:
℘1 : B → [0, 1]
℘2 : B → [0, 1]
Define a binary operation:
℘1 ⊕ ℘2 ℘3
where
℘3(xΣω
) = ℘1(xΣω
)℘2(xΣω
) " Constant
x∈Σ
℘3(xΣω
) = 1
Commutative
Closed
Unique Inverse
Unique Identity
Abelian Group
Lifting Group Structure To PFSAs
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
q1
q2
q3
σ1
|0.2
σ0 |0.8
σ1|0.3σ0|0.7
σ
1 |0.4
σ0|0.6
q1 q2 q3
q4 q5 q6
σ1|0.86
σ0|0.14
σ1|0.7
σ0|0.3
σ1|0.79
σ0|0.21
σ1|0.57
σ0|0.43
σ0|0.39
σ1|0.61
σ0|0.63
σ1|0.37
℘1 =℘2 ℘3
H
G1 G2
G3Space Of PFSA
Space Of Measures
Mathematical Structure Of Model Space
The Abelian Group
The Space of Models has the mathematical structure of an Abelian Group
Mathematical Structure Of Model Space
The Abelian Group
The Space of Models has the mathematical structure of an Abelian Group
1 + 2 = 3
2 − 2 = 0
3 + 0 = 3
Mathematical Structure Of Model Space
The Abelian Group
The Space of Models has the mathematical structure of an Abelian Group
1 + 2 = 3
2 − 2 = 0
3 + 0 = 3
We can “add and subtract” models:
G + H = J
G − H = K
Mathematical Structure Of Model Space
The Abelian Group
The Space of Models has the mathematical structure of an Abelian Group
1 + 2 = 3
2 − 2 = 0
3 + 0 = 3
We can “add and subtract” models:
G + H = J
G − H = K
G − G =?
The Zero Machine
q1 q2
σ1|0.3
σ0|0.7 σ1|0.1
σ0|0.9
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
q1
σ0|0.5
σ1|0.5
G −G
W
(Flat White Noise)+ =
The Zero Machine
q1 q2
σ1|0.3
σ0|0.7 σ1|0.1
σ0|0.9
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
q1
σ0|0.5
σ1|0.5
G −G
W
(Flat White Noise)+ =
q1
σ0|0.5
σ1|0.5
Zero PFSA
for binary alphabet
q1
σ0|1/3σ1|1/3
σ2|1/3
Zero PFSA
for trinary alphabet
The Zero Machine
Maximum entropy rate
History is useless for prediction
Encodes minimum information
q1
σ0|0.5
σ1|0.5
Zero PFSA
or binary alphabet
1
1
00
Pr(0|s) = 0.5
Pr(1|s) = 0.5
Stream Inversion
Direct Generation of Anti-streams
q1 q2
σ1|0.3
σ0|0.7 σ1|0.1
σ0|0.9
Group-theoretic
Inversion
Using PFSA model
Stream Inversion
via selective erasure of s1
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
sequence s1 sequence s
−1
G −G
Space Of PFSA
Annihilation Identity
q1 q2
σ1|0.3
σ0|0.7 σ1|0.1
σ0|0.9
q1 q2
σ1|0.7
σ0|0.3 σ1|0.9
σ0|0.1
q1
σ0|0.5
σ1|0.5
sequence s1
sequence s2
sequence s
G
−G
W
(Flat White
Noise Model)
Flat White Noise
=
Space Of PFSA
Example Of Stream, Anti-stream, & FWN
4 Letter Alphabet
Signal
Inverse Signal
Flat White Noise
EEG - Epileptic Pathology
5 · 102
1 · 103
1.5 · 103
2 · 103
2.5 · 103
3 · 103
3.5 · 103
4 · 103
−2 · 102
0 · 100
2 · 102
5 · 102
1 · 103
1.5 · 103
2 · 103
2.5 · 103
3 · 103
3.5 · 103
4 · 103
−2 · 102
0 · 100
2 · 102
5 · 102
1 · 103
1.5 · 103
2 · 103
2.5 · 103
3 · 103
3.5 · 103
4 · 103
0 · 100
5 · 102
5 · 102
1 · 103
1.5 · 103
2 · 103
2.5 · 103
3 · 103
3.5 · 103
4 · 103
−1 · 102
0 · 100
1 · 102
time
Eyes Open
Epileptic Pathology
EEG - Epileptic Pathology
Pathology
Eyes
Closed
(Normal)
Eyes
Open
(Normal)
100 200 300
100
200
300
Pairwise Distance Matrix
9.9 · 10
−4
1 · 10
−2
1 · 10
−1
0
0.2
0.4
0.6 0
0.2
0.4
0
0.2
x
yz
3D Euclidean Embedding
Anomaly
Normal (Eyes closed)
Normal (Eyes open)
PlaneA
Cardiac Pathology
Disambiguate Normal Rhythm from Murmur
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
−1 · 10−1
−5 · 10−2
0 · 100
5 · 10−2
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
−5 · 10−1
0 · 100
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
−4 · 10−1
−2 · 10−1
0 · 100
2 · 10−1
4 · 10−1
1 · 102
2 · 102
3 · 102
4 · 102
5 · 102
6 · 102
−5 · 10−1
0 · 100
5 · 10−1
time
Normal Rhythm
Murmur
Cardiac Pathology
20 40 60
20
40
60
Distance Matrix
0
9 · 10
−5
1 · 10
−2
0
0.05
0.1
0
2
4
6
·10−20
0.02
x
y
z
3D Euclidean Embeddin
Murmur
Healthy
EEG Based Biometric Authentication
122 Subjects, 100% Accuracy
T8
FT8
F8
AF8
FP2
FPZ
FP1
AF7
F7
FT7
T7
TP7
P7
PO7
O1
OZ
O2
PO8
P8
TP8
CZ
FCZ
FZ
AFZ
CPZ
PZ
POZ
F1
F3
F5
F2
F4
F6
FC1FC3FC5
FC2 FC4 FC6
C1C3C5 C2 C4 C6
CP1CP3CP5
CP2 CP4 CP6
P1
P3
P5
P2
P4
P6
PO1 PO2
AF1 AF2
0.5
0.6
0.7
0.2 0.4
0
0.2
x
y
z
3D Euclidean Embedding
(3 random subjects)
EEG Based Biometric Authentication
122 Subjects, 100% Accuracy
T8
FT8
F8
AF8
FP2
FPZ
FP1
AF7
F7
FT7
T7
TP7
P7
PO7
O1
OZ
O2
PO8
P8
TP8
CZ
FCZ
FZ
AFZ
CPZ
PZ
POZ
F1
F3
F5
F2
F4
F6
FC1FC3FC5
FC2 FC4 FC6
C1C3C5 C2 C4 C6
CP1CP3CP5
CP2 CP4 CP6
P1
P3
P5
P2
P4
P6
PO1 PO2
AF1 AF2
0.5
0.6
0.7
0.2 0.4
0
0.2
x
y
z
3D Euclidean Embedding
(3 random subjects)
Classification of Variable Stars
Optical Gravitational Lensing Experiment (OGLE) database
Cepheid
RR Lyrae
0 5,000 10,000
0
5,000
10,000
Distance Matrix
9.9 · 10
−4
1 · 10
−2
1 · 10
−1
0 0.2 0.4 0.6
0
0.2
0.4
0
0.2
0.4
x
yz
3D Euclidean Embedding
RRL
Cepheids
Self Annihilation Error
−1
Sequence s Flat White Noise
Self Annihilation Error
−1
Sequence s Flat White Noise
Only if |s| is large enough !
Self Annihilation Error
−1
Sequence s s
s = Θ(s , W)
Self Annihilation Error
−1
Sequence s s
s = Θ(s , W)
s → 0 exponentially fast with |s|
Information content of stream
Self Annihilation Error
−1
Sequence s s
s = Θ(s , W)
Auto-detect Data Sufficiency!
Cognitive Fingerprinting
User Authentication From Keypress Dynamics
100 200 300 400
−1000
0
1000
100 200 300 400
−1000
0
1000
2000
100 200 300 400
−2000
0
2000
100 200 300 400
−1000
0
1000
100 200 300 400
−1000
0
1000
100 200 300 400
−2000
0
2000
100 200 300 400
−1000
0
1000
2000
100 200 300 400
−1000
0
1000
100 200 300 400
−1000
0
1000
100 200 300 400
−1000
0
1000
JOHNCHERYL
2 4 6 8 10 12 14 16
2
4
6
8
10
12
14
16
CHERYL
JASON
JOHN
Time-series of
keypress delays
Random text
Cognitive Fingerprinting
User Authentication From Keypress Dynamics
10
1
10
2
10
−1
Threshold
ACCEPT
# of Key Press
ACCEPT
Data Sufficiency Test
Authentication Test
10
1
10
2
10
−1
Threshold
Data Sufficiency Test
Authentication Test
REJECT
# of Key Press
User Authorized
User Rejected
Cognitive Fingerprinting
User Authentication From Keypress Dynamics
10
1
10
2
10
−1
Threshold
# of Key Press
ACCEPT
REJECT
USER CHANGE
Authorization
revoked
on unexpected user
change detection
Physical Dexterity Quantification
Dynamic Regulation of Instabilities: Athletes vs Amateurs
Francisco Valero-Cuevas
Brain-body Dynamics Lab
Univ. of Southern California
−0.6
−0.4
−0.2
0
0.2
−0.5
0
0.5
1
−1
0
1
−0.4
−0.2
0
0.2
0.4
−0.4
−0.2
0
0.2
0.4
0 1,000 2,000 3,000 4,000 5,000
−2
0
2
Physical Dexterity Quantification
Dynamic Regulation of Instabilities: Athletes vs Amateurs
50 100 150 200
50
100
150
200
Pairwise Distance Matrix
0.02 · 10−14 · 10−1
0.0
5 · 10−2
1 · 10−1
0.0
5 · 10−2
x
y
z
3D Euclidean Embedding
Physical Dexterity Quantification
Dynamic Regulation of Instabilities: Athletes vs Amateurs
50 100 150
50
100
150
Pairwise Distance Matrix
180 200
180
200
0.02 · 10−14 · 10−1
0.0
5 · 10−2
1 · 10−1
0.0
5 · 10−2
x
y
z
3D Euclidean Embedding
Handwritten Digits
Recognition / Classification Problem
Handwritten Digits
Recognition / Classification Problem
Handwritten Digits
Recognition / Classification Problem
Handwritten Digits
Recognition / Classification Problem
State of Art Approaches
Find Clever Representations
State of Art Approaches
Find Clever Representations
State of Art Approaches
Find Clever Representations
State of Art Approaches
Find Clever Representations
Feature
Vector 1
Feature
Vector 2
Feature
Vector 16
Image To Symbol Stream
Random Walk
Handwritten Digits
Recognition / Classification Problem
4
9
100 200 300
100
200
300
Pairwise Distance Matrix
00.20.40.6
0
0.2
0.4
0.6
0
0.2
0.4
0.6
x
y
z
3D Euclidean Embedding
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2
0
s1:
FWN:
Copy 1:
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2
0
s1:
FWN:
Copy 1:
1
1
1
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2
0
s1:
FWN:
Copy 1:
1 2
1 2
1 2
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2 0
0 2
s1:
FWN:
Copy 1:
1 2
1 2
1 2
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2 0 1
0 2 2
s1:
FWN:
Copy 1:
1 2
1 2
1 2
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2 0 1 0 2 2 1 2 1 1 0 0 2 0
0 2 2 1 0 0 2 1 2 2 2 2 0 2
s1:
FWN:
Copy 1:
1 2 2 0 0 0 1 0 0 1 0 2
1 2 2 0 0 0 1 0 0 1 0 2
1 2 2 0 0 0 1 0 0 1 0 2 1 0 2 2 2 1 2 2 1 2 2 1 2 1
Generating Second Independent Copy for String s1:
2
0
s1:
FWN:
Copy 2:
Memory-less Stream Manipulation
Generating First Independent Copy for String s1:
2 0 1 0 2 2 1 2 1 1 0 0 2 0
0 2 2 1 0 0 2 1 2 2 2 2 0 2
s1:
FWN:
Copy 1:
1 2 2 0 0 0 1 0 0 1 0 2
1 2 2 0 0 0 1 0 0 1 0 2
1 2 2 0 0 0 1 0 0 1 0 2 1 0 2 2 2 1 2 2 1 2 2 1 2 1
Generating Second Independent Copy for String s1:
2 1 1 0 2 0 0 1 2 0 1 0 1 0 2 0 1 0 2
0 0 0 2 0 2 2 2 1 2 0 1 0 2 0 2 2 2 0
s1:
FWN:
Copy 2:
2 0 2 2 1 0 0
2 0 2 2 1 0 0
2 0 2 2 1 0 0 2 2 2 1 2 2 0 1 2 2 1 2 1 2
Memory-less Stream Manipulation
Generating Inverse of String s1:
1
Inverse:
Copy 2:
Copy 1:
2
0
0
Memory-less Stream Manipulation
Generating Inverse of String s1:
1 2
Inverse:
Copy 2:
Copy 1:
2 0
0 1
0 1
Memory-less Stream Manipulation
Generating Inverse of String s1:
2
2
1 2
Inverse:
Copy 2:
Copy 1:
2 0
0 1
0 1
Memory-less Stream Manipulation
Generating Inverse of String s1:
2
2
1 2 0
Inverse:
Copy 2:
Copy 1:
2 0 2
0 1 1
0 1 1
Memory-less Stream Manipulation
Generating Inverse of String s1:
2 0 2 0 2 2 1 2
2 0 2 0 2 2 1 2
1 2 0 0 1 0 0 1 0 1 2 2 1 2 2 1 2 1
Inverse:
Copy 2:
Copy 1:
2 0 2 1 0 2 2 2 1 2 1 1 2
0 1 1 2 2 1 1 0 2 0 0 0 0
0 1 1 2 2 1 1 0 2 0 0 0 0
Summing s2 with Inverse of s1
2 2 1 2 1 0 2 2 2 1 0 0 2 1 2 2
0 1 2 1 0 2 0 0 0
· · · · · ·
Inverse of String s1:
String s2:
Result:
1 2 1 0
1 2 1 0
1 2 1 0
Summing It Up!
Notion of Universal Similarity
Zero-knowledge Feature-free Anomaly Detection
Patents held by Cornell University
6259-01-US Stochastic Automata for Earthquake Prediction from Large
Scale Surveys
6259-02-PC Systems and Methods for Abductive Learning of Quantized
Stochastic Process
6024-03-PC System and Methods for Analysis of Data PCT/US13/62397
6998-01-US Causality Network Construction Algorithm (Application no.
62170063, EFS ID 2517508)

More Related Content

What's hot

solucionario de purcell 1
solucionario de purcell 1solucionario de purcell 1
solucionario de purcell 1
José Encalada
 
Modeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential EquationModeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential Equation
Mark Chang
 
Management of uncertainties in numerical aerodynamics
Management of uncertainties in numerical aerodynamicsManagement of uncertainties in numerical aerodynamics
Management of uncertainties in numerical aerodynamics
Alexander Litvinenko
 
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
IJRES Journal
 
Probability and Statistics
Probability and StatisticsProbability and Statistics
Probability and Statistics
shekharpatil33
 
Iterative methods with special structures
Iterative methods with special structuresIterative methods with special structures
Iterative methods with special structures
David Gleich
 
Solutions manual for calculus an applied approach brief international metric ...
Solutions manual for calculus an applied approach brief international metric ...Solutions manual for calculus an applied approach brief international metric ...
Solutions manual for calculus an applied approach brief international metric ...
Larson612
 
51549 0131469657 ism-8
51549 0131469657 ism-851549 0131469657 ism-8
51549 0131469657 ism-8Carlos Fuentes
 
Lecture 4: Stochastic Hydrology (Site Characterization)
Lecture 4: Stochastic Hydrology (Site Characterization)Lecture 4: Stochastic Hydrology (Site Characterization)
Lecture 4: Stochastic Hydrology (Site Characterization)
Amro Elfeki
 
Thesis defense
Thesis defenseThesis defense
Thesis defense
Zheng Mengdi
 
Cuaderno+de+integrales
Cuaderno+de+integralesCuaderno+de+integrales
Cuaderno+de+integralesjoseluisroyo
 
Lecture 2: Stochastic Hydrology
Lecture 2: Stochastic Hydrology Lecture 2: Stochastic Hydrology
Lecture 2: Stochastic Hydrology
Amro Elfeki
 
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
Amro Elfeki
 
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
Ken'ichi Matsui
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13Carlos Fuentes
 
Lecture 3: Stochastic Hydrology
Lecture 3: Stochastic HydrologyLecture 3: Stochastic Hydrology
Lecture 3: Stochastic Hydrology
Amro Elfeki
 
Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...
Vladimir Bakhrushin
 
Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu Hino
Suurist
 
An Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUMAn Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUM
Dominik Dahlem
 

What's hot (20)

solucionario de purcell 1
solucionario de purcell 1solucionario de purcell 1
solucionario de purcell 1
 
Modeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential EquationModeling the Dynamics of SGD by Stochastic Differential Equation
Modeling the Dynamics of SGD by Stochastic Differential Equation
 
Management of uncertainties in numerical aerodynamics
Management of uncertainties in numerical aerodynamicsManagement of uncertainties in numerical aerodynamics
Management of uncertainties in numerical aerodynamics
 
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
Mx/G(a,b)/1 With Modified Vacation, Variant Arrival Rate With Restricted Admi...
 
Probability and Statistics
Probability and StatisticsProbability and Statistics
Probability and Statistics
 
Iterative methods with special structures
Iterative methods with special structuresIterative methods with special structures
Iterative methods with special structures
 
Solutions manual for calculus an applied approach brief international metric ...
Solutions manual for calculus an applied approach brief international metric ...Solutions manual for calculus an applied approach brief international metric ...
Solutions manual for calculus an applied approach brief international metric ...
 
51549 0131469657 ism-8
51549 0131469657 ism-851549 0131469657 ism-8
51549 0131469657 ism-8
 
Lecture 4: Stochastic Hydrology (Site Characterization)
Lecture 4: Stochastic Hydrology (Site Characterization)Lecture 4: Stochastic Hydrology (Site Characterization)
Lecture 4: Stochastic Hydrology (Site Characterization)
 
Thesis defense
Thesis defenseThesis defense
Thesis defense
 
Cuaderno+de+integrales
Cuaderno+de+integralesCuaderno+de+integrales
Cuaderno+de+integrales
 
Lecture 2: Stochastic Hydrology
Lecture 2: Stochastic Hydrology Lecture 2: Stochastic Hydrology
Lecture 2: Stochastic Hydrology
 
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
Lecture 6: Stochastic Hydrology (Estimation Problem-Kriging-, Conditional Sim...
 
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
第13回数学カフェ「素数!!」二次会 LT資料「乱数!!」
 
51554 0131469657 ism-13
51554 0131469657 ism-1351554 0131469657 ism-13
51554 0131469657 ism-13
 
Lecture 3: Stochastic Hydrology
Lecture 3: Stochastic HydrologyLecture 3: Stochastic Hydrology
Lecture 3: Stochastic Hydrology
 
Capitulo 5 Soluciones Purcell 9na Edicion
Capitulo 5 Soluciones Purcell 9na EdicionCapitulo 5 Soluciones Purcell 9na Edicion
Capitulo 5 Soluciones Purcell 9na Edicion
 
Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...Identification of the Mathematical Models of Complex Relaxation Processes in ...
Identification of the Mathematical Models of Complex Relaxation Processes in ...
 
Hideitsu Hino
Hideitsu HinoHideitsu Hino
Hideitsu Hino
 
An Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUMAn Introduction into Anomaly Detection Using CUSUM
An Introduction into Anomaly Detection Using CUSUM
 

Similar to Data Smashing

Geohydrology ii (3)
Geohydrology ii (3)Geohydrology ii (3)
Geohydrology ii (3)
Amro Elfeki
 
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
The Statistical and Applied Mathematical Sciences Institute
 
Interface 2010
Interface 2010Interface 2010
Interface 2010
jtrevelator
 
Probability-1.pptx
Probability-1.pptxProbability-1.pptx
Probability-1.pptx
MuskanKhan320706
 
NTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANsNTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANs
Mark Chang
 
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Okamoto Laboratory, The University of Electro-Communications
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priors
Elvis DOHMATOB
 
Normal probability distribution
Normal probability distributionNormal probability distribution
Normal probability distribution
Nadeem Uddin
 
Input analysis
Input analysisInput analysis
Input analysis
Bhavik A Shah
 
1506 binomial-coefficients
1506 binomial-coefficients1506 binomial-coefficients
1506 binomial-coefficients
Dr Fereidoun Dejahang
 
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片Chyi-Tsong Chen
 
Ml presentation
Ml presentationMl presentation
Ml presentation
Mark Fetherolf
 
Slides: Jeffreys centroids for a set of weighted histograms
Slides: Jeffreys centroids for a set of weighted histogramsSlides: Jeffreys centroids for a set of weighted histograms
Slides: Jeffreys centroids for a set of weighted histograms
Frank Nielsen
 
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
The Statistical and Applied Mathematical Sciences Institute
 
alt klausur
alt klausuralt klausur
alt klausur
zhongchengdai
 
微積分定理與公式
微積分定理與公式微積分定理與公式
微積分定理與公式
zoayzoay
 
Numerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolationNumerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolation
Nikolai Priezjev
 
Chapter 04 answers
Chapter 04 answersChapter 04 answers
Chapter 04 answers
Rajwinder Marock
 
Measurement of Rule-based LTLf Declarative Process Specifications
Measurement of Rule-based LTLf Declarative Process SpecificationsMeasurement of Rule-based LTLf Declarative Process Specifications
Measurement of Rule-based LTLf Declarative Process Specifications
Claudio Di Ciccio
 

Similar to Data Smashing (20)

Geohydrology ii (3)
Geohydrology ii (3)Geohydrology ii (3)
Geohydrology ii (3)
 
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
 
Interface 2010
Interface 2010Interface 2010
Interface 2010
 
Probability-1.pptx
Probability-1.pptxProbability-1.pptx
Probability-1.pptx
 
NTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANsNTHU AI Reading Group: Improved Training of Wasserstein GANs
NTHU AI Reading Group: Improved Training of Wasserstein GANs
 
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...Families of Triangular Norm Based Kernel Function and Its Application to Kern...
Families of Triangular Norm Based Kernel Function and Its Application to Kern...
 
MVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priorsMVPA with SpaceNet: sparse structured priors
MVPA with SpaceNet: sparse structured priors
 
Normal probability distribution
Normal probability distributionNormal probability distribution
Normal probability distribution
 
Input analysis
Input analysisInput analysis
Input analysis
 
AJMS_384_22.pdf
AJMS_384_22.pdfAJMS_384_22.pdf
AJMS_384_22.pdf
 
1506 binomial-coefficients
1506 binomial-coefficients1506 binomial-coefficients
1506 binomial-coefficients
 
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片
Intelligent Process Control Using Neural Fuzzy Techniques ~陳奇中教授演講投影片
 
Ml presentation
Ml presentationMl presentation
Ml presentation
 
Slides: Jeffreys centroids for a set of weighted histograms
Slides: Jeffreys centroids for a set of weighted histogramsSlides: Jeffreys centroids for a set of weighted histograms
Slides: Jeffreys centroids for a set of weighted histograms
 
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
QMC: Transition Workshop - Applying Quasi-Monte Carlo Methods to a Stochastic...
 
alt klausur
alt klausuralt klausur
alt klausur
 
微積分定理與公式
微積分定理與公式微積分定理與公式
微積分定理與公式
 
Numerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolationNumerical Methods: curve fitting and interpolation
Numerical Methods: curve fitting and interpolation
 
Chapter 04 answers
Chapter 04 answersChapter 04 answers
Chapter 04 answers
 
Measurement of Rule-based LTLf Declarative Process Specifications
Measurement of Rule-based LTLf Declarative Process SpecificationsMeasurement of Rule-based LTLf Declarative Process Specifications
Measurement of Rule-based LTLf Declarative Process Specifications
 

Recently uploaded

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 

Recently uploaded (20)

【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

Data Smashing

  • 1. 010100101010111001 010010111010100001 00010010010101001010101010100001 Data Smashing Zero-Knowledge Feature-free Anomaly Detection Data Smashing Zero-Knowledge Feature-free Anomaly Detection Ishanu Chattopadhyay Computation Institute University of Chicago
  • 2. a·nom·a·ly noun something that deviates from what is standard, normal, or expected. “there are a number of anomalies in the present system” synonyms: oddity, peculiarity, abnormality, irregularity, inconsistency, incongruity, aberration, quirk, rarity
  • 4. Stochastic Process Deterministic + Noise 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 7 · 102 8 · 102 9 · 102 1 · 103 −1 · 101 −5 · 100 0 · 100 5 · 100 1 · 101 time
  • 5. Stochastic Process Deterministic + Noise 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 7 · 102 8 · 102 9 · 102 1 · 103 −1 · 101 −5 · 100 0 · 100 5 · 100 1 · 101 time
  • 6. Stochastic Process Deterministic + Noise 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 7 · 102 8 · 102 9 · 102 1 · 103 −1 · 101 −5 · 100 0 · 100 5 · 100 1 · 101 Similar ? time
  • 7. Brainwaves: Identical Stimuli 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −4 · 100 −2 · 100 0 · 100 2 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −5 · 100 0 · 100 5 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −5 · 100 0 · 100 5 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −4 · 100 −2 · 100 0 · 100 2 · 100 4 · 100 time Are they from the same individual?
  • 8. Brainwaves: Identical Stimuli 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −4 · 100 −2 · 100 0 · 100 2 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −5 · 100 0 · 100 5 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −5 · 100 0 · 100 5 · 100 5 · 101 1 · 102 1.5 · 102 2 · 102 2.5 · 102 −4 · 100 −2 · 100 0 · 100 2 · 100 4 · 100 time Subject A Subject B
  • 9. State of Art Requires Features Feature 1 Feature 2 Feature 3 Similarity: Distance between feature vectors
  • 10. Data Smashing b c a b c a cababcbababcbbacbcbcbababacacabaccbcbccab· · · Quantize: bcbbbbacabbcbbbbabbbbba· · · Quantize: abbababcbbbabbbabcbbbab· · · Invert: cacacccaaabcacbcacabacaaacacac· · · Signal 1 Signal 2
  • 11. Data Smashing b c a b c a cababcbababcbbacbcbcbababacacabaccbcbccab· · · Quantize: bcbbbbacabbcbbbbabbbbba· · · Quantize: abbababcbbbabbbabcbbbab· · · Invert: cacacccaaabcacbcacabacaaacacac· · · Noise?Noise? Signal 1 Signal 2
  • 12. Data Smashing I. Chattopadhyay and H. Lipson , “Data Smashing: Uncovering Lurking Order In Data”, Royal Society Interface, 2014 vol. 11 no. 101 20140826
  • 23. Anti-streams Intuitive Description Stream s −→ Anti-stream s 0 1 0 0.2 0.4 0.6 0.8 1 0.7 0.3 0 1 0 0.2 0.4 0.6 0.8 1 0.3 0.7
  • 24. Anti-streams Intuitive Description Stream s −→ Anti-stream s 0 1 0 0.2 0.4 0.6 0.8 1 0.7 0.3 0 1 0 0.2 0.4 0.6 0.8 1 0.3 0.7 00 01 10 11 0.2 0.4 0.48 0.24 0.16 0.12 00 01 10 11 0.2 0.4 0.1 0.2 0.3 0.4
  • 25. Anti-streams Intuitive Description Stream s −→ Anti-stream s 0 1 0 0.2 0.4 0.6 0.8 1 0.7 0.3 0 1 0 0.2 0.4 0.6 0.8 1 0.3 0.7 00 01 10 11 0.2 0.4 0.48 0.24 0.16 0.12 00 01 10 11 0.2 0.4 0.1 0.2 0.3 0.4 000 001 010 011 100 101 110 111 0 0.2 0.4 000 001 010 011 100 101 110 111 0 0.1 0.2
  • 26. Quantization Mapping Continuous Data to Symbol Stream Quantization Alphabet Σ = {0, 1, 2, 3}
  • 27. The Black Box Approach Dynamical System As A Symbol Generator 1 1 22 Pr(0|s) = 0.8 Pr(1|s) = 0 Pr(2|s) = 0.2
  • 28. The Black Box Approach Dynamical System As A Symbol Generator 0 0 10 2 22 0 11 2 21 1 22 Pr(0|s) = 0.4 Pr(1|s) = 0.3 Pr(2|s) = 0.3 Ergodic Stationary Quantized Process ⇐⇒ Probability Measure on Infinite Strings
  • 29. System As A Symbol Generator Probability Space (Σω , B, µ) Σω : Set of strictly infinite strings on alphabet Σ B : smallest σ-algebra generated by the sets xΣω : x ∈ Σ µ : Probability measure on infinite strings: µ(xΣω ) → [0, 1] x∈Σ µ(xΣω ) = 1
  • 30. System As A Symbol Generator Probability Space (Σω , B, µ) Σω : Set of strictly infinite strings on alphabet Σ B : smallest σ-algebra generated by the sets xΣω : x ∈ Σ µ : Probability measure on infinite strings: µ(xΣω ) → [0, 1] x∈Σ µ(xΣω ) = 1 Probability Measure on Infinite Strings =⇒ Equivalence Relation on Finite Strings ∀x1, x2 ∈ Σ , x1 ∼ x2 if ∀x ∈ Σ , µ(x1xΣω ) = µ(x2xΣω ) Equivalence Classes are causal states
  • 31. Probabilistic Finite State Automata Models For Quantized Stationary Ergodic Stochastic Processes q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 q1 q2 q3q4 σ1|0.7 σ1|0.7 σ0|0.7 σ0|0.7 σ0|0.3 σ1|0.9 σ 1|0.1σ 0|0.1 q1 σ0|0.5 σ1|0.5 ℘1 ℘2 G1 G2 G3 Space Of PFSA Space Of Measures
  • 32. Adding Probability Measures Consider two measures: ℘1 : B → [0, 1] ℘2 : B → [0, 1] Define a binary operation: ℘1 ⊕ ℘2 ℘3 where ℘3(xΣω ) = ℘1(xΣω )℘2(xΣω ) " Constant x∈Σ ℘3(xΣω ) = 1
  • 33. Adding Probability Measures Consider two measures: ℘1 : B → [0, 1] ℘2 : B → [0, 1] Define a binary operation: ℘1 ⊕ ℘2 ℘3 where ℘3(xΣω ) = ℘1(xΣω )℘2(xΣω ) " Constant x∈Σ ℘3(xΣω ) = 1 Commutative Closed Unique Inverse Unique Identity Abelian Group
  • 34. Lifting Group Structure To PFSAs q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 q1 q2 q3 σ1 |0.2 σ0 |0.8 σ1|0.3σ0|0.7 σ 1 |0.4 σ0|0.6 q1 q2 q3 q4 q5 q6 σ1|0.86 σ0|0.14 σ1|0.7 σ0|0.3 σ1|0.79 σ0|0.21 σ1|0.57 σ0|0.43 σ0|0.39 σ1|0.61 σ0|0.63 σ1|0.37 ℘1 =℘2 ℘3 H G1 G2 G3Space Of PFSA Space Of Measures
  • 35. Mathematical Structure Of Model Space The Abelian Group The Space of Models has the mathematical structure of an Abelian Group
  • 36. Mathematical Structure Of Model Space The Abelian Group The Space of Models has the mathematical structure of an Abelian Group 1 + 2 = 3 2 − 2 = 0 3 + 0 = 3
  • 37. Mathematical Structure Of Model Space The Abelian Group The Space of Models has the mathematical structure of an Abelian Group 1 + 2 = 3 2 − 2 = 0 3 + 0 = 3 We can “add and subtract” models: G + H = J G − H = K
  • 38. Mathematical Structure Of Model Space The Abelian Group The Space of Models has the mathematical structure of an Abelian Group 1 + 2 = 3 2 − 2 = 0 3 + 0 = 3 We can “add and subtract” models: G + H = J G − H = K G − G =?
  • 39. The Zero Machine q1 q2 σ1|0.3 σ0|0.7 σ1|0.1 σ0|0.9 q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 q1 σ0|0.5 σ1|0.5 G −G W (Flat White Noise)+ =
  • 40. The Zero Machine q1 q2 σ1|0.3 σ0|0.7 σ1|0.1 σ0|0.9 q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 q1 σ0|0.5 σ1|0.5 G −G W (Flat White Noise)+ = q1 σ0|0.5 σ1|0.5 Zero PFSA for binary alphabet q1 σ0|1/3σ1|1/3 σ2|1/3 Zero PFSA for trinary alphabet
  • 41. The Zero Machine Maximum entropy rate History is useless for prediction Encodes minimum information q1 σ0|0.5 σ1|0.5 Zero PFSA or binary alphabet 1 1 00 Pr(0|s) = 0.5 Pr(1|s) = 0.5
  • 42. Stream Inversion Direct Generation of Anti-streams q1 q2 σ1|0.3 σ0|0.7 σ1|0.1 σ0|0.9 Group-theoretic Inversion Using PFSA model Stream Inversion via selective erasure of s1 q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 sequence s1 sequence s −1 G −G Space Of PFSA
  • 43. Annihilation Identity q1 q2 σ1|0.3 σ0|0.7 σ1|0.1 σ0|0.9 q1 q2 σ1|0.7 σ0|0.3 σ1|0.9 σ0|0.1 q1 σ0|0.5 σ1|0.5 sequence s1 sequence s2 sequence s G −G W (Flat White Noise Model) Flat White Noise = Space Of PFSA
  • 44. Example Of Stream, Anti-stream, & FWN 4 Letter Alphabet Signal Inverse Signal Flat White Noise
  • 45. EEG - Epileptic Pathology 5 · 102 1 · 103 1.5 · 103 2 · 103 2.5 · 103 3 · 103 3.5 · 103 4 · 103 −2 · 102 0 · 100 2 · 102 5 · 102 1 · 103 1.5 · 103 2 · 103 2.5 · 103 3 · 103 3.5 · 103 4 · 103 −2 · 102 0 · 100 2 · 102 5 · 102 1 · 103 1.5 · 103 2 · 103 2.5 · 103 3 · 103 3.5 · 103 4 · 103 0 · 100 5 · 102 5 · 102 1 · 103 1.5 · 103 2 · 103 2.5 · 103 3 · 103 3.5 · 103 4 · 103 −1 · 102 0 · 100 1 · 102 time Eyes Open Epileptic Pathology
  • 46. EEG - Epileptic Pathology Pathology Eyes Closed (Normal) Eyes Open (Normal) 100 200 300 100 200 300 Pairwise Distance Matrix 9.9 · 10 −4 1 · 10 −2 1 · 10 −1 0 0.2 0.4 0.6 0 0.2 0.4 0 0.2 x yz 3D Euclidean Embedding Anomaly Normal (Eyes closed) Normal (Eyes open) PlaneA
  • 47. Cardiac Pathology Disambiguate Normal Rhythm from Murmur 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 −1 · 10−1 −5 · 10−2 0 · 100 5 · 10−2 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 −5 · 10−1 0 · 100 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 −4 · 10−1 −2 · 10−1 0 · 100 2 · 10−1 4 · 10−1 1 · 102 2 · 102 3 · 102 4 · 102 5 · 102 6 · 102 −5 · 10−1 0 · 100 5 · 10−1 time Normal Rhythm Murmur
  • 48. Cardiac Pathology 20 40 60 20 40 60 Distance Matrix 0 9 · 10 −5 1 · 10 −2 0 0.05 0.1 0 2 4 6 ·10−20 0.02 x y z 3D Euclidean Embeddin Murmur Healthy
  • 49. EEG Based Biometric Authentication 122 Subjects, 100% Accuracy T8 FT8 F8 AF8 FP2 FPZ FP1 AF7 F7 FT7 T7 TP7 P7 PO7 O1 OZ O2 PO8 P8 TP8 CZ FCZ FZ AFZ CPZ PZ POZ F1 F3 F5 F2 F4 F6 FC1FC3FC5 FC2 FC4 FC6 C1C3C5 C2 C4 C6 CP1CP3CP5 CP2 CP4 CP6 P1 P3 P5 P2 P4 P6 PO1 PO2 AF1 AF2 0.5 0.6 0.7 0.2 0.4 0 0.2 x y z 3D Euclidean Embedding (3 random subjects)
  • 50. EEG Based Biometric Authentication 122 Subjects, 100% Accuracy T8 FT8 F8 AF8 FP2 FPZ FP1 AF7 F7 FT7 T7 TP7 P7 PO7 O1 OZ O2 PO8 P8 TP8 CZ FCZ FZ AFZ CPZ PZ POZ F1 F3 F5 F2 F4 F6 FC1FC3FC5 FC2 FC4 FC6 C1C3C5 C2 C4 C6 CP1CP3CP5 CP2 CP4 CP6 P1 P3 P5 P2 P4 P6 PO1 PO2 AF1 AF2 0.5 0.6 0.7 0.2 0.4 0 0.2 x y z 3D Euclidean Embedding (3 random subjects)
  • 51. Classification of Variable Stars Optical Gravitational Lensing Experiment (OGLE) database Cepheid RR Lyrae 0 5,000 10,000 0 5,000 10,000 Distance Matrix 9.9 · 10 −4 1 · 10 −2 1 · 10 −1 0 0.2 0.4 0.6 0 0.2 0.4 0 0.2 0.4 x yz 3D Euclidean Embedding RRL Cepheids
  • 53. Self Annihilation Error −1 Sequence s Flat White Noise Only if |s| is large enough !
  • 55. Self Annihilation Error −1 Sequence s s s = Θ(s , W) s → 0 exponentially fast with |s| Information content of stream
  • 56. Self Annihilation Error −1 Sequence s s s = Θ(s , W) Auto-detect Data Sufficiency!
  • 57. Cognitive Fingerprinting User Authentication From Keypress Dynamics 100 200 300 400 −1000 0 1000 100 200 300 400 −1000 0 1000 2000 100 200 300 400 −2000 0 2000 100 200 300 400 −1000 0 1000 100 200 300 400 −1000 0 1000 100 200 300 400 −2000 0 2000 100 200 300 400 −1000 0 1000 2000 100 200 300 400 −1000 0 1000 100 200 300 400 −1000 0 1000 100 200 300 400 −1000 0 1000 JOHNCHERYL 2 4 6 8 10 12 14 16 2 4 6 8 10 12 14 16 CHERYL JASON JOHN Time-series of keypress delays Random text
  • 58. Cognitive Fingerprinting User Authentication From Keypress Dynamics 10 1 10 2 10 −1 Threshold ACCEPT # of Key Press ACCEPT Data Sufficiency Test Authentication Test 10 1 10 2 10 −1 Threshold Data Sufficiency Test Authentication Test REJECT # of Key Press User Authorized User Rejected
  • 59. Cognitive Fingerprinting User Authentication From Keypress Dynamics 10 1 10 2 10 −1 Threshold # of Key Press ACCEPT REJECT USER CHANGE Authorization revoked on unexpected user change detection
  • 60. Physical Dexterity Quantification Dynamic Regulation of Instabilities: Athletes vs Amateurs Francisco Valero-Cuevas Brain-body Dynamics Lab Univ. of Southern California −0.6 −0.4 −0.2 0 0.2 −0.5 0 0.5 1 −1 0 1 −0.4 −0.2 0 0.2 0.4 −0.4 −0.2 0 0.2 0.4 0 1,000 2,000 3,000 4,000 5,000 −2 0 2
  • 61. Physical Dexterity Quantification Dynamic Regulation of Instabilities: Athletes vs Amateurs 50 100 150 200 50 100 150 200 Pairwise Distance Matrix 0.02 · 10−14 · 10−1 0.0 5 · 10−2 1 · 10−1 0.0 5 · 10−2 x y z 3D Euclidean Embedding
  • 62. Physical Dexterity Quantification Dynamic Regulation of Instabilities: Athletes vs Amateurs 50 100 150 50 100 150 Pairwise Distance Matrix 180 200 180 200 0.02 · 10−14 · 10−1 0.0 5 · 10−2 1 · 10−1 0.0 5 · 10−2 x y z 3D Euclidean Embedding
  • 63. Handwritten Digits Recognition / Classification Problem
  • 64. Handwritten Digits Recognition / Classification Problem
  • 65. Handwritten Digits Recognition / Classification Problem
  • 66. Handwritten Digits Recognition / Classification Problem
  • 67. State of Art Approaches Find Clever Representations
  • 68. State of Art Approaches Find Clever Representations
  • 69. State of Art Approaches Find Clever Representations
  • 70. State of Art Approaches Find Clever Representations Feature Vector 1 Feature Vector 2 Feature Vector 16
  • 71. Image To Symbol Stream Random Walk
  • 72. Handwritten Digits Recognition / Classification Problem 4 9 100 200 300 100 200 300 Pairwise Distance Matrix 00.20.40.6 0 0.2 0.4 0.6 0 0.2 0.4 0.6 x y z 3D Euclidean Embedding
  • 73. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 s1: FWN: Copy 1:
  • 74. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 s1: FWN: Copy 1: 1 1 1
  • 75. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 s1: FWN: Copy 1: 1 2 1 2 1 2
  • 76. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 0 2 s1: FWN: Copy 1: 1 2 1 2 1 2
  • 77. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 1 0 2 2 s1: FWN: Copy 1: 1 2 1 2 1 2
  • 78. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 1 0 2 2 1 2 1 1 0 0 2 0 0 2 2 1 0 0 2 1 2 2 2 2 0 2 s1: FWN: Copy 1: 1 2 2 0 0 0 1 0 0 1 0 2 1 2 2 0 0 0 1 0 0 1 0 2 1 2 2 0 0 0 1 0 0 1 0 2 1 0 2 2 2 1 2 2 1 2 2 1 2 1 Generating Second Independent Copy for String s1: 2 0 s1: FWN: Copy 2:
  • 79. Memory-less Stream Manipulation Generating First Independent Copy for String s1: 2 0 1 0 2 2 1 2 1 1 0 0 2 0 0 2 2 1 0 0 2 1 2 2 2 2 0 2 s1: FWN: Copy 1: 1 2 2 0 0 0 1 0 0 1 0 2 1 2 2 0 0 0 1 0 0 1 0 2 1 2 2 0 0 0 1 0 0 1 0 2 1 0 2 2 2 1 2 2 1 2 2 1 2 1 Generating Second Independent Copy for String s1: 2 1 1 0 2 0 0 1 2 0 1 0 1 0 2 0 1 0 2 0 0 0 2 0 2 2 2 1 2 0 1 0 2 0 2 2 2 0 s1: FWN: Copy 2: 2 0 2 2 1 0 0 2 0 2 2 1 0 0 2 0 2 2 1 0 0 2 2 2 1 2 2 0 1 2 2 1 2 1 2
  • 80. Memory-less Stream Manipulation Generating Inverse of String s1: 1 Inverse: Copy 2: Copy 1: 2 0 0
  • 81. Memory-less Stream Manipulation Generating Inverse of String s1: 1 2 Inverse: Copy 2: Copy 1: 2 0 0 1 0 1
  • 82. Memory-less Stream Manipulation Generating Inverse of String s1: 2 2 1 2 Inverse: Copy 2: Copy 1: 2 0 0 1 0 1
  • 83. Memory-less Stream Manipulation Generating Inverse of String s1: 2 2 1 2 0 Inverse: Copy 2: Copy 1: 2 0 2 0 1 1 0 1 1
  • 84. Memory-less Stream Manipulation Generating Inverse of String s1: 2 0 2 0 2 2 1 2 2 0 2 0 2 2 1 2 1 2 0 0 1 0 0 1 0 1 2 2 1 2 2 1 2 1 Inverse: Copy 2: Copy 1: 2 0 2 1 0 2 2 2 1 2 1 1 2 0 1 1 2 2 1 1 0 2 0 0 0 0 0 1 1 2 2 1 1 0 2 0 0 0 0 Summing s2 with Inverse of s1 2 2 1 2 1 0 2 2 2 1 0 0 2 1 2 2 0 1 2 1 0 2 0 0 0 · · · · · · Inverse of String s1: String s2: Result: 1 2 1 0 1 2 1 0 1 2 1 0
  • 85. Summing It Up! Notion of Universal Similarity Zero-knowledge Feature-free Anomaly Detection
  • 86.
  • 87. Patents held by Cornell University 6259-01-US Stochastic Automata for Earthquake Prediction from Large Scale Surveys 6259-02-PC Systems and Methods for Abductive Learning of Quantized Stochastic Process 6024-03-PC System and Methods for Analysis of Data PCT/US13/62397 6998-01-US Causality Network Construction Algorithm (Application no. 62170063, EFS ID 2517508)