SlideShare a Scribd company logo
1 of 47
Compressed Learning for Time
Series Classification
Shueh-Han Shih
Department of Computer Science and Information
Engineering, National Taiwan University of Science and
Technology
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
Motivation
Image: http://www.aeris.com/
Motivation (cont’d)
• The key to handle time series data effectively is
choosing a suitable representation
• Transmission and storage issues are critical in IoT
scenario
• To provide interpretable result for human is
important
 Time series sparse representation - envelope
Time series data type
1. Symbolic sequence
2. Complex symbolic sequence
3. Simple time series
4. Multivariate time series
A brief survey on sequence classification Z Xing, J Pei, E Keogh - ACM SIGKDD , 2010
Classification of time series
• Assigning instances to one of the predefined classes.
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 1
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 2
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 3
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 4
Time series classification approaches
• Feature based
• Sequence distance based
• Model based
A brief survey on sequence classification Z Xing, J Pei, E Keogh - ACM SIGKDD, 2010
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
Conventional approach
• Number of sample needed for compressed sensing is
much more lower than Nyquist frequency.
Image: http://www.ni.com/
Main idea
• Most real-world signals are sparse in some basis
A𝑥 = 𝑦, A ∈ ℝ 𝑝×𝑛 𝑎𝑛𝑑 𝑝 ≪ 𝑛
• Dramatically reduce the transmission loading
a measure
Requirements of compressed sensing
1. 𝑥 should be a 𝑘-sparse signal
 1 to 1 relation between data and compressed domain
2. A must satisfies the restricted isometry property
 (1 − δ 𝑝)ǁ𝑥ǁ2
2
≤ ǁA𝑥ǁ2
2
≤ (1 + δ 𝑝)ǁ𝑥ǁ2
2
A =
𝑟𝑎𝑛𝑑𝑛 𝑝,𝑛
𝑛
(mean=0, 𝜎 =
1
𝑛
)
for some constant 𝛿 𝑝 ∈ (0, 1
Image: Mostafa Mohsenvand Projects
Basic routine
𝑨 = 𝑸
𝑨′ = 𝑸𝑷
𝒚 = 𝑨𝒙(𝒐𝒓 𝑨′
𝒔)
transmission
• Postpone the computational cost to recovery stage
Learning in the compressed domain
• Perform task without recovery
• SVM can keep the learnability
in compressed domain
• Reduce model complexity
Image: Compressed learning: Universal sparse dimensionality reduction and learning in the
measurement domain. R Calderbank, S Jafarpour, R Schapire - preprint, 2009 - dsp.rice.edu
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
The origin
• The ‘envelope’ in finance
Image: http://www.investopedia.com/
Preliminaries
• A time series
– T = 𝑡1, 𝑡2, … 𝑡𝑗 … 𝑡 𝑛 , 𝑡𝑗 ∈ ℝ
• A time series dataset
– D = T 𝑖 | T 𝑖 = 𝑡1
𝑖
, 𝑡2
𝑖
, … 𝑡 𝑛
𝑖 , 𝑖 = 1 𝑡𝑜 𝑚
• Well-synchronized with the same length
– A set of random sample from random variables 𝐓1, 𝐓2, … 𝐓𝑗, … 𝐓𝑛
Envelope creation
• Given 𝐷, envelope with size 𝑘
– E 𝑘 = 𝑍 𝑍 = 𝑧1, 𝑧2, … 𝑧 𝑛 , 𝑧𝑗 − 𝜇 𝑗 ≤ 𝑘 ∙ 𝑠𝑡𝑑𝑗 , ∀ 𝑧𝑗 ∈ ℝ}
• 𝜇 𝑗 = 𝑚𝑒𝑎𝑛(𝑻𝑗) , 𝑠𝑡𝑑𝑗 = 𝑠𝑡𝑑(𝑻𝑗)
– Profiling the time series dataset
Envelope encoding
• Encoding time series T as a sparse series S
• Sparsity indicates the similarity of a time series and 𝐷
𝑠𝑗 = 1, 𝑖𝑓 𝑡𝑗 > 𝜇 𝑗 + 𝑘 ∙ 𝑠𝑡𝑑𝑗
𝑠𝑗 = −1, 𝑖𝑓 𝑡𝑗 < 𝜇 𝑗 − 𝑘 ∙ 𝑠𝑡𝑑𝑗
𝑠𝑗 = 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
, 𝑓𝑜𝑟 𝑗 = 1 𝑡𝑜 𝑛
Guarantee of sparsity
• According to Chebyshev’s inequality,
– Pr(|X − 𝜇| ≤ 𝑘𝜎) ≥ 1 −
1
𝑘2
– No matter what kind of distribution for 𝑻𝑗
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
Determination of 𝑘
• 𝑘 affects the effectiveness of envelope representation
Determination of 𝑘 (cont’d)
• Focus on time series multi-class classification
– Envelope representation should be discriminative
𝑘∗ = arg max
𝑘
(−𝑎 𝑘 + 𝜆 ∙ 𝑏 𝑘)
– 𝑘 : tradeoff between sparsity and distinguishability
Encoding result visualization
• Sparsity indicates similarity
Sparse property
⟹ transmission
efficiency
• Encoding results
are interpretable
ECGFiveDays from UCR
Envelope representation workflow
• From raw series to feature
Overall workflow
• Classification scheme
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
1. Proposed method vs. state-of-art method on
classification task
2. Compressibility of envelope representation
with compressed sensing
3. Noise resistance of envelope representation
4. Time efficiency
5. Space efficiency
Classification performance
• Benchmark dataset from UCR (5/42)
Dataset/ Algorithm Number of
classes
Size of training
set
Size of testing
set
Time
series Length
CBF 3 30 900 128
Coffee 2 28 28 286
ECGFiveDays 2 23 861 136
ItalyPowerDemand 2 67 1029 24
Sony II 2 27 953 65
Classification performance (cont’d)
• Result on benchmark dataset (5/42)
– Win:9 / lose:18 / between:15 (close:12)
– Not the case with IoT scenario, never lack of data
Dataset/ Algorithm 1NN-Euclidean 1NN-DTW (best, noWin) Envelope+
linearSVM
CBF 85.2(0.9357) 99.6/99.7 90.66
Coffee 75(0.031608) 82.1/82.1 85.71
ECGFiveDays 79.7(0.8758) 79.7/76.8 88.38
ItalyPowerDemand 95.5(1.0661) 95.5/95 97.08
Sony II 69.5(0.9986) 69.5/72.5 82.79
Influence of compression ratio
• Compression ratio = 𝑝/𝑛
(Number of measurements) / (data dimension)
Influence of compression ratio (cont’d)
• Using nearly
1
3
datasets from UCR
– Some datasets have excellent compressibility
Influence of compression ratio (cont’d)
• Result on benchmark dataset (5/42)
Dataset/ Algorithm 1NN-
Euclidean
1NN-DTW
(best, noWin)
Compression
ratio=10%
Compression
ratio=20%
Compression
ratio=50%
CBF 85.2 99.6/99.7 80.44 88.22 88.44
Coffee 75 82.1/82.1 71.42 82.14 89.28
ECGFiveDays 79.7 79.7/76.8 78.86 81.3 81.64
ItalyPowerDemand 95.5 95.5/95 86.58 91.73 93.97
Sony II 69.5 69.5/72.5 76.91 78.38 80.06
Robustness to noise
• Noise level - SNR
Image: documentation.meraki.com
Robustness to noise (cont’d)
• Using ECG200 dataset as example
The original envelope The envelope with noise level SNR=10.
Robustness to noise (cont’d)
• Envelope representation is noise-resistant
– Can even ignore denoising stage
Envelope built/SVM trained with clean data Envelope built/SVM trained with noisy data
Time efficiency
1. Building envelope takes O(m*n)
2. Encoding each instance takes O(n)
3. Linear SVM, expects to be O(m2)
 Linear time in prediction
0 2 4 6 8 10 12 14 16
0
2
4
6
8
10
12
14
16
Execution time (testing)
envelope (Sec.)
KNN+ED(Sec.)
Space efficiency
1. 32 to (2 ∗ #𝑐𝑙𝑎𝑠𝑠) ratio of reduction
2. 32 to (32 ∗ #𝑐𝑙𝑎𝑠𝑠 ∗ 𝑐𝑜𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛 𝑟𝑎𝑡𝑖𝑜) ratio of
reduction through compressed sensing
3. Run length encoding
Outline
• Introduction
• Compressed sensing
• Sparse representation - envelope
• Classification framework
• Experimental results
• Case study
• Conclusion
Smart home project
• Passive user identification
Image: www.bitronvideo.eu
Using sensor
• Data collection
– EcoBT Mini
– 33Hz
50 100 150 200 250
-0.1
-0.05
0
0.05
0.1
0.15
0.2
0.25
50 100 150 200 250
-0.4
-0.3
-0.2
-0.1
0
0.1
0.2
0.3
50 100 150 200 250
-0.4
-0.3
-0.2
-0.1
0
0.1
0.2
0.3
50 100 150 200 250
-80
-60
-40
-20
0
20
40
60
80
50 100 150 200 250
-200
-150
-100
-50
0
50
100
150
200
50 100 150 200 250
-200
-150
-100
-50
0
50
100
150
Door opening recognition
• User identification
Door opening recognition (cont’d)
• Recognition performance
– Left: axis 5 Right: axis 1&5
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 1
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 2
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 3
0 20 40 60 80 100 120 140 160 180 200
-200
-150
-100
-50
0
50
100
150
200
Class 4
Gait recognition
• Via slipper
Image: www.footbionics.com
Gait recognition(cont’d)
• Recognition capability
– Left: axis 2 Right: axis 2&3&4
0 5 10 15 20 25 30 35 40
-1
-0.5
0
0.5
1
1.5
2
Class 1
0 5 10 15 20 25 30 35 40
-1
-0.5
0
0.5
1
1.5
2
Class 2
0 5 10 15 20 25 30 35 40
-1
-0.5
0
0.5
1
1.5
2
Class 3
0 5 10 15 20 25 30 35 40
-1
-0.5
0
0.5
1
1.5
2
Class 4
Demonstration
• Demo workflow
Demonstration(cont’d)
Conclusion
• Propose a sparse representation for time series
• Propose a heuristic to determine envelope size 𝑘
• Effectiveness, efficiency, robustness verification
• Real-world use case

More Related Content

What's hot

Una introducción a la minería de series temporales
Una introducción a la minería de series temporalesUna introducción a la minería de series temporales
Una introducción a la minería de series temporalesFacultad de Informática UCM
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16MLconf
 
Fast Single-pass K-means Clusterting at Oxford
Fast Single-pass K-means Clusterting at Oxford Fast Single-pass K-means Clusterting at Oxford
Fast Single-pass K-means Clusterting at Oxford MapR Technologies
 
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflowNVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflowNVIDIA Taiwan
 
Neural Networks: Support Vector machines
Neural Networks: Support Vector machinesNeural Networks: Support Vector machines
Neural Networks: Support Vector machinesMostafa G. M. Mostafa
 
Svm map reduce_slides
Svm map reduce_slidesSvm map reduce_slides
Svm map reduce_slidesSara Asher
 
Stochastic gradient descent and its tuning
Stochastic gradient descent and its tuningStochastic gradient descent and its tuning
Stochastic gradient descent and its tuningArsalan Qadri
 
Anil Thomas - Object recognition
Anil Thomas - Object recognitionAnil Thomas - Object recognition
Anil Thomas - Object recognitionIntel Nervana
 
Large scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using sparkLarge scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using sparkMila, Université de Montréal
 
Support Vector Machines Simply
Support Vector Machines SimplySupport Vector Machines Simply
Support Vector Machines SimplyEmad Nabil
 
Stochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of MultipliersStochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of MultipliersTaiji Suzuki
 
MLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackMLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackarogozhnikov
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelineChenYiHuang5
 
DRAW: Deep Recurrent Attentive Writer
DRAW: Deep Recurrent Attentive WriterDRAW: Deep Recurrent Attentive Writer
DRAW: Deep Recurrent Attentive WriterMark Chang
 
Neural Networks: Model Building Through Linear Regression
Neural Networks: Model Building Through Linear RegressionNeural Networks: Model Building Through Linear Regression
Neural Networks: Model Building Through Linear RegressionMostafa G. M. Mostafa
 
MLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic trackMLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic trackarogozhnikov
 

What's hot (20)

Una introducción a la minería de series temporales
Una introducción a la minería de series temporalesUna introducción a la minería de series temporales
Una introducción a la minería de series temporales
 
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
 
ACM 2013-02-25
ACM 2013-02-25ACM 2013-02-25
ACM 2013-02-25
 
12
1212
12
 
Fast Single-pass K-means Clusterting at Oxford
Fast Single-pass K-means Clusterting at Oxford Fast Single-pass K-means Clusterting at Oxford
Fast Single-pass K-means Clusterting at Oxford
 
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflowNVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow
NVIDIA 深度學習教育機構 (DLI): Image segmentation with tensorflow
 
Neural Networks: Support Vector machines
Neural Networks: Support Vector machinesNeural Networks: Support Vector machines
Neural Networks: Support Vector machines
 
Svm map reduce_slides
Svm map reduce_slidesSvm map reduce_slides
Svm map reduce_slides
 
Stochastic gradient descent and its tuning
Stochastic gradient descent and its tuningStochastic gradient descent and its tuning
Stochastic gradient descent and its tuning
 
Anil Thomas - Object recognition
Anil Thomas - Object recognitionAnil Thomas - Object recognition
Anil Thomas - Object recognition
 
Large scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using sparkLarge scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using spark
 
Support Vector Machines Simply
Support Vector Machines SimplySupport Vector Machines Simply
Support Vector Machines Simply
 
Stochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of MultipliersStochastic Alternating Direction Method of Multipliers
Stochastic Alternating Direction Method of Multipliers
 
MLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic trackMLHEP Lectures - day 2, basic track
MLHEP Lectures - day 2, basic track
 
presentation_btp
presentation_btppresentation_btp
presentation_btp
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipeline
 
DRAW: Deep Recurrent Attentive Writer
DRAW: Deep Recurrent Attentive WriterDRAW: Deep Recurrent Attentive Writer
DRAW: Deep Recurrent Attentive Writer
 
thesis
thesisthesis
thesis
 
Neural Networks: Model Building Through Linear Regression
Neural Networks: Model Building Through Linear RegressionNeural Networks: Model Building Through Linear Regression
Neural Networks: Model Building Through Linear Regression
 
MLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic trackMLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic track
 

Viewers also liked

期末最終報告 NTUST 大學專題
期末最終報告 NTUST 大學專題期末最終報告 NTUST 大學專題
期末最終報告 NTUST 大學專題學翰 施
 
人工智慧 & 機器學習 bacon number
人工智慧 & 機器學習 bacon number人工智慧 & 機器學習 bacon number
人工智慧 & 機器學習 bacon number學翰 施
 
Presentation for interview
Presentation for interviewPresentation for interview
Presentation for interview學翰 施
 
What Makes Great Infographics
What Makes Great InfographicsWhat Makes Great Infographics
What Makes Great InfographicsSlideShare
 
Masters of SlideShare
Masters of SlideShareMasters of SlideShare
Masters of SlideShareKapost
 
STOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to SlideshareSTOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to SlideshareEmpowered Presentations
 
10 Ways to Win at SlideShare SEO & Presentation Optimization
10 Ways to Win at SlideShare SEO & Presentation Optimization10 Ways to Win at SlideShare SEO & Presentation Optimization
10 Ways to Win at SlideShare SEO & Presentation OptimizationOneupweb
 
How To Get More From SlideShare - Super-Simple Tips For Content Marketing
How To Get More From SlideShare - Super-Simple Tips For Content MarketingHow To Get More From SlideShare - Super-Simple Tips For Content Marketing
How To Get More From SlideShare - Super-Simple Tips For Content MarketingContent Marketing Institute
 
How to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksHow to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksSlideShare
 

Viewers also liked (14)

期末最終報告 NTUST 大學專題
期末最終報告 NTUST 大學專題期末最終報告 NTUST 大學專題
期末最終報告 NTUST 大學專題
 
人工智慧 & 機器學習 bacon number
人工智慧 & 機器學習 bacon number人工智慧 & 機器學習 bacon number
人工智慧 & 機器學習 bacon number
 
Presentation for interview
Presentation for interviewPresentation for interview
Presentation for interview
 
Pmi servizio pca
Pmi servizio   pcaPmi servizio   pca
Pmi servizio pca
 
Pmi servizio mb
Pmi servizio   mbPmi servizio   mb
Pmi servizio mb
 
Las libertades del aire grupo 5
Las libertades del aire grupo 5Las libertades del aire grupo 5
Las libertades del aire grupo 5
 
Pmi servizio sbr
Pmi servizio   sbrPmi servizio   sbr
Pmi servizio sbr
 
What Makes Great Infographics
What Makes Great InfographicsWhat Makes Great Infographics
What Makes Great Infographics
 
Masters of SlideShare
Masters of SlideShareMasters of SlideShare
Masters of SlideShare
 
STOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to SlideshareSTOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
STOP! VIEW THIS! 10-Step Checklist When Uploading to Slideshare
 
You Suck At PowerPoint!
You Suck At PowerPoint!You Suck At PowerPoint!
You Suck At PowerPoint!
 
10 Ways to Win at SlideShare SEO & Presentation Optimization
10 Ways to Win at SlideShare SEO & Presentation Optimization10 Ways to Win at SlideShare SEO & Presentation Optimization
10 Ways to Win at SlideShare SEO & Presentation Optimization
 
How To Get More From SlideShare - Super-Simple Tips For Content Marketing
How To Get More From SlideShare - Super-Simple Tips For Content MarketingHow To Get More From SlideShare - Super-Simple Tips For Content Marketing
How To Get More From SlideShare - Super-Simple Tips For Content Marketing
 
How to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksHow to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & Tricks
 

Similar to Compressed learning for time series classification

Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Dataymelka
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingHsing-chuan Hsieh
 
Expert estimation from Multimodal Features
Expert estimation from Multimodal FeaturesExpert estimation from Multimodal Features
Expert estimation from Multimodal FeaturesXavier Ochoa
 
Hardware Acceleration for Machine Learning
Hardware Acceleration for Machine LearningHardware Acceleration for Machine Learning
Hardware Acceleration for Machine LearningCastLabKAIST
 
Accelerating Dynamic Time Warping Subsequence Search with GPU
Accelerating Dynamic Time Warping Subsequence Search with GPUAccelerating Dynamic Time Warping Subsequence Search with GPU
Accelerating Dynamic Time Warping Subsequence Search with GPUDavide Nardone
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Yan Xu
 
Fuzzy c means clustering protocol for wireless sensor networks
Fuzzy c means clustering protocol for wireless sensor networksFuzzy c means clustering protocol for wireless sensor networks
Fuzzy c means clustering protocol for wireless sensor networksmourya chandra
 
Time series data mining techniques
Time series data mining techniquesTime series data mining techniques
Time series data mining techniquesShanmukha S. Potti
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer ChemistryPreferred Networks
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Aritra Sarkar
 
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01Hemant Jha
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...MLconf
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)DonghyunKang12
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterYousef Fadila
 
Deep Learning Based Voice Activity Detection and Speech Enhancement
Deep Learning Based Voice Activity Detection and Speech EnhancementDeep Learning Based Voice Activity Detection and Speech Enhancement
Deep Learning Based Voice Activity Detection and Speech EnhancementNAVER Engineering
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satChenYiHuang5
 

Similar to Compressed learning for time series classification (20)

Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Data
 
rgDefense
rgDefensergDefense
rgDefense
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketching
 
Expert estimation from Multimodal Features
Expert estimation from Multimodal FeaturesExpert estimation from Multimodal Features
Expert estimation from Multimodal Features
 
Hardware Acceleration for Machine Learning
Hardware Acceleration for Machine LearningHardware Acceleration for Machine Learning
Hardware Acceleration for Machine Learning
 
Accelerating Dynamic Time Warping Subsequence Search with GPU
Accelerating Dynamic Time Warping Subsequence Search with GPUAccelerating Dynamic Time Warping Subsequence Search with GPU
Accelerating Dynamic Time Warping Subsequence Search with GPU
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
Fuzzy c means clustering protocol for wireless sensor networks
Fuzzy c means clustering protocol for wireless sensor networksFuzzy c means clustering protocol for wireless sensor networks
Fuzzy c means clustering protocol for wireless sensor networks
 
Time series data mining techniques
Time series data mining techniquesTime series data mining techniques
Time series data mining techniques
 
Time series Forecasting using svm
Time series Forecasting using  svmTime series Forecasting using  svm
Time series Forecasting using svm
 
Introduction to Chainer Chemistry
Introduction to Chainer ChemistryIntroduction to Chainer Chemistry
Introduction to Chainer Chemistry
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18
 
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01
Vlsiphysicaldesignautomationonpartitioning 120219012744-phpapp01
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)Cvpr 2018 papers review (efficient computing)
Cvpr 2018 papers review (efficient computing)
 
Trackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity CalorimeterTrackster Pruning at the CMS High-Granularity Calorimeter
Trackster Pruning at the CMS High-Granularity Calorimeter
 
Deep Learning Based Voice Activity Detection and Speech Enhancement
Deep Learning Based Voice Activity Detection and Speech EnhancementDeep Learning Based Voice Activity Detection and Speech Enhancement
Deep Learning Based Voice Activity Detection and Speech Enhancement
 
Modeling full scale-data(2)
Modeling full scale-data(2)Modeling full scale-data(2)
Modeling full scale-data(2)
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit sat
 

Recently uploaded

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 

Compressed learning for time series classification

  • 1. Compressed Learning for Time Series Classification Shueh-Han Shih Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology
  • 2. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 3. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 5. Motivation (cont’d) • The key to handle time series data effectively is choosing a suitable representation • Transmission and storage issues are critical in IoT scenario • To provide interpretable result for human is important  Time series sparse representation - envelope
  • 6. Time series data type 1. Symbolic sequence 2. Complex symbolic sequence 3. Simple time series 4. Multivariate time series A brief survey on sequence classification Z Xing, J Pei, E Keogh - ACM SIGKDD , 2010
  • 7. Classification of time series • Assigning instances to one of the predefined classes. 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 1 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 2 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 3 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 4
  • 8. Time series classification approaches • Feature based • Sequence distance based • Model based A brief survey on sequence classification Z Xing, J Pei, E Keogh - ACM SIGKDD, 2010
  • 9. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 10. Conventional approach • Number of sample needed for compressed sensing is much more lower than Nyquist frequency. Image: http://www.ni.com/
  • 11. Main idea • Most real-world signals are sparse in some basis A𝑥 = 𝑦, A ∈ ℝ 𝑝×𝑛 𝑎𝑛𝑑 𝑝 ≪ 𝑛 • Dramatically reduce the transmission loading a measure
  • 12. Requirements of compressed sensing 1. 𝑥 should be a 𝑘-sparse signal  1 to 1 relation between data and compressed domain 2. A must satisfies the restricted isometry property  (1 − δ 𝑝)ǁ𝑥ǁ2 2 ≤ ǁA𝑥ǁ2 2 ≤ (1 + δ 𝑝)ǁ𝑥ǁ2 2 A = 𝑟𝑎𝑛𝑑𝑛 𝑝,𝑛 𝑛 (mean=0, 𝜎 = 1 𝑛 ) for some constant 𝛿 𝑝 ∈ (0, 1 Image: Mostafa Mohsenvand Projects
  • 13. Basic routine 𝑨 = 𝑸 𝑨′ = 𝑸𝑷 𝒚 = 𝑨𝒙(𝒐𝒓 𝑨′ 𝒔) transmission • Postpone the computational cost to recovery stage
  • 14. Learning in the compressed domain • Perform task without recovery • SVM can keep the learnability in compressed domain • Reduce model complexity Image: Compressed learning: Universal sparse dimensionality reduction and learning in the measurement domain. R Calderbank, S Jafarpour, R Schapire - preprint, 2009 - dsp.rice.edu
  • 15. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 16. The origin • The ‘envelope’ in finance Image: http://www.investopedia.com/
  • 17. Preliminaries • A time series – T = 𝑡1, 𝑡2, … 𝑡𝑗 … 𝑡 𝑛 , 𝑡𝑗 ∈ ℝ • A time series dataset – D = T 𝑖 | T 𝑖 = 𝑡1 𝑖 , 𝑡2 𝑖 , … 𝑡 𝑛 𝑖 , 𝑖 = 1 𝑡𝑜 𝑚 • Well-synchronized with the same length – A set of random sample from random variables 𝐓1, 𝐓2, … 𝐓𝑗, … 𝐓𝑛
  • 18. Envelope creation • Given 𝐷, envelope with size 𝑘 – E 𝑘 = 𝑍 𝑍 = 𝑧1, 𝑧2, … 𝑧 𝑛 , 𝑧𝑗 − 𝜇 𝑗 ≤ 𝑘 ∙ 𝑠𝑡𝑑𝑗 , ∀ 𝑧𝑗 ∈ ℝ} • 𝜇 𝑗 = 𝑚𝑒𝑎𝑛(𝑻𝑗) , 𝑠𝑡𝑑𝑗 = 𝑠𝑡𝑑(𝑻𝑗) – Profiling the time series dataset
  • 19. Envelope encoding • Encoding time series T as a sparse series S • Sparsity indicates the similarity of a time series and 𝐷 𝑠𝑗 = 1, 𝑖𝑓 𝑡𝑗 > 𝜇 𝑗 + 𝑘 ∙ 𝑠𝑡𝑑𝑗 𝑠𝑗 = −1, 𝑖𝑓 𝑡𝑗 < 𝜇 𝑗 − 𝑘 ∙ 𝑠𝑡𝑑𝑗 𝑠𝑗 = 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 , 𝑓𝑜𝑟 𝑗 = 1 𝑡𝑜 𝑛
  • 20. Guarantee of sparsity • According to Chebyshev’s inequality, – Pr(|X − 𝜇| ≤ 𝑘𝜎) ≥ 1 − 1 𝑘2 – No matter what kind of distribution for 𝑻𝑗
  • 21. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 22. Determination of 𝑘 • 𝑘 affects the effectiveness of envelope representation
  • 23. Determination of 𝑘 (cont’d) • Focus on time series multi-class classification – Envelope representation should be discriminative 𝑘∗ = arg max 𝑘 (−𝑎 𝑘 + 𝜆 ∙ 𝑏 𝑘) – 𝑘 : tradeoff between sparsity and distinguishability
  • 24. Encoding result visualization • Sparsity indicates similarity Sparse property ⟹ transmission efficiency • Encoding results are interpretable ECGFiveDays from UCR
  • 25. Envelope representation workflow • From raw series to feature
  • 27. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion 1. Proposed method vs. state-of-art method on classification task 2. Compressibility of envelope representation with compressed sensing 3. Noise resistance of envelope representation 4. Time efficiency 5. Space efficiency
  • 28. Classification performance • Benchmark dataset from UCR (5/42) Dataset/ Algorithm Number of classes Size of training set Size of testing set Time series Length CBF 3 30 900 128 Coffee 2 28 28 286 ECGFiveDays 2 23 861 136 ItalyPowerDemand 2 67 1029 24 Sony II 2 27 953 65
  • 29. Classification performance (cont’d) • Result on benchmark dataset (5/42) – Win:9 / lose:18 / between:15 (close:12) – Not the case with IoT scenario, never lack of data Dataset/ Algorithm 1NN-Euclidean 1NN-DTW (best, noWin) Envelope+ linearSVM CBF 85.2(0.9357) 99.6/99.7 90.66 Coffee 75(0.031608) 82.1/82.1 85.71 ECGFiveDays 79.7(0.8758) 79.7/76.8 88.38 ItalyPowerDemand 95.5(1.0661) 95.5/95 97.08 Sony II 69.5(0.9986) 69.5/72.5 82.79
  • 30. Influence of compression ratio • Compression ratio = 𝑝/𝑛 (Number of measurements) / (data dimension)
  • 31. Influence of compression ratio (cont’d) • Using nearly 1 3 datasets from UCR – Some datasets have excellent compressibility
  • 32. Influence of compression ratio (cont’d) • Result on benchmark dataset (5/42) Dataset/ Algorithm 1NN- Euclidean 1NN-DTW (best, noWin) Compression ratio=10% Compression ratio=20% Compression ratio=50% CBF 85.2 99.6/99.7 80.44 88.22 88.44 Coffee 75 82.1/82.1 71.42 82.14 89.28 ECGFiveDays 79.7 79.7/76.8 78.86 81.3 81.64 ItalyPowerDemand 95.5 95.5/95 86.58 91.73 93.97 Sony II 69.5 69.5/72.5 76.91 78.38 80.06
  • 33. Robustness to noise • Noise level - SNR Image: documentation.meraki.com
  • 34. Robustness to noise (cont’d) • Using ECG200 dataset as example The original envelope The envelope with noise level SNR=10.
  • 35. Robustness to noise (cont’d) • Envelope representation is noise-resistant – Can even ignore denoising stage Envelope built/SVM trained with clean data Envelope built/SVM trained with noisy data
  • 36. Time efficiency 1. Building envelope takes O(m*n) 2. Encoding each instance takes O(n) 3. Linear SVM, expects to be O(m2)  Linear time in prediction 0 2 4 6 8 10 12 14 16 0 2 4 6 8 10 12 14 16 Execution time (testing) envelope (Sec.) KNN+ED(Sec.)
  • 37. Space efficiency 1. 32 to (2 ∗ #𝑐𝑙𝑎𝑠𝑠) ratio of reduction 2. 32 to (32 ∗ #𝑐𝑙𝑎𝑠𝑠 ∗ 𝑐𝑜𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛 𝑟𝑎𝑡𝑖𝑜) ratio of reduction through compressed sensing 3. Run length encoding
  • 38. Outline • Introduction • Compressed sensing • Sparse representation - envelope • Classification framework • Experimental results • Case study • Conclusion
  • 39. Smart home project • Passive user identification Image: www.bitronvideo.eu
  • 40. Using sensor • Data collection – EcoBT Mini – 33Hz 50 100 150 200 250 -0.1 -0.05 0 0.05 0.1 0.15 0.2 0.25 50 100 150 200 250 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 50 100 150 200 250 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 50 100 150 200 250 -80 -60 -40 -20 0 20 40 60 80 50 100 150 200 250 -200 -150 -100 -50 0 50 100 150 200 50 100 150 200 250 -200 -150 -100 -50 0 50 100 150
  • 41. Door opening recognition • User identification
  • 42. Door opening recognition (cont’d) • Recognition performance – Left: axis 5 Right: axis 1&5 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 1 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 2 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 3 0 20 40 60 80 100 120 140 160 180 200 -200 -150 -100 -50 0 50 100 150 200 Class 4
  • 43. Gait recognition • Via slipper Image: www.footbionics.com
  • 44. Gait recognition(cont’d) • Recognition capability – Left: axis 2 Right: axis 2&3&4 0 5 10 15 20 25 30 35 40 -1 -0.5 0 0.5 1 1.5 2 Class 1 0 5 10 15 20 25 30 35 40 -1 -0.5 0 0.5 1 1.5 2 Class 2 0 5 10 15 20 25 30 35 40 -1 -0.5 0 0.5 1 1.5 2 Class 3 0 5 10 15 20 25 30 35 40 -1 -0.5 0 0.5 1 1.5 2 Class 4
  • 47. Conclusion • Propose a sparse representation for time series • Propose a heuristic to determine envelope size 𝑘 • Effectiveness, efficiency, robustness verification • Real-world use case

Editor's Notes

  1. Time series classification (dis)advantage of CS Time series representation method Then experiments Last,
  2. Data collected continuously, time correlated series; communication & storage issue Reduce dimension & recover with little loss Extract info. From TS for ML model www.aeris.com www.comp-engine.org www.ceremade.dauphine.fr
  3. In order to ~~~ we propose envelope…
  4. alphabet of symbols e.g. DNA complex symbolic sequence e.g. transaction simple time series e.g. electric meter multivariate time series e.g. EEG www.bios.net www.apps.rus.mto.gov.on.ca www.rowetel.com www.dianliwenmi.com
  5. Assign new instance to certain class based on given data Using the example from case study
  6. model each part as one state; the mean of the state is the mean estimated from that part For a gesture recognizer we build multiple of these models, one for each gesture. a training set to estimate the parameters of models. During recognition we simply pick the model describing the data best.
  7. CS
  8. Lower transmission loading & recover well 取樣率 (f s) > 2 *受測訊號的最高頻率部份, 否則高頻的內容會失真(Aliasing)
  9. The goal of compressed sensing is to provide measurement matrix A, with the number of measurements m as small as possible M is the #sample, which is << Nyquist
  10. Normal Random matrix A generated with specific parameters is usually good enough for most real world applications.
  11. Most efforts are cost in recovery stage (discard this)
  12. The erroe of SVM in the measurement domain is with high probability close to the error of the best linear classifier in the data domain
  13. The idea of ‘envelope’ has been applied in finance for a long time used by investors and traders to help identify extreme overbought and oversold conditions. (兩端分別是由開盤價和收盤價)
  14. a vector in temporal order D is well-synchronized with the same length regarded as random samples from a set of random variables
  15. A set of values covered mu+- k*std. Envelope is the profile of the dataset
  16. possibility of applying CS for further boost
  17. 𝑘 is a critical issue, which directly affect the distinguishability /sparsity
  18. Propose a heuristic to make envelope distinguishable Large/small k will affect distinguishability
  19. Options for concise format
  20. libSVM with 1 to 1 (shorter training time)
  21. few datasets from UCR comparing the performance of proposed method with state-of-art
  22. Envelope may have inferior performance due to the lack of training instances
  23. possible to reduce the size to about 20%~10% and still keep the classification performance, which is very promising.
  24. Still keep good performance after compression
  25. 為訊號功率(Power of Signal)。 為雜訊功率(Power of Noise)。 為訊號振幅(Amplitude of Signal)。 為雜訊振幅(Amplitude of Noise)。
  26. Robustness of proposed method to noise
  27. proposed method is noise-resisting.
  28. Faster than Knn+ED
  29. Space-saving
  30. Using multiple devices with weak models integrate them to get better performance
  31. Using BLE for transmission
  32. identify users from door opening trajectories.
  33. treat each step as a time series instance
  34. the proposed method is also suitable for distinct cases. Integrate results of multi-steps to get better performance
  35. Door and slippers make distinct predictions
  36. Demo
  37. Intro. of TS classification, Pros & cons of CS Supervised feature extraction technique Heuristic Benchmark, noise, compression Real-world cases