SlideShare a Scribd company logo
1 of 21
Download to read offline
Machine learning and advanced statistics: photometric classification of SNe
Machine learning and advanced statistics:
photometric classification of SNe
Marco De Pascale1,2,3
Baruffolo A.3, Cappellaro E.3, Patat F.2
1Department of Physics and Astronomy – University of Padova
2European Southern Observatory
2Astronomical Observatory of Padova – INAF
November, 28th 2014
Machine learning and advanced statistics: photometric classification of SNe
Outline
1 Surveys and data
2 Data-flood
3 Solution
Data smoothing
Dimensionality reduction
Classification
Machine learning and advanced statistics: photometric classification of SNe
Surveys and data
Supernovae . . .
• Among the brightest objects in the Universe
• Universe expansion
• Star formation rate evolution
Machine learning and advanced statistics: photometric classification of SNe
Surveys and data
. . . and photometric surveys
• DES and PanSTARSS, ongoing
SNe surveys
• LSST, planned survey
• Huge amount of data produced
Machine learning and advanced statistics: photometric classification of SNe
Data-flood
A problem for classification
• Can not be done “by hand”
• Can not use spectra
• Need for an automatic technique
Machine learning and advanced statistics: photometric classification of SNe
Solution
Information from data
• Large data set can describe itself
• Combine advanced statistics and
machine learning
• data driven classification
Machine learning and advanced statistics: photometric classification of SNe
Solution
Backwards outline
Machine learning and advanced statistics: photometric classification of SNe
Solution
Backwards outline
Machine learning and advanced statistics: photometric classification of SNe
Solution
Backwards outline
Machine learning and advanced statistics: photometric classification of SNe
Solution
Data set
• Simulated in four bands griz
• From Supernova Photometric Classification Challenge
(Kessler+ 2010)
• Biased towards SN Ia
• Real data from SUDARE survey
Machine learning and advanced statistics: photometric classification of SNe
Solution
Data smoothing
Smoothing with few assumption
• Data speak for themselves = no templates
• Non-parametric smoothing techniques
• Only constraint: main features of underlining function
Machine learning and advanced statistics: photometric classification of SNe
Solution
Data smoothing
Gaussian processes (or kriging)
• Measurements described
by infinite number of
functions
• Gaussian probability
distribution over functions
f(x) ∼ GP(µ(x), k(xi, xj))
Machine learning and advanced statistics: photometric classification of SNe
Solution
Data smoothing
Covariance function
• Covariance among measurements
k(xi, xj) = h2
exp −
|xi − xj|2
2 2
• Hyper parameters
• h: magnitude of the function
• : length scale
• Optimised through likelihood maximisation
Machine learning and advanced statistics: photometric classification of SNe
Solution
Data smoothing
Smoothing results
Machine learning and advanced statistics: photometric classification of SNe
Solution
Dimensionality reduction
Next step
• Describe light curves with few parameters
• Capture the important features
• Diffusion maps, Coifam & Lafon (2006)
Machine learning and advanced statistics: photometric classification of SNe
Solution
Dimensionality reduction
Measuring similarities
• Euclidean metric to
measure similarities
• Weighted by smoothing
errors
Machine learning and advanced statistics: photometric classification of SNe
Solution
Dimensionality reduction
Diffusion map
• Non-linear dimensionality
reduction technique
• Captures intrinsic
geometry of data set
• Similarity measure
→ diffusion distance
Machine learning and advanced statistics: photometric classification of SNe
Solution
Dimensionality reduction
Results
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
−2.0 −1.5 −1.0 −0.5 0.0 0.5 1.0 1.5
−2−101
Diffusion Coordinate 6
DiffusionCoordinate3
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
Machine learning and advanced statistics: photometric classification of SNe
Solution
Classification
Identify clusters
• Diffusion coordinates = parameters
• Hopefully we have clusters in diffusion space
• Associate clusters to SN type
Machine learning and advanced statistics: photometric classification of SNe
Solution
Classification
Random forests. . .
• Set of classification trees
• Committee with majority
vote
• Successful
Machine learning and advanced statistics: photometric classification of SNe
Solution
Classification
. . . at work
• Trained on labelled observations
SNIa SNIbc SNII class.error
SNIa 536 5 17 0.03
SNIbc 74 50 20 0.65
SNII 18 4 378 0.05
• Tested on test observations
SNIa SNIbc SNII class.error
SNIa 3791 162 568 0.16
SNIbc 1467 680 506 0.74
SNII 4296 350 8382 0.35

More Related Content

Similar to de_pascale

A Multiple-Shooting Differential Dynamic Programming Algorithm
A Multiple-Shooting Differential Dynamic Programming AlgorithmA Multiple-Shooting Differential Dynamic Programming Algorithm
A Multiple-Shooting Differential Dynamic Programming AlgorithmEtienne Pellegrini
 
Pasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxPasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxgrssieee
 
Pasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxPasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxgrssieee
 
01 gabriel vasile_igarss_2018
01 gabriel vasile_igarss_201801 gabriel vasile_igarss_2018
01 gabriel vasile_igarss_2018Gabriel VASILE
 
Advances in Agricultural remote sensings
Advances in Agricultural remote sensingsAdvances in Agricultural remote sensings
Advances in Agricultural remote sensingsAyanDas644783
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
Processing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelProcessing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelSergey Gerasimov
 
Processing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
Processing Real-Time Volcano Seismic Measurements Through Redis: David ChavesProcessing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
Processing Real-Time Volcano Seismic Measurements Through Redis: David ChavesRedis Labs
 
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi MartinelliDSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi MartinelliDeltares
 
RemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptxRemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptxElise Colin
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
My works slideshare hl
My works slideshare hlMy works slideshare hl
My works slideshare hlNima Dabidian
 
4-Navarro-SanchezIGARSS2011.ppt
4-Navarro-SanchezIGARSS2011.ppt4-Navarro-SanchezIGARSS2011.ppt
4-Navarro-SanchezIGARSS2011.pptgrssieee
 
(17 22) karthick sir
(17 22) karthick sir(17 22) karthick sir
(17 22) karthick sirIISRTJournals
 

Similar to de_pascale (20)

A Multiple-Shooting Differential Dynamic Programming Algorithm
A Multiple-Shooting Differential Dynamic Programming AlgorithmA Multiple-Shooting Differential Dynamic Programming Algorithm
A Multiple-Shooting Differential Dynamic Programming Algorithm
 
Pasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxPasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptx
 
Pasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptxPasolli_TU1_TO3.4.pptx
Pasolli_TU1_TO3.4.pptx
 
01 gabriel vasile_igarss_2018
01 gabriel vasile_igarss_201801 gabriel vasile_igarss_2018
01 gabriel vasile_igarss_2018
 
Advances in Agricultural remote sensings
Advances in Agricultural remote sensingsAdvances in Agricultural remote sensings
Advances in Agricultural remote sensings
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
Processing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce modelProcessing of raw astronomical data of large volume by map reduce model
Processing of raw astronomical data of large volume by map reduce model
 
Processing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
Processing Real-Time Volcano Seismic Measurements Through Redis: David ChavesProcessing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
Processing Real-Time Volcano Seismic Measurements Through Redis: David Chaves
 
20210226 esa-science-coffee-v2.0
20210226 esa-science-coffee-v2.020210226 esa-science-coffee-v2.0
20210226 esa-science-coffee-v2.0
 
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi MartinelliDSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
 
Seminar nov2017
Seminar nov2017Seminar nov2017
Seminar nov2017
 
RemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptxRemoteSensing_DeepLearning_v2.pptx
RemoteSensing_DeepLearning_v2.pptx
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
My works slideshare hl
My works slideshare hlMy works slideshare hl
My works slideshare hl
 
4-Navarro-SanchezIGARSS2011.ppt
4-Navarro-SanchezIGARSS2011.ppt4-Navarro-SanchezIGARSS2011.ppt
4-Navarro-SanchezIGARSS2011.ppt
 
(17 22) karthick sir
(17 22) karthick sir(17 22) karthick sir
(17 22) karthick sir
 

de_pascale

  • 1. Machine learning and advanced statistics: photometric classification of SNe Machine learning and advanced statistics: photometric classification of SNe Marco De Pascale1,2,3 Baruffolo A.3, Cappellaro E.3, Patat F.2 1Department of Physics and Astronomy – University of Padova 2European Southern Observatory 2Astronomical Observatory of Padova – INAF November, 28th 2014
  • 2. Machine learning and advanced statistics: photometric classification of SNe Outline 1 Surveys and data 2 Data-flood 3 Solution Data smoothing Dimensionality reduction Classification
  • 3. Machine learning and advanced statistics: photometric classification of SNe Surveys and data Supernovae . . . • Among the brightest objects in the Universe • Universe expansion • Star formation rate evolution
  • 4. Machine learning and advanced statistics: photometric classification of SNe Surveys and data . . . and photometric surveys • DES and PanSTARSS, ongoing SNe surveys • LSST, planned survey • Huge amount of data produced
  • 5. Machine learning and advanced statistics: photometric classification of SNe Data-flood A problem for classification • Can not be done “by hand” • Can not use spectra • Need for an automatic technique
  • 6. Machine learning and advanced statistics: photometric classification of SNe Solution Information from data • Large data set can describe itself • Combine advanced statistics and machine learning • data driven classification
  • 7. Machine learning and advanced statistics: photometric classification of SNe Solution Backwards outline
  • 8. Machine learning and advanced statistics: photometric classification of SNe Solution Backwards outline
  • 9. Machine learning and advanced statistics: photometric classification of SNe Solution Backwards outline
  • 10. Machine learning and advanced statistics: photometric classification of SNe Solution Data set • Simulated in four bands griz • From Supernova Photometric Classification Challenge (Kessler+ 2010) • Biased towards SN Ia • Real data from SUDARE survey
  • 11. Machine learning and advanced statistics: photometric classification of SNe Solution Data smoothing Smoothing with few assumption • Data speak for themselves = no templates • Non-parametric smoothing techniques • Only constraint: main features of underlining function
  • 12. Machine learning and advanced statistics: photometric classification of SNe Solution Data smoothing Gaussian processes (or kriging) • Measurements described by infinite number of functions • Gaussian probability distribution over functions f(x) ∼ GP(µ(x), k(xi, xj))
  • 13. Machine learning and advanced statistics: photometric classification of SNe Solution Data smoothing Covariance function • Covariance among measurements k(xi, xj) = h2 exp − |xi − xj|2 2 2 • Hyper parameters • h: magnitude of the function • : length scale • Optimised through likelihood maximisation
  • 14. Machine learning and advanced statistics: photometric classification of SNe Solution Data smoothing Smoothing results
  • 15. Machine learning and advanced statistics: photometric classification of SNe Solution Dimensionality reduction Next step • Describe light curves with few parameters • Capture the important features • Diffusion maps, Coifam & Lafon (2006)
  • 16. Machine learning and advanced statistics: photometric classification of SNe Solution Dimensionality reduction Measuring similarities • Euclidean metric to measure similarities • Weighted by smoothing errors
  • 17. Machine learning and advanced statistics: photometric classification of SNe Solution Dimensionality reduction Diffusion map • Non-linear dimensionality reduction technique • Captures intrinsic geometry of data set • Similarity measure → diffusion distance
  • 18. Machine learning and advanced statistics: photometric classification of SNe Solution Dimensionality reduction Results q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q −2.0 −1.5 −1.0 −0.5 0.0 0.5 1.0 1.5 −2−101 Diffusion Coordinate 6 DiffusionCoordinate3 q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q q q q q q q q q q q q q q q q q
  • 19. Machine learning and advanced statistics: photometric classification of SNe Solution Classification Identify clusters • Diffusion coordinates = parameters • Hopefully we have clusters in diffusion space • Associate clusters to SN type
  • 20. Machine learning and advanced statistics: photometric classification of SNe Solution Classification Random forests. . . • Set of classification trees • Committee with majority vote • Successful
  • 21. Machine learning and advanced statistics: photometric classification of SNe Solution Classification . . . at work • Trained on labelled observations SNIa SNIbc SNII class.error SNIa 536 5 17 0.03 SNIbc 74 50 20 0.65 SNII 18 4 378 0.05 • Tested on test observations SNIa SNIbc SNII class.error SNIa 3791 162 568 0.16 SNIbc 1467 680 506 0.74 SNII 4296 350 8382 0.35