Interesting near galaxy sources

 • identified by TCP in the last 2 days
 • (last epoch observed 1 week ago)
 • Classificati...
PI: Josh Bloom
Overview

•   TCP Software & Data Architecture
•   Classifiers & Cutting out “Junk”

•   Continuing work...

      PTF spec...
Transients Classification Pipeline
Parallelized source correlation
                             and classification

                •   Difference objects are...
Parallelized source correlation
                             and classification
                •   Realtime TCP runs on 22...
Warehouse of light-curves

•   Need representative light-curves for all science

•   With these we can model each science ...
Confusion Matrix


    different ways of quantifying effeciencies
     - using original good training set, and train/evalu...
“Noisification”
                        (resampling light-curves)



•   For PTF, the Noisification code references:
     • ...
Constructing Light Curves
        from subtractions ain’t easy



                         true
mag
                      ...
Constructing Light Curves
        from subtractions ain’t easy



                         true
mag
                      ...
Constructing Light Curves
        from subtractions ain’t easy



                         true
mag
                      ...
Constructing Light Curves
              from subtractions ain’t easy

      5σ exclusion
          band
                  ...
Constructing Light Curves
              from subtractions ain’t easy

      5σ exclusion
          band
                  ...
for some source at                         Constructing Light Curves
RA,DEC & ti, determine                         from s...
Classifiers
•   General Classifier
     •   Filter out: poorly subtracted sources

     •   Filter out: minor planets / rock...
(Source)

                          General Classification
                         •      Three general classification grou...
(Source)

                        General Classification
                        •     Applied to ~80 spectroscopically con...
General Classifier: components & cuts
•   Crowd source modeled “RealBogus” metric
     • Cut on: average RealBogus, derivat...
Evaluating and Combining Classifiers
The “Netflix Prize” was won using a combination of ~1000 different classifiers.

 •    I...
Periodic variable classifiers
                   •     Currently, science classes are determined by combining
             ...
Continuing Work


•   Test, improve general classifier cuts

•   Push general classifications to Followup
    Marshal

•   P...
TCP Explorer
Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop
Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop
Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop
Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop
Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop
Upcoming SlideShare
Loading in...5
×

Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop

240

Published on

Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
240
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Caltech 20090903 Talk on T.C.P. for LSST/PTF workshop

  1. 1. Interesting near galaxy sources • identified by TCP in the last 2 days • (last epoch observed 1 week ago) • Classification triggered by latest epoch added to the source
  2. 2. PI: Josh Bloom
  3. 3. Overview • TCP Software & Data Architecture • Classifiers & Cutting out “Junk” • Continuing work... PTF spectroscopically confirmed SN, subsequently classified by TCP as SN
  4. 4. Transients Classification Pipeline
  5. 5. Parallelized source correlation and classification • Difference objects are retrieved from LBL • Each difference-object is passed to an IPython client • Each parallel IPython client performs: • Source creation or correlation with existing sources • “Feature” generation (or re-generation) for that source source • Classification of that source generation feature generation source classification
  6. 6. Parallelized source correlation and classification • Realtime TCP runs on 22 dedicated cores • LCOGT’s 96 core beowulf • non run-time tasks • Classifier generation • Additional resources • To be used for future timeseries classification work source generation • Yahoo’s 4000 core Hadoop academic cluster • Amazon EC2 cluster feature generation source classification
  7. 7. Warehouse of light-curves • Need representative light-curves for all science • With these we can model each science class • We’ve built a warehouse of example light-curves TCP-TUTOR DotAstro.org internal interface public interface
  8. 8. Confusion Matrix different ways of quantifying effeciencies - using original good training set, and train/evaluate efficencies via folding - using “noisified”, simulated sources matching sur vey shedule, cadences, limits • C
  9. 9. “Noisification” (resampling light-curves) • For PTF, the Noisification code references: • 1000s of PTF pointing and survey observing plans • This allows simulation of PTF cadenced light-curves • Occasionally PTF observes using a faster cadence: • 7.5 minutes between revisiting an RA, Dec • This requires a separate set of noisified light-curves and classifiers. • Other pointing and observing plans could be used. • This means we can easily generate noisified light-curves for any survey. • Thus we can generate science classifiers for any survey.
  10. 10. Constructing Light Curves from subtractions ain’t easy true mag reference [assumes template doesn’t update] time
  11. 11. Constructing Light Curves from subtractions ain’t easy true mag reference [assumes template doesn’t update] = 3 σ limiting mag time
  12. 12. Constructing Light Curves from subtractions ain’t easy true mag reference [assumes template doesn’t update] = 3 σ limiting mag detected in: pos_sub? neg_sub? time
  13. 13. Constructing Light Curves from subtractions ain’t easy 5σ exclusion band true mag reference [assumes template doesn’t update] = 3 σ limiting mag detected in: pos_sub? neg_sub? time
  14. 14. Constructing Light Curves from subtractions ain’t easy 5σ exclusion band true mag reference [assumes template doesn’t update] = 3 σ limiting mag detected in: pos_sub? neg_sub? time
  15. 15. for some source at Constructing Light Curves RA,DEC & ti, determine from subtractions ain’t easy best ref_mag at t=ti total mag = TM+ yes [detection] detection in positive sub? total mag = limit_mag no [upper limit] no limit_mag fainter than ref_mag? total mag = ref_mag [detection] yes no detection in the total mag = TM- negative sub? [detection] s yes ye mag in negative sub < total mag = limit_mag limit_mag - ref_mag? no [upper limit] TM+ = 2.5 log10( f_aper × 10-0.4(sub_zp-ref_zp) + flux_aper ) + ub1_ref_zp TM- = 2.5 log10( -f_aper × 10-0.4(sub_zp-ref_zp) + flux_aper ) + ub1_ref_zp
  16. 16. Classifiers • General Classifier • Filter out: poorly subtracted sources • Filter out: minor planets / rocks • Filter out: long-time sampled (periodic & nonperiodic) • Identify interesting sources near known galaxies • Identify periodic variable science class when confidence is high • Timeseries Classifier • Weighted combination of machine learning classifiers • Astronomer crafted classifiers for specific science types • Microlens, Super Nova
  17. 17. (Source) General Classification • Three general classification groups. • Periodic variables are contained within the “uninteresting” group, although more specific Interesting with sub-classifications are known. nearby galaxy context Poor subtraction JUNK class SN, AGN of Uninteresting various quality classes Rock class (general) Periodic variable class Interesting without context information Nicely subtracted, non-galaxy, non-periodic variable classes
  18. 18. (Source) General Classification • Applied to ~80 spectroscopically confirmed user classified (SN, AGN, galaxy) sources. • SN lightcurve classifier is needed when galaxy Interesting with context is not available, and to improve confidence nearby galaxy context in SN classification. SN, AGN, galaxy Uninteresting (58 SN) faint, poorly subtracted (11 SN) Interesting without context information
  19. 19. General Classifier: components & cuts • Crowd source modeled “RealBogus” metric • Cut on: average RealBogus, derivatives of RB components • Cut on: % epochs in source with good RealBogus • PSF statistics • Cuts on: PSF symmetry, eccentricity (averages) • Neighboring object comparisons • Cuts on significance of above metrics when compared to neighboring pixels • Minor Planet check PyEphem • Does an epoch intersect a Minor Planet? (PyMPChecker) PyMPChecker • Well sampled source • Cuts on: well sampled periodic & nonperiodic sources
  20. 20. Evaluating and Combining Classifiers The “Netflix Prize” was won using a combination of ~1000 different classifiers. • Issues when using multiple classifiers: • How to combine Classifiers using weights or tree-hierarchy • How to generate final classification “probabilities” when using: • Widely varying types of classifiers • Each classifier may contain sub-classifications with their own class probabilities. • Evaluate the final combination of classifiers • We classify PTF09xxx user classified sources • We display success / failure cases for each general class • Update classifier weights & cuts, try again. • OR: Iteratively & algorithmically find best weights.
  21. 21. Periodic variable classifiers • Currently, science classes are determined by combining the weighted probabilities generated by different classification models, for a source. ~0.4 day period ~0.14 day period RR Lyrae using • Each machine-learned classification model is trained using RR Lyrae using 10 epoch 20 epoch “noisified” lightcurves which were generated using different parameters. noisification noisification ...shows highest classification Clicking on a class for one probability sources for that of dozens of ML models... model::class Overplotting of period-fold plotting period-folded model probably failed here still needs work 0.1 - 0.17 day period RR Lyrae using 15 epoch noisification
  22. 22. Continuing Work • Test, improve general classifier cuts • Push general classifications to Followup Marshal • Push specific variable science class identified sources to Followup Marshal • Explore other timeseries classifiers for periodic variable classification.
  23. 23. TCP Explorer
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×