SlideShare a Scribd company logo
Emerging Needs & Opportunities in
Data-Intensive Domains: Astronomy
Joshua Bloom (@profjsb)

Gordon & Betty Moore Foundation Data-Driven Investigator
UC Berkeley, Astronomy
Irvine, CA Mar 20, 2017
Presented to the National Academy of Sciences Roundtable on the
"Intersection of Domain Expertise and Data Science"
Data
Decreasing cost to obtain,
move, store
Compute
Decreasing cost,
increasing specialty
Technology/
Methodology
Algorithmic innovation,
software tooling
Democratizing Trends in Physical Sciences
open data,
more freely shared
more accessible
open source
Democratizing Trends in Physical Sciences
Competition for superior inferential capabilities
‣ Statistical / machine learning capabilities
‣ Computational prowess
‣ Ability to innovate methodologies
Astronomy's Data deluge
- who wins?
- examples of data science impact in astro
Notable data-driven success with domain + methodologies
Outline
particle physics & gravitational wave astronomy
Educational challenges given these trends
Symbiotic relationship between data domains
& methodological sciences
Discovery of the Higgs Boson in 2012
‣ 600M collisions/s; only ~100/s are interesting
‣ 600 GB/s raw → 25 GB/s processed
‣ 3.5 MW data centers, Pb/day processing
Data & Computation
https://home.cern/about/computing
Horvath 1310.6839
Inference
‣ "HEP phystatisticians"
‣ Statistics combining 

standard model (theoretical underpinning)

& observations
4.3 Compatibility of the observed state with the SM Higgs boson hypothesis
SM
σ/σBest fit
-4 -2 0 2 4
ZZ (2 jets)→H
ZZ (0/1 jet)→H
(VH tag)ττ→H
(VBF tag)ττ→H
(0/1 jet)ττ→H
WW (VH tag)→H
WW (VBF tag)→H
WW (0/1 jet)→H
(VH tag)γγ→H
(VBF tag)γγ→H
(untagged)γγ→H
bb (ttH tag)→H
bb (VH tag)→H
0.14±= 0.80µ
Combined
-1
19.6 fb≤= 8 TeV, Ls-1
5.1 fb≤= 7 TeV, Ls
CMS Preliminary
= 0.94
SM
p
= 125.7 GeVHm
bb→H
0.14±= 0.80µ
Combined
-1
19.6 fb≤= 8 TeV, Ls-1
5.1 fb≤= 7 TeV, Ls
CMS Preliminary
= 0.65
SM
p
= 125.7 GeVHm
0.14±= 0.80µ
Combined
= 7 TeV, Ls
CMS Preli
= 0.52
SM
p
CMS PAS HIG-13-005
https://apod.nasa.gov/apod/ap160211.html
Direct Detection of Gravitational Waves in 2016
https://apod.nasa.gov/apod/ap160211.html
‣ aLIGO Data rate: 81 MB/s, 2PB/yr
‣ 105
templates to continually match (32TB) at 5 TFLOPS
‣ 20k CPUs, 12 PB cluster Shoemaker M060056-01.doc
Data & Computation
Inference
‣novel long-tail statistics
‣Need rapid identification

for multi wavelength followup
e.g. Cannon et al. 2012
Lessons from Large-Scale Physics Experiments
‣ Strongly focused on limited number of high-impact
discoveries
‣ Residual inventions (e.g., WWW, grid computing)
‣ Diverse Teams: physicists collaborating with
methodological specialists
‣ Large collaborations producing science results
themselves
Astronomy's Data Deluge
Large Synoptic Survey Telescope (LSST) - 2020
Light curves for 800M sources every 3 days
106
supernovae/yr, 105
eclipsing binaries
3.2 gigapixel camera, 20 TB/night
LOFAR & SKA
150 Gps (27 Tflops) → 20 Pps (~100 Pflops)
Many other astronomical surveys are already
producing data:
SDSS, iPTF, CRTS, Pan-STARRS, Hipparcos, OGLE,
ASAS, Kepler, LINEAR, DES etc.,
•US Open Skies Policy
•Investments in Public
Data Dissemination
•Unaffiliated Scientists
are expected to reap
rewards
If anyone can get the Data, who wins?
She who asks the right questions
She who answer questions better & faster than others:
‣computational access
‣methodological inference (e.g., machine learning)
‣better story telling, dissemination of results
‣reproducibility (acceptance)
domain expertise
data science
+
Our	ML	framework	found	
the	Nearest	Supernova	in	3	
Decades	..
‣ Built	&	Deployed	robust,	real-Ime	
supervised	machine	learning	
framework,	discovering	>10,000	
events	in	>	10	TB	of	imaging	

								→	75+	journal	arIcles
‣ Built	probabilisIc	source	
classificaIon	catalogs	on	public	
archives	with	novel	acIve	
learning	approach
h"ps://www.nsf.gov/news/news_summ.jsp?cntn_id=122537
Some of my work…
Probabilistic Classification of
50k+ Variable Stars
Shivvers,JSB,Richards MNRAS,2014
106 “DEB” candidates
12 new
mass-radii
15 “RCB/DYP”

candidates
8 new discoveries
Triple # of
Galactic
DYPer Stars
Miller, Richards, JSB,..ApJ 2012
5400
Spectroscopic
Targets
Miller, JSB, Richards,..ApJ 2015
Turn synoptic
imagers into
~spectrographs
L112 K. Schawinski et al.
Figure 2. We show the results obtained for one example galaxy. From left to right: the original SDSS image, the degraded image with a worse PSF and higher
noise level (indicating the PSF and noise level used), the image as recovered by the GAN, and for comparison, the result of a deconvolution. This figure visually
illustrates the GAN’s ability to recover features that conventional deconvolutions cannot.
Some unsupervised work…
Calculus
Linear Algebra
Multivariate calculus
Diff Equations
Complex Analysis
Special relativity
Quantum mechanics
Classical mechanics
Thermo & Statistical Lab astrophysics
Intro astronomy
E&M
Particle physics
Cosmology
Stars
Galaxies
Astrophysical fluids
grad cosmology
grad galaxies
Intro Statistics
K-12
undergraduategraduate
General relativity
PhysicsMath/Stat Astro
PhD specialization
Challenge of Data-Driven Domain Education Stack
Calculus
Linear Algebra
Multivariate calculus
Diff Equations
Complex Analysis
Special relativity
Quantum mechanics
Classical mechanics
Thermo & Statistical Lab astrophysics
Intro astronomy
E&M
Particle physics
Cosmology
Stars
Galaxies
Astrophysical fluids
grad cosmology
grad galaxies
Intro Statistics
K-12
undergraduategraduate
General relativity
PhysicsMath/Stat Astro
databases
algorithms
convex optimization
machine learning advanced probability
deep learning
Python
Javascript
stochastic processes
C++
CSProgramming
PhD specialization
Challenge of Data-Driven Domain Education Stack
Python Computing for Data Science
Graduate course at UC Berkeley
64%
36%
female
male
8%
4%
8%
12%
4%
12%
8%
16%
16%
12%Psychology
Astronomy
Neuroscience
Biostatistics
Physics
Chemical Engineering
ISchool
Earth and Planetary Sciences
Industrial Engineering
Mechanical Engineering
visualization
machine learning
database interaction
user interface & web frameworks
timeseries & numerical computing
interfacing to other languages
Bayesian inference & MCMC
hardware control
parallelism
2013 statistics
http://github.com/profjsb/python-seminar
Time domain preprocessing
- Start with raw photometry!
- Gaussian process detrending!
- Calibration!
- Petigura & Marcy 2012!
!
Transit search
- Matched filter!
- Similar to BLS algorithm (Kovcas+ 2002)!
- Leverages Fast-Folding Algorithm
O(N^2) → O(N log N) (Staelin+ 1968)!
!
Data validation
- Significant peaks in periodogram, but
inconsistent with exoplanet transit
TERRA – optimized for small planets
Detrended/calibrated photometry
TERRA
RawFlux(ppt)CalibratedFlux
Erik Petigura
Berkeley Astro
Grad Student
Prevalence of Earth-size planets orbiting Sun-like stars
Erik A. Petiguraa,b,1
, Andrew W. Howardb
, and Geoffrey W. Marcya
a
Astronomy Department, University of California, Berkeley, CA 94720; and b
Institute for Astronomy, University of Hawaii at Manoa, Honolulu, HI 96822
Contributed by Geoffrey W. Marcy, October 22, 2013 (sent for review October 18, 2013)
Determining whether Earth-like planets are common or rare looms
as a touchstone in the question of life in the universe. We searched
for Earth-size planets that cross in front of their host stars by
examining the brightness measurements of 42,000 stars from
National Aeronautics and Space Administration’s Kepler mission.
We found 603 planets, including 10 that are Earth size (1 − 2 R⊕)
and receive comparable levels of stellar energy to that of Earth
(0:25 − 4 F⊕). We account for Kepler’s imperfect detectability of
such planets by injecting synthetic planet–caused dimmings into
the Kepler brightness measurements and recording the fraction
detected. We find that 11 ± 4% of Sun-like stars harbor an Earth-
size planet receiving between one and four times the stellar inten-
sity as Earth. We also find that the occurrence of Earth-size planets is
constant with increasing orbital period (P), within equal intervals of
logP up to ∼200 d. Extrapolating, one finds 5:7+1:7
−2:2 % of Sun-like stars
harbor an Earth-size planet with orbital periods of 200–400 d.
extrasolar planets | astrobiology
The National Aeronautics and Space Administration’s (NASA’s)
Kepler mission was launched in 2009 to search for planets
that transit (cross in front of) their host stars (1–4). The resulting
dimming of the host stars is detectable by measuring their bright-
ness, and Kepler monitored the brightness of 150,000 stars every
30 min for 4 y. To date, this exoplanet survey has detected more
We searched for transiting planets in Kepler brightness mea-
surements using our custom-built TERRA software package
described in previous works (6, 9) and in SI Appendix. In brief,
TERRA conditions Kepler photometry in the time domain, re-
moving outliers, long timescale variability (>10 d), and systematic
errors common to a large number of stars. TERRA then searches
for transit signals by evaluating the signal-to-noise ratio (SNR) of
prospective transits over a finely spaced 3D grid of orbital period,
P, time of transit, t0, and transit duration, ΔT. This grid-based
search extends over the orbital period range of 0.5–400 d.
TERRA produced a list of “threshold crossing events” (TCEs)
that meet the key criterion of a photometric dimming SNR ratio
SNR > 12. Unfortunately, an unwieldy 16,227 TCEs met this cri-
terion, many of which are inconsistent with the periodic dimming
profile from a true transiting planet. Further vetting was performed
by automatically assessing which light curves were consistent with
theoretical models of transiting planets (10). We also visually
inspected each TCE light curve, retaining only those exhibiting a
consistent, periodic, box-shaped dimming, and rejecting those
caused by single epoch outliers, correlated noise, and other data
anomalies. The vetting process was applied homogeneously to all
TCEs and is described in further detail in SI Appendix.
To assess our vetting accuracy, we evaluated the 235 Kepler
objects of interest (KOIs) among Best42k stars having P > 50 d,
which had been found by the Kepler Project and identified as planet
candidates in the official Exoplanet Archive (exoplanetarchive.
RONOMY
Bootcamp/Seminar Alum
Python
DOE/NERSC computation
PNAS [2014]
ỉπ vs.
21st Century Eduction Mission:
Produce Gamma-shaped people
deep domain skill/knowledge/training
deep methodological knowledge/skill
deep domain or methodological skill/knowledge/training
strong methodological or domain knowledge/skill
Goal: empower teams of gamma’s to excel
“I	love	working	with	astronomers,	
since	their	data	is	worthless.”	
"Fleming had constructed a spyglass, by means
of which visible objects, though very distant
from the eye of the observer, were distinctly
seen as if nearby."
- Galileo Galilei (1610) 	-	Jim	Gray,	MicrosoT
A Rich History of Symbiosis
Astronomers co-opt tools… …while astro serves as a testbed
for computational exploration
Established CS/Stats/Math in Service
of novelty in domain science
vs.
Novelty in domain science driving & informing novelty
in CS/Stats/Math
“novelty2
problem”
https://medium.com/tech-talk/dd88857f662
"Automated feature extraction for
irregularly-sampled time series
using deep neural networks"
Naul, JSB+, in prep
Network for RNN+GRU
Deep Learning for (Astronomical) Timeseries Inference
Stats PhD from Stanford
https://www.forbes.com/sites/jillianscudder/2017/01/21/astroquizzical-care-planet-nine
The Planet 9 Opportunity
hypothesized to exist using mostly
public data about comet orbits +
deep domain knowledge +
sophisticated computing
Batygin & Brown 1601.05438
My Prediction: the discovery data of Planet X have already been
obtained & reside in existing public data archives. It will be a group
of clever astronomers & statisticians with a lot of compute resources
that will make the retrospective discovery of the century…
Summary
•DemocraIzing	trends	in	the	physical	sciences	
•Winners	will	be	those	domain	experts	with	superior	inferenIal	capabiliIes,	
not	themselves,	but	in	teams	
•Data	science	educaIon	for	domains	should	be	focused	on	learning	enough	
to	work	with	those	methodological	domains	
•Symbiosis:	Physical	domain	science	data/quesIons	can	also	foster	novel	
methodological	approaches
Emerging Needs & Opportunities in
Data-Intensive Domains: Astronomy
Joshua Bloom
@profjsb
Irvine, CA Mar 20, 2017
Thanks!

More Related Content

What's hot

Cosmology Research in China
Cosmology Research in ChinaCosmology Research in China
Cosmology Research in China
EnergyCosmicLaborato
 
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT CruBayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
Ashkbiz Danehkar
 
earth_like_planet_presentation
earth_like_planet_presentationearth_like_planet_presentation
earth_like_planet_presentationtunde akinsanmi
 
Final Year Project - Observation and Characterisation of Exoplanets
Final Year Project - Observation and Characterisation of ExoplanetsFinal Year Project - Observation and Characterisation of Exoplanets
Final Year Project - Observation and Characterisation of ExoplanetsLucy Stickland
 
GaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data ProductGaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data Product
Mario Juric
 
tw1979_current_topics_paper
tw1979_current_topics_papertw1979_current_topics_paper
tw1979_current_topics_paperThomas Wigg
 
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
Sérgio Sacani
 
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
Sérgio Sacani
 
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
Mario Juric
 
Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014William Comaskey
 
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companionA nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
Sérgio Sacani
 
20131107 damasso great
20131107 damasso great20131107 damasso great
20131107 damasso greatOAVdA_APACHE
 
Insights to the Morphology of Planetary Nebulae from 3D Spectroscopy
Insights to the Morphology of Planetary Nebulae from 3D SpectroscopyInsights to the Morphology of Planetary Nebulae from 3D Spectroscopy
Insights to the Morphology of Planetary Nebulae from 3D Spectroscopy
Ashkbiz Danehkar
 
The parkes hi zone of avoidance survey
The parkes hi zone of avoidance surveyThe parkes hi zone of avoidance survey
The parkes hi zone of avoidance survey
Sérgio Sacani
 
A candidate super-Earth planet orbiting near the snow line of Barnard’s star
A candidate super-Earth planet orbiting near the snow line of Barnard’s starA candidate super-Earth planet orbiting near the snow line of Barnard’s star
A candidate super-Earth planet orbiting near the snow line of Barnard’s star
Sérgio Sacani
 
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
PC Cluster Consortium
 
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
Sérgio Sacani
 
LSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing SystemLSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing System
Mario Juric
 
SKA Regional Sciences Centres - A Platform for Global Astronomy
SKA Regional Sciences Centres - A Platform for Global AstronomySKA Regional Sciences Centres - A Platform for Global Astronomy
SKA Regional Sciences Centres - A Platform for Global Astronomy
EUDAT
 

What's hot (20)

Cosmology Research in China
Cosmology Research in ChinaCosmology Research in China
Cosmology Research in China
 
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT CruBayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
Bayesian X-ray Spectral Analysis of the Symbiotic Star RT Cru
 
earth_like_planet_presentation
earth_like_planet_presentationearth_like_planet_presentation
earth_like_planet_presentation
 
Final Year Project - Observation and Characterisation of Exoplanets
Final Year Project - Observation and Characterisation of ExoplanetsFinal Year Project - Observation and Characterisation of Exoplanets
Final Year Project - Observation and Characterisation of Exoplanets
 
GaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data ProductGaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data Product
 
tw1979_current_topics_paper
tw1979_current_topics_papertw1979_current_topics_paper
tw1979_current_topics_paper
 
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
Probing the innermost_regions_of_agn_jets_and_their_magnetic_fields_with_radi...
 
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
Characterizing k2 planet_discoveries_a_super_earth_transiting_the_bright_k_dw...
 
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
ADASS XXV: LSST DM - Building the Data System for the Era of Petascale Optica...
 
Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014Comaskey_William_Poster_SULI_FALL_2014
Comaskey_William_Poster_SULI_FALL_2014
 
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companionA nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
A nearby yoiung_m_dwarf_with_wide_possibly_planetary_m_ass_companion
 
20131107 damasso great
20131107 damasso great20131107 damasso great
20131107 damasso great
 
Insights to the Morphology of Planetary Nebulae from 3D Spectroscopy
Insights to the Morphology of Planetary Nebulae from 3D SpectroscopyInsights to the Morphology of Planetary Nebulae from 3D Spectroscopy
Insights to the Morphology of Planetary Nebulae from 3D Spectroscopy
 
ESTEC_Oct_2015
ESTEC_Oct_2015ESTEC_Oct_2015
ESTEC_Oct_2015
 
The parkes hi zone of avoidance survey
The parkes hi zone of avoidance surveyThe parkes hi zone of avoidance survey
The parkes hi zone of avoidance survey
 
A candidate super-Earth planet orbiting near the snow line of Barnard’s star
A candidate super-Earth planet orbiting near the snow line of Barnard’s starA candidate super-Earth planet orbiting near the snow line of Barnard’s star
A candidate super-Earth planet orbiting near the snow line of Barnard’s star
 
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
PCCC20 筑波大学計算科学研究センター「学際計算科学による最新の研究成果」
 
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
Eccentricity from transit_photometry_small_planets_in_kepler_multi_planet_sys...
 
LSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing SystemLSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing System
 
SKA Regional Sciences Centres - A Platform for Global Astronomy
SKA Regional Sciences Centres - A Platform for Global AstronomySKA Regional Sciences Centres - A Platform for Global Astronomy
SKA Regional Sciences Centres - A Platform for Global Astronomy
 

Viewers also liked

PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
Joshua Bloom
 
Forumpa2017 impari
Forumpa2017 impariForumpa2017 impari
Forumpa2017 impari
impari-scuola
 
Desarrollo sindical
Desarrollo sindicalDesarrollo sindical
Desarrollo sindical
Derecho Colectivo
 
5 domande (e risposte) per scegliere crm giusto
5 domande (e risposte) per scegliere crm giusto5 domande (e risposte) per scegliere crm giusto
5 domande (e risposte) per scegliere crm giusto
Stefano Marchetti
 
Sesión de cooperacion
Sesión de cooperacionSesión de cooperacion
Sesión de cooperacion
Shanaiss
 
Generación distribuida
Generación distribuidaGeneración distribuida
Generación distribuida
SolarAcademy_CIETUAI
 
Física 2º ano ensino médio ondas sonoras e efeito doppler
Física 2º ano ensino médio   ondas sonoras e efeito dopplerFísica 2º ano ensino médio   ondas sonoras e efeito doppler
Física 2º ano ensino médio ondas sonoras e efeito doppler
Tiago Gomes da Silva
 
Baño del recien nacido
Baño del recien nacidoBaño del recien nacido
Baño del recien nacido
jheriv
 
Cartilha previdência-seu-direito-está-em-risco-cut
Cartilha previdência-seu-direito-está-em-risco-cutCartilha previdência-seu-direito-está-em-risco-cut
Cartilha previdência-seu-direito-está-em-risco-cut
SINTE Regional
 
Pptexamples
PptexamplesPptexamples
Pptexamples
Li
 
Física 2º ano ensino médio ondulatória comprimento, frequência, amplitude e...
Física 2º ano ensino médio   ondulatória comprimento, frequência, amplitude e...Física 2º ano ensino médio   ondulatória comprimento, frequência, amplitude e...
Física 2º ano ensino médio ondulatória comprimento, frequência, amplitude e...
Tiago Gomes da Silva
 
Física 2º ano ensino médio ondulatória movimento harmônico simples e cinemá...
Física 2º ano ensino médio   ondulatória movimento harmônico simples e cinemá...Física 2º ano ensino médio   ondulatória movimento harmônico simples e cinemá...
Física 2º ano ensino médio ondulatória movimento harmônico simples e cinemá...
Tiago Gomes da Silva
 
Física 2º ano ensino médio ondulatória classificação das ondas
Física 2º ano ensino médio   ondulatória classificação das ondasFísica 2º ano ensino médio   ondulatória classificação das ondas
Física 2º ano ensino médio ondulatória classificação das ondas
Tiago Gomes da Silva
 
Evidencias cambio climático zbc
Evidencias cambio climático zbcEvidencias cambio climático zbc
Evidencias cambio climático zbc
María Angélica Peña
 
Motor de corriente eléctrica
Motor de corriente eléctricaMotor de corriente eléctrica
Motor de corriente eléctrica
Ruben Guerra
 
Sesión de coeducacion
Sesión de coeducacionSesión de coeducacion
Sesión de coeducacion
Shanaiss
 
Sesión desinhibicion
Sesión desinhibicionSesión desinhibicion
Sesión desinhibicion
Shanaiss
 
Física 2º ano ensino médio ondulatória equação de onda e princípio de super...
Física 2º ano ensino médio   ondulatória equação de onda e princípio de super...Física 2º ano ensino médio   ondulatória equação de onda e princípio de super...
Física 2º ano ensino médio ondulatória equação de onda e princípio de super...
Tiago Gomes da Silva
 
Optimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using dockerOptimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using docker
Rahul Krishna Upadhyaya
 
Nanotechnology lecture5
Nanotechnology lecture5Nanotechnology lecture5
Nanotechnology lecture5
Ninad Mehendale
 

Viewers also liked (20)

PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning" PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
 
Forumpa2017 impari
Forumpa2017 impariForumpa2017 impari
Forumpa2017 impari
 
Desarrollo sindical
Desarrollo sindicalDesarrollo sindical
Desarrollo sindical
 
5 domande (e risposte) per scegliere crm giusto
5 domande (e risposte) per scegliere crm giusto5 domande (e risposte) per scegliere crm giusto
5 domande (e risposte) per scegliere crm giusto
 
Sesión de cooperacion
Sesión de cooperacionSesión de cooperacion
Sesión de cooperacion
 
Generación distribuida
Generación distribuidaGeneración distribuida
Generación distribuida
 
Física 2º ano ensino médio ondas sonoras e efeito doppler
Física 2º ano ensino médio   ondas sonoras e efeito dopplerFísica 2º ano ensino médio   ondas sonoras e efeito doppler
Física 2º ano ensino médio ondas sonoras e efeito doppler
 
Baño del recien nacido
Baño del recien nacidoBaño del recien nacido
Baño del recien nacido
 
Cartilha previdência-seu-direito-está-em-risco-cut
Cartilha previdência-seu-direito-está-em-risco-cutCartilha previdência-seu-direito-está-em-risco-cut
Cartilha previdência-seu-direito-está-em-risco-cut
 
Pptexamples
PptexamplesPptexamples
Pptexamples
 
Física 2º ano ensino médio ondulatória comprimento, frequência, amplitude e...
Física 2º ano ensino médio   ondulatória comprimento, frequência, amplitude e...Física 2º ano ensino médio   ondulatória comprimento, frequência, amplitude e...
Física 2º ano ensino médio ondulatória comprimento, frequência, amplitude e...
 
Física 2º ano ensino médio ondulatória movimento harmônico simples e cinemá...
Física 2º ano ensino médio   ondulatória movimento harmônico simples e cinemá...Física 2º ano ensino médio   ondulatória movimento harmônico simples e cinemá...
Física 2º ano ensino médio ondulatória movimento harmônico simples e cinemá...
 
Física 2º ano ensino médio ondulatória classificação das ondas
Física 2º ano ensino médio   ondulatória classificação das ondasFísica 2º ano ensino médio   ondulatória classificação das ondas
Física 2º ano ensino médio ondulatória classificação das ondas
 
Evidencias cambio climático zbc
Evidencias cambio climático zbcEvidencias cambio climático zbc
Evidencias cambio climático zbc
 
Motor de corriente eléctrica
Motor de corriente eléctricaMotor de corriente eléctrica
Motor de corriente eléctrica
 
Sesión de coeducacion
Sesión de coeducacionSesión de coeducacion
Sesión de coeducacion
 
Sesión desinhibicion
Sesión desinhibicionSesión desinhibicion
Sesión desinhibicion
 
Física 2º ano ensino médio ondulatória equação de onda e princípio de super...
Física 2º ano ensino médio   ondulatória equação de onda e princípio de super...Física 2º ano ensino médio   ondulatória equação de onda e princípio de super...
Física 2º ano ensino médio ondulatória equação de onda e princípio de super...
 
Optimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using dockerOptimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using docker
 
Nanotechnology lecture5
Nanotechnology lecture5Nanotechnology lecture5
Nanotechnology lecture5
 

Similar to Data Science Education: Needs & Opportunities in Astronomy

Computational Training for Domain Scientists & Data Literacy
Computational Training for Domain Scientists & Data LiteracyComputational Training for Domain Scientists & Data Literacy
Computational Training for Domain Scientists & Data Literacy
Joshua Bloom
 
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training AlgorithmsExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
IRJET Journal
 
Teletransportacion marcikic2003
Teletransportacion  marcikic2003Teletransportacion  marcikic2003
Teletransportacion marcikic2003
Asesor De Tesis Doctorados
 
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine LearningLocating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Sérgio Sacani
 
The extremely high albedo of LTT 9779 b revealed by CHEOPS
The extremely high albedo of LTT 9779 b revealed by CHEOPSThe extremely high albedo of LTT 9779 b revealed by CHEOPS
The extremely high albedo of LTT 9779 b revealed by CHEOPS
Sérgio Sacani
 
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
csandit
 
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
Sérgio Sacani
 
The completeness-corrected rate of stellar encounters with the Sun from the f...
The completeness-corrected rate of stellar encounters with the Sun from the f...The completeness-corrected rate of stellar encounters with the Sun from the f...
The completeness-corrected rate of stellar encounters with the Sun from the f...
Sérgio Sacani
 
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine LearningLocating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Sérgio Sacani
 
Science with small telescopes - exoplanets
Science with small telescopes - exoplanetsScience with small telescopes - exoplanets
Science with small telescopes - exoplanets
guest8aa6ebb
 
The Possible Tidal Demise of Kepler’s First Planetary System
The Possible Tidal Demise of Kepler’s First Planetary SystemThe Possible Tidal Demise of Kepler’s First Planetary System
The Possible Tidal Demise of Kepler’s First Planetary System
Sérgio Sacani
 
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
SEENET-MTP
 
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
JeremyHeyl
 
DISSERTATIONJOAKIM676951
DISSERTATIONJOAKIM676951DISSERTATIONJOAKIM676951
DISSERTATIONJOAKIM676951Joakim Carlsen
 
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_bEvidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
Sérgio Sacani
 
Effects of differing statistical methodologies on inferences about Earth-like...
Effects of differing statistical methodologies on inferences about Earth-like...Effects of differing statistical methodologies on inferences about Earth-like...
Effects of differing statistical methodologies on inferences about Earth-like...John Owens
 
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
Sérgio Sacani
 
Detection of an atmosphere around the super earth 55 cancri e
Detection of an atmosphere around the super earth 55 cancri eDetection of an atmosphere around the super earth 55 cancri e
Detection of an atmosphere around the super earth 55 cancri e
Sérgio Sacani
 
Data Science at Berkeley
Data Science at BerkeleyData Science at Berkeley
Data Science at Berkeley
Joshua Bloom
 
Joshua Bloom Data Science at Berkeley
Joshua Bloom Data Science at BerkeleyJoshua Bloom Data Science at Berkeley
Joshua Bloom Data Science at Berkeley
PyData
 

Similar to Data Science Education: Needs & Opportunities in Astronomy (20)

Computational Training for Domain Scientists & Data Literacy
Computational Training for Domain Scientists & Data LiteracyComputational Training for Domain Scientists & Data Literacy
Computational Training for Domain Scientists & Data Literacy
 
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training AlgorithmsExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
ExoSGAN and ExoACGAN: Exoplanet Detection using Adversarial Training Algorithms
 
Teletransportacion marcikic2003
Teletransportacion  marcikic2003Teletransportacion  marcikic2003
Teletransportacion marcikic2003
 
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine LearningLocating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
 
The extremely high albedo of LTT 9779 b revealed by CHEOPS
The extremely high albedo of LTT 9779 b revealed by CHEOPSThe extremely high albedo of LTT 9779 b revealed by CHEOPS
The extremely high albedo of LTT 9779 b revealed by CHEOPS
 
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
ASTRONOMICAL OBJECTS DETECTION IN CELESTIAL BODIES USING COMPUTER VISION ALGO...
 
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telesco...
 
The completeness-corrected rate of stellar encounters with the Sun from the f...
The completeness-corrected rate of stellar encounters with the Sun from the f...The completeness-corrected rate of stellar encounters with the Sun from the f...
The completeness-corrected rate of stellar encounters with the Sun from the f...
 
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine LearningLocating Hidden Exoplanets in ALMA Data Using Machine Learning
Locating Hidden Exoplanets in ALMA Data Using Machine Learning
 
Science with small telescopes - exoplanets
Science with small telescopes - exoplanetsScience with small telescopes - exoplanets
Science with small telescopes - exoplanets
 
The Possible Tidal Demise of Kepler’s First Planetary System
The Possible Tidal Demise of Kepler’s First Planetary SystemThe Possible Tidal Demise of Kepler’s First Planetary System
The Possible Tidal Demise of Kepler’s First Planetary System
 
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
H. Stefancic: The Accelerated Expansion of the Universe and the Cosmological ...
 
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
 
DISSERTATIONJOAKIM676951
DISSERTATIONJOAKIM676951DISSERTATIONJOAKIM676951
DISSERTATIONJOAKIM676951
 
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_bEvidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
Evidence for a_spectroscopic_direct_detection_of_reflected_light_from_51_peg_b
 
Effects of differing statistical methodologies on inferences about Earth-like...
Effects of differing statistical methodologies on inferences about Earth-like...Effects of differing statistical methodologies on inferences about Earth-like...
Effects of differing statistical methodologies on inferences about Earth-like...
 
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
A nearby m_star_with_three_transiting_super-earths_discovered_by_k2
 
Detection of an atmosphere around the super earth 55 cancri e
Detection of an atmosphere around the super earth 55 cancri eDetection of an atmosphere around the super earth 55 cancri e
Detection of an atmosphere around the super earth 55 cancri e
 
Data Science at Berkeley
Data Science at BerkeleyData Science at Berkeley
Data Science at Berkeley
 
Joshua Bloom Data Science at Berkeley
Joshua Bloom Data Science at BerkeleyJoshua Bloom Data Science at Berkeley
Joshua Bloom Data Science at Berkeley
 

Recently uploaded

Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
binhminhvu04
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
anitaento25
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 

Recently uploaded (20)

Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 

Data Science Education: Needs & Opportunities in Astronomy

  • 1. Emerging Needs & Opportunities in Data-Intensive Domains: Astronomy Joshua Bloom (@profjsb)
 Gordon & Betty Moore Foundation Data-Driven Investigator UC Berkeley, Astronomy Irvine, CA Mar 20, 2017 Presented to the National Academy of Sciences Roundtable on the "Intersection of Domain Expertise and Data Science"
  • 2. Data Decreasing cost to obtain, move, store Compute Decreasing cost, increasing specialty Technology/ Methodology Algorithmic innovation, software tooling Democratizing Trends in Physical Sciences open data, more freely shared more accessible open source
  • 3. Democratizing Trends in Physical Sciences Competition for superior inferential capabilities ‣ Statistical / machine learning capabilities ‣ Computational prowess ‣ Ability to innovate methodologies
  • 4. Astronomy's Data deluge - who wins? - examples of data science impact in astro Notable data-driven success with domain + methodologies Outline particle physics & gravitational wave astronomy Educational challenges given these trends Symbiotic relationship between data domains & methodological sciences
  • 5. Discovery of the Higgs Boson in 2012
  • 6. ‣ 600M collisions/s; only ~100/s are interesting ‣ 600 GB/s raw → 25 GB/s processed ‣ 3.5 MW data centers, Pb/day processing Data & Computation https://home.cern/about/computing Horvath 1310.6839 Inference ‣ "HEP phystatisticians" ‣ Statistics combining 
 standard model (theoretical underpinning)
 & observations 4.3 Compatibility of the observed state with the SM Higgs boson hypothesis SM σ/σBest fit -4 -2 0 2 4 ZZ (2 jets)→H ZZ (0/1 jet)→H (VH tag)ττ→H (VBF tag)ττ→H (0/1 jet)ττ→H WW (VH tag)→H WW (VBF tag)→H WW (0/1 jet)→H (VH tag)γγ→H (VBF tag)γγ→H (untagged)γγ→H bb (ttH tag)→H bb (VH tag)→H 0.14±= 0.80µ Combined -1 19.6 fb≤= 8 TeV, Ls-1 5.1 fb≤= 7 TeV, Ls CMS Preliminary = 0.94 SM p = 125.7 GeVHm bb→H 0.14±= 0.80µ Combined -1 19.6 fb≤= 8 TeV, Ls-1 5.1 fb≤= 7 TeV, Ls CMS Preliminary = 0.65 SM p = 125.7 GeVHm 0.14±= 0.80µ Combined = 7 TeV, Ls CMS Preli = 0.52 SM p CMS PAS HIG-13-005
  • 8. https://apod.nasa.gov/apod/ap160211.html ‣ aLIGO Data rate: 81 MB/s, 2PB/yr ‣ 105 templates to continually match (32TB) at 5 TFLOPS ‣ 20k CPUs, 12 PB cluster Shoemaker M060056-01.doc Data & Computation Inference ‣novel long-tail statistics ‣Need rapid identification
 for multi wavelength followup e.g. Cannon et al. 2012
  • 9. Lessons from Large-Scale Physics Experiments ‣ Strongly focused on limited number of high-impact discoveries ‣ Residual inventions (e.g., WWW, grid computing) ‣ Diverse Teams: physicists collaborating with methodological specialists ‣ Large collaborations producing science results themselves
  • 10. Astronomy's Data Deluge Large Synoptic Survey Telescope (LSST) - 2020 Light curves for 800M sources every 3 days 106 supernovae/yr, 105 eclipsing binaries 3.2 gigapixel camera, 20 TB/night LOFAR & SKA 150 Gps (27 Tflops) → 20 Pps (~100 Pflops) Many other astronomical surveys are already producing data: SDSS, iPTF, CRTS, Pan-STARRS, Hipparcos, OGLE, ASAS, Kepler, LINEAR, DES etc., •US Open Skies Policy •Investments in Public Data Dissemination •Unaffiliated Scientists are expected to reap rewards
  • 11. If anyone can get the Data, who wins? She who asks the right questions She who answer questions better & faster than others: ‣computational access ‣methodological inference (e.g., machine learning) ‣better story telling, dissemination of results ‣reproducibility (acceptance) domain expertise data science +
  • 13. Probabilistic Classification of 50k+ Variable Stars Shivvers,JSB,Richards MNRAS,2014 106 “DEB” candidates 12 new mass-radii 15 “RCB/DYP”
 candidates 8 new discoveries Triple # of Galactic DYPer Stars Miller, Richards, JSB,..ApJ 2012 5400 Spectroscopic Targets Miller, JSB, Richards,..ApJ 2015 Turn synoptic imagers into ~spectrographs
  • 14. L112 K. Schawinski et al. Figure 2. We show the results obtained for one example galaxy. From left to right: the original SDSS image, the degraded image with a worse PSF and higher noise level (indicating the PSF and noise level used), the image as recovered by the GAN, and for comparison, the result of a deconvolution. This figure visually illustrates the GAN’s ability to recover features that conventional deconvolutions cannot. Some unsupervised work…
  • 15. Calculus Linear Algebra Multivariate calculus Diff Equations Complex Analysis Special relativity Quantum mechanics Classical mechanics Thermo & Statistical Lab astrophysics Intro astronomy E&M Particle physics Cosmology Stars Galaxies Astrophysical fluids grad cosmology grad galaxies Intro Statistics K-12 undergraduategraduate General relativity PhysicsMath/Stat Astro PhD specialization Challenge of Data-Driven Domain Education Stack
  • 16. Calculus Linear Algebra Multivariate calculus Diff Equations Complex Analysis Special relativity Quantum mechanics Classical mechanics Thermo & Statistical Lab astrophysics Intro astronomy E&M Particle physics Cosmology Stars Galaxies Astrophysical fluids grad cosmology grad galaxies Intro Statistics K-12 undergraduategraduate General relativity PhysicsMath/Stat Astro databases algorithms convex optimization machine learning advanced probability deep learning Python Javascript stochastic processes C++ CSProgramming PhD specialization Challenge of Data-Driven Domain Education Stack
  • 17. Python Computing for Data Science Graduate course at UC Berkeley 64% 36% female male 8% 4% 8% 12% 4% 12% 8% 16% 16% 12%Psychology Astronomy Neuroscience Biostatistics Physics Chemical Engineering ISchool Earth and Planetary Sciences Industrial Engineering Mechanical Engineering visualization machine learning database interaction user interface & web frameworks timeseries & numerical computing interfacing to other languages Bayesian inference & MCMC hardware control parallelism 2013 statistics http://github.com/profjsb/python-seminar
  • 18. Time domain preprocessing - Start with raw photometry! - Gaussian process detrending! - Calibration! - Petigura & Marcy 2012! ! Transit search - Matched filter! - Similar to BLS algorithm (Kovcas+ 2002)! - Leverages Fast-Folding Algorithm O(N^2) → O(N log N) (Staelin+ 1968)! ! Data validation - Significant peaks in periodogram, but inconsistent with exoplanet transit TERRA – optimized for small planets Detrended/calibrated photometry TERRA RawFlux(ppt)CalibratedFlux Erik Petigura Berkeley Astro Grad Student Prevalence of Earth-size planets orbiting Sun-like stars Erik A. Petiguraa,b,1 , Andrew W. Howardb , and Geoffrey W. Marcya a Astronomy Department, University of California, Berkeley, CA 94720; and b Institute for Astronomy, University of Hawaii at Manoa, Honolulu, HI 96822 Contributed by Geoffrey W. Marcy, October 22, 2013 (sent for review October 18, 2013) Determining whether Earth-like planets are common or rare looms as a touchstone in the question of life in the universe. We searched for Earth-size planets that cross in front of their host stars by examining the brightness measurements of 42,000 stars from National Aeronautics and Space Administration’s Kepler mission. We found 603 planets, including 10 that are Earth size (1 − 2 R⊕) and receive comparable levels of stellar energy to that of Earth (0:25 − 4 F⊕). We account for Kepler’s imperfect detectability of such planets by injecting synthetic planet–caused dimmings into the Kepler brightness measurements and recording the fraction detected. We find that 11 ± 4% of Sun-like stars harbor an Earth- size planet receiving between one and four times the stellar inten- sity as Earth. We also find that the occurrence of Earth-size planets is constant with increasing orbital period (P), within equal intervals of logP up to ∼200 d. Extrapolating, one finds 5:7+1:7 −2:2 % of Sun-like stars harbor an Earth-size planet with orbital periods of 200–400 d. extrasolar planets | astrobiology The National Aeronautics and Space Administration’s (NASA’s) Kepler mission was launched in 2009 to search for planets that transit (cross in front of) their host stars (1–4). The resulting dimming of the host stars is detectable by measuring their bright- ness, and Kepler monitored the brightness of 150,000 stars every 30 min for 4 y. To date, this exoplanet survey has detected more We searched for transiting planets in Kepler brightness mea- surements using our custom-built TERRA software package described in previous works (6, 9) and in SI Appendix. In brief, TERRA conditions Kepler photometry in the time domain, re- moving outliers, long timescale variability (>10 d), and systematic errors common to a large number of stars. TERRA then searches for transit signals by evaluating the signal-to-noise ratio (SNR) of prospective transits over a finely spaced 3D grid of orbital period, P, time of transit, t0, and transit duration, ΔT. This grid-based search extends over the orbital period range of 0.5–400 d. TERRA produced a list of “threshold crossing events” (TCEs) that meet the key criterion of a photometric dimming SNR ratio SNR > 12. Unfortunately, an unwieldy 16,227 TCEs met this cri- terion, many of which are inconsistent with the periodic dimming profile from a true transiting planet. Further vetting was performed by automatically assessing which light curves were consistent with theoretical models of transiting planets (10). We also visually inspected each TCE light curve, retaining only those exhibiting a consistent, periodic, box-shaped dimming, and rejecting those caused by single epoch outliers, correlated noise, and other data anomalies. The vetting process was applied homogeneously to all TCEs and is described in further detail in SI Appendix. To assess our vetting accuracy, we evaluated the 235 Kepler objects of interest (KOIs) among Best42k stars having P > 50 d, which had been found by the Kepler Project and identified as planet candidates in the official Exoplanet Archive (exoplanetarchive. RONOMY Bootcamp/Seminar Alum Python DOE/NERSC computation PNAS [2014]
  • 19. ỉπ vs. 21st Century Eduction Mission: Produce Gamma-shaped people deep domain skill/knowledge/training deep methodological knowledge/skill deep domain or methodological skill/knowledge/training strong methodological or domain knowledge/skill Goal: empower teams of gamma’s to excel
  • 20. “I love working with astronomers, since their data is worthless.” "Fleming had constructed a spyglass, by means of which visible objects, though very distant from the eye of the observer, were distinctly seen as if nearby." - Galileo Galilei (1610) - Jim Gray, MicrosoT A Rich History of Symbiosis Astronomers co-opt tools… …while astro serves as a testbed for computational exploration
  • 21. Established CS/Stats/Math in Service of novelty in domain science vs. Novelty in domain science driving & informing novelty in CS/Stats/Math “novelty2 problem” https://medium.com/tech-talk/dd88857f662
  • 22. "Automated feature extraction for irregularly-sampled time series using deep neural networks" Naul, JSB+, in prep Network for RNN+GRU Deep Learning for (Astronomical) Timeseries Inference Stats PhD from Stanford
  • 23. https://www.forbes.com/sites/jillianscudder/2017/01/21/astroquizzical-care-planet-nine The Planet 9 Opportunity hypothesized to exist using mostly public data about comet orbits + deep domain knowledge + sophisticated computing Batygin & Brown 1601.05438 My Prediction: the discovery data of Planet X have already been obtained & reside in existing public data archives. It will be a group of clever astronomers & statisticians with a lot of compute resources that will make the retrospective discovery of the century…
  • 25. Emerging Needs & Opportunities in Data-Intensive Domains: Astronomy Joshua Bloom @profjsb Irvine, CA Mar 20, 2017 Thanks!