SlideShare a Scribd company logo
1 of 37
DISTRIBUTED DATA PROCESSING
USING SPARK IN RADIO ASTRONOMY
Panos Labropoulos
Bright Computing, Inc.
Sarod Yatawatta
ASTRON
Outline
• Science drivers
• Basics of radio interferometry
• Challenges
• How can Spark help?
Who are we?
• Panos Labropoulos
– Works at Bright Computing
– HPC, accelerators, high-speed
interconnects
– PhD Radio Astronomy, University of
Groningen
• Sarod Yatawatta
• Works at ASTRON
• Signal and Image processing
• PhD Electrical engineering, Drexel
The visible sky: Thermal emission processes
Cygnus A
Cassiopeia A
Moon Sun
The radio sky
Why radio?
• Thermal Radiation
• Synchrotron Radiation
– Relativistic e-
in magnetic fields
• Bremstrahlung
– “Breaking Radiation” e-
/ion
collisions
• Maser
– Microwave Laser e-
oscillations in
molecular clouds
• Atomic Transitions (emission spectra)
– Hydrogen e-
spin flip
Tools required: antennas, receivers (e.g. voltage amplifiers, ADCs, computers)
Similar principles to optical astronomy – different wavelengths
Basic characteristics of a telescope
• Oldest telescope: human eye
– Human pupil: 0.5 cm diameter, 0.05 deg. Resolution at
λ=600 nm (yellow)
• Radio telescope: 100m diameter, 1.4 deg. resolution at λ=2m
– 2.8 km diameter required for same resolution as human eye
– 250 km dimeter required for same resolution as an optical
telescope
Angular resolution and sensitivity proportional to size
Constructing a 250km parabolic dish is simply not feasible
Use smaller antennas and add-up signals using a computer
How many antennas? How big?
• Intensity follows an inverse square law
• Source 3 times further away become 9 times fainter
• Size of telescope depends on distance at which we want
to look objects at.
10.000 sq.m. VLA
10x further -> 100x bigger
1 sq.km.
Measurement process
x
s s
b
cg /sb⋅=τ
)(cos2 tEV ω=])(cos[1 gtEV τω −=
])2(cos)([cos gg tP ωτωωτ −+
multiply
average
The path lengths
from sensors
to multiplier are
assumed equal!
Geometric
Time Delay
Rapidly varying,
with zero mean
Unchanging)(cos gC PR ωτ=
plane wavefront
90 deg
])2sin()[sin( gg tP ωτωωτ −+
)sin( gs PR ωτ=
Ω=−= ⋅−
∫∫ dsIiRRV eSC
sb
b λi2π
)()( νυ
2-D FT of the sky’ s
brightness distribution
Examples of 1-Dimensional Visibilities
• Simple pictures are easy to make illustrating 1-dimensional visibilities.
Brightness Distribution Visibility Function
• Unresolved
Doubles
• Uniform
• Central
Peaked
Earth rotation synthesis
• An interferometer with N antennas has N ( N -1 ) / 2 “baseline” pairs
• Real interferometers are built on the surface of the earth – a rotating platform. From the
observer’s perspective, sources move across the sky, but from the source’s perspective the
antenna array is rotating in such a way that the baseline pair points form tracks over time
• u-v tracks have gaps: incomplete sampling of the 2-D transform / missing information
Antenna layout
360 min
Baseline pairs layout
amplitude phase
Example: synthesis image
Example: synthesis image
Square Kilometer Array
• 100x more sensitive, 1,000,000x faster imaging
• Up to 5 sq.km collecting area and 3500km baselines
• Sub-arcsecond resolution, large field-of-view
• 3 types of antennas, 2 locations, a single observatory
• No mechanical steering for low- and mid-frequency antennas, multiple beams
• Will address key questions of astronomy, cosmology and physics
• Ambitious IT project. Probably will be the first production exascale system
• Will be generating several exabytes of processed data per year
Five Key Science Areas for the SKA
Topic Goals
Probing the Dark Ages
1. Map out structure formation using HI from the era of
reionization (6 < z < 13)
2. Probe early star formation using high-z CO
3. Detect the first light-emitting sources
Gravity: Pulsars & Black Holes
1. Precision timing of pulsars to test theories of gravity
approaching the strong-field limit (NS-NS,
NS-BH binaries, incl Sgr A*)
2. Millisecond pulsar timing array for detecting long-
wavelength gravitational waves
Cosmic Structure
1. Understand dark energy [e.g. eqn. of state; W(z)]
2. Understand structure formation and galaxy evolution
3. Map and understand dark matter
Cosmic Magnetism
Determine the structure and origins of cosmic magnetic
fields (in galaxies and in the intergalactic medium) vs.
redshift z
The Cradle of Life
1. Understand the formation of Earth-like planets
2. Understand the chemistry of organic molecules and
their roles in planet formation and
generation of life
3. Detect signals from ET
+ serendipitous
discoveries
Site selection: determined by man-made interference
The majority of the radio spectrum has
been allocated
Only available radio-quit sites: south Africa and western Australia
(or the back side of the Moon)
Signal corruptions
• Astronomical radio signals are very weak
• Calibration is the name of the game
• Several sources of corruption
– Ionospheric Faraday Rotation/phase
– Antenna/station beam patterns
– Receiver gain and phase errors
• If not tackled, they will limited the sensitivity and therefore the
science
SKA data rates
..
Sparse AA
Dense AA
..
Central Processing Facility - CPF
User interface
via Internet...
To 250 AA Stations
DSP
...
DSP
To 1200 Dishes
...15m Dishes
16 Tb/s
10 Gb/s
Data
Time
Control
70-450 MHz
Wide FoV
0.4-1.4 GHz
Wide FoV
1.2-10 GHz
WB-Single
Pixel feeds
Tile &
Station
Processing
Optical
Data
links
...
AA slice
...
AA slice
...
AA slice
...Dish&AA+DishCorrelation
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
ProcessorBuffer
Dataswitch
......
Data
Archive
Science
Processors
Tb/s Gb/s Gb/s
...
...
Time
Standard
ImagingProcessors
Control Processors & User interface
Pb/s
Correlator UV Processors Image formation Archive
Aperture Array
Station
16 Tb/s
4 Pb/s
24 Tb/s
20 Gb/s
20 Gb/s
1PB/s
Science data processor pipeline
10 Pflop
1 Eflop
100 Pflop10 Tb/s 200 Pflop
10 Eflop
…
Incoming
Data from
collectors
Switch
Bufferstore
Switch
Bufferstore
BulkStoreBulkStore
Correlator
Beamformer
UV
Processor
Imaging:
Non-Imaging:
Corner
Turning
Course
Delays
Fine F-step/
Correlation
Visibility
Steering
Observation
Buffer
Gridding
Visibilities Imaging
Image
Storage
Corner
Turning
Course
Delays
Beamforming/
De-dispersion
Beam
Steering
Observation
Buffer
Time-series
Searching
Search
analysis
Object/timing
Storage
HPCscienceHPCscience
processingprocessing
Image
Processor
1000Tb/s 1 Eflop10 EB/ySKA 2
SKA 1 1 EB/y50 PB
10/1 TB/s
30x more computationally intensive
(deconvolution, gridding, prediction, FFTs)
90% of power consumption
Current data reduction scheme
• Software developed in the 80s/90s
• Fortran libraries
• Minimal use of multi-cores/No use of accelerators
• Design for interactive processing of single dataset/Support for scripted batched processing
• ipython/custom REPL
• Will not work for SKA
Why Spark
• Power consumption will be the most important differentiator –minimize movement
• supercomputer in the middle of the desert
• data locality – automatic work placement
• In-memory processing
• Iterative algorithms / Pipelines that enable piggybacking
• New algorithms that have modest inter-process communication
• Not all features of MPI are required -> Simple collectives: broadcast, aggregation
• Can MPI handle the increased pararellism available on exascale systems?
• Fault tolerance
• Current MTBF: 2 out of 600 nodes per week -> 300 nodes per week for SKA
• MPI applications need to be designed for fault-tolerance/checkpointing
• Straggler mitigation
• Friendly APIs
RDD
Fault Tolerance
• Bright cluster manager
• HDFS/Lustre/Ceph support
• Early warning via e-mail, SMS etc and automate actions
• Single plane of glass: monitor nodes, switches, PDUs, multiple clusters
• Hadoop/Spark-specific metrics and healthchecks
• Simple Python/REST/C++ APIs
Types of processing:
•Signal detection/extraction
•Object classification
•AGN spectral classification
•Time series
•Clustering
•LSS
•Class discovery
• RFI mitigation
•Transient detection
•Deconvolution
autoScan algorithm
Goldstein et al 2015
Types of algorithms:
•ANNs (this talk)
•Decision trees
•SVM
•Nearest neighbor
•Expectation maximization
•Non-linear optimization
Chapman et al 2012
How is data processed
o Datasets contain multiple sources of information
• Ionosphere, lightning, cosmic rays, ET?, interference, compact and extended astronomical sources
• Different algorithms for each
o Multidimensional data: spatial, time, frequency, polarization
o Pipeline should be able to chain different algorithms in order to extract as much information as possible with
minimal data movement
Case study: Calibration
Case study: Calibration
Case study: Calibration
Data distributed over a network: all existing
calibration algorithms work independently
We want a unified solution, collecting information from all
data
Case study: Calibration
• Consensus optimization: exploit smoothness of systematic
errors to find a unified solution
• Use of a fusion center is not essential, but easier
• Communication overhead is little, more important is robustness
and fault tolerance (can throw away lost data).
• Large rho makes problem convex
1. INITIALIZE
• Read data as Breeze BlockMatrix
• P sub-problems, each with its own copy of Z,Y
2. REPEAT until convergence or N_max_iter
1. Each worker solves:
2. Globally calculate average:
3. Broadcast Z to P sub-problems
Zb = sc.broadcast(Z)
4. Locally calculate Lagrange multiplier
Case study: Calibration
traditional distributed difference
Case study: Imaging
1. INITIALIZE
• a residual image set to the Fourier Transform of
the visibilities
• var row: Int = _, var col: Int = _
• (row , col, flux) MAX= tuple(l: Int, m: int)
• a Clean Component list to empty
2. WHILE ( img.getNoise < THERMAL_NOISE )
1. (row , col, flux) = img.getMax
2. resimg = resimg – g * PSF.imshift(l,m,size)
3. CCList.++ = (row , col, flux)
• linear scaling
• fits in memory
• easy to implement
• gridding/FFT can be offloaded to GPU via JNI
• Does not work well with extended sources, still useful for catalogs
and calibration
Imaging
33
• The “dirty” image is the convolution of the true underlying image with the PSF, corrupted by Gaussian random noise
Convolution Theorem
This pseudo-inverse cannot
be computed in the presence
of corruptions
Tikhonov Regularization
High noise strong regularization→
Imaging
34
Astrophysical sources are complication, but most radio sources have simple shapes, and occasionally irregular:
Use high SNR images from the VLA FIRST survey supplemented with simulated sources (elliptical Gaussians,
points sources, narrow gaussians) to construct a training pair set of 10.000 64x64 images
uncorrupted convolved noise added recovered
Challenges
• Spark is feature already but some useful components are not yet available
• JNI
• Analytical implementations needed
• Hard to debug and argue about performance sometimes
• Application specific formats
• Accelerators such as GPUs are almost mandatory – JNI
Conclusions
• New generation interferometers will enable us to do new science
• Data processing is the major challenge
• Is Spark the answer? To early to answer but maybe
• Distributed algorithms not only address computational issues, but are
more robust and accurate as well
• Consensus calibration is the best way forward
• Spark offers many of the tools need to construct radio astronomical
pipelines

More Related Content

What's hot

Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...
Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...
Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...Pratik Ramdasi
 
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...Toshihiro FUJII
 
FAST実験2:新型大気蛍光望遠鏡の性能評価
FAST実験2:新型大気蛍光望遠鏡の性能評価FAST実験2:新型大気蛍光望遠鏡の性能評価
FAST実験2:新型大気蛍光望遠鏡の性能評価Toshihiro FUJII
 
Development of a prototype for Fluorescence detector Array of Single-pixel Te...
Development of a prototype for Fluorescence detector Array of Single-pixel Te...Development of a prototype for Fluorescence detector Array of Single-pixel Te...
Development of a prototype for Fluorescence detector Array of Single-pixel Te...Toshihiro FUJII
 
First results from a prototype for the Fluorescence detector Array of Single-...
First results from a prototype for the Fluorescence detector Array of Single-...First results from a prototype for the Fluorescence detector Array of Single-...
First results from a prototype for the Fluorescence detector Array of Single-...Toshihiro FUJII
 
Ralph.basilio
Ralph.basilioRalph.basilio
Ralph.basilioNASAPMC
 
Integral field spectroscopy
Integral field spectroscopyIntegral field spectroscopy
Integral field spectroscopyFernando Reche
 
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発Toshihiro FUJII
 
Cosmological Results from Planck
Cosmological Results from PlanckCosmological Results from Planck
Cosmological Results from PlanckCosmoAIMS Bassett
 
Mace2010
Mace2010Mace2010
Mace2010mnico61
 
SBAS-DInSAR processing on the ESA Geohazards Exploitation Platform
SBAS-DInSAR processing on the ESA Geohazards Exploitation PlatformSBAS-DInSAR processing on the ESA Geohazards Exploitation Platform
SBAS-DInSAR processing on the ESA Geohazards Exploitation PlatformEmmanuel Mathot
 
GoogleSky Status at Google
GoogleSky Status at GoogleGoogleSky Status at Google
GoogleSky Status at GoogleAlberto Conti
 
RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)Carlos Pinto
 
Solar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status UpdateSolar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status UpdateMario Juric
 

What's hot (20)

Final_Croteau
Final_CroteauFinal_Croteau
Final_Croteau
 
Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...
Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...
Parametric Time Domain Method for separation of Cloud and Drizzle for ARM Clo...
 
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...
Next-Generation Observatory: Fluorescence detector Array of Single Pixel Tele...
 
FAST実験2:新型大気蛍光望遠鏡の性能評価
FAST実験2:新型大気蛍光望遠鏡の性能評価FAST実験2:新型大気蛍光望遠鏡の性能評価
FAST実験2:新型大気蛍光望遠鏡の性能評価
 
Development of a prototype for Fluorescence detector Array of Single-pixel Te...
Development of a prototype for Fluorescence detector Array of Single-pixel Te...Development of a prototype for Fluorescence detector Array of Single-pixel Te...
Development of a prototype for Fluorescence detector Array of Single-pixel Te...
 
First results from a prototype for the Fluorescence detector Array of Single-...
First results from a prototype for the Fluorescence detector Array of Single-...First results from a prototype for the Fluorescence detector Array of Single-...
First results from a prototype for the Fluorescence detector Array of Single-...
 
Ralph.basilio
Ralph.basilioRalph.basilio
Ralph.basilio
 
PanicO
PanicOPanicO
PanicO
 
Integral field spectroscopy
Integral field spectroscopyIntegral field spectroscopy
Integral field spectroscopy
 
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発
次世代極高エネルギー宇宙線のための新型大気蛍光望遠鏡の開発
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
Cosmological Results from Planck
Cosmological Results from PlanckCosmological Results from Planck
Cosmological Results from Planck
 
Zubrin nov99
Zubrin nov99Zubrin nov99
Zubrin nov99
 
Jarp big data_sydney_v7
Jarp big data_sydney_v7Jarp big data_sydney_v7
Jarp big data_sydney_v7
 
Spectroscopy
SpectroscopySpectroscopy
Spectroscopy
 
Mace2010
Mace2010Mace2010
Mace2010
 
SBAS-DInSAR processing on the ESA Geohazards Exploitation Platform
SBAS-DInSAR processing on the ESA Geohazards Exploitation PlatformSBAS-DInSAR processing on the ESA Geohazards Exploitation Platform
SBAS-DInSAR processing on the ESA Geohazards Exploitation Platform
 
GoogleSky Status at Google
GoogleSky Status at GoogleGoogleSky Status at Google
GoogleSky Status at Google
 
RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)RE.SUN Validation (March 2013)
RE.SUN Validation (March 2013)
 
Solar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status UpdateSolar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status Update
 

Similar to SPARK FOR RADIO ASTRONOMY DATA PROCESSING

Computational Tools for Multimessenger Astronomy in the Gravitational Wave Era
Computational Tools for Multimessenger Astronomy in the Gravitational Wave EraComputational Tools for Multimessenger Astronomy in the Gravitational Wave Era
Computational Tools for Multimessenger Astronomy in the Gravitational Wave EraUniversity of Illinois at Urbana-Champaign
 
Science with small telescopes - exoplanets
Science with small telescopes - exoplanetsScience with small telescopes - exoplanets
Science with small telescopes - exoplanetsguest8aa6ebb
 
120_SEM_Special_Topics.ppt
120_SEM_Special_Topics.ppt120_SEM_Special_Topics.ppt
120_SEM_Special_Topics.pptzaki194502
 
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...Altair
 
Maria's Poster Presentation
Maria's Poster PresentationMaria's Poster Presentation
Maria's Poster Presentationdukeousf
 
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...JeremyHeyl
 
MIRAS: the instrument aboard SMOS
MIRAS: the instrument aboard SMOSMIRAS: the instrument aboard SMOS
MIRAS: the instrument aboard SMOSadrianocamps
 
Basics of remote sensing, pk mani
Basics of remote sensing, pk maniBasics of remote sensing, pk mani
Basics of remote sensing, pk maniP.K. Mani
 
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...EarthCube
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Integrated Carbon Observation System (ICOS)
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningNoah Brenowitz
 
Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ahmed Ammar Rebai PhD
 
Solar Power Satellites - Part 2
Solar Power Satellites - Part 2Solar Power Satellites - Part 2
Solar Power Satellites - Part 2Jayanth Rajakumar
 
Applications Of Computer Science in Astronomy
Applications Of Computer Science in AstronomyApplications Of Computer Science in Astronomy
Applications Of Computer Science in AstronomyAhmed Abuzuraiq
 
MIRAS: The SMOS Instrument
MIRAS: The SMOS InstrumentMIRAS: The SMOS Instrument
MIRAS: The SMOS Instrumentadrianocamps
 
636896main gilland presentation
636896main gilland presentation636896main gilland presentation
636896main gilland presentationClifford Stone
 
Importance of SSPS in SDG and ESG, and importance of antennas in SSPS
Importance of SSPS in SDG and ESG, and importance of antennas in SSPSImportance of SSPS in SDG and ESG, and importance of antennas in SSPS
Importance of SSPS in SDG and ESG, and importance of antennas in SSPSAdvanced-Concepts-Team
 
Team ESAT Preliminary Design Review
Team ESAT Preliminary Design ReviewTeam ESAT Preliminary Design Review
Team ESAT Preliminary Design Reviewmarc_cattrell
 

Similar to SPARK FOR RADIO ASTRONOMY DATA PROCESSING (20)

Computational Tools for Multimessenger Astronomy in the Gravitational Wave Era
Computational Tools for Multimessenger Astronomy in the Gravitational Wave EraComputational Tools for Multimessenger Astronomy in the Gravitational Wave Era
Computational Tools for Multimessenger Astronomy in the Gravitational Wave Era
 
Science with small telescopes - exoplanets
Science with small telescopes - exoplanetsScience with small telescopes - exoplanets
Science with small telescopes - exoplanets
 
120_SEM_Special_Topics.ppt
120_SEM_Special_Topics.ppt120_SEM_Special_Topics.ppt
120_SEM_Special_Topics.ppt
 
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...
A coupled Electromagnetic-Mechanical analysis of next generation Radio Telesc...
 
Maria's Poster Presentation
Maria's Poster PresentationMaria's Poster Presentation
Maria's Poster Presentation
 
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
 
MIRAS: the instrument aboard SMOS
MIRAS: the instrument aboard SMOSMIRAS: the instrument aboard SMOS
MIRAS: the instrument aboard SMOS
 
Basics of remote sensing, pk mani
Basics of remote sensing, pk maniBasics of remote sensing, pk mani
Basics of remote sensing, pk mani
 
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
Novel Techniques & Connections Between High-Pressure Mineral Physics, Microto...
 
Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...Combining remote sensing earth observations and in situ networks: detection o...
Combining remote sensing earth observations and in situ networks: detection o...
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine Learning
 
Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...Ill-posedness formulation of the emission source localization in the radio- d...
Ill-posedness formulation of the emission source localization in the radio- d...
 
Solar Power Satellites - Part 2
Solar Power Satellites - Part 2Solar Power Satellites - Part 2
Solar Power Satellites - Part 2
 
Applications Of Computer Science in Astronomy
Applications Of Computer Science in AstronomyApplications Of Computer Science in Astronomy
Applications Of Computer Science in Astronomy
 
MIRAS: The SMOS Instrument
MIRAS: The SMOS InstrumentMIRAS: The SMOS Instrument
MIRAS: The SMOS Instrument
 
636896main gilland presentation
636896main gilland presentation636896main gilland presentation
636896main gilland presentation
 
Importance of SSPS in SDG and ESG, and importance of antennas in SSPS
Importance of SSPS in SDG and ESG, and importance of antennas in SSPSImportance of SSPS in SDG and ESG, and importance of antennas in SSPS
Importance of SSPS in SDG and ESG, and importance of antennas in SSPS
 
Stellar hydro
Stellar hydroStellar hydro
Stellar hydro
 
Team ESAT Preliminary Design Review
Team ESAT Preliminary Design ReviewTeam ESAT Preliminary Design Review
Team ESAT Preliminary Design Review
 
Xray interferometry
Xray interferometryXray interferometry
Xray interferometry
 

More from Spark Summit

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang Spark Summit
 
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...Spark Summit
 
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang WuSpark Summit
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya RaghavendraSpark Summit
 
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...Spark Summit
 
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...Spark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingSpark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingSpark Summit
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...Spark Summit
 
Next CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakNext CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakSpark Summit
 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimSpark Summit
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraSpark Summit
 
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Spark Summit
 
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...Spark Summit
 
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spark Summit
 
Goal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovGoal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovSpark Summit
 
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Spark Summit
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkGetting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkSpark Summit
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Spark Summit
 
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...Spark Summit
 

More from Spark Summit (20)

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
 
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
 
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
 
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
 
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
 
Next CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakNext CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub Wozniak
 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin Kim
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
 
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
 
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
 
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
 
Goal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovGoal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim Simeonov
 
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkGetting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir Volk
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
 
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
 

Recently uploaded

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 

Recently uploaded (20)

Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 

SPARK FOR RADIO ASTRONOMY DATA PROCESSING

  • 1. DISTRIBUTED DATA PROCESSING USING SPARK IN RADIO ASTRONOMY Panos Labropoulos Bright Computing, Inc. Sarod Yatawatta ASTRON
  • 2. Outline • Science drivers • Basics of radio interferometry • Challenges • How can Spark help?
  • 3. Who are we? • Panos Labropoulos – Works at Bright Computing – HPC, accelerators, high-speed interconnects – PhD Radio Astronomy, University of Groningen • Sarod Yatawatta • Works at ASTRON • Signal and Image processing • PhD Electrical engineering, Drexel
  • 4. The visible sky: Thermal emission processes
  • 5. Cygnus A Cassiopeia A Moon Sun The radio sky
  • 6. Why radio? • Thermal Radiation • Synchrotron Radiation – Relativistic e- in magnetic fields • Bremstrahlung – “Breaking Radiation” e- /ion collisions • Maser – Microwave Laser e- oscillations in molecular clouds • Atomic Transitions (emission spectra) – Hydrogen e- spin flip Tools required: antennas, receivers (e.g. voltage amplifiers, ADCs, computers)
  • 7. Similar principles to optical astronomy – different wavelengths
  • 8. Basic characteristics of a telescope • Oldest telescope: human eye – Human pupil: 0.5 cm diameter, 0.05 deg. Resolution at λ=600 nm (yellow) • Radio telescope: 100m diameter, 1.4 deg. resolution at λ=2m – 2.8 km diameter required for same resolution as human eye – 250 km dimeter required for same resolution as an optical telescope Angular resolution and sensitivity proportional to size Constructing a 250km parabolic dish is simply not feasible Use smaller antennas and add-up signals using a computer
  • 9. How many antennas? How big? • Intensity follows an inverse square law • Source 3 times further away become 9 times fainter • Size of telescope depends on distance at which we want to look objects at. 10.000 sq.m. VLA 10x further -> 100x bigger 1 sq.km.
  • 10. Measurement process x s s b cg /sb⋅=τ )(cos2 tEV ω=])(cos[1 gtEV τω −= ])2(cos)([cos gg tP ωτωωτ −+ multiply average The path lengths from sensors to multiplier are assumed equal! Geometric Time Delay Rapidly varying, with zero mean Unchanging)(cos gC PR ωτ= plane wavefront 90 deg ])2sin()[sin( gg tP ωτωωτ −+ )sin( gs PR ωτ= Ω=−= ⋅− ∫∫ dsIiRRV eSC sb b λi2π )()( νυ 2-D FT of the sky’ s brightness distribution
  • 11. Examples of 1-Dimensional Visibilities • Simple pictures are easy to make illustrating 1-dimensional visibilities. Brightness Distribution Visibility Function • Unresolved Doubles • Uniform • Central Peaked
  • 12. Earth rotation synthesis • An interferometer with N antennas has N ( N -1 ) / 2 “baseline” pairs • Real interferometers are built on the surface of the earth – a rotating platform. From the observer’s perspective, sources move across the sky, but from the source’s perspective the antenna array is rotating in such a way that the baseline pair points form tracks over time • u-v tracks have gaps: incomplete sampling of the 2-D transform / missing information Antenna layout 360 min Baseline pairs layout amplitude phase
  • 15. Square Kilometer Array • 100x more sensitive, 1,000,000x faster imaging • Up to 5 sq.km collecting area and 3500km baselines • Sub-arcsecond resolution, large field-of-view • 3 types of antennas, 2 locations, a single observatory • No mechanical steering for low- and mid-frequency antennas, multiple beams • Will address key questions of astronomy, cosmology and physics • Ambitious IT project. Probably will be the first production exascale system • Will be generating several exabytes of processed data per year
  • 16.
  • 17. Five Key Science Areas for the SKA Topic Goals Probing the Dark Ages 1. Map out structure formation using HI from the era of reionization (6 < z < 13) 2. Probe early star formation using high-z CO 3. Detect the first light-emitting sources Gravity: Pulsars & Black Holes 1. Precision timing of pulsars to test theories of gravity approaching the strong-field limit (NS-NS, NS-BH binaries, incl Sgr A*) 2. Millisecond pulsar timing array for detecting long- wavelength gravitational waves Cosmic Structure 1. Understand dark energy [e.g. eqn. of state; W(z)] 2. Understand structure formation and galaxy evolution 3. Map and understand dark matter Cosmic Magnetism Determine the structure and origins of cosmic magnetic fields (in galaxies and in the intergalactic medium) vs. redshift z The Cradle of Life 1. Understand the formation of Earth-like planets 2. Understand the chemistry of organic molecules and their roles in planet formation and generation of life 3. Detect signals from ET + serendipitous discoveries
  • 18.
  • 19. Site selection: determined by man-made interference The majority of the radio spectrum has been allocated Only available radio-quit sites: south Africa and western Australia (or the back side of the Moon)
  • 20. Signal corruptions • Astronomical radio signals are very weak • Calibration is the name of the game • Several sources of corruption – Ionospheric Faraday Rotation/phase – Antenna/station beam patterns – Receiver gain and phase errors • If not tackled, they will limited the sensitivity and therefore the science
  • 21. SKA data rates .. Sparse AA Dense AA .. Central Processing Facility - CPF User interface via Internet... To 250 AA Stations DSP ... DSP To 1200 Dishes ...15m Dishes 16 Tb/s 10 Gb/s Data Time Control 70-450 MHz Wide FoV 0.4-1.4 GHz Wide FoV 1.2-10 GHz WB-Single Pixel feeds Tile & Station Processing Optical Data links ... AA slice ... AA slice ... AA slice ...Dish&AA+DishCorrelation ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer ProcessorBuffer Dataswitch ...... Data Archive Science Processors Tb/s Gb/s Gb/s ... ... Time Standard ImagingProcessors Control Processors & User interface Pb/s Correlator UV Processors Image formation Archive Aperture Array Station 16 Tb/s 4 Pb/s 24 Tb/s 20 Gb/s 20 Gb/s 1PB/s
  • 22. Science data processor pipeline 10 Pflop 1 Eflop 100 Pflop10 Tb/s 200 Pflop 10 Eflop … Incoming Data from collectors Switch Bufferstore Switch Bufferstore BulkStoreBulkStore Correlator Beamformer UV Processor Imaging: Non-Imaging: Corner Turning Course Delays Fine F-step/ Correlation Visibility Steering Observation Buffer Gridding Visibilities Imaging Image Storage Corner Turning Course Delays Beamforming/ De-dispersion Beam Steering Observation Buffer Time-series Searching Search analysis Object/timing Storage HPCscienceHPCscience processingprocessing Image Processor 1000Tb/s 1 Eflop10 EB/ySKA 2 SKA 1 1 EB/y50 PB 10/1 TB/s 30x more computationally intensive (deconvolution, gridding, prediction, FFTs) 90% of power consumption
  • 23. Current data reduction scheme • Software developed in the 80s/90s • Fortran libraries • Minimal use of multi-cores/No use of accelerators • Design for interactive processing of single dataset/Support for scripted batched processing • ipython/custom REPL • Will not work for SKA
  • 24. Why Spark • Power consumption will be the most important differentiator –minimize movement • supercomputer in the middle of the desert • data locality – automatic work placement • In-memory processing • Iterative algorithms / Pipelines that enable piggybacking • New algorithms that have modest inter-process communication • Not all features of MPI are required -> Simple collectives: broadcast, aggregation • Can MPI handle the increased pararellism available on exascale systems? • Fault tolerance • Current MTBF: 2 out of 600 nodes per week -> 300 nodes per week for SKA • MPI applications need to be designed for fault-tolerance/checkpointing • Straggler mitigation • Friendly APIs RDD
  • 25. Fault Tolerance • Bright cluster manager • HDFS/Lustre/Ceph support • Early warning via e-mail, SMS etc and automate actions • Single plane of glass: monitor nodes, switches, PDUs, multiple clusters • Hadoop/Spark-specific metrics and healthchecks • Simple Python/REST/C++ APIs
  • 26. Types of processing: •Signal detection/extraction •Object classification •AGN spectral classification •Time series •Clustering •LSS •Class discovery • RFI mitigation •Transient detection •Deconvolution autoScan algorithm Goldstein et al 2015 Types of algorithms: •ANNs (this talk) •Decision trees •SVM •Nearest neighbor •Expectation maximization •Non-linear optimization Chapman et al 2012 How is data processed o Datasets contain multiple sources of information • Ionosphere, lightning, cosmic rays, ET?, interference, compact and extended astronomical sources • Different algorithms for each o Multidimensional data: spatial, time, frequency, polarization o Pipeline should be able to chain different algorithms in order to extract as much information as possible with minimal data movement
  • 29. Case study: Calibration Data distributed over a network: all existing calibration algorithms work independently We want a unified solution, collecting information from all data
  • 30. Case study: Calibration • Consensus optimization: exploit smoothness of systematic errors to find a unified solution • Use of a fusion center is not essential, but easier • Communication overhead is little, more important is robustness and fault tolerance (can throw away lost data). • Large rho makes problem convex 1. INITIALIZE • Read data as Breeze BlockMatrix • P sub-problems, each with its own copy of Z,Y 2. REPEAT until convergence or N_max_iter 1. Each worker solves: 2. Globally calculate average: 3. Broadcast Z to P sub-problems Zb = sc.broadcast(Z) 4. Locally calculate Lagrange multiplier
  • 31. Case study: Calibration traditional distributed difference
  • 32. Case study: Imaging 1. INITIALIZE • a residual image set to the Fourier Transform of the visibilities • var row: Int = _, var col: Int = _ • (row , col, flux) MAX= tuple(l: Int, m: int) • a Clean Component list to empty 2. WHILE ( img.getNoise < THERMAL_NOISE ) 1. (row , col, flux) = img.getMax 2. resimg = resimg – g * PSF.imshift(l,m,size) 3. CCList.++ = (row , col, flux) • linear scaling • fits in memory • easy to implement • gridding/FFT can be offloaded to GPU via JNI • Does not work well with extended sources, still useful for catalogs and calibration
  • 33. Imaging 33 • The “dirty” image is the convolution of the true underlying image with the PSF, corrupted by Gaussian random noise Convolution Theorem This pseudo-inverse cannot be computed in the presence of corruptions Tikhonov Regularization High noise strong regularization→
  • 35. Astrophysical sources are complication, but most radio sources have simple shapes, and occasionally irregular: Use high SNR images from the VLA FIRST survey supplemented with simulated sources (elliptical Gaussians, points sources, narrow gaussians) to construct a training pair set of 10.000 64x64 images uncorrupted convolved noise added recovered
  • 36. Challenges • Spark is feature already but some useful components are not yet available • JNI • Analytical implementations needed • Hard to debug and argue about performance sometimes • Application specific formats • Accelerators such as GPUs are almost mandatory – JNI
  • 37. Conclusions • New generation interferometers will enable us to do new science • Data processing is the major challenge • Is Spark the answer? To early to answer but maybe • Distributed algorithms not only address computational issues, but are more robust and accurate as well • Consensus calibration is the best way forward • Spark offers many of the tools need to construct radio astronomical pipelines

Editor's Notes

  1. Key science areas as they appear in New Astronomy Reviews, with some highlights by myself.