The document discusses the challenges of big data in astronomy. It describes several upcoming and current astronomical surveys that will produce enormous amounts of data, such as Euclid which will collect 15 terabytes per day starting in 2022. Dealing with such large datasets requires new techniques for storage, processing, analyzing, classifying, compressing and visualizing the data. Machine learning and citizen science are helping to process the data more efficiently. Future surveys like the Square Kilometer Array telescope will pose even greater challenges by producing data at terabytes per second.
1. The
Dark Matter
Mystery
The
big data universe.
Literally
D r M a g g i e L i e u
@space_mog
The
big data universe.
Literally
Bruno Merin & Beatriz Martinez
46. Citizen science
‣ Outsource tasks to the general public
‣ Zooniverse platform: easy to build projects
‣ 100’s of Projects
‣ 250M classifications
‣ 2M Volunteers
50. Data & model compression
‣ Neural networks & emulators
Emulate the halo mass
function with mixture
density networks,
Lieu+in. prep
Emulate cosmology with
neural density estimators,
Alsing+2019
Scaling relations with
principle component
analyses PCA,
Bothwell+2016
65. Biggest challenges in Astronomy
‣ Collecting the data
‣ Retrieval
‣ Filtering good from bad
‣ Data Storage
‣ Distributing the data
‣ Upload/Download
‣ Combining data
‣ complementary observations and multi-wavelength observations
‣ Data analysis
‣ Compression
‣ Source detection
‣ Visualising the data
Let’s collaborate!