Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Big Data from the LHC
Commissioning

!

Practical Lessons from Big Science
Simon/@drsm79

Time at places I’ve worked
Bristol University

Cloudant

Python

Perl

Bash

C++

Java

Javascript

Fortran

100

75

50

25

0
2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

2013

The formula
Fixed

G* E

Fixed

Usually ﬁxed

The formula
Grant * Effectiveness

The life of LHC data
1. Detected by experiment

2. “Online” filtering (hardware and software)

3. Transferred to CERN main campus, archived & reconstructed

4. Transferred to T1 sites, archived, reconstructed & skimmed

5. Transferred to T2 sites, reconstructed, skimmed, filtered & analysed

6. Written into locally analysable files, put on laptops

7. Turned into a plot in a paper

Chain u p se rie s o f
“ato m smashe rs”

Pu t se nsitive cam eras in
aw kw ard places

Process data on
high end
machines
http://www.chilton-computing.org.uk

CMS online data ﬂow
We have a big digital camera

It takes photos of this

courtesy of James Jackson

which come out like this

courtesy of James Jackson


Which goes into lots of
computers (the HLT)


computers (the HLT)
disk (the Storage Manager)

CMS data ﬂow
Write to digital
We have a big HLT at camera
~200GB/s
Write to Storage
computers ~2GB/s
(the HLT)
Manager at
Write to T0 at ~2GB/s
disk (the Storage Manager)




5. Transferred to T2 sites, reconstructed, skimmed, ﬁltered &
analysed


To process all the data
taken in one year on
one computer would
take ~64,000 years

Analysis
• Each analysis is ~unique

• Query language is C++

• Runs on distributed system and local resources

• Series of “cut” selections to identify interesting
events

• Data in the ﬁnal plot may be substantially
reduced from the original dataset

Workﬂow ladder
Number of users
Large datasets (>100 TB)

Complex computation
Large datasets (>100 TB)

Simple computation
Shared datasets (>500 GB)

Complex computation
Shared datasets (10-500 GB)

Complex computation

Simple computation
Shared datasets (0.1-10 GB)

Simple computation
Private datasets (0.1-10 GB)

Simple computation

}
}
}

Use Grid compute and storage

exclusively

Work on departmental resources,

store resulting datasets to Grid storage

Work on laptop/desktop machine,


The life of LHC
simulated data
1. Simulated by experimentalists at T0/T1/T2 sites

2. Transferred to T1 sites, archived possibly reconstructed &
skimmed

3. Transferred to T2 sites, reconstructed, skimmed, ﬁltered &
analysed



!

“We are going to die, and that makes us the
lucky ones. Most people are never going to
die because they are never going to be born.”
!

- Richard Dawkins

Setup
• Maybe a bit different to other people

• Many sites (>100) with >100’s TB storage,
10000’s worker nodes

• Global system

• Why not at one site?

• politics, power budget, cost

We Have a “Big Data”
Problem

We Have a Big “Data
Problem”

Do what you do best,
out source the rest

What's interesting is
that big data isn't
interesting any more

Define and refine
workflows

Our situation

•

Expert users, who are not
interested in infrastructure

• Will work around things they
perceive as unnecessary
limitations

How to engage
disruptive users?

Our situation
• Limited resources for integration/
testbed style activities

• Strange organisation

Keep things as local as
possible

Deﬁning monitoring is
difﬁcult

Recognise, embrace and
communicate failures

People are harder than
computers

Consequences
• Automate all the things

• Learn to love a conﬁguration management
system

• Make sure everyone in the team knows
how to interact with it

• Simple human solutions go a long way

Workﬂow ladder
Number of users
Large datasets (100 TB)

Complex computation
Large datasets (100 TB)

Simple computation
Shared datasets (500 GB)

Complex computation

Complex computation

Simple computation
Shared datasets (0.1-10 GB)

Simple computation
Private datasets (0.1-10 GB)

Simple computation

}
}
}

Use Grid compute and storage

exclusively

Work on departmental resources,


Work on laptop/desktop machine,


Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)

Similar to Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant) (20)

More from jaxLondonConference

More from jaxLondonConference (19)

Recently uploaded

Recently uploaded (20)

Big data from the LHC commissioning: practical lessons from big science - Simon Metson (Cloudant)