SlideShare a Scribd company logo
1 of 40
Download to read offline
The	
  Matsu	
  Project	
  
Robert	
  L.	
  Grossman	
  
University	
  of	
  Chicago	
  
Open	
  Cloud	
  ConsorAum	
  
June	
  18,	
  2013	
  
The	
  Matsu	
  Project	
  represents	
  work	
  by	
  Collin	
  
BenneL,	
  Robert	
  L.	
  Grossman,	
  	
  MaLhew	
  Handy,	
  
Vuong	
  Ly,	
  Dan	
  Mandl,	
  Ryan	
  Miller,	
  Jim	
  Pivarski,	
  
Ray	
  Powell	
  and	
  Steve	
  Vejcik.	
  
	
  
What	
  is	
  the	
  Matsu	
  Project?	
  
Matsu	
  is	
  an	
  open	
  source	
  
project	
  for	
  processing	
  satellite	
  
imagery	
  to	
  support	
  earth	
  
sciences	
  researchers	
  using	
  a	
  
community	
  science	
  cloud.	
  
Matsu	
  is	
  a	
  joint	
  project	
  
between	
  the	
  Open	
  Cloud	
  
ConsorAum	
  and	
  NASA’s	
  EO-­‐1	
  
Mission	
  (Dan	
  Mandl,	
  Lead)	
  
matsu.opensciencedatacloud.org	
  
EO-­‐1	
  mission	
  
•  Approved	
  in	
  March	
  
1996	
  and	
  launched	
  on	
  
November	
  21,	
  2000	
  
from	
  Vandenburg	
  Air	
  
Force	
  Base,	
  California	
  
on	
  a	
  Delta	
  7320	
  	
  
•  All	
  technologies	
  were	
  
flight-­‐validated	
  by	
  
December	
  2001	
  
•  EO-­‐1	
  is	
  now	
  in	
  an	
  
Extended	
  Mission	
  
EO-­‐1’s	
  ALI	
  and	
  Hyperion	
  Instruments	
  
Data	
  -­‐	
  Instruments	
  
	
  
•  Hyperion	
  Imaging	
  Spectrometer	
  
– Designed	
  to	
  gather	
  data	
  from	
  a	
  given	
  region	
  on	
  
the	
  Earth	
  by	
  viewing	
  the	
  surface	
  in	
  terms	
  of	
  242	
  
disAnct	
  'bands'	
  of	
  light.	
  
•  Advanced	
  Land	
  Imager	
  (ALI)	
  
– Used	
  to	
  validate	
  and	
  demonstrate	
  technology	
  for	
  
the	
  Landsat	
  Data	
  ConAnuity	
  Mission	
  (LDCM)	
  
All	
  available	
  L1G	
  images	
  (2010-­‐now)	
  
1.	
  Open	
  Science	
  Data	
  
Cloud	
  (OSDC)	
  stores	
  
Level	
  0	
  data	
  from	
  EO-­‐1	
  
and	
  uses	
  an	
  OpenStack-­‐
based	
  cloud	
  to	
  create	
  
Level	
  1	
  data.	
  
2.	
  OSDC	
  also	
  
provides	
  OpenStack	
  
resources	
  for	
  the	
  
Nambia	
  Flood	
  
Dashboard	
  
developed	
  by	
  Dan	
  
Mandl’s	
  team.	
  
3.	
  Project	
  Matsu	
  uses	
  
a	
  Hadoop/Accumulo	
  
system	
  to	
  run	
  
analyAcs	
  nightly	
  and	
  
to	
  create	
  Ales	
  with	
  
OGC-­‐compliant	
  
WMTS.	
  
NASA’s	
  Matsu	
  Mashup	
  
OSDC	
  Satellites	
  
•  EO-­‐1	
  (2012)	
  
•  Landsat7	
  –	
  GLS	
  2000	
  	
  (2013)	
  
•  MODIS	
  (2013)	
  	
  
•  TBD	
  (2014)	
  
•  TBD	
  (2015)	
  
Matsu	
  Web	
  Map	
  Tile	
  Service	
  
It	
  is	
  easy	
  to	
  layer	
  analyAcs	
  over	
  the	
  Web	
  Map	
  Tile	
  
Service	
  (WMTS).	
  	
  Here	
  is	
  one	
  idenAfying	
  CO2	
  
Matsu	
  Hadoop	
  Architecture	
  
Hadoop	
  HDFS	
  
Matsu	
  Web	
  Map	
  Tile	
  
Service	
  
Matsu	
  MR-­‐based	
  
Tiling	
  Service	
  
NoSQL	
  
Database(Accumulo)	
  
Images	
  at	
  different	
  zoom	
  layers	
  
suitable	
  for	
  OGC	
  Web	
  Mapping	
  Server	
  
Level	
  0,	
  Level	
  1	
  and	
  Level	
  2	
  
images	
  
MapReduce	
  used	
  to	
  process	
  Level	
  n	
  to	
  Level	
  n+1	
  
data	
  and	
  to	
  parAAon	
  images	
  for	
  different	
  zoom	
  
levels	
  
NoSQL-­‐based	
  
AnalyAc	
  Services	
  
Streaming	
  AnalyAc	
  
Services	
  
MR-­‐based	
  AnalyAc	
  
Services	
  
AnalyAc	
  Services	
   Storage	
  for	
  WMTS	
  Ales	
  and	
  
derived	
  data	
  products	
  
PresentaAon	
  Services	
  
Web	
  Coverage	
  
Processing	
  Service	
  
(WCPS)	
  
Workflow	
  Services	
  
Zoom	
  Levels	
  
Zoom	
  Level	
  1:	
  4	
  images	
   Zoom	
  Level	
  2:	
  16	
  images	
  
Zoom	
  Level	
  3:	
  64	
  images	
   Zoom	
  Level	
  4:	
  256	
  images	
  
Mapper	
  Input	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Input	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  resizes	
  and/or	
  cuts	
  up	
  the	
  original	
  
image	
  into	
  pieces	
  to	
  output	
  Bounding	
  Boxes	
  
(minx	
  =	
  -­‐135.0	
  miny	
  =	
  45.0	
  maxx	
  =	
  -­‐112.5	
  maxy	
  =	
  67.5)	
  
Step	
  1:	
  Input	
  to	
  Mapper	
  
Step	
  2:	
  Processing	
  in	
  Mapper	
   Step	
  3:	
  Mapper	
  Output	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Mapper	
  Output	
  Key:	
  Bounding	
  Box	
  
Mapper	
  Output	
  Value:	
  
Build	
  Tile	
  Cache:	
  Map	
  
Reducer	
  Key	
  Input:	
  Bounding	
  Box	
  
(minx	
  =	
  -­‐45.0	
  miny	
  =	
  -­‐2.8125	
  maxx	
  =	
  -­‐43.59375	
  maxy	
  =	
  -­‐2.109375)	
  
Reducer	
  Value	
  Input:	
  
Step	
  1:	
  Input	
  to	
  Reducer	
  
…	
  
Step	
  2:	
  Reducer	
  Output	
  
Assemble	
  Images	
  based	
  on	
  bounding	
  box	
  
•  Reducer	
  
assembles	
  Ales	
  
at	
  each	
  zoom	
  
level	
  
•  Tiles	
  wriLen	
  to	
  
Accumulo	
  (a	
  
NoSQL	
  
database)	
  
Build	
  Tile	
  Cache:	
  Reduce	
  
Map	
  Phase	
  
•  Map	
  
– Read	
  in	
  images	
  by	
  Bands,	
  Date,	
  and	
  Region	
  
– Fix	
  a	
  zoom	
  level	
  for	
  sending	
  to	
  reducers	
  
•  Based	
  on	
  number	
  of	
  reducers	
  and	
  processing	
  power,	
  
not	
  on	
  the	
  zoom	
  you	
  want	
  for	
  display	
  
– Emit	
  as	
  <key>,	
  <value>	
  
•  Key	
  =	
  <Bounding	
  Box	
  at	
  Fixed	
  Zoom	
  Level>	
  
•  Value	
  =	
  <Bounding	
  Bounding	
  Box	
  at	
  Smallest	
  Zoom	
  	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Level,	
  Bands,	
  ProjecAon,	
  Timestamp,	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Image	
  Bytes>	
  
Reduce	
  Phase	
  
•  All	
  bytes	
  for	
  
bands	
  and	
  
satellite	
  strips	
  in	
  
this	
  bounding	
  box	
  
are	
  mapped	
  to	
  
the	
  same	
  reducer	
  
	
  
•  The	
  key	
  can	
  be	
  
idenAfied	
  by	
  the	
  
Lat/Long	
  of	
  the	
  	
  
upper	
  right	
  
corner	
  of	
  the	
  box	
  
Level	
  1	
  Images	
  -­‐	
  Details	
  
•  Satellite	
  track	
  images	
  (L1R)	
  are	
  rotated	
  and	
  
geolocated	
  (L1G)	
  by	
  NASA	
  
•  We	
  overlay	
  L1G	
  images	
  into	
  Level-­‐2	
  dyadic	
  Ales	
  
in	
  Map-­‐Reduce	
  
locaAon	
  in	
  
Google	
  Maps	
  
L1R	
   L1G	
   Level-­‐2	
  Ales	
  made	
  in	
  Map-­‐Reduce,	
  prepared	
  for	
  WMS	
  
T06-­‐00097-­‐00092	
  
T10-­‐01561-­‐01486	
  
Some	
  example	
  images	
  
Gobi	
  Desert	
  
•  same	
  as	
  previous	
  
page	
  
•  contains	
  some	
  
strange	
  structures	
  
that	
  are	
  too	
  small	
  
to	
  spaAally	
  resolve	
  
with	
  Hyperion,	
  
but	
  they	
  might	
  
have	
  interesAng	
  
spectral	
  features	
  
Some	
  example	
  images	
  
Karijini,	
  Australia	
  
•  lots	
  of	
  colorful	
  
minerals	
  
•  should	
  have	
  a	
  very	
  
rich	
  spectrum	
  
Some	
  example	
  images	
  
Lake	
  Frome,	
  Australia	
  
•  salt	
  bed	
  is	
  a	
  standard	
  
calibraAon	
  target	
  
Atacama	
  Desert,	
  Chile	
  
•  salt	
  bed	
  in	
  the	
  driest	
  part	
  
of	
  the	
  world	
  
•  CO2	
  has	
  three	
  
absorbAon	
  lines	
  
within	
  Hyperion’s	
  
spectral	
  range	
  
•  Sideband	
  subtracAon	
  
technique	
  extracts	
  a	
  
pure	
  sample	
  of	
  data	
  in	
  
a	
  peak	
  by	
  fisng	
  
nearby	
  datapoints	
  to	
  a	
  
curve	
  and	
  subtracAng	
  
peak	
  values	
  from	
  the	
  
curve	
  
•  In	
  this	
  case,	
  we	
  invert	
  
the	
  subtracAon	
  
because	
  it’s	
  an	
  anA-­‐
peak	
  
External	
  
Reference	
  
Algebraic	
  combinaAon	
  of	
  spectral	
  
bands	
  to	
  make	
  a	
  more	
  sensiAve	
  image	
  
•  CO2	
  has	
  three	
  
absorbAon	
  lines	
  
within	
  Hyperion’s	
  
spectral	
  range	
  
•  Sideband	
  subtracAon	
  
technique	
  extracts	
  a	
  
pure	
  sample	
  of	
  data	
  in	
  
a	
  peak	
  by	
  fisng	
  
nearby	
  datapoints	
  to	
  a	
  
curve	
  and	
  subtracAng	
  
peak	
  values	
  from	
  the	
  
curve	
  
•  In	
  this	
  case,	
  we	
  invert	
  
the	
  subtracAon	
  
because	
  it’s	
  an	
  anA-­‐
peak	
  
Algebraic	
  combinaAon	
  of	
  spectral	
  
bands	
  to	
  make	
  a	
  more	
  sensiAve	
  image	
  
two	
  bands	
  in	
  
the	
  CO2	
  line	
  
Algebraic	
  combinaAon	
  of	
  spectral	
  
bands	
  to	
  make	
  a	
  more	
  sensiAve	
  image	
  
•  Icelandic	
  
volcano	
  in	
  
April	
  2010	
  
(Eyjatallajökull)	
  
•  Visible	
  frame	
  is	
  
full	
  of	
  ash	
  clouds	
  
•  CO2	
  distribuAon	
  is	
  
non-­‐uniform	
  
•  Some	
  CO2	
  
	
  acAvity	
  follows	
  
	
  	
  visible	
  cloud	
  
	
  	
  	
  formaAons,	
  
	
  	
  	
  	
  some	
  doesn’t	
  
Algebraic	
  combinaAon	
  of	
  spectral	
  
bands	
  to	
  make	
  a	
  more	
  sensiAve	
  image	
  
•  Some	
  CO2	
  
	
  acAvity	
  follows	
  
	
  	
  visible	
  cloud	
  
	
  	
  	
  formaAons,	
  
	
  	
  	
  	
  some	
  doesn’t	
  
Python	
  code	
  used	
  to	
  produce	
  this	
  image	
  (vectors	
  in	
  bold):	
  
	
  
sum1	
  =	
  4.	
  
sumx	
  =	
  183.	
  +	
  184.	
  +	
  188.	
  +	
  189.	
  
sumxx	
  =	
  183.**2	
  +	
  184.**2	
  +	
  188.**2	
  +	
  189.**2	
  
sumy	
  =	
  B183	
  +	
  B184	
  +	
  B188	
  +	
  B189	
  
sumxy	
  =	
  183.*B183	
  +	
  184.*B184	
  +	
  188.*B188	
  +	
  189.*B189	
  
	
  
delta	
  =	
  sum1*sumxx	
  -­‐	
  sumx**2	
  
constant	
  =	
  (sumxx*sumy	
  -­‐	
  sumx*sumxy)	
  /	
  delta	
  
linear	
  =	
  (sum1*sumxy	
  -­‐	
  sumx*sumy)	
  /	
  delta	
  
	
  
subtracted	
  =	
  (B185	
  -­‐	
  (constant	
  +	
  185.*linear))/2.	
  +	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  (B186	
  -­‐	
  (constant	
  +	
  186.*linear))/2.	
  
•  Icelandic	
  
volcano	
  in	
  
April	
  2010	
  
(Eyjatallajökull)	
  
•  Visible	
  frame	
  is	
  
full	
  of	
  ash	
  clouds	
  
•  CO2	
  distribuAon	
  is	
  
non-­‐uniform	
  
Algebraic	
  combinaAon	
  of	
  spectral	
  
bands	
  to	
  make	
  a	
  more	
  sensiAve	
  image	
  
•  Some	
  CO2	
  
	
  acAvity	
  follows	
  
	
  	
  visible	
  cloud	
  
	
  	
  	
  formaAons,	
  
	
  	
  	
  	
  some	
  doesn’t	
  
hLp://lvoc-­‐matsu.opensciencedatacloud.org/SimpleWMS/?
lat=63.7&lng=-­‐19.45&z=11&rgb=true&co2=true&flood=false&points=clust
ers	
  
•  Icelandic	
  
volcano	
  in	
  
April	
  2010	
  
(Eyjatallajökull)	
  
•  Visible	
  frame	
  is	
  
full	
  of	
  ash	
  clouds	
  
•  CO2	
  distribuAon	
  is	
  
non-­‐uniform	
  
QuesAons	
  
For	
  More	
  InformaAon	
  
•  Project	
  Matsu	
  is	
  managed	
  and	
  operated	
  by	
  the	
  Open	
  Cloud	
  ConsorAum	
  
(www.opencloudconsorAum.org).	
  
•  Project	
  Matsu	
  is	
  supported	
  in	
  part	
  by	
  grants	
  from	
  Gordon	
  and	
  BeLy	
  Moore	
  
FoundaAon	
  and	
  the	
  NaAonal	
  Science	
  FoundaAon	
  (Grants	
  OISE	
  -­‐	
  1129076	
  and	
  CISE	
  
1127316).	
  	
  
•  For	
  more	
  informaAon	
  about	
  Project	
  Matsu,	
  please	
  see	
  the	
  Project	
  Matsu	
  website:	
  
matsu.opensciencedatacloud.org	
  
•  The	
  Project	
  Director	
  is	
  Robert	
  Grossman,	
  who	
  can	
  be	
  reached	
  at	
  	
  
Here	
  is	
  some	
  detail	
  of	
  how	
  we	
  process	
  EO-­‐1	
  	
  satellite	
  
imagery	
  data	
  using	
  Hadoop	
  in	
  Project	
  Matsu…	
  
Step	
  1	
  –	
  Storage	
  &	
  Archiving	
  
From	
  Space	
  to	
  Goddard	
  to	
  the	
  OSDC	
  
1.  Transmit	
  data	
  from	
  NASA’s	
  EO-­‐1	
  Satellite	
  to	
  NASA	
  
ground	
  staAons	
  and	
  then	
  to	
  NASA	
  Goddard	
  
2.  At	
  Goddard,	
  align	
  data,	
  perform	
  radiometric	
  
correcAons	
  and	
  generate	
  Level	
  0	
  images	
  (16-­‐bit	
  
radiance	
  values)	
  
3.  Transmit	
  Level	
  0	
  data	
  from	
  NASA	
  Goddard	
  to	
  the	
  
OCC’s	
  Open	
  Science	
  Data	
  Cloud	
  (OSDC)	
  
4.  Store	
  images	
  in	
  a	
  distributed,	
  fault	
  tolerate,	
  file	
  
system	
  
Step	
  2	
  –	
  CreaAng	
  Level	
  1	
  Images	
  
Building	
  Level	
  1	
  Images	
  on	
  the	
  OSDC	
  
1.  Each	
  day,	
  the	
  new	
  Level	
  0	
  images	
  stored	
  on	
  the	
  
OSDC	
  are	
  processed	
  
2.  Within	
  the	
  OSDC,	
  NASA	
  launches	
  Virtual	
  
Machines	
  (VMs)	
  specifically	
  built	
  to	
  render	
  Level	
  
1	
  images	
  from	
  Level	
  0	
  data.	
  
–  Each	
  Level	
  1	
  band	
  is	
  saved	
  as	
  a	
  disAnct	
  image	
  
3.  Level	
  1	
  bands	
  are	
  wriLen	
  to	
  storage	
  facility	
  in	
  the	
  
OSCD	
  for	
  long-­‐term	
  public	
  access	
  
Step	
  3	
  –	
  Tiling	
  
Matsu	
  Processing	
  
1.  Build	
  Web	
  Mapping	
  Tile	
  Service	
  Tiles	
  from	
  Level	
  
1	
  images	
  using	
  MapReduce	
  
2.  Store	
  Ales	
  in	
  Accumulo	
  
•  Index	
  them	
  so	
  that	
  they	
  are	
  accessible	
  via	
  Web	
  
Mapping	
  Service	
  
3.  Run	
  AnalyAcs	
  on	
  Level	
  1	
  images	
  
•  Move	
  results	
  of	
  the	
  analyAcs	
  to	
  Accumulo	
  
Tiling	
  -­‐	
  Detail	
  
•  Use	
  MapReduce	
  to	
  build	
  Web	
  Tiles	
  
1.  Each	
  day,	
  the	
  Level	
  1	
  images	
  created	
  by	
  NASA	
  	
  
and	
  stored	
  on	
  the	
  OSDC	
  are	
  processed	
  
2.  The	
  Date	
  and	
  Bands	
  (to	
  create	
  a	
  visible	
  image)	
  
are	
  specified	
  
3.  Run	
  MapReduce	
  Job	
  
1.  Map	
  –	
  FILL-­‐IN	
  
2.  ParAAon	
  –	
  FILL-­‐IN	
  
3.  Reduce	
  –	
  FILL-­‐IN	
  
Tile	
  Details,	
  cont’d	
  
•  Images	
  are	
  handles	
  as	
  byte	
  streams	
  
•  Divide	
  (chunk)	
  the	
  Level	
  1	
  images	
  into	
  
manageable	
  sizes.	
  
•  Dyadic	
  decomposiAon	
  
–  Divide	
  each	
  image	
  into	
  4	
  equal	
  size	
  pieces	
  
–  For	
  each	
  addiAonal	
  zoom,	
  subdivide	
  each	
  piece	
  into	
  4	
  
equal	
  size	
  pieces	
  
•  Tag	
  each	
  chunked	
  images	
  with	
  the	
  bounding	
  box,	
  
date,	
  Ame,	
  dyadic	
  level	
  and	
  bands.	
  
•  Convert	
  the	
  bytes	
  into	
  PNG	
  files	
  
Processing	
  the	
  Data	
  
•  Reduce	
  
– Once	
  all	
  images	
  are	
  received	
  for	
  a	
  Bounding	
  Box,	
  
sort	
  by	
  the	
  most	
  granular	
  zoom	
  level	
  
– Process	
  that	
  Zoom	
  Level	
  
– Once	
  a	
  zoom	
  level	
  in	
  is	
  completed,	
  combine	
  
images	
  and	
  scale	
  the	
  build	
  the	
  next	
  zoom	
  level	
  
	
  
Z1	
  
Z1	
   Z1	
  
Z1	
  
Z2	
   Z2	
  
1.	
  Assemble	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
	
  
	
  
	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  2.	
  Scale	
  	
  	
  	
  	
  
Accumulo	
  Storage	
  	
  
•  Images	
  are	
  stored	
  by	
  Bounding	
  Box	
  
– -­‐180.0_-­‐90.0_180.0_90.0	
  
•  Column	
  family	
  
– The	
  Ale	
  style,	
  zoom,	
  and	
  projecAon	
  
•  Column	
  qualifier	
  	
  
– Dimensions	
  (width	
  and	
  height,	
  512	
  x	
  256)	
  
•  Value	
  	
  
– the	
  corresponding	
  PNG	
  image	
  in	
  raw	
  bytes	
  
Serve	
  to	
  WMTS	
  
•  The	
  WMTS	
  query:	
  
–  Bounding	
  Box	
  
–  Date	
  
–  Layer	
  name	
  as	
  a	
  string	
  
•  HaiA	
  
–  Style	
  name	
  as	
  a	
  string	
  
•  The	
  bands	
  used	
  to	
  build	
  the	
  Level	
  1	
  image	
  or	
  an	
  alias:	
  
“B058:B023:B015”	
  or	
  “agricultural”	
  
•  Not	
  supported	
  
–  Map	
  Project	
  could	
  be	
  used,	
  but	
  for	
  now,	
  we	
  only	
  
support	
  a	
  single	
  projecAon	
  
Images:	
  stages	
  of	
  processing	
  
•  Satellite	
  track	
  images	
  (L1R)	
  are	
  rotated	
  and	
  
geolocated	
  (L1G)	
  by	
  NASA	
  
•  We	
  overlay	
  L1G	
  images	
  into	
  Level-­‐2	
  dyadic	
  Ales	
  
using	
  Map-­‐Reduce	
  
image	
  locaAons	
  
(viewed	
  in	
  
Google	
  Maps)	
  
L1R	
   L1G	
   Level-­‐2	
  Ales	
  made	
  in	
  Map-­‐Reduce,	
  prepared	
  for	
  WMS	
  
T06-­‐00097-­‐00092	
  
T10-­‐01561-­‐01486	
  

More Related Content

What's hot

Bioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pBioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pRobert Grossman
 
Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Robert Grossman
 
OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3Robert Grossman
 
Project Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefProject Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefRobert Grossman
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefRobert Grossman
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Robert Grossman
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Robert Grossman
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Robert Grossman
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataRobert Grossman
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Ian Foster
 
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Geoffrey Fox
 
An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)Robert Grossman
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Robert Grossman
 
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...inside-BigData.com
 
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...GEO Analytics Canada
 
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...Amazon Web Services
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Robert Grossman
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Robert Grossman
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the ContinuumIan Foster
 

What's hot (20)

Bioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9pBioclouds CAMDA (Robert Grossman) 09-v9p
Bioclouds CAMDA (Robert Grossman) 09-v9p
 
Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)Processing Big Data (Chapter 3, SC 11 Tutorial)
Processing Big Data (Chapter 3, SC 11 Tutorial)
 
OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3OCC Overview OMG Clouds Meeting 07-13-09 v3
OCC Overview OMG Clouds Meeting 07-13-09 v3
 
Project Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster ReliefProject Matsu: Elastic Clouds for Disaster Relief
Project Matsu: Elastic Clouds for Disaster Relief
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster Relief
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
 
Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)Open Science Data Cloud (IEEE Cloud 2011)
Open Science Data Cloud (IEEE Cloud 2011)
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
 
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
 
An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)An Overview of Bionimbus (March 2010)
An Overview of Bionimbus (March 2010)
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)Open Science Data Cloud (June 21, 2010)
Open Science Data Cloud (June 21, 2010)
 
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simu...
 
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...
STAC, ZARR, COG, K8S and Data Cubes: The brave new world of satellite EO anal...
 
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...
AWS Public Sector Symposium 2014 Canberra | Big Data in the Cloud: Accelerati...
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
 

Viewers also liked

Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Robert Grossman
 
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Robert Grossman
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Robert Grossman
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsRobert Grossman
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...Robert Grossman
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkRobert Grossman
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016Robert Grossman
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Robert Grossman
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...Robert Grossman
 

Viewers also liked (10)

Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)Big Data - Lab A1 (SC 11 Tutorial)
Big Data - Lab A1 (SC 11 Tutorial)
 
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
Bionimbus: Towards One Million Genomes (XLDB 2012 Lecture)
 
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
Clouds and Commons for the Data Intensive Science Community (June 8, 2015)
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
Practical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large DatasetsPractical Methods for Identifying Anomalies That Matter in Large Datasets
Practical Methods for Identifying Anomalies That Matter in Large Datasets
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World TalkAdversarial Analytics - 2013 Strata & Hadoop World Talk
Adversarial Analytics - 2013 Strata & Hadoop World Talk
 
AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016AnalyticOps - Chicago PAW 2016
AnalyticOps - Chicago PAW 2016
 
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
Biomedical Clusters, Clouds and Commons - DePaul Colloquium Oct 24, 2014
 
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
How to Lower the Cost of Deploying Analytics: An Introduction to the Portable...
 

Similar to The Matsu Project - Open Source Software for Processing Satellite Imagery Data

Watershed Delineation Using ArcMap
Watershed Delineation Using ArcMapWatershed Delineation Using ArcMap
Watershed Delineation Using ArcMapArthur Green
 
Watershed Delineation in ArcGIS
Watershed Delineation in ArcGISWatershed Delineation in ArcGIS
Watershed Delineation in ArcGISArthur Green
 
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...NopphawanTamkuan
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryAstraea, Inc.
 
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...InfluxData
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopExtremeEarth
 
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1Ki Hyunwoo
 
From Report to Geographic Study
From Report to Geographic StudyFrom Report to Geographic Study
From Report to Geographic StudyCOGS Presentations
 
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...NopphawanTamkuan
 
Burst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runBurst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runIgor Sfiligoi
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.pptgrssieee
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkAlpine Data
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkSpark Summit
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010Cloudera, Inc.
 

Similar to The Matsu Project - Open Source Software for Processing Satellite Imagery Data (20)

Watershed Delineation Using ArcMap
Watershed Delineation Using ArcMapWatershed Delineation Using ArcMap
Watershed Delineation Using ArcMap
 
Watershed Delineation in ArcGIS
Watershed Delineation in ArcGISWatershed Delineation in ArcGIS
Watershed Delineation in ArcGIS
 
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...
Data Processing Using THEOS Satellite Imagery for Disaster Monitoring (Case S...
 
Project Matsu
Project MatsuProject Matsu
Project Matsu
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite Imagery
 
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB H...
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open Workshop
 
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
 
From Report to Geographic Study
From Report to Geographic StudyFrom Report to Geographic Study
From Report to Geographic Study
 
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...
Data Processing Using DubaiSat Satellite Imagery for Disaster Monitoring (Cas...
 
Burst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runBurst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud run
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
FV_IGARSS11.ppt
FV_IGARSS11.pptFV_IGARSS11.ppt
FV_IGARSS11.ppt
 
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using Spark
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using Spark
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010
 

More from Robert Grossman

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanyRobert Grossman
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsRobert Grossman
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchRobert Grossman
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Robert Grossman
 

More from Robert Grossman (9)

Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011Bionimbus - Northwestern CGI Workshop 4-21-2011
Bionimbus - Northwestern CGI Workshop 4-21-2011
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

The Matsu Project - Open Source Software for Processing Satellite Imagery Data

  • 1. The  Matsu  Project   Robert  L.  Grossman   University  of  Chicago   Open  Cloud  ConsorAum   June  18,  2013  
  • 2. The  Matsu  Project  represents  work  by  Collin   BenneL,  Robert  L.  Grossman,    MaLhew  Handy,   Vuong  Ly,  Dan  Mandl,  Ryan  Miller,  Jim  Pivarski,   Ray  Powell  and  Steve  Vejcik.    
  • 3. What  is  the  Matsu  Project?   Matsu  is  an  open  source   project  for  processing  satellite   imagery  to  support  earth   sciences  researchers  using  a   community  science  cloud.   Matsu  is  a  joint  project   between  the  Open  Cloud   ConsorAum  and  NASA’s  EO-­‐1   Mission  (Dan  Mandl,  Lead)  
  • 5. EO-­‐1  mission   •  Approved  in  March   1996  and  launched  on   November  21,  2000   from  Vandenburg  Air   Force  Base,  California   on  a  Delta  7320     •  All  technologies  were   flight-­‐validated  by   December  2001   •  EO-­‐1  is  now  in  an   Extended  Mission  
  • 6. EO-­‐1’s  ALI  and  Hyperion  Instruments  
  • 7. Data  -­‐  Instruments     •  Hyperion  Imaging  Spectrometer   – Designed  to  gather  data  from  a  given  region  on   the  Earth  by  viewing  the  surface  in  terms  of  242   disAnct  'bands'  of  light.   •  Advanced  Land  Imager  (ALI)   – Used  to  validate  and  demonstrate  technology  for   the  Landsat  Data  ConAnuity  Mission  (LDCM)  
  • 8. All  available  L1G  images  (2010-­‐now)  
  • 9. 1.  Open  Science  Data   Cloud  (OSDC)  stores   Level  0  data  from  EO-­‐1   and  uses  an  OpenStack-­‐ based  cloud  to  create   Level  1  data.   2.  OSDC  also   provides  OpenStack   resources  for  the   Nambia  Flood   Dashboard   developed  by  Dan   Mandl’s  team.   3.  Project  Matsu  uses   a  Hadoop/Accumulo   system  to  run   analyAcs  nightly  and   to  create  Ales  with   OGC-­‐compliant   WMTS.  
  • 11. OSDC  Satellites   •  EO-­‐1  (2012)   •  Landsat7  –  GLS  2000    (2013)   •  MODIS  (2013)     •  TBD  (2014)   •  TBD  (2015)  
  • 12. Matsu  Web  Map  Tile  Service  
  • 13. It  is  easy  to  layer  analyAcs  over  the  Web  Map  Tile   Service  (WMTS).    Here  is  one  idenAfying  CO2  
  • 14. Matsu  Hadoop  Architecture   Hadoop  HDFS   Matsu  Web  Map  Tile   Service   Matsu  MR-­‐based   Tiling  Service   NoSQL   Database(Accumulo)   Images  at  different  zoom  layers   suitable  for  OGC  Web  Mapping  Server   Level  0,  Level  1  and  Level  2   images   MapReduce  used  to  process  Level  n  to  Level  n+1   data  and  to  parAAon  images  for  different  zoom   levels   NoSQL-­‐based   AnalyAc  Services   Streaming  AnalyAc   Services   MR-­‐based  AnalyAc   Services   AnalyAc  Services   Storage  for  WMTS  Ales  and   derived  data  products   PresentaAon  Services   Web  Coverage   Processing  Service   (WCPS)   Workflow  Services  
  • 15. Zoom  Levels   Zoom  Level  1:  4  images   Zoom  Level  2:  16  images   Zoom  Level  3:  64  images   Zoom  Level  4:  256  images  
  • 16. Mapper  Input  Key:  Bounding  Box   Mapper  Input  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  resizes  and/or  cuts  up  the  original   image  into  pieces  to  output  Bounding  Boxes   (minx  =  -­‐135.0  miny  =  45.0  maxx  =  -­‐112.5  maxy  =  67.5)   Step  1:  Input  to  Mapper   Step  2:  Processing  in  Mapper   Step  3:  Mapper  Output   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Mapper  Output  Key:  Bounding  Box   Mapper  Output  Value:   Build  Tile  Cache:  Map  
  • 17. Reducer  Key  Input:  Bounding  Box   (minx  =  -­‐45.0  miny  =  -­‐2.8125  maxx  =  -­‐43.59375  maxy  =  -­‐2.109375)   Reducer  Value  Input:   Step  1:  Input  to  Reducer   …   Step  2:  Reducer  Output   Assemble  Images  based  on  bounding  box   •  Reducer   assembles  Ales   at  each  zoom   level   •  Tiles  wriLen  to   Accumulo  (a   NoSQL   database)   Build  Tile  Cache:  Reduce  
  • 18. Map  Phase   •  Map   – Read  in  images  by  Bands,  Date,  and  Region   – Fix  a  zoom  level  for  sending  to  reducers   •  Based  on  number  of  reducers  and  processing  power,   not  on  the  zoom  you  want  for  display   – Emit  as  <key>,  <value>   •  Key  =  <Bounding  Box  at  Fixed  Zoom  Level>   •  Value  =  <Bounding  Bounding  Box  at  Smallest  Zoom                                              Level,  Bands,  ProjecAon,  Timestamp,                                          Image  Bytes>  
  • 19. Reduce  Phase   •  All  bytes  for   bands  and   satellite  strips  in   this  bounding  box   are  mapped  to   the  same  reducer     •  The  key  can  be   idenAfied  by  the   Lat/Long  of  the     upper  right   corner  of  the  box  
  • 20. Level  1  Images  -­‐  Details   •  Satellite  track  images  (L1R)  are  rotated  and   geolocated  (L1G)  by  NASA   •  We  overlay  L1G  images  into  Level-­‐2  dyadic  Ales   in  Map-­‐Reduce   locaAon  in   Google  Maps   L1R   L1G   Level-­‐2  Ales  made  in  Map-­‐Reduce,  prepared  for  WMS   T06-­‐00097-­‐00092   T10-­‐01561-­‐01486  
  • 21. Some  example  images   Gobi  Desert   •  same  as  previous   page   •  contains  some   strange  structures   that  are  too  small   to  spaAally  resolve   with  Hyperion,   but  they  might   have  interesAng   spectral  features  
  • 22. Some  example  images   Karijini,  Australia   •  lots  of  colorful   minerals   •  should  have  a  very   rich  spectrum  
  • 23. Some  example  images   Lake  Frome,  Australia   •  salt  bed  is  a  standard   calibraAon  target   Atacama  Desert,  Chile   •  salt  bed  in  the  driest  part   of  the  world  
  • 24. •  CO2  has  three   absorbAon  lines   within  Hyperion’s   spectral  range   •  Sideband  subtracAon   technique  extracts  a   pure  sample  of  data  in   a  peak  by  fisng   nearby  datapoints  to  a   curve  and  subtracAng   peak  values  from  the   curve   •  In  this  case,  we  invert   the  subtracAon   because  it’s  an  anA-­‐ peak   External   Reference   Algebraic  combinaAon  of  spectral   bands  to  make  a  more  sensiAve  image  
  • 25. •  CO2  has  three   absorbAon  lines   within  Hyperion’s   spectral  range   •  Sideband  subtracAon   technique  extracts  a   pure  sample  of  data  in   a  peak  by  fisng   nearby  datapoints  to  a   curve  and  subtracAng   peak  values  from  the   curve   •  In  this  case,  we  invert   the  subtracAon   because  it’s  an  anA-­‐ peak   Algebraic  combinaAon  of  spectral   bands  to  make  a  more  sensiAve  image   two  bands  in   the  CO2  line  
  • 26. Algebraic  combinaAon  of  spectral   bands  to  make  a  more  sensiAve  image   •  Icelandic   volcano  in   April  2010   (Eyjatallajökull)   •  Visible  frame  is   full  of  ash  clouds   •  CO2  distribuAon  is   non-­‐uniform   •  Some  CO2    acAvity  follows      visible  cloud        formaAons,          some  doesn’t  
  • 27. Algebraic  combinaAon  of  spectral   bands  to  make  a  more  sensiAve  image   •  Some  CO2    acAvity  follows      visible  cloud        formaAons,          some  doesn’t   Python  code  used  to  produce  this  image  (vectors  in  bold):     sum1  =  4.   sumx  =  183.  +  184.  +  188.  +  189.   sumxx  =  183.**2  +  184.**2  +  188.**2  +  189.**2   sumy  =  B183  +  B184  +  B188  +  B189   sumxy  =  183.*B183  +  184.*B184  +  188.*B188  +  189.*B189     delta  =  sum1*sumxx  -­‐  sumx**2   constant  =  (sumxx*sumy  -­‐  sumx*sumxy)  /  delta   linear  =  (sum1*sumxy  -­‐  sumx*sumy)  /  delta     subtracted  =  (B185  -­‐  (constant  +  185.*linear))/2.  +                                                  (B186  -­‐  (constant  +  186.*linear))/2.   •  Icelandic   volcano  in   April  2010   (Eyjatallajökull)   •  Visible  frame  is   full  of  ash  clouds   •  CO2  distribuAon  is   non-­‐uniform  
  • 28. Algebraic  combinaAon  of  spectral   bands  to  make  a  more  sensiAve  image   •  Some  CO2    acAvity  follows      visible  cloud        formaAons,          some  doesn’t   hLp://lvoc-­‐matsu.opensciencedatacloud.org/SimpleWMS/? lat=63.7&lng=-­‐19.45&z=11&rgb=true&co2=true&flood=false&points=clust ers   •  Icelandic   volcano  in   April  2010   (Eyjatallajökull)   •  Visible  frame  is   full  of  ash  clouds   •  CO2  distribuAon  is   non-­‐uniform  
  • 30. For  More  InformaAon   •  Project  Matsu  is  managed  and  operated  by  the  Open  Cloud  ConsorAum   (www.opencloudconsorAum.org).   •  Project  Matsu  is  supported  in  part  by  grants  from  Gordon  and  BeLy  Moore   FoundaAon  and  the  NaAonal  Science  FoundaAon  (Grants  OISE  -­‐  1129076  and  CISE   1127316).     •  For  more  informaAon  about  Project  Matsu,  please  see  the  Project  Matsu  website:   matsu.opensciencedatacloud.org   •  The  Project  Director  is  Robert  Grossman,  who  can  be  reached  at    
  • 31. Here  is  some  detail  of  how  we  process  EO-­‐1    satellite   imagery  data  using  Hadoop  in  Project  Matsu…  
  • 32. Step  1  –  Storage  &  Archiving   From  Space  to  Goddard  to  the  OSDC   1.  Transmit  data  from  NASA’s  EO-­‐1  Satellite  to  NASA   ground  staAons  and  then  to  NASA  Goddard   2.  At  Goddard,  align  data,  perform  radiometric   correcAons  and  generate  Level  0  images  (16-­‐bit   radiance  values)   3.  Transmit  Level  0  data  from  NASA  Goddard  to  the   OCC’s  Open  Science  Data  Cloud  (OSDC)   4.  Store  images  in  a  distributed,  fault  tolerate,  file   system  
  • 33. Step  2  –  CreaAng  Level  1  Images   Building  Level  1  Images  on  the  OSDC   1.  Each  day,  the  new  Level  0  images  stored  on  the   OSDC  are  processed   2.  Within  the  OSDC,  NASA  launches  Virtual   Machines  (VMs)  specifically  built  to  render  Level   1  images  from  Level  0  data.   –  Each  Level  1  band  is  saved  as  a  disAnct  image   3.  Level  1  bands  are  wriLen  to  storage  facility  in  the   OSCD  for  long-­‐term  public  access  
  • 34. Step  3  –  Tiling   Matsu  Processing   1.  Build  Web  Mapping  Tile  Service  Tiles  from  Level   1  images  using  MapReduce   2.  Store  Ales  in  Accumulo   •  Index  them  so  that  they  are  accessible  via  Web   Mapping  Service   3.  Run  AnalyAcs  on  Level  1  images   •  Move  results  of  the  analyAcs  to  Accumulo  
  • 35. Tiling  -­‐  Detail   •  Use  MapReduce  to  build  Web  Tiles   1.  Each  day,  the  Level  1  images  created  by  NASA     and  stored  on  the  OSDC  are  processed   2.  The  Date  and  Bands  (to  create  a  visible  image)   are  specified   3.  Run  MapReduce  Job   1.  Map  –  FILL-­‐IN   2.  ParAAon  –  FILL-­‐IN   3.  Reduce  –  FILL-­‐IN  
  • 36. Tile  Details,  cont’d   •  Images  are  handles  as  byte  streams   •  Divide  (chunk)  the  Level  1  images  into   manageable  sizes.   •  Dyadic  decomposiAon   –  Divide  each  image  into  4  equal  size  pieces   –  For  each  addiAonal  zoom,  subdivide  each  piece  into  4   equal  size  pieces   •  Tag  each  chunked  images  with  the  bounding  box,   date,  Ame,  dyadic  level  and  bands.   •  Convert  the  bytes  into  PNG  files  
  • 37. Processing  the  Data   •  Reduce   – Once  all  images  are  received  for  a  Bounding  Box,   sort  by  the  most  granular  zoom  level   – Process  that  Zoom  Level   – Once  a  zoom  level  in  is  completed,  combine   images  and  scale  the  build  the  next  zoom  level     Z1   Z1   Z1   Z1   Z2   Z2   1.  Assemble                                                                                                                                                                                                2.  Scale          
  • 38. Accumulo  Storage     •  Images  are  stored  by  Bounding  Box   – -­‐180.0_-­‐90.0_180.0_90.0   •  Column  family   – The  Ale  style,  zoom,  and  projecAon   •  Column  qualifier     – Dimensions  (width  and  height,  512  x  256)   •  Value     – the  corresponding  PNG  image  in  raw  bytes  
  • 39. Serve  to  WMTS   •  The  WMTS  query:   –  Bounding  Box   –  Date   –  Layer  name  as  a  string   •  HaiA   –  Style  name  as  a  string   •  The  bands  used  to  build  the  Level  1  image  or  an  alias:   “B058:B023:B015”  or  “agricultural”   •  Not  supported   –  Map  Project  could  be  used,  but  for  now,  we  only   support  a  single  projecAon  
  • 40. Images:  stages  of  processing   •  Satellite  track  images  (L1R)  are  rotated  and   geolocated  (L1G)  by  NASA   •  We  overlay  L1G  images  into  Level-­‐2  dyadic  Ales   using  Map-­‐Reduce   image  locaAons   (viewed  in   Google  Maps)   L1R   L1G   Level-­‐2  Ales  made  in  Map-­‐Reduce,  prepared  for  WMS   T06-­‐00097-­‐00092   T10-­‐01561-­‐01486