Hadoop And Universities

2,415 views
2,308 views

Published on

Some thoughts about how to link up University research with the Hadoop community

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,415
On SlideShare
0
From Embeds
0
Number of Embeds
9
Actions
Shares
0
Downloads
37
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Hadoop And Universities

  1. 1. Hadoop and Universities © 2009 Hewlett-Packard Development Company, L.P. V 1.1 The information contained herein is subject to change without notice
  2. 2. Strategic Goals •  CS and other science graduates come out knowing how to code with MapReduce •  The UK & EU grids host Hadoop for PB of data and the computation •  Postgraduate research is done on and inside Hadoop. •  Engagement between the ASF/Hadoop team and the Academic community
  3. 3. Where is Hadoop being used? •  CS: MapReduce as an algorithm •  AI: datamining (Edinburgh) •  Other sciences: Hadoop for data storage/ analysis?
  4. 4. CS Teaching •  Is Hadoop over-complex? •  MapReduce with Haskell, Prolog, Erlang •  Cloudera VM + Eclipse •  Common datasets •  Re-use and adapt US coursework
  5. 5. EU and UK Grids •  How to to host Hadoop over GGF grids? •  Should we bother? •  Who will do the work?
  6. 6. What can we do •  Lecture at the local universities •  Help people set up clusters •  Offer cluster-time and datasets •  Anything else?
  7. 7. Postgraduate Research •  On Hadoop: new algorithms, layers on top •  On Hadoop: MR for science •  In Hadoop: scheduling, placement •  Present at ApacheCon, HUG •  Cluster time on OpenCirrus? Steer researchers away from trouble, mentor them ASF to host hadoop-research list, SVN
  8. 8. UK Hadoop-in-eScience event? Ross Gardler: OSS Watch are putting on an open source conference in Q2 2010...
  9. 9. 9 August 9, 09

×