Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Hadoop And Universities


Published on

Some thoughts about how to link up University research with the Hadoop community

Published in: Technology, Business
  • Be the first to comment

Hadoop And Universities

  1. 1. Hadoop and Universities © 2009 Hewlett-Packard Development Company, L.P. V 1.1 The information contained herein is subject to change without notice
  2. 2. Strategic Goals •  CS and other science graduates come out knowing how to code with MapReduce •  The UK & EU grids host Hadoop for PB of data and the computation •  Postgraduate research is done on and inside Hadoop. •  Engagement between the ASF/Hadoop team and the Academic community
  3. 3. Where is Hadoop being used? •  CS: MapReduce as an algorithm •  AI: datamining (Edinburgh) •  Other sciences: Hadoop for data storage/ analysis?
  4. 4. CS Teaching •  Is Hadoop over-complex? •  MapReduce with Haskell, Prolog, Erlang •  Cloudera VM + Eclipse •  Common datasets •  Re-use and adapt US coursework
  5. 5. EU and UK Grids •  How to to host Hadoop over GGF grids? •  Should we bother? •  Who will do the work?
  6. 6. What can we do •  Lecture at the local universities •  Help people set up clusters •  Offer cluster-time and datasets •  Anything else?
  7. 7. Postgraduate Research •  On Hadoop: new algorithms, layers on top •  On Hadoop: MR for science •  In Hadoop: scheduling, placement •  Present at ApacheCon, HUG •  Cluster time on OpenCirrus? Steer researchers away from trouble, mentor them ASF to host hadoop-research list, SVN
  8. 8. UK Hadoop-in-eScience event? Ross Gardler: OSS Watch are putting on an open source conference in Q2 2010...
  9. 9. 9 August 9, 09