Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Launching your career in Big Data

1,963 views

Published on

Some tips on launching a great career in the exciting field of Big Data

Published in: Career

Launching your career in Big Data

  1. 1. + LaunchingYour Career in Big Data Sujee Maniyam ElephantScale.com Sujee@elephantscale.com
  2. 2. + Who Invited This Guy? Hi, I am Sujee Maniyam J n  15 years+ software development experience n  Consulting & Training in Big Data n  Author n  “Hadoop illuminated” open source book n  “HBase Design Patterns” coming soon n  Open Source contributor (including Hadoop) http://github.com/sujee n  Founder / Organizer of ‘Big Data Guru’ meetup http://www.meetup.com/BigDataGurus/ n  http://sujee.net/
  3. 3. + Riding The Technology Wave Late 1990s : Web 2004-06 : Social (FB, Twitter) 2007 : iPhone 2008 : Hadoop 2014 : Big Data in Real Time 2015 : ?
  4. 4. + Big Data Fad Or Real? n  It is very real !
  5. 5. + Big Data Job Trend
  6. 6. + … and $$$
  7. 7. + Audience For This Talk… Developers
  8. 8. + This Doesn’t Work…. n  Quit Job on Friday n  Attend Big Data Bootcamp over the Weekend n  Start as a Big Data Developer on Monday n  J sorry !
  9. 9. + Road Map For LaunchingYour Big Data Career n  (1) Learn n  (2) Network n  (3) Be Known n  (4) get hired
  10. 10. + (c) Elephant Scale.com -- 2014 10
  11. 11. + Big Data / Hadoop Jobs
  12. 12. + Data Scientist (c) Elephant Scale.com -- 2014 12
  13. 13. + Step (1) Learning n  Preferred Option : learn on your current job J n  Take a training class n  Do a Project n  If That is not possible (dead end job / employer) n  Learn on your own
  14. 14. + Learning 1 : Learn n  Books n  So many, n  Start with ‘Hadoop’ by Tom White n  Also checkout our free, open source book :‘hadoop illuminated’ J n  Tutorials (Cloudera / HortonWorks) n  Blogs (Major vendors) n  Webinars n  Free, watch at your own time n  Signup at Cloudera / HortonWorks / DataStax n  Meetups n  Plenty to choose from n  My meetup : Big Data Gurus in San Jose J
  15. 15. + Learning 2 : Practice n  Hands-on a must !!!! n  Get a Hadoop VM / Sandbox from a vendor n  Easiest way to get Hadoop running n  Free ! n  Every thing pre-installed and configured… ready to go! n  Use Hadoop version 2.x n  Cloudera 5.x series n  HortonWorks 2.x series
  16. 16. + Learning 2 : Practice n  Most VMs come with some tutorials pre-installed.. Do them n  We maintain an open-source Hadoop labs on github n  https://github.com/elephantscale/HI-labs n  50+ labs on HDFS / MapReduce / Pig / Hive n  Where to get data? n  http://hadoopilluminated.com/hadoop_illuminated/ Public_Bigdata_Sets.html n  Amazon hosts some big data sets
  17. 17. + Learning 2 : Practice ++ How to stand-out n  Need more than ‘hello world’ n  Practice running Hadoop as a cluster n  Use cloud providers like Amazon, Rackspace … n  Cost ?
  18. 18. + Challenges In Self Learning n  Takes up a lot of personal time! n  Can loose motivation n  Pair-study with some-one n  Motivate / teach each other
  19. 19. + Very quickly….
  20. 20. + Insight Data Engineering Fellowship (I am mentor!) n  6 weeks, full time, professional fellowship n  Completely free for students ! (yes, really) n  Mentored by top industry experts (Nathan Marz – creator of Storm, Databricks– creators of Spark ..etc) n  Learn cool Data Engineering tools n  Build an awesome project n  Motif finder at large scale n  Inspect / visualize BitCoin transactions in real-time n  Demo to top companies (Netflix,Yelp, Facebook, Square) n  Get hired !
  21. 21. Insight  Data  Engineering   Fellows  Program   InsightDataEngineering.com  
  22. 22. + Big Data Skill Chasm
  23. 23. + Do I Need a Certification? n  Certifications are good n  if you have no practical experience n  Consultant n  Cloudera / Horton Works n  You don’t need to take their courses n  Just take the certification exams n  Reasonably easy with preparation n  Very affordable (few hundred $)
  24. 24. + Do I Need To Take A Pay Cut? n  Don’t hesitate to take a paycut for your first Big Data job n  You will gain valuable practical experience n  And the next job you will make lot more
  25. 25. + Step (2) Networking n First get your OWN business card J n Meetups n  tons of meetup in this area n  My meetup : http://www.meetup.com/BigDataGurus/ J n Conferences n  Can be expensive (Strata $3k) n  Beg some one for a ‘visitor pass’ J n  Cheap conferences (HbaseConf $400, Hadoop Summit : $500) n  Money well spent… great connections!
  26. 26. + Networking : How To Find Decision Makers?
  27. 27. + Networking : Uber Networking Tips n  Read this book è n  Soft Networking n  Become the connector, do intros n  People will REMEMBER you! n  Volunteering n  Help to run a meetup / event n  You will get to know people you otherwise wouldn’t meet (Board of directors …etc)
  28. 28. + Step (3) -- Be Known It is not what you know It is who you know It is WHO knowsYOU !
  29. 29. + How to Be Known (aka How To Become an Expert!) n Open Source n  Huge boost to your resume n  Write quality blogs, articles n  Lot of magazines wants contributors n  Write a Book n  We did it on our own – without a publisher n  ‘Hadoop Illuminated’ : http://hadoopilluminated.com/ n  Speak at meetups / conferences
  30. 30. + Old Resume
  31. 31. + Your New Resume
  32. 32. + Github activity log (employers check these !)
  33. 33. + Open Source Contributions n  Open Source involvement is a ‘hot skill’ n  Just saying ‘I used TomCat’ isn’t enough J n  Open source tells me two things about you: n  You are passionate about technology (not just b/c this gig pays well) n  You dedicate your personal time à initiative n  Best option : Get Paid to work on open source J cloudera, linkedin, netflix…. n  If not, you can still do meaningful contributions..
  34. 34. + How to Contribute To Open Source n  Step 1) Get a GitHub account (free) n  Step 2) Look for open source projects to contribute n  Hadoop, cassandra, Spark n  Start fixing bugs (‘low hanging fruits’) n  Step 3) Starting your own project n  Has to be unique (not another word count example using Spark!) n  Solve some thing you know about n  E.g. : Mark Kerzner – eDiscovery & Hadoop
  35. 35. + Personal Story : Contributing to HBase n  [code] Improve benchmarking tool n  Started as a hobby project n  Submitted patch n  Lots help from senior developers n  https://issues.apache.org/jira/browse/HBASE-4440 n  [documentation] improve patch submission process n  You don’t need to be a code-ninja to contribute !! n  Documentation is badly needed in open source projects n  https://issues.apache.org/jira/browse/HBASE-5577
  36. 36. + JIRA : HBASE-4440
  37. 37. + Tips On Submissions n  Make it easy for committers n  Don’t create extra work for them !
  38. 38. + Speaking at Meetups / Conferences n  Start with meetups n  Have a unique topic n  “solving XXX using Spark” ..etc n  Having a popular open source project will help J J n  Submit proposals to many conferences n  You may not get into Strata first J n  Pay attention to ‘local’ conferences n  SVCodeCamp, Dataweek in SF …etc n  Big Data conference calendar http://elephantscale.com/bigdata_conferences
  39. 39. + Acing The Interview n  Interviewer : So , have you used Hadoop at your work? What kind of practical experience you have? n  If no, then usually interview ends here n  You : Ahem, I haven’t had a chance to use Hadoop at work… But let me tell you about the open source project I am working on… * walk to whiteboard, start drawing, explain …* * gets hired ! *
  40. 40. + Step (4) Get Hired
  41. 41. + Thanks ! Sujee Maniyam sujee@elephantscale.com http://elephantscale.com Expert consulting & training in Big Data © Elephant Scale, 2014
  42. 42. + Credits

×