Your SlideShare is downloading. ×
0
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Cloudera/Stanford EE203 (Entrepreneurial Engineer)
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Cloudera/Stanford EE203 (Entrepreneurial Engineer)

3,094

Published on

This is a talk that I gave at Stanford's EE203 (Entrepreneurial Engineer) on Tuesday Feb 9th, 2010. It covers my experience at Stanford, VivaSmart, Yahoo, Accel Partners, and Cloudera.

This is a talk that I gave at Stanford's EE203 (Entrepreneurial Engineer) on Tuesday Feb 9th, 2010. It covers my experience at Stanford, VivaSmart, Yahoo, Accel Partners, and Cloudera.

Published in: Technology, Business
0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,094
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
60
Comments
0
Likes
3
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • http://developer.yahoo.net/blogs/hadoop/2009/05/hadoop_sorts_a_petabyte_in_162.html100s of deployments worldwide (http://wiki.apache.org/hadoop/PoweredBy)
  • Transcript

    • 1.
    • 2. Outline<br />About Me<br />The VivaSmart Story<br />The EIR Experience<br />The Cloudera Story .. so far<br />What is Hadoop?<br />Open Source Business Models<br />Lessons Learned & Advice<br />
    • 3. About Me<br />I got BS and MS from Cairo University in Egypt.<br />I came to US in 1995 to get my PhD from Stanford, with goal to go back to Egypt and teach.<br />I got infected by the Entrepreneurship bug, it is rampant at Stanford, hopefully you’ll get infected too<br />In 1999 I took a leave of absence from PhD to start VivaSmart, which I sold to Yahoo in 2000.<br />I stayed with Yahoo till mid-2008, also finished my PhD in mid-2007 with Mendel Rosenblum.<br />I started Cloudera in fall of 2008.<br />
    • 4. The VivaSmart Story<br />It started as Booksmart in Spring of 1999.<br />Initial prototype was built by Thai Tran.<br />We got funded by a great angel (Frank Marshall).<br />Couldn’t raise VC money, but we were able to raise more angel money, and got lighthouse customers.<br />Noticed that it is hard to drive traffic, decided to focus on catalog management technology (Aptivia).<br />Got initial acquisition termsheet from Excite@Home for $12M but they reneged at last minute (4/2000)<br />Yahoo Shopping acquired us for $9M in June 2000.<br />
    • 5. The EIR Experience<br />EIR = Entrepreneur in Residence.<br />Joined Accel Partners in June 2008 as an EIR.<br />Spent most of the summer researching possible ideas for my next venture, also helped with due diligence for a number of companies.<br />Experienced the fund raising process from the VC side, very useful to see how they think.<br />Met my Cloudera co-founders through Accel<br />Andrew Braccia (agb) and Ping Li (pli) from Accel Partners joined the Cloudera Board of Directors.<br />
    • 6. The Cloudera Story … so far<br />Oct 2008: Got $5M round A funding from Accel Partners and a number of strategic angel investors.<br />Four founders (too many?): <br />Mike Olson (Oracle)<br />Jeff Hammerbacher (Facebook)<br />Christophe Bisciglia (Google)<br />AmrAwadallah (Yahoo)<br />Announced the company in March of 2009.<br />May 2009: Got $6M in funding from Greylock Ventures (opportunistic B round)<br />AneelBhusri joined our board from Greylock<br />
    • 7. Cloudera’s Elevator Pitch<br />A single,consolidated repository to enable insights across complex and structured data.<br />Complex Data<br />Documents<br />Web feeds<br />System logs<br />Online forums<br />SharePoint<br />Sensor data<br />EMB archives<br />Photo/Video<br />Structured Data (“relational”) <br />CRM<br />Financials<br />Logistics<br />Inventory<br />Sales records<br />HR records<br />
    • 8. What is Hadoop?<br />The foundation of our system is built on top of Apache Hadoop, which is a scalable distributed data processing system.<br />The scalability of Hadoop comes from marriage of:<br />HDFS: Self-Healing High-Bandwidth Clustered Storage.<br />MapReduce: Fault-Tolerant Distributed Processing.<br />The software manages and heals it self.<br />Leverages the economies of scale of commodity hardware (multi-core chips, many disks per system)<br />Compute moves to data (not other way around).<br />
    • 9. Hadoop History<br />2002-2004: Doug Cutting and Mike Cafarella started working on Nutch<br />2003-2004: Google publishes GFS and MapReduce papers <br />2004: Cutting adds DFS & MapReduce support to Nutch<br />2006: Yahoo! hires Cutting, Hadoop spins out of Nutch<br />2007: NY Times converts 4TB of archives over 100 EC2s<br />2008: Web-scale deployments at Y!, Facebook, Last.fm<br />April 2008: Yahoo does fastest sort of a TB, 3.5mins over 910 nodes<br />May 2009:<br />Yahoo does fastest sort of a TB, 62secs over 1460 nodes<br />Yahoo sorts a PB in 16.25hours over 3658 nodes<br />June 2009, Oct 2009: Hadoop Summit (750), Hadoop World (500)<br />September 2009: Doug Cutting joins Cloudera<br />
    • 10. Open Source Software Business Models<br />Open Source is attractive since it gets you:<br />Free Distribution: People can download and try it out<br />Darwinian Effect: Lots of developers try to solve the problem, best solution wins.<br />Faster Innovation: Customers build the product with you!<br />OSS Business Models:<br />Support/Maintenance/Service agreements<br />Open Core: core is free, but there is value-add proprietary technology around it (“Community” vs “Enterprise” Edition)<br />Monetization through enablement of other services (e.g. Firefox makes money from Google Search).<br />
    • 11. Lessons Learned & Advice<br />Make sure your idea can actually make money!<br />Hire great people (corollary: Fire swiftly).<br />Make sure you are passionate about your idea.<br />Listen to customers, but look for the problems, it is your job to come up with solutions.<br />Be agile, iterate quickly, don’t spend a year planning, don’t be afraid to make mistakes.<br />Don’t be afraid to fail, but don’t persist in your failing ways, learn from failure quickly and evolve (Moore)<br />Have faith, but don’t let it blind you from reality<br />
    • 12. Books I Recommend<br />“Blue Ocean Strategy”, W. Chan Kim, Renée Mauborgne.<br />“The Innovator’s Dilemma”, Clayton Christensen<br />“The Innovator’s Solution”, Clayton Christensen, and Michael Raynor<br />“Good to Great”, Jim Collins<br />“The Seven Habits of Highly Effective People”, Stephen Covey<br />“Crossing the Chasm”,“Tornado”, Geoffrey Moore<br />“The Black Swan”, NassimTaleb.<br />
    • 13. Contact Information<br />We Are Hiring:<br />jobs+ee203@cloudera.com<br />AmrAwadallah<br />CTO, Cloudera Inc.<br />http://twitter.com/awadallah<br />Online Training Videos and Info:<br />http://cloudera.com/hadoop-training<br />http://cloudera.com/blog<br />http://twitter.com/cloudera<br />

    ×