Hadoop Summit 2010 Yahoo’S Commitment To Hadoop And Open Source

1,435 views
1,376 views

Published on

Published in: Technology
0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,435
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
68
Comments
0
Likes
3
Embeds 0
No embeds

No notes for slide

Hadoop Summit 2010 Yahoo’S Commitment To Hadoop And Open Source

  1. 1. Accelerating Innovation with Cloud Computing Hari Vasudev India Hadoop Summit - Bangalore February 2010
  2. 2. I’m not selling anything
  3. 3. Cloud Computing is NOT about saving money
  4. 4. Yahoo! is Perfect for Cloud Computing PETABYTES HUNDREDS BILLIONS 600M 300M+ OFUNIQUE USERS DAILY PROPERTIES /STORED PETABYTES OF/ STORAGE OF OBJECTS PRODUCTS OF TRAFFIC MONTH YAHOO! MAIL USERS / MONTH
  5. 5. Yahoo! Cloud Strategy • Creating a private Cloud for Yahoo! • Optimizing for global Yahoo! properties • Data processing and serving environments • Multi-year effort • Open Source
  6. 6. Inside Yahoo!’s Cloud
  7. 7. Yahoo!’s Open Source for Cloud
  8. 8. Cloud Solving Industry-wide Problems • Mail abuse detection • Dependent on globally synchronized data • Cloud storage • Global data replication • Consistency • Fast and easy to use • Developers focus on task at hand
  9. 9. • Organizational commitment • Investment • Time
  10. 10. Cloud Computing is worth it!
  11. 11. Yahoo!’s Cloud Use Case Caching, Load Balancing Search Index Machine Learning Advertising (e.g. Spam filters) Content Optimization Optimization & Delivery Attachment Storage RSS Feeds Image/Video Storage & Delivery
  12. 12. Cloud improves dynamic content refresh rates and consumer access speed
  13. 13. Cloud abstracts away scale for processing enormous data sets
  14. 14. Cloud speeds advertising optimization by improving infrastructure utilization 15
  15. 15. Cloud Speeds Time To Market • YQL • SQL-like language • Query, filter, and join data across web services • YQL Open Data Tables built on Cloud storage • Simple and fast integration and deployment • Immediate access to global, replicated, fast, reliable data store 16

×