AWS Start-Up Tour 2009 / ShareThis

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    AWS Start-Up Tour 2009 / ShareThis - Presentation Transcript

    1. ShareThis on AWS Paco Nathan, Data Insights ShareThis.com AWS Start-Up Tour 2009-06-16
    2. What Does ShareThis Do? • “Make it simple to share any online content” • Social content sharing platform • ESPN, FOX, CS Monitor, HuffPost, CBS Marketwatch, Wired, TechCrunch, ThinkGeek, etc. • When a news story goes viral on a major publisher, our sharing services must scale-out to keep pace AWS Start-Up Tour 2009-06-16
    3. AWS Start-Up Tour 2009-06-16
    4. Why Our Company Uses AWS • >10^6 publishers, >10^9 users, >10^10 urls • Early stage start-up, < 25 people, “wearing lots of hats”, ultra fast-paced R&D • Spikes in popular stories impose demands throughout the architecture: API services, loggers, DW, BI, etc. • How can this level of service be built 100% in the cloud? AWS Start-Up Tour 2009-06-16
    5. http://shar.es/1B7 AWS Start-Up Tour 2009-06-16
    6. System Architecture • Each service designed for cost-effective, horizontal scale-out • API served by cluster of LAMP stack + cluster of NginX • AsterData: nCluster infrastructure “hub-and-spoke” pattern • Cascading: abstraction layer for tying together components • Batch jobs on Elastic MapReduce, AsterData SQL/MR • SQS, EBS, SimpleDB, MTurk, plus other AWS services AWS Start-Up Tour 2009-06-16
    7. AWS Start-Up Tour 2009-06-16
    8. Key Learnings • Capability to scale-out horizontally without having to recode, rebuild, etc. — add new EC2 nodes to clusters • Authoritative data + backups in S3, great approach for DR • Wide range of use cases implemented: widget API, log clean-up, vertical search, business intelligence, etc. • Developers launch their own sandbox instances — makes dev/test/debug cycles more efficient • Staff enabled to “wear even more hats” with less risk AWS Start-Up Tour 2009-06-16
    9. Cascading + Elastic MapReduce AWS Start-Up Tour 2009-06-16
    10. Cascading + Elastic MapReduce • “Syntax is for humans, APIs are for software” • Defines apps as set operations applied to data flows • Engineers & data scientists don’t think in terms of MapReduce primitives, key/value pairs, etc. • Integrates Hadoop API + other APIs (S3, SQS, JDBC) • Expresses end-points as Java design patterns, compiled code — not just a scramble of scripts AWS Start-Up Tour 2009-06-16
    11. Cascading + Elastic MapReduce • Highly scalable, fault-tolerate framework for batch jobs • Dramatically reduced need for Ops overhead • Excellent command line tools make the dev/test/debug cycle very efficient with “Big Data” • Highly expert staff, very responsive and helpful in forums • Cascading example code in developer resources: “LogAnalyzer for CloudFront” and “Multitool” AWS Start-Up Tour 2009-06-16
    12. Hadoop Book / Case Study ShareThis case study, "Cascading" by Chris K Wensel, in… AWS Start-Up Tour 2009-06-16
    13. Contacts http://sharethis.com @pacoid on Twitter AWS Start-Up Tour 2009-06-16

    + Paco NathanPaco Nathan, 5 months ago

    custom

    323 views, 0 favs, 0 embeds more stats

    ShareThis, AWS Start-Up Tour 2009, Sunnyvale

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 323
      • 323 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 5
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories

    Tags