• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
The evolution of web and big data
 

The evolution of web and big data

on

  • 1,133 views

 

Statistics

Views

Total Views
1,133
Views on SlideShare
1,130
Embed Views
3

Actions

Likes
1
Downloads
8
Comments
0

1 Embed 3

http://www.linkedin.com 3

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    The evolution of web and big data The evolution of web and big data Presentation Transcript

    • The Evolution ofWeb and Big Data Edward J. Yoon
    • Who Am I• Edward J. Yoon – @eddieyoon• Founder of Apache Hama• PMC member of Apache BigTop• Oracle Employee
    • Early era of Web Google• 2003: GFS• 2004: MapReduce OSS• 2005: SawZall • 2005: Hadoop• 2006: BigTable HDFS MapReduce • 2006: Pig • 2007: Hive HBase
    • Google?• World best “Full-text search engine”• In 2003, – 10,000+ Servers – 4+ billion Documents – 300+ Million Images
    • Hadoop 1.0• HDFS + MapReduce – And Pig. Hive, Hbase, Mahout
    • The New era of Web Google OSS• 2010: Pregel • 2010: Hama Dremel Twitter• 2012: Spanner Storm • 2011: YARN Giraph • 2012: Drill
    • MR vs. Alternatives
    • YARN?• Job scheduling and cluster resource management
    • Future of CDH4 and Hadoop• CDH4 will be based on 0.23.x or later• 0.23.0 doesn’t include Map/Reduce 1.0 – Storm, Giraph, Hama, Spark, MPI, GraphLab