• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Hadoop: the Big Answer to the Big Question of the Big Data
 

Hadoop: the Big Answer to the Big Question of the Big Data

on

  • 6,233 views

More info: http://www.elekslabs.com/2012/02/devtalks-1-presentations.html

More info: http://www.elekslabs.com/2012/02/devtalks-1-presentations.html
Video: http://www.youtube.com/watch?feature=player_embedded&v=GENRle60Elk

Statistics

Views

Total Views
6,233
Views on SlideShare
1,152
Embed Views
5,081

Actions

Likes
1
Downloads
12
Comments
0

21 Embeds 5,081

http://www.elekslabs.com 3227
http://www.rozrobka.com 1340
http://jug-lviv.blogspot.com 352
http://localhost 30
http://elekslabs.com 27
http://elekslabs.azurewebsites.net 22
http://eleks.com 17
http://elekscookiesv2.cloudapp.net 11
http://127.0.0.1 9
http://www.eleks.com 8
http://5625550541520304843_f4c2a411f7711d002736a819ba0f826780166342.blogspot.com 7
https://app.wellcentive.com 7
http://elekscookies.cloudapp.net 6
http://www.linkedin.com 5
http://cookies.demo.eleks.com 3
http://jug-lviv.blogspot.co.uk 3
http://translate.googleusercontent.com 3
http://webcache.googleusercontent.com 1
http://jug-lviv.blogspot.ru 1
http://jug-lviv.blogspot.it 1
http://jug-lviv.blogspot.dk 1
More...

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Hadoop: the Big Answer to the Big Question of the Big Data Hadoop: the Big Answer to the Big Question of the Big Data Presentation Transcript

    • THE ANSWER TO THE QUESTION OF THE DATAeleks by Victor HaydinDevTalks #1
    • Gordon Moore
    • 1975 2012 Cost of 1 TB storage$208 000 000 $110 Cost of 1 GFLOPS/s computing facility$62 000 000 $1.50 Number of network hosts 57 > 1 000 000 000 World’s data amount~130 GB ~2.9 ZB
    • 1 ZB = 1 000 000 000 000 000 000 000 B (1021)
    • Commodity Hardware
    • Wikipedia: “Apache Hadoop is a softwareframework that supports data-intensivedistributed applications”
    • Main Contributors
    • HDFS: Hadoop Distributed File System Hardware Failure Streaming Data Access Large Data Sets Simple Coherency Mode (write-once) Portability
    • Moving Computation is cheaper then moving Data
    • MapReduce
    • Map(k1,v1) → list(k2,v2)void map(string key, string value): for each word w in value: yield return KeyValuePair(w, 1);Reduce(k2, list (v2)) → list(v3)void reduce(string key, int[] values): int sum = 0; for each pc in values: sum += pc; return KeyValuePair(key, sum);
    • Demo
    • EcosystemZooKeeper
    • 45K nodes, 180-200 PB3K+ nodes, 36+ PB
    • powered by
    • FutureCore:• HDFS: high-availability and scalability• MapReduce: modularity and alternative ways to perform queriesEcosystem development:• Apache BigTop: consolidation project• HBase, Hive, Pig, ZooKeeper, Avro, Sqoop: stabilizing,interoperability• Incubator: Flume, Ozzie, Whirr
    • Demo
    • Q&A