Upload
  • Email
  • Favorite
  • Download
  • Embed
  • Private Content

Loading…

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

Hadoop, Pig, and Twitter (NoSQL East 2009)

by Kevin Weil on Oct 30, 2009

  • 75,445 views

A talk on the use of Hadoop and Pig inside Twitter, focusing on the flexibility and simplicity of Pig, and the benefits of that for solving real-world big data problems.

A talk on the use of Hadoop and Pig inside Twitter, focusing on the flexibility and simplicity of Pig, and the benefits of that for solving real-world big data problems.

Accessibility

Categories

Tags

hadoop twitter pig nosql analytics bigdata hadoop pig twitter pig hadoop hadoop pig twitter nosql data hadoop pig big data hadoop nosql twitter mapreduce hadoop bigdata nlp pig twitter coding hadoop pig twitter analysis hive nosql hadoop apache hadoop pig machine-learning mapreduce twitter inte databases pigtwitter haddop analysis dataproblem solutions hadoopadoop hadoop twitter pig analysis todo201108

More...

Upload Details

Uploaded via SlideShare as Apple Keynote

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

Embed Views

http://www.blog.c 1074
http://www.slideshare.net 306
http://paper.li 33
http://www.adambreckler.com 22
http://it.gilbird.com 21
http://wiki.weinter.co.kr 14
http://juepner-bpm.com 11
http://marcelmaatkamp.blogspot.com 10
http://blog.escalabilidad.net 8
http://obl8.com:8000 8
http://blog.iband.kr 8
http://www.iweb34.com 7
http://jisi.dreamblog.jp 6
http://devpub.kr 6
http://dineshoi.blogspot.com 3
http://static.slidesharecdn.com 3
http://10.150.200.76 2
http://hadoopbrasil.com 2
http://irr.posterous.com 2
http://twitter.com 2
http://a0.twimg.com 2
https://myuni.adelaide.edu.au 1
http://webcache.googleusercontent.com 1

More...

Statistics

Favorites
191
Downloads
1,851
Comments
17
Embed Views
1,552
Views on SlideShare
73,893
Total Views
75,445

110 of 17 previous next Post a comment

  • nishantsonar nishantsonar Twitter on hadoop 2 months ago Reply
    Are you sure you want to Yes No
  • JinhwanLee Jinhwan Lee dd 2 months ago Reply
    Are you sure you want to Yes No
  • JohnsonRuham Johnson Ruham Excellent slides! Thanks for sharing.
    http://inetworkingskills.blogspot.com/
    8 months ago Reply
    Are you sure you want to Yes No
  • ShyamVelupula Shyam Velupula PDF version should have provided for downloading 1 year ago Reply
    Are you sure you want to Yes No
  • HellonoNo Hellono No learn 1 year ago Reply
    Are you sure you want to Yes No
  • tygarvin tygarvin Hey Kevin,

    I am doing a report of my work with Hadoop in a high performance computing environment for a national laboratory and was wondering if I could use some of your slides and photos in my report?

    If you could get back to me that would be incredible. Thanks in advance.
    Tyler Garvin
    1 year ago Reply
    Are you sure you want to Yes No
  • TeishaFoglia Teisha Foglia , Engineer at TM LtD I think it is better you put some picture in there. looks more interesting and inviting

    Regards
    Teisha
    http://winkhealth.com
    http://financewink.com
    http://www.fakhriramley.com
    1 year ago Reply
    Are you sure you want to Yes No
  • xds2000 xds2000 Cool idea 2 years ago Reply
    Are you sure you want to Yes No
  • satishgaude satishgaude That was great. Cleared some of my doubts 2 years ago Reply
    Are you sure you want to Yes No
  • kevinweil Kevin Weil , Analytics Lead at Twitter Hitesh,

    We don't read or write data directly from/to MySQL with Pig. The interfaces for load/store aren't quite right yet, although they're getting better (follow along in realtime at https://issues.apache.org/jira/browse/PIG-966). Rather, we export in batch from mysql into hdfs at regular intervals, which is good because then non-pig jobs can access this data too. And we have a little wrapper around pig that can optionally push the results of pig jobs back into databases etc after the job has run. This is something we plan on open sourcing once it hardens a bit.
    2 years ago Reply
    Are you sure you want to Yes No
Post Comment
Edit your comment Cancel

Hadoop, Pig, and Twitter (NoSQL East 2009) — Presentation Transcript