Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Introduction to Apache Pig

1,629 views

Published on

This Introduction to Apache Pig covers:
1. Pig philosophy and architecture
2. Pig Latin and the Grunt shell
3. Loading data
4. Data types and schemas
5. Pig Latin details: structure, functions, expressions, relational operators
6. User Defined Functions
7. Resources

Introduction to Apache Pig

  1. 1. @avkashchauhanhttp://www.linkedin.com/in/avkashchauhan
  2. 2. http://pig.apache.org/philosophy.html
  3. 3. http://www.slideshare.net/mortardata/mongodb-pig-on-hadoophttp://www.slideshare.net/jeromatron/pig-with-cassandra-adventures-in-analytics
  4. 4. http://search.maven.org/#search%7Cga%7C1%7Cg%3A%22org.apache.pig%22http://pig.apache.org/docs/r0.11.0/basic.html
  5. 5. Pig Version #pig -version
  6. 6. {1,{1,2,3}}
  7. 7. http://pig.apache.org/docs/r0.11.0/basic.html#Relational+Operators
  8. 8. http://pig.apache.org/docs/r0.11.0/func.html
  9. 9. AVGCONCATCOUNTCOUNT_STARDIFFIsEmptyMAXMINSIZESUMTOKENIZE http://pig.apache.org/docs/r0.11.0/func.html#eval-functions
  10. 10. ABS FLOORACOS LOG LOG10ASIN RANDOMATAN ROUNDCBRT SINCEIL SINHCOS SQRTCOSH TAN TANHEXP http://pig.apache.org/docs/r0.11.0/func.html#math-functions
  11. 11. http://pig.apache.org/docs/r0.11.0/udf.html
  12. 12. UDF Category Function NameLoad UDF Functions  PigStorage  HBaseStorage  TextLoaderStore UDF Functions  PigStorage  HBaseStorageEvaluation Functions ABS ROUND EXP LOG SUM SIZE ACOS ASIN ATAN RANDOMFilter Functions  IsEmpty
  13. 13. http://pig.apache.org/docs/r0.7.0/udf.html#Schema
  14. 14. http://sivaanalytics.wordpress.com/2013/03/14/fundamentals-of-pig-exploring-more-on-schema-and-data-models/
  15. 15. http://developer.yahoo.com/hadoop/tutorial/pigtutorial.htmlhttp://stackoverflow.com/questions/4968843/how-do-i-store-gzipped-files-using-pigstorage-in-apache-pig

×