SlideShare a Scribd company logo
1 of 10
What Pig does ?
Extensible (Pig Latin)
UDFs (User Defined Functions)
Local mode
Map reduce mode (requires a
hadoop cluster) - Default
Example

File: /etc/passwd
PIG LATIN
Input / output

Debugging

•LOAD
•STORE

•DUMP
•DESCRIBLE
•EXPLAIN
•ILLUSTRATE

Working with Data
•FILTER
•FOREACH
•GROUP
•UNION
•SPLIT
slideshare.net/jefersonm

twitter.com/jefersonm

github.com/jefersonm
slideshare.net/jefersonm

twitter.com/jefersonm

github.com/jefersonm

More Related Content

Viewers also liked

Management 3.0, como evoluir pessoas em conjunto com sua organização.
Management 3.0, como evoluir pessoas em conjunto com sua organização.Management 3.0, como evoluir pessoas em conjunto com sua organização.
Management 3.0, como evoluir pessoas em conjunto com sua organização.
Jéferson Machado
 
Distributed Data Analysis with Hadoop and R - Strangeloop 2011
Distributed Data Analysis with Hadoop and R - Strangeloop 2011Distributed Data Analysis with Hadoop and R - Strangeloop 2011
Distributed Data Analysis with Hadoop and R - Strangeloop 2011
Jonathan Seidman
 
Agile Restropective
Agile RestropectiveAgile Restropective
Agile Restropective
Marcos Alves
 

Viewers also liked (20)

Business model generation
Business model generationBusiness model generation
Business model generation
 
Agile Coaching P2
Agile Coaching P2Agile Coaching P2
Agile Coaching P2
 
1 jeferson (grow)
1 jeferson (grow)1 jeferson (grow)
1 jeferson (grow)
 
GROW
GROWGROW
GROW
 
AngularJS
AngularJSAngularJS
AngularJS
 
Management 3.0
Management 3.0Management 3.0
Management 3.0
 
Python - basics
Python - basicsPython - basics
Python - basics
 
Management 3.0, como evoluir pessoas em conjunto com sua organização.
Management 3.0, como evoluir pessoas em conjunto com sua organização.Management 3.0, como evoluir pessoas em conjunto com sua organização.
Management 3.0, como evoluir pessoas em conjunto com sua organização.
 
Apache Spark
Apache SparkApache Spark
Apache Spark
 
Distributed Data Analysis with Hadoop and R - Strangeloop 2011
Distributed Data Analysis with Hadoop and R - Strangeloop 2011Distributed Data Analysis with Hadoop and R - Strangeloop 2011
Distributed Data Analysis with Hadoop and R - Strangeloop 2011
 
Agile Restropective
Agile RestropectiveAgile Restropective
Agile Restropective
 
Lean & T.O.C
Lean & T.O.CLean & T.O.C
Lean & T.O.C
 
Responsive Design
Responsive DesignResponsive Design
Responsive Design
 
Apache pig
Apache pigApache pig
Apache pig
 
Apache pig
Apache pigApache pig
Apache pig
 
Plano de carreira, isso funciona ? Me consegue uma bússola por favor. (Agile...
Plano de carreira, isso funciona ? Me consegue uma bússola por favor. (Agile...Plano de carreira, isso funciona ? Me consegue uma bússola por favor. (Agile...
Plano de carreira, isso funciona ? Me consegue uma bússola por favor. (Agile...
 
Metodologias Ageis
Metodologias AgeisMetodologias Ageis
Metodologias Ageis
 
Coaching e Autodesenvolvimento: Transformando a Cultura de Desenvolvimento de...
Coaching e Autodesenvolvimento: Transformando a Cultura de Desenvolvimento de...Coaching e Autodesenvolvimento: Transformando a Cultura de Desenvolvimento de...
Coaching e Autodesenvolvimento: Transformando a Cultura de Desenvolvimento de...
 
How to innovate ?
How to innovate ?How to innovate ?
How to innovate ?
 
Kanban metrics
Kanban metricsKanban metrics
Kanban metrics
 

More from Jéferson Machado

Haskell scala exercises presentation
Haskell scala exercises presentationHaskell scala exercises presentation
Haskell scala exercises presentation
Jéferson Machado
 

More from Jéferson Machado (18)

druid.io
druid.iodruid.io
druid.io
 
Apache Spark Internals - Part 2
Apache Spark Internals - Part 2Apache Spark Internals - Part 2
Apache Spark Internals - Part 2
 
Node.js, is it the solution for every problem?
Node.js, is it the solution for every problem?Node.js, is it the solution for every problem?
Node.js, is it the solution for every problem?
 
Management 3.0 (TDC 2015)
Management 3.0 (TDC 2015)Management 3.0 (TDC 2015)
Management 3.0 (TDC 2015)
 
Apache HBase
Apache HBaseApache HBase
Apache HBase
 
Scala
ScalaScala
Scala
 
Theory of constraints
Theory of constraintsTheory of constraints
Theory of constraints
 
Spring MVC
Spring MVCSpring MVC
Spring MVC
 
Continuous integration
Continuous integrationContinuous integration
Continuous integration
 
Webpy
WebpyWebpy
Webpy
 
Gradle
GradleGradle
Gradle
 
Padrino
PadrinoPadrino
Padrino
 
Haskell scala exercises presentation
Haskell scala exercises presentationHaskell scala exercises presentation
Haskell scala exercises presentation
 
RabbitMQ
RabbitMQRabbitMQ
RabbitMQ
 
Clojure presentation
Clojure presentationClojure presentation
Clojure presentation
 
Erjang
ErjangErjang
Erjang
 
Clojure noir
Clojure noirClojure noir
Clojure noir
 
Clojure LDC 5
Clojure LDC 5Clojure LDC 5
Clojure LDC 5
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Apache Pig