• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Azkaban and Pig at LinkedIn
 

Azkaban and Pig at LinkedIn

on

  • 4,860 views

Description of using Pig with the Azkaban workflow scheduler for Hadoop

Description of using Pig with the Azkaban workflow scheduler for Hadoop

Statistics

Views

Total Views
4,860
Views on SlideShare
4,858
Embed Views
2

Actions

Likes
0
Downloads
48
Comments
0

2 Embeds 2

http://facebook.slideshare.com 1
http://apollo89.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Azkaban and Pig at LinkedIn Azkaban and Pig at LinkedIn Presentation Transcript

    • Azkaban and Pig
      Richard Park, Russell Jurney
      LinkedIn Search, Network, Analytics
    • Installing & Running Azkaban
      wgethttp://github.com/downloads/azkaban/azkaban/azkaban-0.04.tar.gz
      tar –xvzf azkaban-0.04.tar.gz
      mkdir /some-dir/azkaban-jobs
      cd azkaban-0.04
      bin/azkaban-server.sh –job-dir /some-dir/azkaban-jobs
    • Azkaban @ localhost:8080
    • Pig Configuration
      myproject.properties – Global Configuration
      hadoop.job.ugi=rjurney,hadoop
      udf.import.list=org.apache.pig.builtin.,com.linkedin.pig.,com.linkedin…
      cc_0_compute_title_counts.job – Pig Job
      type=pig
      pig.script=cc_0_compute_title_counts.pig
      cc_1_reverse_engineer_durak_3000 – Pig Job with Dependency
      type=pig
      pig.script=cc_1_reverse_engineer_durak_3000.pig
      dependencies=cc_0_compute_title_counts
    • Running Job
      > bin/run-jobs.sh –job-dir /some-dir/azkaban-jobs my-job
    • Scheduling Jobs
    • Viewing Jobs
    • Editing Jobs
    • Azkaban Pig Docs