2. Plan
• Performance Optimization tips
• Naive Bays Classifier and Sentiment Analysis
• Pig Intro
• Pig Data Operations
• Pig user defined functions (UDFs)
• Pig ETL Features
• Exercise –
– MapReduce - Programming with Python using Streaming for
Sentiment Analysis of IMDB Movie Review using a Naive Bays
Classifier
– Pig – Apache Server Log analysis with Pig