SlideShare a Scribd company logo
1 of 3
Download to read offline
HADOOP COURSE CONTENT 
1. THE MOTIVATION OF HADOOP 
 Problems with traditional large scale systems 
 Requirement for a new apache 
 Introducing Hadoop 
2. HADOOP BASIC CONCEPTS 
 Hadoop project and Hadoop components 
 Hadoop distributed file system 
 Hadoop on exercise using HDFS 
 How map reduce works 
 Hands on exercise running a map reduce job 
 How a Hadoop cluster operates 
 Other Hadoop Ecosystem projects 
3. WRITING A MAP REDUCE PROGRAM 
 The Map reduce flow 
 Basic map reduce API concepts 
 Writing map reduce drivers, mappers and reducers in java 
 Writing mappers and reducers in another languages using the streaming API 
 Speeding up hadoop development by using eclipse 
 Hands on exercise writing a Map reduce program 
 Difference between old and new Map reduces APIs 
4. UNIT TESTING MAP REDUCE PROGRAMS 
 Unit testing 
 The J unit and MR unit testing frame works 
 Writing unit tests and MR units 
 Hand on exercise writing unit test and MR test frame works 
5. DELVING DEPER IN TO HADOOP API 
 Using the tool runner class 
 Decreasing the amount of intermediate data with combiners 
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- 
INDIA Trainingicon USA 
Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864
 Hands on experience writing and implementing combiners 
 Setting up and tearing down mappers and reducers by using the configure and 
close methods 
 Writing custom practitioners for better load balancing 
 Hands-on exercise on writing a practitioner 
 Accessing HDFS programmatically 
 Using the distributed cache 
 Using the Hadoop APIs library of mappers, reducers and practitioners 
6. PRACTICAL DEVELOPMENT TIPS AND TECHNIQUES 
 Strategies for debugging map reduce code 
 Testing map reduce code locally by using local job reducer 
 Writing and viewing log files 
 Retrieving job information with counters 
 Determining the optimal number of reducers for a job 
 Creating map only map reduce jobs 
 Hands on exercise using counters and a map only job 
7. DATA INPUT AND OUTPUT 
 Creating custom writable and writable comparable implementations 
 Saving binary data using sequence file and Avro data files 
 Implementing custom input formats and output formats 
 Issues to consider when using file compression 
 Hands-on exercises using sequence files and file compression 
8. COMMAN MAP REDUCE ALLOGORITHMS 
 Sorting and searching large data sets 
 Performing a secondary sort 
 Indexing data 
 Hand-on exercise creating an inverted index 
 Computing term frequency -inverse document frequency 
 Calculating word concurrence 
 Hands-on exercise calculating word concurrence 
 Hands-on exercise implementing word concurrence with a customer writable 
comparable 
9. JOINING DATA SETS IN MAP REDUCE JOBS 
 Writing a map-side join 
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- 
INDIA Trainingicon USA 
Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864
 Writing a reduce -side join 
10. INTEGRATING HADOOP IN TO ENTERPRISE WORK FLOW 
 Integrating hadoop in to an existing enterprise 
 Loading data from an RDBMS in to HDFS by using sqoop 
 Hands-on exercise importing data with sqoop 
 Managing real-time data using flume 
 Accessing HDFS from legacy systems with fuse DFS and HTTP FS 
11. MACHINE LEARNING AN MAHOUT 
 Introduction to machine learning 
 using mahout 
 Hands-on exercise using a mahout recommended 
12. AN INTRODUCTION HIVE AND PIG 
 The motivation for HIVE and PIG 
 Hive basics 
 Hands-on exercise manipulating data with HIVE 
 PIG basics 
 Hand-on exercise using PIG to retrieve movie names from our recommender 
 Choosing between HIVE and PIG 
 Introduction to OOZIE,HADOOP ONLINE TRAINING,HADOOP TRAINING 
 Creating OOZE work flow 
 Hand-on exercise running and OOZE work flow 
CONCLUSION 
APPENDIX: GRAPH PROCESSING IN MAP REDUCE AN INTRODUCTION TO 
OOZIE 
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- 
INDIA Trainingicon USA 
Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864

More Related Content

Similar to HADOOP ONLINE TRAINING

Hadoop online training by certified trainer
Hadoop online training by certified trainerHadoop online training by certified trainer
Hadoop online training by certified trainer
sriram0233
 

Similar to HADOOP ONLINE TRAINING (20)

Hadoop 80hr v1.0
Hadoop 80hr v1.0Hadoop 80hr v1.0
Hadoop 80hr v1.0
 
B04 06 0918
B04 06 0918B04 06 0918
B04 06 0918
 
A hadoop map reduce
A hadoop map reduceA hadoop map reduce
A hadoop map reduce
 
Build a Big Data solution using DB2 for z/OS
Build a Big Data solution using DB2 for z/OSBuild a Big Data solution using DB2 for z/OS
Build a Big Data solution using DB2 for z/OS
 
Hadoop online training by certified trainer
Hadoop online training by certified trainerHadoop online training by certified trainer
Hadoop online training by certified trainer
 
20141111 파이썬으로 Hadoop MR프로그래밍
20141111 파이썬으로 Hadoop MR프로그래밍20141111 파이썬으로 Hadoop MR프로그래밍
20141111 파이썬으로 Hadoop MR프로그래밍
 
Data Science
Data ScienceData Science
Data Science
 
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
Datascience Training with Hadoop, Python Machine Learning & Scala, SparkDatascience Training with Hadoop, Python Machine Learning & Scala, Spark
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
 
Characterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learningCharacterization of hadoop jobs using unsupervised learning
Characterization of hadoop jobs using unsupervised learning
 
Using GPUs to Handle Big Data with Java
Using GPUs to Handle Big Data with JavaUsing GPUs to Handle Big Data with Java
Using GPUs to Handle Big Data with Java
 
Data scientist a perfect job
Data scientist a perfect jobData scientist a perfect job
Data scientist a perfect job
 
Hadoop and Mapreduce Certification
Hadoop and Mapreduce CertificationHadoop and Mapreduce Certification
Hadoop and Mapreduce Certification
 
Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online training
 
Serverless ML Workshop with Hopsworks at PyData Seattle
Serverless ML Workshop with Hopsworks at PyData SeattleServerless ML Workshop with Hopsworks at PyData Seattle
Serverless ML Workshop with Hopsworks at PyData Seattle
 
Big Data and Hadoop Training in Bangalore by myTectra
Big Data and Hadoop Training in Bangalore by myTectraBig Data and Hadoop Training in Bangalore by myTectra
Big Data and Hadoop Training in Bangalore by myTectra
 
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in HyderabadHadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
Hadoop Training in Hyderabad,Hadoop Training Institute in Hyderabad
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop Training Institutes in Hyderabad
Hadoop Training Institutes in HyderabadHadoop Training Institutes in Hyderabad
Hadoop Training Institutes in Hyderabad
 
Quality Hadoop Training
Quality Hadoop TrainingQuality Hadoop Training
Quality Hadoop Training
 
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in HyderabadHadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
Hadoop Institute in Hyderabad,Hadoop Training Institutes in Hyderabad
 

Recently uploaded

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
Peter Brusilovsky
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MysoreMuleSoftMeetup
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
AnaAcapella
 

Recently uploaded (20)

Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
e-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopale-Sealing at EADTU by Kamakshi Rajagopal
e-Sealing at EADTU by Kamakshi Rajagopal
 
Rich Dad Poor Dad ( PDFDrive.com )--.pdf
Rich Dad Poor Dad ( PDFDrive.com )--.pdfRich Dad Poor Dad ( PDFDrive.com )--.pdf
Rich Dad Poor Dad ( PDFDrive.com )--.pdf
 
An overview of the various scriptures in Hinduism
An overview of the various scriptures in HinduismAn overview of the various scriptures in Hinduism
An overview of the various scriptures in Hinduism
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
SPLICE Working Group: Reusable Code Examples
SPLICE Working Group:Reusable Code ExamplesSPLICE Working Group:Reusable Code Examples
SPLICE Working Group: Reusable Code Examples
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
MuleSoft Integration with AWS Textract | Calling AWS Textract API |AWS - Clou...
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17How to Send Pro Forma Invoice to Your Customers in Odoo 17
How to Send Pro Forma Invoice to Your Customers in Odoo 17
 
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPSSpellings Wk 4 and Wk 5 for Grade 4 at CAPS
Spellings Wk 4 and Wk 5 for Grade 4 at CAPS
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Trauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical PrinciplesTrauma-Informed Leadership - Five Practical Principles
Trauma-Informed Leadership - Five Practical Principles
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 

HADOOP ONLINE TRAINING

  • 1. HADOOP COURSE CONTENT 1. THE MOTIVATION OF HADOOP  Problems with traditional large scale systems  Requirement for a new apache  Introducing Hadoop 2. HADOOP BASIC CONCEPTS  Hadoop project and Hadoop components  Hadoop distributed file system  Hadoop on exercise using HDFS  How map reduce works  Hands on exercise running a map reduce job  How a Hadoop cluster operates  Other Hadoop Ecosystem projects 3. WRITING A MAP REDUCE PROGRAM  The Map reduce flow  Basic map reduce API concepts  Writing map reduce drivers, mappers and reducers in java  Writing mappers and reducers in another languages using the streaming API  Speeding up hadoop development by using eclipse  Hands on exercise writing a Map reduce program  Difference between old and new Map reduces APIs 4. UNIT TESTING MAP REDUCE PROGRAMS  Unit testing  The J unit and MR unit testing frame works  Writing unit tests and MR units  Hand on exercise writing unit test and MR test frame works 5. DELVING DEPER IN TO HADOOP API  Using the tool runner class  Decreasing the amount of intermediate data with combiners ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- INDIA Trainingicon USA Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864
  • 2.  Hands on experience writing and implementing combiners  Setting up and tearing down mappers and reducers by using the configure and close methods  Writing custom practitioners for better load balancing  Hands-on exercise on writing a practitioner  Accessing HDFS programmatically  Using the distributed cache  Using the Hadoop APIs library of mappers, reducers and practitioners 6. PRACTICAL DEVELOPMENT TIPS AND TECHNIQUES  Strategies for debugging map reduce code  Testing map reduce code locally by using local job reducer  Writing and viewing log files  Retrieving job information with counters  Determining the optimal number of reducers for a job  Creating map only map reduce jobs  Hands on exercise using counters and a map only job 7. DATA INPUT AND OUTPUT  Creating custom writable and writable comparable implementations  Saving binary data using sequence file and Avro data files  Implementing custom input formats and output formats  Issues to consider when using file compression  Hands-on exercises using sequence files and file compression 8. COMMAN MAP REDUCE ALLOGORITHMS  Sorting and searching large data sets  Performing a secondary sort  Indexing data  Hand-on exercise creating an inverted index  Computing term frequency -inverse document frequency  Calculating word concurrence  Hands-on exercise calculating word concurrence  Hands-on exercise implementing word concurrence with a customer writable comparable 9. JOINING DATA SETS IN MAP REDUCE JOBS  Writing a map-side join ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- INDIA Trainingicon USA Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864
  • 3.  Writing a reduce -side join 10. INTEGRATING HADOOP IN TO ENTERPRISE WORK FLOW  Integrating hadoop in to an existing enterprise  Loading data from an RDBMS in to HDFS by using sqoop  Hands-on exercise importing data with sqoop  Managing real-time data using flume  Accessing HDFS from legacy systems with fuse DFS and HTTP FS 11. MACHINE LEARNING AN MAHOUT  Introduction to machine learning  using mahout  Hands-on exercise using a mahout recommended 12. AN INTRODUCTION HIVE AND PIG  The motivation for HIVE and PIG  Hive basics  Hands-on exercise manipulating data with HIVE  PIG basics  Hand-on exercise using PIG to retrieve movie names from our recommender  Choosing between HIVE and PIG  Introduction to OOZIE,HADOOP ONLINE TRAINING,HADOOP TRAINING  Creating OOZE work flow  Hand-on exercise running and OOZE work flow CONCLUSION APPENDIX: GRAPH PROCESSING IN MAP REDUCE AN INTRODUCTION TO OOZIE ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- INDIA Trainingicon USA Phone: +91-966-690-0051 Email: info@trainingicon.com | www.trainingicon.com Phone: +1-408-791-8864