HADOOP BIG DATA Training from ZaranTech
 

HADOOP BIG DATA Training from ZaranTech

on

  • 282 views

HADOOP Training Course: Click here, http://www.zarantech.com/course-list/... ...

HADOOP Training Course: Click here, http://www.zarantech.com/course-list/...
Contact: 515-978-9788 (or) peter@zarantech.com

BASIC HADOOP
1. Introduction and Overview of Hadoop
2. Hadoop Distributed FileSystem (HDFS)
3. HBase -- The Hadoop Database
4. Map/Reduce 2.0/YARN
5. MapReduce Workflows
6. Pig
7. Hive
8. Putting it all together

ADVANCED HADOOP
1. Integrating Hadoop Into The Workflow
2. Delving Deeper Into The Hadoop API
3. Common Map Reduce Algorithms
4. Using Hive and Pig
5. Practical Development Tips and Techniques
6. More Advanced Map Reduce Programming
7. Joining Data Sets in Map Reduce
8. Graph Manipulation in Hadoop
9. Creating Workflows With Oozie
10. HANDS ON EXCERCISE

About the Trainer:

1. 14 years of experience in consulting / training and mentoring participants on the design, infrastructure, integration aspects in the training.
2. Have trained more than 5,000 participants in the areas of Java, J2EE, Android and BPM and always looking forward to share his knowledge in the IT domain with anyone.
3. Have extensively travelled and mentioned participants in different organizations in countries like RBC [Luxemburg], Motorola [Germany],PayPal [Dublin],GVT [Brazil], Virtusa [Sri Lanka], Damac [Dubai], Rogers Telecom [Canada],D&B, HBO, Micron, EMC, e-Rewards, Maximus [USA].
4. Have assisted and providing consulting to ADP, Diebold, Level 3 Communications, e- Rewards, South West Airlines and other Corporates on their Process Requirements in the areas of BPM.
5. Have been on the Code Review Panel for multiple organizations for their product development efforts and have brain stormed multiple new ideas which have turned into reality.
6. Was a part of the Core Initial Team for exploring HD Insight [Hadoop on Windows] for Microsoft India Development Center and have mentored multiple batches of Developers, Project Managers and Development Testers.
7. Have mentored participants in J P Morgan, TCS, HCL, Accenture in H1adoop and its eco-system components like Hike, Hbase, Pig and Sqoop. Have also been involved in assisting the organizations in setting up their initial Hadoop team.

Checkout what our past trainees are saying:
Recommendations on Facebook - Click here, http://www.facebook.com/ZaranTechLLC
Testimonials on our website -- Click here, http://www.zarantech.com/testimonials

Refer your friends to ZaranTech for their Training & consulting needs and Reward yourself with benefits, http://www.zarantech.com/be-a-friend-tell-a-friend/

Cannot Attend LIVE sessions !! - Then we have another option for you. It is called Instructor led VIDEO training. See this Video for more info, http://www.youtube.com/watch?v=WmWqzGFPqck

Statistics

Views

Total Views
282
Views on SlideShare
282
Embed Views
0

Actions

Likes
0
Downloads
2
Comments
1

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
  • Objective:
    The participants will learn the Installation of Hadoop Cluster, understand the basic and advanced concepts of Map Reduce and the best practices for Apache Hadoop Development as experienced by the developers and architects of core
    Apache Hadoop. They will also learn the following during the duration of the course
    1. Hadoop Ecosystem
    2. Best programming practices for Map Reduce
    3. System administration issues with other Hadoop projects such as Hive, Pig, and Scoop
    4. Configuration Map Reduce environment with Eclipse IDE
    5. Advanced Map Reduce Algorithms and techniques
    6. Pig and HIVE Introduction
    Note: The course will be have 40% of theoretical discussion and 60% of actual hands on

    Take Away from this Course:
    1. Understanding of What and Why of Hadoop with its Eco-System Components.
    2. Ability to write Map Reduce programs in a given scenario
    3. Ability to correctly architect and implement the Best Practices in Hadoop Development
    4. Ability to Manage and Monitor Hadoop
    5. Ability to Manage the different Hadoop Components when talking to each other.

    Audience: This course is designed for anyone who is
    1. Wanting to architect a project using Hadoop and its Eco System components.
    2. Wanting to develop Map Reduce programs
    3. A Business Analyst or Data Warehousing person looking at alternative approach to data analysis and storage.

    Pre-Requisites:
    1. The participants should have at least basic knowledge of Java.
    2. Any experience of Linux environment will be very helpful.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

HADOOP BIG DATA Training from ZaranTech HADOOP BIG DATA Training from ZaranTech Document Transcript

  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training HADOOP Professional Role Based Training Course Duration: 30-35 hours + Live Case Studies Prereq Timings: Weekdays & Weekends (after work hrs) Mode of Training: Online C L How Are We Different?  We just don’t teach HADOOP Concepts but we share our real-time implementation experiences to get the audiences ready to face customers and Implement Solutions.  We don’t make HADOOP Developer but we make them Complete & Full-fledged HADOOP Consultants by training them on Development, Administration & Application Design with Project based real-time scenarios and several Case Studies for practice  Our faculties are not just technical developers or trainers; they are industry experts and consultants for fortune 500 companies who are highly capable of understanding the business and know how technology can be closely connected with people and business.  Our Role-Specific training differs from any other training company in the world h c L e T Benefits: Training Highlights:  Quality Course Material & E-books  Focus on Hands on training  24 x 7 Online access to trainers  35 hrs. + 30 Assignments, 2Live Case Studies for Doubts Clarification, Course  Video Recordings of sessions provided Title: Business Analyst Competency Development Program  Project based training with hands on exp. Course  Demonstration ofTraining using tools like Duration: 45 hours Concepts Training Materials: All attendees would receive  Resume Preparation Guidance Eclipse, Tomcat Server and MySQL  Mock Interviews from Professional  One Problem Statement discussed across the  Training presentation of each session, Core Java, Servlets, JSP, EJB, Struts, Hibernate Consultants,  Source Code for examples covered.  HADOOP Certification Guidance  Marketing one-on-one with a Recruiter  Resume prep, Interview Questions provided Training Format: This course is delivered as a highly interactive session, with extensive live examples. This  Real-time Project Documents  SOA Fundamentals and Products covered course is delivered in Online using Web and Audio Conferencing.  Onsite Job assistance for 1 month What will you learn?  Cloud Computing for JAVA developers  Special Project training programs for  Introduction to HADOOP and BIG DATA trained F1 students on OPT or CPT. n a r a Z The J2EE/JEE Training uses best practices and guidelines from Java Community Process (JCP®). The trainincontent ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 1 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training In this training, attendees learn: BASIC HADOOP 1. 2. 3. 4. 5. 6. 7. 8. Introduction and Overview of Hadoop Hadoop Distributed FileSystem (HDFS) HBase – The Hadoop Database Map/Reduce 2.0/YARN MapReduce Workflows Pig Hive Putting it all together C L ADVANCED HADOOP 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Integrating Hadoop Into The Workflow Delving Deeper Into The Hadoop API Common Map Reduce Algorithms Using Hive and Pig Practical Development Tips and Techniques More Advanced Map Reduce Programming Joining Data Sets in Map Reduce Graph Manipulation in Hadoop Creating Workflows With Oozie HANDS ON EXCERCISE e T n a Attendees also learn: 1. 2. h c L r a Resume Preparation Guidelines and Tips Mock Interviews and Interview Preparation Tips attend? Audience Z This course is designed for anyone who is 1. Wanting to architect a project using Hadoop and its Eco System components. 2. Wanting to develop Map Reduce programs 3. A Business Analyst or Data Warehousing person looking at alternative approach to data analysis and storage. ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 2 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training Pre-Requisites 1. 2. The participants should have at least basic knowledge of Java. Any experience of Linux environment will be very helpful. C L HADOOP Role Based Training Program Enhancements HADOOP ROLE BASED TRAINING PROGRAM ROADMAP What is Big Data & Why Hadoop? Hadoop Overview & its Ecosystem Map Reduce Anatomy Developing Map Reduce Programs Advanced Tips & Techniques L Monitoring & Management of Hadoop h c n a Sqoop r a Z e T Deploying Hadoop on Cloud HDFS – Hadoop Distributed File System Advanced Map Reduce Algorithms Using Hive & Pig Hadoop Best Practices and Use Cases HBase 40 plus Assignments Case Studies Certification Guidance Resumes / Interview Guidance Mock Interviews (project & technical) ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 3 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training Topics covered: BASIC HADOOP C L Introduction and Overview of Hadoop      What is Hadoop? History of Hadoop. Building Blocks - Hadoop Eco-System. Who is behind Hadoop? What Hadoop is good for and what it is not? Hadoop Distributed File System (HDFS)       HDFS Overview and Architecture HDFS Installation HDFS Use Cases Hadoop File System Shell File System Java API Hadoop Configuration e T HBase - The Hadoop Database          HBase Overview and Architecture HBase Installation HBase Shell Java Client API Java Administrative API Filters Scan Caching and Batching Key Design Table Design h c L n a r a Z Map/Reduce 2.0/YARN      Decomposing Problems into MapReduce Workflow Using JobControl Oozie Introduction and Architecture Oozie Installation Developing, deploying, and Executing Oozie Workflows ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 4 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training Pig       Pig Overview Installation Pig Latin Developing Pig Scripts Processing Big Data with Pig Joining data-sets with Pig C L Hive    Hive Overview Installation Hive QL Putting it all together   Integrating Hadoop Into The Workflow      L e T ADVANCED HADOOP  h c Distributed installations Best Practices Relational Database Management Systems Storage Systems Importing Data from RDBMSs With Sqoop Hands-on exercise Importing Real-Time Data with Flume Accessing HDFS Using FuseDFS and Hoop n a r a Delving Deeper Into The Hadoop API          More about ToolRunner Testing with MRUnit Reducing Intermediate Data With Combiners The configure and close methods for Map/Reduce Setup and Teardown Writing Partitioners for Better Load Balancing Hands-On Exercise Directly Accessing HDFS Using the Distributed Cache Hands-On Exercise Z ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 5 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training Common MapReduce Algorithms       Sorting and Searching Indexing Machine Learning With Mahout Term Frequency – Inverse Document Frequency Word Co-Occurrence Hands-On Exercise C L Using Hive and Pig    Hive Basics Pig Basics Hands-on exercise Practical Development Tips and Techniques         Debugging MapReduce Code Using LocalJobRunner Mode For Easier Debugging Retrieving Job Information with Counters Logging Splittable File Formats Determining the Optimal Number of Reducers Map-Only MapReduce Jobs Hands-On-Exercise    Custom Writables and WritableComparables Saving Binary Data using SequenceFiles and Avro Files Creating InputFormats and OutputFormats Hands-On Exercise r a Joining Data Sets in MapReduce    e T n a More Advanced MapReduce Programming  h c L Map-Side Joins The Secondary Sort Reduce-Side Joins Z Graph Manipulation in Hadoop    Introduction to graph techniques Representing graphs in Hadoop Implementing a sample algorithm: Single Source Shortest Path ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 6 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training Creating Workflows With Oozie    The Motivation for Oozie Oozie’s Workflow Definition Format HANDS ON EXERCISE Interview Preparation C L 1. 2. 3. Mock Interviews Interview Preparation Tips Sample Interview Questions 4. How to clear an Interview Take Away from the Course 1. 2. 3. 4. 5. L Understanding of What and Why of Hadoop with its Eco-System Components. Ability to write Map Reduce programs in a given scenario Ability to correctly architect and implement the Best Practices in Hadoop Development Ability to Manage and Monitor Hadoop Ability to manage the different Hadoop Components when talking to each other. h c e T About the trainer 1. 2. 3. 4. 5. 6. 7. 14 years of experience in consulting / training and mentoring participants on the design, infrastructure, integration aspects in the training. Have trained more than 5,000 participants in the areas of Java, J2EE, Android and BPM and always looking forward to share his knowledge in the IT domain with anyone. Have extensively travelled and mentioned participants in different organizations in countries like RBC [Luxemburg], Motorola [Germany],PayPal [Dublin],GVT [Brazil], Virtusa [Sri Lanka], Damac [Dubai], Rogers Telecom [Canada],D&B, HBO, Micron, EMC, e-Rewards, Maximus [USA]. Have assisted and providing consulting to ADP, Diebold, Level 3 Communications, e- Rewards, South West Airlines and other Corporates on their Process Requirements in the areas of BPM. Have been on the Code Review Panel for multiple organizations for their product development efforts and have brain stormed multiple new ideas which have turned into reality. Was a part of the Core Initial Team for exploring HD Insight [Hadoop on Windows] for Microsoft India Development Center and have mentored multiple batches of Developers, Project Managers and Development Testers. Have mentored participants in J P Morgan, TCS, HCL, Accenture in H1adoop and its eco-system components like Hike, Hbase, Pig and Sqoop. Have also been involved in assisting the organizations in setting up their initial Hadoop team. n a r a Z ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 7 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training CASE STUDY # 1 – “Healthcare System” Healthcare System Application: C L As the Product Manager for Inner Expressions you are asked to provide one of your largest clients with additional features in the EMR (Electronic Medical Records Management) System. The client has requested an integrated Referral Management System that tracks patients from Primary care into the Specialist departments. Appointments are created by either the Primary Care Physicians themselves or other clinical staff like Nurse Practitioners or Clinical Assistants. Each appointment must go through the appropriate checks including checking if the patient has an active insurance with the client, whether the insurance program covers the condition of the patient, patient’s preference for location and timings and availability of the Specialist doctor. Some appointments may have to be reviewed by the Specialists themselves before they can be approved, the administrator of the facility (hospital) must have the ability to choose by appointment type to either make it directly bookable by the Primary Care Staff or as a type that requires review by the specialist. The system should also allow the Primary Care Staff and specialists departments to exchange notes and comments about a particular appointment. If the specialist department requests tests or reports as mandatory for the appointment, the system must ensure that the patient has these available on the date of the appointment. The Hospital sets about 300 appointments per day and must support about 50 users at the same time. The existing EMR system is based on J2EE/JEE and a MySQL database system. h c L e T n a Tasks:     r a Identify Features - Login Page, Customer Information, Facility Information, Appointments etc. Develop Prototype – HTML, JavaScript, CSS Implement Functionality – Core Java, Servlets, JSP, Struts, Hibernate Deploy and Test Application – Tomcat Server, Eclipse Z ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 8 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266
  • HADOOP Professional Competency Development Program We focus on delivering Role-Specific training rather than Product based Training CASE STUDY # 2 – “Asset Management System” Asset Management Application: C L The asset management system keeps track of a number of assets that can be borrowed, their ownership, their availability, their current location, the current borrower and the asset history. Assets include books, software, computers, and peripherals. Assets are entered in the database when acquired, deleted from the database when disposed. The availability is updated whenever it is borrowed or returned. When a borrower fails to return an asset on time, the asset management system sends a reminder to the borrower and informs the asset owner. L The administrator enters new assets in the database, deletes obsolete ones, and updates any information related to assets. The borrower search for assets in the database to determine their availability and borrows and returns assets. The asset owner loans assets to borrowers. Each system has exactly one administrator, one or more asset owners, and one or more borrowers. When referring to any of the above actor, we use the term "user". All users are known to the system by their name and their email address. The system may keep track of other attributes such as the owner's telephone number, title, address, and position in the organization. h c e T The system should support at least 200 borrowers and 2000 assets. The system should be extensible to other types of assets. The system should checkpoint the state of the database every day such that it can be recovered in case of data loss. Owners and the administrator are authenticated using a user/password combination. Actors interact with the system via a web browser capable of rendering HTML and HTTP without support for JavaScript and Java. n a Tasks: r a The persistent storage is realized using an MySQL database. The business logic is realized using the J2EE/JEE  Identify runtime system. Features - Login Page, Customer Information, Asset Information etc.  Develop Prototype – HTML, JavaScript, CSS  aImplement Functionality – Core Java, Servlets, JSP, Struts, Hibernate storage subsystem managing persistent data  a session subsystem controlling authentication, check out, and check in  aDeploy and Test Application – Tomcat Server, Eclipse notification subsystem sending past due notices Z  administration OTHER anuser STUDIES: subsystem realizing Cruise Management System, Collegiatesubsystems and translating CASE interface subsystem translatingadministration requests Social Networking, Sporting system  a HTTP requests into requests to other results into HTML pages. ZaranTech LLC. , http://www.zarantech.com, info@zarantech.com , (515) 309-7846, Page - 9 5550 Wild Rose Lane, Suite 400, West Des Moines IA 50266