SlideShare a Scribd company logo
1 of 22
Hadoop/Spark Webinar
Career Opportunities in Open Source Framework
Know your Trainer
www.edunextgen.com 2©2018 EduNextgen
Webinar Objective
What is Open Source Framework?
Open Source Framework Example
Difference from other types of Framework?
Importance of open source framework
What is Big Data?
Deep understanding of Hadoop and Spark framework
Job Analysis on Hadoop and Spark
www.edunextgen.com 3©2018 EduNextgen
What is Open Source Framework?
The term “Open Source" refers to something people can modify and share because its design is publicly accessible
Code Available publicly, so source code that anyone can inspect, modify, and enhance
www.edunextgen.com 4©2018 EduNextgen
Open Source Framework Example
www.edunextgen.com 5©2018 EduNextgen
Difference from other types of Framework
Open Source Framework:
Source code available publicly
Redistribute solutions
Can use in any way
Eliminates single point of failure
Democratic forum for action
No vendor lock-in
No guarantee for development will continue
Intellectual property (algorithms)
Support consistency
www.edunextgen.com 6
Proprietary Software:
Predictable releases
Entity to hold responsible for bugs, errors and updates
Consistent feature development
More stable framework
More consistent training options
Easier access to support
Single company releasing patches
Costs will be Higher for start-up
Vendor will have owns software
©2018 EduNextgen
Importance of open source framework
Control: People have more control. As per requirement can be change and modify as well. Experiment can be possible and
what ever suitable for the requirement can be implement.
Training: Open source code is publicly accessible, students can easily study it as they learn to make better software. It help to
develop the skill.
Security: More secure and stable, because anyone can view and modify, someone might spot and correct errors or omissions
that a program's original authors might have missed.
Stability: It can be possible to develop long-term projects. Because programmers publicly distribute the source code for open
source software, users relying on that software for critical tasks can be sure their tools won't disappear or fall into disrepair if
their original creators stop working on them.
www.edunextgen.com 7©2018 EduNextgen
What is Big Data?
Big Data is extremely large volume of data sets that may be trends or
associations, especially relating to human behavior and interactions
Big data is a term that describes the large volume (Terabytes or Petabytes) of
data – both structured and unstructured – that inundates a business on a day-
to-day basis
Big Data is a collection of huge amount of data set which is not possible to
handle in traditional way
Big Data include capturing data, data storage, data analysis, visualization,
querying etc.
www.edunextgen.com 8©2018 EduNextgen
Introduction to Hadoop
www.edunextgen.com 9
Hadoop is an Open-source Data Management Framework which
support, store and process big data
Hadoop is a part of apache project. It is being used by Google,
Yahoo, Facebook, Twitter, LinkedIn and many more
It allows the distributed processing of huge data across clusters
Hadoop was developed by Doug Cutting and Mike Cafarella in
year 2006
©2018 EduNextgen
Necessity of Hadoop
Make strategic, confident decisions based on solid data and advanced analytics
Gain valuable business insights that help you pinpoint weaknesses and discover
new opportunities
Earn higher profits by better understanding the business, processes and the
customers
Big Data flowing in at exponential rate
Increased number of Hadoop driven jobs
www.edunextgen.com 10©2018 EduNextgen
Hadoop Characteristics
www.edunextgen.com 11
Hadoop Is Easily Scalable
Hadoop Brings Flexibility In Data Processing
Hadoop Is Fault Tolerant
Hadoop Is Great At Faster Data Processing
Hadoop Ecosystem Is Robust:
Hadoop Is Very Cost Effective
©2018 EduNextgen
Introduction to Apache Spark
www.edunextgen.com 12
Apache Spark is an open-source cluster in-memory computing framework.
Apache Spark provides an interface for programming entire clusters with implicit data
parallelism and fault tolerance.
Apache Spark provides high-level APIs in Scala, Java, R and Python.
Spark is 100x faster then Hadoop MapReduce
Spark support streaming process for large dataset
Initial Release on May, 2014
Stable Version Release on July, 2017: v2.2.0 on
Spark is written in Scala, Java, Python
Operating System Support: Microsoft Windows, macOS, Linux
©2018 EduNextgen
Introduction to Apache Spark (Cont’d)
www.edunextgen.com 13©2018 EduNextgen
Why Spark?
Speed:
Spark is In-memory computations
It extends the MapReduce model and takes it to a whole other level
Spark is 100x times faster than Hadoop MapReduce
Generality:
Spark is able to handle wide range of workloads
Iterative algorithms
It provide interactive queries and streaming feature
Ease of use:
We have APIs as Scala, Python, Java in Spark
It contents libraries for ML, SQL, Streaming and Graph Processing
Spark runs on Hadoop clusters, Mesos Cassandra etc
www.edunextgen.com 14©2018 EduNextgen
Job Opportunities
www.edunextgen.com 15
After the U.S., India has the largest demand of analytics / big data / data science professionals. Amidst such
demand, people find themselves confused to select an appropriate job profile for the best future.
“A professional with working knowledge of data science and big data earns 8% more than with co-worker “
©2018 EduNextgen
Job Opportunities (Cont’d)
www.edunextgen.com 16
89% of hiring managers find it difficult to find talent
47% of employers are willing to pay for professional certifications, up from 33% in 2017
Positions they are looking for:
73% – Developers
60% – DevOps
53% – SysAdmins
Employers are seeking expertise in:
70% – Cloud
67% – Big Data
65% – Linux
©2018 EduNextgen
Job Opportunities (Cont’d)
www.edunextgen.com 17©2018 EduNextgen
Next Webinar: Execute your First Hive Project
What is Big Data?
Why do we need Big Data?
What is Hive?
Basic Hive Operations & Commands:
Create database, Show databases, Use, Create table, Show table, Describe, Data
Loading in Hive Table – From Local Filesystem, Inserting Data in Hive Table,
Select*
Retails domain project execution with the hive:
Use Case #1: Out of 20000 how many customers given product rating
Use Case #2: Find how many number of product available for below brand:
Puma
Regular
First Choice
Note: Show product details for "Puma“
www.edunextgen.com 18©2018 EduNextgen
Hadoop Kick-Starter Course
What is this course about?
Get insights into applications of Big Data and Hadoop along with learning about performing basic operation of HDFS,
MapReduce and Hive. The course is bundled with industry grade hands on assignments and project access provided
through VM environment to practice what you learn. A program to help you understand career path in Big Data and
available learning paths to advance your career options.
Duration: 6 Hrs.
Date: 20th & 21st January, 2018
Time: 07:30 PM to 10:30 PM
Price: ₹ 499
www.edunextgen.com 19©2018 EduNextgen
Participants will get access to
Course Content (LMS Access)
10+ Assignments
20+ Quizzes
Pre-Installed Hadoop environment (Plug and Play)
1 Project with 5 Use Cases
Hadoop Kick-Starter Course Curriculum
Day #1:
What is Big Data?
Why need More and More Data?
Big Data Characteristics:
Volume, Velocity, Variety, Veracity
Types of Data
Applications of Big Data
Industry who generate Big Data
Introduction to Hadoop
Why Hadoop?
Hadoop Ecosystem
YARN
www.edunextgen.com 20©2018 EduNextgen
Day #2:
Hive: Introduction
What is Hive and it’s Limitation?
Hive Architecture
Hive Components
Hive Data Types:
Primary Data Types
Complex Data Types
Various Hive Commands and Operations
Joins in Hive
Project Execution
Q&A
www.edunextgen.com 21©2018 EduNextgen
Career opportunities in open source framework

More Related Content

What's hot

Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Edureka!
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopSkillspeed
 
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDevOps.com
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotLuc Bors
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!DataWorks Summit
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseDataWorks Summit
 
CAG Corporate Dossier
CAG Corporate DossierCAG Corporate Dossier
CAG Corporate Dossiercagtorus
 
The Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainThe Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainKI sciencebiz
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldDataWorks Summit
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Edgar Alejandro Villegas
 
Talend webinar
Talend webinarTalend webinar
Talend webinarEdureka!
 

What's hot (15)

Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?
 
Predicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via HadoopPredicting Consumer Behaviour via Hadoop
Predicting Consumer Behaviour via Hadoop
 
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App CodeDataOps, DevOps and the Developer: Treating Database Code Just Like App Code
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular Chatbot
 
Hareesh
HareeshHareesh
Hareesh
 
RESUME_N
RESUME_NRESUME_N
RESUME_N
 
Resume (1)
Resume (1)Resume (1)
Resume (1)
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!
 
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the EnterpriseData Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
 
CAG Corporate Dossier
CAG Corporate DossierCAG Corporate Dossier
CAG Corporate Dossier
 
The Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, SpainThe Computer Architecture Group at the University of A Coruña, Spain
The Computer Architecture Group at the University of A Coruña, Spain
 
PGDay Brasilia 2017
PGDay Brasilia 2017PGDay Brasilia 2017
PGDay Brasilia 2017
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected World
 
Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869Big Data and Enterprise Data - Oracle -1663869
Big Data and Enterprise Data - Oracle -1663869
 
Talend webinar
Talend webinarTalend webinar
Talend webinar
 

Similar to Career opportunities in open source framework

How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfCareervira
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companiesRobert Smith
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Imam Raza
 
Big data courses
Big data coursesBig data courses
Big data coursespremav6
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
All data accessible to all my organization - Presentation at OW2con'19, June...
 All data accessible to all my organization - Presentation at OW2con'19, June... All data accessible to all my organization - Presentation at OW2con'19, June...
All data accessible to all my organization - Presentation at OW2con'19, June...OW2
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project edunextgen
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache SoftwareBob Marcus
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitSlim Baltagi
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistancephdAssistance1
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big DataDataWorks Summit
 

Similar to Career opportunities in open source framework (20)

sudipto_resume
sudipto_resumesudipto_resume
sudipto_resume
 
How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdf
 
Top 10 renowned big data companies
Top 10 renowned big data companiesTop 10 renowned big data companies
Top 10 renowned big data companies
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
 
Big data courses
Big data coursesBig data courses
Big data courses
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
All data accessible to all my organization - Presentation at OW2con'19, June...
 All data accessible to all my organization - Presentation at OW2con'19, June... All data accessible to all my organization - Presentation at OW2con'19, June...
All data accessible to all my organization - Presentation at OW2con'19, June...
 
HimaBindu
HimaBinduHimaBindu
HimaBindu
 
Execute your first hive project
Execute your first hive project Execute your first hive project
Execute your first hive project
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
BigData_Krishna Kumar Sharma
BigData_Krishna Kumar SharmaBigData_Krishna Kumar Sharma
BigData_Krishna Kumar Sharma
 
CSB_community
CSB_communityCSB_community
CSB_community
 
Apresentação Hadoop
Apresentação HadoopApresentação Hadoop
Apresentação Hadoop
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summitAnalysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Resume_Karthick
Resume_KarthickResume_Karthick
Resume_Karthick
 

Recently uploaded

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 

Recently uploaded (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 

Career opportunities in open source framework

  • 1. Hadoop/Spark Webinar Career Opportunities in Open Source Framework
  • 3. Webinar Objective What is Open Source Framework? Open Source Framework Example Difference from other types of Framework? Importance of open source framework What is Big Data? Deep understanding of Hadoop and Spark framework Job Analysis on Hadoop and Spark www.edunextgen.com 3©2018 EduNextgen
  • 4. What is Open Source Framework? The term “Open Source" refers to something people can modify and share because its design is publicly accessible Code Available publicly, so source code that anyone can inspect, modify, and enhance www.edunextgen.com 4©2018 EduNextgen
  • 5. Open Source Framework Example www.edunextgen.com 5©2018 EduNextgen
  • 6. Difference from other types of Framework Open Source Framework: Source code available publicly Redistribute solutions Can use in any way Eliminates single point of failure Democratic forum for action No vendor lock-in No guarantee for development will continue Intellectual property (algorithms) Support consistency www.edunextgen.com 6 Proprietary Software: Predictable releases Entity to hold responsible for bugs, errors and updates Consistent feature development More stable framework More consistent training options Easier access to support Single company releasing patches Costs will be Higher for start-up Vendor will have owns software ©2018 EduNextgen
  • 7. Importance of open source framework Control: People have more control. As per requirement can be change and modify as well. Experiment can be possible and what ever suitable for the requirement can be implement. Training: Open source code is publicly accessible, students can easily study it as they learn to make better software. It help to develop the skill. Security: More secure and stable, because anyone can view and modify, someone might spot and correct errors or omissions that a program's original authors might have missed. Stability: It can be possible to develop long-term projects. Because programmers publicly distribute the source code for open source software, users relying on that software for critical tasks can be sure their tools won't disappear or fall into disrepair if their original creators stop working on them. www.edunextgen.com 7©2018 EduNextgen
  • 8. What is Big Data? Big Data is extremely large volume of data sets that may be trends or associations, especially relating to human behavior and interactions Big data is a term that describes the large volume (Terabytes or Petabytes) of data – both structured and unstructured – that inundates a business on a day- to-day basis Big Data is a collection of huge amount of data set which is not possible to handle in traditional way Big Data include capturing data, data storage, data analysis, visualization, querying etc. www.edunextgen.com 8©2018 EduNextgen
  • 9. Introduction to Hadoop www.edunextgen.com 9 Hadoop is an Open-source Data Management Framework which support, store and process big data Hadoop is a part of apache project. It is being used by Google, Yahoo, Facebook, Twitter, LinkedIn and many more It allows the distributed processing of huge data across clusters Hadoop was developed by Doug Cutting and Mike Cafarella in year 2006 ©2018 EduNextgen
  • 10. Necessity of Hadoop Make strategic, confident decisions based on solid data and advanced analytics Gain valuable business insights that help you pinpoint weaknesses and discover new opportunities Earn higher profits by better understanding the business, processes and the customers Big Data flowing in at exponential rate Increased number of Hadoop driven jobs www.edunextgen.com 10©2018 EduNextgen
  • 11. Hadoop Characteristics www.edunextgen.com 11 Hadoop Is Easily Scalable Hadoop Brings Flexibility In Data Processing Hadoop Is Fault Tolerant Hadoop Is Great At Faster Data Processing Hadoop Ecosystem Is Robust: Hadoop Is Very Cost Effective ©2018 EduNextgen
  • 12. Introduction to Apache Spark www.edunextgen.com 12 Apache Spark is an open-source cluster in-memory computing framework. Apache Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Apache Spark provides high-level APIs in Scala, Java, R and Python. Spark is 100x faster then Hadoop MapReduce Spark support streaming process for large dataset Initial Release on May, 2014 Stable Version Release on July, 2017: v2.2.0 on Spark is written in Scala, Java, Python Operating System Support: Microsoft Windows, macOS, Linux ©2018 EduNextgen
  • 13. Introduction to Apache Spark (Cont’d) www.edunextgen.com 13©2018 EduNextgen
  • 14. Why Spark? Speed: Spark is In-memory computations It extends the MapReduce model and takes it to a whole other level Spark is 100x times faster than Hadoop MapReduce Generality: Spark is able to handle wide range of workloads Iterative algorithms It provide interactive queries and streaming feature Ease of use: We have APIs as Scala, Python, Java in Spark It contents libraries for ML, SQL, Streaming and Graph Processing Spark runs on Hadoop clusters, Mesos Cassandra etc www.edunextgen.com 14©2018 EduNextgen
  • 15. Job Opportunities www.edunextgen.com 15 After the U.S., India has the largest demand of analytics / big data / data science professionals. Amidst such demand, people find themselves confused to select an appropriate job profile for the best future. “A professional with working knowledge of data science and big data earns 8% more than with co-worker “ ©2018 EduNextgen
  • 16. Job Opportunities (Cont’d) www.edunextgen.com 16 89% of hiring managers find it difficult to find talent 47% of employers are willing to pay for professional certifications, up from 33% in 2017 Positions they are looking for: 73% – Developers 60% – DevOps 53% – SysAdmins Employers are seeking expertise in: 70% – Cloud 67% – Big Data 65% – Linux ©2018 EduNextgen
  • 18. Next Webinar: Execute your First Hive Project What is Big Data? Why do we need Big Data? What is Hive? Basic Hive Operations & Commands: Create database, Show databases, Use, Create table, Show table, Describe, Data Loading in Hive Table – From Local Filesystem, Inserting Data in Hive Table, Select* Retails domain project execution with the hive: Use Case #1: Out of 20000 how many customers given product rating Use Case #2: Find how many number of product available for below brand: Puma Regular First Choice Note: Show product details for "Puma“ www.edunextgen.com 18©2018 EduNextgen
  • 19. Hadoop Kick-Starter Course What is this course about? Get insights into applications of Big Data and Hadoop along with learning about performing basic operation of HDFS, MapReduce and Hive. The course is bundled with industry grade hands on assignments and project access provided through VM environment to practice what you learn. A program to help you understand career path in Big Data and available learning paths to advance your career options. Duration: 6 Hrs. Date: 20th & 21st January, 2018 Time: 07:30 PM to 10:30 PM Price: ₹ 499 www.edunextgen.com 19©2018 EduNextgen Participants will get access to Course Content (LMS Access) 10+ Assignments 20+ Quizzes Pre-Installed Hadoop environment (Plug and Play) 1 Project with 5 Use Cases
  • 20. Hadoop Kick-Starter Course Curriculum Day #1: What is Big Data? Why need More and More Data? Big Data Characteristics: Volume, Velocity, Variety, Veracity Types of Data Applications of Big Data Industry who generate Big Data Introduction to Hadoop Why Hadoop? Hadoop Ecosystem YARN www.edunextgen.com 20©2018 EduNextgen Day #2: Hive: Introduction What is Hive and it’s Limitation? Hive Architecture Hive Components Hive Data Types: Primary Data Types Complex Data Types Various Hive Commands and Operations Joins in Hive Project Execution

Editor's Notes

  1. 1. Hadoop Brings Flexibility In Data Processing: One of the biggest challenges organizations have had in that past was the challenge of handling unstructured data. Hadoop manages data whether structured or unstructured, encoded or formatted, or any other type of data. Hadoop brings the value to the table where unstructured data can be useful in decision making process. 2. Hadoop Is Easily Scalable This is a huge feature of Hadoop. It is an open source platform and runs on industry-standard hardware. That makes Hadoop extremely scalable platform where new nodes can be easily added in the system as and data volume of processing needs grow without altering anything in the existing systems or programs. 3. Hadoop Is Fault Tolerant In Hadoop, the data is stored in HDFS where data automatically gets replicated at two other locations. So, even if one or two of the systems collapse, the file is still available on the third system at least. This brings a high level of fault tolerance. 4. Hadoop Is Great At Faster Data Processing Hadoop is extremely good at high-volume batch processing because of its ability to do parallel processing. Hadoop can perform batch processes 10 times faster than on a single thread server or on the mainframe. 5. Hadoop Ecosystem Is Robust: Hadoop has a very robust ecosystem that is well suited to meet the analytical needs of developers and small to large organizations. Hadoop Ecosystem comes with a suite of tools and technologies making i a very much suitable to deliver to a variety of data processing needs. 6. Hadoop Is Very Cost Effective Hadoop generates cost benefits by bringing massively parallel computing to commodity servers, resulting in a substantial reduction in the cost per terabyte of storage, which in turn makes it reasonable to model all your data.