SlideShare a Scribd company logo
1 of 10
How Do I Learn Big Data?
~For Big Data Beginners
Visit : udemy.com/big-data-and-hadoop-for-beginners/
Topics
About Big Data
Hadoop Ecosystem
Verticals
Job Roles
Big Data Salary Trends
Skills Good To Have
Getting Started
About Big data
There are many ways data is being generated like tweets generated by
millions of users on Twitter, Facebook posts by billions of users, YouTube
videos uploaded every minute, and sensors generating data, a Boeing
generating terabytes of data in a single flight. These data can be termed
as Big Data.
Big Data is complex to analyze and solve because of the nature of the
data. They are mostly in semi-structured or unstructured form which
makes it difficult to extract information and business insight (using
traditional systems). This is where Hadoop comes into the picture.
Hadoop Ecosystem
HDFS: a distributed file storage system in Hadoop
Map-Reduce: a data processing engine in Hadoop
Hive: a data warehouse infrastructure built on top of Hadoop
Pig: a data pipeline infrastructure built on top of Hadoop
Scoop: a tool to transfer data from RDBMS world to Hadoop and vice-versa.
And many more tools, built on top of Hadoop, to solve different business problems.
Verticals
Telecommunication
Retail
E-commerce
Aerospace
R&D
Automotive
Logistics
Job Roles
Big Data Analyst
Hadoop Administrator
Big Data Engineer
Big Data Scientist
Big Data Manager
Big Data Solutions Architect
Chief Data Officer
Big Data Salary Trends
Skills Good To Have
RDBMS: MySQL, Oracle, MS SQL Server etc.
SQL
ETL
Data Warehouse
Business Intelligence
Java/Python/Scala
Getting Started
Steps:
1. Learn Fundamentals of Hadoop
2. Learn HDFS, Map-Reduce, Hive, Pig
3. Start playing with small data sets (take one use case)
4. Build a sample Data Pipeline to see how Pig and Hive can be used together in
processing Big Data.
5. Go Big. Take real life problems, and try to solve them using Big Data Technologies
Thank you!
www.udemy.com/big-data-and-hadoop-for-beginners/

More Related Content

What's hot (17)

Gail Zhou on "Big Data Technology, Strategy, and Applications"
Gail Zhou on "Big Data Technology, Strategy, and Applications"Gail Zhou on "Big Data Technology, Strategy, and Applications"
Gail Zhou on "Big Data Technology, Strategy, and Applications"
 
A brief history of "big data"
A brief history of "big data"A brief history of "big data"
A brief history of "big data"
 
Big data
Big dataBig data
Big data
 
A Brief History Of Data
A Brief History Of DataA Brief History Of Data
A Brief History Of Data
 
Big data
Big dataBig data
Big data
 
Introduction of Big data and Hadoop
Introduction of Big data and Hadoop Introduction of Big data and Hadoop
Introduction of Big data and Hadoop
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
The big story (BIG DATA)
The big story (BIG DATA)The big story (BIG DATA)
The big story (BIG DATA)
 
Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big Data And Hadoop
Big Data And HadoopBig Data And Hadoop
Big Data And Hadoop
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 
Big Stream Processing Systems, Big Graphs
Big Stream Processing Systems, Big GraphsBig Stream Processing Systems, Big Graphs
Big Stream Processing Systems, Big Graphs
 

Viewers also liked

Student Participation, Engagement, Interaction and Learning in the Synchronou...
Student Participation, Engagement, Interaction and Learning in the Synchronou...Student Participation, Engagement, Interaction and Learning in the Synchronou...
Student Participation, Engagement, Interaction and Learning in the Synchronou...Suzanne Stone
 
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103kamalkumarmalan2
 
NAR Certifications0001
NAR Certifications0001NAR Certifications0001
NAR Certifications0001Jerry Smektala
 
Visita al centro de mayores
Visita al centro de mayoresVisita al centro de mayores
Visita al centro de mayoresCEOPUERTO
 
Les temps du futur
Les temps du futurLes temps du futur
Les temps du futurL N
 
MongoDB 101
MongoDB 101MongoDB 101
MongoDB 101leafnode
 
Mango Database - Web Development
Mango Database - Web DevelopmentMango Database - Web Development
Mango Database - Web Developmentmssaman
 
Mango Language Database Presentation
Mango Language Database PresentationMango Language Database Presentation
Mango Language Database PresentationSTCC Library
 
Dev Jumpstart: Building Your First App
Dev Jumpstart: Building Your First AppDev Jumpstart: Building Your First App
Dev Jumpstart: Building Your First AppMongoDB
 
Ecophon - Akusto - FR
Ecophon - Akusto - FREcophon - Akusto - FR
Ecophon - Akusto - FRArchitectura
 
Normas COVENIN Edificaciones Sismo-Resistentes
Normas COVENIN Edificaciones Sismo-ResistentesNormas COVENIN Edificaciones Sismo-Resistentes
Normas COVENIN Edificaciones Sismo-ResistentesJosué A. Sanez C.
 

Viewers also liked (14)

Student Participation, Engagement, Interaction and Learning in the Synchronou...
Student Participation, Engagement, Interaction and Learning in the Synchronou...Student Participation, Engagement, Interaction and Learning in the Synchronou...
Student Participation, Engagement, Interaction and Learning in the Synchronou...
 
Arpan-Resume
Arpan-ResumeArpan-Resume
Arpan-Resume
 
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103
Puri diplomatic greens 3/4 bhk apartments in resale call 9871822103
 
NAR Certifications0001
NAR Certifications0001NAR Certifications0001
NAR Certifications0001
 
Visita al centro de mayores
Visita al centro de mayoresVisita al centro de mayores
Visita al centro de mayores
 
Digital painting
Digital paintingDigital painting
Digital painting
 
Les temps du futur
Les temps du futurLes temps du futur
Les temps du futur
 
MongoDB 101
MongoDB 101MongoDB 101
MongoDB 101
 
Mango Database - Web Development
Mango Database - Web DevelopmentMango Database - Web Development
Mango Database - Web Development
 
Architecting for failure
Architecting for failureArchitecting for failure
Architecting for failure
 
Mango Language Database Presentation
Mango Language Database PresentationMango Language Database Presentation
Mango Language Database Presentation
 
Dev Jumpstart: Building Your First App
Dev Jumpstart: Building Your First AppDev Jumpstart: Building Your First App
Dev Jumpstart: Building Your First App
 
Ecophon - Akusto - FR
Ecophon - Akusto - FREcophon - Akusto - FR
Ecophon - Akusto - FR
 
Normas COVENIN Edificaciones Sismo-Resistentes
Normas COVENIN Edificaciones Sismo-ResistentesNormas COVENIN Edificaciones Sismo-Resistentes
Normas COVENIN Edificaciones Sismo-Resistentes
 

Similar to How Do I Learn Big Data

Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overviewNitesh Ghosh
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035Neelam Rawat
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET Journal
 
What is big data
What is big dataWhat is big data
What is big dataShubShubi
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataIMC Institute
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion ahmed alshikh
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET Journal
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Big Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –ReviewBig Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –ReviewIJERA Editor
 

Similar to How Do I Learn Big Data (20)

Big Data
Big DataBig Data
Big Data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big data
Big dataBig data
Big data
 
Big Data
Big DataBig Data
Big Data
 
big-data-notes1.ppt
big-data-notes1.pptbig-data-notes1.ppt
big-data-notes1.ppt
 
Big Data
Big DataBig Data
Big Data
 
GADLJRIET850691
GADLJRIET850691GADLJRIET850691
GADLJRIET850691
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
No sql databases
No sql databasesNo sql databases
No sql databases
 
Big data technologies with Case Study Finance and Healthcare
Big data technologies with Case Study Finance and HealthcareBig data technologies with Case Study Finance and Healthcare
Big data technologies with Case Study Finance and Healthcare
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
 
What is big data
What is big dataWhat is big data
What is big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
00 hadoop welcome_transcript
00 hadoop welcome_transcript00 hadoop welcome_transcript
00 hadoop welcome_transcript
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOP
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big data
Big dataBig data
Big data
 
Big Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –ReviewBig Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –Review
 

Recently uploaded

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Recently uploaded (20)

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

How Do I Learn Big Data

  • 1. How Do I Learn Big Data? ~For Big Data Beginners Visit : udemy.com/big-data-and-hadoop-for-beginners/
  • 2. Topics About Big Data Hadoop Ecosystem Verticals Job Roles Big Data Salary Trends Skills Good To Have Getting Started
  • 3. About Big data There are many ways data is being generated like tweets generated by millions of users on Twitter, Facebook posts by billions of users, YouTube videos uploaded every minute, and sensors generating data, a Boeing generating terabytes of data in a single flight. These data can be termed as Big Data. Big Data is complex to analyze and solve because of the nature of the data. They are mostly in semi-structured or unstructured form which makes it difficult to extract information and business insight (using traditional systems). This is where Hadoop comes into the picture.
  • 4. Hadoop Ecosystem HDFS: a distributed file storage system in Hadoop Map-Reduce: a data processing engine in Hadoop Hive: a data warehouse infrastructure built on top of Hadoop Pig: a data pipeline infrastructure built on top of Hadoop Scoop: a tool to transfer data from RDBMS world to Hadoop and vice-versa. And many more tools, built on top of Hadoop, to solve different business problems.
  • 6. Job Roles Big Data Analyst Hadoop Administrator Big Data Engineer Big Data Scientist Big Data Manager Big Data Solutions Architect Chief Data Officer
  • 8. Skills Good To Have RDBMS: MySQL, Oracle, MS SQL Server etc. SQL ETL Data Warehouse Business Intelligence Java/Python/Scala
  • 9. Getting Started Steps: 1. Learn Fundamentals of Hadoop 2. Learn HDFS, Map-Reduce, Hive, Pig 3. Start playing with small data sets (take one use case) 4. Build a sample Data Pipeline to see how Pig and Hive can be used together in processing Big Data. 5. Go Big. Take real life problems, and try to solve them using Big Data Technologies

Editor's Notes

  1. We designed this template so that each member of the project team has a set of slides with its own theme where he/she can present their research. Members, here’s how you add a new slide to just your set: Mark where you want to add the slide: Select an existing one in the Thumbnails pane, click the New Slide button, then choose a layout. The new slide gets the same theme as the previous one you selected. Careful! Don’t annoy your fellow presenters by accidentally changing their themes. That can happen if you choose a theme Variant from the Design tab, which changes all of the slides in your presentation to that look.