SlideShare a Scribd company logo
1 of 28
Sizes of data
What is actually Big Data?
Big data-: So large data that it becomes difficult to
process it using the traditional system.
SO WHAT’S THE
PROBLEM?
Storage
fer
VisualizationAnalysis
Sharing
Example...
Do you ever tried opening 0.5GB of file on
your machine?
Its difficult to edit 10TB file in limited
time in traditional system
ClassifiCation of Big Data
1. StructuredData:
● It refers to data that has a defined
length and format for big data
● Ex.numbers, dates, and groups of
words and numbers called strings.
● It’s usually stored in a database.
2. Unstructured Data
●
● No fields
Massive data ex. Internet
data●
Music(Audio)Applications Movie(vedio)
X-Rays Pictures
3. Semi-Structured Data
The data which do not have a proper
formate attached to it. Ex.
–Data within an email
–Data in Doc File
Why do we need this?
●
●
●
● Better understand and target
coustmers
In Election exit poll
Improving healthcare
Improving and optimizing cities and
countries.
Application of big data are endless
Characteristics of Big Data
1)
Velocit
y
2)Volu
ExamplEs of VEloCity
●
●
●
Almost 3.5 billion queries on Google
are performed each day
80 million photos are shared on Instagram on
an average day.
Every minute we upload 300 hours
of video on Youtube.
every day over 205 billion emails are sent.
500 million tweets are sent per day.
●
● It refers to the vast amount of
data generated every second.
● Here we are talking about
Zettabyte or more.
● Data is generated by machines,
networks and human
interaction on systems like
social media.
● The volume of data to be
analyzed is massive.
Example of Volume.
Airb
us●
●
Airbus generates 10TB every
30 minutes About 640TB is
generated in one flight
Example of Volume...2
• Self-driving cars will
generate 2 Petabyte of data
every year.
• From now on, the amount of
data in the world will
double every two years.
• By 2020, we will have 50 times
the amount of data as that we
had in 2011.
●
●
●
Variety
●
●
●
Refers to the different types ofdata we can
now use.
In past the data was structured that fitted
in columns and rows.
– Stored in Database
– Spread sheets
But now the data is unstructured that are
difficult to storing, analysing,mining.
– Email, photo, audio
– monitoring devices, PDFs
●
●
●
Are the results meaningful for the
given problem space?
it’s about data quality and
understandability.
Especially in automated decision-
making, where no human is involved
anymore, you need to be sure that
both the data and the analyses are
correct.
Veracity
Data Generating Points
●
●
●
Smart Phones
5 billion camera phones are
there in the world Most of them
have location awareness(GPS)
By 2020 we will have 6.1 billion
smartphones users globally.
Internet
●
●
●
●
2 billion people using internet
By the end of 2020, cisco internet
traffic will be 4.8 ZB per year.
Emails:
205 billion email sent every day
Blogs:
There are 200 million entries on the
web
Social Media
Facebook:
● 34K likes every minute
● It deals with 3-4 PB of data each
day
● There are 1 billion active user
Twitter:
● It generates 12TB of data daily
● 300million user generates
●
●
Google:
It perform 2million search
every minute It deals with
20PB of data each day
Youtube:
●
2.9 billion vedio hours vedio
watched per month
Tools for handling big data
Traditional System
ex. RDBMS
Big Data Tools
ex. Hadoop
Created to handle
Big Data
Limitations of Traditional
Data Warehouse
●
Co
st
●
Fixed Schema of
RDBMS
●
Saving huge file and
accessing them
●
Perform
analysis
●
Time to do all
What is
Hadoop?
• The Apache Hadoop software library is a
framework that allows for the distributed
processing of large data sets across clusters
of computers using simple programming
models.
• It is made by apache software foundation in
2011.
• Written in JAVA.
BIG DATA ANALYTICS
BIG DATA ANALYTICS

More Related Content

What's hot

What's hot (6)

Big Data meetup R#1 slide
Big Data meetup R#1 slideBig Data meetup R#1 slide
Big Data meetup R#1 slide
 
Neil Fraser
Neil FraserNeil Fraser
Neil Fraser
 
On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...On the Quest for Changing Knowledge. Capturing emerging entities from social ...
On the Quest for Changing Knowledge. Capturing emerging entities from social ...
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
4 Things You Didn't Know About Big Data
4 Things You Didn't Know About Big Data4 Things You Didn't Know About Big Data
4 Things You Didn't Know About Big Data
 

Similar to BIG DATA ANALYTICS

Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptxSamiksha880257
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAkshata Humbe
 
Implementation of application for huge data file transfer
Implementation of application for huge data file transferImplementation of application for huge data file transfer
Implementation of application for huge data file transferijwmn
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfvvpadhu
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.saranya270513
 
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Love Arora
 

Similar to BIG DATA ANALYTICS (20)

Big Data Presentation
Big  Data PresentationBig  Data Presentation
Big Data Presentation
 
Lecture #03
Lecture #03Lecture #03
Lecture #03
 
Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptx
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Implementation of application for huge data file transfer
Implementation of application for huge data file transferImplementation of application for huge data file transfer
Implementation of application for huge data file transfer
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Ictam big data
Ictam big dataIctam big data
Ictam big data
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
Big Data Handling Technologies ICCCS 2014_Love Arora _GNDU
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Big data
Big dataBig data
Big data
 

Recently uploaded

Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 

BIG DATA ANALYTICS

  • 1.
  • 3. What is actually Big Data? Big data-: So large data that it becomes difficult to process it using the traditional system.
  • 4.
  • 6. Example... Do you ever tried opening 0.5GB of file on your machine?
  • 7. Its difficult to edit 10TB file in limited time in traditional system
  • 8. ClassifiCation of Big Data 1. StructuredData: ● It refers to data that has a defined length and format for big data ● Ex.numbers, dates, and groups of words and numbers called strings. ● It’s usually stored in a database.
  • 9. 2. Unstructured Data ● ● No fields Massive data ex. Internet data● Music(Audio)Applications Movie(vedio) X-Rays Pictures
  • 10. 3. Semi-Structured Data The data which do not have a proper formate attached to it. Ex. –Data within an email –Data in Doc File
  • 11. Why do we need this? ● ● ● ● Better understand and target coustmers In Election exit poll Improving healthcare Improving and optimizing cities and countries. Application of big data are endless
  • 12. Characteristics of Big Data 1) Velocit y 2)Volu
  • 13.
  • 14. ExamplEs of VEloCity ● ● ● Almost 3.5 billion queries on Google are performed each day 80 million photos are shared on Instagram on an average day. Every minute we upload 300 hours of video on Youtube. every day over 205 billion emails are sent. 500 million tweets are sent per day. ●
  • 15. ● It refers to the vast amount of data generated every second. ● Here we are talking about Zettabyte or more. ● Data is generated by machines, networks and human interaction on systems like social media. ● The volume of data to be analyzed is massive.
  • 16. Example of Volume. Airb us● ● Airbus generates 10TB every 30 minutes About 640TB is generated in one flight
  • 17. Example of Volume...2 • Self-driving cars will generate 2 Petabyte of data every year. • From now on, the amount of data in the world will double every two years. • By 2020, we will have 50 times the amount of data as that we had in 2011. ● ● ●
  • 18. Variety ● ● ● Refers to the different types ofdata we can now use. In past the data was structured that fitted in columns and rows. – Stored in Database – Spread sheets But now the data is unstructured that are difficult to storing, analysing,mining. – Email, photo, audio – monitoring devices, PDFs
  • 19. ● ● ● Are the results meaningful for the given problem space? it’s about data quality and understandability. Especially in automated decision- making, where no human is involved anymore, you need to be sure that both the data and the analyses are correct. Veracity
  • 20. Data Generating Points ● ● ● Smart Phones 5 billion camera phones are there in the world Most of them have location awareness(GPS) By 2020 we will have 6.1 billion smartphones users globally.
  • 21. Internet ● ● ● ● 2 billion people using internet By the end of 2020, cisco internet traffic will be 4.8 ZB per year. Emails: 205 billion email sent every day Blogs: There are 200 million entries on the web
  • 22. Social Media Facebook: ● 34K likes every minute ● It deals with 3-4 PB of data each day ● There are 1 billion active user Twitter: ● It generates 12TB of data daily ● 300million user generates
  • 23. ● ● Google: It perform 2million search every minute It deals with 20PB of data each day Youtube: ● 2.9 billion vedio hours vedio watched per month
  • 24. Tools for handling big data Traditional System ex. RDBMS Big Data Tools ex. Hadoop Created to handle Big Data
  • 25. Limitations of Traditional Data Warehouse ● Co st ● Fixed Schema of RDBMS ● Saving huge file and accessing them ● Perform analysis ● Time to do all
  • 26. What is Hadoop? • The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. • It is made by apache software foundation in 2011. • Written in JAVA.