SlideShare a Scribd company logo
Big Data Analysis
MOHAMMAD ALFAWAREH
What is the Big data
Big data: are massive and complex amount of data
that generated from different sources, with different
format structure, unstructured and semistrucured .
Resources of Big data
Social Sites Data :- such as Instagram , my space, snapchat Facebook, whatsup
All of these S.M generate photos, text, videos and emotions'
Search Engine Data: millions of peoples make searches in seconds and search engines deal with
serval type of data bases.
Medical History Data: hospital and clinks produce millions of transactions to the paints
regards history or from researches on some diseases .
IOT devices: with this revolutions billions of devices will generate the data in difference format.
Online shopping:-
YouTube Statistics
Every day people
watch hundreds of
millions of hours
over a billion users
people across the
world are
uploading 1.2
million videos
generate billions of
views
over 100 hours per
minute and this
number is ever
increasing
Characterize of B.D
Varity
Vilocity
Volume
Why now Big data analysis
1. Low cost storage to store data that was discarded earlier
2. Powerful multi-core processors
3. Low latency possible by distributed computing: Compute
clusters and grids connected via high-speed networks
4. Virtualization Partition, Aggregate, isolate resources in any
size and dynamically change it Minimize latency for any
scale
5. Affordable storage and computing with minimal man power
via clouds
6 Possible because of advances in Networking
Why we need Big data analysis
Businesses
intelligence
Make Our
life Easy
Military
App
Decrease
Average of
Diseases
Old method to solve the data base issues
RDMS was used to store the data in single space
when the client send a job the device will process the data and send it back to the client
,unfortunately with rapid generation of the data in huge volume every day like exa,peta,zeta
and in different format this method become ineffective for couple of reasons
Total data production is increasing day by day in a very large amount
We need huge storage devices and management system to this data
We need to increase the processing unit scale in due to some specific app need real time
processing.
Increase Colling and power consumptions .
What is Hadoop
Hadoop: is open source frame work used to
process and storage big data ,used in big
companies like google , Facebook, twitter.
Hadoop components
The Hadoop has two core components
one to the storage called HDFS the ides come from GFS.
And the another to the processing called Map-reduce.
HDFS
Map-Reduce
Hadoop Advantage
It is compatible on all the platforms.
 It distributes the data and tasks across the nodes automatically, which allows the users to
write and execute the distributed systems quickly.
 The library has the API to detect and handle failures at the application layer which relieves the
framework to rely on the hardware for fault tolerance.
The framework continues to operate smoothly with the addition and removal of servers
dynamically.
Big Data

More Related Content

What's hot

What's hot (20)

Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
 
Big data management
Big data managementBig data management
Big data management
 
Electronic Document Preparation and Management
Electronic Document Preparation and ManagementElectronic Document Preparation and Management
Electronic Document Preparation and Management
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion
 
Big data
Big dataBig data
Big data
 
Hadoop
HadoopHadoop
Hadoop
 
Big data and its applications
Big data and its applicationsBig data and its applications
Big data and its applications
 
Ijdsa
IjdsaIjdsa
Ijdsa
 
Big data
Big dataBig data
Big data
 
Ijdsa
IjdsaIjdsa
Ijdsa
 
Big Data & Data Mining
Big Data & Data MiningBig Data & Data Mining
Big Data & Data Mining
 
Big Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data ScientistsBig Data, Big Deal: For Future Big Data Scientists
Big Data, Big Deal: For Future Big Data Scientists
 
Big Data
Big DataBig Data
Big Data
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 

Similar to Big Data

Similar to Big Data (20)

No sql databases
No sql databasesNo sql databases
No sql databases
 
Big data technologies with Case Study Finance and Healthcare
Big data technologies with Case Study Finance and HealthcareBig data technologies with Case Study Finance and Healthcare
Big data technologies with Case Study Finance and Healthcare
 
Big Data
Big DataBig Data
Big Data
 
Big Data
Big DataBig Data
Big Data
 
How Do I Learn Big Data
How Do I Learn Big DataHow Do I Learn Big Data
How Do I Learn Big Data
 
Big data
Big dataBig data
Big data
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big Data
Big DataBig Data
Big Data
 
hadoop seminar training report
hadoop seminar  training reporthadoop seminar  training report
hadoop seminar training report
 
Big data
Big dataBig data
Big data
 
GADLJRIET850691
GADLJRIET850691GADLJRIET850691
GADLJRIET850691
 
NoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and AnalyticsNoSQL Type, Bigdata, and Analytics
NoSQL Type, Bigdata, and Analytics
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
big-data-notes1.ppt
big-data-notes1.pptbig-data-notes1.ppt
big-data-notes1.ppt
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 

Recently uploaded

一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 

Recently uploaded (20)

Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 

Big Data

  • 2. What is the Big data Big data: are massive and complex amount of data that generated from different sources, with different format structure, unstructured and semistrucured .
  • 3. Resources of Big data Social Sites Data :- such as Instagram , my space, snapchat Facebook, whatsup All of these S.M generate photos, text, videos and emotions' Search Engine Data: millions of peoples make searches in seconds and search engines deal with serval type of data bases. Medical History Data: hospital and clinks produce millions of transactions to the paints regards history or from researches on some diseases . IOT devices: with this revolutions billions of devices will generate the data in difference format. Online shopping:-
  • 4. YouTube Statistics Every day people watch hundreds of millions of hours over a billion users people across the world are uploading 1.2 million videos generate billions of views over 100 hours per minute and this number is ever increasing
  • 6. Why now Big data analysis 1. Low cost storage to store data that was discarded earlier 2. Powerful multi-core processors 3. Low latency possible by distributed computing: Compute clusters and grids connected via high-speed networks 4. Virtualization Partition, Aggregate, isolate resources in any size and dynamically change it Minimize latency for any scale 5. Affordable storage and computing with minimal man power via clouds 6 Possible because of advances in Networking
  • 7. Why we need Big data analysis Businesses intelligence Make Our life Easy Military App Decrease Average of Diseases
  • 8. Old method to solve the data base issues RDMS was used to store the data in single space when the client send a job the device will process the data and send it back to the client ,unfortunately with rapid generation of the data in huge volume every day like exa,peta,zeta and in different format this method become ineffective for couple of reasons Total data production is increasing day by day in a very large amount We need huge storage devices and management system to this data We need to increase the processing unit scale in due to some specific app need real time processing. Increase Colling and power consumptions .
  • 9. What is Hadoop Hadoop: is open source frame work used to process and storage big data ,used in big companies like google , Facebook, twitter.
  • 10. Hadoop components The Hadoop has two core components one to the storage called HDFS the ides come from GFS. And the another to the processing called Map-reduce.
  • 11. HDFS
  • 13. Hadoop Advantage It is compatible on all the platforms.  It distributes the data and tasks across the nodes automatically, which allows the users to write and execute the distributed systems quickly.  The library has the API to detect and handle failures at the application layer which relieves the framework to rely on the hardware for fault tolerance. The framework continues to operate smoothly with the addition and removal of servers dynamically.