SlideShare a Scribd company logo
1 of 20
The rise of “big data”
on cloud computing
Presenter
Muhammad Maaz Irfan
Reg No: 201824100008
School of Information Science and Engineering
University of Jinan, Jinan, Shandong, China
1
Contents
I. Introduction
II. Definition & characteristics of big data
III. Cloud Computing
IV. Relationship between cloud & big data
V. Case studies
VI. Big data storage system
VII. Hadoop background
VIII. Research Challenges
I. Scalability
II. Availability
III. Data integrity
IV. Transformation
V. Data quality
VI. Heterogeneity
VII. Privacy
VIII. Legal issues
IX. Governance
IX. Open Research issues
X. Conclusion
2
Introduction
• The rise of social media, Internet of Things (IoT),
and multimedia, has produced an overwhelming
flow of data in either structured
• Big data are characterized by three aspects:
– Data are numerous
– Data cannot be categorized into regular relational
databases
– Data are generated, captured, and processed rapidly.
Moreover, big data is transforming healthcare,
science, engineering, finance, business, and
eventually, the society.
3
Definition & characteristics of big data
• Big data is a set of techniques and
technologies that require new forms of
integration to uncover large hidden values
from large datasets that are
– Diverse
– Complex
– & In massive scale
4
Four Vs of big data
5
Big
data
Volume
Variety
Velocity
Value
Advantages of big data
6
Cloud Computing 7
8
Cloud Computing
9
• Cloud computing is a fast-growing technology that has
established itself in the next generation of IT industry
and business.
• Cloud computing promises reliable software, hardware,
and IaaS(Infrastructure as a Service) delivered over the
Internet and remote data centers .
• Cloud services have become a powerful architecture
to perform complex large-scale computing tasks and
span a range of IT functions from storage and
computation to database and application services.
Relationship between cloud
& big data
10
Relationship between cloud
& big data
• Cloud computing and big data are conjoined.
• Big data provides users the ability to use commodity
computing to process distributed queries across
multiple datasets and return resultant sets in a timely
manner.
• Cloud computing provides the underlying through the
use of Hadoop, a class of distributed data-processing
platforms.
• Big data utilizes distributed storage technology based
on cloud computing rather than local storage attached
to a computer or electronic device.
11
Cloud computing usage in big data
12
Case studies
• Studies provided by different vendors who
integrate big data technologies into their
cloud environment.
13
Big data storage system
14
Big data storage system
• A storage architecture that can be accessed in a highly efficient
manner while achieving availabilityand reliability is required to store
and manage large datasets.
• Existing technologies can be classified as direct attached storage
(DAS), network attached storage (NAS), and storage area network
(SAN).
• The organizational systems of data storages (DAS, NAS, and SAN)
can be divided into three parts:
– (i) disc array, where the foundation of a storage system provides the
fundamental guarantee
– (ii) connection and network subsystems, which connect one or more
disc arrays and servers
– (iii) storage management software, which oversees data sharing,
storage management, and disaster recovery tasks for multiple servers.
15
background
• Hadoop is an open-source software framework
for storing data and running applications on
clusters of commodity hardware. It provides
massive storage for any kind of data, enormous
processing power and the ability to handle
virtually limitless concurrent tasks or jobs
• The software framework that supports HDFS,
MapReduce and other related entities is called
the project Hadoop or simply Hadoop.
• This is open source and distributed by Apache
16
Features of Hadoop
I. Cost Effective System
II. Large Cluster of Notes
III. Parallel Processing
IV. Distributive Data
V. Automatic failover management
VI. Data Locality optimization
VII.Heterogeneous Cluster
VIII.Scalability
17
Research Challenges
I. Scalability = Scalability is the ability of the storage to handle increasing
amounts of data in an appropriate manner.
II. Availability = Availability refers to the resources of the system accessible on
demand by an authorized individual
III. Data integrity = A key aspect of big data security is integrity. Integrity means
that data can be modified only by authorized parties or the data owner to
prevent misuse.
IV. Transformation = Transforming data into a form suitable for analysis is an
obstacle in the adoption of big data.
V. Data quality = In the past, data processing was typically performed on clean
datasets from well-known and limited sources.
VI. Heterogeneity = Variety, one of the major aspects of big data characterization,
is the result of the growth of virtually unlimited different sources of data.
VII.Privacy = Privacy concerns continue to hamper users who outsource their
private data into the cloud storage.
VIII.Legal issues = Specific laws and regulations must be established to preserve
the personal and sensitive information of users.
IX. Governance = Data governance embodies the exercise of control and
authority over data-related rules of law, transparency. 18
Conclusion
• Two IT initiatives are currently top of mind for
organizations across the globe i.e.
– Big Data Analytics
– Cloud Computing
As a delivery model for IT services , cloud computing has the
potential to enhance business agility and productivity while
enabling greater efficiencies and reducing costs.
• In the current scenario , Big Data is a big challenge for the
organizations . To store and process such large volume of
data , variety of data and velocity of data Hadoop came into
existence.
• Our presentation is all about Cloud Computing , Big Data &
Big Data Analytics.
19
Thanks for your time 
If you any Questions please ask.
20

More Related Content

What's hot

International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)ijccsa
 
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata PrivacyTwo-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacydbpublications
 
11th International conference on Database Management Systems (DMS 2020)
11th International conference on Database Management Systems (DMS 2020)11th International conference on Database Management Systems (DMS 2020)
11th International conference on Database Management Systems (DMS 2020)dannyijwest
 
Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introductionPublicLibraryServices
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesEditor IJMTER
 
Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris
 
Bigdata Analytics using Hadoop
Bigdata Analytics using HadoopBigdata Analytics using Hadoop
Bigdata Analytics using HadoopNagamani Gurram
 
Hypermedia database on the Web
Hypermedia database on the WebHypermedia database on the Web
Hypermedia database on the WebMelvin Balajadia
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop iACT Global
 
Big data analytics data structure
Big data analytics data structureBig data analytics data structure
Big data analytics data structureSubrahmanyam Gunda
 

What's hot (20)

Big data
Big dataBig data
Big data
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)International Journal on Cloud Computing: Services and Architecture (IJCCSA)
International Journal on Cloud Computing: Services and Architecture (IJCCSA)
 
Big data tools
Big data toolsBig data tools
Big data tools
 
Dms 2020
Dms 2020Dms 2020
Dms 2020
 
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata PrivacyTwo-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
 
11th International conference on Database Management Systems (DMS 2020)
11th International conference on Database Management Systems (DMS 2020)11th International conference on Database Management Systems (DMS 2020)
11th International conference on Database Management Systems (DMS 2020)
 
Digital preservation: an introduction
Digital preservation: an introductionDigital preservation: an introduction
Digital preservation: an introduction
 
Data preservation
Data preservationData preservation
Data preservation
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining Challenges
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
 
Bigdata Analytics using Hadoop
Bigdata Analytics using HadoopBigdata Analytics using Hadoop
Bigdata Analytics using Hadoop
 
Big Data
Big DataBig Data
Big Data
 
Hypermedia database on the Web
Hypermedia database on the WebHypermedia database on the Web
Hypermedia database on the Web
 
Cassie findlay
Cassie findlayCassie findlay
Cassie findlay
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
Big data analytics data structure
Big data analytics data structureBig data analytics data structure
Big data analytics data structure
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 

Similar to The rise of big data on cloud computing

The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingMinhazul Arefin
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfvvpadhu
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 
MIS Big Data & Data Analytics.pptx
MIS Big Data & Data Analytics.pptxMIS Big Data & Data Analytics.pptx
MIS Big Data & Data Analytics.pptxDrSalehAlSaleh
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyanIAESIJEECS
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
Protection of big data privacy
Protection of big data privacyProtection of big data privacy
Protection of big data privacyredpel dot com
 
The Internet of Things: What's next?
The Internet of Things: What's next? The Internet of Things: What's next?
The Internet of Things: What's next? PayamBarnaghi
 
A proposed Solution: Data Availability and Error Correction in Cloud Computing
A proposed Solution: Data Availability and Error Correction in Cloud ComputingA proposed Solution: Data Availability and Error Correction in Cloud Computing
A proposed Solution: Data Availability and Error Correction in Cloud ComputingCSCJournals
 
Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in theIJNSA Journal
 
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD IJNSA Journal
 
Delivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDelivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDenodo
 
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIESBIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIESijcsit
 
Big Data in Cloud Computing Review and Opportunities
Big Data in Cloud Computing Review and OpportunitiesBig Data in Cloud Computing Review and Opportunities
Big Data in Cloud Computing Review and OpportunitiesAIRCC Publishing Corporation
 
Net-Centric Data Strategy
Net-Centric Data StrategyNet-Centric Data Strategy
Net-Centric Data StrategyDaniel Risacher
 

Similar to The rise of big data on cloud computing (20)

The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
MIS Big Data & Data Analytics.pptx
MIS Big Data & Data Analytics.pptxMIS Big Data & Data Analytics.pptx
MIS Big Data & Data Analytics.pptx
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
 
Datamining with big data
 Datamining with big data  Datamining with big data
Datamining with big data
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
Protection of big data privacy
Protection of big data privacyProtection of big data privacy
Protection of big data privacy
 
The Internet of Things: What's next?
The Internet of Things: What's next? The Internet of Things: What's next?
The Internet of Things: What's next?
 
E018142329
E018142329E018142329
E018142329
 
A proposed Solution: Data Availability and Error Correction in Cloud Computing
A proposed Solution: Data Availability and Error Correction in Cloud ComputingA proposed Solution: Data Availability and Error Correction in Cloud Computing
A proposed Solution: Data Availability and Error Correction in Cloud Computing
 
Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in the
 
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD
BIG DATA SECURITY AND PRIVACY ISSUES IN THE CLOUD
 
Delivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data FabricDelivering Faster Insights with a Logical Data Fabric
Delivering Faster Insights with a Logical Data Fabric
 
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIESBIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
BIG DATA IN CLOUD COMPUTING REVIEW AND OPPORTUNITIES
 
Big Data in Cloud Computing Review and Opportunities
Big Data in Cloud Computing Review and OpportunitiesBig Data in Cloud Computing Review and Opportunities
Big Data in Cloud Computing Review and Opportunities
 
Net-Centric Data Strategy
Net-Centric Data StrategyNet-Centric Data Strategy
Net-Centric Data Strategy
 

Recently uploaded

Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesMayuraD1
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfsmsksolar
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdfKamal Acharya
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...HenryBriggs2
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadhamedmustafa094
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayEpec Engineered Technologies
 

Recently uploaded (20)

Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Rums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdfRums floating Omkareshwar FSPV IM_16112021.pdf
Rums floating Omkareshwar FSPV IM_16112021.pdf
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 

The rise of big data on cloud computing

  • 1. The rise of “big data” on cloud computing Presenter Muhammad Maaz Irfan Reg No: 201824100008 School of Information Science and Engineering University of Jinan, Jinan, Shandong, China 1
  • 2. Contents I. Introduction II. Definition & characteristics of big data III. Cloud Computing IV. Relationship between cloud & big data V. Case studies VI. Big data storage system VII. Hadoop background VIII. Research Challenges I. Scalability II. Availability III. Data integrity IV. Transformation V. Data quality VI. Heterogeneity VII. Privacy VIII. Legal issues IX. Governance IX. Open Research issues X. Conclusion 2
  • 3. Introduction • The rise of social media, Internet of Things (IoT), and multimedia, has produced an overwhelming flow of data in either structured • Big data are characterized by three aspects: – Data are numerous – Data cannot be categorized into regular relational databases – Data are generated, captured, and processed rapidly. Moreover, big data is transforming healthcare, science, engineering, finance, business, and eventually, the society. 3
  • 4. Definition & characteristics of big data • Big data is a set of techniques and technologies that require new forms of integration to uncover large hidden values from large datasets that are – Diverse – Complex – & In massive scale 4
  • 5. Four Vs of big data 5 Big data Volume Variety Velocity Value
  • 8. 8
  • 9. Cloud Computing 9 • Cloud computing is a fast-growing technology that has established itself in the next generation of IT industry and business. • Cloud computing promises reliable software, hardware, and IaaS(Infrastructure as a Service) delivered over the Internet and remote data centers . • Cloud services have become a powerful architecture to perform complex large-scale computing tasks and span a range of IT functions from storage and computation to database and application services.
  • 11. Relationship between cloud & big data • Cloud computing and big data are conjoined. • Big data provides users the ability to use commodity computing to process distributed queries across multiple datasets and return resultant sets in a timely manner. • Cloud computing provides the underlying through the use of Hadoop, a class of distributed data-processing platforms. • Big data utilizes distributed storage technology based on cloud computing rather than local storage attached to a computer or electronic device. 11
  • 12. Cloud computing usage in big data 12
  • 13. Case studies • Studies provided by different vendors who integrate big data technologies into their cloud environment. 13
  • 14. Big data storage system 14
  • 15. Big data storage system • A storage architecture that can be accessed in a highly efficient manner while achieving availabilityand reliability is required to store and manage large datasets. • Existing technologies can be classified as direct attached storage (DAS), network attached storage (NAS), and storage area network (SAN). • The organizational systems of data storages (DAS, NAS, and SAN) can be divided into three parts: – (i) disc array, where the foundation of a storage system provides the fundamental guarantee – (ii) connection and network subsystems, which connect one or more disc arrays and servers – (iii) storage management software, which oversees data sharing, storage management, and disaster recovery tasks for multiple servers. 15
  • 16. background • Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs • The software framework that supports HDFS, MapReduce and other related entities is called the project Hadoop or simply Hadoop. • This is open source and distributed by Apache 16
  • 17. Features of Hadoop I. Cost Effective System II. Large Cluster of Notes III. Parallel Processing IV. Distributive Data V. Automatic failover management VI. Data Locality optimization VII.Heterogeneous Cluster VIII.Scalability 17
  • 18. Research Challenges I. Scalability = Scalability is the ability of the storage to handle increasing amounts of data in an appropriate manner. II. Availability = Availability refers to the resources of the system accessible on demand by an authorized individual III. Data integrity = A key aspect of big data security is integrity. Integrity means that data can be modified only by authorized parties or the data owner to prevent misuse. IV. Transformation = Transforming data into a form suitable for analysis is an obstacle in the adoption of big data. V. Data quality = In the past, data processing was typically performed on clean datasets from well-known and limited sources. VI. Heterogeneity = Variety, one of the major aspects of big data characterization, is the result of the growth of virtually unlimited different sources of data. VII.Privacy = Privacy concerns continue to hamper users who outsource their private data into the cloud storage. VIII.Legal issues = Specific laws and regulations must be established to preserve the personal and sensitive information of users. IX. Governance = Data governance embodies the exercise of control and authority over data-related rules of law, transparency. 18
  • 19. Conclusion • Two IT initiatives are currently top of mind for organizations across the globe i.e. – Big Data Analytics – Cloud Computing As a delivery model for IT services , cloud computing has the potential to enhance business agility and productivity while enabling greater efficiencies and reducing costs. • In the current scenario , Big Data is a big challenge for the organizations . To store and process such large volume of data , variety of data and velocity of data Hadoop came into existence. • Our presentation is all about Cloud Computing , Big Data & Big Data Analytics. 19
  • 20. Thanks for your time  If you any Questions please ask. 20