SlideShare a Scribd company logo
1 of 30
Download to read offline
National Engineering College,
K.R.Nagar, Kovilpatti
(An Autonomous Institution, Affiliated to Anna University, Chennai)
Department of Artificial Intelligence& Data Science
CLOUD & BIG DATA
Dr.V.Kalaivani, M.E.,Ph.D.,
Professor & HOD/AI & DS
 Introduction
 Why Cloud Computing
 Benefits of Cloud Computing
 Characteristics
 Advantages of Cloud Computing
 Disadvantages of Cloud
Computing
 How Cloud Computing Works
 Challenges of Cloud Computing
 Layers of Cloud Computing
 Components of Cloud Computing
 Big Data
 3 Vs of Big Data
 Importance of Big Data
 What Comes Under Big Data
 Hadoop
 Hadoop Architecture
 Hadoop With Big Data
 Map Reduce
 Why Data Analytics
 Types of Analysis
 Types of Data Analytics
 Big Data Analytics
 Conclusion
 References
 Thanking You
2
Cloud computing is an internet based computer
technology. It is the next stage technology that
uses the clouds to provide the services
whenever and wherever the user need it. It
provides a method to access several servers
world wide.
What is Cloud?
A cloud is a combination of networks,
hardware, services, storage, and interfaces
that helps in delivering computing as a
service.
What is Cloud Computing ?
3
Why Cloud Computing?
Without Cloud Computing With Cloud Computing
4
Benefits of Cloud Computing
 Cloud computing enables companies and
applications, which are system
infrastructure dependent, to be
infrastructure-less.
 By using the Cloud infrastructure on “pay
as used and on demand”, all of us can save
in capital and operational investment!
 Clients can:-
 Put their data on the platform instead of on their
own desktop PCs and/or on their own servers.
 They can put their applications on the cloud and
use the servers within the cloud to do processing
and data manipulations etc.
5
Agile
Highly Reliable
Independent of Device
and Location
Low Cost
Pay-Per-Use
Easy to Maintain
Highly Scalable
Multi-Shared
6
Advantages of Cloud Computing
 Lower cost computer users
 Lower IT infrastructure
 Fewer Maintenance cost
 Lower Software Cost
 Instant Software updates
 Increased Computing Powers
 Unlimited storage capacity
7
Disadvantages of Cloud Computing
 Requires a constant Internet
connection
 Stored data might not be secured
 Limited control and flexibility
 More risk on information leakage
 Users cannot be aware of the
network
 Dependencies on service suppliers for
implementing data management
8
 Use of cloud computing means dependence on
others and that could possibly limit flexibility
and innovation
 Security could prove to be a big issue:
 It is still unclear how safe out-sourced data is and when using these services
 Ownership of data is not always clear.
 Data Centre can become environmental
hazards: Green Cloud
 Cloud Interoperability is still an issue.
Layers of Cloud Computing
 Infrastructure as a service (IaaS):-It provides cloud infrastructure
in terms of hardware as like memory, processor, speed etc.
 Platform as a service (PaaS):It provides cloud application
platform for the developer.
 Software as a service (SaaS)::It provides the cloud applications
to users directly without installing anything on the system.
These applications remains on cloud.
Components Of Cloud Computing
Big Data
Big Data refers to a collection of data sets so large
and complex. It is impossible to process them with
the usual databases and tools because of its size and
associated numbers. Big data is hard to capture, store,
search, share, analyze and visualize.
3 Vs of Big Data
 The “BIG” in big data isn’t just about volume
 Volume
 Variety
 Velocity
Importance of Big Data
The importance of big data does not revolve around how much data you have ,
but what you do with it.
You can take data from any source and analyze it to find answer that enables,
 Cost reductions.
 Time reductions.
 New product development and optimized offerings .
 Smart decision making.
 Black Box Data
 Social Media Data
 Stock Exchange Data
 Power Grid Data
 Transport Data
 Search Engine Data
 Structured data
 Semi Structured data
 Unstructured data
What is Hadoop ?
 Hadoop is an open-source software framework for storing
data and running applications on clusters of commodity
hardware. It provides massive storage for any kind of
data, enormous processing power and the ability to handle
virtually limitless concurrent tasks or jobs.
 The software framework that supports HDFS,
MapReduce and other related entities is called the project
Hadoop or simply Hadoop.
 This is open source and distributed by Apache.
Hadoop Ecosystem
Apache Oozie (Workflow)
Pig Latin
Data Analysis
Mahout
Machine Learning
HDFS (Hadoop Distributed File System)
Map Reduce Framework
Flume Sqoop
Unstructured or
Semi-Structured data
Structured data
Pig Latin
Data Analysis
Mahout
Machine Learning
H Base
Hive
DW System
With Big Data
Hadoop is the core platform for
structuring Big Data, and solves the
problem of formatting it for
subsequent analytics
purposes. Hadoop uses a distributed
computing architecture consisting of
multiple servers using commodity
hardware, making it relatively
Cost Effective System
Large Cluster of Notes
Parallel Processing
Distributive Data
Automatic failover management
Data Locality optimization
Heterogeneous Cluster
Scalability
Map Reduce
MapReduce is a programming model that Google has used
successfully in processing its “big-data” sets (~ 20000 peta bytes
per day)
 A map function extracts some intelligence from
raw data.
 A reduce function aggregates according to some
guides the data output by the map.
 Users specify the computation in terms of a map
and a reduce function,
 Underlying runtime system automatically
parallelizes the computation across large-scale
clusters of machines, and
 Underlying system also handles machine failures,
efficient communications, and performance issues.
Broken into pieces
[ MAP ]
Computation
Computation
Computation
Computation
Computation
Computation
Shuffle and Sort
Why Data Analysis?
It is important to remember that the primary
value from big data does not come from the
data in its raw form but from the processing
and analysis of it and the insights, products
and services that emerge from analysis.
For unstructured data to be useful it must be analysed to extract and
expose the information it contains
Different types of analysis are possible, such as:-
 Entity analysis – people, organisations, objects and events, and the relationships
between them
 Topic analysis – topics or themes, and their relative importance
 Sentiment analysis – subjective view of a person to a particular topic
 Feature analysis – Inherent characteristics that are significant for a particular analytical
perspective (e.g. land coverage in satellite imagery)
Types Of Analysis
Types Of Data Analytics
Analytic Excellence leads to better decisions:-
 Descriptive Analytics : What is happening?
 Diagnostic Analytics : Why did it happen?
 Predictive Analytics : What is likely going to
happen?
 Prescriptive Analytics : What should we do about it?
Analytics
 Focus On :-
 Predictive Analysis
 Data Science
 Data Sets:-
 Large Scale Data Sets
 More type of Data
 Raw Data
 Complex Data Models
 Supports:-
 Correlations – new insight more accurate answer
 Two IT initiatives are currently top of mind for organizations across the globe i.e.
 Big Data Analytics
 Cloud Computing
 As a delivery model for IT services , cloud computing has the potential to enhance
business agility and productivity while enabling greater efficiencies and reducing
costs.
 In the current scenario , Big Data is a big challenge for the organizations .
To store and process such large volume of data , variety of data and velocity of data
Hadoop came into existence.
 Our presentation is all about Cloud Computing , Big Data & Big Data Analytics.
www.slideshare.com/cloud&bigdata
www.hadooptutorial.com
www.javatpoint.com/cloudcomputing
www.ibm.com/ibm/academy
Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf

More Related Content

Similar to Cloud and Bid data Dr.VK.pdf

using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
ranjit banshpal
 

Similar to Cloud and Bid data Dr.VK.pdf (20)

Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
 
Bar camp bigdata
Bar camp bigdataBar camp bigdata
Bar camp bigdata
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
Big data and Hadoop overview
Big data and Hadoop overviewBig data and Hadoop overview
Big data and Hadoop overview
 
hadoop seminar training report
hadoop seminar  training reporthadoop seminar  training report
hadoop seminar training report
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONSHIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
 
High level view of cloud security
High level view of cloud securityHigh level view of cloud security
High level view of cloud security
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
 
The Six pillars for Building big data analytics ecosystems
The Six pillars for Building big data analytics ecosystemsThe Six pillars for Building big data analytics ecosystems
The Six pillars for Building big data analytics ecosystems
 

More from kalai75

More from kalai75 (13)

Python ppt.pdf
Python ppt.pdfPython ppt.pdf
Python ppt.pdf
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
R Programming.pptx
R Programming.pptxR Programming.pptx
R Programming.pptx
 
Q-Step_WS_06112019_Data_Analysis_and_visualisation_with_Python.pptx
Q-Step_WS_06112019_Data_Analysis_and_visualisation_with_Python.pptxQ-Step_WS_06112019_Data_Analysis_and_visualisation_with_Python.pptx
Q-Step_WS_06112019_Data_Analysis_and_visualisation_with_Python.pptx
 
Lec1cgu13updated.ppt
Lec1cgu13updated.pptLec1cgu13updated.ppt
Lec1cgu13updated.ppt
 
2 ppt.pptx
2 ppt.pptx2 ppt.pptx
2 ppt.pptx
 
COA Precourse.docx
COA Precourse.docxCOA Precourse.docx
COA Precourse.docx
 
print.pptx
print.pptxprint.pptx
print.pptx
 
Linear algebra
Linear algebraLinear algebra
Linear algebra
 
Ppt1
Ppt1Ppt1
Ppt1
 
Applications of data mining in bioinformatics
Applications of data mining in bioinformaticsApplications of data mining in bioinformatics
Applications of data mining in bioinformatics
 
Atal trivandrum academy
Atal trivandrum academyAtal trivandrum academy
Atal trivandrum academy
 
Atal trivandrum academy
Atal trivandrum academyAtal trivandrum academy
Atal trivandrum academy
 

Recently uploaded

Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 

Recently uploaded (20)

Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 

Cloud and Bid data Dr.VK.pdf

  • 1. National Engineering College, K.R.Nagar, Kovilpatti (An Autonomous Institution, Affiliated to Anna University, Chennai) Department of Artificial Intelligence& Data Science CLOUD & BIG DATA Dr.V.Kalaivani, M.E.,Ph.D., Professor & HOD/AI & DS
  • 2.  Introduction  Why Cloud Computing  Benefits of Cloud Computing  Characteristics  Advantages of Cloud Computing  Disadvantages of Cloud Computing  How Cloud Computing Works  Challenges of Cloud Computing  Layers of Cloud Computing  Components of Cloud Computing  Big Data  3 Vs of Big Data  Importance of Big Data  What Comes Under Big Data  Hadoop  Hadoop Architecture  Hadoop With Big Data  Map Reduce  Why Data Analytics  Types of Analysis  Types of Data Analytics  Big Data Analytics  Conclusion  References  Thanking You 2
  • 3. Cloud computing is an internet based computer technology. It is the next stage technology that uses the clouds to provide the services whenever and wherever the user need it. It provides a method to access several servers world wide. What is Cloud? A cloud is a combination of networks, hardware, services, storage, and interfaces that helps in delivering computing as a service. What is Cloud Computing ? 3
  • 4. Why Cloud Computing? Without Cloud Computing With Cloud Computing 4
  • 5. Benefits of Cloud Computing  Cloud computing enables companies and applications, which are system infrastructure dependent, to be infrastructure-less.  By using the Cloud infrastructure on “pay as used and on demand”, all of us can save in capital and operational investment!  Clients can:-  Put their data on the platform instead of on their own desktop PCs and/or on their own servers.  They can put their applications on the cloud and use the servers within the cloud to do processing and data manipulations etc. 5
  • 6. Agile Highly Reliable Independent of Device and Location Low Cost Pay-Per-Use Easy to Maintain Highly Scalable Multi-Shared 6
  • 7. Advantages of Cloud Computing  Lower cost computer users  Lower IT infrastructure  Fewer Maintenance cost  Lower Software Cost  Instant Software updates  Increased Computing Powers  Unlimited storage capacity 7
  • 8. Disadvantages of Cloud Computing  Requires a constant Internet connection  Stored data might not be secured  Limited control and flexibility  More risk on information leakage  Users cannot be aware of the network  Dependencies on service suppliers for implementing data management 8
  • 9.
  • 10.  Use of cloud computing means dependence on others and that could possibly limit flexibility and innovation  Security could prove to be a big issue:  It is still unclear how safe out-sourced data is and when using these services  Ownership of data is not always clear.  Data Centre can become environmental hazards: Green Cloud  Cloud Interoperability is still an issue.
  • 11. Layers of Cloud Computing  Infrastructure as a service (IaaS):-It provides cloud infrastructure in terms of hardware as like memory, processor, speed etc.  Platform as a service (PaaS):It provides cloud application platform for the developer.  Software as a service (SaaS)::It provides the cloud applications to users directly without installing anything on the system. These applications remains on cloud.
  • 12. Components Of Cloud Computing
  • 13. Big Data Big Data refers to a collection of data sets so large and complex. It is impossible to process them with the usual databases and tools because of its size and associated numbers. Big data is hard to capture, store, search, share, analyze and visualize.
  • 14. 3 Vs of Big Data  The “BIG” in big data isn’t just about volume  Volume  Variety  Velocity
  • 15. Importance of Big Data The importance of big data does not revolve around how much data you have , but what you do with it. You can take data from any source and analyze it to find answer that enables,  Cost reductions.  Time reductions.  New product development and optimized offerings .  Smart decision making.
  • 16.  Black Box Data  Social Media Data  Stock Exchange Data  Power Grid Data  Transport Data  Search Engine Data  Structured data  Semi Structured data  Unstructured data
  • 17. What is Hadoop ?  Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs.  The software framework that supports HDFS, MapReduce and other related entities is called the project Hadoop or simply Hadoop.  This is open source and distributed by Apache.
  • 18. Hadoop Ecosystem Apache Oozie (Workflow) Pig Latin Data Analysis Mahout Machine Learning HDFS (Hadoop Distributed File System) Map Reduce Framework Flume Sqoop Unstructured or Semi-Structured data Structured data Pig Latin Data Analysis Mahout Machine Learning H Base Hive DW System
  • 19. With Big Data Hadoop is the core platform for structuring Big Data, and solves the problem of formatting it for subsequent analytics purposes. Hadoop uses a distributed computing architecture consisting of multiple servers using commodity hardware, making it relatively
  • 20. Cost Effective System Large Cluster of Notes Parallel Processing Distributive Data Automatic failover management Data Locality optimization Heterogeneous Cluster Scalability
  • 21. Map Reduce MapReduce is a programming model that Google has used successfully in processing its “big-data” sets (~ 20000 peta bytes per day)  A map function extracts some intelligence from raw data.  A reduce function aggregates according to some guides the data output by the map.  Users specify the computation in terms of a map and a reduce function,  Underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, and  Underlying system also handles machine failures, efficient communications, and performance issues.
  • 22. Broken into pieces [ MAP ] Computation Computation Computation Computation Computation Computation Shuffle and Sort
  • 23. Why Data Analysis? It is important to remember that the primary value from big data does not come from the data in its raw form but from the processing and analysis of it and the insights, products and services that emerge from analysis.
  • 24. For unstructured data to be useful it must be analysed to extract and expose the information it contains Different types of analysis are possible, such as:-  Entity analysis – people, organisations, objects and events, and the relationships between them  Topic analysis – topics or themes, and their relative importance  Sentiment analysis – subjective view of a person to a particular topic  Feature analysis – Inherent characteristics that are significant for a particular analytical perspective (e.g. land coverage in satellite imagery) Types Of Analysis
  • 25. Types Of Data Analytics Analytic Excellence leads to better decisions:-  Descriptive Analytics : What is happening?  Diagnostic Analytics : Why did it happen?  Predictive Analytics : What is likely going to happen?  Prescriptive Analytics : What should we do about it?
  • 26. Analytics  Focus On :-  Predictive Analysis  Data Science  Data Sets:-  Large Scale Data Sets  More type of Data  Raw Data  Complex Data Models  Supports:-  Correlations – new insight more accurate answer
  • 27.  Two IT initiatives are currently top of mind for organizations across the globe i.e.  Big Data Analytics  Cloud Computing  As a delivery model for IT services , cloud computing has the potential to enhance business agility and productivity while enabling greater efficiencies and reducing costs.  In the current scenario , Big Data is a big challenge for the organizations . To store and process such large volume of data , variety of data and velocity of data Hadoop came into existence.  Our presentation is all about Cloud Computing , Big Data & Big Data Analytics.