SlideShare a Scribd company logo
Mr. Kailash Shaw [ HOD ( CSE DEPT.) ]
Mrinal Kumar - 1301292599
Pranav Kumar - 1301292603
1
 Introduction
 Why Cloud Computing
 Benefits of Cloud Computing
 Characteristics
 Advantages of Cloud Computing
 Disadvantages of Cloud
Computing
 How Cloud Computing Works
 Challenges of Cloud Computing
 Layers of Cloud Computing
 Components of Cloud Computing
 Big Data
 3 Vs of Big Data
 Importance of Big Data
 What Comes Under Big Data
 Hadoop
 Hadoop Architecture
 Hadoop With Big Data
 Map Reduce
 Why Data Analytics
 Types of Analysis
 Types of Data Analytics
 Big Data Analytics
 Conclusion
 References
 Thanking You
2
Cloud computing is an internet based computer
technology. It is the next stage technology that
uses the clouds to provide the services
whenever and wherever the user need it. It
provides a method to access several servers
world wide.
What is Cloud?
A cloud is a combination of networks,
hardware, services, storage, and interfaces
that helps in delivering computing as a
service.
What is Cloud Computing ?
3
Why Cloud Computing?
Without Cloud Computing With Cloud Computing
4
Benefits of Cloud Computing
 Cloud computing enables companies and
applications, which are system
infrastructure dependent, to be
infrastructure-less.
 By using the Cloud infrastructure on “pay
as used and on demand”, all of us can save
in capital and operational investment!
 Clients can:-
 Put their data on the platform instead of on their
own desktop PCs and/or on their own servers.
 They can put their applications on the cloud and
use the servers within the cloud to do processing
and data manipulations etc.
5
Agile
Highly Reliable
Independent of Device
and Location
Low Cost
Pay-Per-Use
Easy to Maintain
Highly Scalable
Multi-Shared
6
Advantages of Cloud Computing
 Lower cost computer users
 Lower IT infrastructure
 Fewer Maintenance cost
 Lower Software Cost
 Instant Software updates
 Increased Computing Powers
 Unlimited storage capacity
7
Disadvantages of Cloud Computing
 Requires a constant Internet
connection
 Stored data might not be secured
 Limited control and flexibility
 More risk on information leakage
 Users cannot be aware of the
network
 Dependencies on service suppliers for
implementing data management
8
 Use of cloud computing means dependence on
others and that could possibly limit flexibility
and innovation
 Security could prove to be a big issue:
 It is still unclear how safe out-sourced data is and when using these services
 Ownership of data is not always clear.
 Data Centre can become environmental
hazards: Green Cloud
 Cloud Interoperability is still an issue.
Layers of Cloud Computing
 Infrastructure as a service (IaaS):-It provides cloud infrastructure
in terms of hardware as like memory, processor, speed etc.
 Platform as a service (PaaS):It provides cloud application
platform for the developer.
 Software as a service (SaaS)::It provides the cloud applications
to users directly without installing anything on the system.
These applications remains on cloud.
Components Of Cloud Computing
Big Data
Big Data refers to a collection of data sets so large
and complex. It is impossible to process them with
the usual databases and tools because of its size and
associated numbers. Big data is hard to capture, store,
search, share, analyze and visualize.
3 Vs of Big Data
 The “BIG” in big data isn’t just about volume
 Volume
 Variety
 Velocity
Importance of Big Data
The importance of big data does not revolve around how much data you have ,
but what you do with it.
You can take data from any source and analyze it to find answer that enables,
 Cost reductions.
 Time reductions.
 New product development and optimized offerings .
 Smart decision making.
 Black Box Data
 Social Media Data
 Stock Exchange Data
 Power Grid Data
 Transport Data
 Search Engine Data
 Structured data
 Semi Structured data
 Unstructured data
What is Hadoop ?
 Hadoop is an open-source software framework for storing
data and running applications on clusters of commodity
hardware. It provides massive storage for any kind of
data, enormous processing power and the ability to handle
virtually limitless concurrent tasks or jobs.
 The software framework that supports HDFS,
MapReduce and other related entities is called the project
Hadoop or simply Hadoop.
 This is open source and distributed by Apache.
Hadoop Ecosystem
Apache Oozie (Workflow)
Pig Latin
Data Analysis
Mahout
Machine Learning
HDFS (Hadoop Distributed File System)
Map Reduce Framework
Flume Sqoop
Unstructured or
Semi-Structured data
Structured data
Pig Latin
Data Analysis
Mahout
Machine Learning
H Base
Hive
DW System
With Big Data
Hadoop is the core platform for
structuring Big Data, and solves the
problem of formatting it for
subsequent analytics
purposes. Hadoop uses a distributed
computing architecture consisting of
multiple servers using commodity
hardware, making it relatively
Cost Effective System
Large Cluster of Notes
Parallel Processing
Distributive Data
Automatic failover management
Data Locality optimization
Heterogeneous Cluster
Scalability
Map Reduce
MapReduce is a programming model that Google has used
successfully in processing its “big-data” sets (~ 20000 peta bytes
per day)
 A map function extracts some intelligence from
raw data.
 A reduce function aggregates according to some
guides the data output by the map.
 Users specify the computation in terms of a map
and a reduce function,
 Underlying runtime system automatically
parallelizes the computation across large-scale
clusters of machines, and
 Underlying system also handles machine failures,
efficient communications, and performance issues.
Broken into pieces
[ MAP ]
Computation
Computation
Computation
Computation
Computation
Computation
Shuffle and Sort
Why Data Analysis?
It is important to remember that the primary
value from big data does not come from the
data in its raw form but from the processing
and analysis of it and the insights, products
and services that emerge from analysis.
For unstructured data to be useful it must be analysed to extract and
expose the information it contains
Different types of analysis are possible, such as:-
 Entity analysis – people, organisations, objects and events, and the relationships
between them
 Topic analysis – topics or themes, and their relative importance
 Sentiment analysis – subjective view of a person to a particular topic
 Feature analysis – Inherent characteristics that are significant for a particular analytical
perspective (e.g. land coverage in satellite imagery)
Types Of Analysis
Types Of Data Analytics
Analytic Excellence leads to better decisions:-
 Descriptive Analytics : What is happening?
 Diagnostic Analytics : Why did it happen?
 Predictive Analytics : What is likely going to
happen?
 Prescriptive Analytics : What should we do about it?
Analytics
 Focus On :-
 Predictive Analysis
 Data Science
 Data Sets:-
 Large Scale Data Sets
 More type of Data
 Raw Data
 Complex Data Models
 Supports:-
 Correlations – new insight more accurate answer
 Two IT initiatives are currently top of mind for organizations across the globe i.e.
 Big Data Analytics
 Cloud Computing
 As a delivery model for IT services , cloud computing has the potential to enhance
business agility and productivity while enabling greater efficiencies and reducing
costs.
 In the current scenario , Big Data is a big challenge for the organizations .
To store and process such large volume of data , variety of data and velocity of data
Hadoop came into existence.
 Our presentation is all about Cloud Computing , Big Data & Big Data Analytics.
www.slideshare.com/cloud&bigdata
www.hadooptutorial.com
www.javatpoint.com/cloudcomputing
www.ibm.com/ibm/academy
Cloud Computing & Big Data
Cloud Computing & Big Data

More Related Content

What's hot

Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
Robert Keahey
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
ShivanandaVSeeri
 
Introduction to Google App Engine
Introduction to Google App EngineIntroduction to Google App Engine
Introduction to Google App Engine
rajdeep
 
Edge Computing.pptx
Edge Computing.pptxEdge Computing.pptx
Edge Computing.pptx
PriyaMaurya52
 
Cloud computing seminar
Cloud computing seminarCloud computing seminar
Cloud computing seminar
ANKIT KUMAR
 
Big data ppt
Big data pptBig data ppt
Big data ppt
IDBI Bank Ltd.
 
Data science and cloud computing
Data science and cloud computingData science and cloud computing
Data science and cloud computing
Jithendra Balakrishnan
 
Application of Cloud Computing
Application of Cloud ComputingApplication of Cloud Computing
Application of Cloud Computing
Boonlert Aroonpiboon
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
hktripathy
 
Cloud analytics
Cloud analyticsCloud analytics
Cloud analytics
gaurav jain
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber Security
Rishi Kant
 
Cloud computing
Cloud computingCloud computing
Cloud computing
DebrajKarmakar
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
RohithND
 
Cloud Computing Architecture
Cloud Computing ArchitectureCloud Computing Architecture
Cloud Computing Architecture
Animesh Chaturvedi
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
Md. Salman Ahmed
 
Google App Engine ppt
Google App Engine  pptGoogle App Engine  ppt
Cloud computing
Cloud computingCloud computing
Cloud computing
Shiva Prasad
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPTUpender Upr
 
Cloud security Presentation
Cloud security PresentationCloud security Presentation
Cloud security Presentation
Ajay p
 
Overview of computing paradigm
Overview of computing paradigmOverview of computing paradigm
Overview of computing paradigm
Ripal Ranpara
 

What's hot (20)

Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Introduction to Google App Engine
Introduction to Google App EngineIntroduction to Google App Engine
Introduction to Google App Engine
 
Edge Computing.pptx
Edge Computing.pptxEdge Computing.pptx
Edge Computing.pptx
 
Cloud computing seminar
Cloud computing seminarCloud computing seminar
Cloud computing seminar
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data science and cloud computing
Data science and cloud computingData science and cloud computing
Data science and cloud computing
 
Application of Cloud Computing
Application of Cloud ComputingApplication of Cloud Computing
Application of Cloud Computing
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Cloud analytics
Cloud analyticsCloud analytics
Cloud analytics
 
Machine Learning in Cyber Security
Machine Learning in Cyber SecurityMachine Learning in Cyber Security
Machine Learning in Cyber Security
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Cloud Computing Architecture
Cloud Computing ArchitectureCloud Computing Architecture
Cloud Computing Architecture
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Google App Engine ppt
Google App Engine  pptGoogle App Engine  ppt
Google App Engine ppt
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPT
 
Cloud security Presentation
Cloud security PresentationCloud security Presentation
Cloud security Presentation
 
Overview of computing paradigm
Overview of computing paradigmOverview of computing paradigm
Overview of computing paradigm
 

Viewers also liked

Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
Viet-Trung TRAN
 
Cloud Computing And Virtualization
Cloud Computing And VirtualizationCloud Computing And Virtualization
Cloud Computing And Virtualization
Sonali Parab
 
Data Virtualization Primer - Introduction
Data Virtualization Primer - IntroductionData Virtualization Primer - Introduction
Data Virtualization Primer - Introduction
Kenneth Peeples
 
Crash Course in Cloud Computing
Crash Course in Cloud ComputingCrash Course in Cloud Computing
Crash Course in Cloud Computing
All Things Open
 
Introduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big DataIntroduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big Data
waheed751
 
big data and cloud computing
big data and cloud computingbig data and cloud computing
big data and cloud computing
Mohamed Sharique Vellikan
 
Big Data and Cloud Computing
Big Data and Cloud ComputingBig Data and Cloud Computing
Big Data and Cloud ComputingFarzad Nozarian
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
Zaloni
 
Big data on virtualized infrastucture
Big data on virtualized infrastuctureBig data on virtualized infrastucture
Big data on virtualized infrastucture
DataWorks Summit
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Abzetdin Adamov
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the CloudDATAVERSITY
 
Solving Big Data Industry Use Cases with AWS Cloud Computing
Solving Big Data Industry Use Cases with AWS Cloud ComputingSolving Big Data Industry Use Cases with AWS Cloud Computing
Solving Big Data Industry Use Cases with AWS Cloud Computing
Blazeclan Technologies Private Limited
 
Cloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for PartnersCloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for Partners
Amazon Web Services
 
Big Data
Big DataBig Data
Big Data
Neha Mehta
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Seungyun Lee
 
The Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationThe Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen Modernization
Hortonworks
 
Big Data & The Cloud
Big Data & The CloudBig Data & The Cloud
Big Data & The Cloud
Amazon Web Services
 
Big Data in the Cloud
Big Data in the CloudBig Data in the Cloud
Big Data in the Cloud
Nati Shalom
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data VirtualizationKenneth Peeples
 

Viewers also liked (20)

Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
 
Cloud Computing And Virtualization
Cloud Computing And VirtualizationCloud Computing And Virtualization
Cloud Computing And Virtualization
 
Data Virtualization Primer - Introduction
Data Virtualization Primer - IntroductionData Virtualization Primer - Introduction
Data Virtualization Primer - Introduction
 
Crash Course in Cloud Computing
Crash Course in Cloud ComputingCrash Course in Cloud Computing
Crash Course in Cloud Computing
 
Introduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big DataIntroduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big Data
 
big data and cloud computing
big data and cloud computingbig data and cloud computing
big data and cloud computing
 
Big Data and Cloud Computing
Big Data and Cloud ComputingBig Data and Cloud Computing
Big Data and Cloud Computing
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
 
Big data on virtualized infrastucture
Big data on virtualized infrastuctureBig data on virtualized infrastucture
Big data on virtualized infrastucture
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the Cloud
 
Solving Big Data Industry Use Cases with AWS Cloud Computing
Solving Big Data Industry Use Cases with AWS Cloud ComputingSolving Big Data Industry Use Cases with AWS Cloud Computing
Solving Big Data Industry Use Cases with AWS Cloud Computing
 
Cloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for PartnersCloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for Partners
 
Big Data
Big DataBig Data
Big Data
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
 
The Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationThe Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen Modernization
 
Big Data & The Cloud
Big Data & The CloudBig Data & The Cloud
Big Data & The Cloud
 
Big Data in the Cloud
Big Data in the CloudBig Data in the Cloud
Big Data in the Cloud
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data Virtualization
 

Similar to Cloud Computing & Big Data

Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
kalai75
 
Big Data
Big DataBig Data
Big Data
Kirubaburi R
 
IRJET- Secured Hadoop Environment
IRJET- Secured Hadoop EnvironmentIRJET- Secured Hadoop Environment
IRJET- Secured Hadoop Environment
IRJET Journal
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
Information Security Awareness Group
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview
Rajesh Menon
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
ElsonPaul2
 
hadoop seminar training report
hadoop seminar  training reporthadoop seminar  training report
hadoop seminar training report
Sarvesh Meena
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
hktripathy
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
Anusha sweety
 
Mobile Data Analytics
Mobile Data AnalyticsMobile Data Analytics
Mobile Data Analytics
RICHARD AMUOK
 
High level view of cloud security
High level view of cloud securityHigh level view of cloud security
High level view of cloud security
csandit
 
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONSHIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
cscpconf
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
Sysfore Technologies
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
IRJET Journal
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
almaraniabwmalk
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
IRJET Journal
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
Pankajkumar496281
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Denodo
 

Similar to Cloud Computing & Big Data (20)

Cloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdfCloud and Bid data Dr.VK.pdf
Cloud and Bid data Dr.VK.pdf
 
Big Data
Big DataBig Data
Big Data
 
IRJET- Secured Hadoop Environment
IRJET- Secured Hadoop EnvironmentIRJET- Secured Hadoop Environment
IRJET- Secured Hadoop Environment
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
hadoop seminar training report
hadoop seminar  training reporthadoop seminar  training report
hadoop seminar training report
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
Big data with hadoop
Big data with hadoopBig data with hadoop
Big data with hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Mobile Data Analytics
Mobile Data AnalyticsMobile Data Analytics
Mobile Data Analytics
 
High level view of cloud security
High level view of cloud securityHigh level view of cloud security
High level view of cloud security
 
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONSHIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
HIGH LEVEL VIEW OF CLOUD SECURITY: ISSUES AND SOLUTIONS
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 

Recently uploaded

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 

Cloud Computing & Big Data

  • 1. Mr. Kailash Shaw [ HOD ( CSE DEPT.) ] Mrinal Kumar - 1301292599 Pranav Kumar - 1301292603 1
  • 2.  Introduction  Why Cloud Computing  Benefits of Cloud Computing  Characteristics  Advantages of Cloud Computing  Disadvantages of Cloud Computing  How Cloud Computing Works  Challenges of Cloud Computing  Layers of Cloud Computing  Components of Cloud Computing  Big Data  3 Vs of Big Data  Importance of Big Data  What Comes Under Big Data  Hadoop  Hadoop Architecture  Hadoop With Big Data  Map Reduce  Why Data Analytics  Types of Analysis  Types of Data Analytics  Big Data Analytics  Conclusion  References  Thanking You 2
  • 3. Cloud computing is an internet based computer technology. It is the next stage technology that uses the clouds to provide the services whenever and wherever the user need it. It provides a method to access several servers world wide. What is Cloud? A cloud is a combination of networks, hardware, services, storage, and interfaces that helps in delivering computing as a service. What is Cloud Computing ? 3
  • 4. Why Cloud Computing? Without Cloud Computing With Cloud Computing 4
  • 5. Benefits of Cloud Computing  Cloud computing enables companies and applications, which are system infrastructure dependent, to be infrastructure-less.  By using the Cloud infrastructure on “pay as used and on demand”, all of us can save in capital and operational investment!  Clients can:-  Put their data on the platform instead of on their own desktop PCs and/or on their own servers.  They can put their applications on the cloud and use the servers within the cloud to do processing and data manipulations etc. 5
  • 6. Agile Highly Reliable Independent of Device and Location Low Cost Pay-Per-Use Easy to Maintain Highly Scalable Multi-Shared 6
  • 7. Advantages of Cloud Computing  Lower cost computer users  Lower IT infrastructure  Fewer Maintenance cost  Lower Software Cost  Instant Software updates  Increased Computing Powers  Unlimited storage capacity 7
  • 8. Disadvantages of Cloud Computing  Requires a constant Internet connection  Stored data might not be secured  Limited control and flexibility  More risk on information leakage  Users cannot be aware of the network  Dependencies on service suppliers for implementing data management 8
  • 9.
  • 10.  Use of cloud computing means dependence on others and that could possibly limit flexibility and innovation  Security could prove to be a big issue:  It is still unclear how safe out-sourced data is and when using these services  Ownership of data is not always clear.  Data Centre can become environmental hazards: Green Cloud  Cloud Interoperability is still an issue.
  • 11. Layers of Cloud Computing  Infrastructure as a service (IaaS):-It provides cloud infrastructure in terms of hardware as like memory, processor, speed etc.  Platform as a service (PaaS):It provides cloud application platform for the developer.  Software as a service (SaaS)::It provides the cloud applications to users directly without installing anything on the system. These applications remains on cloud.
  • 12. Components Of Cloud Computing
  • 13. Big Data Big Data refers to a collection of data sets so large and complex. It is impossible to process them with the usual databases and tools because of its size and associated numbers. Big data is hard to capture, store, search, share, analyze and visualize.
  • 14. 3 Vs of Big Data  The “BIG” in big data isn’t just about volume  Volume  Variety  Velocity
  • 15. Importance of Big Data The importance of big data does not revolve around how much data you have , but what you do with it. You can take data from any source and analyze it to find answer that enables,  Cost reductions.  Time reductions.  New product development and optimized offerings .  Smart decision making.
  • 16.  Black Box Data  Social Media Data  Stock Exchange Data  Power Grid Data  Transport Data  Search Engine Data  Structured data  Semi Structured data  Unstructured data
  • 17. What is Hadoop ?  Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs.  The software framework that supports HDFS, MapReduce and other related entities is called the project Hadoop or simply Hadoop.  This is open source and distributed by Apache.
  • 18. Hadoop Ecosystem Apache Oozie (Workflow) Pig Latin Data Analysis Mahout Machine Learning HDFS (Hadoop Distributed File System) Map Reduce Framework Flume Sqoop Unstructured or Semi-Structured data Structured data Pig Latin Data Analysis Mahout Machine Learning H Base Hive DW System
  • 19. With Big Data Hadoop is the core platform for structuring Big Data, and solves the problem of formatting it for subsequent analytics purposes. Hadoop uses a distributed computing architecture consisting of multiple servers using commodity hardware, making it relatively
  • 20. Cost Effective System Large Cluster of Notes Parallel Processing Distributive Data Automatic failover management Data Locality optimization Heterogeneous Cluster Scalability
  • 21. Map Reduce MapReduce is a programming model that Google has used successfully in processing its “big-data” sets (~ 20000 peta bytes per day)  A map function extracts some intelligence from raw data.  A reduce function aggregates according to some guides the data output by the map.  Users specify the computation in terms of a map and a reduce function,  Underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, and  Underlying system also handles machine failures, efficient communications, and performance issues.
  • 22. Broken into pieces [ MAP ] Computation Computation Computation Computation Computation Computation Shuffle and Sort
  • 23. Why Data Analysis? It is important to remember that the primary value from big data does not come from the data in its raw form but from the processing and analysis of it and the insights, products and services that emerge from analysis.
  • 24. For unstructured data to be useful it must be analysed to extract and expose the information it contains Different types of analysis are possible, such as:-  Entity analysis – people, organisations, objects and events, and the relationships between them  Topic analysis – topics or themes, and their relative importance  Sentiment analysis – subjective view of a person to a particular topic  Feature analysis – Inherent characteristics that are significant for a particular analytical perspective (e.g. land coverage in satellite imagery) Types Of Analysis
  • 25. Types Of Data Analytics Analytic Excellence leads to better decisions:-  Descriptive Analytics : What is happening?  Diagnostic Analytics : Why did it happen?  Predictive Analytics : What is likely going to happen?  Prescriptive Analytics : What should we do about it?
  • 26. Analytics  Focus On :-  Predictive Analysis  Data Science  Data Sets:-  Large Scale Data Sets  More type of Data  Raw Data  Complex Data Models  Supports:-  Correlations – new insight more accurate answer
  • 27.  Two IT initiatives are currently top of mind for organizations across the globe i.e.  Big Data Analytics  Cloud Computing  As a delivery model for IT services , cloud computing has the potential to enhance business agility and productivity while enabling greater efficiencies and reducing costs.  In the current scenario , Big Data is a big challenge for the organizations . To store and process such large volume of data , variety of data and velocity of data Hadoop came into existence.  Our presentation is all about Cloud Computing , Big Data & Big Data Analytics.