SlideShare a Scribd company logo
1 of 38
Faculty Name: Namrata
Sharma/Arjun S. Parihar
Year/Branch:3rd/CSE
Subject Code:CS-503(A)
Subject Name:Data Analytics
In this session you will learn about:
• Hadoop Ecosystem
• Data discovery
• Open source technology for Big Data Analytics
• cloud and Big Data
Learning Objectives
 Apache Hadoop is the most powerful tool of Big Data.
 Hadoop ecosystem revolves around three main components-
• HDFS
• MapReduce
• YARN
Apart from these Hadoop Components, there are
some other Hadoop ecosystem components also, that play an important
role to boost Hadoop functionalities.
Hadoop
Hadoop Components
1.1 HDFS
Hadoop Distributed File system (HDFS) is the primary storage system
of Hadoop.
HDFS store very large files running on a cluster of commodity
hardware.
It follows the principle of storing less number of large files rather than
the huge number of small files.
HDFS stores data reliably even in the case of hardware failure.
 it provides high throughput access to the application by accessing in
parallel.
1.11 NameNode –
 It works as Master in Hadoop cluster.
 Namenode stores meta-data i.e. number of blocks, replicas and other
details.
 Meta-data is present in memory in the master.
 NameNode assigns tasks to the slave node.
 It should deploy on reliable hardware as it is the centerpiece of HDFS.
Components of HDFS
1.12 DataNode –
 It works as Slave in Hadoop cluster.
 In Hadoop HDFS, DataNode is responsible for storing actual data
in HDFS.
 DataNode also performs read and write operation as per request
for the clients.
 DataNodes can also deploy on commodity hardware.
1.2 MapReduce
 Hadoop MapReduce is the data processing layer of Hadoop.
 It processes large structured and unstructured data stored in HDFS.
 MapReduce also processes a huge amount of data in parallel.
 It does this by dividing the job (submitted job) into a set of independent
tasks (sub-job).
 In Hadoop, MapReduce works by breaking the processing into phases.
1.3 YARN
Hadoop YARN provides the resource management.
It is the operating system of Hadoop.
So, it is responsible for managing and monitoring workloads,
implementing security controls.
It is a central platform to deliver data governance tools across Hadoop
clusters.
YARN allows multiple data processing engines such as real-time
streaming, batch processing etc.
Resource Manager –
It is a cluster level component and runs on the Master machine.
It manages resources and schedule applications running on the top of
YARN.
It has two components: Scheduler & Application Manager.
Node Manager –
 It is a node level component.
It runs on each slave machine.
It continuously communicate with Resource Manager to remain up-to-date
Components of YARN
Data discovery is the collection and analysis of data from various sources
to gain insight from hidden patterns and trends.
It is the first step in fully harnessing an organization’s data to inform
critical business decisions.
Through the data discovery process, data is gathered, combined, and
analyzed in a sequence of steps.
The goal is to make messy and scattered data clean, understandable, and
user-friendly.
Data discovery
According to Gartner, “Big Data Discovery” is the next big trend in
analytics.
Hottest trends of the last few years in analytics:
Big Data
Data Discovery
Data Science
What are the Benefits of Data Discovery?
Gather Actionable Insights
Save Time
Scale Data Across Teams
Clean and Reuse Data
Data discovery provides a framework for firms to unlock and act upon the
insights contained within their data.
It transforms messy and unstructured data to facilitate and enhance its
analysis. Data discovery allows firms to:
Data Discovery Tools
We know we want collect, store, organize,
analyze and share it.
But we have limited resources.
What is Cloud Computing?
25
Cloud computing is a fast-
growing technology that has
established itself in the next
generation of IT industry and
business.
Cloud Service Model
26
Cloud service model typically consists of paas, saas, and laas.
Cloud Process
Case Study
 Application
• Call Center surveillance
 Background
• Previously – voice data
 Goal for a new system
• Monitor data & voice
• Multiple data sources
• Advanced correlations
Ever Growing Data
Deeper Correlation
Tight Performance
A Classic Case for..
Cost Business
Impact
Big Data
in the Cloud
 Auto start VMs
 Install and configure app components
 Monitor
 Repair
 (Auto) Scale
Managing Big
Data on the cloud
Big Data in the cloud
Reduce the
infrastructure cost
Choose the right
cloud for the job
Big Data in the cloud
• Consistent Management
• Automation Through the Entire Stack
Reducing the
operational
complexity
Big Data in the cloud
Predictive analytics is the practice of extracting insights from the existing
data set with the help data mining, statistical modeling and machine
learning techniques and using it to predict unobserved/unknown events.
Identifying cause-effect relationships across the variables from the
historical data.
Discovering hidden insights and patterns with the help of data mining
techniques.
Apply observed patterns to unknowns in the Past, Present or Future.
Predictive Analytics
Thanks!

More Related Content

Similar to data analytics lecture4.pptx

Big Data Analytics With Hadoop
Big Data Analytics With HadoopBig Data Analytics With Hadoop
Big Data Analytics With HadoopUmair Shafique
 
project report on hadoop
project report on hadoopproject report on hadoop
project report on hadoopManoj Jangalva
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopIOSR Journals
 
1.demystifying big data & hadoop
1.demystifying big data & hadoop1.demystifying big data & hadoop
1.demystifying big data & hadoopdatabloginfo
 
BigData & Hadoop Ecosystem.pptx
BigData & Hadoop Ecosystem.pptxBigData & Hadoop Ecosystem.pptx
BigData & Hadoop Ecosystem.pptxBibhasDeb1
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introductionsaisreealekhya
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfDIVYA370851
 
Distributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxDistributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxUttara University
 
Bigdata and hadoop
Bigdata and hadoopBigdata and hadoop
Bigdata and hadoopAditi Yadav
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with HadoopNalini Mehta
 
Hadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceHadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceNeev Technologies
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopArchana Gopinath
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop BasicsSonal Tiwari
 

Similar to data analytics lecture4.pptx (20)

Big Data Analytics With Hadoop
Big Data Analytics With HadoopBig Data Analytics With Hadoop
Big Data Analytics With Hadoop
 
project report on hadoop
project report on hadoopproject report on hadoop
project report on hadoop
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
 
G017143640
G017143640G017143640
G017143640
 
1.demystifying big data & hadoop
1.demystifying big data & hadoop1.demystifying big data & hadoop
1.demystifying big data & hadoop
 
BigData & Hadoop Ecosystem.pptx
BigData & Hadoop Ecosystem.pptxBigData & Hadoop Ecosystem.pptx
BigData & Hadoop Ecosystem.pptx
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdf
 
Hadoop info
Hadoop infoHadoop info
Hadoop info
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Distributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxDistributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptx
 
Bigdata and hadoop
Bigdata and hadoopBigdata and hadoop
Bigdata and hadoop
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Managing Big data with Hadoop
Managing Big data with HadoopManaging Big data with Hadoop
Managing Big data with Hadoop
 
Hadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceHadoop Ecosystem at a Glance
Hadoop Ecosystem at a Glance
 
Big data
Big dataBig data
Big data
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and Hadoop
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop Basics
 

Recently uploaded

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 

data analytics lecture4.pptx

  • 1.
  • 2. Faculty Name: Namrata Sharma/Arjun S. Parihar Year/Branch:3rd/CSE Subject Code:CS-503(A) Subject Name:Data Analytics
  • 3. In this session you will learn about: • Hadoop Ecosystem • Data discovery • Open source technology for Big Data Analytics • cloud and Big Data Learning Objectives
  • 4.  Apache Hadoop is the most powerful tool of Big Data.  Hadoop ecosystem revolves around three main components- • HDFS • MapReduce • YARN Apart from these Hadoop Components, there are some other Hadoop ecosystem components also, that play an important role to boost Hadoop functionalities. Hadoop
  • 5.
  • 6. Hadoop Components 1.1 HDFS Hadoop Distributed File system (HDFS) is the primary storage system of Hadoop. HDFS store very large files running on a cluster of commodity hardware. It follows the principle of storing less number of large files rather than the huge number of small files. HDFS stores data reliably even in the case of hardware failure.  it provides high throughput access to the application by accessing in parallel.
  • 7.
  • 8. 1.11 NameNode –  It works as Master in Hadoop cluster.  Namenode stores meta-data i.e. number of blocks, replicas and other details.  Meta-data is present in memory in the master.  NameNode assigns tasks to the slave node.  It should deploy on reliable hardware as it is the centerpiece of HDFS. Components of HDFS
  • 9.
  • 10. 1.12 DataNode –  It works as Slave in Hadoop cluster.  In Hadoop HDFS, DataNode is responsible for storing actual data in HDFS.  DataNode also performs read and write operation as per request for the clients.  DataNodes can also deploy on commodity hardware.
  • 11.
  • 12. 1.2 MapReduce  Hadoop MapReduce is the data processing layer of Hadoop.  It processes large structured and unstructured data stored in HDFS.  MapReduce also processes a huge amount of data in parallel.  It does this by dividing the job (submitted job) into a set of independent tasks (sub-job).  In Hadoop, MapReduce works by breaking the processing into phases.
  • 13.
  • 14. 1.3 YARN Hadoop YARN provides the resource management. It is the operating system of Hadoop. So, it is responsible for managing and monitoring workloads, implementing security controls. It is a central platform to deliver data governance tools across Hadoop clusters. YARN allows multiple data processing engines such as real-time streaming, batch processing etc.
  • 15.
  • 16. Resource Manager – It is a cluster level component and runs on the Master machine. It manages resources and schedule applications running on the top of YARN. It has two components: Scheduler & Application Manager. Node Manager –  It is a node level component. It runs on each slave machine. It continuously communicate with Resource Manager to remain up-to-date Components of YARN
  • 17. Data discovery is the collection and analysis of data from various sources to gain insight from hidden patterns and trends. It is the first step in fully harnessing an organization’s data to inform critical business decisions. Through the data discovery process, data is gathered, combined, and analyzed in a sequence of steps. The goal is to make messy and scattered data clean, understandable, and user-friendly. Data discovery
  • 18.
  • 19. According to Gartner, “Big Data Discovery” is the next big trend in analytics. Hottest trends of the last few years in analytics: Big Data Data Discovery Data Science
  • 20.
  • 21. What are the Benefits of Data Discovery? Gather Actionable Insights Save Time Scale Data Across Teams Clean and Reuse Data Data discovery provides a framework for firms to unlock and act upon the insights contained within their data. It transforms messy and unstructured data to facilitate and enhance its analysis. Data discovery allows firms to:
  • 23.
  • 24. We know we want collect, store, organize, analyze and share it. But we have limited resources.
  • 25. What is Cloud Computing? 25 Cloud computing is a fast- growing technology that has established itself in the next generation of IT industry and business.
  • 26. Cloud Service Model 26 Cloud service model typically consists of paas, saas, and laas.
  • 28. Case Study  Application • Call Center surveillance  Background • Previously – voice data  Goal for a new system • Monitor data & voice • Multiple data sources • Advanced correlations
  • 29. Ever Growing Data Deeper Correlation Tight Performance
  • 30. A Classic Case for..
  • 33.  Auto start VMs  Install and configure app components  Monitor  Repair  (Auto) Scale Managing Big Data on the cloud Big Data in the cloud
  • 34. Reduce the infrastructure cost Choose the right cloud for the job Big Data in the cloud
  • 35. • Consistent Management • Automation Through the Entire Stack Reducing the operational complexity Big Data in the cloud
  • 36. Predictive analytics is the practice of extracting insights from the existing data set with the help data mining, statistical modeling and machine learning techniques and using it to predict unobserved/unknown events. Identifying cause-effect relationships across the variables from the historical data. Discovering hidden insights and patterns with the help of data mining techniques. Apply observed patterns to unknowns in the Past, Present or Future. Predictive Analytics
  • 37.