Big Data Mind Map
Big Data Landscape Mind Map
How BIGDATA is utilising in the life
Big Data is still a big problem for many companies.
● How do you collect, process and distribute it?
● How do you analyze it?
Hadoop promises an answer to these questions.
Hadoop
Apache Hadoop® is an open source Java based framework for distributed storage and
processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly
gain insight from massive amounts of structured and unstructured data.
Hadoop vs SAP HANA
Hadoop vs DWH
Business intelligence (BI) is a technology-driven process for analyzing data and presenting actionable
information to help corporate executives, business managers and other end users make more informed
business decisions.
Common BigData Deployment Architecture
Hadoop Batch Processing:
Hadoop Live stream Processing:
Hadoop
What is Spark?
● Spark is new technology that sits on top of Hadoop Distributed File System (HDFS)
● It is characterized as “a fast and general engine for large-scale data processing.”
● Spark has three key features:
1. For iterative analysis like logistic regression, Random Forests, or other advanced algorithms,
Spark has demonstrated 100X increase in speed that scales to hundreds of millions of rows.
2. Spark has native support for the latest and greatest programming languages Java, Scala, and of
course Python.
3. Spark has generality or platform compatibility in both directions meaning it integrates nicely with
SQL engines (Shark), Machine Learning (MLlib), and streaming (Spark Streaming) without
requiring new software installed on the cluster using Hadoop’s new YARN cluster manager.
Data Analysis Flow with Spark
Spark Or Hadoop--
Which Is The Best Big Data Framework?
● Hadoop, for many years, was the leading open source Big Data framework
● Spark has become the more popular of the Apache Software Foundation tool from 2014.
● Spark does not include its own system for organizing files in a distributed way (the file system)
● so it requires one provided by a third-party. For this reason many Big Data projects involve installing
Spark on top of Hadoop
● Spark’s advanced analytics applications can make use of data stored using the Hadoop Distributed File
System (HDFS).
● Many of the big vendors (i.e Cloudera) now offer Spark as well as Hadoop, so will be in a good position
to advise companies on which they will find most suitable, on a job-by-job basis.
Top 6 Hadoop Vendors providing Big Data Solutions in Open Data Platform
WHAT IS BIG DATA MARKET
Big Data Is Big Market & Big Business - $50 Billion Market
by 2017
Big Data not only refers to the data itself but also a set of
technologies that capture, store, manage and analyze large
and variable collections of data to solve complex problems.
BIG DATA OPPORTUNITY
How companies are succeeding
by using BIGDATA Analytics
What is BIGDATA needs
to CUSTOMER
QA IN BIG
DATA
7 Ways Big Data Training Can
Change Your Organization
1.Information Technology: Improving productivity with Big Data Training
2.Product Development: Rethinking innovation across all stages of R&D
3.Finance: Training employees on big data platforms to handle financial modelling
4.Human Resources: Redefining HR employee capabilities
5.Supply Chain & Logistics: Training delivery team with big data platforms
6.Operations, Support & Customer service: Employee training on big data at every customer interaction
7.Marketing: Training employees on a systematic marketing approach with big data
RESOURCE REQUIRED IN BIG DATA
Krisshhna
dkrishna.hadoop@gmail.com

View on big data technologies

  • 25.
  • 26.
  • 28.
    How BIGDATA isutilising in the life
  • 31.
    Big Data isstill a big problem for many companies. ● How do you collect, process and distribute it? ● How do you analyze it? Hadoop promises an answer to these questions.
  • 32.
    Hadoop Apache Hadoop® isan open source Java based framework for distributed storage and processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly gain insight from massive amounts of structured and unstructured data.
  • 43.
  • 44.
  • 45.
    Business intelligence (BI)is a technology-driven process for analyzing data and presenting actionable information to help corporate executives, business managers and other end users make more informed business decisions.
  • 46.
  • 47.
    Hadoop Batch Processing: HadoopLive stream Processing: Hadoop
  • 53.
    What is Spark? ●Spark is new technology that sits on top of Hadoop Distributed File System (HDFS) ● It is characterized as “a fast and general engine for large-scale data processing.” ● Spark has three key features: 1. For iterative analysis like logistic regression, Random Forests, or other advanced algorithms, Spark has demonstrated 100X increase in speed that scales to hundreds of millions of rows. 2. Spark has native support for the latest and greatest programming languages Java, Scala, and of course Python. 3. Spark has generality or platform compatibility in both directions meaning it integrates nicely with SQL engines (Shark), Machine Learning (MLlib), and streaming (Spark Streaming) without requiring new software installed on the cluster using Hadoop’s new YARN cluster manager.
  • 54.
  • 64.
    Spark Or Hadoop-- WhichIs The Best Big Data Framework? ● Hadoop, for many years, was the leading open source Big Data framework ● Spark has become the more popular of the Apache Software Foundation tool from 2014. ● Spark does not include its own system for organizing files in a distributed way (the file system) ● so it requires one provided by a third-party. For this reason many Big Data projects involve installing Spark on top of Hadoop ● Spark’s advanced analytics applications can make use of data stored using the Hadoop Distributed File System (HDFS). ● Many of the big vendors (i.e Cloudera) now offer Spark as well as Hadoop, so will be in a good position to advise companies on which they will find most suitable, on a job-by-job basis.
  • 67.
    Top 6 HadoopVendors providing Big Data Solutions in Open Data Platform
  • 68.
    WHAT IS BIGDATA MARKET
  • 69.
    Big Data IsBig Market & Big Business - $50 Billion Market by 2017 Big Data not only refers to the data itself but also a set of technologies that capture, store, manage and analyze large and variable collections of data to solve complex problems.
  • 78.
  • 81.
    How companies aresucceeding by using BIGDATA Analytics
  • 87.
    What is BIGDATAneeds to CUSTOMER
  • 92.
  • 95.
    7 Ways BigData Training Can Change Your Organization
  • 96.
    1.Information Technology: Improvingproductivity with Big Data Training 2.Product Development: Rethinking innovation across all stages of R&D 3.Finance: Training employees on big data platforms to handle financial modelling 4.Human Resources: Redefining HR employee capabilities 5.Supply Chain & Logistics: Training delivery team with big data platforms 6.Operations, Support & Customer service: Employee training on big data at every customer interaction 7.Marketing: Training employees on a systematic marketing approach with big data
  • 97.
  • 99.