SlideShare a Scribd company logo
1 of 63
BIG Data
Desai Karan A
https://in.linkedin.com/in/karan28
SYNOPSIS:
1. Handy Hands-on
2. Introduction to big data
3. Big Data Niceties
4. Specifics of Big Data
5. Big Data Management Tools
6. Practical use-cases
7. Conclusions
8. References
1 Handy Hands-On
2. Introduction to big data
-2.1 What is big data?
-2.2 Etymology.
-2.3 Hype and Facts.
2.1 What is big data?
• “Big data” refers to datasets whose size is
beyond the ability of typical database software
tools to capture, store, manage, and analyze.
• Big Data is the extremely large data sets that
may be analyzed computationally to reveal
patterns, trends, and associations, especially
relating to human behavior and interactions.
• Big data is the data of range more than 1000
gigabytes or 100 zettabytes.
2.2 Etymology: Word Origination
Big data is the simplest,
shortest phrase to convey that
the boundaries of computing
keep advancing, growing,
diversifying and intensifying
rapidly..
John R Mashey, chief
scientist at Silicon Graphics
coined the term “Big Data”.
2.3 Hype and Facts
2.3 Hype and Facts
GLOBALLY, EVERY 60 SECONDS…
• 204 Million emails are
sent.
• 300k logins to .
• 1.3 Million views on
YouTube.
• 2 Million Google searches.
• 100k tweets.
• 62,000 hours of Music
Downloads
• WE GENERATE 2.5 QUINTILION BYTES
EVERYDAY
• IN 2012, WORLD’S INFORMATION
CROSSED 2 ZETTA BYTES =2
TRILLION GIGABYTES!!
2.3 Hype and Facts (contd.)
3. Big Data Niceties.
-3.1 Evolution of Big Data
-3.2 Why traditional tools fail?
-3.3 Utilities of Big Data
3.1 Evolution Story:
• E-TSUNAMI and Heavy RAINS of DATA…
3.2 Why traditional tools fail? (contd.)
3.2 Why traditional tools fail?
• The present data is highly BIG for the
traditional data managers.
-Can work only with small samples of
data
-It is same as looking through keyhole
and finding size of room…
• High Turnaround time for meaningful
results
– Means Deciding to cross road based on
picture taken 5 minutes earlier!!
3.2 Why traditional tools fail? (contd.)
3.3 Big data utilities:
• Dealing with real time data.
• A new level of insight and
opportunity.
• More effective, fact based
decision making.
• A new source of business
values.
• A competitive advantage.
4. Specifics of Big Data
-4.1 Characteristics
-4.2 Life cycle
4.1 Characteristics
Big
data
Volume
Variety
Velocity
Veracity
4.2 Big Data Life Cycle
Insight
Enrich
Manage
• Manage and secure data of any size.
• Enrich by connecting world’s data.
• Insights on any data irrespective of
location
3.2 Big Data Life Cycle
5. Big Data Management tools.
-5.1 Cow story
-5.2 Introduction to Hadoop
-5.3 Basic Working of Hadoop.
5.1 Cow story: Case 1
It is easy for me
to handle my
resources.(Data)
.
Data
Storage device
MB/GB
Case 2 I am strong…I
can handle my
resources
Data Data
Data Data
Data Data
Storage device
TB
Case 3
Oof…There are so
many resources!!!
I am not strong!
Storage device
PB
Case 4
I call my
friends
for help
Big Data Management tools
5.2 Introduction to Hadoop
Apache Hadoop is an open-source software
framework for storage and large-scale
processing of data-sets on clusters of
commodity hardware.
Introduction to Hadoop
• Doug Cutting created the Apache Hadoop.
• Logo of Hadoop is a tiny yellow elephant.
5.3 Basic working of Hadoop
Read 1 TB of Data
1 Machine 10 Machine
• 4 I/O Channels
• Each channel: 100
MB/s
• ~ 45 minutes
• 4 I/O Channels
• Each channel: 100
MB/s
• ~4.5 Minutes
Present Hadoop basic
architecture.
Schematic Working.
Schematic Working.
• Application written in java for Big Data Processing
• Uses the “Map-Reduce” Processing Paradigm
• Optimized for distributed storage and computing
of data
• Open Source
• Very low cost for acquisition and storage
Hadoop .
HadoopData Analytics
Other big data management
tools: Overview…
6. Practical Use-Cases
-6.1 Big apps of Big Data tools
-6.2 How big data affects small business
-6.3 Relevance of big data in market
6.1 Big apps of big data tools.
Who is using big data?
Who is using big data?
6.2 How big data affects
small businesses?
• Every organization has a tipping point, and
most organizations – regardless of size –
will eventually reach a point where the
volume, variety and velocity of their data
will be something that they have to
address.
• This new big data world is not only about
running problems faster, but about solving
problems that were not solvable before.
6.3 Relevance of big data in
market.
7. Conclusions
Conclusions: Through pics..
Conclusions: Through pics..
Conclusions: Through pics..
8. References:
• www.microsoft.com
• http://en.wikipedia.org/wiki/Hadoop
• http://en.wikipedia.org/wiki/Big_data
• www.google.com
• www.slideshare.net
• Pdf: Mgkinskey Global Institute
• Pdf: 101 Big data by Pradeep Vardan
• Workshop in college by ‘Ecsttasys’ on big
data
Introduction to Big Data

More Related Content

What's hot

ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureDATAVERSITY
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data Srinath Perera
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewSivashankar Ganapathy
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...DATAVERSITY
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Alan McSweeney
 
Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 
Lessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDMLessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDMDATAVERSITY
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overviewJames Serra
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesAmazon Web Services
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdfAlan McSweeney
 

What's hot (20)

ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data ArchitectureADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
ADV Slides: Strategies for Fitting a Data Lake into a Modern Data Architecture
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
 
Big data
Big dataBig data
Big data
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big data
Big dataBig data
Big data
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Lessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDMLessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDM
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best Practices
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Big Data
Big DataBig Data
Big Data
 
Data Engineering Basics
Data Engineering BasicsData Engineering Basics
Data Engineering Basics
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdf
 

Viewers also liked

Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for BeginnersMichael Perez
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiEdzo Botjes
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataRichard Vidgen
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Big Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia ExperienceBig Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia Experiencerotated8
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Impetus Technologies
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Big data
Big dataBig data
Big datahsn99
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataMohammed Guller
 
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...DataKitchen
 

Viewers also liked (20)

Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introduction
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for Beginners
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - Sogeti
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
What is big data?
What is big data?What is big data?
What is big data?
 
Hadoop basics
Hadoop basicsHadoop basics
Hadoop basics
 
Big Data
Big DataBig Data
Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia ExperienceBig Data Processing in the Cloud: A Hydra/Sufia Experience
Big Data Processing in the Cloud: A Hydra/Sufia Experience
 
Big data experiments
Big data experimentsBig data experiments
Big data experiments
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique BruxellesBig Data introduction - Café Numérique Bruxelles
Big Data introduction - Café Numérique Bruxelles
 
Big data
Big dataBig data
Big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
 

Similar to Introduction to Big Data

WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye TechnologiesAshish Jha
 
WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologieswisdomeye
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationDoug Denton
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - IntroductionTomy Rhymond
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big dataVedanand Singh
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01nayanbhatia2
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and HadoopGreyCampus
 

Similar to Introduction to Big Data (20)

WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologies
 
WisdomEye Technologies
WisdomEye TechnologiesWisdomEye Technologies
WisdomEye Technologies
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Intro big data analytics
Intro big data analyticsIntro big data analytics
Intro big data analytics
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 

Recently uploaded

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 

Recently uploaded (20)

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 

Introduction to Big Data

  • 1. BIG Data Desai Karan A https://in.linkedin.com/in/karan28
  • 2. SYNOPSIS: 1. Handy Hands-on 2. Introduction to big data 3. Big Data Niceties 4. Specifics of Big Data 5. Big Data Management Tools 6. Practical use-cases 7. Conclusions 8. References
  • 4.
  • 5.
  • 6.
  • 7. 2. Introduction to big data -2.1 What is big data? -2.2 Etymology. -2.3 Hype and Facts.
  • 8. 2.1 What is big data? • “Big data” refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze. • Big Data is the extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions. • Big data is the data of range more than 1000 gigabytes or 100 zettabytes.
  • 9. 2.2 Etymology: Word Origination Big data is the simplest, shortest phrase to convey that the boundaries of computing keep advancing, growing, diversifying and intensifying rapidly.. John R Mashey, chief scientist at Silicon Graphics coined the term “Big Data”.
  • 10. 2.3 Hype and Facts
  • 11. 2.3 Hype and Facts
  • 12.
  • 13. GLOBALLY, EVERY 60 SECONDS… • 204 Million emails are sent. • 300k logins to . • 1.3 Million views on YouTube. • 2 Million Google searches. • 100k tweets. • 62,000 hours of Music Downloads
  • 14. • WE GENERATE 2.5 QUINTILION BYTES EVERYDAY • IN 2012, WORLD’S INFORMATION CROSSED 2 ZETTA BYTES =2 TRILLION GIGABYTES!! 2.3 Hype and Facts (contd.)
  • 15. 3. Big Data Niceties. -3.1 Evolution of Big Data -3.2 Why traditional tools fail? -3.3 Utilities of Big Data
  • 17.
  • 18. • E-TSUNAMI and Heavy RAINS of DATA… 3.2 Why traditional tools fail? (contd.)
  • 19. 3.2 Why traditional tools fail? • The present data is highly BIG for the traditional data managers. -Can work only with small samples of data -It is same as looking through keyhole and finding size of room…
  • 20. • High Turnaround time for meaningful results – Means Deciding to cross road based on picture taken 5 minutes earlier!! 3.2 Why traditional tools fail? (contd.)
  • 21. 3.3 Big data utilities: • Dealing with real time data. • A new level of insight and opportunity. • More effective, fact based decision making. • A new source of business values. • A competitive advantage.
  • 22. 4. Specifics of Big Data -4.1 Characteristics -4.2 Life cycle
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. 4.2 Big Data Life Cycle Insight Enrich Manage
  • 30. • Manage and secure data of any size. • Enrich by connecting world’s data. • Insights on any data irrespective of location 3.2 Big Data Life Cycle
  • 31.
  • 32. 5. Big Data Management tools. -5.1 Cow story -5.2 Introduction to Hadoop -5.3 Basic Working of Hadoop.
  • 33. 5.1 Cow story: Case 1 It is easy for me to handle my resources.(Data) . Data Storage device MB/GB
  • 34. Case 2 I am strong…I can handle my resources Data Data Data Data Data Data Storage device TB
  • 35. Case 3 Oof…There are so many resources!!! I am not strong! Storage device PB
  • 36. Case 4 I call my friends for help Big Data Management tools
  • 37. 5.2 Introduction to Hadoop Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware.
  • 38. Introduction to Hadoop • Doug Cutting created the Apache Hadoop. • Logo of Hadoop is a tiny yellow elephant.
  • 39. 5.3 Basic working of Hadoop
  • 40. Read 1 TB of Data 1 Machine 10 Machine • 4 I/O Channels • Each channel: 100 MB/s • ~ 45 minutes • 4 I/O Channels • Each channel: 100 MB/s • ~4.5 Minutes
  • 42.
  • 43.
  • 46. • Application written in java for Big Data Processing • Uses the “Map-Reduce” Processing Paradigm • Optimized for distributed storage and computing of data • Open Source • Very low cost for acquisition and storage Hadoop . HadoopData Analytics
  • 47. Other big data management tools: Overview…
  • 48.
  • 49. 6. Practical Use-Cases -6.1 Big apps of Big Data tools -6.2 How big data affects small business -6.3 Relevance of big data in market
  • 50. 6.1 Big apps of big data tools.
  • 51.
  • 52. Who is using big data?
  • 53. Who is using big data?
  • 54. 6.2 How big data affects small businesses? • Every organization has a tipping point, and most organizations – regardless of size – will eventually reach a point where the volume, variety and velocity of their data will be something that they have to address. • This new big data world is not only about running problems faster, but about solving problems that were not solvable before.
  • 55. 6.3 Relevance of big data in market.
  • 56.
  • 61.
  • 62. 8. References: • www.microsoft.com • http://en.wikipedia.org/wiki/Hadoop • http://en.wikipedia.org/wiki/Big_data • www.google.com • www.slideshare.net • Pdf: Mgkinskey Global Institute • Pdf: 101 Big data by Pradeep Vardan • Workshop in college by ‘Ecsttasys’ on big data

Editor's Notes

  1. ©Karan Desai(Follow me on twitter/@karlmit or https://in.linkedin.com/in/karan28) DISCLAIMER: The images or diagrams or content presented in the presentations are meant for educational purpose only. The author don’t guarantee the originality of any media of the presentation. The author has only combined and summed up the details regarding the topic from varied sources. The author is not subjected to any violation or copyrights.
  2. SSAS: SQL Server Analysis Services, SSAS, is an online analytical processing (OLAP), data mining and reporting tool in Microsoft SQL Server. Essbase is a multidimensional database management system (MDBMS) that provides a multidimensional database platform upon which to build analytic applications.  BM Cognos TM1 (formerly Applix TM1) is enterprise planning software used to implement collaborative planning, budgeting and forecasting solutions, as well as analytical and reporting applications. Power Pivot is a free add-in to the 2010 version of the spreadsheet application Microsoft Excel. PowerPivot workbooks are self contained web applications, merely requiring a 'Save as' to make them accessible in the browser as interactive solutions.”. K is a proprietary array processing language developed by Arthur Whitney and commercialized by Kx Systems. Since then, an open-source implementation known as Kona has also been developed. ... kdb is both a database (kdb) and a vector language (q). It's used by almost every major financial institution Vertica Systems is an analytic database management software company. QlikView is the most flexible Business Intelligence platform for turning data into knowledge. TIBCO Spotfire® designs, develops and distributes in-memory analytics software for next generation business intelligence. Tableau Software is an American computer software company headquartered in Seattle, Washington. It produces a family of interactive data visualization products focused on business intelligence Omniscope is single, in-memory, file-based application that enables agile, 'best practise' data sharing solutions An in-memory database (IMDB; also main memory database system or MMDB or memory resident database) is a database management system that primarily relies on main memory for computer data storage. It is contrasted with database management systems that employ a disk storage mechanism. Relational databases are row oriented, as the data in each row of a table is stored together. In a columnar, or column-oriented database, the data is stored across rows.