SlideShare a Scribd company logo
1 of 37
BigData AnalyticsBigData Analytics
Incorporation Pvt. Ltd.
Presented By:-Presented By:-
Mayank Kumar Sharma
1
2AMSTECH Incorporation Pvt. Ltd.
Internet = Ocean of informationInternet = Ocean of information
3AMSTECH Incorporation Pvt. Ltd.
4AMSTECH Incorporation Pvt. Ltd.
What is BigData?
What makes data, “Big” Data?
Why BigData?
5AMSTECH Incorporation Pvt. Ltd.
“Extremely large data sets that may be analyzed
computationally to reveal patterns, trends, and
associations, especially relating to human behavior
and interactions are known as BigDataBigData.”
OR
BigDataBigData is the term for a collection of data sets
so large and complex that it becomes difficult to
process using on-hand database management tools or
traditional data processing applications.
What is BigData?
6AMSTECH Incorporation Pvt. Ltd.
“Gartner Definition(2012): "BigData is high
volume, high velocity, and/or high variety information
assets that require new forms of processing to enable
enhanced decision making, insight discovery and
process optimization.”
“No exact Definition, Only Experience.”
What is BigData?
7AMSTECH Incorporation Pvt. Ltd.
Every day, we create 3.5 quintillion bytes of data — so
much that 90% of the data in the world today has been
created in the last two years alone.
An example of big data might be petabytes (1,024
terabytes) or exabytes (1,024 petabytes) of data consisting
of billions to trillions of records of millions of people.
Storage capacity increases 23% on average annually.
Exponential growth during a decade starts from 2010.
What makes data, “Big” Data?
8AMSTECH Incorporation Pvt. Ltd.
• Creates over 30 billion pieces of content per day.
• Stores 30 petabytes of data.
• 90 million tweets per day.
9AMSTECH Incorporation Pvt. Ltd.
Why BigData?
To Manage Data Better.
[Abstraction has enabled numerous use cases where
data in a wide variety of formats]
Benefit From Speed, Capacity and Scalability of Cloud
Storage.
[Utilize substantially large data sets provide both the storage and
the computing power necessary crunch data for a specific period.]
End Users Can Visualize Data
[Data in easy-to-read charts, graphs and slideshows]
10AMSTECH Incorporation Pvt. Ltd.
Why BigData?
Find New Business Opportunities.
[Social media, Business Intelligence]
Data Analysis Methods, Capabilities Will Evolve
[Utilize substantially large data sets provide both the storage and
the computing power necessary crunch data for a specific period.]
11AMSTECH Incorporation Pvt. Ltd.
Why BigData?
12AMSTECH Incorporation Pvt. Ltd.
Who uses BigData?
1. Banking
2. Education
3. Government
4. Health Care
5. Manufacturing
6. Retail
“ It’s important to remember that the primary value
from big data comes not from the data in its raw form, but
from the processing and analysis of it and the insights,
products, and services that emerge from analysis. “
13AMSTECH Incorporation Pvt. Ltd.
BigData Challenges
14AMSTECH Incorporation Pvt. Ltd.
Big data can be characterized by 3Vs,
which can be known as Volume, Velocity and
Variety.
Characteristics of Big Data:
15AMSTECH Incorporation Pvt. Ltd.
Data Volume
 44x increase from 2009 2020
 From 0.8 zettabytes to 35zb
Data volume is increasing exponentially
16
Exponential increase in
collected/generated data
Volume : BigData 3Vs
AMSTECH Incorporation Pvt. Ltd.
Various formats, types, and structures.
Text, numerical, images, audio, video, sequences, time
series, social media data, multi-dim arrays, etc…
Static data vs. streaming data
A single application can be generating/collecting many types
of data
17
To extract knowledge All these types of data need
to linked together
To extract knowledge All these types of data need
to linked together
Variety : BigData 3Vs
AMSTECH Incorporation Pvt. Ltd.
18AMSTECH Incorporation Pvt. Ltd.
Data is begin generated fast and need to be processed fast
Online Data Analytics
Late decisions  missing opportunities
Examples
 E-Promotions: Based on your current location, your purchase history,
what you like  send promotions right now for store next to you
 Healthcare monitoring: sensors monitoring your activities and body
 any abnormal measurements require immediate reaction.
19
Velocity : BigData 3Vs
AMSTECH Incorporation Pvt. Ltd.
20AMSTECH Incorporation Pvt. Ltd.
Shim, K., S., Lee, S., K. and Kim, M., S. “Application Traffic Classification
in Hadoop Distributed Computing Environment” published in Asia-Pacific
Network Operation and Management Symposium (APNOMS) 2014.
1. This research work proposed an application traffic
classification in Hadoop Distributed Computing Environment.
2. Traffic phenomena of current network have been changes and
conventional traffic analysis method are not adequate.
3. In the proposed solution, authors consider packet units of
traffic from campus network. Collected packets are converted
into Flow format through the flow generator. The flow is
defined by 5 –tuple analysis.
Research Study
21AMSTECH Incorporation Pvt. Ltd.
Conclusion
4. Proposed method perform well in term of processing
speed through a comparison between the Hadoop based
system and a single server system.
5. On the other hand, it has certain drawbacks which are;
1. Adoption of Classification technique rather than
clustering.
2. Low analysis rate.
22AMSTECH Incorporation Pvt. Ltd.
Existing Solution for Traffic Classification
23AMSTECH Incorporation Pvt. Ltd.
BigData Analytics
24AMSTECH Incorporation Pvt. Ltd.
BigData Analytics Use Cases
Real Time
Intelligence
Data
Discovery
Business
Intelligence
Data
Scientist Business
User
Consumer
25AMSTECH Incorporation Pvt. Ltd.
1. Hadoop is a free, Java-based programming framework
that supports the processing of large data sets in a
distributed computing environment.
2. The Hadoop Distributed File System (HDFS) is designed
to store very large data sets reliably, and to stream those
data sets at high bandwidth to user applications.
3. By distributing storage and computation across many
servers, the resource can grow with demand while
remaining economical at every size.
BigData: Hadoop
26AMSTECH Incorporation Pvt. Ltd.
4. An important characteristic of Hadoop is the partitioning
of data and computation across many (thousands) of
hosts, and executing application computations in parallel
close to their data.
5. A Hadoop cluster scales computation capacity, storage
capacity and IO bandwidth by simply adding commodity
servers.
6. In simple words, it is a scalable fault tolerant grid
operating system for data storage and processing with
high bandwidth and clustering storage.
27AMSTECH Incorporation Pvt. Ltd.
Figure 2: HADOOP Components
28AMSTECH Incorporation Pvt. Ltd.
Figure 3: HDFS Processing
29AMSTECH Incorporation Pvt. Ltd.
30AMSTECH Incorporation Pvt. Ltd.
1. NameNode is the centerpiece of HDFS.
2. NameNode is also known as the Master
3. NameNode only stores the metadata of HDFS – the
directory tree of all files in the file system, and tracks the
files across the cluster.
4. NameNode does not store the actual data or the dataset.
The data itself is actually stored in the DataNodes.
5. NameNode knows the list of the blocks and its location
for any given file in HDFS. With this information
NameNode knows how to construct the file from blocks.
Name Node
31AMSTECH Incorporation Pvt. Ltd.
6. NameNode is so critical to HDFS and when the
NameNode is down, HDFS/Hadoop cluster is inaccessible
and considered down.
7. NameNode is a single point of failure in Hadoop cluster.
8. NameNode is usually configured with a lot of memory
(RAM). Because the block locations are help in main
memory.
32AMSTECH Incorporation Pvt. Ltd.
1. DataNode is responsible for storing the actual data in HDFS.
2. DataNode is also known as the Slave
3. NameNode and DataNode are in constant communication.
4. When a DataNode starts up it announce itself to the
NameNode along with the list of blocks it is responsible for.
5. When a DataNode is down, it does not affect the availability
of data or the cluster. NameNode will arrange for replication
for the blocks managed by the DataNode that is not
available.
6. DataNode is usually configured with a lot of hard disk space.
Because the actual data is stored in the DataNode.
DataNode
33AMSTECH Incorporation Pvt. Ltd.
Operation series when writing a file
34AMSTECH Incorporation Pvt. Ltd.
Operation series when reading a file
35AMSTECH Incorporation Pvt. Ltd.
Hadoop ConfigurationHadoop Configuration
36AMSTECH Incorporation Pvt. Ltd.
Thanks A LotThanks A Lot
Incorporation Pvt. Ltd.
By:
Mayank Kumar Sharma
37AMSTECH Incorporation Pvt. Ltd.

More Related Content

What's hot

Introduction of big data and analytics
Introduction of big data and analyticsIntroduction of big data and analytics
Introduction of big data and analyticsSanjeev Solanki
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataKaran Desai
 
DOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyDOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyHarald Erb
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAmpoolIO
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Simplilearn
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingm_hepburn
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache HadoopSuman Saurabh
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big DataMatthew Dennis
 
Big Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case StudyBig Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case StudyNati Shalom
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data WarehousingThomas Kejser
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsKaniska Mandal
 
IBM Big Data in the Cloud
IBM Big Data in the CloudIBM Big Data in the Cloud
IBM Big Data in the CloudRob Thomas
 
Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Edureka!
 

What's hot (20)

Introduction of big data and analytics
Introduction of big data and analyticsIntroduction of big data and analytics
Introduction of big data and analytics
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
DOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyDOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud Journey
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
Big Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case StudyBig Data Real Time Analytics - A Facebook Case Study
Big Data Real Time Analytics - A Facebook Case Study
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data Analytics
 
IBM Big Data in the Cloud
IBM Big Data in the CloudIBM Big Data in the Cloud
IBM Big Data in the Cloud
 
BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
Big Data
Big DataBig Data
Big Data
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
 

Similar to BigData Analytics

Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviationranjit banshpal
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesAshraf Uddin
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeSysfore Technologies
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in detailsMahmoud Yassin
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects IJMER
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformIRJET Journal
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewIRJET Journal
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Prof.Balakrishnan S
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsHappiest Minds Technologies
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxPankajkumar496281
 

Similar to BigData Analytics (20)

Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
Big Data
Big DataBig Data
Big Data
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
using big-data methods analyse the Cross platform aviation
 using big-data methods analyse the Cross platform aviation using big-data methods analyse the Cross platform aviation
using big-data methods analyse the Cross platform aviation
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
 
Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects Influence of Hadoop in Big Data Analysis and Its Aspects
Influence of Hadoop in Big Data Analysis and Its Aspects
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
Big Data
Big DataBig Data
Big Data
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 

Recently uploaded

SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 

BigData Analytics

  • 1. BigData AnalyticsBigData Analytics Incorporation Pvt. Ltd. Presented By:-Presented By:- Mayank Kumar Sharma 1
  • 3. Internet = Ocean of informationInternet = Ocean of information 3AMSTECH Incorporation Pvt. Ltd.
  • 5. What is BigData? What makes data, “Big” Data? Why BigData? 5AMSTECH Incorporation Pvt. Ltd.
  • 6. “Extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions are known as BigDataBigData.” OR BigDataBigData is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. What is BigData? 6AMSTECH Incorporation Pvt. Ltd.
  • 7. “Gartner Definition(2012): "BigData is high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization.” “No exact Definition, Only Experience.” What is BigData? 7AMSTECH Incorporation Pvt. Ltd.
  • 8. Every day, we create 3.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. An example of big data might be petabytes (1,024 terabytes) or exabytes (1,024 petabytes) of data consisting of billions to trillions of records of millions of people. Storage capacity increases 23% on average annually. Exponential growth during a decade starts from 2010. What makes data, “Big” Data? 8AMSTECH Incorporation Pvt. Ltd.
  • 9. • Creates over 30 billion pieces of content per day. • Stores 30 petabytes of data. • 90 million tweets per day. 9AMSTECH Incorporation Pvt. Ltd.
  • 10. Why BigData? To Manage Data Better. [Abstraction has enabled numerous use cases where data in a wide variety of formats] Benefit From Speed, Capacity and Scalability of Cloud Storage. [Utilize substantially large data sets provide both the storage and the computing power necessary crunch data for a specific period.] End Users Can Visualize Data [Data in easy-to-read charts, graphs and slideshows] 10AMSTECH Incorporation Pvt. Ltd.
  • 11. Why BigData? Find New Business Opportunities. [Social media, Business Intelligence] Data Analysis Methods, Capabilities Will Evolve [Utilize substantially large data sets provide both the storage and the computing power necessary crunch data for a specific period.] 11AMSTECH Incorporation Pvt. Ltd.
  • 13. Who uses BigData? 1. Banking 2. Education 3. Government 4. Health Care 5. Manufacturing 6. Retail “ It’s important to remember that the primary value from big data comes not from the data in its raw form, but from the processing and analysis of it and the insights, products, and services that emerge from analysis. “ 13AMSTECH Incorporation Pvt. Ltd.
  • 15. Big data can be characterized by 3Vs, which can be known as Volume, Velocity and Variety. Characteristics of Big Data: 15AMSTECH Incorporation Pvt. Ltd.
  • 16. Data Volume  44x increase from 2009 2020  From 0.8 zettabytes to 35zb Data volume is increasing exponentially 16 Exponential increase in collected/generated data Volume : BigData 3Vs AMSTECH Incorporation Pvt. Ltd.
  • 17. Various formats, types, and structures. Text, numerical, images, audio, video, sequences, time series, social media data, multi-dim arrays, etc… Static data vs. streaming data A single application can be generating/collecting many types of data 17 To extract knowledge All these types of data need to linked together To extract knowledge All these types of data need to linked together Variety : BigData 3Vs AMSTECH Incorporation Pvt. Ltd.
  • 19. Data is begin generated fast and need to be processed fast Online Data Analytics Late decisions  missing opportunities Examples  E-Promotions: Based on your current location, your purchase history, what you like  send promotions right now for store next to you  Healthcare monitoring: sensors monitoring your activities and body  any abnormal measurements require immediate reaction. 19 Velocity : BigData 3Vs AMSTECH Incorporation Pvt. Ltd.
  • 21. Shim, K., S., Lee, S., K. and Kim, M., S. “Application Traffic Classification in Hadoop Distributed Computing Environment” published in Asia-Pacific Network Operation and Management Symposium (APNOMS) 2014. 1. This research work proposed an application traffic classification in Hadoop Distributed Computing Environment. 2. Traffic phenomena of current network have been changes and conventional traffic analysis method are not adequate. 3. In the proposed solution, authors consider packet units of traffic from campus network. Collected packets are converted into Flow format through the flow generator. The flow is defined by 5 –tuple analysis. Research Study 21AMSTECH Incorporation Pvt. Ltd.
  • 22. Conclusion 4. Proposed method perform well in term of processing speed through a comparison between the Hadoop based system and a single server system. 5. On the other hand, it has certain drawbacks which are; 1. Adoption of Classification technique rather than clustering. 2. Low analysis rate. 22AMSTECH Incorporation Pvt. Ltd.
  • 23. Existing Solution for Traffic Classification 23AMSTECH Incorporation Pvt. Ltd.
  • 25. BigData Analytics Use Cases Real Time Intelligence Data Discovery Business Intelligence Data Scientist Business User Consumer 25AMSTECH Incorporation Pvt. Ltd.
  • 26. 1. Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. 2. The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. 3. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. BigData: Hadoop 26AMSTECH Incorporation Pvt. Ltd.
  • 27. 4. An important characteristic of Hadoop is the partitioning of data and computation across many (thousands) of hosts, and executing application computations in parallel close to their data. 5. A Hadoop cluster scales computation capacity, storage capacity and IO bandwidth by simply adding commodity servers. 6. In simple words, it is a scalable fault tolerant grid operating system for data storage and processing with high bandwidth and clustering storage. 27AMSTECH Incorporation Pvt. Ltd.
  • 28. Figure 2: HADOOP Components 28AMSTECH Incorporation Pvt. Ltd.
  • 29. Figure 3: HDFS Processing 29AMSTECH Incorporation Pvt. Ltd.
  • 31. 1. NameNode is the centerpiece of HDFS. 2. NameNode is also known as the Master 3. NameNode only stores the metadata of HDFS – the directory tree of all files in the file system, and tracks the files across the cluster. 4. NameNode does not store the actual data or the dataset. The data itself is actually stored in the DataNodes. 5. NameNode knows the list of the blocks and its location for any given file in HDFS. With this information NameNode knows how to construct the file from blocks. Name Node 31AMSTECH Incorporation Pvt. Ltd.
  • 32. 6. NameNode is so critical to HDFS and when the NameNode is down, HDFS/Hadoop cluster is inaccessible and considered down. 7. NameNode is a single point of failure in Hadoop cluster. 8. NameNode is usually configured with a lot of memory (RAM). Because the block locations are help in main memory. 32AMSTECH Incorporation Pvt. Ltd.
  • 33. 1. DataNode is responsible for storing the actual data in HDFS. 2. DataNode is also known as the Slave 3. NameNode and DataNode are in constant communication. 4. When a DataNode starts up it announce itself to the NameNode along with the list of blocks it is responsible for. 5. When a DataNode is down, it does not affect the availability of data or the cluster. NameNode will arrange for replication for the blocks managed by the DataNode that is not available. 6. DataNode is usually configured with a lot of hard disk space. Because the actual data is stored in the DataNode. DataNode 33AMSTECH Incorporation Pvt. Ltd.
  • 34. Operation series when writing a file 34AMSTECH Incorporation Pvt. Ltd.
  • 35. Operation series when reading a file 35AMSTECH Incorporation Pvt. Ltd.
  • 37. Thanks A LotThanks A Lot Incorporation Pvt. Ltd. By: Mayank Kumar Sharma 37AMSTECH Incorporation Pvt. Ltd.