SlideShare a Scribd company logo
Big Data Hadoop
For project and thesis
What is Big Data?
Introduction
Big Data refers to large volume of data which may be structured or
unstructured and which make use of certain new technologies and techniques
to handle it. Organised form of data is known as structured data while
unorganised form of data is known as unstructured data. The data sets in
big data are so large and complex that we cannot handle them using
traditional application softwares. There are certain frameworks like Hadoop
designed for processing big data. These techniques are also used to extract
useful insights from data using predictive analysis, user behavior and
analytics.
3 Vs of Big Data
● Volume – It refers to the amount of data that is generated. The data can be low-density, high
volume, structured/unstructured or data with unknown value. This unknown data is converted
into useful one using technologies like Hadoop. The data can range from terabytes to
petabytes.
● Velocity – It refers to the rate at which the data is generated. The data is received at
an unprecedented speed and is acted upon in a timely manner. It also require real time
evaluation and action in case of Internet of Things(IoT) applications
● Variety – Variety refers to different formats of data. It may be structured, unstructured or
semistructured. The data can be audio, video, text or email. In this additional
processing is required to derive the meaning of data and also to support the metadata.
Hadoop
Hadoop is an open-source framework
provided to process and store big
data. Hadoop make use of simple
programming models to process big
data in a distributed environment
across clusters of computers. Hadoop
provides storage for large volume of
data along with advanced processing
power. It also gives the ability to
handle multiple tasks and jobs.
Hadoop Architecture
HDFS is the main component of Hadoop architecture. It stands for Hadoop
Distributed File Systems. It is used to store large amount of data and multiple
machines are used for this storage. MapReduce Overview is another component of
big data architecture. The data is processed here in a distributed manner across
multiple machines. YARN component is used for data processing resources like
CPU, RAM, and memory. Resource Manager and Node Manager are the elements of
YARN. These two elements work as master and slave. Resource Manager is the
master and assigns resources to the slave i.e. Node Manager. Node Manager sends
signal to the master when it is going to start the work. Big Data Hadoop for thesis
will be plus point for you.
Importance of Hadoop in Big
Data
Hadoop is important in Big Data due to:
● Processing of huge chunks of data – With Hadoop, we can process and store huge amount of data mainly the
data from social media and IoT(Internet of Things) applications.
● Computation power – The computation power of hadoop is high as it can process big data pretty fast. Hadoop
make use of distributed models for processing of data.
● Fault tolerance – Hadoop provide protection against any form of malware as well as from hardware failure. If a
node in the distributed model goes down, then other nodes continue to function.
● Flexibility – As much data as you require can be stored using Hadoop. There is no requirement of
preprocessing the data.
● Low Cost – Hadoop is an open-source framework and free to use. It provides additional hardware to store the
large quantities of data.
● Scalability – The system can be grown easily just by adding nodes in the system according to the requirements.
Minimal administration is required.
Applications of Big Data
Government
Big Data is used within governmental services with efficiency in cost, productivity and innovation. The
common example of this is the Indian Elections of 2014 in which BJP tried this to win the elections.
Finance
Big Data is used in finance for market prediction. It is used for compliance and regulatory reporting,
risk analysis, fraud detection, high speed trading and for analytics.
Healthcare
Big Data is used in healthcare services for clinical data analysis, disease pattern analysis, medical
devices and medicines supply, drug discovery and various other such analytics.
Media
Media uses Big Data for various mechanisms like ad targeting, forecasting,
clickstream analytics, campaign management and loyalty programs. It is mainly
focused on following three points:
Targeting consumers
Capturing of data
Data journalism
Information Technology
Big Data has helped employees working in Information Technology to work
efficiently and for widespread distribution of Information Technology.
Challenges of Big
Data
The main challenges of Big Data are:
Data Storage and quality of Data – The data is
growing at a fast pace as the number of companies and
organizations are growing. Proper storage of this data
has become a challenge.
Lack of big data analysts – There is huge demand for
data scientists and analysts who can understand and
analyze this data.
Quality Analysis - The data should also be accurate as
inaccurate data can lead to wrong decisions that will
affect the company's business.
Security and Privacy of Data – Security and privacy
are the biggest risks in big data.
Thanks!
Techsparks, 2nd floor, D-185,
Phase 8B, Industrial Area,
Sahibzada Ajit Singh Nagar,
Mohali, Punjab 160055
+91-9465330425
http://www.techsparks.co.in/
techsparks2013@gmail.com
Contact Us

More Related Content

What's hot

Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
hktripathy
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
yashbheda
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
AkhmadZakiAlsafi
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
Parviz Vakili
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
Chirag Ahuja
 
Big data tools
Big data toolsBig data tools
Big data tools
Novita Sari
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentationAASTHA PANDEY
 
Big Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business NeedsBig Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business Needs
Bernard Marr
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
Melinda Thielbar
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
SAishwaryaDinesh
 
Moneytree - Data Aggregation with SWF
Moneytree - Data Aggregation with SWFMoneytree - Data Aggregation with SWF
Moneytree - Data Aggregation with SWF
Ross Sharrott
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
Stuart Miniman
 
Business intelligence architectures.pdf
Business intelligence architectures.pdfBusiness intelligence architectures.pdf
Business intelligence architectures.pdf
Anand572211
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
kk1718
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
Krisshhna Daasaarii
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research reportJULIO GONZALEZ SANZ
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and Hadoop
Archana Gopinath
 
Big data
Big dataBig data
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
SherinMariamReji05
 

What's hot (20)

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
 
Intro to big data and applications - day 2
Intro to big data and applications - day 2Intro to big data and applications - day 2
Intro to big data and applications - day 2
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Big data tools
Big data toolsBig data tools
Big data tools
 
Big Data Analytics MIS presentation
Big Data Analytics MIS presentationBig Data Analytics MIS presentation
Big Data Analytics MIS presentation
 
Big Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business NeedsBig Data: The 6 Key Skills Every Business Needs
Big Data: The 6 Key Skills Every Business Needs
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
 
Moneytree - Data Aggregation with SWF
Moneytree - Data Aggregation with SWFMoneytree - Data Aggregation with SWF
Moneytree - Data Aggregation with SWF
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Business intelligence architectures.pdf
Business intelligence architectures.pdfBusiness intelligence architectures.pdf
Business intelligence architectures.pdf
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
 
Fundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and HadoopFundamentals of big data analytics and Hadoop
Fundamentals of big data analytics and Hadoop
 
Big data
Big dataBig data
Big data
 
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital ForensicsBig Data in Distributed Analytics,Cybersecurity And Digital Forensics
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
 

Similar to Big Data Hadoop

Big Data
Big DataBig Data
Big Data
Kirubaburi R
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Survey
ijeei-iaes
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
Mohamed Magdy
 
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond HillDOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
ClaraZara1
 
DOCUMENT SELECTION USING MAPREDUCE
DOCUMENT SELECTION USING MAPREDUCEDOCUMENT SELECTION USING MAPREDUCE
DOCUMENT SELECTION USING MAPREDUCE
ijsptm
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
IRJET Journal
 
Introduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptxIntroduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptx
Pratimakumari213460
 
Big data
Big dataBig data
Big data
Nimish Kochhar
 
Big data
Big dataBig data
Big data
Nimish Kochhar
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
himanshu arora
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and Hadoop
SamiraChandan
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
IJSRD
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
IJSRD
 
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
YashiBatra1
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
FredReynolds2
 
xGem BigData
xGem BigDataxGem BigData
xGem BigData
Julio Castro
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
Sysfore Technologies
 

Similar to Big Data Hadoop (20)

Big Data
Big DataBig Data
Big Data
 
Big Data-Survey
Big Data-SurveyBig Data-Survey
Big Data-Survey
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond HillDOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
DOCUMENT SELECTION USING MAPREDUCE Yenumula B Reddy and Desmond Hill
 
DOCUMENT SELECTION USING MAPREDUCE
DOCUMENT SELECTION USING MAPREDUCEDOCUMENT SELECTION USING MAPREDUCE
DOCUMENT SELECTION USING MAPREDUCE
 
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop PlatformBig Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
 
Introduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptxIntroduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptx
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and Hadoop
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
xGem BigData
xGem BigDataxGem BigData
xGem BigData
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | SysforeHadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
 

More from Techsparks

Available Research Topics in Machine Learning
Available Research Topics in Machine LearningAvailable Research Topics in Machine Learning
Available Research Topics in Machine Learning
Techsparks
 
How to Complete your thesis fast.pdf
How to Complete your thesis fast.pdfHow to Complete your thesis fast.pdf
How to Complete your thesis fast.pdf
Techsparks
 
How to Plan Thesis on Computer Science.pdf
How to Plan Thesis on Computer Science.pdfHow to Plan Thesis on Computer Science.pdf
How to Plan Thesis on Computer Science.pdf
Techsparks
 
Latest IEEE base paper for CSE.pdf
Latest IEEE base paper for CSE.pdfLatest IEEE base paper for CSE.pdf
Latest IEEE base paper for CSE.pdf
Techsparks
 
Popular Thesis Topics in Networking
Popular Thesis Topics in NetworkingPopular Thesis Topics in Networking
Popular Thesis Topics in Networking
Techsparks
 
Significant Research Topics in Cloud Computing
Significant Research Topics in Cloud ComputingSignificant Research Topics in Cloud Computing
Significant Research Topics in Cloud Computing
Techsparks
 
Trending Topics in Machine Learning
Trending Topics in Machine LearningTrending Topics in Machine Learning
Trending Topics in Machine Learning
Techsparks
 
Masters thesis in assistance patna
Masters thesis in assistance patnaMasters thesis in assistance patna
Masters thesis in assistance patna
Techsparks
 
Software engineering - Topics and Research Areas
Software engineering - Topics and Research AreasSoftware engineering - Topics and Research Areas
Software engineering - Topics and Research Areas
Techsparks
 
Cloud computing and Cloud Security - Basics and Terminologies
Cloud computing and Cloud Security - Basics and TerminologiesCloud computing and Cloud Security - Basics and Terminologies
Cloud computing and Cloud Security - Basics and Terminologies
Techsparks
 
Data mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research TopicsData mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research Topics
Techsparks
 
How to write a thesis - Guidelines to Thesis Writing
How to write a thesis - Guidelines to Thesis WritingHow to write a thesis - Guidelines to Thesis Writing
How to write a thesis - Guidelines to Thesis Writing
Techsparks
 
Matlab - Introduction and Basics
Matlab - Introduction and BasicsMatlab - Introduction and Basics
Matlab - Introduction and Basics
Techsparks
 
Topics in digital communication
Topics in digital communicationTopics in digital communication
Topics in digital communication
Techsparks
 
Research paper writers in hyderabad
Research paper writers in hyderabadResearch paper writers in hyderabad
Research paper writers in hyderabad
Techsparks
 
Topics in wireless communication for project and thesis
Topics in wireless communication for project and thesisTopics in wireless communication for project and thesis
Topics in wireless communication for project and thesis
Techsparks
 
Masters thesis in assistance indore
Masters thesis in assistance indoreMasters thesis in assistance indore
Masters thesis in assistance indore
Techsparks
 
Master's thesis assistance jalandhar
Master's thesis assistance jalandharMaster's thesis assistance jalandhar
Master's thesis assistance jalandhar
Techsparks
 
How to get published in Scopus/ IEEE journals
How to get published in Scopus/ IEEE journalsHow to get published in Scopus/ IEEE journals
How to get published in Scopus/ IEEE journals
Techsparks
 
Topics in Networking for project and thesis
Topics in Networking for project and thesisTopics in Networking for project and thesis
Topics in Networking for project and thesis
Techsparks
 

More from Techsparks (20)

Available Research Topics in Machine Learning
Available Research Topics in Machine LearningAvailable Research Topics in Machine Learning
Available Research Topics in Machine Learning
 
How to Complete your thesis fast.pdf
How to Complete your thesis fast.pdfHow to Complete your thesis fast.pdf
How to Complete your thesis fast.pdf
 
How to Plan Thesis on Computer Science.pdf
How to Plan Thesis on Computer Science.pdfHow to Plan Thesis on Computer Science.pdf
How to Plan Thesis on Computer Science.pdf
 
Latest IEEE base paper for CSE.pdf
Latest IEEE base paper for CSE.pdfLatest IEEE base paper for CSE.pdf
Latest IEEE base paper for CSE.pdf
 
Popular Thesis Topics in Networking
Popular Thesis Topics in NetworkingPopular Thesis Topics in Networking
Popular Thesis Topics in Networking
 
Significant Research Topics in Cloud Computing
Significant Research Topics in Cloud ComputingSignificant Research Topics in Cloud Computing
Significant Research Topics in Cloud Computing
 
Trending Topics in Machine Learning
Trending Topics in Machine LearningTrending Topics in Machine Learning
Trending Topics in Machine Learning
 
Masters thesis in assistance patna
Masters thesis in assistance patnaMasters thesis in assistance patna
Masters thesis in assistance patna
 
Software engineering - Topics and Research Areas
Software engineering - Topics and Research AreasSoftware engineering - Topics and Research Areas
Software engineering - Topics and Research Areas
 
Cloud computing and Cloud Security - Basics and Terminologies
Cloud computing and Cloud Security - Basics and TerminologiesCloud computing and Cloud Security - Basics and Terminologies
Cloud computing and Cloud Security - Basics and Terminologies
 
Data mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research TopicsData mining - Process, Techniques and Research Topics
Data mining - Process, Techniques and Research Topics
 
How to write a thesis - Guidelines to Thesis Writing
How to write a thesis - Guidelines to Thesis WritingHow to write a thesis - Guidelines to Thesis Writing
How to write a thesis - Guidelines to Thesis Writing
 
Matlab - Introduction and Basics
Matlab - Introduction and BasicsMatlab - Introduction and Basics
Matlab - Introduction and Basics
 
Topics in digital communication
Topics in digital communicationTopics in digital communication
Topics in digital communication
 
Research paper writers in hyderabad
Research paper writers in hyderabadResearch paper writers in hyderabad
Research paper writers in hyderabad
 
Topics in wireless communication for project and thesis
Topics in wireless communication for project and thesisTopics in wireless communication for project and thesis
Topics in wireless communication for project and thesis
 
Masters thesis in assistance indore
Masters thesis in assistance indoreMasters thesis in assistance indore
Masters thesis in assistance indore
 
Master's thesis assistance jalandhar
Master's thesis assistance jalandharMaster's thesis assistance jalandhar
Master's thesis assistance jalandhar
 
How to get published in Scopus/ IEEE journals
How to get published in Scopus/ IEEE journalsHow to get published in Scopus/ IEEE journals
How to get published in Scopus/ IEEE journals
 
Topics in Networking for project and thesis
Topics in Networking for project and thesisTopics in Networking for project and thesis
Topics in Networking for project and thesis
 

Recently uploaded

Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
R&R Consult
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 

Recently uploaded (20)

Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptxCFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
CFD Simulation of By-pass Flow in a HRSG module by R&R Consult.pptx
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 

Big Data Hadoop

  • 1. Big Data Hadoop For project and thesis
  • 2. What is Big Data?
  • 3. Introduction Big Data refers to large volume of data which may be structured or unstructured and which make use of certain new technologies and techniques to handle it. Organised form of data is known as structured data while unorganised form of data is known as unstructured data. The data sets in big data are so large and complex that we cannot handle them using traditional application softwares. There are certain frameworks like Hadoop designed for processing big data. These techniques are also used to extract useful insights from data using predictive analysis, user behavior and analytics.
  • 4. 3 Vs of Big Data ● Volume – It refers to the amount of data that is generated. The data can be low-density, high volume, structured/unstructured or data with unknown value. This unknown data is converted into useful one using technologies like Hadoop. The data can range from terabytes to petabytes. ● Velocity – It refers to the rate at which the data is generated. The data is received at an unprecedented speed and is acted upon in a timely manner. It also require real time evaluation and action in case of Internet of Things(IoT) applications ● Variety – Variety refers to different formats of data. It may be structured, unstructured or semistructured. The data can be audio, video, text or email. In this additional processing is required to derive the meaning of data and also to support the metadata.
  • 5. Hadoop Hadoop is an open-source framework provided to process and store big data. Hadoop make use of simple programming models to process big data in a distributed environment across clusters of computers. Hadoop provides storage for large volume of data along with advanced processing power. It also gives the ability to handle multiple tasks and jobs.
  • 7. HDFS is the main component of Hadoop architecture. It stands for Hadoop Distributed File Systems. It is used to store large amount of data and multiple machines are used for this storage. MapReduce Overview is another component of big data architecture. The data is processed here in a distributed manner across multiple machines. YARN component is used for data processing resources like CPU, RAM, and memory. Resource Manager and Node Manager are the elements of YARN. These two elements work as master and slave. Resource Manager is the master and assigns resources to the slave i.e. Node Manager. Node Manager sends signal to the master when it is going to start the work. Big Data Hadoop for thesis will be plus point for you.
  • 8. Importance of Hadoop in Big Data
  • 9. Hadoop is important in Big Data due to: ● Processing of huge chunks of data – With Hadoop, we can process and store huge amount of data mainly the data from social media and IoT(Internet of Things) applications. ● Computation power – The computation power of hadoop is high as it can process big data pretty fast. Hadoop make use of distributed models for processing of data. ● Fault tolerance – Hadoop provide protection against any form of malware as well as from hardware failure. If a node in the distributed model goes down, then other nodes continue to function. ● Flexibility – As much data as you require can be stored using Hadoop. There is no requirement of preprocessing the data. ● Low Cost – Hadoop is an open-source framework and free to use. It provides additional hardware to store the large quantities of data. ● Scalability – The system can be grown easily just by adding nodes in the system according to the requirements. Minimal administration is required.
  • 10. Applications of Big Data Government Big Data is used within governmental services with efficiency in cost, productivity and innovation. The common example of this is the Indian Elections of 2014 in which BJP tried this to win the elections. Finance Big Data is used in finance for market prediction. It is used for compliance and regulatory reporting, risk analysis, fraud detection, high speed trading and for analytics. Healthcare Big Data is used in healthcare services for clinical data analysis, disease pattern analysis, medical devices and medicines supply, drug discovery and various other such analytics.
  • 11. Media Media uses Big Data for various mechanisms like ad targeting, forecasting, clickstream analytics, campaign management and loyalty programs. It is mainly focused on following three points: Targeting consumers Capturing of data Data journalism Information Technology Big Data has helped employees working in Information Technology to work efficiently and for widespread distribution of Information Technology.
  • 12. Challenges of Big Data The main challenges of Big Data are: Data Storage and quality of Data – The data is growing at a fast pace as the number of companies and organizations are growing. Proper storage of this data has become a challenge. Lack of big data analysts – There is huge demand for data scientists and analysts who can understand and analyze this data. Quality Analysis - The data should also be accurate as inaccurate data can lead to wrong decisions that will affect the company's business. Security and Privacy of Data – Security and privacy are the biggest risks in big data.
  • 13. Thanks! Techsparks, 2nd floor, D-185, Phase 8B, Industrial Area, Sahibzada Ajit Singh Nagar, Mohali, Punjab 160055 +91-9465330425 http://www.techsparks.co.in/ techsparks2013@gmail.com Contact Us