SlideShare a Scribd company logo
1 of 3
Download to read offline
Discuss the advantages of Hadoop technology and distributed data file systems. How is an
Hadoop Distributed File System different from a Relational Database system? What
organizational issues are best solved using Hadoop technology? Give examples of the type of
data they will analyze. What companies currently use Hadoopo related technologies.
Solution
The advantages of Hadoop technology
1. Scalable
Hadoop is a highly scalable storage platform, because it can store and distribute very large data
sets across hundreds of inexpensive servers that operate in parallel. Unlike traditional relational
database systems (RDBMS) that can't scale to process large amounts of data, Hadoop enables
businesses to run applications on thousands of nodes involving thousands of terabytes of data.
2. Cost effective
Hadoop also offers a cost effective storage solution for businesses' exploding data sets. The
problem with traditional relational database management systems is that it is extremely cost
prohibitive to scale to such a degree in order to process such massive volumes of data. In an
effort to reduce costs, many companies in the past would have had to down-sample data and
classify it based on certain assumptions as to which data was the most valuable. The raw data
would be deleted, as it would be too cost-prohibitive to keep. While this approach may have
worked in the short term, this meant that when business priorities changed, the complete raw
data set was not available, as it was too expensive to store. Hadoop, on the other hand, is
designed as a scale-out architecture that can affordably store all of a company's data for later
use. The cost savings are staggering: instead of costing thousands to tens of thousands of pounds
per terabyte, Hadoop offers computing and storage capabilities for hundreds of pounds per
terabyte.
3. Flexible
Hadoop enables businesses to easily access new data sources and tap into different types of data
(both structured and unstructured) to generate value from that data. This means businesses can
use Hadoop to derive valuable business insights from data sources such as social media, email
conversations or clickstream data. In addition, Hadoop can be used for a wide variety of
purposes, such as log processing, recommendation systems, data warehousing, market campaign
analysis and fraud detection.
4. Fast
Hadoop's unique storage method is based on a distributed file system that basically 'maps' data
wherever it is located on a cluster. The tools for data processing are often on the same servers
where the data is located, resulting in much faster data processing. If you're dealing with large
volumes of unstructured data, Hadoop is able to efficiently process terabytes of data in just
minutes, and petabytes in hours.
5. Resilient to failure
A key advantage of using Hadoop is its fault tolerance. When data is sent to an individual node,
that data is also replicated to other nodes in the cluster, which means that in the event of failure,
there is another copy available for use.
The MapR distribution goes beyond that by eliminating the NameNode and replacing it with a
distributed No NameNode architecture that provides true high availability. Our architecture
provides protection from both single and multiple failures.
The advantages ofdistributed data file systems:
Sharing Data :
There is a provision in the environment where user at one site may be able to access the data
residing at other sites.
Autonomy :
Because of sharing data by means of data distribution each site is able to retain a degree of
control over data that are stored locally.
In distributed system there is a global database administrator responsible for the entire system. A
part of global data base administrator responsibilities is delegated to local data base administrator
for each site. Depending upon the design of distributed database
each local database administrator may have different degree of local autonomy.
Availability :
If one site fails in a distributed system, the remaining sites may be able to continue operating.
Thus a failure of a site doesn't necessarily imply the shutdown of the System.
How is an Hadoop Distributed File System different from a Relational Database system?
Hadoop framework works very well with structured and unstructured data. This also supports
variety of data formats in real time such as XML, JSON and text based flat file formats.
However, RDBMS only work with better when an entity relationship model (ER model) is
defined perfectly and therefore, the database schema or structure can grow and unmanaged
otherwise. i.e., An RDBMS works well with structured data. Hadoop will be a choice in
environments such as when there are needs for BIG data processing on which the data being
processed does not have consistent relationships. Where the data size is too BIG for complex
processing, or not easy to define the relationships between the data, then it becomes difficult to
save the extracted information in an RDBMS with a coherent relationship.
What organizational issues are best solved using Hadoop technology?
1. eCommerce Challenges.
2. big data problem.
3. banking
4. cases in retail.
examples of the type of data they will analyze:
1. Analyze life-threatening risks
2. Identify warning signs of security breaches
3. Prevent hardware failure
4. Understand what people think about your company
5. Understand when to sell certain products
6. Find your ideal prospects
7. Gain insight from your log files
companies currently use Hadoopo related technologies:

More Related Content

Similar to Discuss the advantages of Hadoop technology and distributed data fil.pdf

Introduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone ModeIntroduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone Mode
inventionjournals
 

Similar to Discuss the advantages of Hadoop technology and distributed data fil.pdf (20)

Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
 
Hadoop hdfs interview questions
Hadoop hdfs interview questionsHadoop hdfs interview questions
Hadoop hdfs interview questions
 
Big data overview
Big data overviewBig data overview
Big data overview
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoption
 
Big data Hadoop presentation
Big data  Hadoop  presentation Big data  Hadoop  presentation
Big data Hadoop presentation
 
Bigdata overview
Bigdata overviewBigdata overview
Bigdata overview
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
 
G017143640
G017143640G017143640
G017143640
 
Introduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone ModeIntroduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone Mode
 
Hadoop info
Hadoop infoHadoop info
Hadoop info
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-System
 
Big Data Hadoop Technology
Big Data Hadoop TechnologyBig Data Hadoop Technology
Big Data Hadoop Technology
 
What is hadoop
What is hadoopWhat is hadoop
What is hadoop
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
 
Big Data
Big DataBig Data
Big Data
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdf
 
Big Data: RDBMS vs. Hadoop vs. Spark
Big Data: RDBMS vs. Hadoop vs. SparkBig Data: RDBMS vs. Hadoop vs. Spark
Big Data: RDBMS vs. Hadoop vs. Spark
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
finap ppt conference.pptx
finap ppt conference.pptxfinap ppt conference.pptx
finap ppt conference.pptx
 
Hadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An OverviewHadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An Overview
 

More from arhamgarmentsdelhi

In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdfIn Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
arhamgarmentsdelhi
 
How can Virtual Communities, Geographical Information Systems, and G.pdf
How can Virtual Communities, Geographical Information Systems, and G.pdfHow can Virtual Communities, Geographical Information Systems, and G.pdf
How can Virtual Communities, Geographical Information Systems, and G.pdf
arhamgarmentsdelhi
 
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdf
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdfHere is what a Dubai-Sharjah metro link could meanStudy shows .pdf
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdf
arhamgarmentsdelhi
 
For each mammalian organ system, list the organs and their function. .pdf
For each mammalian organ system, list the organs and their function. .pdfFor each mammalian organ system, list the organs and their function. .pdf
For each mammalian organ system, list the organs and their function. .pdf
arhamgarmentsdelhi
 
Describe how sexual orientation is a product of biology and soci.pdf
Describe how sexual orientation is a product of biology and soci.pdfDescribe how sexual orientation is a product of biology and soci.pdf
Describe how sexual orientation is a product of biology and soci.pdf
arhamgarmentsdelhi
 
Define social contract theory. What does it attempt to explainS.pdf
Define social contract theory. What does it attempt to explainS.pdfDefine social contract theory. What does it attempt to explainS.pdf
Define social contract theory. What does it attempt to explainS.pdf
arhamgarmentsdelhi
 
A) Explain five (5) challenges amphibians faced when colonizing land.pdf
A) Explain five (5) challenges amphibians faced when colonizing land.pdfA) Explain five (5) challenges amphibians faced when colonizing land.pdf
A) Explain five (5) challenges amphibians faced when colonizing land.pdf
arhamgarmentsdelhi
 
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdfa. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
arhamgarmentsdelhi
 
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
arhamgarmentsdelhi
 
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
arhamgarmentsdelhi
 

More from arhamgarmentsdelhi (20)

For z 2.85, in which of the following graphs does the shaded area .pdf
For z  2.85, in which of the following graphs does the shaded area .pdfFor z  2.85, in which of the following graphs does the shaded area .pdf
For z 2.85, in which of the following graphs does the shaded area .pdf
 
In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdfIn Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
In Drosophila melanogaster, the yellow body gene is X-linked. The yel.pdf
 
How can Virtual Communities, Geographical Information Systems, and G.pdf
How can Virtual Communities, Geographical Information Systems, and G.pdfHow can Virtual Communities, Geographical Information Systems, and G.pdf
How can Virtual Communities, Geographical Information Systems, and G.pdf
 
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdf
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdfHere is what a Dubai-Sharjah metro link could meanStudy shows .pdf
Here is what a Dubai-Sharjah metro link could meanStudy shows .pdf
 
For each mammalian organ system, list the organs and their function. .pdf
For each mammalian organ system, list the organs and their function. .pdfFor each mammalian organ system, list the organs and their function. .pdf
For each mammalian organ system, list the organs and their function. .pdf
 
Direct but short answer to the following problemBased on intervie.pdf
Direct but short answer to the following problemBased on intervie.pdfDirect but short answer to the following problemBased on intervie.pdf
Direct but short answer to the following problemBased on intervie.pdf
 
Describe how sexual orientation is a product of biology and soci.pdf
Describe how sexual orientation is a product of biology and soci.pdfDescribe how sexual orientation is a product of biology and soci.pdf
Describe how sexual orientation is a product of biology and soci.pdf
 
Define social contract theory. What does it attempt to explainS.pdf
Define social contract theory. What does it attempt to explainS.pdfDefine social contract theory. What does it attempt to explainS.pdf
Define social contract theory. What does it attempt to explainS.pdf
 
Assume that you have the following set of hexadecimal values $20, $.pdf
Assume that you have the following set of hexadecimal values $20, $.pdfAssume that you have the following set of hexadecimal values $20, $.pdf
Assume that you have the following set of hexadecimal values $20, $.pdf
 
Briefly explain why heating water close to boiling in a test tube wo.pdf
Briefly explain why heating water close to boiling in a test tube wo.pdfBriefly explain why heating water close to boiling in a test tube wo.pdf
Briefly explain why heating water close to boiling in a test tube wo.pdf
 
AP Biology II -1. What substrate does the enzyme Potassium Cyanide.pdf
AP Biology II -1. What substrate does the enzyme Potassium Cyanide.pdfAP Biology II -1. What substrate does the enzyme Potassium Cyanide.pdf
AP Biology II -1. What substrate does the enzyme Potassium Cyanide.pdf
 
allocation SolutionThe correct option is Depreciation.Depreci.pdf
allocation SolutionThe correct option is Depreciation.Depreci.pdfallocation SolutionThe correct option is Depreciation.Depreci.pdf
allocation SolutionThe correct option is Depreciation.Depreci.pdf
 
A) Explain five (5) challenges amphibians faced when colonizing land.pdf
A) Explain five (5) challenges amphibians faced when colonizing land.pdfA) Explain five (5) challenges amphibians faced when colonizing land.pdf
A) Explain five (5) challenges amphibians faced when colonizing land.pdf
 
Classful addressing describes the IP address scheme thata.) divid.pdf
Classful addressing describes the IP address scheme thata.) divid.pdfClassful addressing describes the IP address scheme thata.) divid.pdf
Classful addressing describes the IP address scheme thata.) divid.pdf
 
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdfa. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
a. Explain how chronic bleeding leads to iron deficiency anemia. b. .pdf
 
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
8. Identify the causes and symptoms of endocarditis, myocarditis, an.pdf
 
Can someone please help me with this question3. Describe the funct.pdf
Can someone please help me with this question3. Describe the funct.pdfCan someone please help me with this question3. Describe the funct.pdf
Can someone please help me with this question3. Describe the funct.pdf
 
2. The probability that a phone call will last more than t minutes i.pdf
2. The probability that a phone call will last more than t minutes i.pdf2. The probability that a phone call will last more than t minutes i.pdf
2. The probability that a phone call will last more than t minutes i.pdf
 
28. Which of the following is an example of temporal structure in a .pdf
28. Which of the following is an example of temporal structure in a .pdf28. Which of the following is an example of temporal structure in a .pdf
28. Which of the following is an example of temporal structure in a .pdf
 
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
17 Testing Your Knowledge s. Why are biologists sointerested inchemis.pdf
 

Recently uploaded

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Recently uploaded (20)

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 

Discuss the advantages of Hadoop technology and distributed data fil.pdf

  • 1. Discuss the advantages of Hadoop technology and distributed data file systems. How is an Hadoop Distributed File System different from a Relational Database system? What organizational issues are best solved using Hadoop technology? Give examples of the type of data they will analyze. What companies currently use Hadoopo related technologies. Solution The advantages of Hadoop technology 1. Scalable Hadoop is a highly scalable storage platform, because it can store and distribute very large data sets across hundreds of inexpensive servers that operate in parallel. Unlike traditional relational database systems (RDBMS) that can't scale to process large amounts of data, Hadoop enables businesses to run applications on thousands of nodes involving thousands of terabytes of data. 2. Cost effective Hadoop also offers a cost effective storage solution for businesses' exploding data sets. The problem with traditional relational database management systems is that it is extremely cost prohibitive to scale to such a degree in order to process such massive volumes of data. In an effort to reduce costs, many companies in the past would have had to down-sample data and classify it based on certain assumptions as to which data was the most valuable. The raw data would be deleted, as it would be too cost-prohibitive to keep. While this approach may have worked in the short term, this meant that when business priorities changed, the complete raw data set was not available, as it was too expensive to store. Hadoop, on the other hand, is designed as a scale-out architecture that can affordably store all of a company's data for later use. The cost savings are staggering: instead of costing thousands to tens of thousands of pounds per terabyte, Hadoop offers computing and storage capabilities for hundreds of pounds per terabyte. 3. Flexible Hadoop enables businesses to easily access new data sources and tap into different types of data (both structured and unstructured) to generate value from that data. This means businesses can use Hadoop to derive valuable business insights from data sources such as social media, email conversations or clickstream data. In addition, Hadoop can be used for a wide variety of purposes, such as log processing, recommendation systems, data warehousing, market campaign analysis and fraud detection. 4. Fast Hadoop's unique storage method is based on a distributed file system that basically 'maps' data
  • 2. wherever it is located on a cluster. The tools for data processing are often on the same servers where the data is located, resulting in much faster data processing. If you're dealing with large volumes of unstructured data, Hadoop is able to efficiently process terabytes of data in just minutes, and petabytes in hours. 5. Resilient to failure A key advantage of using Hadoop is its fault tolerance. When data is sent to an individual node, that data is also replicated to other nodes in the cluster, which means that in the event of failure, there is another copy available for use. The MapR distribution goes beyond that by eliminating the NameNode and replacing it with a distributed No NameNode architecture that provides true high availability. Our architecture provides protection from both single and multiple failures. The advantages ofdistributed data file systems: Sharing Data : There is a provision in the environment where user at one site may be able to access the data residing at other sites. Autonomy : Because of sharing data by means of data distribution each site is able to retain a degree of control over data that are stored locally. In distributed system there is a global database administrator responsible for the entire system. A part of global data base administrator responsibilities is delegated to local data base administrator for each site. Depending upon the design of distributed database each local database administrator may have different degree of local autonomy. Availability : If one site fails in a distributed system, the remaining sites may be able to continue operating. Thus a failure of a site doesn't necessarily imply the shutdown of the System. How is an Hadoop Distributed File System different from a Relational Database system? Hadoop framework works very well with structured and unstructured data. This also supports variety of data formats in real time such as XML, JSON and text based flat file formats. However, RDBMS only work with better when an entity relationship model (ER model) is defined perfectly and therefore, the database schema or structure can grow and unmanaged otherwise. i.e., An RDBMS works well with structured data. Hadoop will be a choice in environments such as when there are needs for BIG data processing on which the data being processed does not have consistent relationships. Where the data size is too BIG for complex processing, or not easy to define the relationships between the data, then it becomes difficult to save the extracted information in an RDBMS with a coherent relationship. What organizational issues are best solved using Hadoop technology?
  • 3. 1. eCommerce Challenges. 2. big data problem. 3. banking 4. cases in retail. examples of the type of data they will analyze: 1. Analyze life-threatening risks 2. Identify warning signs of security breaches 3. Prevent hardware failure 4. Understand what people think about your company 5. Understand when to sell certain products 6. Find your ideal prospects 7. Gain insight from your log files companies currently use Hadoopo related technologies: