SlideShare a Scribd company logo
1 of 9
4 Layers Everyone
Must Know
BIG Data
There is still so much confusion
surrounding Big Data.
We thought it might help to clarify the 4
key layers of a big data system - i.e.
the different stages the data itself has
to pass through on its journey from raw
statistic or snippet of unstructured data
(for example, social media post) to
actionable insight.
The whole point of a big data strategy
is to develop a system which moves
data along this path – raw data to
actionable insights.
Here, we will attempt to define the basic
layers you will need to have in place, if
you are getting to grips with how big
data could help your business.
Although people have
come up with different
names for these layers,
as we’re charting a brave
new world where little is
set in stone, We think this
is the simplest and most
accurate breakdown:
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data sources layer
This is where the data arrives at your
organization. It includes everything from
your sales records, customer database,
feedback, social media channels, marketing
list, email archives and any data gleaned
from monitoring or measuring aspects of
your operations. One of the first steps in
setting up a data strategy is assessing what
you have here, and measuring it against
what you need to answer the critical
questions you want help with. You might
have everything you need already, or you
might need to establish new sources.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data storage layer
This is where your Big Data lives, once it is
gathered from your sources. As the volume
of data generated and stored by companies
has started to explode, sophisticated but
accessible systems and tools have been
developed – such as Apache Hadoop DFS
(distributed file system), or Google File
System, to help with this task. As well as a
system for storing data that your computer
system will understand (the file system) you
will need a system for organizing and
categorizing it in a way that people will
understand – the database. Hadoop has its
own, known as HBase, but others including
Amazon’s DynamoDB, MongoDB and
Cassandra (used by Facebook), all based
on the NoSQL architecture, are popular too.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data processing/ analysis
layer
When you want to use the data you have
stored to find out something useful, you
will need to process and analyze it. A
common method is by using a
MapReduce tool. Essentially, this is used
to select the elements of the data that you
want to analyze, and putting it into a
format from which insights can be
gleaned. If you are a large organization
which has invested in its own data
analytics team, they will form a part of this
layer, too. They will employ tools such as
Apache PIG or HIVE to query the data,
and might use automated pattern
recognition tools to determine trends, as
well as drawing their conclusions from
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data output layer
This is how the insights gleaned through
the analysis is passed on to the people
who can take action to benefit from
them. Clear and concise communication
(particularly if your decision-makers
don’t have a background in statistics) is
essential, and this output can take the
form of reports, charts, figures and key
recommendations. Ultimately, your Big
Data system’s main task is to show, at
this stage of the process, how
measurable improvement in at least one
KPI that can be achieved by taking
action based on the analysis you have
carried out.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Stay Tuned With Us for More
Information
https://www.linkedin.com/company/tyronesystems
https://twitter.com/tyronesystems
https://www.facebook.com/tyronesystems

More Related Content

More from Tyrone Systems

5 ways hpc can provides cost savings and flexibility to meet the technology i...
5 ways hpc can provides cost savings and flexibility to meet the technology i...5 ways hpc can provides cost savings and flexibility to meet the technology i...
5 ways hpc can provides cost savings and flexibility to meet the technology i...Tyrone Systems
 
How Emerging Technologies are Enabling The Banking Industry
How Emerging Technologies are Enabling The Banking IndustryHow Emerging Technologies are Enabling The Banking Industry
How Emerging Technologies are Enabling The Banking IndustryTyrone Systems
 
Five Exciting Ways HCI can accelerates digital transformation for Media and E...
Five Exciting Ways HCI can accelerates digital transformation for Media and E...Five Exciting Ways HCI can accelerates digital transformation for Media and E...
Five Exciting Ways HCI can accelerates digital transformation for Media and E...Tyrone Systems
 
Design and Optimize your code for high-performance with Intel® Advisor and I...
Design and Optimize your code for high-performance with Intel®  Advisor and I...Design and Optimize your code for high-performance with Intel®  Advisor and I...
Design and Optimize your code for high-performance with Intel® Advisor and I...Tyrone Systems
 
Fast-Track Your Digital Transformation with Intelligent Automation
Fast-Track Your Digital Transformation with Intelligent AutomationFast-Track Your Digital Transformation with Intelligent Automation
Fast-Track Your Digital Transformation with Intelligent AutomationTyrone Systems
 
Top Five benefits of Hyper-Converged Infrastructure
Top Five benefits of Hyper-Converged InfrastructureTop Five benefits of Hyper-Converged Infrastructure
Top Five benefits of Hyper-Converged InfrastructureTyrone Systems
 
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)Tyrone Systems
 
How can Artificial Intelligence improve software development process?
How can Artificial Intelligence improve software development process?How can Artificial Intelligence improve software development process?
How can Artificial Intelligence improve software development process?Tyrone Systems
 
3 Ways Machine Learning Facilitates Fraud Detection
3 Ways Machine Learning Facilitates Fraud Detection3 Ways Machine Learning Facilitates Fraud Detection
3 Ways Machine Learning Facilitates Fraud DetectionTyrone Systems
 
Four ways to digitally transform with HPC in the cloud
Four ways to digitally transform with HPC in the cloudFour ways to digitally transform with HPC in the cloud
Four ways to digitally transform with HPC in the cloudTyrone Systems
 
How to Secure Containerized Environments?
How to Secure Containerized Environments?How to Secure Containerized Environments?
How to Secure Containerized Environments?Tyrone Systems
 
OneAPI Series 2 Webinar - 9th, Dec-20
OneAPI Series 2 Webinar - 9th, Dec-20OneAPI Series 2 Webinar - 9th, Dec-20
OneAPI Series 2 Webinar - 9th, Dec-20Tyrone Systems
 
OneAPI dpc++ Virtual Workshop 9th Dec-20
OneAPI dpc++ Virtual Workshop 9th Dec-20OneAPI dpc++ Virtual Workshop 9th Dec-20
OneAPI dpc++ Virtual Workshop 9th Dec-20Tyrone Systems
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductTyrone Systems
 
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...Tyrone Systems
 
Top 5 Benefits of Hyper-Converged Infrastructure
Top 5 Benefits of Hyper-Converged InfrastructureTop 5 Benefits of Hyper-Converged Infrastructure
Top 5 Benefits of Hyper-Converged InfrastructureTyrone Systems
 
4 Ways Cloud Technology Can Enhance Your Retail Business
4 Ways Cloud Technology Can Enhance Your Retail Business4 Ways Cloud Technology Can Enhance Your Retail Business
4 Ways Cloud Technology Can Enhance Your Retail BusinessTyrone Systems
 
What are the Key Megatrends for Adoption of Containers in Enterprise IT?
What are the Key Megatrends for Adoption of Containers in Enterprise IT?What are the Key Megatrends for Adoption of Containers in Enterprise IT?
What are the Key Megatrends for Adoption of Containers in Enterprise IT?Tyrone Systems
 
How are Financial institutions are Leveraging Big Data to Transform Digitally
How are Financial institutions are Leveraging Big Data to Transform DigitallyHow are Financial institutions are Leveraging Big Data to Transform Digitally
How are Financial institutions are Leveraging Big Data to Transform DigitallyTyrone Systems
 
4 Ways IT Automation can help your Organization Survive the Coronavirus
4 Ways IT Automation can help your Organization Survive the Coronavirus4 Ways IT Automation can help your Organization Survive the Coronavirus
4 Ways IT Automation can help your Organization Survive the CoronavirusTyrone Systems
 

More from Tyrone Systems (20)

5 ways hpc can provides cost savings and flexibility to meet the technology i...
5 ways hpc can provides cost savings and flexibility to meet the technology i...5 ways hpc can provides cost savings and flexibility to meet the technology i...
5 ways hpc can provides cost savings and flexibility to meet the technology i...
 
How Emerging Technologies are Enabling The Banking Industry
How Emerging Technologies are Enabling The Banking IndustryHow Emerging Technologies are Enabling The Banking Industry
How Emerging Technologies are Enabling The Banking Industry
 
Five Exciting Ways HCI can accelerates digital transformation for Media and E...
Five Exciting Ways HCI can accelerates digital transformation for Media and E...Five Exciting Ways HCI can accelerates digital transformation for Media and E...
Five Exciting Ways HCI can accelerates digital transformation for Media and E...
 
Design and Optimize your code for high-performance with Intel® Advisor and I...
Design and Optimize your code for high-performance with Intel®  Advisor and I...Design and Optimize your code for high-performance with Intel®  Advisor and I...
Design and Optimize your code for high-performance with Intel® Advisor and I...
 
Fast-Track Your Digital Transformation with Intelligent Automation
Fast-Track Your Digital Transformation with Intelligent AutomationFast-Track Your Digital Transformation with Intelligent Automation
Fast-Track Your Digital Transformation with Intelligent Automation
 
Top Five benefits of Hyper-Converged Infrastructure
Top Five benefits of Hyper-Converged InfrastructureTop Five benefits of Hyper-Converged Infrastructure
Top Five benefits of Hyper-Converged Infrastructure
 
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)
An Effective Approach to Cloud Migration for Small and Medium Enterprises (SMEs)
 
How can Artificial Intelligence improve software development process?
How can Artificial Intelligence improve software development process?How can Artificial Intelligence improve software development process?
How can Artificial Intelligence improve software development process?
 
3 Ways Machine Learning Facilitates Fraud Detection
3 Ways Machine Learning Facilitates Fraud Detection3 Ways Machine Learning Facilitates Fraud Detection
3 Ways Machine Learning Facilitates Fraud Detection
 
Four ways to digitally transform with HPC in the cloud
Four ways to digitally transform with HPC in the cloudFour ways to digitally transform with HPC in the cloud
Four ways to digitally transform with HPC in the cloud
 
How to Secure Containerized Environments?
How to Secure Containerized Environments?How to Secure Containerized Environments?
How to Secure Containerized Environments?
 
OneAPI Series 2 Webinar - 9th, Dec-20
OneAPI Series 2 Webinar - 9th, Dec-20OneAPI Series 2 Webinar - 9th, Dec-20
OneAPI Series 2 Webinar - 9th, Dec-20
 
OneAPI dpc++ Virtual Workshop 9th Dec-20
OneAPI dpc++ Virtual Workshop 9th Dec-20OneAPI dpc++ Virtual Workshop 9th Dec-20
OneAPI dpc++ Virtual Workshop 9th Dec-20
 
oneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel ProductoneAPI: Industry Initiative & Intel Product
oneAPI: Industry Initiative & Intel Product
 
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...
Tyrone-Intel oneAPI Webinar: Optimized Tools for Performance-Driven, Cross-Ar...
 
Top 5 Benefits of Hyper-Converged Infrastructure
Top 5 Benefits of Hyper-Converged InfrastructureTop 5 Benefits of Hyper-Converged Infrastructure
Top 5 Benefits of Hyper-Converged Infrastructure
 
4 Ways Cloud Technology Can Enhance Your Retail Business
4 Ways Cloud Technology Can Enhance Your Retail Business4 Ways Cloud Technology Can Enhance Your Retail Business
4 Ways Cloud Technology Can Enhance Your Retail Business
 
What are the Key Megatrends for Adoption of Containers in Enterprise IT?
What are the Key Megatrends for Adoption of Containers in Enterprise IT?What are the Key Megatrends for Adoption of Containers in Enterprise IT?
What are the Key Megatrends for Adoption of Containers in Enterprise IT?
 
How are Financial institutions are Leveraging Big Data to Transform Digitally
How are Financial institutions are Leveraging Big Data to Transform DigitallyHow are Financial institutions are Leveraging Big Data to Transform Digitally
How are Financial institutions are Leveraging Big Data to Transform Digitally
 
4 Ways IT Automation can help your Organization Survive the Coronavirus
4 Ways IT Automation can help your Organization Survive the Coronavirus4 Ways IT Automation can help your Organization Survive the Coronavirus
4 Ways IT Automation can help your Organization Survive the Coronavirus
 

Recently uploaded

Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 

Recently uploaded (20)

Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 

Big data 4 layers

  • 1. 4 Layers Everyone Must Know BIG Data
  • 2. There is still so much confusion surrounding Big Data. We thought it might help to clarify the 4 key layers of a big data system - i.e. the different stages the data itself has to pass through on its journey from raw statistic or snippet of unstructured data (for example, social media post) to actionable insight.
  • 3. The whole point of a big data strategy is to develop a system which moves data along this path – raw data to actionable insights. Here, we will attempt to define the basic layers you will need to have in place, if you are getting to grips with how big data could help your business.
  • 4. Although people have come up with different names for these layers, as we’re charting a brave new world where little is set in stone, We think this is the simplest and most accurate breakdown: 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 5. Data sources layer This is where the data arrives at your organization. It includes everything from your sales records, customer database, feedback, social media channels, marketing list, email archives and any data gleaned from monitoring or measuring aspects of your operations. One of the first steps in setting up a data strategy is assessing what you have here, and measuring it against what you need to answer the critical questions you want help with. You might have everything you need already, or you might need to establish new sources. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 6. Data storage layer This is where your Big Data lives, once it is gathered from your sources. As the volume of data generated and stored by companies has started to explode, sophisticated but accessible systems and tools have been developed – such as Apache Hadoop DFS (distributed file system), or Google File System, to help with this task. As well as a system for storing data that your computer system will understand (the file system) you will need a system for organizing and categorizing it in a way that people will understand – the database. Hadoop has its own, known as HBase, but others including Amazon’s DynamoDB, MongoDB and Cassandra (used by Facebook), all based on the NoSQL architecture, are popular too. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 7. Data processing/ analysis layer When you want to use the data you have stored to find out something useful, you will need to process and analyze it. A common method is by using a MapReduce tool. Essentially, this is used to select the elements of the data that you want to analyze, and putting it into a format from which insights can be gleaned. If you are a large organization which has invested in its own data analytics team, they will form a part of this layer, too. They will employ tools such as Apache PIG or HIVE to query the data, and might use automated pattern recognition tools to determine trends, as well as drawing their conclusions from 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 8. Data output layer This is how the insights gleaned through the analysis is passed on to the people who can take action to benefit from them. Clear and concise communication (particularly if your decision-makers don’t have a background in statistics) is essential, and this output can take the form of reports, charts, figures and key recommendations. Ultimately, your Big Data system’s main task is to show, at this stage of the process, how measurable improvement in at least one KPI that can be achieved by taking action based on the analysis you have carried out. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 9. Stay Tuned With Us for More Information https://www.linkedin.com/company/tyronesystems https://twitter.com/tyronesystems https://www.facebook.com/tyronesystems