SlideShare a Scribd company logo
1 of 13
What
Is
Big Data Pipeline
www.technogeekscs.com
"Data pipelines" are a
collection of processes that
transmit data from one
location to another
location.
www.technogeekscs.com
The end-to-end process of
gathering data,
turning it into insights
and models,
disseminating insights,
and applying the model
whenever and wherever
the action is required to
achieve the business goal
is stitched together by a
data pipeline.
www.technogeekscs.com
Architects and
developers have
had to adjust to "big
data" because of
the significantly
increased
volume,
diversity,
and velocity
of data in recent years.
www.technogeekscs.com
Big data indicates
that there is a huge
amount of data to
manage.
This data can
present potential
for use cases
including, among
many others,
alerting, real-time
reporting, and
predictive analytics.
www.technogeekscs.com
Big data indicates
that there is a huge
amount of data to
manage.
This data can
present potential
for use cases
including, among
many others,
alerting, real-time
reporting, and
predictive analytics.
www.technogeekscs.com
Big data pipelines perform the same
job as smaller data pipelines.
However, you can extract,
transform, and load (ETL) massive
amounts of data using big data
pipelines. The distinction is
significant since analysts anticipate
that data output will skyrocket in
the future.
Transform Load
Extract
www.technogeekscs.com
Big data pipelines are created to
support one or more of the three
characteristics of big data.
Big data's rapid growth makes it
appealing to build streaming data
pipelines.
The ability to collect and interpret
data in real-time enables prompt
action.
www.technogeekscs.com
The volume of big data requires
data pipelines to be scalable, which
might fluctuate over time.
Scalable and efficient data
pipelines are as important for the
success of analytics, data science,
and machine learning as reliable
supply lines are for staying in
business.
www.technogeekscs.com
The big data pipeline must be
to process
substantial
volumes of data
concurrently
because, in
practice,
able to scale
multiple events are
likely to occur at
once or very close
together.
www.technogeekscs.com
Due to the diversity
of big data, large
data pipelines must
be able to recognise
and process data
in a wide variety
of formats,
including
structured,
unstructured,
and semi-
structured.
www.technogeekscs.com
Data pipelines have
five stages grouped
into three heads:
Analytics / Machine
Learning: computation
(~25% effort)
Data Engineering:
collection, ingestion,
preparation (~50%
effort)
Delivery: presentation
(~25% effort)
For more Content Like THIS
Can't decide which course to
take?
Call Us and get free career
councelling

More Related Content

Similar to What is Big Data Pipe?

Delivering on the Promise of Big Data and the Cloud
Delivering on the Promise of Big Data and the CloudDelivering on the Promise of Big Data and the Cloud
Delivering on the Promise of Big Data and the Cloud
Booz Allen Hamilton
 
Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis
Conference Papers
 

Similar to What is Big Data Pipe? (20)

elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
201506 OSIsoft Garter Big Data.pdf
201506 OSIsoft Garter Big Data.pdf201506 OSIsoft Garter Big Data.pdf
201506 OSIsoft Garter Big Data.pdf
 
Big data
Big dataBig data
Big data
 
Delivering on the Promise of Big Data and the Cloud
Delivering on the Promise of Big Data and the CloudDelivering on the Promise of Big Data and the Cloud
Delivering on the Promise of Big Data and the Cloud
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICSHIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
HIGH-IMPACT USE CASES POWERED BY NEXT-GENERATION NETWORK ANALYTICS
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Sample
Sample Sample
Sample
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 

More from sunil173422

Comparison Between Excel Vs SQL
Comparison Between Excel Vs SQL Comparison Between Excel Vs SQL
Comparison Between Excel Vs SQL
sunil173422
 

More from sunil173422 (20)

Basic Linux Commands Used In AWS
Basic Linux Commands Used In AWSBasic Linux Commands Used In AWS
Basic Linux Commands Used In AWS
 
Most-Popular-Backend-Framework.pdf
Most-Popular-Backend-Framework.pdfMost-Popular-Backend-Framework.pdf
Most-Popular-Backend-Framework.pdf
 
What Is BootStrap?
What Is BootStrap?What Is BootStrap?
What Is BootStrap?
 
Comparison Between Excel Vs SQL
Comparison Between Excel Vs SQL Comparison Between Excel Vs SQL
Comparison Between Excel Vs SQL
 
How to visualize Number using Python Library
How to visualize Number using Python Library How to visualize Number using Python Library
How to visualize Number using Python Library
 
Comparison Between Java And Python
Comparison Between Java And PythonComparison Between Java And Python
Comparison Between Java And Python
 
Hybrid Cloud Vs Multiple Cloud
Hybrid Cloud Vs Multiple CloudHybrid Cloud Vs Multiple Cloud
Hybrid Cloud Vs Multiple Cloud
 
Comparison Between react js & react native
Comparison Between react js & react nativeComparison Between react js & react native
Comparison Between react js & react native
 
ETL Testing Vs Database Testing
ETL Testing Vs Database TestingETL Testing Vs Database Testing
ETL Testing Vs Database Testing
 
ETL Development Learning path
ETL Development Learning pathETL Development Learning path
ETL Development Learning path
 
Data Science Top Companies
Data Science Top CompaniesData Science Top Companies
Data Science Top Companies
 
Life Cycle Of Data Science Project
Life Cycle Of Data Science ProjectLife Cycle Of Data Science Project
Life Cycle Of Data Science Project
 
Components of CI/CD in DevOps
Components of CI/CD in DevOpsComponents of CI/CD in DevOps
Components of CI/CD in DevOps
 
DevOps Site Reliability Engineer Vs DevOps
DevOps Site Reliability Engineer Vs DevOpsDevOps Site Reliability Engineer Vs DevOps
DevOps Site Reliability Engineer Vs DevOps
 
Devops learning path
Devops learning pathDevops learning path
Devops learning path
 
Learn Data Science With Python
Learn Data Science With PythonLearn Data Science With Python
Learn Data Science With Python
 
Comparison ETL Vs ELT
Comparison ETL Vs ELTComparison ETL Vs ELT
Comparison ETL Vs ELT
 
Relationship Between DevOps and Cloud
Relationship Between DevOps and CloudRelationship Between DevOps and Cloud
Relationship Between DevOps and Cloud
 
Big Data Tools & Libraries
Big Data Tools & LibrariesBig Data Tools & Libraries
Big Data Tools & Libraries
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Recently uploaded

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

What is Big Data Pipe?

  • 2. www.technogeekscs.com "Data pipelines" are a collection of processes that transmit data from one location to another location.
  • 3. www.technogeekscs.com The end-to-end process of gathering data, turning it into insights and models, disseminating insights, and applying the model whenever and wherever the action is required to achieve the business goal is stitched together by a data pipeline.
  • 4. www.technogeekscs.com Architects and developers have had to adjust to "big data" because of the significantly increased volume, diversity, and velocity of data in recent years.
  • 5. www.technogeekscs.com Big data indicates that there is a huge amount of data to manage. This data can present potential for use cases including, among many others, alerting, real-time reporting, and predictive analytics.
  • 6. www.technogeekscs.com Big data indicates that there is a huge amount of data to manage. This data can present potential for use cases including, among many others, alerting, real-time reporting, and predictive analytics.
  • 7. www.technogeekscs.com Big data pipelines perform the same job as smaller data pipelines. However, you can extract, transform, and load (ETL) massive amounts of data using big data pipelines. The distinction is significant since analysts anticipate that data output will skyrocket in the future. Transform Load Extract
  • 8. www.technogeekscs.com Big data pipelines are created to support one or more of the three characteristics of big data. Big data's rapid growth makes it appealing to build streaming data pipelines. The ability to collect and interpret data in real-time enables prompt action.
  • 9. www.technogeekscs.com The volume of big data requires data pipelines to be scalable, which might fluctuate over time. Scalable and efficient data pipelines are as important for the success of analytics, data science, and machine learning as reliable supply lines are for staying in business.
  • 10. www.technogeekscs.com The big data pipeline must be to process substantial volumes of data concurrently because, in practice, able to scale multiple events are likely to occur at once or very close together.
  • 11. www.technogeekscs.com Due to the diversity of big data, large data pipelines must be able to recognise and process data in a wide variety of formats, including structured, unstructured, and semi- structured.
  • 12. www.technogeekscs.com Data pipelines have five stages grouped into three heads: Analytics / Machine Learning: computation (~25% effort) Data Engineering: collection, ingestion, preparation (~50% effort) Delivery: presentation (~25% effort)
  • 13. For more Content Like THIS Can't decide which course to take? Call Us and get free career councelling