SlideShare a Scribd company logo
1 of 2
Download to read offline
Topic Guide: Big Data as a Technology
History
The process of accessing and using big data has been around for a long time but the
concept gained preference in 2000 when an industry analyst Doug Laney made Big Data
mainstream.
Notable Moments in the history of big data are:
1881: The first instance of data was discovered.
1928: A German engineer named Fritz Pfleumer developed a magnetic data storage method
on tape.
1948: Theory of Shannon’s Information is developed.
1970: Presentation of the relational database.
1976: Development and commercial use of MRP systems.
1989: Creation of the World Wide Web.
2001: Release of fundamentals related to big data.
2005: Hadoop, an open-source software framework was developed.
2007: Introduction to masses about big data.
2014: Shift in the usage of ERP systems, Internet of Things, etc.
2017: Increment in the creation of data.
The term big data refers to a large, hard-to-manage volume of data that exponentially keeps
on growing with time and helps businesses grow on a day-to-day basis. It includes large,
complex, structured, and unstructured data types used to analyze the insights for proper
decisions and strategic business moves. Big data is said to be generated and transferred
from a wide variety of sources.
Types of Big Data
There are 3 types of Big Data
1. Structured: Structured form of big data refers to the data for which the format is
known in advance. It includes data that can be stored, accessed, and processed in a
fixed format. For instance, data related to employees is stored in a table.
2. Unstructured: Unstructured form of big data refers to the data for which there is no
specific structure to store and manage it. Unstructured data includes heterogeneous
data sources including files like text, images, videos, etc. For instance, Google
Search.
3. Semi-Structured: Semi-structured form of big data refers to the data that includes
both the method of structured and unstructured. It is a big data form that does not
include a major format for Big Data Analytics. For instance, Data in an XML file.
5 V’s of Big Data
Often considered as characteristics of big data earlier there were 3 V’s of big data but for
now, there are around 5 V’s of big data.
1. Volume: As the name suggests, big data is all related to its size. Businesses collect
data from a variety of sources like business transactions, email listing, images,
videos, and others so for business, the size of data plays a major role in determining
its insights.
2. Velocity: The term velocity in big data refers to the generation speed of the data via
different sources like business transactions, networks, logs, social media sites
connections, and others.
3. Value: For an organization operating at its biggest the data derived and stored does
not count if it is not worth the value. The data is only needed in a business when it
serves as valuable insights. No data ever comes with insights instead it needs to be
converted into something valuable.
4. Veracity: As data comes in from different sources the main reason for veracity refers
to the quality of data. Veracity in big data refers to the inconsistencies and
uncertainties involved in the data stored and managed by a business.
5. Variety: Variety in big data refers to the nature of the data stored and managed by a
business-like structure, unstructured, and semi-structured. It involves heterogeneous
sources.

More Related Content

Similar to Topic guide big data as a technology

Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxYashiBatra1
 
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata PrivacyTwo-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacydbpublications
 
Big data by Ravi .pdf
Big data by Ravi .pdfBig data by Ravi .pdf
Big data by Ravi .pdfRAVIPSHARMA2
 
Data and Information.docx
Data and Information.docxData and Information.docx
Data and Information.docxswarna627082
 
Big data privacy and inconsistency issues
Big data privacy and inconsistency issuesBig data privacy and inconsistency issues
Big data privacy and inconsistency issueseSAT Publishing House
 
An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.ijceronline
 
Class 1 - Introduction to Big data.pptx
Class 1 - Introduction to Big data.pptxClass 1 - Introduction to Big data.pptx
Class 1 - Introduction to Big data.pptxtejayasam
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data scienceJohnson Ubah
 
Protection of big data privacy
Protection of big data privacyProtection of big data privacy
Protection of big data privacyredpel dot com
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfrajsharma159890
 

Similar to Topic guide big data as a technology (20)

Unit 2
Unit 2Unit 2
Unit 2
 
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata PrivacyTwo-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
Two-Phase TDS Approach for Data Anonymization To Preserving Bigdata Privacy
 
Big data by Ravi .pdf
Big data by Ravi .pdfBig data by Ravi .pdf
Big data by Ravi .pdf
 
Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Data and Information.docx
Data and Information.docxData and Information.docx
Data and Information.docx
 
Data Literacy.docx
Data Literacy.docxData Literacy.docx
Data Literacy.docx
 
Big data privacy and inconsistency issues
Big data privacy and inconsistency issuesBig data privacy and inconsistency issues
Big data privacy and inconsistency issues
 
An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.An Comprehensive Study of Big Data Environment and its Challenges.
An Comprehensive Study of Big Data Environment and its Challenges.
 
1
11
1
 
Class 1 - Introduction to Big data.pptx
Class 1 - Introduction to Big data.pptxClass 1 - Introduction to Big data.pptx
Class 1 - Introduction to Big data.pptx
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Big data
Big dataBig data
Big data
 
Big data intro.pptx
Big data intro.pptxBig data intro.pptx
Big data intro.pptx
 
Protection of big data privacy
Protection of big data privacyProtection of big data privacy
Protection of big data privacy
 
big-data.pdf
big-data.pdfbig-data.pdf
big-data.pdf
 
Sample
Sample Sample
Sample
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 

Recently uploaded

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Topic guide big data as a technology

  • 1. Topic Guide: Big Data as a Technology History The process of accessing and using big data has been around for a long time but the concept gained preference in 2000 when an industry analyst Doug Laney made Big Data mainstream. Notable Moments in the history of big data are: 1881: The first instance of data was discovered. 1928: A German engineer named Fritz Pfleumer developed a magnetic data storage method on tape. 1948: Theory of Shannon’s Information is developed. 1970: Presentation of the relational database. 1976: Development and commercial use of MRP systems. 1989: Creation of the World Wide Web. 2001: Release of fundamentals related to big data. 2005: Hadoop, an open-source software framework was developed. 2007: Introduction to masses about big data. 2014: Shift in the usage of ERP systems, Internet of Things, etc. 2017: Increment in the creation of data. The term big data refers to a large, hard-to-manage volume of data that exponentially keeps on growing with time and helps businesses grow on a day-to-day basis. It includes large, complex, structured, and unstructured data types used to analyze the insights for proper decisions and strategic business moves. Big data is said to be generated and transferred from a wide variety of sources.
  • 2. Types of Big Data There are 3 types of Big Data 1. Structured: Structured form of big data refers to the data for which the format is known in advance. It includes data that can be stored, accessed, and processed in a fixed format. For instance, data related to employees is stored in a table. 2. Unstructured: Unstructured form of big data refers to the data for which there is no specific structure to store and manage it. Unstructured data includes heterogeneous data sources including files like text, images, videos, etc. For instance, Google Search. 3. Semi-Structured: Semi-structured form of big data refers to the data that includes both the method of structured and unstructured. It is a big data form that does not include a major format for Big Data Analytics. For instance, Data in an XML file. 5 V’s of Big Data Often considered as characteristics of big data earlier there were 3 V’s of big data but for now, there are around 5 V’s of big data. 1. Volume: As the name suggests, big data is all related to its size. Businesses collect data from a variety of sources like business transactions, email listing, images, videos, and others so for business, the size of data plays a major role in determining its insights. 2. Velocity: The term velocity in big data refers to the generation speed of the data via different sources like business transactions, networks, logs, social media sites connections, and others. 3. Value: For an organization operating at its biggest the data derived and stored does not count if it is not worth the value. The data is only needed in a business when it serves as valuable insights. No data ever comes with insights instead it needs to be converted into something valuable. 4. Veracity: As data comes in from different sources the main reason for veracity refers to the quality of data. Veracity in big data refers to the inconsistencies and uncertainties involved in the data stored and managed by a business. 5. Variety: Variety in big data refers to the nature of the data stored and managed by a business-like structure, unstructured, and semi-structured. It involves heterogeneous sources.