SlideShare a Scribd company logo
1 of 3
Download to read offline
www.datacademy.ai
Knowledge world
Characteristics of Big Data: Understanding
the Five V’s
Volume, Velocity, Variety, Veracity, Value
5V’s of Big Data
History
It started in the year 2001 with 3 V’s, namely Volume, Velocity and Variety.
Then Veracity got added, making it 4 V’s. Then Value got added, making it
5V’s. Later came 8Vs, 10Vs etc.
We will discuss on the important ones (5V’s) Volume, Velocity, Variety,
Veracity, and Value.
1) Volume
It refers to the size of Big Data. Data can be considered Big Data or not is based
on the volume. The rapidly increasing volume data is due to cloud-computing
traffic, IoT, mobile traffic etc.
www.datacademy.ai
Knowledge world
Data growth prediction
2) Velocity
It refers to the speed at which the data is getting accumulated. This is mainly
due to IoTs, mobile data, social media etc.
In the year 2000, Google was receiving 32.8 million searches per day. As for
2018, Google was receiving 5.6 billion searches per day!
Approximate monthly active users as of 2018:
Facebook: 2.41 billion
Instagram: 1 billion
Twitter: 320 million
LinkedIn: 575 million
Facebook monthly active users growth since 2008
3) Variety
It refers to Structured, Semi-structured and Unstructured data due to
different sources of data generated either by humans or by machines.
www.datacademy.ai
Knowledge world
Structured data: It’s the traditional data which is organized and conforms to
the formal structure of data. This data can be stored in a relational database.
Example: Bank statement containing date, time, amount etc.
Semi-structured data: It’s semi-organized data. It doesn’t conform to the
formal structure of data. Example: Log files, JSON files, Sensor data, csv files
etc.
Unstructured data: It’s not an organized data and doesn’t fit into rows and
columns structure of a relational database. Example: Text files, Emails,
images, videos, voicemails, audio files etc.
4) Veracity
It refers to the assurance of quality/integrity/credibility/accuracy of the
data. Since the data is collected from multiple sources, we need to check the
data for accuracy before using it for business insights.
5) Value
Just because we collected lots of Data, it’s of no value unless we garner some
insights out of it. Value refers to how useful the data is in decision making. We
need to extract the value of the Big Data using proper analytics.
What are the other V’s?
Viscosity (complexity or degree of correlation), Variability (inconsistency in
data flow), Volatility (durability or how long time data is valid and how long it
should be stored), Viability (capability to be live and
active), Validity (understandable to find the hidden relationships).

More Related Content

Similar to Characteristics of Big Data Understanding the Five V.pdf

Similar to Characteristics of Big Data Understanding the Five V.pdf (20)

Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Final_Bigdata_pret
Final_Bigdata_pretFinal_Bigdata_pret
Final_Bigdata_pret
 
Data and Information.docx
Data and Information.docxData and Information.docx
Data and Information.docx
 
Big data by Ravi .pdf
Big data by Ravi .pdfBig data by Ravi .pdf
Big data by Ravi .pdf
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Big data intro.pptx
Big data intro.pptxBig data intro.pptx
Big data intro.pptx
 
Your organization and Big Data: Managing access, privacy, and security
Your organization and Big Data: Managing access, privacy, and securityYour organization and Big Data: Managing access, privacy, and security
Your organization and Big Data: Managing access, privacy, and security
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 
The 5 key V's of Big Data
The 5 key V's of Big DataThe 5 key V's of Big Data
The 5 key V's of Big Data
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Big data
Big dataBig data
Big data
 
Data Structure and Types
Data Structure and TypesData Structure and Types
Data Structure and Types
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Metadata and Analytics
Metadata and AnalyticsMetadata and Analytics
Metadata and Analytics
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data
 
A beginner's guide to Big data
A beginner's guide to Big dataA beginner's guide to Big data
A beginner's guide to Big data
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and Issues
 

More from Datacademy.ai

More from Datacademy.ai (16)

Learn Polymorphism in Python with Examples.pdf
Learn Polymorphism in Python with Examples.pdfLearn Polymorphism in Python with Examples.pdf
Learn Polymorphism in Python with Examples.pdf
 
Why Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdfWhy Monitoring and Logging are Important in DevOps.pdf
Why Monitoring and Logging are Important in DevOps.pdf
 
AWS data storage Amazon S3, Amazon RDS.pdf
AWS data storage Amazon S3, Amazon RDS.pdfAWS data storage Amazon S3, Amazon RDS.pdf
AWS data storage Amazon S3, Amazon RDS.pdf
 
Top 30+ Latest AWS Certification Interview Questions on AWS BI and data visua...
Top 30+ Latest AWS Certification Interview Questions on AWS BI and data visua...Top 30+ Latest AWS Certification Interview Questions on AWS BI and data visua...
Top 30+ Latest AWS Certification Interview Questions on AWS BI and data visua...
 
Top 50 Ansible Interview Questions And Answers in 2023.pdf
Top 50 Ansible Interview Questions And Answers in 2023.pdfTop 50 Ansible Interview Questions And Answers in 2023.pdf
Top 50 Ansible Interview Questions And Answers in 2023.pdf
 
Interview Questions on AWS Elastic Compute Cloud (EC2).pdf
Interview Questions on AWS Elastic Compute Cloud (EC2).pdfInterview Questions on AWS Elastic Compute Cloud (EC2).pdf
Interview Questions on AWS Elastic Compute Cloud (EC2).pdf
 
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
 
Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...
Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...
Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...
 
Top 60 Power BI Interview Questions and Answers for 2023.pdf
Top 60 Power BI Interview Questions and Answers for 2023.pdfTop 60 Power BI Interview Questions and Answers for 2023.pdf
Top 60 Power BI Interview Questions and Answers for 2023.pdf
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
 
AWS DevOps: Introduction to DevOps on AWS
  AWS DevOps: Introduction to DevOps on AWS  AWS DevOps: Introduction to DevOps on AWS
AWS DevOps: Introduction to DevOps on AWS
 
Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdf
 
Top 140+ Advanced SAS Interview Questions and Answers.pdf
Top 140+ Advanced SAS Interview Questions and Answers.pdfTop 140+ Advanced SAS Interview Questions and Answers.pdf
Top 140+ Advanced SAS Interview Questions and Answers.pdf
 
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
50 Extraordinary AWS CloudWatch Interview Questions & Answers.pdf
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
 
Top Most Python Interview Questions.pdf
Top Most Python Interview Questions.pdfTop Most Python Interview Questions.pdf
Top Most Python Interview Questions.pdf
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Recently uploaded (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 

Characteristics of Big Data Understanding the Five V.pdf

  • 1. www.datacademy.ai Knowledge world Characteristics of Big Data: Understanding the Five V’s Volume, Velocity, Variety, Veracity, Value 5V’s of Big Data History It started in the year 2001 with 3 V’s, namely Volume, Velocity and Variety. Then Veracity got added, making it 4 V’s. Then Value got added, making it 5V’s. Later came 8Vs, 10Vs etc. We will discuss on the important ones (5V’s) Volume, Velocity, Variety, Veracity, and Value. 1) Volume It refers to the size of Big Data. Data can be considered Big Data or not is based on the volume. The rapidly increasing volume data is due to cloud-computing traffic, IoT, mobile traffic etc.
  • 2. www.datacademy.ai Knowledge world Data growth prediction 2) Velocity It refers to the speed at which the data is getting accumulated. This is mainly due to IoTs, mobile data, social media etc. In the year 2000, Google was receiving 32.8 million searches per day. As for 2018, Google was receiving 5.6 billion searches per day! Approximate monthly active users as of 2018: Facebook: 2.41 billion Instagram: 1 billion Twitter: 320 million LinkedIn: 575 million Facebook monthly active users growth since 2008 3) Variety It refers to Structured, Semi-structured and Unstructured data due to different sources of data generated either by humans or by machines.
  • 3. www.datacademy.ai Knowledge world Structured data: It’s the traditional data which is organized and conforms to the formal structure of data. This data can be stored in a relational database. Example: Bank statement containing date, time, amount etc. Semi-structured data: It’s semi-organized data. It doesn’t conform to the formal structure of data. Example: Log files, JSON files, Sensor data, csv files etc. Unstructured data: It’s not an organized data and doesn’t fit into rows and columns structure of a relational database. Example: Text files, Emails, images, videos, voicemails, audio files etc. 4) Veracity It refers to the assurance of quality/integrity/credibility/accuracy of the data. Since the data is collected from multiple sources, we need to check the data for accuracy before using it for business insights. 5) Value Just because we collected lots of Data, it’s of no value unless we garner some insights out of it. Value refers to how useful the data is in decision making. We need to extract the value of the Big Data using proper analytics. What are the other V’s? Viscosity (complexity or degree of correlation), Variability (inconsistency in data flow), Volatility (durability or how long time data is valid and how long it should be stored), Viability (capability to be live and active), Validity (understandable to find the hidden relationships).