3. HISTORY
It started in the year 2001
with 3 V’s, namely Volume,
Velocity, and Variety. Then
Veracity got added, making
it 4 V’s. Then Value got
added, making it 5 V’s. Later
came 8Vs, 10Vs, etc.
4. 1) Volume
• It refers to the size of Big Data.
• Data can be considered Big Data or
not, is based on the volume.
• The rapidly increasing volume of
data is due to cloud-computing
traffic, IoT, mobile traffic, etc.
5.
6. 2) Velocity
• It refers to the speed at which
the data is getting
accumulated.
• This is mainly due to IoTs,
mobile data, social media etc.
7. 3) Variety
It refers to Structured, Semi-
structured and Unstructured data
due to different sources of data
generated either by humans or by
machines.
8. • Structured data: It’s the traditional data that is organized and conforms to the formal
structure of data. This data can be stored in a relational database.
Example: Bank statement containing date, time, amount, etc.
• Semi-structured data: It’s semi-organized data. It doesn’t conform to the formal
structure of data.
Examples: Log files, JSON files, Sensor data, CSV files, etc.
• Unstructured data: It’s not organized data and doesn’t fit into the rows and
columns structure of a relational database.
Examples: Text files, Emails, images, videos, voicemails, audio files, etc.
9. 4) Veracity
• It refers to the assurance of
quality/integrity/credibility/accurac
y of the data.
• Since the data is collected from
multiple sources, we need to check
the data for accuracy before using it
for business insights.
10. 5) Value
• Just because we collected lots
of Data, it’s of no value unless
we garner some insights out of
it.
• Value refers to how useful the
data is in decision-making.
• We need to extract the value
of Big Data using proper
analytics.
11. 5) Value
• Just because we collected lots
of Data, it’s of no value unless
we garner some insights out of
it.
• Value refers to how useful the
data is in decision-making.
• We need to extract the value
of Big Data using proper
analytics.