5. May Venn Diagram helps us!
Tabular/
Relational/
RDBMS
Data
Big Data
6. May Venn Diagram helps us!
Dark Data
Tabular/
Relational/
RDBMS
Data
Big Data
7. May Venn Diagram helps us!
Dark Data
Tabular/
Relational/
RDBMS
Data
(Structured/Unstructured)
(Almost Unstructured)
(Structured)
Big Data
8. May Venn Diagram helps us!
Dark Data
Tabular/
Relational/
RDBMS
Data
(Structured/Unstructured)
(Almost Unstructured)
(Structured)
Big Data
Almost can’t be
processed or analyzed
9. Gartner defines dark data as the information assets
organizations collect, process and store during
regular business activities, but generally fail to use
for other purposes (for example, analytics, business
relationships and direct monetizing).
Dark Data Definition by Gartner
10. Gartner defines dark data as the information assets
organizations collect, process and store during
regular business activities, but generally fail to use
for other purposes (for example, analytics, business
relationships and direct monetizing).
Similar to dark matter in physics, dark data often
comprises most organizations’ universe of
information assets.
Dark Data Definition by Gartner
11. Gartner defines dark data as the information assets
organizations collect, process and store during
regular business activities, but generally fail to use
for other purposes (for example, analytics, business
relationships and direct monetizing).
Similar to dark matter in physics, dark data often
comprises most organizations’ universe of
information assets.
Thus, organizations often retain dark data for
compliance purposes only. Storing and securing
data typically incurs more expense (and sometimes
greater risk) than value.
Dark Data Definition by Gartner
12. Gartner defines dark data as the information assets
organizations collect, process and store during
regular business activities, but generally fail to use
for other purposes (for example, analytics, business
relationships and direct monetizing).
Similar to dark matter in physics, dark data often
comprises most organizations’ universe of
information assets.
Thus, organizations often retain dark data for
compliance purposes only. Storing and securing
data typically incurs more expense (and sometimes
greater risk) than value.
Dark Data Definition by Gartner
14. Dark Data - A more Sensible Definition
Organizations Generate
and Gather Data
15. Dark Data - A more Sensible Definition
Organizations Generate
and Gather Data
A large portion of the
collected data are
never even analyzed!
16. Dark Data - A more Sensible Definition
Organizations Generate
and Gather Data
A large portion of the
collected data are
never even analyzed!
90% of the data are
never analyzed
17. Dark Data - A more Sensible Definition
Organizations Generate
and Gather Data
A large portion of the
collected data are
never even analyzed!
90% of the data are
never analysed.
• Customer Information
• Log Files
• Previous Employee Information
• Previous Webpages
• Sensor Data
• Email Correspondences
• Account Information
• Notes or Presentations
• Old Versions of Relevant
Documents
28. Why there is so much of dark data?
• Lack of insight about data
• Lack of ambitions to improve
• Disconnect among departments
• Lopsided priorities
• Lack of technologies to Capture and Store
• Lack of resources/infrastructures to make it available
• Lack of CPU and technics to analyze the data
29. The issues you face with Dark Data
• Legal and Regulatory Issues
• Loss of Reputation
• Intelligence Risk
• Operation Costs
• Opportunity Costs
30. Some essential questions
• What can we gather?
• What may we extract from it?
• How we may prune it?
• How long should we keep it?
• What are the storage options?
• What are the processing options?
• How much is the value of each block of data
(Approximately)
• Running limited boundary scenarios