SlideShare a Scribd company logo
What’s The Difference
Between Structured,
and Unstructured Data
Semi-Structured,
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
Title
Text
IntroductionIntroduction
When a conversation turns to analytics or big data, the terms structured, semi-
structured and unstructured might get bandied about. These are classifications
of data that are now important to understand with the rapid increase of semi-
structured and unstructured data today as well as the development of tools
that make managing and analysing these classes of data possible. Here’s what
you need to know.
What’s The Difference Between Structured,
Semi-Structured And Unstructured Data?
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
Structured Data
Data that is the easiest to search and organize, because it is usually contained in rows and
columns and its elements can be mapped into fixed pre-defined fields, is known as
structured data. Think about what data you might store in an Excel spreadsheet and you
have an example of structured data. Structured data can follow a data model a database
designer creates - think of sales records by region, by product or by customer. In
structured data, entities can be grouped together to form relations (‘customers’ that are
also ‘satisfied with the service). This makes structured data easy to store, analyse and
search and until recently was the only data easily usable for businesses. Today, most
estimate structured data accounts for less than 20 percent of all data.
Often structured data is managed using Structured Query Language (SQL)—a
programming software language developed by IBM in the 1970s for relational databases.
Structured data can be created by machines and humans. Examples of structured data
include financial data such as accounting transactions, address details, demographic
information, star ratings by customers, machines logs, location data from smart phones
and smart devices, etc.
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
Unstructured Data
A much bigger percentage of all the data is our world is unstructured data. Unstructured
data is data that cannot be contained in a row-column database and doesn’t have an
associated data model. Think of the text of an email message. The lack of structure made
unstructured data more difficult to search, manage and analyse, which is why companies
have widely discarded unstructured data, until the recent proliferation of artificial
intelligence and machine learning algorithms made it easier to process.
Other examples of unstructured data include photos, video and audio files, text files, social
media content, satellite imagery, presentations, PDFs, open-ended survey responses,
websites and call centre transcripts/recordings.
Instead of spreadsheets or relational databases, unstructured data is usually stored in data
lakes, NoSQL databases, applications and data warehouses. The wealth of information in
unstructured data is now accessible and can be automatically processed with artificial
intelligence algorithms today. This technology has elevated unstructured data to an
extremely valuable resource for organizations.
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
Semi-Structured Data
Beyond structured and unstructured data, there is a third category, which basically is a mix
between both of them. The type of data defined as semi-structured data has some
defining or consistent characteristics but doesn’t conform to a structure as rigid as is
expected with a relational database. Therefore, there are some organizational properties
such as semantic tags or metadata to make it easier to organize, but there’s still fluidity in
the data.
Email messages are a good example. While the actual content is unstructured, it does
contain structured data such as name and email address of sender and recipient, time
sent, etc. Another example is a digital photograph. The image itself is unstructured, but if
the photo was taken on a smart phone, for example, it would be date and time stamped,
geo tagged, and would have a device ID. Once stored, the photo could also be given tags
that would provide a structure, such as ‘dog’ or ‘pet.’
A lot of what people would usually classify as unstructured data is indeed semi-structured,
because it contains some classifying characteristics.
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
The Difference Between Structured, Unstructured,
And Semi-Structured Data
To easily understand the differences between the classifications of data, let’s use this
analogy to illustrate. When interviewing for a job, let’s say there are three different
classifications of interviews: structured, semi-structured and unstructured.
In a structured interview, the interviewer follows a strict script that was defined by the
human resources department and is followed for every candidate. Another form of
interview is an unstructured interview. In an unstructured interview, it is entirely up to the
interviewer to determine the questions and the order they will be asked (or even if they
will be asked) for every candidate. A semi-structured interview takes elements from both
structured and unstructured interview classifications. It uses the consistency and
quantitative elements allowed with the structured interview but offers the freedom to
customize based on the circumstances that are more in line with an unstructured
interview.
So, for data, structured data is easily organizable and follows a rigid format; unstructured
is complex and often qualitative information that is impossible to reduce to or organize in
a relational database and semi-structured data has elements of both.
© 2017 Bernard Marr , Bernard Marr & Co. All rights reserved
© 2018 Bernard Marr, Bernard Marr & Co. All rights reserved
Bernard Marr is an internationally best-selling author, popular keynote speaker, futurist, and a
strategic business & technology advisor to governments and companies. He helps
organisations improve their business performance, use data more intelligently, and
understand the implications of new technologies such as artificial intelligence, big data,
blockchains, and the Internet of Things.
LinkedIn has ranked Bernard as one of the world’s top 5 business influencers. He is a frequent
contributor to the World Economic Forum and writes a regular column for Forbes. Every day
Bernard actively engages his 1.5 million social media followers and shares content that
reaches millions of readers.
Visit The
Website
© 2017 Bernard Marr , Bernard Marr & Co. All rights reserved
© 2019 Bernard Marr, Bernard Marr & Co. All rights reserved
Bernard Marr is an internationally best-selling author, popular keynote speaker, futurist, and a
strategic business & technology advisor to governments and companies. He helps
organisations improve their business performance, use data more intelligently, and
understand the implications of new technologies such as artificial intelligence, big data,
blockchains, and the Internet of Things.
LinkedIn has ranked Bernard as one of the world’s top 5 business influencers. He is a frequent
contributor to the World Economic Forum and writes a regular column for Forbes. Every day
Bernard actively engages his 1.5 million social media followers and shares content that
reaches millions of readers.
Visit The
Website
Title
Subtitle
Be the FIRST to receive news,
articles, insights and event
updates from Bernard Marr & Co
straight to your inbox.
Signing up is EASY! Simply fill out
the online form and we’ll be in
touch!
© 2018 Bernard Marr, Bernard Marr & Co. All rights reserved
BernardMarr
hello@bernardmarr.com
www.bernardmarr.com

More Related Content

What's hot

Data mining slides
Data mining slidesData mining slides
Data mining slides
smj
 

What's hot (20)

3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
Data mining
Data miningData mining
Data mining
 
Data preparation
Data preparationData preparation
Data preparation
 
Data preprocessing in Data Mining
Data preprocessing in Data MiningData preprocessing in Data Mining
Data preprocessing in Data Mining
 
Data Cleaning Techniques
Data Cleaning TechniquesData Cleaning Techniques
Data Cleaning Techniques
 
Dynamic Itemset Counting
Dynamic Itemset CountingDynamic Itemset Counting
Dynamic Itemset Counting
 
OLAP
OLAPOLAP
OLAP
 
What is "data"?
What is "data"?What is "data"?
What is "data"?
 
5.2 mining time series data
5.2 mining time series data5.2 mining time series data
5.2 mining time series data
 
Data Modeling PPT
Data Modeling PPTData Modeling PPT
Data Modeling PPT
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Metadata ppt
Metadata pptMetadata ppt
Metadata ppt
 
Normalization in DBMS
Normalization in DBMSNormalization in DBMS
Normalization in DBMS
 
Data mining techniques unit 1
Data mining techniques  unit 1Data mining techniques  unit 1
Data mining techniques unit 1
 
Data cubes
Data cubesData cubes
Data cubes
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 

Similar to What’s The Difference Between Structured, Semi-Structured And Unstructured Data?

Types of Big Data.pptx
Types of Big Data.pptxTypes of Big Data.pptx
Types of Big Data.pptx
varun453331
 

Similar to What’s The Difference Between Structured, Semi-Structured And Unstructured Data? (20)

What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!Converting Big Data To Smart Data | The Step-By-Step Guide!
Converting Big Data To Smart Data | The Step-By-Step Guide!
 
Types of Big Data.pptx
Types of Big Data.pptxTypes of Big Data.pptx
Types of Big Data.pptx
 
big data.pptx
big data.pptxbig data.pptx
big data.pptx
 
Unit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdfUnit No2 Introduction to big data.pdf
Unit No2 Introduction to big data.pdf
 
Data set module 1
Data set   module 1Data set   module 1
Data set module 1
 
For the Love of Big Data
For the Love of Big DataFor the Love of Big Data
For the Love of Big Data
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
A Primer for a layman about Big Data, Business Analytics and Cloud
A Primer for a layman  about Big Data, Business Analytics and CloudA Primer for a layman  about Big Data, Business Analytics and Cloud
A Primer for a layman about Big Data, Business Analytics and Cloud
 
Data set Introduction to Big Data
Data set   Introduction to Big DataData set   Introduction to Big Data
Data set Introduction to Big Data
 
Data Literacy.docx
Data Literacy.docxData Literacy.docx
Data Literacy.docx
 
Semantic Web Mining of Un-structured Data: Challenges and Opportunities
Semantic Web Mining of Un-structured Data: Challenges and OpportunitiesSemantic Web Mining of Un-structured Data: Challenges and Opportunities
Semantic Web Mining of Un-structured Data: Challenges and Opportunities
 
Big data privacy and inconsistency issues
Big data privacy and inconsistency issuesBig data privacy and inconsistency issues
Big data privacy and inconsistency issues
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOP
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 

More from Bernard Marr

More from Bernard Marr (20)

The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023
 
How To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For BusinessHow To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For Business
 
The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023
 
The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023
 
Policing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening NowPolicing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening Now
 
Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services
 
The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023
 
Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?
 
The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023
 
The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023
 
How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023
 
Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence? Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence?
 
The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023
 
The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023
 
12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work 12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work
 
The Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your CareerThe Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your Career
 
How AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of HealthcareHow AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of Healthcare
 
Top 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of WorkTop 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of Work
 
Artificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of MarketingArtificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of Marketing
 
Is AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say NoIs AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say No
 

Recently uploaded

一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Uber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis ReportUber Ride Supply Demand Gap Analysis Report
Uber Ride Supply Demand Gap Analysis Report
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 

What’s The Difference Between Structured, Semi-Structured And Unstructured Data?

  • 1. What’s The Difference Between Structured, and Unstructured Data Semi-Structured,
  • 2. © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved Title Text IntroductionIntroduction When a conversation turns to analytics or big data, the terms structured, semi- structured and unstructured might get bandied about. These are classifications of data that are now important to understand with the rapid increase of semi- structured and unstructured data today as well as the development of tools that make managing and analysing these classes of data possible. Here’s what you need to know. What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
  • 3. © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved Structured Data Data that is the easiest to search and organize, because it is usually contained in rows and columns and its elements can be mapped into fixed pre-defined fields, is known as structured data. Think about what data you might store in an Excel spreadsheet and you have an example of structured data. Structured data can follow a data model a database designer creates - think of sales records by region, by product or by customer. In structured data, entities can be grouped together to form relations (‘customers’ that are also ‘satisfied with the service). This makes structured data easy to store, analyse and search and until recently was the only data easily usable for businesses. Today, most estimate structured data accounts for less than 20 percent of all data. Often structured data is managed using Structured Query Language (SQL)—a programming software language developed by IBM in the 1970s for relational databases. Structured data can be created by machines and humans. Examples of structured data include financial data such as accounting transactions, address details, demographic information, star ratings by customers, machines logs, location data from smart phones and smart devices, etc.
  • 4. © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved Unstructured Data A much bigger percentage of all the data is our world is unstructured data. Unstructured data is data that cannot be contained in a row-column database and doesn’t have an associated data model. Think of the text of an email message. The lack of structure made unstructured data more difficult to search, manage and analyse, which is why companies have widely discarded unstructured data, until the recent proliferation of artificial intelligence and machine learning algorithms made it easier to process. Other examples of unstructured data include photos, video and audio files, text files, social media content, satellite imagery, presentations, PDFs, open-ended survey responses, websites and call centre transcripts/recordings. Instead of spreadsheets or relational databases, unstructured data is usually stored in data lakes, NoSQL databases, applications and data warehouses. The wealth of information in unstructured data is now accessible and can be automatically processed with artificial intelligence algorithms today. This technology has elevated unstructured data to an extremely valuable resource for organizations.
  • 5. © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved Semi-Structured Data Beyond structured and unstructured data, there is a third category, which basically is a mix between both of them. The type of data defined as semi-structured data has some defining or consistent characteristics but doesn’t conform to a structure as rigid as is expected with a relational database. Therefore, there are some organizational properties such as semantic tags or metadata to make it easier to organize, but there’s still fluidity in the data. Email messages are a good example. While the actual content is unstructured, it does contain structured data such as name and email address of sender and recipient, time sent, etc. Another example is a digital photograph. The image itself is unstructured, but if the photo was taken on a smart phone, for example, it would be date and time stamped, geo tagged, and would have a device ID. Once stored, the photo could also be given tags that would provide a structure, such as ‘dog’ or ‘pet.’ A lot of what people would usually classify as unstructured data is indeed semi-structured, because it contains some classifying characteristics.
  • 6. © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved The Difference Between Structured, Unstructured, And Semi-Structured Data To easily understand the differences between the classifications of data, let’s use this analogy to illustrate. When interviewing for a job, let’s say there are three different classifications of interviews: structured, semi-structured and unstructured. In a structured interview, the interviewer follows a strict script that was defined by the human resources department and is followed for every candidate. Another form of interview is an unstructured interview. In an unstructured interview, it is entirely up to the interviewer to determine the questions and the order they will be asked (or even if they will be asked) for every candidate. A semi-structured interview takes elements from both structured and unstructured interview classifications. It uses the consistency and quantitative elements allowed with the structured interview but offers the freedom to customize based on the circumstances that are more in line with an unstructured interview. So, for data, structured data is easily organizable and follows a rigid format; unstructured is complex and often qualitative information that is impossible to reduce to or organize in a relational database and semi-structured data has elements of both.
  • 7. © 2017 Bernard Marr , Bernard Marr & Co. All rights reserved © 2018 Bernard Marr, Bernard Marr & Co. All rights reserved Bernard Marr is an internationally best-selling author, popular keynote speaker, futurist, and a strategic business & technology advisor to governments and companies. He helps organisations improve their business performance, use data more intelligently, and understand the implications of new technologies such as artificial intelligence, big data, blockchains, and the Internet of Things. LinkedIn has ranked Bernard as one of the world’s top 5 business influencers. He is a frequent contributor to the World Economic Forum and writes a regular column for Forbes. Every day Bernard actively engages his 1.5 million social media followers and shares content that reaches millions of readers. Visit The Website © 2017 Bernard Marr , Bernard Marr & Co. All rights reserved © 2019 Bernard Marr, Bernard Marr & Co. All rights reserved Bernard Marr is an internationally best-selling author, popular keynote speaker, futurist, and a strategic business & technology advisor to governments and companies. He helps organisations improve their business performance, use data more intelligently, and understand the implications of new technologies such as artificial intelligence, big data, blockchains, and the Internet of Things. LinkedIn has ranked Bernard as one of the world’s top 5 business influencers. He is a frequent contributor to the World Economic Forum and writes a regular column for Forbes. Every day Bernard actively engages his 1.5 million social media followers and shares content that reaches millions of readers. Visit The Website
  • 8. Title Subtitle Be the FIRST to receive news, articles, insights and event updates from Bernard Marr & Co straight to your inbox. Signing up is EASY! Simply fill out the online form and we’ll be in touch! © 2018 Bernard Marr, Bernard Marr & Co. All rights reserved