SlideShare a Scribd company logo
Data(base) taxonomy
Dejan Radić
02.04.2021.
Taxonomy !?
Taxonomy is the practice and science of classification of things or concepts, including
the principles that underlie such classification.
Terminology
Data are facts and statistics collected for reference or analysis.
Data model is an abstract or conceptual model that organizes elements of data and
standardizes how they relate to one another and to the properties of real-world
entities. It is used in both to define domain model, as well as its metamodel. It differs
from physical model which defines a way data is stored on storage media.
Database is an organized collection of data, generally stored and accessed
electronically from a computer system.
DBMS (Database Management System) is a software system that enables users to
define, create, maintain and control access to the database. DBMS that supports
multiple data models is called a multi-model DBMS.
High-level data types
Data types differ from data classes or categories
Something is in class or category, but of type
Data classifications
Data of all data models can be divided into the following classes:
By temporal value:
Real-time
Stale
By general source:
Machine-generated
Human
By level of abstraction:
Data
Metadata
By structure:
Unstructured
Semi-structured
Structured
By sensitivity:
Public
Internal-only
Confidential
Restricted
DBMS classifications
By accessibility:
Online
Local
By distribution:
Centralized
Distributed
Homogenous
Heterogenous
By read-write purpose:
OLAP (Online Analytical Processing)
OLTP (Online Transaction Processing)
By indexing:
Indexed
Unindexed
By query language:
SQL (standardized)
NoSQL
By storage medium:
In-memory (RAM)
On disc (HDD, SSD)
By schema existence:
Has schema
Schemaless
Data models
Tabular
Hierarchical
Relational
Associative
Textual
Dimensional
Time-series
Graph
Spatial
Multimedia
Hybrid
Tabular
Data presented as a plain-text single table
Considered to be structured
Usually unindexed
Used for data transfer to indexed DBMSes
Relational algebra enabled (SQL !?)
Usual format: CSV/DSV
Implementations: Flat-file, Excel
DBMS: Berkeley DB
Hierarchical
Organized as tree-like structure (parent -< child)
Child contains link to parent (usually a unique identifier)
Each child has only one parent
Created by IBM in 1960s
Considered as semi-structured data
Suitable for both machine and human generated data
Usually distributed DBMSes
NoSQL (XPath, XQuery, JSON)
Usual format: XML, JSON, YAML, BSON
Implementations: Document-oriented, XML data store
DBMS: MarkLogic, MongoDB
Hierarchical > Document-oriented
Considered to be associative (document identifier)
Difference from plain associative model – filtering/restriction
Aggregate data model (DDD)
Direct object mapping
Collections belong to a database
Documents belong to collections
Document contains multiple fields/documents
DB, Collection, Field names - metadata
Relational
Data presented as tuples grouped into relations/tables
Relations consists of heading and body
Foreign keys between relations/tables
Each relation has primary key
Most popular data model
Usually SQL query language supported
First described by Codd in 1969
DBMS: Oracle, SQL Server, MySQL
Associative
Associative array, dictionary, hash table
Collection of values, objects or records
Values are usually unstructured or raw data
Identifier is a unique key
Search (index) enabled only by key (equality, wildcard)
Keys can represent hierarchy: /folder/subfolder/file
NoSQL
Used for caching (In-memory)
Usually distributed
Implementations: Key-value store
DBMS: Redis, Riak, Memcached
Textual
Data can be both machine and human generated
Usually indexed - inverted index
Working like search engines - FTS
Unstructured data (including multimedia)
NoSQL
Centralized and distributed
Implementation: Search Engine, Content store
DBMS: Solr, Elasticsearch
Dimensional
Data presented with multiple dimensions - cube
(R)OLAP – Business Intelligence
Data warehouse
Fact and dimension table
Structured and indexed data
Usually centralized and in-memory
MDX queries (Not exactly SQL !?)
DBMS: MS Analysis Services
Time-series
Series of data points listed in chronological order
Presenting discrete data points
Append(current_timestamp, value)
High transaction volumes
Statistical queries (aggregation with time dimension)
Structured and indexed
Usually distributed
Mostly machine-generated data
DBMS: InfluxDB, Riak-TS, TimescaleDB
Graph
Graph structures with nodes and edges
Superset of hierarchical
Successor of early network model
Nodes and edges have fields
NoSQL (graph traversal)
Mostly indexed and centralized
Implementations: Triplestores/RDF store
DBMS: Neo4j
Spatial data
Data which represents objects defined in geometric space
Geospatial data - GIS
Vector and raster data
Point, Line, Polygon
Spatial query examples:
Distance
Intersection
Centralized and indexed
DBMS: Postgres + PostGIS
Multimedia
Sub-classes of multimedia data:
Graphic (vectors) – time independent
Image (pixels) – time independent
Audio (sound) – time dependent
Video (combination) – time dependent
Time dependent serving - streaming
Multimedia data is considered unstructured
Multimedia search
Different media formats (BMP, JPEG, GIF, PNG…)
Hybrid
Database with multiple models – multi-model DB
Polyglot persistence – maintaining consistency !?
Document + Graph
Relational + Hierarchical
Goes with association/identifier
XML and JSON columns
Object-relational
Relational + Textual - FTS
Associative
Spatial – Geo types
Spatiotabular and spatiotemporal
Column-family
Combining: associative, tabular, hierarchical
Column-oriented
Sparse table
Google Big Table, Cassandra
Conclusion
Data model level of abstraction
DBMS choice defines data models too
Structured data
Has schema – metadata
Has better query capabilities – SQL
Semi-structured data
Usually associated with NoSQL
Hierarchical – XML, JSON
Unstructured data
Multimedia and text
Better organization requires more energy
Multi-model vs. polyglot persistence
Thank you for your attention!
Questions?

More Related Content

What's hot

Types dbms
Types dbmsTypes dbms
Types dbms
Avnish Shaw
 
Stomata & Their Type
Stomata & Their TypeStomata & Their Type
Stomata & Their Type
Pankaj Kukreti
 
Dbms models
Dbms modelsDbms models
Dbms models
devgocool
 
DBMS architecture &; system structure
DBMS architecture &; system  structureDBMS architecture &; system  structure
DBMS architecture &; system structure
RUpaliLohar
 
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
Praveen M Jigajinni
 
Molecular Analysis Of SAM & RAM
Molecular Analysis Of SAM & RAMMolecular Analysis Of SAM & RAM
Molecular Analysis Of SAM & RAM
Tilak I S
 
Flora, Revision and Monograph
Flora, Revision and  MonographFlora, Revision and  Monograph
Flora, Revision and Monograph
Avishek Bhattacharjee
 
Artificial system of classification
Artificial system of classificationArtificial system of classification
Artificial system of classification
K.V.N. Naik Arts, Commerce and Science College Nashik
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
Charu Sharma
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
Aswathy S Nair
 
DATABASE MANAGEMENT SYSTEM
DATABASE MANAGEMENT SYSTEMDATABASE MANAGEMENT SYSTEM
DATABASE MANAGEMENT SYSTEM
Mahmud Hasan Tanvir
 
Data base management system
Data base management systemData base management system
Data base management system
ashirafzal1
 
Metadata ppt
Metadata pptMetadata ppt
Metadata ppt
Shashikant Kumar
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Anshika Nigam
 
Dbms classification according to data models
Dbms classification according to data modelsDbms classification according to data models
Dbms classification according to data models
ABDUL KHALIQ
 
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Edureka!
 
Data base management system
Data base management systemData base management system
Data base management system
Navneet Jingar
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
NithyaNandapal
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
ZoufishanY
 

What's hot (20)

Types dbms
Types dbmsTypes dbms
Types dbms
 
Stomata & Their Type
Stomata & Their TypeStomata & Their Type
Stomata & Their Type
 
Dbms models
Dbms modelsDbms models
Dbms models
 
DBMS architecture &; system structure
DBMS architecture &; system  structureDBMS architecture &; system  structure
DBMS architecture &; system structure
 
11 Database Concepts
11 Database Concepts11 Database Concepts
11 Database Concepts
 
Molecular Analysis Of SAM & RAM
Molecular Analysis Of SAM & RAMMolecular Analysis Of SAM & RAM
Molecular Analysis Of SAM & RAM
 
Flora, Revision and Monograph
Flora, Revision and  MonographFlora, Revision and  Monograph
Flora, Revision and Monograph
 
Artificial system of classification
Artificial system of classificationArtificial system of classification
Artificial system of classification
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
 
Data Mining and Data Warehousing
Data Mining and Data WarehousingData Mining and Data Warehousing
Data Mining and Data Warehousing
 
DATABASE MANAGEMENT SYSTEM
DATABASE MANAGEMENT SYSTEMDATABASE MANAGEMENT SYSTEM
DATABASE MANAGEMENT SYSTEM
 
Data base management system
Data base management systemData base management system
Data base management system
 
Metadata ppt
Metadata pptMetadata ppt
Metadata ppt
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Dbms classification according to data models
Dbms classification according to data modelsDbms classification according to data models
Dbms classification according to data models
 
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | EdurekaData Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
Data Warehouse Concepts | Data Warehouse Tutorial | Data Warehousing | Edureka
 
Data base management system
Data base management systemData base management system
Data base management system
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Nucleic acid and protein databanks
Nucleic acid and protein databanksNucleic acid and protein databanks
Nucleic acid and protein databanks
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 

Similar to Data(base) taxonomy

TAMUC LO 8
TAMUC LO 8TAMUC LO 8
Introduction of big data unit 1
Introduction of big data unit 1Introduction of big data unit 1
Introduction of big data unit 1
RojaT4
 
Dbms Lec Uog 02
Dbms Lec Uog 02Dbms Lec Uog 02
Dbms Lec Uog 02
smelltulip
 
Lecture01 257
Lecture01 257Lecture01 257
Lecture01 257
hansamurli
 
DATABASE Lecture 1 and 2.pptx
DATABASE Lecture 1 and 2.pptxDATABASE Lecture 1 and 2.pptx
DATABASE Lecture 1 and 2.pptx
RUBAB79
 
Database Basics Theory
Database Basics TheoryDatabase Basics Theory
Database Basics Theory
sunmitraeducation
 
DBMS an Example
DBMS an ExampleDBMS an Example
DBMS an Example
Dr. C.V. Suresh Babu
 
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
Raza Baloch
 
Ch-1-Introduction-to-Database.pdf
Ch-1-Introduction-to-Database.pdfCh-1-Introduction-to-Database.pdf
Ch-1-Introduction-to-Database.pdf
MrjJoker1
 
Databases and its representation
Databases and its representationDatabases and its representation
Databases and its representation
Ruhull
 
Chapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen dataChapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen data
UNIVERSITAS TEKNOKRAT INDONESIA
 
SQL (Scratch to Advance).pptx
SQL (Scratch to Advance).pptxSQL (Scratch to Advance).pptx
SQL (Scratch to Advance).pptx
Hitesh670643
 
Database management system
Database management systemDatabase management system
Database management system
khagendrabasnet4
 
M.sc. engg (ict) admission guide database management system 4
M.sc. engg (ict) admission guide   database management system 4M.sc. engg (ict) admission guide   database management system 4
M.sc. engg (ict) admission guide database management system 4
Syed Ariful Islam Emon
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
Bahria University Islamabad, Pakistan
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
Bahria University Islamabad, Pakistan
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
Bahria University Islamabad, Pakistan
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
Bahria University Islamabad, Pakistan
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
Bahria University Islamabad, Pakistan
 
Database systems Handbook.pdf
Database systems Handbook.pdfDatabase systems Handbook.pdf
Database systems Handbook.pdf
Bahria University Islamabad, Pakistan
 

Similar to Data(base) taxonomy (20)

TAMUC LO 8
TAMUC LO 8TAMUC LO 8
TAMUC LO 8
 
Introduction of big data unit 1
Introduction of big data unit 1Introduction of big data unit 1
Introduction of big data unit 1
 
Dbms Lec Uog 02
Dbms Lec Uog 02Dbms Lec Uog 02
Dbms Lec Uog 02
 
Lecture01 257
Lecture01 257Lecture01 257
Lecture01 257
 
DATABASE Lecture 1 and 2.pptx
DATABASE Lecture 1 and 2.pptxDATABASE Lecture 1 and 2.pptx
DATABASE Lecture 1 and 2.pptx
 
Database Basics Theory
Database Basics TheoryDatabase Basics Theory
Database Basics Theory
 
DBMS an Example
DBMS an ExampleDBMS an Example
DBMS an Example
 
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
Cdocumentsandsettingsuser1desktop2 dbmsexamples-091012013049-phpapp01
 
Ch-1-Introduction-to-Database.pdf
Ch-1-Introduction-to-Database.pdfCh-1-Introduction-to-Database.pdf
Ch-1-Introduction-to-Database.pdf
 
Databases and its representation
Databases and its representationDatabases and its representation
Databases and its representation
 
Chapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen dataChapter 05 pertemuan 7- donpas - manajemen data
Chapter 05 pertemuan 7- donpas - manajemen data
 
SQL (Scratch to Advance).pptx
SQL (Scratch to Advance).pptxSQL (Scratch to Advance).pptx
SQL (Scratch to Advance).pptx
 
Database management system
Database management systemDatabase management system
Database management system
 
M.sc. engg (ict) admission guide database management system 4
M.sc. engg (ict) admission guide   database management system 4M.sc. engg (ict) admission guide   database management system 4
M.sc. engg (ict) admission guide database management system 4
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
 
Database systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdfDatabase systems Handbook by Muhammad Sharif.pdf
Database systems Handbook by Muhammad Sharif.pdf
 
Database systems Handbook.pdf
Database systems Handbook.pdfDatabase systems Handbook.pdf
Database systems Handbook.pdf
 

More from Dejan Radic

A Tale of Two Worlds: Real World and On-chain World
A Tale of Two Worlds: Real World and On-chain WorldA Tale of Two Worlds: Real World and On-chain World
A Tale of Two Worlds: Real World and On-chain World
Dejan Radic
 
Technical challenges of RWA Tokenization
Technical challenges of RWA TokenizationTechnical challenges of RWA Tokenization
Technical challenges of RWA Tokenization
Dejan Radic
 
Sta su to Blockchain, Crypto i Web3?
Sta su to Blockchain, Crypto i Web3?Sta su to Blockchain, Crypto i Web3?
Sta su to Blockchain, Crypto i Web3?
Dejan Radic
 
Privacy-enhancing technologies and Blockchain
Privacy-enhancing technologies and BlockchainPrivacy-enhancing technologies and Blockchain
Privacy-enhancing technologies and Blockchain
Dejan Radic
 
Blockchain beyond DeFi
Blockchain beyond DeFiBlockchain beyond DeFi
Blockchain beyond DeFi
Dejan Radic
 
Paillier Cryptosystem
Paillier CryptosystemPaillier Cryptosystem
Paillier Cryptosystem
Dejan Radic
 
Da li su Vasi podaci sigurni u Cloud-u?
Da li su Vasi podaci sigurni u Cloud-u?Da li su Vasi podaci sigurni u Cloud-u?
Da li su Vasi podaci sigurni u Cloud-u?
Dejan Radic
 
Internal and external positioning in mobile and web applications
Internal and external positioning in mobile and web applicationsInternal and external positioning in mobile and web applications
Internal and external positioning in mobile and web applications
Dejan Radic
 
Abstract Factory pattern application on multi-contract on-chain deployments
Abstract Factory pattern application on multi-contract on-chain deploymentsAbstract Factory pattern application on multi-contract on-chain deployments
Abstract Factory pattern application on multi-contract on-chain deployments
Dejan Radic
 
Ethereum Intro
Ethereum IntroEthereum Intro
Ethereum Intro
Dejan Radic
 
Influence of schema-less approach on database authorization
Influence of schema-less approach on database authorizationInfluence of schema-less approach on database authorization
Influence of schema-less approach on database authorization
Dejan Radic
 
Initial sprint velocity problem
Initial sprint velocity problemInitial sprint velocity problem
Initial sprint velocity problem
Dejan Radic
 

More from Dejan Radic (12)

A Tale of Two Worlds: Real World and On-chain World
A Tale of Two Worlds: Real World and On-chain WorldA Tale of Two Worlds: Real World and On-chain World
A Tale of Two Worlds: Real World and On-chain World
 
Technical challenges of RWA Tokenization
Technical challenges of RWA TokenizationTechnical challenges of RWA Tokenization
Technical challenges of RWA Tokenization
 
Sta su to Blockchain, Crypto i Web3?
Sta su to Blockchain, Crypto i Web3?Sta su to Blockchain, Crypto i Web3?
Sta su to Blockchain, Crypto i Web3?
 
Privacy-enhancing technologies and Blockchain
Privacy-enhancing technologies and BlockchainPrivacy-enhancing technologies and Blockchain
Privacy-enhancing technologies and Blockchain
 
Blockchain beyond DeFi
Blockchain beyond DeFiBlockchain beyond DeFi
Blockchain beyond DeFi
 
Paillier Cryptosystem
Paillier CryptosystemPaillier Cryptosystem
Paillier Cryptosystem
 
Da li su Vasi podaci sigurni u Cloud-u?
Da li su Vasi podaci sigurni u Cloud-u?Da li su Vasi podaci sigurni u Cloud-u?
Da li su Vasi podaci sigurni u Cloud-u?
 
Internal and external positioning in mobile and web applications
Internal and external positioning in mobile and web applicationsInternal and external positioning in mobile and web applications
Internal and external positioning in mobile and web applications
 
Abstract Factory pattern application on multi-contract on-chain deployments
Abstract Factory pattern application on multi-contract on-chain deploymentsAbstract Factory pattern application on multi-contract on-chain deployments
Abstract Factory pattern application on multi-contract on-chain deployments
 
Ethereum Intro
Ethereum IntroEthereum Intro
Ethereum Intro
 
Influence of schema-less approach on database authorization
Influence of schema-less approach on database authorizationInfluence of schema-less approach on database authorization
Influence of schema-less approach on database authorization
 
Initial sprint velocity problem
Initial sprint velocity problemInitial sprint velocity problem
Initial sprint velocity problem
 

Recently uploaded

Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 

Recently uploaded (20)

Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 

Data(base) taxonomy

  • 2. Taxonomy !? Taxonomy is the practice and science of classification of things or concepts, including the principles that underlie such classification.
  • 3. Terminology Data are facts and statistics collected for reference or analysis. Data model is an abstract or conceptual model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. It is used in both to define domain model, as well as its metamodel. It differs from physical model which defines a way data is stored on storage media. Database is an organized collection of data, generally stored and accessed electronically from a computer system. DBMS (Database Management System) is a software system that enables users to define, create, maintain and control access to the database. DBMS that supports multiple data models is called a multi-model DBMS.
  • 4. High-level data types Data types differ from data classes or categories Something is in class or category, but of type
  • 5. Data classifications Data of all data models can be divided into the following classes: By temporal value: Real-time Stale By general source: Machine-generated Human By level of abstraction: Data Metadata By structure: Unstructured Semi-structured Structured By sensitivity: Public Internal-only Confidential Restricted
  • 6. DBMS classifications By accessibility: Online Local By distribution: Centralized Distributed Homogenous Heterogenous By read-write purpose: OLAP (Online Analytical Processing) OLTP (Online Transaction Processing) By indexing: Indexed Unindexed By query language: SQL (standardized) NoSQL By storage medium: In-memory (RAM) On disc (HDD, SSD) By schema existence: Has schema Schemaless
  • 8. Tabular Data presented as a plain-text single table Considered to be structured Usually unindexed Used for data transfer to indexed DBMSes Relational algebra enabled (SQL !?) Usual format: CSV/DSV Implementations: Flat-file, Excel DBMS: Berkeley DB
  • 9. Hierarchical Organized as tree-like structure (parent -< child) Child contains link to parent (usually a unique identifier) Each child has only one parent Created by IBM in 1960s Considered as semi-structured data Suitable for both machine and human generated data Usually distributed DBMSes NoSQL (XPath, XQuery, JSON) Usual format: XML, JSON, YAML, BSON Implementations: Document-oriented, XML data store DBMS: MarkLogic, MongoDB
  • 10. Hierarchical > Document-oriented Considered to be associative (document identifier) Difference from plain associative model – filtering/restriction Aggregate data model (DDD) Direct object mapping Collections belong to a database Documents belong to collections Document contains multiple fields/documents DB, Collection, Field names - metadata
  • 11. Relational Data presented as tuples grouped into relations/tables Relations consists of heading and body Foreign keys between relations/tables Each relation has primary key Most popular data model Usually SQL query language supported First described by Codd in 1969 DBMS: Oracle, SQL Server, MySQL
  • 12. Associative Associative array, dictionary, hash table Collection of values, objects or records Values are usually unstructured or raw data Identifier is a unique key Search (index) enabled only by key (equality, wildcard) Keys can represent hierarchy: /folder/subfolder/file NoSQL Used for caching (In-memory) Usually distributed Implementations: Key-value store DBMS: Redis, Riak, Memcached
  • 13. Textual Data can be both machine and human generated Usually indexed - inverted index Working like search engines - FTS Unstructured data (including multimedia) NoSQL Centralized and distributed Implementation: Search Engine, Content store DBMS: Solr, Elasticsearch
  • 14. Dimensional Data presented with multiple dimensions - cube (R)OLAP – Business Intelligence Data warehouse Fact and dimension table Structured and indexed data Usually centralized and in-memory MDX queries (Not exactly SQL !?) DBMS: MS Analysis Services
  • 15. Time-series Series of data points listed in chronological order Presenting discrete data points Append(current_timestamp, value) High transaction volumes Statistical queries (aggregation with time dimension) Structured and indexed Usually distributed Mostly machine-generated data DBMS: InfluxDB, Riak-TS, TimescaleDB
  • 16. Graph Graph structures with nodes and edges Superset of hierarchical Successor of early network model Nodes and edges have fields NoSQL (graph traversal) Mostly indexed and centralized Implementations: Triplestores/RDF store DBMS: Neo4j
  • 17. Spatial data Data which represents objects defined in geometric space Geospatial data - GIS Vector and raster data Point, Line, Polygon Spatial query examples: Distance Intersection Centralized and indexed DBMS: Postgres + PostGIS
  • 18. Multimedia Sub-classes of multimedia data: Graphic (vectors) – time independent Image (pixels) – time independent Audio (sound) – time dependent Video (combination) – time dependent Time dependent serving - streaming Multimedia data is considered unstructured Multimedia search Different media formats (BMP, JPEG, GIF, PNG…)
  • 19. Hybrid Database with multiple models – multi-model DB Polyglot persistence – maintaining consistency !? Document + Graph Relational + Hierarchical Goes with association/identifier XML and JSON columns Object-relational Relational + Textual - FTS Associative Spatial – Geo types Spatiotabular and spatiotemporal Column-family Combining: associative, tabular, hierarchical Column-oriented Sparse table Google Big Table, Cassandra
  • 20. Conclusion Data model level of abstraction DBMS choice defines data models too Structured data Has schema – metadata Has better query capabilities – SQL Semi-structured data Usually associated with NoSQL Hierarchical – XML, JSON Unstructured data Multimedia and text Better organization requires more energy Multi-model vs. polyglot persistence
  • 21. Thank you for your attention! Questions?

Editor's Notes

  1. -General source + business generated data
  2. -Cryptography !?