What are existing data types?
2
3
4
5
What are existing database types?
6
Database Types (Relational - Structured)
7
Database Types (Relational - Structured)
8
• OLTP Characteristics
• Designed for data entry and manipulations
• Day to day business operation
• Normalized data
• Concerned with current data (real time data)
• Data stored in columns and rows (highly detailed)
• Flexible schema
• Most common Type
Database Types (Relational - Structured)
9
• OLAP Characteristics
• AKA Data warehouse
• Designed for data collection & analysis
• Fast
• Easy access to complex queries
• Concerned with historical/ statistical information
• Data is summarized
• Rigid schema (how the data is organized)
• Not usually concerned with individual transactions
Why SQL is not enough?
10
Why SQL is not enough?
11
Case Study
12
G-Flix Example
13
G-Flix Example
14
Data lakes
15
• Designed to capture raw data (structured, semi-
structured, and unstructured)
• Made for large amounts of data
• Uses ML and AI for analytics and processing
• It is not super-useable
• To be useful it is cleansed and organized into databases or data
warehouses.
16
Data lakes
No one option is better for all…
• No one is better than the other
• Depends on what kind of data you have and what you
want to do with it
• A company can have one or all options
18
Course Plan
• Data-Warehousing concepts and data modeling
• NoSQL Database Models
• Key-Value
• Column
• Document
• Graph
• In-Memory Databases
19
PART 1
DATA WAREHOUSING
20
What is Data Warehouse?
Is a database, or collection of databases, that
centralizes a business's information from
multiple sources and applications, and makes it
available for analytics and use across the
organization.
21
Data Warehouse Definition
22
Data Warehouse Architecture
Why do we need a DW?
23

Lecture 0 - Advanced DB.pdf

  • 1.
    What are existingdata types? 2
  • 2.
  • 3.
  • 4.
  • 5.
    What are existingdatabase types? 6
  • 6.
  • 7.
    Database Types (Relational- Structured) 8 • OLTP Characteristics • Designed for data entry and manipulations • Day to day business operation • Normalized data • Concerned with current data (real time data) • Data stored in columns and rows (highly detailed) • Flexible schema • Most common Type
  • 8.
    Database Types (Relational- Structured) 9 • OLAP Characteristics • AKA Data warehouse • Designed for data collection & analysis • Fast • Easy access to complex queries • Concerned with historical/ statistical information • Data is summarized • Rigid schema (how the data is organized) • Not usually concerned with individual transactions
  • 9.
    Why SQL isnot enough? 10
  • 10.
    Why SQL isnot enough? 11
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
    • Designed tocapture raw data (structured, semi- structured, and unstructured) • Made for large amounts of data • Uses ML and AI for analytics and processing • It is not super-useable • To be useful it is cleansed and organized into databases or data warehouses. 16 Data lakes
  • 16.
    No one optionis better for all… • No one is better than the other • Depends on what kind of data you have and what you want to do with it • A company can have one or all options 18
  • 17.
    Course Plan • Data-Warehousingconcepts and data modeling • NoSQL Database Models • Key-Value • Column • Document • Graph • In-Memory Databases 19
  • 18.
  • 19.
    What is DataWarehouse? Is a database, or collection of databases, that centralizes a business's information from multiple sources and applications, and makes it available for analytics and use across the organization. 21
  • 20.
  • 21.