1. The document discusses key concepts related to database systems including the definition of a database, database management systems (DBMS), data models, database classification, data integrity, query optimization, structured query language (SQL), parallel databases, and object-relational mapping (ORM).
2. It provides details on common data models like hierarchical, network, and relational models. It also describes concepts like database architecture, data definition language, data manipulation language, and distributed databases.
3. Control questions are provided at the end to test understanding of database concepts like the difference between a database and data set, components of a database system, and main elements of a database.
2. Lecture content
1. Concept of DBMS
2. Database System
3. Database classification
4. Normalization
5. Data integrity
6. Optimizing Queries and their processing
7. Fundamentals of SQL
8. The parallel databases
9. Technology of programming of ORM
10. Control questions
3. Concept of DBMS
The database refers to a named set of structured data related to a certain
subject area.
The DB is stored and
processed on a computer
system.
The data in the database is
well structured, i.e. it
highlights the main elements
of the types and relationships
between the elements, as
well as the restrictions on the
permitted operations.
It provides data search and
data processing.
4. Database System
The Database System is a computerized system of structured data. Its main
purpose is to store information and provide it on demand.
Data Hardware
Software Users
Database
5. A database is a set of data that has a regular
structure and that is organized in such a way that
a computer can easily find the desired
information.
Well-known DBMSs include MySQL,
PostgreSQL, EnterpriseDB, MongoDB, MariaDB,
Microsoft SQL Server, Oracle, Sybase, SAP HANA,
MemSQL, SQLite and IBM DB2.
6. What Is Database Architecture?
Database architecture focuses on the design,
development, implementation and maintenance of
computer programs that store and organize information
for businesses, agencies and institutions. A database
architect develops and implements software to meet the
needs of users.
7. Data models
A data model is an abstract model that organizes
elements of data and standardizes how they relate to
one another and to properties of the real world entities.
8. Database System
The hierarchical data model.
Director
Head of Sales
Department
Head of PR
Department
Head of Service
Department
Employee Employee Employee Employee Employee
Benefits:
- simplicity;
- minimum memory
consumption.
Disadvantages:
- lack of flexibility ;
- an exceptional
navigation principle
of access to the data;
- access to the data
through the root
member only.
9. Database System
Network data model
Benefits:
- universality;
- ability to access
the data through
multiple values
relations.
-
Disadvantages:
- complexity;
- validity only for
the navigational
principle of
accessing the data.
Employee 1 Employee 2 Employee 3 Employee 4
Project 1 Project 2 Project 3
Customer 2 Customer 3 Customer 4
Customer 1
10. Database System
Relational data model Benefits:
- simplicity.
- theoretical
justification.
- data independence.
Disadvantages:
- low speed.
- large memory
consumption.
StudentID LastName FirstName MiddleName GroupID
1 Kazakov Petr Vladimirovich 1
2 Vasilyev Ivan Arkadievich 2
4 Shishkina Daria Sergeevna 1
GroupID Supervisor
1 Tsarev S.M.
2 Pestov D.N.
15. Data integrity is the maintenance of, and the assurance of the
accuracy and consistency of, data over its entire life-cycle, and is a critical
aspect to the design, implementation and usage of any system which stores,
processes, or retrieves data.
16. 4 Types of Data Integrity
• Entity integrity
• Referential integrity
• Domain integrity
• User-defined integrity
17. Optimizing Queries and their processing
To prepare queries using various DBMS, two main languages for query
description are most often used:
• The QBE (QueryByExample);
• SQL (StructuredQueryLanguage);
Data processing includes a set of various functions and operations that can be
divided into sodium groups:
• search, filter and sort data;
• queries to the database;
• mechanism for implementing events, rules (triggers), and procedures in the
database.
18. The Structured Query Language is one of the fundamental
building blocks of modern database architecture. SQL defines the
methods used to create and manipulate relational databases on all
major platforms.
Fundamentals of SQL
19. The Data Definition Language (DDL) contains the commands used to create and
destroy databases and database objects. After the database structure is defined
with DDL, database administrators and users can use the Data Manipulation
Language (DML) to insert, retrieve and modify the data contained within it.
DDL AND DML
20. CREATE. Installing a database management system on
a computer allows you to create and manage many
independent databases.
DATA DEFINITION LANGUAGE COMMANDS
21. USE. The USE command allows you to specify the database you want to work with
within your DBMS.
ALTER. Once you've created a table within a database, you may want to modify its
definition. The ALTER command allows you to make changes to the structure of a
table without deleting and recreating it.
22. DROP. DROP allows us to remove entire database objects from
our DBMS.
23. INSERT. The INSERT command in SQL is used to add records to an
existing table.
SELECT. It allows database users to retrieve the specific information they
desire from an operational database.
DATA MANIPULATION
LANGUAGE COMMANDS
24. UPDATE. The UPDATE command can be used to modify the
information contained within a table, either in bulk or individually.
25. DELETE. You'll find that the syntax of this command is similar
to that of the other DML commands.
26. A parallel DBMS can be defined as a DBMS implementation for a
multiprocessor computer.
In the database software, there can be three types of parallelism
inherent in data intensive applications:
• Inter-request parallelism;
• intra-query parallelism;
• intra-operation parallelism.
The parallel databases
27. Inter-request concurrency assumes simultaneous execution of multiple requests related to different
transactions
The parallel databases
SQL compilation
+
implementation
28. By intra-query parallelism is meant simultaneous execution of several
operations (for example, sampling operations) related to the same request
The parallel databases
SQL compilation
+
implementation
Operation 1
…
Operation n
29. The concept of intra-operative parallelism means the parallel execution of one operation as a set
of sub-operations, in addition to fragmentation of data, also fragmentation of functions
The parallel databases
Sub-operation 1
…
Sub-operation n
Operation k
SQL compilation
+
implementation
30. Design of databases — process of creation of the database
scheme and determination of necessary integrity constraints.
Development of database is a proven way to automate the
organization process in order to achieve maximum efficiency and
profitability of activities.
Design and development of databases
31. Object-relational mapping (ORM) in computer science is a
programming technique for converting data between incompatible type
systems using object-oriented programming languages.
Technology of programming of ORM
32. A distributed database (DDB) is a collection of a set of interrelated
databases distributed in a computer network.
The distributed database
33. Horizontal fragmentation is realized with the help of selection
operations, which directs each tuple of the relation to one of the
sections, guided by the fragmentation predicate.
The distributed database
Tuple 1
Tuple 2
…
Tuple n
Tuple k
…
34. With vertical fragmentation, the ratio is divided into sections using the
projection operation.
The distributed database
Tuple
35. A heterogeneous database
• A heterogeneous database is an automated (or semi-automated)
system for the integration of heterogeneous, disparate database
management systems to present a user with a single, unified query
interface.
• A parallel database system seeks to improve performance through
parallelization of various operations, such as loading data, building
indexes and evaluating queries.
36. Control Questions
What is meant by the database? What is the difference
between a database and a simple set of data?
What is the database system? What are its components?
What are the main elements of the database?
What is the data model? Name the existing data models.
What are the differences between them?
37. References
• Information and Communication Technologies Part 1: Textbook/
D.Shynybekov. - Almaty: IITU, 2017. - 588 p.
• Information and Communication Technologies Part 2: Textbook/
D.Shynybekov. - Almaty: IITU, 2017. - 624 p
• Sheth, Amit P.; James A. Larson (September 1990). "Federated Database
Systems for Managing Distributed, Heterogeneous, and Autonomous
Databases" (PDF). ACM Computing Surveys. 22 (3): 183–236.
• Lorenzo Cantoni (University of Lugano, Switzerland) James A. Danowski
(University of Illinois at Chicago, IL, USA) Communication and
Technology, 576 pages.
• Craig Van Slyke Information Communication Technologies: Concepts,
Methodologies, Tools, and Applications (6 Volumes). ISBN13:
9781599049496, 2008, Pages: 4288
• Utelbaeva A.K.,Utelbaeva A.K. Study guide for lectures on discipline
“Computer science”, Shimkent 2008, 84 pages.