SlideShare a Scribd company logo
1 of 41
GileadSciences
Database Concepts
Page 2
Agenda
Page 3
What is data, information, & Database?
 Data is a collection of facts, such as values or measurements.
 It can be numbers, words, measurements, observations or
even just descriptions of things.
 Data is information or facts that are collected to give a bigger
picture of things.
 Information has meaning in some context for its receiver
 A database is a collection of information that is organized
so that it can easily be accessed, managed, and updated
Page 4
What is DBMS?
 DBMS is the data base management system which is
responsible for the storage of the data. Through this the
storage of data had been quite easy and the data that had
been stored can be easily modified and extracted too. This
in fact is the collection of the storing, executing, processing
and modifying the data.
 DBMS contains information about a particular enterprise
 Collection of interrelated data
 Set of programs to access the data
 An environment that is both convenient and efficient to use
Page 5
DBMS Applications
 Database Applications:
 Banking: transactions
 Airlines: reservations, schedules
 Universities: registration, grades
 Sales: customers, products, purchases
 Online retailers: order tracking, customized
recommendations
 Manufacturing: production, inventory, orders, supply chain
 Human resources: employee records, salaries, tax
deductions
Page 6
Drawbacks of using file systems
 Data redundancy and inconsistency
 Multiple file formats, duplication of information in different files
 Difficulty in accessing data
 Need to write a new program to carry out each new task
 Data isolation — multiple files and formats
 Integrity problems
 Integrity constraints (e.g., account balance > 0) become
“buried” in program code rather than being stated explicitly
 Hard to add new constraints or change existing ones
 Atomicity of updates
 Failures may leave database in an inconsistent state with partial
updates carried out
 Example: Transfer of funds from one account to another should
either complete or not happen at all
Page 7
Drawbacks of using file systems
 Concurrent access by multiple users
 Concurrent access needed for performance
 Uncontrolled concurrent accesses can lead to inconsistencies
 Example: Two people reading a balance (say 100) and updating
it by withdrawing money (say 50 each) at the same time
 Security problems
 Hard to provide user access to some, but not all, data
Database systems (DBMS) offer solutions to all the above
problems
Page 8
What is RDBMS?
 RDBMS is a database management system based on
relational model defined by E. F. Codd. Data is stored in the
form of rows and columns. The relations among tables are
also stored in the form of the table.
 Features: Provides data to be stored in tables
• Persists data in the form of rows and columns
• Provides facility primary key, to uniquely identify the rows
• Creates indexes for quicker data retrieval
• Provides a virtual table creation in which sensitive data can
be stored and simplified query can be applied.(views)
• Sharing a common column in two or more tables(primary key
and foreign key)
• Provides multi user accessibility that can be controlled by
individual users
Page 9
What are E. F. Codd rules?
Also called Codd's Law, a set of 13 rules used to determine if a
DBMS can be considered a relational DBMS (RDBMS). In
1985, Dr. E. F. Codd first published this list of rules that
became a standard way of evaluating a relational system.
The rules have since been expanded by others, and after
publishing the original article Codd stated that there are no
systems that will satisfy every rule. Following are Codd's
original 13 rules:
0. Often referred to as rule 0, this rule states that all
subsequent rules are based on the notion that in order for a
database to be considered relational, it must use its
relational facilities exclusively to manage the database.
1. The Information rule: All information in an RDBMS is
represented logically in just one way - by values in tables.
Page 10
What are E. F. Code rules?
2. The Guaranteed Access rule: Each item of data in an RDBMS
is guaranteed to be logically accessible by resorting to a
combination of table name, primary key value, and column
name.
3. The Systematic Treatment of Null Values rule: Null values
(distinct from an empty character string or a string of blank
characters and distinct from zero or any other number) are
supported in a fully relational DBMS for representing missing
information and inapplicable information in a systematic way,
independent of the data type.
4. The Dynamic Online Catalog Based on the Relational Model
rule: The database description is represented at the logical
level in the same way as ordinary data, so that authorized
users can apply the same relational language to its
interrogation as they apply to the regular data.
Page 11
What are E. F. Code rules?
5. The Comprehensive Data Sublanguage rule: A relational
system may support several languages and various modes of
terminal use (for example, the fill-in-blanks mode). However,
there must be at least one language whose statements are
expressible, per some well-defined syntax, as character strings
and whose ability to support all of the following is
comprehensible: data definition, view definition, data
manipulation (interactive and by program), integrity
constraints, and transaction boundaries (begin, commit, and
rollback).
6. The View updating rule: All views of the data which are
theoretically updatable must be updatable in practice by the
DBMS.
7. The High-level Insert, Update, and Delete rule: The capability
of handling a base relation or a derived relation as a single
operand applies not only to the retrieval of data but also to the
insertion, update, and deletion of data.
Page 12
What are E. F. Code rules?
8. The Physical Data Independence rule: Application programs
and terminal activities remain logically unimpaired whenever
any changes are made in either storage representations or
access methods.
9. The Logical Data Independence rule: Application programs
and terminal activities remain logically unimpaired when
information preserving changes of any kind that theoretically
permit un-impairment are made to the base tables.
10. The Integrity Independence rule: Integrity constraints must
be definable in the RDBMS sub-language and stored in the
system catalogue and not within individual application
programs.
Page 13
What are E. F. Code rules?
11. The Distribution Independence rule: An RDBMS has
distribution independence. Distribution independence implies
that users should not have to be aware of whether a database
is distributed.
12. The Non-subversion rule: If the database has any means of
handling a single record at a time, that low-level language must
not be able to subvert or avoid the integrity rules which are
expressed in a higher-level language that handles multiple
records at a time
Page 14
SQL
 SQL: widely used non-procedural language
• Example: Find the name of the instructor with ID 22222
select name
from instructor
where instructor.ID = ‘22222’
 Application programs generally access databases through
one of
• Language extensions to allow embedded SQL
• Application program interface (e.g., ODBC/JDBC) which allow SQL
queries to be sent to a database
 Two classes of languages
• Procedural – user specifies what data is required and how to get
those data
• Declarative (nonprocedural) – user specifies what data is required
without specifying how to get those data
Page 15
Data Manipulation Language (DML)
 Language for accessing and manipulating the data
organized by the appropriate data model
• DML also known as query language
Page 16
Data Definition Language (DDL)
 Specification notation for defining the database schema
 DDL compiler generates a set of table templates stored in a
data dictionary
 Data dictionary contains metadata (i.e., data about data)
• Database schema
• Integrity constraints
– Primary key (ID uniquely identifies instructors)
– Referential integrity (references constraint in SQL)
• Authorization
 Difference between Schema & User?
• HR Schema, Finance Schema, MFG…
Page 17
Database Objects
Object Description
Table Basic unit of storage; composed of rows
and columns
View Logically represents subsets of data from
one or more tables
Sequence Numeric value generator
Index Improves the performance of some queries
Synonym Gives alternative names to objects
Data Types
Data Type Description
VARCHAR2(size) Variable length character data (32767
characters)
CHAR(size) Fixed Length Character data up to 2000
characters
NUMBER(p,s)
Variable length numeric data
DATE Date and time values
LONG Variable length character data up to 2GB
CLOB Character data up to 4GB
Page 19
Data Model
A database model is a theory or specification describing how a
database is structured and used.
A collection of tools for describing
• Data
• Data relationships
• Data semantics
• Data constraints
Entity-Relationship data model
Object-based data models
Semi-structured data model (XML)
Other older models:
• Network model
• Hierarchical model
Page 20
Database Design
The process of designing the general structure of the
database:
Logical Design – Deciding on the database schema. Database
design requires that we find a “good” collection of relation
schemas.
• Business decision – What attributes should we record in the
database?
• Computer Science decision – What relation schemas should we
have and how should the attributes be distributed among the various
relation schemas?
Physical Design – Deciding on the physical layout of the
database
Design Approach
 Normalization Theory: Database
normalization is usually performed when
designing a relational database
 Entity Relationship Model: ERD is the
commonly used system of diagramming,
for visual representation of the relational
model
Normalization
 Objective
 Eliminating redundant data (for example, storing the
same data in more than one table)
 Ensuring data dependencies make sense (only
storing related data in a table).
 Normalization typically involves decomposing an
un-normalized table into two or more tables that,
were they to be combined (joined), would convey
exactly the same information as the original
table.
 You'll often see 1NF, 2NF, and 3NF along with the
occasional 4NF.
Normalization Example
First Form: For a table to be in first normal form, all repeating groups
must be moved to a new table. Consider the example in Figure, in which
several office location columns have been added to the EMPLOYEE
table.
Normalization Example
Second Form: Second normal form is a special-case normal form that
has to do with tables that have composite primary keys. A composite
primary key includes two or more columns. In second normal form, all
nonkey columns must depend on the entire primary key.
The EMPLOYEE_OFFICE_LOCATION table in Figure is in violation of
second normal form. For this table to comply with second normal form,
another table must be created and the OFFICE_PHONE_NUMBER data
must be moved to the new table.
Normalization Example
Third Form: Third normal form expands on second normal form. It
dictates that every nonkey column must be a detail or a fact about the
primary key. Figure illustrates a third normal form violation.
The addition of the DEPT_NAME column in this EMPLOYEE table
violates third normal form in that DEPT_NAME is dependent on the
DEPT_ID value, not the EMPLOYEE_ID value. Complying with the rules
of third normal form necessitates creating another table and moving the
department name values into this new table, with a foreign key reference
from the EMPLOYEE table to the new table.
Page 26
The Entity-Relationship Model
 Models an enterprise as a collection of entities and
relationships
• Entity: a “thing” or “object” in the enterprise that is distinguishable
from other objects, described by a set of attributes
• Attributes: All information about Entity
• Relationship: an association among several entities
 Represented diagrammatically by an entity-relationship
diagram:
Degrees of a Relationship
HusbandHusband WifeWife
Father ChildChild
BrotherBrother SisterSister
One-to-one (1:1)
One-to-many (1:n)
Many-to-many (n:m)
Many to One Mapping
CUSTOMER
TAKES
LOAN HAS REPAYMENT
C_ID C_NAME
LOAN_ID
LN_AMT
DT_OF_LOAN EMI_AMT
CHEQUE_ID
CHEQUE_DATE
CHEQUE_AMT
1 M
M
M
Page 29
Query Processing
1. Parsing and translation
2. Optimization
3. Evaluation
Page 32
Storage Management
 Storage manager is a program module that provides the
interface between the low-level data stored in the database
and the application programs and queries submitted to the
system.
 The storage manager is responsible to the following tasks:
• Interaction with the file manager
• Efficient storing, retrieving and updating of data
Page 33
Database Users and Administrators
Database
Page 34
Database System Internals
Page 35
Client Server Architecture
Oracle Products
 Oracle Databases
 Oracle Application Server
 Oracle applications
 Oracle Collaboration Suite
 Oracle Developer Suite
Oracle Database 10g: “g” Stands for Grid
 Oracle’s grid infrastructure:
• Low cost
• High quality of service
• Easy to manage
Automatic
Storage
Management
Real
Application
Clusters
Oracle
Streams
Enterprise
Manager
Grid Control
Storage
grid
Database
grid
Application
grid
Grid
control
Oracle RDBMS Architecture
There are three main components:
1. Instance
- Instance memory/System Global Area (SGA)
- Back ground process
2. Database
- Data files
- Control Files
- Redo log files
3. Other Components
- Other process
- Other memory (PGA)
- Other files
Oracle Database Architecture
Oracle Internet Platform
System Development Life Cycle (SDLC)
GileadSciences
Avi Silberschatz
Henry F. Korth
S. Sudarshan
Page 43
QUIZ
1. RDBMS?
Relational Database Management System
2. SQL?
Structured Query Language
3. Relational model defined by?
E. F. Codd
4. ER Model?
Entity-Relationship Model
5. Design Approach to reduce Duplication?
Normalization

More Related Content

What's hot

Database management system
Database management systemDatabase management system
Database management systemashishkthakur94
 
DBMS FOR STUDENTS MUST DOWNLOAD AND READ
DBMS FOR STUDENTS MUST DOWNLOAD AND READDBMS FOR STUDENTS MUST DOWNLOAD AND READ
DBMS FOR STUDENTS MUST DOWNLOAD AND READamitp26
 
Chapter 2 database environment
Chapter 2 database environmentChapter 2 database environment
Chapter 2 database environment>. <
 
Architecture of-dbms-and-data-independence
Architecture of-dbms-and-data-independenceArchitecture of-dbms-and-data-independence
Architecture of-dbms-and-data-independenceAnuj Modi
 
Database management systems
Database management systemsDatabase management systems
Database management systemsJoel Briza
 
Database Systems
Database SystemsDatabase Systems
Database SystemsUsman Tariq
 
Database introduction
Database introductionDatabase introduction
Database introductionHarry Potter
 
Database systems - Chapter 1
Database systems - Chapter 1Database systems - Chapter 1
Database systems - Chapter 1shahab3
 
Modern database management system chapter 1
Modern database management system chapter 1Modern database management system chapter 1
Modern database management system chapter 1AyeshaShoukat5
 
Components and Advantages of DBMS
Components and Advantages of DBMSComponents and Advantages of DBMS
Components and Advantages of DBMSShubham Joon
 
L7 data model and dbms architecture
L7  data model and dbms architectureL7  data model and dbms architecture
L7 data model and dbms architectureRushdi Shams
 
Data base management system and Architecture ppt.
Data base management system and Architecture ppt.Data base management system and Architecture ppt.
Data base management system and Architecture ppt.AnkitAbhilashSwain
 
Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01Jotham Gadot
 

What's hot (20)

Database management system
Database management systemDatabase management system
Database management system
 
DBMS FOR STUDENTS MUST DOWNLOAD AND READ
DBMS FOR STUDENTS MUST DOWNLOAD AND READDBMS FOR STUDENTS MUST DOWNLOAD AND READ
DBMS FOR STUDENTS MUST DOWNLOAD AND READ
 
Chapter 2 database environment
Chapter 2 database environmentChapter 2 database environment
Chapter 2 database environment
 
D B M S Animate
D B M S AnimateD B M S Animate
D B M S Animate
 
Data models
Data modelsData models
Data models
 
Assign 1
Assign 1Assign 1
Assign 1
 
Dbms notes
Dbms notesDbms notes
Dbms notes
 
Architecture of-dbms-and-data-independence
Architecture of-dbms-and-data-independenceArchitecture of-dbms-and-data-independence
Architecture of-dbms-and-data-independence
 
Database management systems
Database management systemsDatabase management systems
Database management systems
 
Database concepts
Database conceptsDatabase concepts
Database concepts
 
Database Systems
Database SystemsDatabase Systems
Database Systems
 
Dbms Useful PPT
Dbms Useful PPTDbms Useful PPT
Dbms Useful PPT
 
Database introduction
Database introductionDatabase introduction
Database introduction
 
Database systems - Chapter 1
Database systems - Chapter 1Database systems - Chapter 1
Database systems - Chapter 1
 
Modern database management system chapter 1
Modern database management system chapter 1Modern database management system chapter 1
Modern database management system chapter 1
 
Ch1
Ch1Ch1
Ch1
 
Components and Advantages of DBMS
Components and Advantages of DBMSComponents and Advantages of DBMS
Components and Advantages of DBMS
 
L7 data model and dbms architecture
L7  data model and dbms architectureL7  data model and dbms architecture
L7 data model and dbms architecture
 
Data base management system and Architecture ppt.
Data base management system and Architecture ppt.Data base management system and Architecture ppt.
Data base management system and Architecture ppt.
 
Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01Fundamentals of Database ppt ch01
Fundamentals of Database ppt ch01
 

Similar to Database Concepts 101

Similar to Database Concepts 101 (20)

Dbms unit01
Dbms unit01Dbms unit01
Dbms unit01
 
Database Management Systems
Database Management SystemsDatabase Management Systems
Database Management Systems
 
database introductoin optimization1-app6891.pdf
database introductoin optimization1-app6891.pdfdatabase introductoin optimization1-app6891.pdf
database introductoin optimization1-app6891.pdf
 
Introduction to Database
Introduction to DatabaseIntroduction to Database
Introduction to Database
 
Lecture 1 to 3intro to normalization in database
Lecture 1 to 3intro to  normalization in databaseLecture 1 to 3intro to  normalization in database
Lecture 1 to 3intro to normalization in database
 
Unit01 dbms
Unit01 dbmsUnit01 dbms
Unit01 dbms
 
Database Management Systems (Mcom Ecommerce)
Database Management Systems (Mcom Ecommerce)Database Management Systems (Mcom Ecommerce)
Database Management Systems (Mcom Ecommerce)
 
27 fcs157al2
27 fcs157al227 fcs157al2
27 fcs157al2
 
RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.RDBMS to NoSQL. An overview.
RDBMS to NoSQL. An overview.
 
Database Management Systems
Database Management SystemsDatabase Management Systems
Database Management Systems
 
Database Management system
Database Management systemDatabase Management system
Database Management system
 
Database Systems Concepts, 5th Ed
Database Systems Concepts, 5th EdDatabase Systems Concepts, 5th Ed
Database Systems Concepts, 5th Ed
 
Database Concepts
Database ConceptsDatabase Concepts
Database Concepts
 
Lecture#5
Lecture#5Lecture#5
Lecture#5
 
Database fundamentals
Database fundamentalsDatabase fundamentals
Database fundamentals
 
Mis chapter 7 database systems
Mis chapter 7 database systemsMis chapter 7 database systems
Mis chapter 7 database systems
 
Database and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health InformaticsDatabase and Database Management (DBM): Health Informatics
Database and Database Management (DBM): Health Informatics
 
INT 1010 07-5.pdf
INT 1010 07-5.pdfINT 1010 07-5.pdf
INT 1010 07-5.pdf
 
jose rizal
jose rizaljose rizal
jose rizal
 
DBMS - Introduction
DBMS - IntroductionDBMS - Introduction
DBMS - Introduction
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Database Concepts 101

  • 3. Page 3 What is data, information, & Database?  Data is a collection of facts, such as values or measurements.  It can be numbers, words, measurements, observations or even just descriptions of things.  Data is information or facts that are collected to give a bigger picture of things.  Information has meaning in some context for its receiver  A database is a collection of information that is organized so that it can easily be accessed, managed, and updated
  • 4. Page 4 What is DBMS?  DBMS is the data base management system which is responsible for the storage of the data. Through this the storage of data had been quite easy and the data that had been stored can be easily modified and extracted too. This in fact is the collection of the storing, executing, processing and modifying the data.  DBMS contains information about a particular enterprise  Collection of interrelated data  Set of programs to access the data  An environment that is both convenient and efficient to use
  • 5. Page 5 DBMS Applications  Database Applications:  Banking: transactions  Airlines: reservations, schedules  Universities: registration, grades  Sales: customers, products, purchases  Online retailers: order tracking, customized recommendations  Manufacturing: production, inventory, orders, supply chain  Human resources: employee records, salaries, tax deductions
  • 6. Page 6 Drawbacks of using file systems  Data redundancy and inconsistency  Multiple file formats, duplication of information in different files  Difficulty in accessing data  Need to write a new program to carry out each new task  Data isolation — multiple files and formats  Integrity problems  Integrity constraints (e.g., account balance > 0) become “buried” in program code rather than being stated explicitly  Hard to add new constraints or change existing ones  Atomicity of updates  Failures may leave database in an inconsistent state with partial updates carried out  Example: Transfer of funds from one account to another should either complete or not happen at all
  • 7. Page 7 Drawbacks of using file systems  Concurrent access by multiple users  Concurrent access needed for performance  Uncontrolled concurrent accesses can lead to inconsistencies  Example: Two people reading a balance (say 100) and updating it by withdrawing money (say 50 each) at the same time  Security problems  Hard to provide user access to some, but not all, data Database systems (DBMS) offer solutions to all the above problems
  • 8. Page 8 What is RDBMS?  RDBMS is a database management system based on relational model defined by E. F. Codd. Data is stored in the form of rows and columns. The relations among tables are also stored in the form of the table.  Features: Provides data to be stored in tables • Persists data in the form of rows and columns • Provides facility primary key, to uniquely identify the rows • Creates indexes for quicker data retrieval • Provides a virtual table creation in which sensitive data can be stored and simplified query can be applied.(views) • Sharing a common column in two or more tables(primary key and foreign key) • Provides multi user accessibility that can be controlled by individual users
  • 9. Page 9 What are E. F. Codd rules? Also called Codd's Law, a set of 13 rules used to determine if a DBMS can be considered a relational DBMS (RDBMS). In 1985, Dr. E. F. Codd first published this list of rules that became a standard way of evaluating a relational system. The rules have since been expanded by others, and after publishing the original article Codd stated that there are no systems that will satisfy every rule. Following are Codd's original 13 rules: 0. Often referred to as rule 0, this rule states that all subsequent rules are based on the notion that in order for a database to be considered relational, it must use its relational facilities exclusively to manage the database. 1. The Information rule: All information in an RDBMS is represented logically in just one way - by values in tables.
  • 10. Page 10 What are E. F. Code rules? 2. The Guaranteed Access rule: Each item of data in an RDBMS is guaranteed to be logically accessible by resorting to a combination of table name, primary key value, and column name. 3. The Systematic Treatment of Null Values rule: Null values (distinct from an empty character string or a string of blank characters and distinct from zero or any other number) are supported in a fully relational DBMS for representing missing information and inapplicable information in a systematic way, independent of the data type. 4. The Dynamic Online Catalog Based on the Relational Model rule: The database description is represented at the logical level in the same way as ordinary data, so that authorized users can apply the same relational language to its interrogation as they apply to the regular data.
  • 11. Page 11 What are E. F. Code rules? 5. The Comprehensive Data Sublanguage rule: A relational system may support several languages and various modes of terminal use (for example, the fill-in-blanks mode). However, there must be at least one language whose statements are expressible, per some well-defined syntax, as character strings and whose ability to support all of the following is comprehensible: data definition, view definition, data manipulation (interactive and by program), integrity constraints, and transaction boundaries (begin, commit, and rollback). 6. The View updating rule: All views of the data which are theoretically updatable must be updatable in practice by the DBMS. 7. The High-level Insert, Update, and Delete rule: The capability of handling a base relation or a derived relation as a single operand applies not only to the retrieval of data but also to the insertion, update, and deletion of data.
  • 12. Page 12 What are E. F. Code rules? 8. The Physical Data Independence rule: Application programs and terminal activities remain logically unimpaired whenever any changes are made in either storage representations or access methods. 9. The Logical Data Independence rule: Application programs and terminal activities remain logically unimpaired when information preserving changes of any kind that theoretically permit un-impairment are made to the base tables. 10. The Integrity Independence rule: Integrity constraints must be definable in the RDBMS sub-language and stored in the system catalogue and not within individual application programs.
  • 13. Page 13 What are E. F. Code rules? 11. The Distribution Independence rule: An RDBMS has distribution independence. Distribution independence implies that users should not have to be aware of whether a database is distributed. 12. The Non-subversion rule: If the database has any means of handling a single record at a time, that low-level language must not be able to subvert or avoid the integrity rules which are expressed in a higher-level language that handles multiple records at a time
  • 14. Page 14 SQL  SQL: widely used non-procedural language • Example: Find the name of the instructor with ID 22222 select name from instructor where instructor.ID = ‘22222’  Application programs generally access databases through one of • Language extensions to allow embedded SQL • Application program interface (e.g., ODBC/JDBC) which allow SQL queries to be sent to a database  Two classes of languages • Procedural – user specifies what data is required and how to get those data • Declarative (nonprocedural) – user specifies what data is required without specifying how to get those data
  • 15. Page 15 Data Manipulation Language (DML)  Language for accessing and manipulating the data organized by the appropriate data model • DML also known as query language
  • 16. Page 16 Data Definition Language (DDL)  Specification notation for defining the database schema  DDL compiler generates a set of table templates stored in a data dictionary  Data dictionary contains metadata (i.e., data about data) • Database schema • Integrity constraints – Primary key (ID uniquely identifies instructors) – Referential integrity (references constraint in SQL) • Authorization  Difference between Schema & User? • HR Schema, Finance Schema, MFG…
  • 17. Page 17 Database Objects Object Description Table Basic unit of storage; composed of rows and columns View Logically represents subsets of data from one or more tables Sequence Numeric value generator Index Improves the performance of some queries Synonym Gives alternative names to objects
  • 18. Data Types Data Type Description VARCHAR2(size) Variable length character data (32767 characters) CHAR(size) Fixed Length Character data up to 2000 characters NUMBER(p,s) Variable length numeric data DATE Date and time values LONG Variable length character data up to 2GB CLOB Character data up to 4GB
  • 19. Page 19 Data Model A database model is a theory or specification describing how a database is structured and used. A collection of tools for describing • Data • Data relationships • Data semantics • Data constraints Entity-Relationship data model Object-based data models Semi-structured data model (XML) Other older models: • Network model • Hierarchical model
  • 20. Page 20 Database Design The process of designing the general structure of the database: Logical Design – Deciding on the database schema. Database design requires that we find a “good” collection of relation schemas. • Business decision – What attributes should we record in the database? • Computer Science decision – What relation schemas should we have and how should the attributes be distributed among the various relation schemas? Physical Design – Deciding on the physical layout of the database
  • 21. Design Approach  Normalization Theory: Database normalization is usually performed when designing a relational database  Entity Relationship Model: ERD is the commonly used system of diagramming, for visual representation of the relational model
  • 22. Normalization  Objective  Eliminating redundant data (for example, storing the same data in more than one table)  Ensuring data dependencies make sense (only storing related data in a table).  Normalization typically involves decomposing an un-normalized table into two or more tables that, were they to be combined (joined), would convey exactly the same information as the original table.  You'll often see 1NF, 2NF, and 3NF along with the occasional 4NF.
  • 23. Normalization Example First Form: For a table to be in first normal form, all repeating groups must be moved to a new table. Consider the example in Figure, in which several office location columns have been added to the EMPLOYEE table.
  • 24. Normalization Example Second Form: Second normal form is a special-case normal form that has to do with tables that have composite primary keys. A composite primary key includes two or more columns. In second normal form, all nonkey columns must depend on the entire primary key. The EMPLOYEE_OFFICE_LOCATION table in Figure is in violation of second normal form. For this table to comply with second normal form, another table must be created and the OFFICE_PHONE_NUMBER data must be moved to the new table.
  • 25. Normalization Example Third Form: Third normal form expands on second normal form. It dictates that every nonkey column must be a detail or a fact about the primary key. Figure illustrates a third normal form violation. The addition of the DEPT_NAME column in this EMPLOYEE table violates third normal form in that DEPT_NAME is dependent on the DEPT_ID value, not the EMPLOYEE_ID value. Complying with the rules of third normal form necessitates creating another table and moving the department name values into this new table, with a foreign key reference from the EMPLOYEE table to the new table.
  • 26. Page 26 The Entity-Relationship Model  Models an enterprise as a collection of entities and relationships • Entity: a “thing” or “object” in the enterprise that is distinguishable from other objects, described by a set of attributes • Attributes: All information about Entity • Relationship: an association among several entities  Represented diagrammatically by an entity-relationship diagram:
  • 27. Degrees of a Relationship HusbandHusband WifeWife Father ChildChild BrotherBrother SisterSister One-to-one (1:1) One-to-many (1:n) Many-to-many (n:m)
  • 28. Many to One Mapping CUSTOMER TAKES LOAN HAS REPAYMENT C_ID C_NAME LOAN_ID LN_AMT DT_OF_LOAN EMI_AMT CHEQUE_ID CHEQUE_DATE CHEQUE_AMT 1 M M M
  • 29. Page 29 Query Processing 1. Parsing and translation 2. Optimization 3. Evaluation
  • 30. Page 32 Storage Management  Storage manager is a program module that provides the interface between the low-level data stored in the database and the application programs and queries submitted to the system.  The storage manager is responsible to the following tasks: • Interaction with the file manager • Efficient storing, retrieving and updating of data
  • 31. Page 33 Database Users and Administrators Database
  • 33. Page 35 Client Server Architecture
  • 34. Oracle Products  Oracle Databases  Oracle Application Server  Oracle applications  Oracle Collaboration Suite  Oracle Developer Suite
  • 35. Oracle Database 10g: “g” Stands for Grid  Oracle’s grid infrastructure: • Low cost • High quality of service • Easy to manage Automatic Storage Management Real Application Clusters Oracle Streams Enterprise Manager Grid Control Storage grid Database grid Application grid Grid control
  • 36. Oracle RDBMS Architecture There are three main components: 1. Instance - Instance memory/System Global Area (SGA) - Back ground process 2. Database - Data files - Control Files - Redo log files 3. Other Components - Other process - Other memory (PGA) - Other files
  • 39. System Development Life Cycle (SDLC)
  • 41. Page 43 QUIZ 1. RDBMS? Relational Database Management System 2. SQL? Structured Query Language 3. Relational model defined by? E. F. Codd 4. ER Model? Entity-Relationship Model 5. Design Approach to reduce Duplication? Normalization

Editor's Notes

  1. Oracle Products Oracle databases: The Oracle database is the first database that is designed for enterprise grid computing (the most flexible and cost-effective way to manage information and applications). Oracle Application Server: Oracle’s Java 2 Platform, Enterprise Edition (J2EE)–certified server integrates everything that is needed to develop and deploy Web-based applications. The application server deploys e-business portals, Web services, and transactional applications, including PL/SQL, Oracle Forms, and J2EE-based applications. Oracle applications: Oracle E-Business Suite is a complete set of business applications for managing and automating processes across your organization. Oracle Collaboration Suite: Oracle Collaboration Suite is a single, integrated system for all your organization’s communications data: voice, e-mail, fax, wireless, calendar information, and files. Oracle Developer Suite: Oracle Developer Suite is a complete, integrated environment that combines application development and business intelligence tools.
  2. Oracle Database 10g: “g” Stands for Grid Global Grid Forum (GGF) is a standards body that develops standards for grid computing. It comprises a set of committees and working groups that focus on various aspects of grid computing. The committees and working groups are composed of participants from academia, the research community, and (increasingly) commercial companies. You can see the Web site of GGF at http://www.gridforum.org. Oracle has created the grid computing infrastructure software that balances all types of workloads across servers and enables all those servers to be managed as one complete system. Grid computing can achieve the same very high level of reliability as mainframe computing because all components are clustered. But unlike mainframes and large UNIX symmetric multiprocessing (SMP) servers, a grid can be built with open system technologies, such as Intel processors and the Linux operating system, at a very low cost. Oracle’s grid computing technology includes: Automatic Storage Management (ASM) Real Application Clusters (RAC) Oracle Streams Enterprise Manager Grid Control Oracle Database 10g: “g” Stands for Grid (continued) Automatic Storage Management spreads database data across all disks, creates and maintains a storage grid, and provides the highest input/output (I/O) throughput with minimal management costs. As disks are added or dropped, ASM redistributes the data automatically. (There is no need for a logical volume manager to manage the file system.) Data availability increases with optional mirroring, and you can add or drop disks online. For more information, see the lesson titled “Managing Database Storage Structures.” Oracle’s Real Application Clusters runs and scales all application workloads on a cluster of servers and offers the following features: Integrated clusterware: This includes functionality for cluster connectivity, messaging and locking, cluster control, and recovery. It is available on all platforms that are supported by Oracle Database 10g. Automatic workload management: Rules can be defined to automatically allocate processing resources to each service both during normal operations and in response to failures. These rules can be dynamically modified to meet the changing business needs. This dynamic resource allocation within a database grid is unique to Oracle RAC. Automatic event notification to the mid-tier: When a cluster configuration changes, the mid-tier can immediately adapt to instance failover or availability of a new instance. This enables end users to continue working in the event of instance failover without the delays typically caused by network timeouts. In the event of new instance availability, the mid-tier can immediately start load balancing connections to that instance. Oracle Database 10g Java Database Connectivity (JDBC) drivers have the “fast connection failover” functionality that can be automatically enabled to handle these events. Oracle Streams provides a unified framework for information sharing, combining message queuing, data replication, event notification, data warehouse loading, and publishing and subscribing functionality into a single technology. Oracle Streams can keep two or more data source copies synchronized when updates are applied at either site. It can automatically capture database changes, propagate the changes to subscribing nodes, apply changes, and detect and resolve data update conflicts. Oracle Streams can be used directly by applications as a message-queuing or workflow feature, enabling communications between applications in the grid. Enterprise Manager Grid Control manages gridwide operations that include managing the entire stack of software, provisioning users, cloning databases, and managing patches. It can monitor the performance of all applications from the point of view of your end users. Grid Control views the performance and availability of the grid infrastructure as a unified whole rather than as isolated storage units, databases, and application servers. You can group hardware nodes, databases, and application servers into single logical entities and manage a group of targets as one unit. Note: In this course, you use Enterprise Manager Database Console to manage one database at a time.
  3. Oracle Instance Management An Oracle database server consists of an Oracle database and an Oracle instance. An Oracle instance is made up of memory structures, known as the System Global Area (SGA), and background processes that handle much of the behind-the-scenes work involved in running an instance. The most common background processes are the following: System Monitor (SMON): Performs crash recovery when the instance is started following a failure Process Monitor (PMON): Performs process cleanup when a user process fails Database Writer (DBWn): Writes modified blocks from the database buffer cache to the data files on the disk Checkpoint (CKPT): Updates all the data files and control files of the database to indicate the most recent checkpoint LogWriter (LGWR): Writes redo log entries to the disk Archiver (ARCn): Copies redo log files to the archival storage when a log switch occurs
  4. Oracle Products Oracle databases: The Oracle database is the first database that is designed for enterprise grid computing (the most flexible and cost-effective way to manage information and applications). Oracle Application Server: Oracle’s Java 2 Platform, Enterprise Edition (J2EE)–certified server integrates everything that is needed to develop and deploy Web-based applications. The application server deploys e-business portals, Web services, and transactional applications, including PL/SQL, Oracle Forms, and J2EE-based applications. Oracle applications: Oracle E-Business Suite is a complete set of business applications for managing and automating processes across your organization. Oracle Collaboration Suite: Oracle Collaboration Suite is a single, integrated system for all your organization’s communications data: voice, e-mail, fax, wireless, calendar information, and files. Oracle Developer Suite: Oracle Developer Suite is a complete, integrated environment that combines application development and business intelligence tools.
  5. Oracle Products Oracle databases: The Oracle database is the first database that is designed for enterprise grid computing (the most flexible and cost-effective way to manage information and applications). Oracle Application Server: Oracle’s Java 2 Platform, Enterprise Edition (J2EE)–certified server integrates everything that is needed to develop and deploy Web-based applications. The application server deploys e-business portals, Web services, and transactional applications, including PL/SQL, Oracle Forms, and J2EE-based applications. Oracle applications: Oracle E-Business Suite is a complete set of business applications for managing and automating processes across your organization. Oracle Collaboration Suite: Oracle Collaboration Suite is a single, integrated system for all your organization’s communications data: voice, e-mail, fax, wireless, calendar information, and files. Oracle Developer Suite: Oracle Developer Suite is a complete, integrated environment that combines application development and business intelligence tools.