This document provides an overview of a student database management system project. It includes an abstract describing the need for a centralized student information system. It also includes sections on the background and objectives of the project, current issues with database systems, the role of database support, methods for evaluation, and future work related to cloud computing, distributed models, and data virtualization.
2DATA WAREHOUSING SYSTEMAchyut Sai Chillara50215.docx
1. 2
DATA WAREHOUSING SYSTEM
Achyut Sai Chillara
50215706
Prof. Kwang Lee
DATABASE
SYSTEM
15/04/2019
TABLE OF CONTENTS
ABSTRACT
3
LIST OF ABBREVATIONS
4
INTRODUCTION TO DATA WAREHOUSE
2. 5
BACKGROUND INFORMATION
7
· PROJECT OBJECTIVE
7
· PURPOUSE OF PROJECT
7
CURRENT ISSUES OF DATABASE SYSTEM AND DATA
WAREHOUSING 8
METHODS AND EVALUATION
10
· OPERATIONAL SYSTEM
10
· DATABSES
10
· DATA WAREHOUSE
10
FUTURE WORK
12
· DEPLOYMENT MODELS FOR CLOUD COMPUTING
13
· DISTRIBUTED COMPUTING MODELS
13
· DATA VIRTUALIZATION
14
REFERENCES
31
ABSTRACT
In any organization an organized and systematic record keeping
is essential. A university has several departments and each
department contains several other information which needs to be
stored and retrieved in efficient manner. Most of the records to
maintain about student activities and their personal information.
Generally, the information is about Student name, department,
Academic performance, address, attendance etc. or any other
3. information which is specifically required by the department.
Every one of the modules in school organization are associated.
They are kept up physically. So, they should be computerized
and brought together as, Information from one module will be
required by different modules. For instance, when an understudy
needs his course fulfillment endorsement it needs to check
numerous insights regarding the understudy like his name, reg
number, year of study, tests he visited and numerous different
subtleties. So, it needs to contact every one of the modules that
are office, office and examination and consequence of
understudies.
There are five stages in this model and the primary stage is the
planning stage. The planning stage decides the objectives of the
database and whether the task ought to be given the green light
to continue. This is the place the proposition accommodation
comes into picture. Subsequent to acquiring the endorsement,
the following stage is investigation. Assembling and breaking
down the framework and client prerequisites is fundamental for
section to the plan step.
LIST OF
ABBREVIATIONS/ACRONYMS
IT………………………………………………………Information
Technology
SDLC……… ………………………………………...System
Development Life Cycle
DFD…………………………………………………. Data Flow
Diagrams
ERD…………………………………………………. Entity
Relationship Diagram
4. DBMS………………………………………………...Database
Management System
RDBMS………………………………….………….. Relational
Database Management System SDBMS
RAD…………………………………………………. Rapid
Application Development
VBA…………………………………………………. Visual Basic
for Applications
INTRODUCTION TO DATA WAREHOUSE
Data warehouse center capacities as a focal vault of data
originating from at least one information sources. Information
streams into an information distribution center from value-based
frameworks and other social databases, and normally
incorporates organized, semi-organized, and unstructured
information. This information is handled, changed, and ingested
at a customary rhythm. Clients including information
researchers, business experts, and chiefs get to the handled
information in the information stockroom through business
knowledge instruments, SQL customers, and spreadsheets.
Information and examination have ended up being basic to
associations to stay centered. Associations use reports,
dashboards, and examination gadgets to remove bits of learning
5. from their data, screen business execution, and support essential
authority. These reports, dashboards and examination
contraptions are constrained by data circulation focuses, which
store data capably to confine I/O and pass on inquiry results at
shooting rates to a colossal number of customers all the while.
The Student Database Management framework will store all the
understudy data in the database.
For the most part, it incorporates understudy name, address,
office, subjects enrolled, execution, normal GPA, graduation
subtleties and so forth. For instance, school site gives all the
data which should be given to the understudy. The reviewing
subtleties and the evaluating technique will be covered up.
By joining most of this data in a single spot, an organization
can dissect its clients in a progressively all-encompassing
manner, guaranteeing that it has considered all the data
accessible. Information warehousing additionally makes
information mining conceivable, which is the undertaking of
searching for examples in the information that could prompt
higher deals and benefi
6. BACKGROUND
INFORMATION
The extent of the administration is CCP which gives basic help
to powerless kids in their social setting of the more distant
family. The school has in excess of 1000 understudies that take
diverse courses. The facts demonstrate that accomplishment of
CCP Vocational Training Institute relies upon its capacity to get
exact and opportune information about its tasks, to deal with
this information adequately, and to utilize it to investigate and
direct its inner every day exercises.
Understudy Database System manages all sort of understudy
subtleties by following every one of the subtleties of an
understudy from the very beginning as far as possible of his or
her course which can be utilized for all announcing reason,
following of participation, advance in the course, finished
semesters years, coming semester year educational programs
subtleties, test subtleties, venture or some other task subtleties,
end of the year test result; and all these are purposed for future
references when deciphering an association exhibition
PROJECT OBJECTIVE
The primary goal is to build up a vigorous Student Database
Management System for Christian Childcare Program. Explicit
Objectives;
To actualize the framework
To test and approve the framework
PURPOSE OF PROJECT
The proposed framework is planned to make life simple. The
primary reason for the task is to fabricate a understudy database
framework to encourage simple access of understudy's records.
The Student Database System will permit the recorder of CCP
Vocational Training foundation, Login to alter, update
understudies’ subtleties records. It additionally enhances
productive administration of understudy's data and procedure
and print out understudy's transcripts/result slips.
7. CURRENT ISSUES OF DATA
WAREHOUSING
• Amount of extra room and the arranging of a distribution
center is serious issue for running of a business.
• Accuracy, cost control, productivity, neatness, wellbeing and
security, yet the basic procedures are unpredictable and
dynamic, exhibiting serious issues for stockroom administrators
crosswise over enterprises.
• Disorganized distribution center spaces can cause pointless
work costs and the off base utilization of capacity frameworks
and racking courses of action result in numerous organizations
finding their stockroom retires full, with no space to get new
stock.
• Slow Picking Processes and Stock Discrepancies
• Fluctuations popular posture genuine difficulties for
stockroom administrators. The dunk in deals because of the
ongoing worldwide money related emergency brought about
significant cost issues for stockrooms because of expanded
stock dimensions.
ROLE OF DATABASE SYSTEM SUPPORT
A database the board framework is imperative since it oversees
information proficiently and enables clients to play out various
undertakings effortlessly. A database the executives framework
stores, arranges and deals with a lot of data inside a solitary
programming application. Utilization of this framework expands
productivity of business activities and diminishes by and large
expenses.
Database the executive’s frameworks are vital to organizations
and associations since they give an exceptionally productive
technique to dealing with different sorts of information. A
portion of the information that are effectively dealt with this
sort of framework include: worker records, understudy data,
finance, bookkeeping, venture the executives, stock and library
books. These frameworks are worked to be very flexible.
Without database the board, undertakings must be done
physically and take additional time. Information can be
8. classified and organized to suit the necessities of the
organization or association.
Information is gone into the framework and got to on a standard
premise by allocated clients. Every client may have an
appointed secret key to access their piece of the framework.
Numerous clients can utilize the framework in the meantime in
various ways.
Social databases utilize various tables and characterize
connections between them utilizing a diagram notwithstanding
information components. Records and information components
from each table union, in view of the inquiry, and show in the
structure. Routinely utilized questions regularly moved toward
becoming reports. A report utilizes a similar question however
gives an account of changes in information after some time.
DBMS pursues the ideas of Atomicity, Consistency, Isolation,
and Durability (ordinarily abbreviated as ACID). These ideas
are connected on exchanges, which control information in a
database. Corrosive properties help the database remain sound
in multi-value-based conditions and if there should be an
occurrence of disappointment.
METHODS OF EVALUATIONOPERATIONAL
SYSTEMSOperational frameworks keep up records of day by
day business exchanges though a Data Warehouse is an
exceptional database that fills in as the coordinated store of
organization information, for announcing and choice help
reason. At the end of the day, operational frameworks are the
place the information is placed in, and the information
distribution center is the place we get the information out.
DATABASESDatabase possibly deliberately sorted out or
organized archive of recorded data that permits simple recovery,
refreshing, investigation, and yield of information. Put away
more often than not in a PC, this information could be as
designs, reports, contents, tables, content, and so on., speaking
to pretty much every sort of data. Most PC applications
9. including antivirus programming, spreadsheets, word-processors
are databases at their center.DATA WAREHOUSE
An information distribution center is a composed accumulation
for every one of the information that an endeavor's different
business frameworks gather. The vault might be physical or
coherent. Information warehousing stresses the catch of
information from assorted hotspots for valuable examination
and access yet does not for the most part begin from the
perspective of the end client who may require access to
particular, once in a while neighborhood databases. It
corporates data and information got from operational
frameworks and outside information sources.
Online exchange preparing (OLTP) is data frameworks that
encourage and oversee exchange situated applications, normally
for information passage and recovery exchange handling on a
primary preferred standpoint to looker after their clients.
DIFFERENCE BETWEEN DATA WAREHOUSE AND OLTP
Data warehouse
OLTP
It is subject oriented
It is Application Oriented
It is used to analyzing, problem solving, and decision support.
It is used to control and run fundamental business tasks.
Processing of the depends on the amount of the data
Processing of data is very fast
Complex query involving aggregation
It uses relatively normal and simple queries
CLOUD TECHNOLOGY
Distributed computing is a processing worldview, where a
substantial pool of frameworks are associated in private or open
systems, to give powerfully adaptable foundation to application,
10. information and record stockpiling. With the approach of this
innovation, the expense of calculation, application facilitating,
content stockpiling and conveyance is diminished essentially. It
depends on an extremely essential key of reusability of IT
abilities.
BENEFITS OF CLOUD COMPUTING:
· Reduced cost
· Increase storage
· Flexibility
FUTURE WORK
DEPLOYMENT MODELS FOR CLOUD COMPUTING
OPEN CLOUD:
Open mists are possessed and worked by outsiders; they convey
better economies of scale than clients, as the framework costs
are spread among a blend of clients, giving every individual
customer an appealing minimal effort, "Pay-as-you-go"
demonstrate.
PRIVATE CLOUD
Private mists are fabricated solely for a solitary venture. They
mean to address worries on information security and offer more
noteworthy control, which is commonly ailing in an open cloud.
CROSS BREED CLOD
Cross breed Clouds consolidate both open and private cloud
models. With a Hybrid Cloud, specialist organizations can use
outsider Cloud Providers in a full or halfway way accordingly
expanding the adaptability of registering.
DISTRIBUTED COMPUTING MODELS
Programming as a Service (SaaS) In this model, a total
application is offered to the client, as an administration on
interest. A solitary occurrence of the administration keeps
running on the cloud and various end clients are overhauled.
Stage as a Service (Paas) Here, a layer of programming, or
improvement condition is epitomized and offered as an
administration, whereupon other larger amounts of
11. administration can be manufactured. The client has the
opportunity to fabricate his own applications, which keep
running on the suppliers framework.
Foundation as a Service (Iaas) IaaS gives fundamental
stockpiling and figuring abilities as institutionalized
administrations over the system. Servers, stockpiling
frameworks, organizing gear, server farm space and so on are
pooled and made accessible to deal with remaining tasks at
hand.DATA VIRTULIZATION
Information virtualization is alluded to the way toward totaling
information from various wellsprings of data to build up a
solitary, coherent and virtual perspective on data so it tends to
be gotten to by front-end arrangements, for example,
applications, dashboards and gateways without knowing the
information's precise stockpiling area.
The latest usage of the information virtualization idea is in
distributed computing innovation. With information
virtualization, business clients can get continuous and solid data
rapidly, which causes them to take significant business choices.
KEY IDEAD OF DATA VIRTUALIZATION
• Abstraction of specialized parts of put away information
which utilizes Application programming interface, Access
language, Location, stockpiling structure.
• In instance of catastrophes information is sent to improvement
for testing and different purposes. In the event that it happens
virtual machines shutdown the machine and the generation
begin once more.
SYSTEM INFRASTRUCTURE AND SECURITY
System framework is the equipment and programming assets of
a whole system that empower organize availability,
correspondence, tasks and the executives of an endeavor
arrange. It gives the correspondence way and administrations
between clients, forms, applications, administrations and
outside systems/the web.
The framework establishment will reinforce definitive
12. exercises; thusly, a pictorial point of view on workstations,
servers, switches, ranges, doors, and paths should be used.
Since the association will join with an overall association,
virtualization and cloud advancement should be pondered
over.In expansion, get to ways for Internet access ought to be
delineated. An account ought to be incorporated to clarify every
one of the hubs of the system and the reason for the plan. In
conclusion, utilizing the Confidentiality, Integrity and
Availability (CIA) Triangle, characterize the authoritative
security strategy.
BASIC ISSUES AND DANGES BY NETWORK
INFRASTRUCTURE:
• A complete system security review to pinpoint
vulnerabilities.
• Ensure your organization holds fast to administrative
consistence.
• Proactive observing to diminish downtime and dangers.
• Enhance client trust in realizing their data is secured.
TECHNIQUES TO VERIFY INFORMATION IS SAFE:
• keeping the information flawless, complete and exact, and
IT frameworks operational
• an objective showing that data or framework is at transfer
of approved clients when required.
• Patching all the data properly.
• Using Public key encryption, where the association can
just unscramble the information.
REFRENCES
https://www.informatica.com/services-and-training/glossary-of-
terms/data-warehousing-definition.html
https://searchdatamanagement.techtarget.com/definition/data-
13. warehouse
https://www.guru99.com/data-warehousing.html
RELATIONAL DATABASE
The social database demonstrate contains gathering of items or
relations, set of activities to follow up on the relations, Data
respectability for exactness and consistency. A database
contains at least one tables of data. The lines in a table are
called records and the sections in a table are called fields or
characteristics. A database that contains just a single table is
known as a level database.
• A database that is seen by the client as a gathering of two-
dimensional tables.
• It is controlled to set at once, as opposed to a record at any
given moment
• SQL is utilized to control social databases
Social Database Management System (RDMS) a database
framework made up of records with information components in
two-dimensional cluster (lines and sections). This database the
executives framework has the ability to recombine information
components to shape diverse relations bringing about an
extraordinary adaptability of information utilization.
A Relational Database Management System (RDBMS) is a
product framework that gives access to a social database. The
product framework is an accumulation of programming
applications that can be utilized to make, keep up, oversee and
utilize the database. A "social database" is a database organized
on the "social" show. Information are put away and exhibited in
a forbidden configuration, sorted out in lines and sections with
one record for every column.
14. ADVANTAGES OF RELATIONAL DATABASE
DYNAMIC VIEWS
Relational databases support an important concept of dynamic
views. In this database, a view is not a part of the physical
schema, it is dynamic. Hence changing the data in a table alters
the data depicted by the view. Views can subset data, join and
simplify multiple relations, dynamically hide the complexity in
the data, and reduce data storage requirements.
SECURITY
Relational databases provide excellent security. They support
access permissions which allow the database administrator to
implement need-based permissions to the access of data in
database tables. Relational databases support the concept of
users and user rights, thus meeting the security needs of
databases. Relations are associated with privileges like create
privilege, grant privilege, select, insert, and delete privileges,
which authorize different users for corresponding operations on
the database.
SIMPLE OPERATIONS AND DEFINED RELATIONSHIPS
Relational databases use SQL, which is an easy and human-
readable language. SQL instructions are in the form of plain
instructions, which can be put to the database for
implementation. Most of the database vendors support the SQL
standard. Relational databases provide the users with simple
operations to manipulate data in the databases and retrieve it.
The other important advantages of relational databases include
their performance, power, and support to new hardware
technologies, as also flexibility and a capacity to meet all types
of data needs. These databases are scalable and provide support
for the implementation of distributed systems.
DISADVANTAGES OF RELATIONAL DATABASE
· Expensive solutions that require thorough planning.
· Easy to create badly designed and inefficient database designs
15. if there is not any proper data analysis prior to
implementation.DATA FLOW DESIGN(DFD)
Information stream charts (DFDs) uncover connections among
and between the different segments in a program or framework.
DFDs are an imperative strategy for displaying a framework's
abnormal state detail by demonstrating how input information is
changed to yield results through an arrangement of utilitarian
changes.
DFDs comprise of four noteworthy segments: elements, forms,
information stores, and information streams. The images used to
delineate how these segments associate in a framework are basic
and straightforward; be that as it may, there are a few DFD
models to work from, each having its own symbiology. DFD
linguistic structure remains consistent by utilizing basic action
word and thing builds. Such a linguistic relationship of DFDs
makes them perfect for item situated examination and parsing
practical particulars into exact DFDs for the framework’s
expert.
1. DFDs are more obvious by specialized and nontechnical
gatherings of people.
2. DFDs can give an abnormal state framework diagram, total
with limits and associations with different frameworks.
3. DFDs can give a point by point portrayal of framework parts.
DATABASE DESIGN – ENTITY-RELATIONSHIP
An Entity Relationship Diagram (ERD) is an information
demonstrate depicting how substances (or ideas or things)
identify with each other. At the point when made by business
16. experts, ERDs can be utilized to comprehend the business area,
elucidate business wording, and interface business ideas to
database structures.
Basically, a calculated or legitimate ERD will outwardly
indicate how the terms in your glossary identify with each
other. They are particularly useful in clearing up data models
for social databases and helping business clients comprehend
database structures at an abnormal state and without subtleties.
A trait of an element is a specific property that depicts the
element. A relationship is the affiliation that portrays the
cooperation between substances. Cardinality, with regards to
ERD, is the quantity of occasions of one substance that can, or
should, be related with each case of another element. All in all,
there might be coordinated, one-to-many, or many-to-numerous
connections.
MAPPING CARDINALITIES
Cardinality characterizes the quantity of elements in a single
element set, which can be related with the quantity of
substances of other set by means of relationship set.
• One-to-one − A substance in An is related with at most one
element in B, and an element in B is related with at most one
element
• One-to-Many − One substance from element set A can be
related with more than one elements of element set B anyway an
element from element set B, can be related with at most one
element.
• Many-to-one − More than one substances from element set A
can be related with at most one element of element set B,
anyway an element from element set B can be related with more
than one element from element set A.
• Many-to-Many − One substance from A can be related with
more than one element from B and the other way around
KEY ELEMENTS
Entities – An element is a thing. In business space terms, it's an
idea or glossary-level term. In social database terms, it's the
17. table. It is Represented by
Relationships – The real insight from this type of diagram
comes when we see how entities relate to one another, or
relationships. Relationships can be thought of as verbs that link
two or more nouns. Relationships can be modeled numerically,
using the multiplicity syntax from a class diagram, or using
Crows Foot Notation.
Attributes – Inside every element, there can be more than one
property. Qualities give point by point data about the idea. In a
social database, properties are spoken to by the fields where the
data inside a record is held.
Steps for making an Entity Relationship graph
Like any investigation display, making an ERD is an iterative
procedure that includes elicitation, examination, and survey
with partners. Here are a few stages you'll experience as you
make an ERD.
• Create boxes for every substance or essential business idea
significant to your model.
• Model the connections between each by attracting lines to
interface related substances. Mark the connections utilizing
action words or a numeric documentation. Crows Foot Notation
is basic for ERDs, however you can likewise utilize the variety
documentation from UML's Class Diagrams.
• Identify important properties inside every substance. For a
reasonable model, center around the most vital properties. As
your model advances, make your trait records progressively
explicit.
• Review your model with business and specialized partners.
• Repeat until your space is very much spoken to by your model.
DATA SCHEME IMPLEMENTATIONS
18. A database mapping is a gathering of metadata that depicts the
relations in a database. A diagram is additionally depicted as
the format or plan of a database that plots the manner in which
information is composed into tables.
A mapping is regularly depicted utilizing Structured Query
Language (SQL) as a progression of CREATE articulations that
might be utilized to imitate the diagram in another database.
There two kinds of database pattern:
• The physical database mapping gives the plan for how each bit
of information is put away in the database.
• The legitimate pattern offers structure to the tables and
connections within the database. As a rule, the coherent outline
is made before the physical diagram. Regularly, database
planners use information displaying to make a database pattern
dependent on the product that will cooperate with the database.
Primary Key: A table can just contain one essential key. The
essential key section can't acknowledge invalid qualities and
furthermore necessitates that the information esteems be special
for that table segment.
Foreign Key: The motivation behind making remote key is to
make a connection between two tables when the segments of the
two tables hold the essential key an incentive for one table and
is referenced by the outside key segment of another
table.DATABASE TABLES
1. Basic information table
Field Name
Data type
Description
Student id
varchar
Primary key
19. Student name
varchar
Phone number
varchar
E-mail
varchar
Address
varchar
2. Department table
Field Name
Data type
Description
Department id
varchar
Primary key
Department name
varchar
Department address
varchar
Foreign key to Department table
3. Student table
Field Name
Data type
Description
Student id
varchar
Primary key
Student name
20. varchar
Phone Number
varchar
Subjects
varchar
Grade
varchar
Department id
varchar
Foreign key to category table
Primary and Foreign keys
1. Student table:
Student id- Primary Key
2. Department table:
Department id- Primary Key
Customer id- Foreign key to customer table
3. Student table:
Student id- Primary key
Department id- Foreign key to category table
Normalization
Normalization is the way toward sorting out information into
tables so that the consequences of utilizing the database are
constantly unambiguous and as proposed. Such standardization
is natural for social database hypothesis. It might have the
impact of copying information inside the database and
frequently results in the formation of extra tables.
First Normal form(1NF)
First Normal Form is characterized in the meaning of relations
tables itself. This standard characterizes that every one of the
traits in a connection must have nuclear areas. The qualities in a
21. nuclear space are resolute units.
• Each table cell ought to contain a solitary esteem.
• Each record should be novel.
Second Normal Form(2NF)
Second Normal Form A pattern is said to be in second ordinary
structure (additionally called 2NF) if all qualities that are not
some portion of the essential key are completely practically
subject to the essential key, and the composition is as of now in
first typical structure.
• Rule 1-Be in 1NF
• Rule 2-Single Column Primary Key
Third Normal Form(3NF)
Third ordinary structure reveals to us that characteristics must
rely upon only the key. Formally, for a diagram to be in third
typical structure (3NF), we should expel every single transitive
reliance, and the blueprint should as of now be in second
ordinary structure.
• Rule 1-Be in 2NF
• Rule 2-Has no transitive practical conditions
To move our 2NF table into 3NF, we again need to again
separate our table.
Normalization of Customer table
Client Table isn't standardized. If there should arise an
occurrence of 1 client having 2 tends to will cause infringement
of 1NF likewise in the event that if 2 distinct clients have same
location will cause infringement of 2NF. Decaying Customer
table in 2 unique tables Customer and Address will make the
table in 2NF.
Student table
Student ID
Student Name
Address
Phone number
Email id
Type
22. This can be done by partitioning the table by as per 1NF
Student table
Student ID
Address
Phone number
Email id
Type
Customer table
Student ID
Student name
Above decomposition still do not make the customer table in
2NF because there can be common customer type for 2
customers so further decomposition of customer table will make
it completely normalized till 3NF.
Student table
Student ID
Address
Phone number
Email id
Type
Student table
Student ID
Student type
Referential Intergrity
Referential respectability is a database limitation that
guarantees that references between information are in reality
substantial and flawless. Referential honesty is an essential
standard of database hypothesis and emerges from the idea that
a database ought store information, however ought to
effectively look to guarantee its quality. Referential
trustworthiness in a social database is consistency between
coupled tables.
23. Referential respectability is generally upheld by the mix of an
essential key and an outside key. For referential honesty to
hold, any field in a table that is proclaimed an outside key can
contain just qualities from a parent table's essential key field.
It is an element given by social database the executives
frameworks (RDBMS's) that keeps clients or applications from
entering conflicting information. Most RDBMS's have different
referential honesty decides that you can apply when you make a
connection between two tables.
Database the executives defend that guarantees each remote key
matches an essential key. For instance, client numbers in a
client document are the essential keys, and client numbers in the
request record are the outside keys. In the event that a client
record is erased, the request records should likewise be erased;
else they are left without an essential reference. On the off
chance that the DBMS does not test for this, it must be
customized into the applications.
REFERENCES
https://www.tutorialspoint.com/dbms/database_normalization.ht
m
https://www.techopedia.com/definition/1221/normalization
https://beginnersbook.com/2015/04/e-r-model-in-dbms/
https://www.lucidchart.com/pages/er-diagrams