chapter 4-Functional Dependency and Normilization.pdfMisganawAbeje1
This chapter describe about the theory that has been developed with the goal of evaluating relational schemas for design quality , that is, to measure formally why one set of groupings of attributes into relation schemas is better than
another.
Normalization is a process that “improves” a database design by generating relations that are of higher normal forms.
The objective of normalization:
“to create relations where every dependency is on the key, the whole key, and nothing but the key”.
INTRODUCTION
3NF and BCNF
Decomposition requirements
Lossless join decomposition
Dependency preserving decomposition
Disk pack features
Records and Files
Ordered and Unordered files
2NF,NF,3NF,BCNF
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
chapter 4-Functional Dependency and Normilization.pdfMisganawAbeje1
This chapter describe about the theory that has been developed with the goal of evaluating relational schemas for design quality , that is, to measure formally why one set of groupings of attributes into relation schemas is better than
another.
Normalization is a process that “improves” a database design by generating relations that are of higher normal forms.
The objective of normalization:
“to create relations where every dependency is on the key, the whole key, and nothing but the key”.
INTRODUCTION
3NF and BCNF
Decomposition requirements
Lossless join decomposition
Dependency preserving decomposition
Disk pack features
Records and Files
Ordered and Unordered files
2NF,NF,3NF,BCNF
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Normal forms & Normalization.pptx
1. Normal forms & Normalization
Proposed by Edgar F. Codd
Integral part of Relational Databases
Reduce redundancy & improve data
integrity
Top down refinement process
Relational design by analysis
3. Definition :
Normalization is the process of analyzing relation schemas
based on their Functional Dependencies and Primary
Keys to achieve the desirable properties of :
Minimizing redundancy
Minimizing insertion, deletion & updation
anomalies.
Normal Form of a relation refers to the degree to which it
has been normalized.
5. First Normal form
Only Atomic Values
Disallow Multi-valued & Composite
Attributes
DEPARTMENT
DNAME DNUMBER MANAGER_ID DLOCATIONS
DNAME DNUMBER MANAGER_ID DNUMBER DLOCATIONS
6. Second normal form
• Full Functional Dependency
• A relation schema is in 2 NF if it is in 1 NF and if every
non-prime attribute is fully functionally dependent on the
Primary key.
SSN PNUMBER HOURS ENAME PNAME PLOCATION
{ SSN, PNUMBER } HOURS ……….. Full Functional Dependency
{ ENAME, PNAME, PLOCATION } all are partially dependent on the key.
Hence the above relation is not in 2 NF
8. DEFINITION :
A relation schema is in 2 NF if it is in 1 NF and every non-
prime attribute is fully functionally dependent on the key.
A relation schema is in 2 NF if it is in 1 NF and every non-
prime attribute is not partially dependent on the key.
9. Third normal form
Transitive Dependency
In a relation schema R if there exist attributes X,Y,Z such
that :
X Y & Y Z, where Y is non-prime
Then, X Y is a transitive dependency through non-
prime attribute Y.
10. DEFINITION :
A relation schema is in 3 NF if
• it satisfies 2 NF and
• no non-prime attribute is transitively
dependent on the key.
12. Examples for practice
PROJ_ID PNAME PMGR_ID E_ID ENAME E_DEPT E_HRLY_RATE HOURS
PROJECT_EMPLOYEE
PROJ_ID PNAME PMGR_ID
PROJ_ID E_ID ENAME E_DEPT E_HRLY_RATE HOURS
PROJECT
PROJECT_EMPLOYEE
Now the relations are in 1NF.
{ ENAME, E_DEPT, E_HRLY_RATE } are partially dependent on the key { PROJ_ID, E_ID }
Therefore the relation is not in 2 NF
13. PROJ_ID PNAME PMGR_ID
PROJECT
E_ID ENAME E_DEPT E_HRLY_RATE
PROJ_ID E_ID HOURS
HOURS
Above relations are in 2 NF, but not in 3 NF.
E_ID E_DEPT, E_DEPT E_HRLY_RATE
So, E_ID E_HRLY_RATE is a transitive dependency.
E_ID ENAME E_DEPT E_DEPT E_HRLY_RATE
EMPLOYEE HOURLY_RATES
14. Q1) Consider the following relation :
CAR_SALE( CAR#, DATE_SOLD, SALESMAN#,
COMMISSION, DISCOUNT)
Additional functional dependencies are :
DATE_SOLD DISCOUNT
SALESMAN# COMMISSION
Based on the primary key, what is the highest normal form
of this relation. Normalize it to 3 NF.
15. Solution :
The relation is in 1NF.
It is not in 2 NF. As SALESMAN# COMMISSION
COMMISSION is partially dependent on the key { CAR#, SALESMAN#}
It is decomposed into two relations :
CAR_SALE(CAR#, DATE_SOLD, DISCOUNT, SALESMAN#)
SALES(SALESMAN#, COMMISSION)
These relations are in 2 NF but not in 3 NF.
CAR# DATE_SOLD, DATE_SOLD DISCOUNT
Further decomposition results :
CAR_SALE(CAR# , DATE_SOLD, SALESMAN#)
DATES(DATE_SOLD, DISCOUNT)
SALES(SALESMAN#, COMMISSION)
17. Q3) Schedule(stud_id, class_no, sname, course, class_time, room_no, instructor)
stud_id sname, course
Class_no class_time, room_no, instructor
Solution :
Relation is in 1 NF but not in 2 NF due to partial dependency on the key.
Therefore it is decomposed into :
STUDENT(STUD_ID, SNAME, COURSE)
CLASS(CLASS_NO, CLSS_TIME, ROOM_NO, INSTRUCTOR)
STUD_CLASS(STUD_ID, CLASS_NO)
18. Q4) Transaction(Trans_ID, item_no, price, qty_sold, seller, seller_discount)
Trans_id seller
Trans_id, item_no qty_sold
Item_no price
Seller seller_discount
Solution :
TRANSCATION(TRANS_ID, ITEM_NO, QTY_SOLD)
ITEMS(ITEM_NO, PRICE)
SELLER(TRANS_ID, SELLER, SELLER_DISCOUNT)
Above relations are in 2 NF but not in 3 NF. Further decomposition of SELLER leads to :
SELLER1(TRANS_ID, SELLER)
SELLER2(SELLER, SELLER_DISCOUNT)