SlideShare a Scribd company logo
QUESTION BANK
SUBJECT : DATA WAREHOUSING AND MINING
UNIT 1:
2MARKS
1. What are the key features of data warehouse?
2. Define data warehouse.
3. What are operational databases?
4. Define OLTP?
5. Define OLAP?
6. How a database design is represented in OLTP systems?
7. How a database design is represented in OLAP systems?
8. Differentiate the features of operational database and data warehouse.
9. Write short notes on multidimensional data model?
10.Define data cube?
11.What are facts?
12.What are dimensions?
13.Define dimension table?
14.Define fact table?
15.What are lattice of cuboids?
16.What is apex cuboid?
17.List out the components of star schema?
18.What is snowflake schema?
19.List out the components of fact constellation schema?
20.Point out the major difference between the star schema and the snowflake
schema?
21.Which is popular in the data warehouse design, star schema model (or)
snowflake schema model?
22.Define concept hierarchy?
23.Define total order?
24.Define partial order?
25.Define schema hierarchy?
26.List out the OLAP operations in multidimensional data model?
27.What is roll-up operation?
28.What is drill-down operation?
29.What is slice operation?
30.What is dice operation?
31.What is pivot operation?
32.List out the views in the design of a data warehouse?
33.What are the methods for developing large software systems?
34.How the operation is performed in waterfall method?
35.How the operation is performed in spiral method?
36..List out the steps of the data warehouse design process?
37.Define ROLAP
38.Define MOLAP?
39..Define HOLAP?
40.What is enterprise warehouse?
41.What is data mart?
42.What are dependent and independent data marts?
43.What is virtual warehouse?
44.Define indexing?
45.What are the types of indexing?
46.Differentiate partial and full materialization.
16 MARKS:
1. Discuss the components of data warehouse.
2. List out the differences between OLTP and OLAP.
3. Discuss the various schematic representations in multidimensional data model.
4. Explain the OLAP operations in the multidimensional data model.
5. Explain the design and construction of a data warehouse.
6. Explain the three-tier data warehouse architecture.
7. Explain the process of data warehouse implementation.
UNIT 2:
2MARKS
1. What is data pre-processing?
2. Define data cleaning.
3. List out the methods to fill the missing values.
4. What is data smoothing?
5. Define data integration.
6. Define data transformation.
7. List out the methods used for data normalization.
8. Differentiate min-max and z-score normalization.
9. What you mean by correlation analysis?
10.What are the methods used for data reduction?
11. Define DWT.
12. What is principle component analysis?
13. Differentiate linear and multiple regression.
14. What is histogram?
15. Define sampling.
16. Define clustering.
17. What is visualization?
18. Define DMOL.
19. How to specify task relevant data using DMQL.
20. List out the coupling schema used by data mining system.
21. Differentiate semi tight and tight coupling.
22.What is Concept description?
23.Differentiate descriptive and predictive data mining.
24.What is AOI?
25.When to perform attribute removal?
26.What is attribute generalization?
27.What are the different types of class comparison?
28.What is attribute relevance analysis?
29.Differentiate quartiles and outliers.
30. Define box plot.
16 MARKS:
1. Explain about various data cleaning and integration process.
2. Explain about data mining primitive tasks.
3. Explain dimensionality reduction and numerosity reduction techniques.
4. Explain about discretization and concept hierarchy generation.
5. Discuss about various data transformation techniques.
6. Explain about DMQL.
7. Write short notes on data mining system architecture?
8. Write short notes on concept description?
9. Explain about the statistical measures used in large databases.
10.Explain attribute oriented induction and its implementation.
11.Write short notes on attribute relevance analysis?
UNIT 3:
2MARKS
1. What is association rule mining?
2. Define support.
3. Define Confidence.
4. How is association rules mined from large databases?
5. What is the classification of association rules based on various criteria?
6. What is Apriori algorithm?
7. List the techniques for improving the efficiency of apriori algorithm.
8. What is portioning?
9. Define transaction reduction.
10.How sampling helps to improve efficiency of apriori algorithm.
11.List the drawbacks of apriori algorithm.
12.What is FP-Tree?
13.What is iceberg query and how it is used to improve market basket analysis?
14.List the different approaches of multilevel association rules.
15.What is controlled level cross filtering?
16.Define level passage threshold.
16 MARKS:
1) Explain about mining association rules in large databases.
2) Explain apriori algorithm with example.
3) Explain FP-growth algorithm with example.
4) Discuss about mining multi-level association rules and also about different
approaches used for it.
5) A DB has 4 transactions.let min_sup=60%, min_conf=80%
TID DATE ITEMS_BOUGHT
T100
T200
T300
T400
10/15/99
10/15/99
10/19/99
10/22/99
{K,A,D,B}
{D,A,C,E,B}
{C,A,B,E}
{B,A,D}
i) Find all frequent itemsets using apriori algorithm.
ii) Find all frequent itemsets using FP-growth algorithm.
6) A database has five transactions. Let min_sup=60% and min_conf=75%.
TID Items brought
T100
T200
T300
T400
T500
{B,C,E,J}
{B,C,J}
{B,M,Y}
{B,J,M}
{C,J,M}
i) Find all frequent itemsets using apriori algorithm and FP-growth.
ii) List all of the strong association rules.
UNIT 4:
2MARKS
1. Define classification and prediction.
2. Describe the two common approaches to tree pruning.
3. What are Bayesian Classifiers?
4. What is a “decision tree”?
5. Where is decision trees mainly used?
6. How will you solve a classification problem using decision trees?
7. What is ID3?
8. What is decision tree pruning?
9. List some of attribute selection measures used in decision tree induction.
10.What is Baye’s theorem?
11.What is Bayesian belief network?
12.What is k-Nearest neighbor classifier?
13.Where case-based reasoning is used?
14.Differentiate eager learner and lazy learner.
15.Define least square?
16.List some available prediction methods available.
17.What is classifier accuracy?
18.What is the purpose of using confusion matrix?
19.Define cluster analysis.
20.List out the types of data used in cluster analysis.
21.Differentiate data and dissimilarity matrix.
22.Define Manhattan distance.
23.List the categorization of clustering methods.
24.What is a density-based and Grid-based method?
25.Differentiate agglomerative and divisive approach.
26.What are the disadvantages of K-means algorithm?
27.List the initial inputs given to K-medoids algorithm.
28.Define outliers.
29.Differentiate statistical-based and distance based outlier detection.
30.What is the purpose of using smoothing factor?
16 MARKS:
1. Explain decision tree induction algorithm.
2. Explain about various attribute selection methods used in decision tree induction.
3. Explain about various methods used for prediction.
4. Explain cluster accuracy.
5. Explain about Bayesian classification.
6. Write short notes on outlier analysis?
7. Write short notes on cluster partitioning methods?
8. Explain K-Means algorithm.
9. Explain K-Mediods algorithm.
10.What is cluster analysis? Explain about types of data used in cluster analysis.
11.Explain various methods used for outlier detection.
12.Classify the given training samples using ID3 algorithm. Apply the same to
construct a decision tree for the data given below. (Note: Use information gain as
attribute selection measure.)
SIZE COLOR SHAPE CLASS
Small
Big
Big
Small
Small
Big
Big
Big
Small
Yellow
Yellow
Red
Red
Black
Black
Yellow
Black
Yellow
Round
Round
Round
Round
Round
Cube
Cube
Round
Cube
A
A
A
A
B
B
B
B
B
13.The following table shows the mid-term and final exam grades obtained for students
in a database course.
X
MIDTERM EXAM
Y
FINAL EXAM
72
50
81
84
63
71
74
94
86
59
83
65
33
88
81
78
90
75
49
79
77
52
74
90
Predict the final exam grade of a student who received 86 on the midterm
exam.
UNIT 5:
2MARKS
1. Define Spatial Databases
2. Define Transactional Databases.
3. What is Temporal Database?
4. Mention the types of dimensions in spatial data cube.
5. Name some of the data mining applications?
6. What are the contributions of data mining to DNA analysis?
7. Name some examples of data mining in retail industry?
8. Explain multimedia data mining
9. What does web mining mean
10.Define text mining
11.How to assess the quality of text retrieval?
12.List out the methods used for information retrieval.
13.What is web usage mining?
14.Differentiate time-series and sequence database.
15.List the kinds of association can be mined from multimedia data.
16 MARKS:
1. Describe the applications and trends in data mining in detail.
2. Explain how data mining is used in banking industry.
3. Explain how data mining is used in health care analysis.
4. Explain data mining applications for Telecommunication industry.
5. Explain data mining applications for retail industry.
6. Explain data mining applications fro financial data analysis.
7. Explain data mining applications for Biomedical and DNA data analysis.
8. Explain in detail about spatial data mining.
9. Explain in detail about text mining.
10.Explain in detail about mining multimedia databases.
11.Explain in detail about mining WWW.
12.Explain in detail about mining time-series and sequence data.

More Related Content

What's hot

Datastructures and algorithms prepared by M.V.Brehmanada Reddy
Datastructures and algorithms prepared by M.V.Brehmanada ReddyDatastructures and algorithms prepared by M.V.Brehmanada Reddy
Datastructures and algorithms prepared by M.V.Brehmanada ReddyMalikireddy Bramhananda Reddy
 
DATA STRUCTURE
DATA STRUCTUREDATA STRUCTURE
DATA STRUCTURE
Rohit Rai
 
Data structures
Data structuresData structures
data structure
data structuredata structure
data structure
hashim102
 
Introduction to data structure
Introduction to data structure Introduction to data structure
Introduction to data structure
NUPOORAWSARMOL
 
Data structures and Alogarithims
Data structures and AlogarithimsData structures and Alogarithims
Data structures and AlogarithimsVictor Palmar
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
Salah Amean
 

What's hot (10)

Datastructures and algorithms prepared by M.V.Brehmanada Reddy
Datastructures and algorithms prepared by M.V.Brehmanada ReddyDatastructures and algorithms prepared by M.V.Brehmanada Reddy
Datastructures and algorithms prepared by M.V.Brehmanada Reddy
 
M v bramhananda reddy dsa complete notes
M v bramhananda reddy dsa complete notesM v bramhananda reddy dsa complete notes
M v bramhananda reddy dsa complete notes
 
Ds 2
Ds 2Ds 2
Ds 2
 
DATA STRUCTURE
DATA STRUCTUREDATA STRUCTURE
DATA STRUCTURE
 
Data structures
Data structuresData structures
Data structures
 
data structure
data structuredata structure
data structure
 
Ds 1
Ds 1Ds 1
Ds 1
 
Introduction to data structure
Introduction to data structure Introduction to data structure
Introduction to data structure
 
Data structures and Alogarithims
Data structures and AlogarithimsData structures and Alogarithims
Data structures and Alogarithims
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
 

Viewers also liked

Windows 10 Error c1900101
Windows 10 Error c1900101Windows 10 Error c1900101
Windows 10 Error c1900101
Jhon Brown
 
Fundraising Document
Fundraising DocumentFundraising Document
Fundraising Documentpattybeeny
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
FellowBuddy.com
 
Folha informativa 22-01-2017
Folha informativa 22-01-2017Folha informativa 22-01-2017
Folha informativa 22-01-2017
Paróquia Pardilhó
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
Bahria University ,
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
Kent Graziano
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
idnats
 
Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 

Viewers also liked (11)

Windows 10 Error c1900101
Windows 10 Error c1900101Windows 10 Error c1900101
Windows 10 Error c1900101
 
Certificate
CertificateCertificate
Certificate
 
Fundraising Document
Fundraising DocumentFundraising Document
Fundraising Document
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
Folha informativa 22-01-2017
Folha informativa 22-01-2017Folha informativa 22-01-2017
Folha informativa 22-01-2017
 
Data warehousing and Data mining
Data warehousing and Data mining Data warehousing and Data mining
Data warehousing and Data mining
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Data Warehousing and Data Mining
Data Warehousing and Data MiningData Warehousing and Data Mining
Data Warehousing and Data Mining
 
Data mining
Data miningData mining
Data mining
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 

Similar to Data mining and warehousing qb

Viva questions
Viva questionsViva questions
Viva questions
ARVIND SARDAR
 
DMS 22319 Viva questions
DMS 22319 Viva questions DMS 22319 Viva questions
DMS 22319 Viva questions
ARVIND SARDAR
 
Qstns_DataMining_Unit3_4_5 data mining a
Qstns_DataMining_Unit3_4_5 data mining aQstns_DataMining_Unit3_4_5 data mining a
Qstns_DataMining_Unit3_4_5 data mining a
MusicCity5
 
Cs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_miningCs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_mining
hari91
 
Daa tutorial
Daa tutorialDaa tutorial
Daa tutorial
poojagupta010
 
Maintenance for university course out line.docx
Maintenance for university  course out line.docxMaintenance for university  course out line.docx
Maintenance for university course out line.docx
amare lakew
 
Axapta interview questions
Axapta interview questionsAxapta interview questions
Axapta interview questions
KD420
 
Computer Science - Class7 term-1 (2019_20)
Computer Science - Class7 term-1 (2019_20)Computer Science - Class7 term-1 (2019_20)
Computer Science - Class7 term-1 (2019_20)
Andrew Raj
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
Venkata Reddy Konasani
 
print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
lathass5
 
CS8091-Big Data Analytics.pdf
CS8091-Big Data Analytics.pdfCS8091-Big Data Analytics.pdf
CS8091-Big Data Analytics.pdf
ssuserad38541
 
Dunham - Data Mining.pdf
Dunham - Data Mining.pdfDunham - Data Mining.pdf
Dunham - Data Mining.pdf
PRAJITBHADURI
 
Dunham - Data Mining.pdf
Dunham - Data Mining.pdfDunham - Data Mining.pdf
Dunham - Data Mining.pdf
ssuserf71896
 
M.tech cse 10july13 (1)
M.tech cse  10july13 (1)M.tech cse  10july13 (1)
M.tech cse 10july13 (1)
vijay707070
 
MDS_1_PacBear_2DWorld.pptx
MDS_1_PacBear_2DWorld.pptxMDS_1_PacBear_2DWorld.pptx
MDS_1_PacBear_2DWorld.pptx
IvovanderMarel1
 
Question bank data management
Question bank  data managementQuestion bank  data management
Question bank data management
SwarnaLatha177
 
Indows soft solutions experience 2 years in my sql
Indows soft solutions experience 2 years in my sqlIndows soft solutions experience 2 years in my sql
Indows soft solutions experience 2 years in my sql
Indows Soft Solutions
 
DBMS Question bank
DBMS Question bankDBMS Question bank
DBMS Question bank
Sara Sahu
 
Assignment0 Intro
Assignment0 IntroAssignment0 Intro
Assignment0 Intro
Arlene Smith
 
Data Warehouse question.docx
Data Warehouse question.docxData Warehouse question.docx
Data Warehouse question.docx
ssuser133420
 

Similar to Data mining and warehousing qb (20)

Viva questions
Viva questionsViva questions
Viva questions
 
DMS 22319 Viva questions
DMS 22319 Viva questions DMS 22319 Viva questions
DMS 22319 Viva questions
 
Qstns_DataMining_Unit3_4_5 data mining a
Qstns_DataMining_Unit3_4_5 data mining aQstns_DataMining_Unit3_4_5 data mining a
Qstns_DataMining_Unit3_4_5 data mining a
 
Cs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_miningCs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_mining
 
Daa tutorial
Daa tutorialDaa tutorial
Daa tutorial
 
Maintenance for university course out line.docx
Maintenance for university  course out line.docxMaintenance for university  course out line.docx
Maintenance for university course out line.docx
 
Axapta interview questions
Axapta interview questionsAxapta interview questions
Axapta interview questions
 
Computer Science - Class7 term-1 (2019_20)
Computer Science - Class7 term-1 (2019_20)Computer Science - Class7 term-1 (2019_20)
Computer Science - Class7 term-1 (2019_20)
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
 
CS8091-Big Data Analytics.pdf
CS8091-Big Data Analytics.pdfCS8091-Big Data Analytics.pdf
CS8091-Big Data Analytics.pdf
 
Dunham - Data Mining.pdf
Dunham - Data Mining.pdfDunham - Data Mining.pdf
Dunham - Data Mining.pdf
 
Dunham - Data Mining.pdf
Dunham - Data Mining.pdfDunham - Data Mining.pdf
Dunham - Data Mining.pdf
 
M.tech cse 10july13 (1)
M.tech cse  10july13 (1)M.tech cse  10july13 (1)
M.tech cse 10july13 (1)
 
MDS_1_PacBear_2DWorld.pptx
MDS_1_PacBear_2DWorld.pptxMDS_1_PacBear_2DWorld.pptx
MDS_1_PacBear_2DWorld.pptx
 
Question bank data management
Question bank  data managementQuestion bank  data management
Question bank data management
 
Indows soft solutions experience 2 years in my sql
Indows soft solutions experience 2 years in my sqlIndows soft solutions experience 2 years in my sql
Indows soft solutions experience 2 years in my sql
 
DBMS Question bank
DBMS Question bankDBMS Question bank
DBMS Question bank
 
Assignment0 Intro
Assignment0 IntroAssignment0 Intro
Assignment0 Intro
 
Data Warehouse question.docx
Data Warehouse question.docxData Warehouse question.docx
Data Warehouse question.docx
 

Recently uploaded

Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 

Recently uploaded (20)

Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 

Data mining and warehousing qb

  • 1. QUESTION BANK SUBJECT : DATA WAREHOUSING AND MINING UNIT 1: 2MARKS 1. What are the key features of data warehouse? 2. Define data warehouse. 3. What are operational databases? 4. Define OLTP? 5. Define OLAP? 6. How a database design is represented in OLTP systems? 7. How a database design is represented in OLAP systems? 8. Differentiate the features of operational database and data warehouse. 9. Write short notes on multidimensional data model? 10.Define data cube? 11.What are facts? 12.What are dimensions? 13.Define dimension table? 14.Define fact table? 15.What are lattice of cuboids? 16.What is apex cuboid? 17.List out the components of star schema? 18.What is snowflake schema? 19.List out the components of fact constellation schema? 20.Point out the major difference between the star schema and the snowflake schema?
  • 2. 21.Which is popular in the data warehouse design, star schema model (or) snowflake schema model? 22.Define concept hierarchy? 23.Define total order? 24.Define partial order? 25.Define schema hierarchy? 26.List out the OLAP operations in multidimensional data model? 27.What is roll-up operation? 28.What is drill-down operation? 29.What is slice operation? 30.What is dice operation? 31.What is pivot operation? 32.List out the views in the design of a data warehouse? 33.What are the methods for developing large software systems? 34.How the operation is performed in waterfall method? 35.How the operation is performed in spiral method? 36..List out the steps of the data warehouse design process? 37.Define ROLAP 38.Define MOLAP? 39..Define HOLAP? 40.What is enterprise warehouse? 41.What is data mart? 42.What are dependent and independent data marts? 43.What is virtual warehouse? 44.Define indexing?
  • 3. 45.What are the types of indexing? 46.Differentiate partial and full materialization. 16 MARKS: 1. Discuss the components of data warehouse. 2. List out the differences between OLTP and OLAP. 3. Discuss the various schematic representations in multidimensional data model. 4. Explain the OLAP operations in the multidimensional data model. 5. Explain the design and construction of a data warehouse. 6. Explain the three-tier data warehouse architecture. 7. Explain the process of data warehouse implementation. UNIT 2: 2MARKS 1. What is data pre-processing? 2. Define data cleaning. 3. List out the methods to fill the missing values. 4. What is data smoothing? 5. Define data integration. 6. Define data transformation. 7. List out the methods used for data normalization. 8. Differentiate min-max and z-score normalization. 9. What you mean by correlation analysis? 10.What are the methods used for data reduction?
  • 4. 11. Define DWT. 12. What is principle component analysis? 13. Differentiate linear and multiple regression. 14. What is histogram? 15. Define sampling. 16. Define clustering. 17. What is visualization? 18. Define DMOL. 19. How to specify task relevant data using DMQL. 20. List out the coupling schema used by data mining system. 21. Differentiate semi tight and tight coupling. 22.What is Concept description? 23.Differentiate descriptive and predictive data mining. 24.What is AOI? 25.When to perform attribute removal? 26.What is attribute generalization? 27.What are the different types of class comparison? 28.What is attribute relevance analysis? 29.Differentiate quartiles and outliers. 30. Define box plot. 16 MARKS: 1. Explain about various data cleaning and integration process.
  • 5. 2. Explain about data mining primitive tasks. 3. Explain dimensionality reduction and numerosity reduction techniques. 4. Explain about discretization and concept hierarchy generation. 5. Discuss about various data transformation techniques. 6. Explain about DMQL. 7. Write short notes on data mining system architecture? 8. Write short notes on concept description? 9. Explain about the statistical measures used in large databases. 10.Explain attribute oriented induction and its implementation. 11.Write short notes on attribute relevance analysis? UNIT 3: 2MARKS 1. What is association rule mining? 2. Define support. 3. Define Confidence. 4. How is association rules mined from large databases? 5. What is the classification of association rules based on various criteria? 6. What is Apriori algorithm? 7. List the techniques for improving the efficiency of apriori algorithm. 8. What is portioning? 9. Define transaction reduction. 10.How sampling helps to improve efficiency of apriori algorithm. 11.List the drawbacks of apriori algorithm. 12.What is FP-Tree? 13.What is iceberg query and how it is used to improve market basket analysis? 14.List the different approaches of multilevel association rules. 15.What is controlled level cross filtering? 16.Define level passage threshold. 16 MARKS:
  • 6. 1) Explain about mining association rules in large databases. 2) Explain apriori algorithm with example. 3) Explain FP-growth algorithm with example. 4) Discuss about mining multi-level association rules and also about different approaches used for it. 5) A DB has 4 transactions.let min_sup=60%, min_conf=80% TID DATE ITEMS_BOUGHT T100 T200 T300 T400 10/15/99 10/15/99 10/19/99 10/22/99 {K,A,D,B} {D,A,C,E,B} {C,A,B,E} {B,A,D} i) Find all frequent itemsets using apriori algorithm. ii) Find all frequent itemsets using FP-growth algorithm. 6) A database has five transactions. Let min_sup=60% and min_conf=75%. TID Items brought T100 T200 T300 T400 T500 {B,C,E,J} {B,C,J} {B,M,Y} {B,J,M} {C,J,M} i) Find all frequent itemsets using apriori algorithm and FP-growth.
  • 7. ii) List all of the strong association rules. UNIT 4: 2MARKS 1. Define classification and prediction. 2. Describe the two common approaches to tree pruning. 3. What are Bayesian Classifiers? 4. What is a “decision tree”? 5. Where is decision trees mainly used? 6. How will you solve a classification problem using decision trees? 7. What is ID3? 8. What is decision tree pruning? 9. List some of attribute selection measures used in decision tree induction. 10.What is Baye’s theorem? 11.What is Bayesian belief network? 12.What is k-Nearest neighbor classifier? 13.Where case-based reasoning is used? 14.Differentiate eager learner and lazy learner. 15.Define least square? 16.List some available prediction methods available. 17.What is classifier accuracy?
  • 8. 18.What is the purpose of using confusion matrix? 19.Define cluster analysis. 20.List out the types of data used in cluster analysis. 21.Differentiate data and dissimilarity matrix. 22.Define Manhattan distance. 23.List the categorization of clustering methods. 24.What is a density-based and Grid-based method? 25.Differentiate agglomerative and divisive approach. 26.What are the disadvantages of K-means algorithm? 27.List the initial inputs given to K-medoids algorithm. 28.Define outliers. 29.Differentiate statistical-based and distance based outlier detection. 30.What is the purpose of using smoothing factor? 16 MARKS: 1. Explain decision tree induction algorithm. 2. Explain about various attribute selection methods used in decision tree induction. 3. Explain about various methods used for prediction. 4. Explain cluster accuracy. 5. Explain about Bayesian classification. 6. Write short notes on outlier analysis? 7. Write short notes on cluster partitioning methods?
  • 9. 8. Explain K-Means algorithm. 9. Explain K-Mediods algorithm. 10.What is cluster analysis? Explain about types of data used in cluster analysis. 11.Explain various methods used for outlier detection. 12.Classify the given training samples using ID3 algorithm. Apply the same to construct a decision tree for the data given below. (Note: Use information gain as attribute selection measure.) SIZE COLOR SHAPE CLASS Small Big Big Small Small Big Big Big Small Yellow Yellow Red Red Black Black Yellow Black Yellow Round Round Round Round Round Cube Cube Round Cube A A A A B B B B B 13.The following table shows the mid-term and final exam grades obtained for students in a database course. X MIDTERM EXAM Y FINAL EXAM 72 50 81 84 63 71
  • 10. 74 94 86 59 83 65 33 88 81 78 90 75 49 79 77 52 74 90 Predict the final exam grade of a student who received 86 on the midterm exam. UNIT 5: 2MARKS 1. Define Spatial Databases 2. Define Transactional Databases. 3. What is Temporal Database? 4. Mention the types of dimensions in spatial data cube. 5. Name some of the data mining applications? 6. What are the contributions of data mining to DNA analysis? 7. Name some examples of data mining in retail industry? 8. Explain multimedia data mining 9. What does web mining mean 10.Define text mining 11.How to assess the quality of text retrieval? 12.List out the methods used for information retrieval. 13.What is web usage mining? 14.Differentiate time-series and sequence database. 15.List the kinds of association can be mined from multimedia data. 16 MARKS:
  • 11. 1. Describe the applications and trends in data mining in detail. 2. Explain how data mining is used in banking industry. 3. Explain how data mining is used in health care analysis. 4. Explain data mining applications for Telecommunication industry. 5. Explain data mining applications for retail industry. 6. Explain data mining applications fro financial data analysis. 7. Explain data mining applications for Biomedical and DNA data analysis. 8. Explain in detail about spatial data mining. 9. Explain in detail about text mining. 10.Explain in detail about mining multimedia databases. 11.Explain in detail about mining WWW. 12.Explain in detail about mining time-series and sequence data.