SlideShare a Scribd company logo
1 of 7
Download to read offline
Data Mining 2013-14
Unit 1
1 What is data mining and data warehouse?
2 Compare data base processing Vs. data mining processing.
3 Explain applications of data mining in detail.
4 Explain all data mining models and tasks.
5 What is KDD? Explain with diagram.
6 Write a short note on visualization.
7 Discuss the issues in Data mining.
8 What is fuzzy logic? Explain in brief with example.
9 Define following terms:
1. Information retrieval
2. Precision
3. Recall
4. Similarity
5. Granularities
6. Facts
7. Roll ups
8. Drill down
10 Define cube and explain with example.
11 Write a short note on star schema.
12 Explain characteristics of data warehouse.
13 Discuss the ways to improve the performance of data warehouse
applications.
14 Write a short note on OLAP operations.
15 Write a short note on point estimation.
16 Compute mean, variance and standard deviation for (1,3,4,6,5).
17 Define
1. Mean
2. Median
3. Mode
4. Variance
5. Standard deviation (Sample/Population)
6. Bias
7. MSE
8. RMS18 Compute mean, median and mode for (15, 10, 18, 20, 28, 32).
19 What is Jackknife estimate technique?
20 Find out Jackknife estimate for variance X={1, 5, 6}
Mean X= {5, 6, 6}.
21 Estimate P that maximizes the likelihood that the given sequence of
heads and tails would occur for {H, H, H, T, T} Note: Assume coin
with H and T equally likely.
Data Mining 2013-14
22 Estimate the missing data and continues until convergence using
Expectation Maximization {1, 5, 10, 4,*, *}. (Guess µ0=3)
23 Prove that X11 belongs to class h2 using Bayes theorem.
ID Income Credit Class xi
1 4 Excellent h1 x4
2 3 Good h1 x7
3 2 Excellent h1 x2
4 3 Good h1 x7
5 4 Good h1 x8
6 2 Excellent h1 x2
7 3 Bad h2 x11
8 2 Bad h2 x10
9 3 Bad h3 x11
10 1 Bad h4 x924 Write a short note on Hypothesis testing.
25 Find Chi square statistics for
Observed value = {51, 95, 67, 78, 88}
Expected value=76
26 Write a short note on linear regression.
27 Write a short note on non-linear regression.
28 Explain correlation in detail.
29 Find correlation betw een Ice cream sales Vs temperature
Temperature
0C
Ice Cream Sales (in
rupees)14.2 215
16.4 325
11.9 185
15.2 332
18.5 406
22.1 522
30 Write a short on similarity measures.
Unit 2
31 Explain the need of data pre-processing.
32 List and explain major tasks in data processing.
33 Explain terms Quartile and Inter-Quartile range.
34 What are Box plot and Quantile plot?
35 What is histogram and scatter plot?
36 Write a short note on data cleaning tasks.
37 Explain Binning with example.
38 Explain Data aggregation, generalization and smoothing.
39 Write a short note on data transformation.
40 Write a short note on data normalization.
41 Define
1. Association rule
Data Mining 2013-14
2. Support
3. Confidence
42 Explain apriori algorithm with example.
42a Write a short note on Association rule mining.
Unit 3
43 What is classification? Discuss the issues.
44 What is prediction? Discuss the issues.
45 Write a short note on decision tree.
46 Write a short note on Bayesian classifier.
47 Write a short note on Rule based classifier.
48 Write a short note on Neural netw ork classifier.
49 Write a short note on Support Vector Machine.
50 Define coverage and accuracy in rule based classifier.
51 Explain triggering and firing of rules.
52 Explain rule based and class based ordering.
53 Discuss “The accuracy on its own is not a reliable estimate
of rule
quality ”54 Consider a training set that contains 100 positive examples
and 400
negative examples for each of the follow ing
candidate rule. R1 : A + (covers 4 positive and
one negative examples) R2 : B + (covers 30
positive and 10 negative examples) R3 : C +
(covers 100 positive and 90 negative examples)
Determine which is the best and worst candidate rule
according to
Rule accuracy?
55 Consider a training set that contains 100 positive examples
and 400
negative examples for each of the follow ing
candidate rule. R1 : A + (covers 4 positive and
one negative examples) R2 : B + (covers 30
positive and 10 negative examples) R3 : C +
(covers 100 positive and 90 negative examples)
Determine which is the best and worst candidate rule
according to
FOIL’s information gain?
56 Consider a training set that contains 100 positive examples
and 400
negative examples for each of the follow ing
candidate rule. R1 : A + (covers 4 positive and
one negative examples) R2 : B + (covers 30
positive and 10 negative examples) R3 : C +
(covers 100 positive and 90 negative examples)
Determine which is the best and worst candidate rule
according to the
likelihood ratio statistic?
57 Write the difference betw een classification and clustering.
58 Explain supervised and unsupervised learning.
59 Explain term pruning and overfitting.
60 Find information gain for “income” in follow ing data
w)
Data Mining 2013-14
61 Write a short note on Gini index.
62 Classify the follow ing tuple using Naïve Bayesian classifier.
X=(age=youth, income=low , student=yes, credit_rating=fair) using
follow ing training data.
Data Mining 2013-14
63 Find out the population for the year 2013 using linear regression.
2005 2006 2007 2008 2009 2013
12 19 28 35 45 ?
64 Write a short note on confusion matrix.
65 Classify X1=4, X2=7 using K-nearest neighbour (assume k=3).
X Y Class
7 7 B
7 4 B
3 4 G
1 4 G
Unit 4
66 List all the requirements of clustering Data mining.
67 Write a short note on type of data in clustering analysis.
68 Compute Euclidean and Manhattan distance for X1 (1, 2) and X2 (3,6).
68 a Compute Euclidean and Manhattan distance for X1 (1, 2) and X2 (4,6).
69 Compute
1. Similarity between A and B
2. Similarity between C and B
3. Similarity between A and C and comment on the most similar
tuples.
Name Gender F C T1 T2 T3
A F Y N P P N
B M Y N Y N P
C F Y P P N N
Data Mining 2013-14
70 Write a short note on K-means clustering.
71 Write a short note on K-medoids clustering.
72 Write a short note on partitioning approach.
73 Write a short note on Hierarchical approach.
74 Write a short note on DBSCAN.
75 List and discuss major clustering approaches.
76 Write a short note on ROCK.
77 Explain agglomeration and divisive approach.
78 Apply hierarchical clustering using single linkage to follow ing data.
A (1,1), B(1.5,1.5), C(3,4), D(4,4), E(3,3.5)
79 What are outliers? How to find out? Write the applications.
Unit5
80 What is graph mining and social network?
81 What are multimedia and spatial databases?
82 Explain set and listed valued attribute with example.
83 Explain set and complex structure valued attribute.
84 What is spatial aggregation and approximation? Explain with
example.
85 Define plan, plan database and plan mining.
86 Explain the types of dimensions in spatial data cube.
87 Explain measures in spatial data cube.
88 Discuss approaches for similarity based retrieval in image database.
89 Write a short note on mining association in multimedia data.
90 Write a short note on text mining.
91 Define
1. Term frequency
2. Term frequency matrix
3. Relative term frequency
4. Inverse document frequency
92 Compute TF, IDF and TF-IDF for t2 in d2 for follow ing data.
Data Mining 2013-14
92 (a) Compute TF, IDF and TF-IDF for t3 in d4 for follow ing data.
93 Discuss “ Web poses grate challenges for effective resources and
knowledge discovery”

More Related Content

What's hot

11.polynomial regression model of making cost prediction in mixed cost analysis
11.polynomial regression model of making cost prediction in mixed cost analysis11.polynomial regression model of making cost prediction in mixed cost analysis
11.polynomial regression model of making cost prediction in mixed cost analysis
Alexander Decker
 
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
The Statistical and Applied Mathematical Sciences Institute
 

What's hot (20)

Bit Vectors Siddhesh
Bit Vectors SiddheshBit Vectors Siddhesh
Bit Vectors Siddhesh
 
11.polynomial regression model of making cost prediction in mixed cost analysis
11.polynomial regression model of making cost prediction in mixed cost analysis11.polynomial regression model of making cost prediction in mixed cost analysis
11.polynomial regression model of making cost prediction in mixed cost analysis
 
Polynomial regression model of making cost prediction in mixed cost analysis
Polynomial regression model of making cost prediction in mixed cost analysisPolynomial regression model of making cost prediction in mixed cost analysis
Polynomial regression model of making cost prediction in mixed cost analysis
 
Dr azimifar pattern recognition lect4
Dr azimifar pattern recognition lect4Dr azimifar pattern recognition lect4
Dr azimifar pattern recognition lect4
 
Embeddings the geometry of relational algebra
Embeddings  the geometry of relational algebraEmbeddings  the geometry of relational algebra
Embeddings the geometry of relational algebra
 
Engineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network AnalysisEngineering Data Science Objectives for Social Network Analysis
Engineering Data Science Objectives for Social Network Analysis
 
Localized methods for diffusions in large graphs
Localized methods for diffusions in large graphsLocalized methods for diffusions in large graphs
Localized methods for diffusions in large graphs
 
Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...
Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...
Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...
 
Correlation clustering and community detection in graphs and networks
Correlation clustering and community detection in graphs and networksCorrelation clustering and community detection in graphs and networks
Correlation clustering and community detection in graphs and networks
 
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
Deep Learning Opening Workshop - ProxSARAH Algorithms for Stochastic Composit...
 
Vi sem
Vi semVi sem
Vi sem
 
Parallel Guided Local Search and Some Preliminary Experimental Results for Co...
Parallel Guided Local Search and Some Preliminary Experimental Results for Co...Parallel Guided Local Search and Some Preliminary Experimental Results for Co...
Parallel Guided Local Search and Some Preliminary Experimental Results for Co...
 
Anti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCutAnti-differentiating Approximation Algorithms: PageRank and MinCut
Anti-differentiating Approximation Algorithms: PageRank and MinCut
 
Localized methods in graph mining
Localized methods in graph miningLocalized methods in graph mining
Localized methods in graph mining
 
Fuzzy Inventory Model for Constantly Deteriorating Items with Power Demand an...
Fuzzy Inventory Model for Constantly Deteriorating Items with Power Demand an...Fuzzy Inventory Model for Constantly Deteriorating Items with Power Demand an...
Fuzzy Inventory Model for Constantly Deteriorating Items with Power Demand an...
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
 
Stock price prediction using k* nearest neighbors and indexing dynamic time w...
Stock price prediction using k* nearest neighbors and indexing dynamic time w...Stock price prediction using k* nearest neighbors and indexing dynamic time w...
Stock price prediction using k* nearest neighbors and indexing dynamic time w...
 
Application cases of MMG library in the Kratos Multiphysics (AKA Kratos) fram...
Application cases of MMG library in the Kratos Multiphysics (AKA Kratos) fram...Application cases of MMG library in the Kratos Multiphysics (AKA Kratos) fram...
Application cases of MMG library in the Kratos Multiphysics (AKA Kratos) fram...
 
How principal components analysis is different from factor
How principal components analysis is different from factorHow principal components analysis is different from factor
How principal components analysis is different from factor
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
 

Similar to Dmqb

eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdfeBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
EdwinPolack1
 
Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive Modelling
Rajiv Advani
 
InstructionsFor this assignment, collect data exhibiting a relat.docx
InstructionsFor this assignment, collect data exhibiting a relat.docxInstructionsFor this assignment, collect data exhibiting a relat.docx
InstructionsFor this assignment, collect data exhibiting a relat.docx
dirkrplav
 
A Validation of Object-Oriented Design Metrics as Quality Indicators
A Validation of Object-Oriented Design Metrics as Quality IndicatorsA Validation of Object-Oriented Design Metrics as Quality Indicators
A Validation of Object-Oriented Design Metrics as Quality Indicators
vie_dels
 
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docx
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docxConvenience shoppingSTAT-S301Fall 2019Question Set 1.docx
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docx
bobbywlane695641
 
Data structure-questions
Data structure-questionsData structure-questions
Data structure-questions
Shekhar Chander
 
Thesis (presentation)
Thesis (presentation)Thesis (presentation)
Thesis (presentation)
nlt2390
 
print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
lathass5
 

Similar to Dmqb (20)

Practive test 1
Practive test 1Practive test 1
Practive test 1
 
eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdfeBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
eBook PDF textbook - Essentials of Econometrics, 5e Damodar Gujarati.pdf
 
Data structure-question-bank
Data structure-question-bankData structure-question-bank
Data structure-question-bank
 
Data mining and warehousing qb
Data mining and warehousing   qbData mining and warehousing   qb
Data mining and warehousing qb
 
Smu bsc it Spring 2014 solved assignments
Smu bsc it Spring 2014  solved assignmentsSmu bsc it Spring 2014  solved assignments
Smu bsc it Spring 2014 solved assignments
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive Modelling
 
Blinkdb
BlinkdbBlinkdb
Blinkdb
 
InstructionsFor this assignment, collect data exhibiting a relat.docx
InstructionsFor this assignment, collect data exhibiting a relat.docxInstructionsFor this assignment, collect data exhibiting a relat.docx
InstructionsFor this assignment, collect data exhibiting a relat.docx
 
ACQSurvey (Poster)
ACQSurvey (Poster)ACQSurvey (Poster)
ACQSurvey (Poster)
 
A Validation of Object-Oriented Design Metrics as Quality Indicators
A Validation of Object-Oriented Design Metrics as Quality IndicatorsA Validation of Object-Oriented Design Metrics as Quality Indicators
A Validation of Object-Oriented Design Metrics as Quality Indicators
 
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docx
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docxConvenience shoppingSTAT-S301Fall 2019Question Set 1.docx
Convenience shoppingSTAT-S301Fall 2019Question Set 1.docx
 
Data Mining.ppt
Data Mining.pptData Mining.ppt
Data Mining.ppt
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 
Data structure-questions
Data structure-questionsData structure-questions
Data structure-questions
 
MCA-ASS-Semester V.pdf
MCA-ASS-Semester V.pdfMCA-ASS-Semester V.pdf
MCA-ASS-Semester V.pdf
 
R_Proficiency.pptx
R_Proficiency.pptxR_Proficiency.pptx
R_Proficiency.pptx
 
Thesis (presentation)
Thesis (presentation)Thesis (presentation)
Thesis (presentation)
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
 

Recently uploaded

AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
ellan12
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
sexy call girls service in goa
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Sheetaleventcompany
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
SofiyaSharma5
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 

Recently uploaded (20)

AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Green Park Escort Service Delhi N.C.R.
 
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
 
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
Dwarka Sector 26 Call Girls | Delhi | 9999965857 🫦 Vanshika Verma More Our Se...
 
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night StandHot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
Hot Call Girls |Delhi |Hauz Khas ☎ 9711199171 Book Your One night Stand
 

Dmqb

  • 1. Data Mining 2013-14 Unit 1 1 What is data mining and data warehouse? 2 Compare data base processing Vs. data mining processing. 3 Explain applications of data mining in detail. 4 Explain all data mining models and tasks. 5 What is KDD? Explain with diagram. 6 Write a short note on visualization. 7 Discuss the issues in Data mining. 8 What is fuzzy logic? Explain in brief with example. 9 Define following terms: 1. Information retrieval 2. Precision 3. Recall 4. Similarity 5. Granularities 6. Facts 7. Roll ups 8. Drill down 10 Define cube and explain with example. 11 Write a short note on star schema. 12 Explain characteristics of data warehouse. 13 Discuss the ways to improve the performance of data warehouse applications. 14 Write a short note on OLAP operations. 15 Write a short note on point estimation. 16 Compute mean, variance and standard deviation for (1,3,4,6,5). 17 Define 1. Mean 2. Median 3. Mode 4. Variance 5. Standard deviation (Sample/Population) 6. Bias 7. MSE 8. RMS18 Compute mean, median and mode for (15, 10, 18, 20, 28, 32). 19 What is Jackknife estimate technique? 20 Find out Jackknife estimate for variance X={1, 5, 6} Mean X= {5, 6, 6}. 21 Estimate P that maximizes the likelihood that the given sequence of heads and tails would occur for {H, H, H, T, T} Note: Assume coin with H and T equally likely.
  • 2. Data Mining 2013-14 22 Estimate the missing data and continues until convergence using Expectation Maximization {1, 5, 10, 4,*, *}. (Guess µ0=3) 23 Prove that X11 belongs to class h2 using Bayes theorem. ID Income Credit Class xi 1 4 Excellent h1 x4 2 3 Good h1 x7 3 2 Excellent h1 x2 4 3 Good h1 x7 5 4 Good h1 x8 6 2 Excellent h1 x2 7 3 Bad h2 x11 8 2 Bad h2 x10 9 3 Bad h3 x11 10 1 Bad h4 x924 Write a short note on Hypothesis testing. 25 Find Chi square statistics for Observed value = {51, 95, 67, 78, 88} Expected value=76 26 Write a short note on linear regression. 27 Write a short note on non-linear regression. 28 Explain correlation in detail. 29 Find correlation betw een Ice cream sales Vs temperature Temperature 0C Ice Cream Sales (in rupees)14.2 215 16.4 325 11.9 185 15.2 332 18.5 406 22.1 522 30 Write a short on similarity measures. Unit 2 31 Explain the need of data pre-processing. 32 List and explain major tasks in data processing. 33 Explain terms Quartile and Inter-Quartile range. 34 What are Box plot and Quantile plot? 35 What is histogram and scatter plot? 36 Write a short note on data cleaning tasks. 37 Explain Binning with example. 38 Explain Data aggregation, generalization and smoothing. 39 Write a short note on data transformation. 40 Write a short note on data normalization. 41 Define 1. Association rule
  • 3. Data Mining 2013-14 2. Support 3. Confidence 42 Explain apriori algorithm with example. 42a Write a short note on Association rule mining. Unit 3 43 What is classification? Discuss the issues. 44 What is prediction? Discuss the issues. 45 Write a short note on decision tree. 46 Write a short note on Bayesian classifier. 47 Write a short note on Rule based classifier. 48 Write a short note on Neural netw ork classifier. 49 Write a short note on Support Vector Machine. 50 Define coverage and accuracy in rule based classifier. 51 Explain triggering and firing of rules. 52 Explain rule based and class based ordering. 53 Discuss “The accuracy on its own is not a reliable estimate of rule quality ”54 Consider a training set that contains 100 positive examples and 400 negative examples for each of the follow ing candidate rule. R1 : A + (covers 4 positive and one negative examples) R2 : B + (covers 30 positive and 10 negative examples) R3 : C + (covers 100 positive and 90 negative examples) Determine which is the best and worst candidate rule according to Rule accuracy? 55 Consider a training set that contains 100 positive examples and 400 negative examples for each of the follow ing candidate rule. R1 : A + (covers 4 positive and one negative examples) R2 : B + (covers 30 positive and 10 negative examples) R3 : C + (covers 100 positive and 90 negative examples) Determine which is the best and worst candidate rule according to FOIL’s information gain? 56 Consider a training set that contains 100 positive examples and 400 negative examples for each of the follow ing candidate rule. R1 : A + (covers 4 positive and one negative examples) R2 : B + (covers 30 positive and 10 negative examples) R3 : C + (covers 100 positive and 90 negative examples) Determine which is the best and worst candidate rule according to the likelihood ratio statistic? 57 Write the difference betw een classification and clustering. 58 Explain supervised and unsupervised learning. 59 Explain term pruning and overfitting. 60 Find information gain for “income” in follow ing data w)
  • 4. Data Mining 2013-14 61 Write a short note on Gini index. 62 Classify the follow ing tuple using Naïve Bayesian classifier. X=(age=youth, income=low , student=yes, credit_rating=fair) using follow ing training data.
  • 5. Data Mining 2013-14 63 Find out the population for the year 2013 using linear regression. 2005 2006 2007 2008 2009 2013 12 19 28 35 45 ? 64 Write a short note on confusion matrix. 65 Classify X1=4, X2=7 using K-nearest neighbour (assume k=3). X Y Class 7 7 B 7 4 B 3 4 G 1 4 G Unit 4 66 List all the requirements of clustering Data mining. 67 Write a short note on type of data in clustering analysis. 68 Compute Euclidean and Manhattan distance for X1 (1, 2) and X2 (3,6). 68 a Compute Euclidean and Manhattan distance for X1 (1, 2) and X2 (4,6). 69 Compute 1. Similarity between A and B 2. Similarity between C and B 3. Similarity between A and C and comment on the most similar tuples. Name Gender F C T1 T2 T3 A F Y N P P N B M Y N Y N P C F Y P P N N
  • 6. Data Mining 2013-14 70 Write a short note on K-means clustering. 71 Write a short note on K-medoids clustering. 72 Write a short note on partitioning approach. 73 Write a short note on Hierarchical approach. 74 Write a short note on DBSCAN. 75 List and discuss major clustering approaches. 76 Write a short note on ROCK. 77 Explain agglomeration and divisive approach. 78 Apply hierarchical clustering using single linkage to follow ing data. A (1,1), B(1.5,1.5), C(3,4), D(4,4), E(3,3.5) 79 What are outliers? How to find out? Write the applications. Unit5 80 What is graph mining and social network? 81 What are multimedia and spatial databases? 82 Explain set and listed valued attribute with example. 83 Explain set and complex structure valued attribute. 84 What is spatial aggregation and approximation? Explain with example. 85 Define plan, plan database and plan mining. 86 Explain the types of dimensions in spatial data cube. 87 Explain measures in spatial data cube. 88 Discuss approaches for similarity based retrieval in image database. 89 Write a short note on mining association in multimedia data. 90 Write a short note on text mining. 91 Define 1. Term frequency 2. Term frequency matrix 3. Relative term frequency 4. Inverse document frequency 92 Compute TF, IDF and TF-IDF for t2 in d2 for follow ing data.
  • 7. Data Mining 2013-14 92 (a) Compute TF, IDF and TF-IDF for t3 in d4 for follow ing data. 93 Discuss “ Web poses grate challenges for effective resources and knowledge discovery”