SlideShare a Scribd company logo
I- Extended Databases
Key words:Key words:
Knowledge Discovery inKnowledge Discovery in
Databases (KDD).Databases (KDD).
Data Mining (DM).Data Mining (DM).
Data Warehousing (DW) .Data Warehousing (DW) .
Query Optimization (QO).Query Optimization (QO).
Assistant Professor,
Computer Science Department,
Faculty of Science,
Al-Tahadi University,
P.O. Box 727,
Sirt ,Libya,
Dr. Zakaria Suliman ZubiDr. Zakaria Suliman Zubi
ByBy
3
I- Extended DatabasesI- Extended Databases
 Abstract .
 Introduction of the Indicative Databases .
 I-Extended Databases (IE) motivation.
 I-Extended Databases (IE) and KDD
processes .
 Example .
 Conclusions and Remarks .
 Questions.
4
AbstractAbstract (1)
 How we can handle generalizations in a very large
database using Association Rules (AR), and
inclusion Functional Dependencies (FD)?
 The answer is Inductive database.
 I- Extended database has a similar property to
inductive databases.
 I- Extended database contain exceedingly defined
generalizations about the data .
5
AbstractAbstract (2)
 It can be used in the process of Data Mining.
 It was proposed in ODBC_KDD(2) Model.
 The query will uses normal database terminology.
 The main aim of I-Extended database is to interact
with a spatial Data Mining query called Knowledge
Discovery Query Language (KDQL) described in
[22].
 The KDQL was demonstrated and introduced as a
query in the ODBC_KDD (2) model in [22].
6
Introduction of the Indicative DatabasesIntroduction of the Indicative Databases
 KDD process, contains several steps: understanding the domain,
preparing the data set, discovering patterns (i.e., computing a
theory), post-processing of discovered patterns, and putting the
results into use.
 KDD, we need a query language that not only enables the user
to select subsets of the data, but also to specify DM tasks and
select patterns from the corresponding theories.
 Considering the KDQL rules operator which was described in [
21] as a possible querying language on mining association rules
for i-extended database.
 Query should be an object of a similar type than its arguments.
7
The model was introduced at the Institute of Mathematics andThe model was introduced at the Institute of Mathematics and
Informatics at Debrecen University, Debrecen, Hungary 2002.Informatics at Debrecen University, Debrecen, Hungary 2002.
I-Extended Databases Motivation
Gateway
8
 I-Extended database is a pair R = (R, (PR, e, V))
 Where :
–R is a database schema.
–PRis a collection of patterns.
–V is a set of result values .
– e is the evaluation function that defines pattern semantics.
 This function maps each pair (r, θi) to an element of V, where
r is a database over R and θi P∊ R is a pattern.
 An instance of the schema, i-extended database (r, s) over the
schema R consists of a database r over the schema R and a
subset s ⊆ PR.
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
9
 Example :
If the patterns are Boolean formulae about the database, V is
{true, false},
And the evaluation function e(r, θ) has value true
iff the formula θ is true about r.
In practice, a user might be interested in selecting from the
intentionally defined collection of all Boolean formulas, the
formulas which are true or the formulas which are false.
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
10
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
 I-Extended Database : Is a database that in
addition to data also contain exceedingly defined
generalizations about the data. First we illustrate
the Association Rules, and then we Generalize the
approach and point out key issues for query
evaluation in general.
 I-Extended database is a database that has similar
properties that are in inductive database that
shows how it can be used throughout the whole
process of DM due to the closure property of the
framework.
11
I-Extended Databases MotivationI-Extended Databases Motivation
continuecontinue
 The aim of I-Extended Database is as follow:The aim of I-Extended Database is as follow:
– I-extended database consists of a normal database
associated to a subset of patterns from a class of
patterns, and an evaluation function that tells how the
patterns occur in the data.
– I-extended database can be queried (in principle) just
by using normal relational algebra or SQL, with the
added property of being able to refer to the values of
the evaluation function on the patterns.
– Modeling KDD processes as a sequence of queries on
i-extended database gives rise to chances for
reasoning and optimizing these processes
12
I-Extended Databases (IE) and KDD
processes
 KDD consists of several steps one of these steps is Data Mining.
 In Data Mining process we are concerned with unique class of
patterns for a real life mining processes presented in a dynamic
nature of knowledge acquisition scenario.
 These interesting patterns will be presented in I-Extended
Databases based on there captured frequency, confidence and
support values.
 Knowledge gathered often affects the search process, giving
rise to new goals in addition to the original ones.
13
I-Extended Databases (IE) and KDD processI-Extended Databases (IE) and KDD process
continuecontinue
 KDD processes can be described by sequences of
operations, i.e., queries over relevant i-extended database.
 Sequences of queries are abstract and concise descriptions
of DM processes.
 These descriptions can even be annotated by statistical
information about the size of selected dataset, the size of
intermediate collection of patterns etc..
 Providing knowledge for further use of these relevant
sequences.
14
Example/
Patterns in three instances of I-Extended
Database
 Schema R = {A1,…..,An} of attributes with
domain {0, 1}.
 Relation r over R, an association rule about r is
an expression of the form X⇒B where X ⊆ R
and B ∊R  X.
 The intuitive meaning of the rule is that if a
row of the matrix r has a 1 in each column of
X, then the row tends to have a 1 also in
column B.
 This semantics is captured by frequency and
confidence values. Given W ⊆ R, support (W, r)
denotes the fraction of rows of r that have a 1
in each column of W.
 The frequency of X ⇒ B in r is defined to be
support(X ⋃{B}, r) while its confidence is
support(X ⋃ {B}, r)/ support(X , r). Typically,
we are interested in association rules for which
the frequency and the confidence are greater
15
Conclusions and RemarksConclusions and Remarks
 I-Extended Databases enables the definition of mining process
as a sequences of queries by using a closure property.
 I-Extended Databases is a mandatory step towards to a
general purpose query languages for KDD applications.
 I-Extended Databases supports pattern generation, pattern
filtering and pattern combining operations.
 I-Extended Databases can uses standard database
terminology to carry out any significant patterns without
introducing any additional concepts .
16
Importance ReferencesImportance References
 [20] T. Imielinski and H. Mannila. A database
perspective on knowledge discovery. Communications
of ACM, 39:58-64, 1996.
 [21] Zakaria S. Zubi, Knowledge Discovery in Remote
Access Database, Ch. 9 , PhD dissertation, Debrecen
University, Hungary, 2002.
 [22] Zakaria S. Zubi, Fazekas Gábor, On ODBC_KDD
models, paper,5th International Conference on Applied
Informatics, , 28 January -3 February 2001, Eger,
Hungary,2001.
17
Thank you!!!
18
19

More Related Content

What's hot

Ej36829834
Ej36829834Ej36829834
Ej36829834
IJERA Editor
 
Bt0066 dbms
Bt0066 dbmsBt0066 dbms
Bt0066 dbms
smumbahelp
 
Warehousing
WarehousingWarehousing
Data structures
Data structuresData structures
Data structures
Dr.MAYA NAYAK
 
Cs583 info-retrieval
Cs583 info-retrievalCs583 info-retrieval
Cs583 info-retrieval
Borseshweta
 
Mc0088 data mining
Mc0088  data miningMc0088  data mining
Mc0088 data mining
smumbahelp
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
Davis David
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
Salah Amean
 
Scalable keyword search on large rdf data
Scalable keyword search on large rdf dataScalable keyword search on large rdf data
Scalable keyword search on large rdf data
LeMeniz Infotech
 
An improvised frequent pattern tree
An improvised frequent pattern treeAn improvised frequent pattern tree
An improvised frequent pattern tree
IJDKP
 
similarity measure
similarity measure similarity measure
similarity measure
ZHAO Sam
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introduction
Anas Jamil
 
Data types ,variables,array
Data types ,variables,arrayData types ,variables,array
Data types ,variables,array
Gujarat Technological University
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Salah Amean
 
A Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextA Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from Text
Deep Kayal
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
DataminingTools Inc
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
Boris Galitsky
 
Data mining
Data miningData mining
Data mining
StudsPlanet.com
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
Gokulks007
 
Data Mining and Knowledge
Data Mining and KnowledgeData Mining and Knowledge
Data Mining and Knowledge
Kartik Kalpande Patil
 

What's hot (20)

Ej36829834
Ej36829834Ej36829834
Ej36829834
 
Bt0066 dbms
Bt0066 dbmsBt0066 dbms
Bt0066 dbms
 
Warehousing
WarehousingWarehousing
Warehousing
 
Data structures
Data structuresData structures
Data structures
 
Cs583 info-retrieval
Cs583 info-retrievalCs583 info-retrieval
Cs583 info-retrieval
 
Mc0088 data mining
Mc0088  data miningMc0088  data mining
Mc0088 data mining
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
 
Data Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trendData Mining: Concepts and techniques: Chapter 13 trend
Data Mining: Concepts and techniques: Chapter 13 trend
 
Scalable keyword search on large rdf data
Scalable keyword search on large rdf dataScalable keyword search on large rdf data
Scalable keyword search on large rdf data
 
An improvised frequent pattern tree
An improvised frequent pattern treeAn improvised frequent pattern tree
An improvised frequent pattern tree
 
similarity measure
similarity measure similarity measure
similarity measure
 
Machine learning introduction
Machine learning introductionMachine learning introduction
Machine learning introduction
 
Data types ,variables,array
Data types ,variables,arrayData types ,variables,array
Data types ,variables,array
 
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
Data Mining: Concepts and Techniques chapter 07 : Advanced Frequent Pattern M...
 
A Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextA Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from Text
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
Automated building of taxonomies for search engines
Automated building of taxonomies for search enginesAutomated building of taxonomies for search engines
Automated building of taxonomies for search engines
 
Data mining
Data miningData mining
Data mining
 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
 
Data Mining and Knowledge
Data Mining and KnowledgeData Mining and Knowledge
Data Mining and Knowledge
 

Viewers also liked

Ismail&&ziko 2003
Ismail&&ziko 2003Ismail&&ziko 2003
Ismail&&ziko 2003
Zakaria Zubi
 
Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases
Zakaria Zubi
 
Edi text
Edi textEdi text
Edi text
Zakaria Zubi
 
Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)
Zakaria Zubi
 
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
Zakaria Zubi
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
Zakaria Zubi
 
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
Zakaria Zubi
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
Zakaria Zubi
 

Viewers also liked (8)

Ismail&&ziko 2003
Ismail&&ziko 2003Ismail&&ziko 2003
Ismail&&ziko 2003
 
Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases
 
Edi text
Edi textEdi text
Edi text
 
Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)Knowledge Discovery Query Language (KDQL)
Knowledge Discovery Query Language (KDQL)
 
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
A Comparative Study of Data Mining Methods to Analyzing Libyan National Crime...
 
Arabic Text mining Classification
Arabic Text mining Classification Arabic Text mining Classification
Arabic Text mining Classification
 
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
COMPARISON OF ROUTING PROTOCOLS FOR AD HOC WIRELESS NETWORK WITH MEDICAL DATA
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
 

Similar to I- Extended Databases

Probablistic information retrieval
Probablistic information retrievalProbablistic information retrieval
Probablistic information retrieval
Nisha Arankandath
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspace
Prakash Dubey
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
csandit
 
HyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-AnsweringHyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-Answering
Jinho Choi
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithm
Rupali Bhatnagar
 
R tutorial
R tutorialR tutorial
R tutorial
Richard Vidgen
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
butest
 
A Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia ArticlesA Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia Articles
ijma
 
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
onlmcq
 
G04124041046
G04124041046G04124041046
G04124041046
IOSR-JEN
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
butest
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
Enayat Rajabi
 
Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)
IJASCSE
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
BereketAraya
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
BereketAraya
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
vini89
 
Data Science Using Scikit-Learn
Data Science Using Scikit-LearnData Science Using Scikit-Learn
Data Science Using Scikit-Learn
Ducat India
 
Ir
IrIr
Ir
IrIr
Project Presentation
Project PresentationProject Presentation
Project Presentation
butest
 

Similar to I- Extended Databases (20)

Probablistic information retrieval
Probablistic information retrievalProbablistic information retrieval
Probablistic information retrieval
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspace
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
HyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-AnsweringHyperQA: A Framework for Complex Question-Answering
HyperQA: A Framework for Complex Question-Answering
 
Information retrival system and PageRank algorithm
Information retrival system and PageRank algorithmInformation retrival system and PageRank algorithm
Information retrival system and PageRank algorithm
 
R tutorial
R tutorialR tutorial
R tutorial
 
PowerPoint Template
PowerPoint TemplatePowerPoint Template
PowerPoint Template
 
A Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia ArticlesA Document Exploring System on LDA Topic Model for Wikipedia Articles
A Document Exploring System on LDA Topic Model for Wikipedia Articles
 
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...IRS-Lecture-Notes irsirs    IRS-Lecture-Notes irsirs   IRS-Lecture-Notes irsi...
IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsirs IRS-Lecture-Notes irsi...
 
G04124041046
G04124041046G04124041046
G04124041046
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
 
Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)Dataminingoneducationaldomain (1)
Dataminingoneducationaldomain (1)
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
 
4-IR Models_new.ppt
4-IR Models_new.ppt4-IR Models_new.ppt
4-IR Models_new.ppt
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Data Science Using Scikit-Learn
Data Science Using Scikit-LearnData Science Using Scikit-Learn
Data Science Using Scikit-Learn
 
Ir
IrIr
Ir
 
Ir
IrIr
Ir
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 

More from Zakaria Zubi

applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
Zakaria Zubi
 
Applying web mining application for user behavior understanding
Applying web mining application for user behavior understandingApplying web mining application for user behavior understanding
Applying web mining application for user behavior understanding
Zakaria Zubi
 
Model
ModelModel
Ibtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital imagesIbtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital images
Zakaria Zubi
 
Deep Web mining
Deep Web miningDeep Web mining
Deep Web mining
Zakaria Zubi
 
Information communication technology in libya for educational purposes
Information communication technology in libya for educational purposesInformation communication technology in libya for educational purposes
Information communication technology in libya for educational purposes
Zakaria Zubi
 

More from Zakaria Zubi (6)

applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
applyingwebminingapplicationforuserbehaviorunderstanding-131215105223-phpapp0...
 
Applying web mining application for user behavior understanding
Applying web mining application for user behavior understandingApplying web mining application for user behavior understanding
Applying web mining application for user behavior understanding
 
Model
ModelModel
Model
 
Ibtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital imagesIbtc dwt hybrid coding of digital images
Ibtc dwt hybrid coding of digital images
 
Deep Web mining
Deep Web miningDeep Web mining
Deep Web mining
 
Information communication technology in libya for educational purposes
Information communication technology in libya for educational purposesInformation communication technology in libya for educational purposes
Information communication technology in libya for educational purposes
 

Recently uploaded

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 

Recently uploaded (20)

Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 

I- Extended Databases

  • 1. I- Extended Databases Key words:Key words: Knowledge Discovery inKnowledge Discovery in Databases (KDD).Databases (KDD). Data Mining (DM).Data Mining (DM). Data Warehousing (DW) .Data Warehousing (DW) . Query Optimization (QO).Query Optimization (QO).
  • 2. Assistant Professor, Computer Science Department, Faculty of Science, Al-Tahadi University, P.O. Box 727, Sirt ,Libya, Dr. Zakaria Suliman ZubiDr. Zakaria Suliman Zubi ByBy
  • 3. 3 I- Extended DatabasesI- Extended Databases  Abstract .  Introduction of the Indicative Databases .  I-Extended Databases (IE) motivation.  I-Extended Databases (IE) and KDD processes .  Example .  Conclusions and Remarks .  Questions.
  • 4. 4 AbstractAbstract (1)  How we can handle generalizations in a very large database using Association Rules (AR), and inclusion Functional Dependencies (FD)?  The answer is Inductive database.  I- Extended database has a similar property to inductive databases.  I- Extended database contain exceedingly defined generalizations about the data .
  • 5. 5 AbstractAbstract (2)  It can be used in the process of Data Mining.  It was proposed in ODBC_KDD(2) Model.  The query will uses normal database terminology.  The main aim of I-Extended database is to interact with a spatial Data Mining query called Knowledge Discovery Query Language (KDQL) described in [22].  The KDQL was demonstrated and introduced as a query in the ODBC_KDD (2) model in [22].
  • 6. 6 Introduction of the Indicative DatabasesIntroduction of the Indicative Databases  KDD process, contains several steps: understanding the domain, preparing the data set, discovering patterns (i.e., computing a theory), post-processing of discovered patterns, and putting the results into use.  KDD, we need a query language that not only enables the user to select subsets of the data, but also to specify DM tasks and select patterns from the corresponding theories.  Considering the KDQL rules operator which was described in [ 21] as a possible querying language on mining association rules for i-extended database.  Query should be an object of a similar type than its arguments.
  • 7. 7 The model was introduced at the Institute of Mathematics andThe model was introduced at the Institute of Mathematics and Informatics at Debrecen University, Debrecen, Hungary 2002.Informatics at Debrecen University, Debrecen, Hungary 2002. I-Extended Databases Motivation Gateway
  • 8. 8  I-Extended database is a pair R = (R, (PR, e, V))  Where : –R is a database schema. –PRis a collection of patterns. –V is a set of result values . – e is the evaluation function that defines pattern semantics.  This function maps each pair (r, θi) to an element of V, where r is a database over R and θi P∊ R is a pattern.  An instance of the schema, i-extended database (r, s) over the schema R consists of a database r over the schema R and a subset s ⊆ PR. I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue
  • 9. 9  Example : If the patterns are Boolean formulae about the database, V is {true, false}, And the evaluation function e(r, θ) has value true iff the formula θ is true about r. In practice, a user might be interested in selecting from the intentionally defined collection of all Boolean formulas, the formulas which are true or the formulas which are false. I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue
  • 10. 10 I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue  I-Extended Database : Is a database that in addition to data also contain exceedingly defined generalizations about the data. First we illustrate the Association Rules, and then we Generalize the approach and point out key issues for query evaluation in general.  I-Extended database is a database that has similar properties that are in inductive database that shows how it can be used throughout the whole process of DM due to the closure property of the framework.
  • 11. 11 I-Extended Databases MotivationI-Extended Databases Motivation continuecontinue  The aim of I-Extended Database is as follow:The aim of I-Extended Database is as follow: – I-extended database consists of a normal database associated to a subset of patterns from a class of patterns, and an evaluation function that tells how the patterns occur in the data. – I-extended database can be queried (in principle) just by using normal relational algebra or SQL, with the added property of being able to refer to the values of the evaluation function on the patterns. – Modeling KDD processes as a sequence of queries on i-extended database gives rise to chances for reasoning and optimizing these processes
  • 12. 12 I-Extended Databases (IE) and KDD processes  KDD consists of several steps one of these steps is Data Mining.  In Data Mining process we are concerned with unique class of patterns for a real life mining processes presented in a dynamic nature of knowledge acquisition scenario.  These interesting patterns will be presented in I-Extended Databases based on there captured frequency, confidence and support values.  Knowledge gathered often affects the search process, giving rise to new goals in addition to the original ones.
  • 13. 13 I-Extended Databases (IE) and KDD processI-Extended Databases (IE) and KDD process continuecontinue  KDD processes can be described by sequences of operations, i.e., queries over relevant i-extended database.  Sequences of queries are abstract and concise descriptions of DM processes.  These descriptions can even be annotated by statistical information about the size of selected dataset, the size of intermediate collection of patterns etc..  Providing knowledge for further use of these relevant sequences.
  • 14. 14 Example/ Patterns in three instances of I-Extended Database  Schema R = {A1,…..,An} of attributes with domain {0, 1}.  Relation r over R, an association rule about r is an expression of the form X⇒B where X ⊆ R and B ∊R X.  The intuitive meaning of the rule is that if a row of the matrix r has a 1 in each column of X, then the row tends to have a 1 also in column B.  This semantics is captured by frequency and confidence values. Given W ⊆ R, support (W, r) denotes the fraction of rows of r that have a 1 in each column of W.  The frequency of X ⇒ B in r is defined to be support(X ⋃{B}, r) while its confidence is support(X ⋃ {B}, r)/ support(X , r). Typically, we are interested in association rules for which the frequency and the confidence are greater
  • 15. 15 Conclusions and RemarksConclusions and Remarks  I-Extended Databases enables the definition of mining process as a sequences of queries by using a closure property.  I-Extended Databases is a mandatory step towards to a general purpose query languages for KDD applications.  I-Extended Databases supports pattern generation, pattern filtering and pattern combining operations.  I-Extended Databases can uses standard database terminology to carry out any significant patterns without introducing any additional concepts .
  • 16. 16 Importance ReferencesImportance References  [20] T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communications of ACM, 39:58-64, 1996.  [21] Zakaria S. Zubi, Knowledge Discovery in Remote Access Database, Ch. 9 , PhD dissertation, Debrecen University, Hungary, 2002.  [22] Zakaria S. Zubi, Fazekas Gábor, On ODBC_KDD models, paper,5th International Conference on Applied Informatics, , 28 January -3 February 2001, Eger, Hungary,2001.
  • 18. 18
  • 19. 19