SlideShare a Scribd company logo
Enterprise and Data Mining Ontology Integration to Extract
Actionable Knowledge
HamidReza Nazari1
, MohammadTaghi TaghaviFard1
and Iman Raeesi Vanani1
1
Department of Management and Accounting, Allameh Tabataba’i University, Tehran, Iran
Hamidnazary2002@yahoo.com, {Taghavifard, imanraeesi }@atu.ac.ir
1 RESEARCH PROBLEM
Today, it is hard to imagine an enterprise with no
manipulated processes to manage its data and
information. Organizations invest their money and
efforts in Information Technologies (IT) with the
aim of gaining competitive advantages.
Knowledge Discovery in Databases (KDD) is
one of the most attractive notions to make real-time
and appropriate decisions and to gain sustainable
competetive advantages. Knowledge discovery in
big data era is expected to change the global
economy by 2020, raising 325 billion dollars only in
U.S retailing and manufacturing sectors(McKinsey
& Company, 2013).
Despite this, knowledge discovered by data
mining (DM) algorithms are not actionable enough
to grow the economy (Cao, 2012; Way & Wang,
2008). Feritas(2006) in an investigation of
association rules reported that only 1% of the
outcomes is actionable, 220 rules among 29050
extracted rules. Existing mining techiques and
algorithms have reached considerable levels of
efficiency but their effectiveness on identifying
actionable knowledge is impaired (Antunes, 2014).
DM Researchers have focued more on providing
creative algorithms and investigated the outcomes
from accuracy and efficiency point of view. They
have simplified the problem, not identifying
business requirements and context completely (Cao,
2012; Hirsh, 2008). The knowledge discovered by
algorithms is mainly passive and is not quite active
and helpful for managers to make a decision. So,
more proccesing and analysis is needed to create
actionable knowledge.
This gap is rooted in neglecting business
understanding (BU) in formulating a DM problem.
There is a need to shift from data driven DM to the
domain driven one (Cao, 2010, 2012). Mariscal et al
(2010) in a deep review of DM methodologies and
models, mention these phases to develop CRISP-
DM, to address its weaknesses in BU and deploying
results: 1) Domain knowledge elicitation to
constrain the learning algorithms search space and to
reduce the number of patterns discovered, 2)
Problem specification to better understand the
problem and at the same time, it has to be considered
that objectives and goals are dynamic due to changes
in the requirements of the organization or business,
and they must be reviewed and updated regularly,
3) Interpretation to ensure achieving business
requirements, 4) Automation for supporting non-
expert data mining users to get new knowledge from
new data and existing models in an easy way, 5)
and, Maintenance.
Thus, the research problems are 1) How to
formalize BU in DM process? 2) How the
formalization would lead to actionable knowledge?
2 OUTLINE OF OBJECTIVES
The objectives of this research involve the
following:
1) Designing an artefact to formally apply BU
in DM.
2) Semi-automating BU phase of DM to help a
wide range of users to avoid common
analytical mistakes.
3) To facilitate assessing and deploying
knowledge discovery algorithms results in
business context.
3 STATE OF THE ART
Output of BU determines relevant data for data
understanding phase. It determines if target variable
should be discretized in data preparation phase. It
results in enumeration of applicable modelling
techniques in data modelling phase. And finally, it
determines the evaluation criteria and actionability
of the results (Sharma & Osei-Bryson, 2015).
Cespivova et el (2004) outlined the possible roles
of a domain ontology in all phases of data mining
process. Enterprise ontology models an enterprise in
the form of information system. The activities of BU
phase span across concepts and their relationships
are described in an enterprise ontology. Sharma &
Osei-Bryson (2008) assert that no other framework
has the ability to account for BU phase activities. Li
(2014) assert that to support decision in BU, case
based reasoning, ontology based approach and work
flow management reasoning can be useful. Antunes
and Silva (2014) also emphasize using domain
ontology to improve DM effectiveness (Fig 1).
Insert Figure 1 here
Many DM ontologies have developed in the last
decade, like DMOP, ONTO-DM, DMW3, to name a
few. DM ontologies have the ability to select best fit
algorithms among the pool of DM algorithms. Most
of them have neglected BU and focused on data
understanding, preparation and modelling. Table 1
summarises coverage of DM ontologies based on
CRISP-DM.
Insert table 1 here
In summary, 1) This research attempts to match
enterprise and DM ontologies to achieve the
aforementioned objectives. According to our
literature review, no similar research can be found in
the literature. 2) This research will cultivate the
theoretical foundations of enterprise engineering,
DM and ontology engineering by designing an
integrated artefact to formalize BU of DM process
and to facilitate extracting actionable knowledge. 3)
Theoretical contribution of the research will enhance
the interoperability of DM process. 4) The research
is a step forward to automating DM process more
wisely and to helps business users and DM experts
to convert their ad hoc reasoning to more systematic
and ontology aware one.
4 METHODOLOGY
Simon (1969) made a clear distinction between
natural sciences and sciences of artificial or sciences
of design. Artefacts are purposely designed artificial
things to satisfy human’s goals. An artefact is an
interface between the substance and organization of
itself and also external environment in which it
operates.
Information system discipline is rooted in the
creation of information technology artefacts (Hevner
et al, 2004). Thus, this research will apply design
science approach. To build an artefact, we will
review current enterprise and DM ontologies. It is
expected to find basic concepts and relationships
essential for integration in this step. The construct
will be evaluated by experts based on its
completeness, understandability and explicitness.
Thus, the integrated ontology will be built and
evaluated in this step.
Solving the identified problem is one the main
disciplines of design science research methodology.
Therefore, we will use an ontology integration
methodology to build a prototype in OWL or XML.
Alignment and correspondence of the prototype
have to be evaluated in this step.
Finally, an instance for an enterprise unit will be
built based on the prototype. Case study is the
method of evaluation here. We will not directly
compute actionability, instead alignment of the
extracted knowledge and the business goal will be
indicated the actionability of the outcomes.
5 EXPECTED OUTCOME
Achieving the research objectives requires a
developed enterprise ontology that can be
seamlessly integrated with DM ontology. So, the
first outcome of the research is developed enterprise
ontology. In term of design science research
methodology, this is the built construct of the
research. This ontology will formalize concepts
mentioned in BU phase of CRISP-DM like business
objectives, requirements and constraints. Also, it
will be useful in automating data understanding
phase of CRISP-DM.
Defining interactions between concepts of
enterprise and DM ontologies and developing
relations between them are the second expected
outcome of the research. This construct will be used
by ontology integration methodologies to build the
instance aspect of research artefact.
These interactions and relations will bridge the
semantic gap between business users and DM
experts. This way, the complexity of DM for novice
data miners will be reduced. This complexity is a
main barrier for many organizations applying or
utilizing DM processes and methodology.
According to the developed correspondences and
the built instance, it is expected to prune the
outcomes of DM algorithms more easily and to
extract more actionable knowledge based on
semantic similarity computing of them. As a result,
the research outcome facilitates the activities of
evaluation phase of CRISP-DM.
6 STAGES OF THE RESEARCH
The research is in its early stage. We have started to
investigate enterprise and DM ontologies to select or
propose suitable ones for the integration.
REFERENCES
Antunes, C., & Silva, A. 2014. New Trends in Knowledge
Driven Data Mining. Proceedings from ICEIS ’14.
International Conference on Enterprise Information
Systems, 346-351.
Cao, L. (2012). Actionable knowledge discovery and
delivery. Wiley Interdisciplinary Reviews: Data
Mining and Knowledge Discovery, 2(2), 149-163.
Cao, L. (2010). Domain-driven data mining: Challenges
and prospects. Knowledge and Data Engineering,
IEEE Transactions on, 22(6), 755-769
Cešpivová, H., Rauch, J., Svatek, V., Kejkula, M., &
Tomeckova, M. (2004). Roles of medical ontology in
association mining Crisp-DM cycle. In ECML/
PKDD04 Workshop on Knowledge Discovery and
Ontologies (KDO’04), Pisa (Vol. 220).
Freitas AA (2006) Are we really discovering “interesting”
knowledge from data? Expert Update, Vol. 9, No. 1,
41-47, autumn 2006.
Hevner, A., March, S., Park, J., and Ram, S. (2004)
Design Science in Information Systems Research. MIS
Quarterly, 28(1).
Hirsh, H. (2008). Data mining research: Current status and
future opportunities. Statistical Analysis and Data
Mining: The ASA Data Science Journal, 1(2), 104-107
Li, Y. (2014) New Artefacts for the Knowledge Discovery
via Data Analytics (KDDA) Process. PhD
Dissertation, Virginia Commonwealth University,
USA.
Mariscal, G., Marbán, Ó. & Fernández, C. (2010). A
survey of data mining and knowledge discovery
process models and methodologies. The Knowledge
Engineering Review, 25(02), 137-166
Mckinsey and Company. (2013). Game changers: Five
opportunities for US growth and renewal. McKinsey
Global Institute.
Sharma, S., and Osei-Bryson, K. (2015) A Method for
Formulating the Business Objectives of Data Mining
Projects. In Knowledge Discovery Process and
Methods to Enhance Organizational Performance.
Osei-Bryson, K, and Barclay, C. (editors), CRC Press
Simon, H. A. 1969. The Sciences of the Artificial, (The
MIT Press: Cambridge, MA.
Wang, H., & Wang, S. (2008). A knowledge management
approach to data mining process for business
intelligence. Industrial Management & Data Systems,
108(5), 622-634.
Figure 1. Research artefact
Table 1. DM ontology comparison
BU Data Understanding Data preparation Modelling Evaluation
Deployme
nt
DMW3
DMOP
ONTO-DM
KDDONTO
Expose
KD-Ontology
Intelligent
Discovery
Electronic
Assistant
(IDEA)
DAMON
Ontological
Learning
Assistant
(OLA)

More Related Content

What's hot

Selecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality ConceptsSelecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality Concepts
ijdms
 
Decision support-systems
Decision support-systemsDecision support-systems
Decision support-systems
Nguyen Phuc
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithm
IJMIT JOURNAL
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
IOSR Journals
 
Actionable Knowledge Discovery
 Actionable Knowledge Discovery  Actionable Knowledge Discovery
Actionable Knowledge Discovery
murthynani1
 
Recursos TIC e Educação
Recursos TIC e EducaçãoRecursos TIC e Educação
Recursos TIC e Educação
Luis Pedro
 
Data Mining: A prediction Technique for the workers in the PR Department of O...
Data Mining: A prediction Technique for the workers in the PR Department of O...Data Mining: A prediction Technique for the workers in the PR Department of O...
Data Mining: A prediction Technique for the workers in the PR Department of O...
ijcseit
 
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATIONROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
International Research Journal of Modernization in Engineering Technology and Science
 
On the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisonsOn the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisons
journalBEEI
 
Modelling turn away intention of information technology professionals in Bang...
Modelling turn away intention of information technology professionals in Bang...Modelling turn away intention of information technology professionals in Bang...
Modelling turn away intention of information technology professionals in Bang...
IJECEIAES
 
Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...
Debashish Mandal
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
butest
 
Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
 Business Analytics Dissertation Help: Defining And Identifying A Research Ag... Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
Tutors India
 
Prediction of Default Customer in Banking Sector using Artificial Neural Network
Prediction of Default Customer in Banking Sector using Artificial Neural NetworkPrediction of Default Customer in Banking Sector using Artificial Neural Network
Prediction of Default Customer in Banking Sector using Artificial Neural Network
rahulmonikasharma
 
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
MOC2010
 
CIS 336 Str Life of the Mind/newtonhelp.com   
CIS 336 Str Life of the Mind/newtonhelp.com   CIS 336 Str Life of the Mind/newtonhelp.com   
CIS 336 Str Life of the Mind/newtonhelp.com   
bellflower3
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
ertekg
 
CIS 336 STR Redefined Education--cis336.com
CIS 336 STR Redefined Education--cis336.comCIS 336 STR Redefined Education--cis336.com
CIS 336 STR Redefined Education--cis336.com
agathachristie209
 

What's hot (18)

Selecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality ConceptsSelecting Experts Using Data Quality Concepts
Selecting Experts Using Data Quality Concepts
 
Decision support-systems
Decision support-systemsDecision support-systems
Decision support-systems
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithm
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
 
Actionable Knowledge Discovery
 Actionable Knowledge Discovery  Actionable Knowledge Discovery
Actionable Knowledge Discovery
 
Recursos TIC e Educação
Recursos TIC e EducaçãoRecursos TIC e Educação
Recursos TIC e Educação
 
Data Mining: A prediction Technique for the workers in the PR Department of O...
Data Mining: A prediction Technique for the workers in the PR Department of O...Data Mining: A prediction Technique for the workers in the PR Department of O...
Data Mining: A prediction Technique for the workers in the PR Department of O...
 
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATIONROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
ROLE OF ELECTRONIC HUMAN RESOURCES (E-HR) IN ORGANIZATION
 
On the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisonsOn the benefit of logic-based machine learning to learn pairwise comparisons
On the benefit of logic-based machine learning to learn pairwise comparisons
 
Modelling turn away intention of information technology professionals in Bang...
Modelling turn away intention of information technology professionals in Bang...Modelling turn away intention of information technology professionals in Bang...
Modelling turn away intention of information technology professionals in Bang...
 
Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
 
Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
 Business Analytics Dissertation Help: Defining And Identifying A Research Ag... Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
Business Analytics Dissertation Help: Defining And Identifying A Research Ag...
 
Prediction of Default Customer in Banking Sector using Artificial Neural Network
Prediction of Default Customer in Banking Sector using Artificial Neural NetworkPrediction of Default Customer in Banking Sector using Artificial Neural Network
Prediction of Default Customer in Banking Sector using Artificial Neural Network
 
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
Alcacer & zhao 2009: Local R&D Strategies and Multi-location Firms: The Role ...
 
CIS 336 Str Life of the Mind/newtonhelp.com   
CIS 336 Str Life of the Mind/newtonhelp.com   CIS 336 Str Life of the Mind/newtonhelp.com   
CIS 336 Str Life of the Mind/newtonhelp.com   
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
 
CIS 336 STR Redefined Education--cis336.com
CIS 336 STR Redefined Education--cis336.comCIS 336 STR Redefined Education--cis336.com
CIS 336 STR Redefined Education--cis336.com
 

Viewers also liked

Ontology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreOntology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and more
Adriel Café
 
8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)
AEGIS-ACCESSIBLE Projects
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
Janna Hastings
 
X Som Graduation Presentation
X Som   Graduation PresentationX Som   Graduation Presentation
X Som Graduation Presentation
Giorgio Orsi
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integration
juanesteva
 
Local Search Hawaii Michael Dorausch PubCon SEO
Local Search Hawaii Michael Dorausch PubCon SEOLocal Search Hawaii Michael Dorausch PubCon SEO
Local Search Hawaii Michael Dorausch PubCon SEO
Michael Dorausch
 
[DSBW Spring 2010] Unit 10: XML and Web And beyond
[DSBW Spring 2010] Unit 10: XML and Web And beyond[DSBW Spring 2010] Unit 10: XML and Web And beyond
[DSBW Spring 2010] Unit 10: XML and Web And beyond
Carles Farré
 
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
Beniamino Murgante
 
Horizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence DataHorizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence Data
DataTactics
 
[ABDO] Data Integration
[ABDO] Data Integration[ABDO] Data Integration
[ABDO] Data Integration
Carles Farré
 
Lecture 07: Localization and Mapping I
Lecture 07: Localization and Mapping ILecture 07: Localization and Mapping I
Lecture 07: Localization and Mapping I
University of Colorado at Boulder
 
Pal gov.tutorial2.session13 2.gav and lav integration
Pal gov.tutorial2.session13 2.gav and lav integrationPal gov.tutorial2.session13 2.gav and lav integration
Pal gov.tutorial2.session13 2.gav and lav integration
Mustafa Jarrar
 
DSBW Final Exam (Spring Sementer 2010)
DSBW Final Exam (Spring Sementer 2010)DSBW Final Exam (Spring Sementer 2010)
DSBW Final Exam (Spring Sementer 2010)
Carles Farré
 
Distributed databases and dbm ss
Distributed databases and dbm ssDistributed databases and dbm ss
Distributed databases and dbm ss
Mohd Arif
 
Lecture 09: Localization and Mapping III
Lecture 09: Localization and Mapping IIILecture 09: Localization and Mapping III
Lecture 09: Localization and Mapping III
University of Colorado at Boulder
 
Jarrar: Data Schema Integration
Jarrar: Data Schema Integration Jarrar: Data Schema Integration
Jarrar: Data Schema Integration
Mustafa Jarrar
 
Database , 17 Web
Database , 17 WebDatabase , 17 Web
Database , 17 Web
Ali Usman
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
betheperformer
 
How to design a linear control system
How to design a linear control systemHow to design a linear control system
How to design a linear control system
Alireza Mirzaei
 
Database , 15 Object DBMS
Database , 15 Object DBMSDatabase , 15 Object DBMS
Database , 15 Object DBMS
Ali Usman
 

Viewers also liked (20)

Ontology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and moreOntology integration - Heterogeneity, Techniques and more
Ontology integration - Heterogeneity, Techniques and more
 
8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)8 ontology integration and interoperability (onto i op)
8 ontology integration and interoperability (onto i op)
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
X Som Graduation Presentation
X Som   Graduation PresentationX Som   Graduation Presentation
X Som Graduation Presentation
 
Ontology For Data Integration
Ontology For Data IntegrationOntology For Data Integration
Ontology For Data Integration
 
Local Search Hawaii Michael Dorausch PubCon SEO
Local Search Hawaii Michael Dorausch PubCon SEOLocal Search Hawaii Michael Dorausch PubCon SEO
Local Search Hawaii Michael Dorausch PubCon SEO
 
[DSBW Spring 2010] Unit 10: XML and Web And beyond
[DSBW Spring 2010] Unit 10: XML and Web And beyond[DSBW Spring 2010] Unit 10: XML and Web And beyond
[DSBW Spring 2010] Unit 10: XML and Web And beyond
 
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
A Data Fusion System for Spatial Data Mining, Analysis and Improvement Silvij...
 
Horizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence DataHorizontal Integration of Big Intelligence Data
Horizontal Integration of Big Intelligence Data
 
[ABDO] Data Integration
[ABDO] Data Integration[ABDO] Data Integration
[ABDO] Data Integration
 
Lecture 07: Localization and Mapping I
Lecture 07: Localization and Mapping ILecture 07: Localization and Mapping I
Lecture 07: Localization and Mapping I
 
Pal gov.tutorial2.session13 2.gav and lav integration
Pal gov.tutorial2.session13 2.gav and lav integrationPal gov.tutorial2.session13 2.gav and lav integration
Pal gov.tutorial2.session13 2.gav and lav integration
 
DSBW Final Exam (Spring Sementer 2010)
DSBW Final Exam (Spring Sementer 2010)DSBW Final Exam (Spring Sementer 2010)
DSBW Final Exam (Spring Sementer 2010)
 
Distributed databases and dbm ss
Distributed databases and dbm ssDistributed databases and dbm ss
Distributed databases and dbm ss
 
Lecture 09: Localization and Mapping III
Lecture 09: Localization and Mapping IIILecture 09: Localization and Mapping III
Lecture 09: Localization and Mapping III
 
Jarrar: Data Schema Integration
Jarrar: Data Schema Integration Jarrar: Data Schema Integration
Jarrar: Data Schema Integration
 
Database , 17 Web
Database , 17 WebDatabase , 17 Web
Database , 17 Web
 
1 ddbms jan 2011_u
1 ddbms jan 2011_u1 ddbms jan 2011_u
1 ddbms jan 2011_u
 
How to design a linear control system
How to design a linear control systemHow to design a linear control system
How to design a linear control system
 
Database , 15 Object DBMS
Database , 15 Object DBMSDatabase , 15 Object DBMS
Database , 15 Object DBMS
 

Similar to Enterprise and Data Mining Ontology Integration to Extract Actionable Knowledge-Final

IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
Rochelle Schear
 
10[1].1.1.115.9508
10[1].1.1.115.950810[1].1.1.115.9508
10[1].1.1.115.9508
okeee
 
F035431037
F035431037F035431037
F035431037
ijceronline
 
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
Julio Figueroa
 
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
cscpconf
 
Synthesis of questions and analysis and create grid.pdf
Synthesis of questions and analysis and create grid.pdfSynthesis of questions and analysis and create grid.pdf
Synthesis of questions and analysis and create grid.pdf
sdfghj21
 
data mining, a prediction technique for workers.pdf
data mining, a prediction technique for workers.pdfdata mining, a prediction technique for workers.pdf
data mining, a prediction technique for workers.pdf
NavAhmed3
 
A Customer Relationship Management Case Study Critical Success Factors In Ac...
A Customer Relationship Management Case Study  Critical Success Factors In Ac...A Customer Relationship Management Case Study  Critical Success Factors In Ac...
A Customer Relationship Management Case Study Critical Success Factors In Ac...
Yolanda Ivey
 
Dr prasanna karhade
Dr prasanna karhadeDr prasanna karhade
Dr prasanna karhade
Prasannakarhade
 
Application of AI in customer relationship management
Application of AI in customer relationship managementApplication of AI in customer relationship management
Application of AI in customer relationship management
Shashwat Shankar
 
Impact of business intelligence on
Impact of business intelligence onImpact of business intelligence on
Impact of business intelligence on
ijejournal
 
A Research Proposal On Electrical Vehicles
A Research Proposal On Electrical VehiclesA Research Proposal On Electrical Vehicles
A Research Proposal On Electrical Vehicles
Who Will Write My Paper For Me Florissant
 
1st Meeting of Colloqium
1st Meeting of Colloqium1st Meeting of Colloqium
1st Meeting of Colloqium
Edhi Juwono
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
priestmanmable
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
sodhi3
 
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
Dr. Mustafa Değerli
 
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docxUniversity of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
ouldparis
 
Towards a Theoretical Model for Human Resource Management Information Systems...
Towards a Theoretical Model for Human Resource Management Information Systems...Towards a Theoretical Model for Human Resource Management Information Systems...
Towards a Theoretical Model for Human Resource Management Information Systems...
IOSRJBM
 
A Practical Application Of Critical Systems Thinking To Improve A Business In...
A Practical Application Of Critical Systems Thinking To Improve A Business In...A Practical Application Of Critical Systems Thinking To Improve A Business In...
A Practical Application Of Critical Systems Thinking To Improve A Business In...
Lisa Garcia
 
Tam & toe
Tam & toeTam & toe
Tam & toe
Sammer Qader
 

Similar to Enterprise and Data Mining Ontology Integration to Extract Actionable Knowledge-Final (20)

IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
IMPACT BIG DATA ANALYTIC AND KNOWLEDGE MANAGEMENT AS STRATEGY SERVICE ADVANTA...
 
10[1].1.1.115.9508
10[1].1.1.115.950810[1].1.1.115.9508
10[1].1.1.115.9508
 
F035431037
F035431037F035431037
F035431037
 
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
CREATING AND SHARING KNOWLEDGE THROUGH A CORPORATE SOCIAL NETWORKING SITE: TH...
 
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
CRESUS: A TOOL TO SUPPORT COLLABORATIVE REQUIREMENTS ELICITATION THROUGH ENHA...
 
Synthesis of questions and analysis and create grid.pdf
Synthesis of questions and analysis and create grid.pdfSynthesis of questions and analysis and create grid.pdf
Synthesis of questions and analysis and create grid.pdf
 
data mining, a prediction technique for workers.pdf
data mining, a prediction technique for workers.pdfdata mining, a prediction technique for workers.pdf
data mining, a prediction technique for workers.pdf
 
A Customer Relationship Management Case Study Critical Success Factors In Ac...
A Customer Relationship Management Case Study  Critical Success Factors In Ac...A Customer Relationship Management Case Study  Critical Success Factors In Ac...
A Customer Relationship Management Case Study Critical Success Factors In Ac...
 
Dr prasanna karhade
Dr prasanna karhadeDr prasanna karhade
Dr prasanna karhade
 
Application of AI in customer relationship management
Application of AI in customer relationship managementApplication of AI in customer relationship management
Application of AI in customer relationship management
 
Impact of business intelligence on
Impact of business intelligence onImpact of business intelligence on
Impact of business intelligence on
 
A Research Proposal On Electrical Vehicles
A Research Proposal On Electrical VehiclesA Research Proposal On Electrical Vehicles
A Research Proposal On Electrical Vehicles
 
1st Meeting of Colloqium
1st Meeting of Colloqium1st Meeting of Colloqium
1st Meeting of Colloqium
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
Mustafa Degerli - 2010 - Annotated Bibliography - IS 720 Research Methods in ...
 
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docxUniversity of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
University of PlymouthPEARL httpspearl.plymouth.ac.uk.docx
 
Towards a Theoretical Model for Human Resource Management Information Systems...
Towards a Theoretical Model for Human Resource Management Information Systems...Towards a Theoretical Model for Human Resource Management Information Systems...
Towards a Theoretical Model for Human Resource Management Information Systems...
 
A Practical Application Of Critical Systems Thinking To Improve A Business In...
A Practical Application Of Critical Systems Thinking To Improve A Business In...A Practical Application Of Critical Systems Thinking To Improve A Business In...
A Practical Application Of Critical Systems Thinking To Improve A Business In...
 
Tam & toe
Tam & toeTam & toe
Tam & toe
 

Enterprise and Data Mining Ontology Integration to Extract Actionable Knowledge-Final

  • 1. Enterprise and Data Mining Ontology Integration to Extract Actionable Knowledge HamidReza Nazari1 , MohammadTaghi TaghaviFard1 and Iman Raeesi Vanani1 1 Department of Management and Accounting, Allameh Tabataba’i University, Tehran, Iran Hamidnazary2002@yahoo.com, {Taghavifard, imanraeesi }@atu.ac.ir 1 RESEARCH PROBLEM Today, it is hard to imagine an enterprise with no manipulated processes to manage its data and information. Organizations invest their money and efforts in Information Technologies (IT) with the aim of gaining competitive advantages. Knowledge Discovery in Databases (KDD) is one of the most attractive notions to make real-time and appropriate decisions and to gain sustainable competetive advantages. Knowledge discovery in big data era is expected to change the global economy by 2020, raising 325 billion dollars only in U.S retailing and manufacturing sectors(McKinsey & Company, 2013). Despite this, knowledge discovered by data mining (DM) algorithms are not actionable enough to grow the economy (Cao, 2012; Way & Wang, 2008). Feritas(2006) in an investigation of association rules reported that only 1% of the outcomes is actionable, 220 rules among 29050 extracted rules. Existing mining techiques and algorithms have reached considerable levels of efficiency but their effectiveness on identifying actionable knowledge is impaired (Antunes, 2014). DM Researchers have focued more on providing creative algorithms and investigated the outcomes from accuracy and efficiency point of view. They have simplified the problem, not identifying business requirements and context completely (Cao, 2012; Hirsh, 2008). The knowledge discovered by algorithms is mainly passive and is not quite active and helpful for managers to make a decision. So, more proccesing and analysis is needed to create actionable knowledge. This gap is rooted in neglecting business understanding (BU) in formulating a DM problem. There is a need to shift from data driven DM to the domain driven one (Cao, 2010, 2012). Mariscal et al (2010) in a deep review of DM methodologies and models, mention these phases to develop CRISP- DM, to address its weaknesses in BU and deploying results: 1) Domain knowledge elicitation to constrain the learning algorithms search space and to reduce the number of patterns discovered, 2) Problem specification to better understand the problem and at the same time, it has to be considered that objectives and goals are dynamic due to changes in the requirements of the organization or business, and they must be reviewed and updated regularly, 3) Interpretation to ensure achieving business requirements, 4) Automation for supporting non- expert data mining users to get new knowledge from new data and existing models in an easy way, 5) and, Maintenance. Thus, the research problems are 1) How to formalize BU in DM process? 2) How the formalization would lead to actionable knowledge? 2 OUTLINE OF OBJECTIVES The objectives of this research involve the following: 1) Designing an artefact to formally apply BU in DM. 2) Semi-automating BU phase of DM to help a wide range of users to avoid common analytical mistakes. 3) To facilitate assessing and deploying knowledge discovery algorithms results in business context. 3 STATE OF THE ART Output of BU determines relevant data for data understanding phase. It determines if target variable should be discretized in data preparation phase. It results in enumeration of applicable modelling techniques in data modelling phase. And finally, it determines the evaluation criteria and actionability of the results (Sharma & Osei-Bryson, 2015).
  • 2. Cespivova et el (2004) outlined the possible roles of a domain ontology in all phases of data mining process. Enterprise ontology models an enterprise in the form of information system. The activities of BU phase span across concepts and their relationships are described in an enterprise ontology. Sharma & Osei-Bryson (2008) assert that no other framework has the ability to account for BU phase activities. Li (2014) assert that to support decision in BU, case based reasoning, ontology based approach and work flow management reasoning can be useful. Antunes and Silva (2014) also emphasize using domain ontology to improve DM effectiveness (Fig 1). Insert Figure 1 here Many DM ontologies have developed in the last decade, like DMOP, ONTO-DM, DMW3, to name a few. DM ontologies have the ability to select best fit algorithms among the pool of DM algorithms. Most of them have neglected BU and focused on data understanding, preparation and modelling. Table 1 summarises coverage of DM ontologies based on CRISP-DM. Insert table 1 here In summary, 1) This research attempts to match enterprise and DM ontologies to achieve the aforementioned objectives. According to our literature review, no similar research can be found in the literature. 2) This research will cultivate the theoretical foundations of enterprise engineering, DM and ontology engineering by designing an integrated artefact to formalize BU of DM process and to facilitate extracting actionable knowledge. 3) Theoretical contribution of the research will enhance the interoperability of DM process. 4) The research is a step forward to automating DM process more wisely and to helps business users and DM experts to convert their ad hoc reasoning to more systematic and ontology aware one. 4 METHODOLOGY Simon (1969) made a clear distinction between natural sciences and sciences of artificial or sciences of design. Artefacts are purposely designed artificial things to satisfy human’s goals. An artefact is an interface between the substance and organization of itself and also external environment in which it operates. Information system discipline is rooted in the creation of information technology artefacts (Hevner et al, 2004). Thus, this research will apply design science approach. To build an artefact, we will review current enterprise and DM ontologies. It is expected to find basic concepts and relationships essential for integration in this step. The construct will be evaluated by experts based on its completeness, understandability and explicitness. Thus, the integrated ontology will be built and evaluated in this step. Solving the identified problem is one the main disciplines of design science research methodology. Therefore, we will use an ontology integration methodology to build a prototype in OWL or XML. Alignment and correspondence of the prototype have to be evaluated in this step. Finally, an instance for an enterprise unit will be built based on the prototype. Case study is the method of evaluation here. We will not directly compute actionability, instead alignment of the extracted knowledge and the business goal will be indicated the actionability of the outcomes. 5 EXPECTED OUTCOME Achieving the research objectives requires a developed enterprise ontology that can be seamlessly integrated with DM ontology. So, the first outcome of the research is developed enterprise ontology. In term of design science research methodology, this is the built construct of the research. This ontology will formalize concepts mentioned in BU phase of CRISP-DM like business objectives, requirements and constraints. Also, it will be useful in automating data understanding phase of CRISP-DM. Defining interactions between concepts of enterprise and DM ontologies and developing relations between them are the second expected outcome of the research. This construct will be used by ontology integration methodologies to build the instance aspect of research artefact. These interactions and relations will bridge the semantic gap between business users and DM experts. This way, the complexity of DM for novice data miners will be reduced. This complexity is a main barrier for many organizations applying or utilizing DM processes and methodology. According to the developed correspondences and the built instance, it is expected to prune the outcomes of DM algorithms more easily and to
  • 3. extract more actionable knowledge based on semantic similarity computing of them. As a result, the research outcome facilitates the activities of evaluation phase of CRISP-DM. 6 STAGES OF THE RESEARCH The research is in its early stage. We have started to investigate enterprise and DM ontologies to select or propose suitable ones for the integration. REFERENCES Antunes, C., & Silva, A. 2014. New Trends in Knowledge Driven Data Mining. Proceedings from ICEIS ’14. International Conference on Enterprise Information Systems, 346-351. Cao, L. (2012). Actionable knowledge discovery and delivery. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(2), 149-163. Cao, L. (2010). Domain-driven data mining: Challenges and prospects. Knowledge and Data Engineering, IEEE Transactions on, 22(6), 755-769 Cešpivová, H., Rauch, J., Svatek, V., Kejkula, M., & Tomeckova, M. (2004). Roles of medical ontology in association mining Crisp-DM cycle. In ECML/ PKDD04 Workshop on Knowledge Discovery and Ontologies (KDO’04), Pisa (Vol. 220). Freitas AA (2006) Are we really discovering “interesting” knowledge from data? Expert Update, Vol. 9, No. 1, 41-47, autumn 2006. Hevner, A., March, S., Park, J., and Ram, S. (2004) Design Science in Information Systems Research. MIS Quarterly, 28(1). Hirsh, H. (2008). Data mining research: Current status and future opportunities. Statistical Analysis and Data Mining: The ASA Data Science Journal, 1(2), 104-107 Li, Y. (2014) New Artefacts for the Knowledge Discovery via Data Analytics (KDDA) Process. PhD Dissertation, Virginia Commonwealth University, USA. Mariscal, G., Marbán, Ó. & Fernández, C. (2010). A survey of data mining and knowledge discovery process models and methodologies. The Knowledge Engineering Review, 25(02), 137-166 Mckinsey and Company. (2013). Game changers: Five opportunities for US growth and renewal. McKinsey Global Institute. Sharma, S., and Osei-Bryson, K. (2015) A Method for Formulating the Business Objectives of Data Mining Projects. In Knowledge Discovery Process and Methods to Enhance Organizational Performance. Osei-Bryson, K, and Barclay, C. (editors), CRC Press Simon, H. A. 1969. The Sciences of the Artificial, (The MIT Press: Cambridge, MA. Wang, H., & Wang, S. (2008). A knowledge management approach to data mining process for business intelligence. Industrial Management & Data Systems, 108(5), 622-634.
  • 4. Figure 1. Research artefact Table 1. DM ontology comparison BU Data Understanding Data preparation Modelling Evaluation Deployme nt DMW3 DMOP ONTO-DM KDDONTO Expose KD-Ontology Intelligent Discovery Electronic Assistant (IDEA) DAMON Ontological Learning Assistant (OLA)