SlideShare a Scribd company logo
Introduction to
Knowledge Engineering
Dr. Albert Fornells Herrera
http://www.linkedin.com/in/afornells
Introduction to Knowledge Engineering - Albert Fornells
1. Definition
2. Process of Knowledge Engineering
3. References
2
Outline
Introduction to Knowledge Engineering - Albert Fornells
 What is Knowledge Engineering?
 The process of building intelligent knowledge-based systems
 Build a KBS is still more art than engineering
 Knowledge is incomplete, uncertain and heuristic
 Complex to estimate the nature and the amount of knowledge needed
 An incremental and prototyped approach is the way to incrementally evolve
the system from its conception until the final version
 Six iterative basic phases (Waterman, 1986; Durkin, 1994):
1. Problem assessment
2. Data and knowledge acquisition
3. Development of a prototype system
4. Development of a complete system
5. Evaluation and revision of the system
6. Integration and maintenance of the system
3
Knowledge Engineering
Introduction to Knowledge Engineering - Albert Fornells
4
The process of Knowledge Engineering
Introduction to Knowledge Engineering - Albert Fornells
Problem assessment
Data and knowledge
acquisition
Development of a
prototype system
Development of a
complete system
Evaluation and revision of
the system
Integration and
maintenance of the system
Determine the characteristics of the problem
Identify the main participants in the project
Specify the project’s objective
Determine the resources needed for building the system
Collect and analyze data and knowledge
Make key concepts of the system design more explicit
Choose a tool for building an Intelligent system
Transform data and represent the knowledge
Design and implement a prototype system
Test the prototype with test cases
Prepare a detailed design for a full-scale system
Collect additional data and knowledge
Develop the user interface
Implement the complete system
Evaluate the system against the performance criteria
Revise the system as necessary
Make arrangements for technology transfer
Establish an effective maintenance program
Requirements
Concepts
Structure
KBS ‘beta’
Reformulation
Redesign
Tuning
KBS tested
Maintenance
Assessment
Redesign
Iterations
 Goal: (1) Determine the characteristics of the problem; (2) Identify the participants
of the project; (3) Specify the objectives of the project and (4) Determine what
resources are needed for building the system.
 The problem characterization implies:
 Determine the problem type:
 Diagnosis. Inferring malfunctions of an object from its behavior and recommending solutions
 Selection. Recommending the best option from a list of possible alternatives
 Prediction. Predicting the future behavior of an object from its behavior in the past
 Classification. Assigning an object to one of the defined classes
 Clustering. Dividing an heterogeneous group of objects into homogeneous subgroups
 Optimization. Improving the quality of solutions until an optimal one is found
 Control Governing the behavior of an object to meet specified requirements in real-time
 Input and output variables and their interactions
 The form and content of the solution.
5
KE process: Problem assessment
Introduction to Knowledge Engineering - Albert Fornells
 Identify the participants of the project. There are two critical participants:
 The knowledge engineer (a person capable of designing, building and testing an
intelligent system)
 The domain expert (a knowledgeable person capable of solving problems in a
specific area or domain)
 Specify the project’s objectives
 Gaining a competitive edge, improving the quality of decisions, reducing labor costs,
and improving the quality of products and services, etc
 Determine what resources are needed for building the system
 Computer facilities
 Development software
 Knowledge and data sources (human experts, textbooks, manuals, web sites,
databases and examples)
 Money
6
KE process: Problem assessment
Introduction to Knowledge Engineering - Albert Fornells
 Toy problem: Found X.
 Do you think that a KBS is needed?
 X=sqrt(3^2+4^2)=5
 Maybe it is not necessary
 But maybe experts do not think the same because they misunderstand the
questions.
 They apply computer vision techniques for text recognition.
 Who is wrong?
7
KE process: Problem assessment
Introduction to Knowledge Engineering - Albert Fornells
4cm
3cm
x
4cm
3cm
x
Here is
 Goal: To obtain further understanding of the problem domain by (1) collecting and
analyzing both data and knowledge and (2) making key concepts of the system’s
design more explicit
 The choice of the system building tool depends on the acquired data. Data is
often collected from different sources and thus can be of different types.
 Continuous variables vs variables divided into several ranges vs normalized (i.e. 0 - 1)
 Symbolic (textual) data vs numerical data vs heterogeneous data
 Imprecise and noisy data vs well-defined and clean data
 As a result, the data must be transformed, or massaged, into the form useful for
a particular tool. There are three important issues that must be resolved before
massaging the data (Berry and Linoff, 1997)
 Incompatible data
 Inconsistent data
 Missing data
8
KE process: Data and knowledge acquisition
Introduction to Knowledge Engineering - Albert Fornells
 Knowledge acquisition is an inherently iterative process based on:
 Start with reviewing documents and reading books, papers and manuals related to
the problem domain in order to become familiar with the problem
 Collect further knowledge through interviewing the domain expert. The expert is
asked to identify four or five typical cases, describe how each case is solved and
explain, or ‘think out loud’, the reasoning behind each solution (Russell, 2002)
 Study and analyze the acquired knowledge, and repeat the entire process again
 However, extracting knowledge from a human expert is a difficult process – it is
often called the ‘knowledge acquisition bottleneck’.
 Quite often experts are unaware of the amount of knowledge they have and the
problem-solving strategy they use, or are unable to verbalize it.
 Experts may also provide us with irrelevant, incomplete or inconsistent information
 Example of cheese factory (Donald Michie, 1982)
 The data and knowledge acquired should enable us to describe the problem-
solving strategy at the most abstract level, but any detailed analysis has to be
done before evaluating the prototype
9
KE process: Data and knowledge acquisition
Introduction to Knowledge Engineering - Albert Fornells
 Goal: Create an intelligent system – or, rather, a small version of it – and test it
with a number of test cases.
 What is a prototype?
 A prototype system can be defined as a small version of the final system
 It is designed to test how well we understand the problem, or in other words, to make
sure that the problem-solving strategy, the tool selected for building a system, and
techniques for representing acquired data and knowledge are adequate to the task
 It also provides us with an opportunity to actively engage the domain expert in the
system’s development
 What is a test case?
 Once it is built, the prototype’s performance needs to be tested (usually together with
the domain expert)
 The domain expert takes an active part in testing the system and, as a result,
becomes more involved in the system’s development
 What should we do if we have made a bad choice of the system-building tool?
 We should throw the prototype away and start the prototyping phase over again
10
KE process: Development of a prototype system
Introduction to Knowledge Engineering - Albert Fornells
 Goal: Once prototype is assessed, we develop a plan, schedule and budget for
the complete system, and also clearly define the system’s performance.
 Main milestones:
 Adding data and knowledge to the system (i.e. collect core rules, additional
historical examples, etc.)
 Develop the user interface for delivering information to a user. Some system
focuses on explaining its reasoning process and justify its advice. Others represent
data in graphical forms.
 The development is like an evolutionary process where, as the project needs,
new data and knowledge are collected and added to the system.
11
KE process: Development of a complete system
Introduction to Knowledge Engineering - Albert Fornells
 Goal: To evaluate an intelligent system is, in fact, to assure that the system
performs the intended task to the user’s satisfaction
 Intelligent systems are designed to solve problems that quite often do not have
clearly defined ‘right’ and ‘wrong’ solutions
 A formal evaluation of the system is normally accomplished with the test cases
selected by the user
 The system’s performance is compared against the performance criteria that were
agreed at the end of the prototyping phase
 The evaluation often reveals the system’s limitations and weaknesses, so it is
revised and relevant development phases are repeated
 There are specific methodological procedures to assess the performance
(Ian Witten, 2005)
 X Stratified cross-validation
 Leave-one out
 Statistical tests are a common way for comparing the performance of the
approach with respect to experts and also with respect to others approaches
12
KE process: Evaluation and revision
Introduction to Knowledge Engineering - Albert Fornells
 Goal: It involves integrating the system into the environment where it will
operate and establishing an effective maintenance program
 What means integration?
 Integration means interfacing the new intelligent system with existing systems within
an organization and arranging for technology transfer
 What means establish a maintenance program?
 Intelligent systems are knowledge-based systems, and because knowledge evolves
over time, the system has to be able to be upgraded
 The organization that uses the system should have in-house expertise to maintain
and modify the system
 Is maintenance forever?
 No, a big change in the knowledge domain could imply to throw away the intelligent
system and start from the beginning.
13
KE process: Integration and maintenance
Introduction to Knowledge Engineering - Albert Fornells
14
Summary: Roadmap for tackling a problem
Introduction to Knowledge Engineering - Albert Fornells
Knowledge
Problem
Analysis
Data
Analysis
Data
Processing
Modeling
Evaluation
Production
Problem: I want to develop an intelligent system that can help me to fix
malfunctions of my computer.
Describe, from a global point of view, how will you tackle the process of Knowledge
Engineering?
1. Problem assessment
2. Data and knowledge acquisition
3. Development of a prototype system
4. Development of a complete system
5. Evaluation and revision of the system
6. Integration and maintenance of the system
15
Exercise: Will a KBS work for my problem?
Introduction to Knowledge Engineering - Albert Fornells
 A. J. González y D. D. Dankel, The Engineering of Knowledge Based Systems.
Theory and Practice. Prentice Hall, 2000.
16
References
Introduction to Knowledge Engineering - Albert Fornells

More Related Content

What's hot

Unit 11-systems-analysis-and-design
Unit 11-systems-analysis-and-designUnit 11-systems-analysis-and-design
Unit 11-systems-analysis-and-designZile Mafantiri
 
Ced unit 1 notes-new
Ced unit 1 notes-newCed unit 1 notes-new
Ced unit 1 notes-new
Dr.Yogesh Dewang
 
A survey of controlled experiments in software engineering
A survey of controlled experiments in software engineeringA survey of controlled experiments in software engineering
A survey of controlled experiments in software engineeringJULIO GONZALEZ SANZ
 
SYSTEM ANALYSIS AND DESIGN Assignment help
SYSTEM ANALYSIS AND DESIGN Assignment helpSYSTEM ANALYSIS AND DESIGN Assignment help
SYSTEM ANALYSIS AND DESIGN Assignment help
john mayer
 
System Analysis and Design (SAD)
System Analysis and Design (SAD)System Analysis and Design (SAD)
System Analysis and Design (SAD)
Sachith Perera
 
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
CS, NcState
 
Systems Analysis
Systems AnalysisSystems Analysis
Systems Analysis
Bli Wilson
 
Artificial Intelligence: Case-based & Model-based Reasoning
Artificial Intelligence: Case-based & Model-based ReasoningArtificial Intelligence: Case-based & Model-based Reasoning
Artificial Intelligence: Case-based & Model-based Reasoning
The Integral Worm
 
System analysis
System analysisSystem analysis
System analysis
Self-employed
 
system analysis and design chapter 1 Kendall & Kendall
system analysis and design chapter 1 Kendall & Kendallsystem analysis and design chapter 1 Kendall & Kendall
system analysis and design chapter 1 Kendall & Kendall
Dana dia
 
Applying Systems Thinking to Solve Wicked Problems in Software Engineering
Applying Systems Thinking to Solve Wicked Problems in Software EngineeringApplying Systems Thinking to Solve Wicked Problems in Software Engineering
Applying Systems Thinking to Solve Wicked Problems in Software Engineering
Majed Ayyad
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
Ahmed Magdy Ezzeldin, MSc.
 
A&D - Fact Finding Techniques
A&D - Fact Finding TechniquesA&D - Fact Finding Techniques
A&D - Fact Finding Techniquesvinay arora
 
Big shadow test
Big shadow testBig shadow test
Big shadow test
munsif123
 
7. evalution of interactive system
7. evalution of interactive system7. evalution of interactive system
7. evalution of interactive system
Kh Ravy
 
Wcre13b.ppt
Wcre13b.pptWcre13b.ppt
Wcre13b.ppt
Ptidej Team
 
Test equating using irt. final
Test equating using irt. finalTest equating using irt. final
Test equating using irt. final
munsif123
 

What's hot (20)

Unit 11-systems-analysis-and-design
Unit 11-systems-analysis-and-designUnit 11-systems-analysis-and-design
Unit 11-systems-analysis-and-design
 
Ced unit 1 notes-new
Ced unit 1 notes-newCed unit 1 notes-new
Ced unit 1 notes-new
 
A survey of controlled experiments in software engineering
A survey of controlled experiments in software engineeringA survey of controlled experiments in software engineering
A survey of controlled experiments in software engineering
 
SYSTEM ANALYSIS AND DESIGN Assignment help
SYSTEM ANALYSIS AND DESIGN Assignment helpSYSTEM ANALYSIS AND DESIGN Assignment help
SYSTEM ANALYSIS AND DESIGN Assignment help
 
System Analysis and Design (SAD)
System Analysis and Design (SAD)System Analysis and Design (SAD)
System Analysis and Design (SAD)
 
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
Promise 2011: "An Iterative Semi-supervised Approach to Software Fault Predic...
 
Systems Analysis
Systems AnalysisSystems Analysis
Systems Analysis
 
Artificial Intelligence: Case-based & Model-based Reasoning
Artificial Intelligence: Case-based & Model-based ReasoningArtificial Intelligence: Case-based & Model-based Reasoning
Artificial Intelligence: Case-based & Model-based Reasoning
 
System analysis
System analysisSystem analysis
System analysis
 
system analysis and design chapter 1 Kendall & Kendall
system analysis and design chapter 1 Kendall & Kendallsystem analysis and design chapter 1 Kendall & Kendall
system analysis and design chapter 1 Kendall & Kendall
 
Prvt file test
Prvt file testPrvt file test
Prvt file test
 
Applying Systems Thinking to Solve Wicked Problems in Software Engineering
Applying Systems Thinking to Solve Wicked Problems in Software EngineeringApplying Systems Thinking to Solve Wicked Problems in Software Engineering
Applying Systems Thinking to Solve Wicked Problems in Software Engineering
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
 
A&D - Fact Finding Techniques
A&D - Fact Finding TechniquesA&D - Fact Finding Techniques
A&D - Fact Finding Techniques
 
Big shadow test
Big shadow testBig shadow test
Big shadow test
 
7. evalution of interactive system
7. evalution of interactive system7. evalution of interactive system
7. evalution of interactive system
 
Wcre13b.ppt
Wcre13b.pptWcre13b.ppt
Wcre13b.ppt
 
Chap06
Chap06Chap06
Chap06
 
Check upload1
Check upload1Check upload1
Check upload1
 
Test equating using irt. final
Test equating using irt. finalTest equating using irt. final
Test equating using irt. final
 

Viewers also liked

Spunti - Biblioteca como 18 febbraio 2014
Spunti - Biblioteca como 18 febbraio 2014Spunti - Biblioteca como 18 febbraio 2014
Spunti - Biblioteca como 18 febbraio 2014
MIUR
 
CERT - NADIA - OFFICE ADMINISTRATION
CERT - NADIA - OFFICE ADMINISTRATIONCERT - NADIA - OFFICE ADMINISTRATION
CERT - NADIA - OFFICE ADMINISTRATIONMaria Raju
 
Y O G A A N D S T R E S S Dr
Y O G A  A N D  S T R E S S  DrY O G A  A N D  S T R E S S  Dr
Y O G A A N D S T R E S S Drdrsolapurkar
 
Balancing It All
Balancing It AllBalancing It All
Balancing It All
Bruce Kasanoff
 
Neurosensory disturvances following surgucal removal of mandibular third molar
Neurosensory disturvances following surgucal removal of mandibular third molarNeurosensory disturvances following surgucal removal of mandibular third molar
Neurosensory disturvances following surgucal removal of mandibular third molarIndian dental academy
 
Electricity
ElectricityElectricity
Electricity
ngibellini
 
Licopodios
LicopodiosLicopodios
Licopodios
Osmara Yanileisi
 
Cícadas
CícadasCícadas
Cálculos e recomendações para correias de transmissão de potência em v good...
Cálculos e recomendações para correias de transmissão de potência em v   good...Cálculos e recomendações para correias de transmissão de potência em v   good...
Cálculos e recomendações para correias de transmissão de potência em v good...
Diogo Bellé
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
Jay Patel
 
PLANTAS VASCULARES SIN SEMILLA
PLANTAS VASCULARES SIN SEMILLAPLANTAS VASCULARES SIN SEMILLA
PLANTAS VASCULARES SIN SEMILLA
Alondra Cervantes
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
Anil kale
 

Viewers also liked (17)

Spunti - Biblioteca como 18 febbraio 2014
Spunti - Biblioteca como 18 febbraio 2014Spunti - Biblioteca como 18 febbraio 2014
Spunti - Biblioteca como 18 febbraio 2014
 
JOSE CASIMIRO File
JOSE CASIMIRO FileJOSE CASIMIRO File
JOSE CASIMIRO File
 
CERT - NADIA - OFFICE ADMINISTRATION
CERT - NADIA - OFFICE ADMINISTRATIONCERT - NADIA - OFFICE ADMINISTRATION
CERT - NADIA - OFFICE ADMINISTRATION
 
Iss
IssIss
Iss
 
2009 11 16 Landerd Bedrijfspresentatie Op Internet
2009 11 16 Landerd Bedrijfspresentatie Op Internet2009 11 16 Landerd Bedrijfspresentatie Op Internet
2009 11 16 Landerd Bedrijfspresentatie Op Internet
 
geoext_dwins
geoext_dwinsgeoext_dwins
geoext_dwins
 
Y O G A A N D S T R E S S Dr
Y O G A  A N D  S T R E S S  DrY O G A  A N D  S T R E S S  Dr
Y O G A A N D S T R E S S Dr
 
Balancing It All
Balancing It AllBalancing It All
Balancing It All
 
Resume - Muhammad Shahzad
Resume - Muhammad  ShahzadResume - Muhammad  Shahzad
Resume - Muhammad Shahzad
 
Neurosensory disturvances following surgucal removal of mandibular third molar
Neurosensory disturvances following surgucal removal of mandibular third molarNeurosensory disturvances following surgucal removal of mandibular third molar
Neurosensory disturvances following surgucal removal of mandibular third molar
 
Electricity
ElectricityElectricity
Electricity
 
Licopodios
LicopodiosLicopodios
Licopodios
 
Cícadas
CícadasCícadas
Cícadas
 
Cálculos e recomendações para correias de transmissão de potência em v good...
Cálculos e recomendações para correias de transmissão de potência em v   good...Cálculos e recomendações para correias de transmissão de potência em v   good...
Cálculos e recomendações para correias de transmissão de potência em v good...
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
 
PLANTAS VASCULARES SIN SEMILLA
PLANTAS VASCULARES SIN SEMILLAPLANTAS VASCULARES SIN SEMILLA
PLANTAS VASCULARES SIN SEMILLA
 
Autonomous car
Autonomous carAutonomous car
Autonomous car
 

Similar to Introduction to knowledge discovery

Systems Analysis And Design 2
Systems Analysis And Design 2Systems Analysis And Design 2
Systems Analysis And Design 2MISY
 
Mis
MisMis
MODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxMODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptx
nikshaikh786
 
Artificial Intelligence Notes Unit 5
Artificial Intelligence Notes Unit 5Artificial Intelligence Notes Unit 5
Artificial Intelligence Notes Unit 5
DigiGurukul
 
Section b
Section bSection b
Section bPCTE
 
IS L02 - Development of Information Systems
IS L02 - Development of Information SystemsIS L02 - Development of Information Systems
IS L02 - Development of Information Systems
Jan Wong
 
4 IT Interview Question.pdf
4 IT Interview Question.pdf4 IT Interview Question.pdf
4 IT Interview Question.pdf
TendaiZulu
 
Expert system (unit 1 & 2)
Expert system (unit 1 & 2)Expert system (unit 1 & 2)
Expert system (unit 1 & 2)
Lakshya Gupta
 
System analysis and design
System analysis and design System analysis and design
System analysis and design Razan Al Ryalat
 
Concepts of system analysis
Concepts of system analysisConcepts of system analysis
Concepts of system analysisALFIYA ALSALAM
 
Different Approaches To Sys Bldg
Different Approaches To Sys BldgDifferent Approaches To Sys Bldg
Different Approaches To Sys Bldg
USeP
 
Senior ipt term4_casestudy
Senior ipt term4_casestudySenior ipt term4_casestudy
Senior ipt term4_casestudyhccit
 
06 styles and_greenfield_design
06 styles and_greenfield_design06 styles and_greenfield_design
06 styles and_greenfield_designMajong DevJfu
 
Hsc project management 2018pptx
Hsc project management 2018pptxHsc project management 2018pptx
Hsc project management 2018pptx
greg robertson
 
SAD_SDLC.pptx
SAD_SDLC.pptxSAD_SDLC.pptx
SAD_SDLC.pptx
SharmilaMore5
 
System_Analysis_and_Design_Assignment_New2.ppt
System_Analysis_and_Design_Assignment_New2.pptSystem_Analysis_and_Design_Assignment_New2.ppt
System_Analysis_and_Design_Assignment_New2.ppt
MarissaPedragosa
 
Unit 2 Classical Systems Development Methodology.pptx
Unit 2 Classical Systems Development Methodology.pptxUnit 2 Classical Systems Development Methodology.pptx
Unit 2 Classical Systems Development Methodology.pptx
VrundaPatadia
 
Online auction system srs riport
Online auction system srs  riportOnline auction system srs  riport
Online auction system srs riport
Dilip Prajapati
 

Similar to Introduction to knowledge discovery (20)

Systems Analysis And Design 2
Systems Analysis And Design 2Systems Analysis And Design 2
Systems Analysis And Design 2
 
Mis
MisMis
Mis
 
Sdlc1
Sdlc1Sdlc1
Sdlc1
 
MODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptxMODULE 1_Introduction to Data analytics and life cycle..pptx
MODULE 1_Introduction to Data analytics and life cycle..pptx
 
Artificial Intelligence Notes Unit 5
Artificial Intelligence Notes Unit 5Artificial Intelligence Notes Unit 5
Artificial Intelligence Notes Unit 5
 
Section b
Section bSection b
Section b
 
IS L02 - Development of Information Systems
IS L02 - Development of Information SystemsIS L02 - Development of Information Systems
IS L02 - Development of Information Systems
 
4 IT Interview Question.pdf
4 IT Interview Question.pdf4 IT Interview Question.pdf
4 IT Interview Question.pdf
 
Expert system (unit 1 & 2)
Expert system (unit 1 & 2)Expert system (unit 1 & 2)
Expert system (unit 1 & 2)
 
System analysis and design
System analysis and design System analysis and design
System analysis and design
 
Concepts of system analysis
Concepts of system analysisConcepts of system analysis
Concepts of system analysis
 
Different Approaches To Sys Bldg
Different Approaches To Sys BldgDifferent Approaches To Sys Bldg
Different Approaches To Sys Bldg
 
Senior ipt term4_casestudy
Senior ipt term4_casestudySenior ipt term4_casestudy
Senior ipt term4_casestudy
 
06 styles and_greenfield_design
06 styles and_greenfield_design06 styles and_greenfield_design
06 styles and_greenfield_design
 
Hsc project management 2018pptx
Hsc project management 2018pptxHsc project management 2018pptx
Hsc project management 2018pptx
 
SAD_SDLC.pptx
SAD_SDLC.pptxSAD_SDLC.pptx
SAD_SDLC.pptx
 
System_Analysis_and_Design_Assignment_New2.ppt
System_Analysis_and_Design_Assignment_New2.pptSystem_Analysis_and_Design_Assignment_New2.ppt
System_Analysis_and_Design_Assignment_New2.ppt
 
Unit 2 Classical Systems Development Methodology.pptx
Unit 2 Classical Systems Development Methodology.pptxUnit 2 Classical Systems Development Methodology.pptx
Unit 2 Classical Systems Development Methodology.pptx
 
Introduction
IntroductionIntroduction
Introduction
 
Online auction system srs riport
Online auction system srs  riportOnline auction system srs  riport
Online auction system srs riport
 

Recently uploaded

Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
Juraj Vysvader
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
Ortus Solutions, Corp
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
WSO2
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Anthony Dahanne
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
Cyanic lab
 
top nidhi software solution freedownload
top nidhi software solution freedownloadtop nidhi software solution freedownload
top nidhi software solution freedownload
vrstrong314
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
Paco van Beckhoven
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Mind IT Systems
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
Max Andersen
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Natan Silnitsky
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 

Recently uploaded (20)

Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
 
Accelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with PlatformlessAccelerate Enterprise Software Engineering with Platformless
Accelerate Enterprise Software Engineering with Platformless
 
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
Paketo Buildpacks : la meilleure façon de construire des images OCI? DevopsDa...
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Cyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdfCyaniclab : Software Development Agency Portfolio.pdf
Cyaniclab : Software Development Agency Portfolio.pdf
 
top nidhi software solution freedownload
top nidhi software solution freedownloadtop nidhi software solution freedownload
top nidhi software solution freedownload
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024Cracking the code review at SpringIO 2024
Cracking the code review at SpringIO 2024
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Quarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden ExtensionsQuarkus Hidden and Forbidden Extensions
Quarkus Hidden and Forbidden Extensions
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 

Introduction to knowledge discovery

  • 1. Introduction to Knowledge Engineering Dr. Albert Fornells Herrera http://www.linkedin.com/in/afornells Introduction to Knowledge Engineering - Albert Fornells
  • 2. 1. Definition 2. Process of Knowledge Engineering 3. References 2 Outline Introduction to Knowledge Engineering - Albert Fornells
  • 3.  What is Knowledge Engineering?  The process of building intelligent knowledge-based systems  Build a KBS is still more art than engineering  Knowledge is incomplete, uncertain and heuristic  Complex to estimate the nature and the amount of knowledge needed  An incremental and prototyped approach is the way to incrementally evolve the system from its conception until the final version  Six iterative basic phases (Waterman, 1986; Durkin, 1994): 1. Problem assessment 2. Data and knowledge acquisition 3. Development of a prototype system 4. Development of a complete system 5. Evaluation and revision of the system 6. Integration and maintenance of the system 3 Knowledge Engineering Introduction to Knowledge Engineering - Albert Fornells
  • 4. 4 The process of Knowledge Engineering Introduction to Knowledge Engineering - Albert Fornells Problem assessment Data and knowledge acquisition Development of a prototype system Development of a complete system Evaluation and revision of the system Integration and maintenance of the system Determine the characteristics of the problem Identify the main participants in the project Specify the project’s objective Determine the resources needed for building the system Collect and analyze data and knowledge Make key concepts of the system design more explicit Choose a tool for building an Intelligent system Transform data and represent the knowledge Design and implement a prototype system Test the prototype with test cases Prepare a detailed design for a full-scale system Collect additional data and knowledge Develop the user interface Implement the complete system Evaluate the system against the performance criteria Revise the system as necessary Make arrangements for technology transfer Establish an effective maintenance program Requirements Concepts Structure KBS ‘beta’ Reformulation Redesign Tuning KBS tested Maintenance Assessment Redesign Iterations
  • 5.  Goal: (1) Determine the characteristics of the problem; (2) Identify the participants of the project; (3) Specify the objectives of the project and (4) Determine what resources are needed for building the system.  The problem characterization implies:  Determine the problem type:  Diagnosis. Inferring malfunctions of an object from its behavior and recommending solutions  Selection. Recommending the best option from a list of possible alternatives  Prediction. Predicting the future behavior of an object from its behavior in the past  Classification. Assigning an object to one of the defined classes  Clustering. Dividing an heterogeneous group of objects into homogeneous subgroups  Optimization. Improving the quality of solutions until an optimal one is found  Control Governing the behavior of an object to meet specified requirements in real-time  Input and output variables and their interactions  The form and content of the solution. 5 KE process: Problem assessment Introduction to Knowledge Engineering - Albert Fornells
  • 6.  Identify the participants of the project. There are two critical participants:  The knowledge engineer (a person capable of designing, building and testing an intelligent system)  The domain expert (a knowledgeable person capable of solving problems in a specific area or domain)  Specify the project’s objectives  Gaining a competitive edge, improving the quality of decisions, reducing labor costs, and improving the quality of products and services, etc  Determine what resources are needed for building the system  Computer facilities  Development software  Knowledge and data sources (human experts, textbooks, manuals, web sites, databases and examples)  Money 6 KE process: Problem assessment Introduction to Knowledge Engineering - Albert Fornells
  • 7.  Toy problem: Found X.  Do you think that a KBS is needed?  X=sqrt(3^2+4^2)=5  Maybe it is not necessary  But maybe experts do not think the same because they misunderstand the questions.  They apply computer vision techniques for text recognition.  Who is wrong? 7 KE process: Problem assessment Introduction to Knowledge Engineering - Albert Fornells 4cm 3cm x 4cm 3cm x Here is
  • 8.  Goal: To obtain further understanding of the problem domain by (1) collecting and analyzing both data and knowledge and (2) making key concepts of the system’s design more explicit  The choice of the system building tool depends on the acquired data. Data is often collected from different sources and thus can be of different types.  Continuous variables vs variables divided into several ranges vs normalized (i.e. 0 - 1)  Symbolic (textual) data vs numerical data vs heterogeneous data  Imprecise and noisy data vs well-defined and clean data  As a result, the data must be transformed, or massaged, into the form useful for a particular tool. There are three important issues that must be resolved before massaging the data (Berry and Linoff, 1997)  Incompatible data  Inconsistent data  Missing data 8 KE process: Data and knowledge acquisition Introduction to Knowledge Engineering - Albert Fornells
  • 9.  Knowledge acquisition is an inherently iterative process based on:  Start with reviewing documents and reading books, papers and manuals related to the problem domain in order to become familiar with the problem  Collect further knowledge through interviewing the domain expert. The expert is asked to identify four or five typical cases, describe how each case is solved and explain, or ‘think out loud’, the reasoning behind each solution (Russell, 2002)  Study and analyze the acquired knowledge, and repeat the entire process again  However, extracting knowledge from a human expert is a difficult process – it is often called the ‘knowledge acquisition bottleneck’.  Quite often experts are unaware of the amount of knowledge they have and the problem-solving strategy they use, or are unable to verbalize it.  Experts may also provide us with irrelevant, incomplete or inconsistent information  Example of cheese factory (Donald Michie, 1982)  The data and knowledge acquired should enable us to describe the problem- solving strategy at the most abstract level, but any detailed analysis has to be done before evaluating the prototype 9 KE process: Data and knowledge acquisition Introduction to Knowledge Engineering - Albert Fornells
  • 10.  Goal: Create an intelligent system – or, rather, a small version of it – and test it with a number of test cases.  What is a prototype?  A prototype system can be defined as a small version of the final system  It is designed to test how well we understand the problem, or in other words, to make sure that the problem-solving strategy, the tool selected for building a system, and techniques for representing acquired data and knowledge are adequate to the task  It also provides us with an opportunity to actively engage the domain expert in the system’s development  What is a test case?  Once it is built, the prototype’s performance needs to be tested (usually together with the domain expert)  The domain expert takes an active part in testing the system and, as a result, becomes more involved in the system’s development  What should we do if we have made a bad choice of the system-building tool?  We should throw the prototype away and start the prototyping phase over again 10 KE process: Development of a prototype system Introduction to Knowledge Engineering - Albert Fornells
  • 11.  Goal: Once prototype is assessed, we develop a plan, schedule and budget for the complete system, and also clearly define the system’s performance.  Main milestones:  Adding data and knowledge to the system (i.e. collect core rules, additional historical examples, etc.)  Develop the user interface for delivering information to a user. Some system focuses on explaining its reasoning process and justify its advice. Others represent data in graphical forms.  The development is like an evolutionary process where, as the project needs, new data and knowledge are collected and added to the system. 11 KE process: Development of a complete system Introduction to Knowledge Engineering - Albert Fornells
  • 12.  Goal: To evaluate an intelligent system is, in fact, to assure that the system performs the intended task to the user’s satisfaction  Intelligent systems are designed to solve problems that quite often do not have clearly defined ‘right’ and ‘wrong’ solutions  A formal evaluation of the system is normally accomplished with the test cases selected by the user  The system’s performance is compared against the performance criteria that were agreed at the end of the prototyping phase  The evaluation often reveals the system’s limitations and weaknesses, so it is revised and relevant development phases are repeated  There are specific methodological procedures to assess the performance (Ian Witten, 2005)  X Stratified cross-validation  Leave-one out  Statistical tests are a common way for comparing the performance of the approach with respect to experts and also with respect to others approaches 12 KE process: Evaluation and revision Introduction to Knowledge Engineering - Albert Fornells
  • 13.  Goal: It involves integrating the system into the environment where it will operate and establishing an effective maintenance program  What means integration?  Integration means interfacing the new intelligent system with existing systems within an organization and arranging for technology transfer  What means establish a maintenance program?  Intelligent systems are knowledge-based systems, and because knowledge evolves over time, the system has to be able to be upgraded  The organization that uses the system should have in-house expertise to maintain and modify the system  Is maintenance forever?  No, a big change in the knowledge domain could imply to throw away the intelligent system and start from the beginning. 13 KE process: Integration and maintenance Introduction to Knowledge Engineering - Albert Fornells
  • 14. 14 Summary: Roadmap for tackling a problem Introduction to Knowledge Engineering - Albert Fornells Knowledge Problem Analysis Data Analysis Data Processing Modeling Evaluation Production
  • 15. Problem: I want to develop an intelligent system that can help me to fix malfunctions of my computer. Describe, from a global point of view, how will you tackle the process of Knowledge Engineering? 1. Problem assessment 2. Data and knowledge acquisition 3. Development of a prototype system 4. Development of a complete system 5. Evaluation and revision of the system 6. Integration and maintenance of the system 15 Exercise: Will a KBS work for my problem? Introduction to Knowledge Engineering - Albert Fornells
  • 16.  A. J. González y D. D. Dankel, The Engineering of Knowledge Based Systems. Theory and Practice. Prentice Hall, 2000. 16 References Introduction to Knowledge Engineering - Albert Fornells