SlideShare a Scribd company logo
Lecture 01
Information Retrieval
About the Course
 Book:
 An Introduction to Information Retrieval, Christopher D.
Manning Prabhakar Raghavan Hinrich Schütze, Cambridge
University Press, 2009.
 Other materials may be considered depending on the subject.
 Principal objective of this course:
 To introduce students to Information Retrieval concepts,
paradigms and techniques, with an emphasis on String and
Semantics based IR techniques.
About the Course
 Grading & Assessment:
 First Exam …………………….. 20%
 Second Exam ………………….. 20%
 Final Exam …………………….. 35%
 Other Activities ………………. 10%
 Major Assignment ……………. 15%
“You are to build a prototype for a search engine that employs
both text-based and semantics-based techniques for retrieving the
most relevant results to users’ queries. The search space will be a
collection of documents, in addition to a collection of images
associated with some textual descriptions”.
Course Topics
 Part 01 – Introduction
 What is IR?
 Examples of IR Systems.
 Other topics related to IR.
 Models of IR
 Part 02 – Boolean Retrieval
 What is Boolean IR?
 Term-Document Incidence Matrices
 Terminology and Notations
Course Topics
 Part 03 – Indexing
 Building Indexes
 Semantic Networks
 Part 04 – Retrieval
 Scoring, Ranking
 Relevance Feedback
 Precision/Recall
Course Topics
 Part 05 – Exploiting Ontologies in IR
 Ontologies
 Traditional vs. Semantics-based IR techniques
Introduction
What is IR
 Information Retrieval:
“Information retrieval (IR) is finding material (usually documents) of an
unstructured nature (usually text) that satisfies an information need from
within large collections (usually stored on computers).”
 Unstructured Data:
“refers to data which does not have clear, semantically overt, easy-for-a-
computer structure.”
e.g.  Textual information in web pages.
 Semistructured Data:
“refers to data which have a partially clear, semantically overt, easy-for-a-
computer structure.”
e.g.  finding a document where the title contains Java and the body
contains threading.
Introduction
What is IR
 Structured Data:
“refers to data which have a clear, semantically overt, easy-
for-a-computer structure.”
e.g.  Relational Databases.
 A look back: 1990s
 Studies showed that most people preferred getting
information from other people rather than from information
retrieval systems.
 Online booking systems?
 Following to this period and after relentless optimization of
IR:
 The field of information retrieval has moved from being a
primarily academic discipline to being the basis underlying
most people’s preferred means of information access.
Introduction
What is IR
 Information retrieval did not begin with the Web.
 The field began with scientific publications and library
records, but soon spread to other forms of content, particularly
those of information professionals, such as journalists, lawyers,
and doctors
Introduction
What is IR
Introduction
Other Topics Related to IR
 Cross-language IR
 Multimedia IR
 Speech retrieval
 User interfaces for IR
 Ontology and Semantics-based IR
 Natural Language Processing (NLP) techniques
 Dynamic IR
 Online Advertising !?
Introduction
Other Topics Related to IR
 The field of information retrieval also covers supporting users in
browsing or filtering document collections or further processing
a set of retrieved documents.
 Given a set of documents, clustering is the task of coming up
with a good grouping of the documents based on their contents.
 Given a set of topics, standing information needs, or other
categories (such as suitability of texts for different age groups),
classification is the task of deciding which class(es), if any,
each of a set of documents belongs to. It is often approached by
first manually classifying some documents and then hoping to
be able to classify new documents automatically.
Introduction
Classification of IR systems
 Scale-based Classification of IR systems: Distinguishing
between Information retrieval systems according to the scale at
which they operate.
1. Web search: The search is conducted over billions of
documents stored on millions of computers.
 Issues to consider:
1. Needing to gather documents for indexing.
2. Being able to build systems that work efficiently at this
enormous scale.
3. Handling particular aspects of the web, such as the
exploitation of hypertext and page ranking given the
commercial importance of the web.
2. Personal Information Retrieval: Integrating information
retrieval into consumer operating systems.
 Issues to consider:
1. Handling the broad range of document types on a typical
personal computer.
2. Making the search system maintenance free and
sufficiently lightweight in terms of startup, processing, and
disk space usage that it can run on one machine without
annoying its owner.
Introduction
Classification of IR systems
3. Enterprise, Institutional, and Domain-specific Search:
A corporation’s documents will typically be stored on
centralized file systems and one or a handful of
dedicated machines will provide search over the
collection.
 Issues to consider:
1. Handling the broad range of document types on a
centralized computer.
2. Scale and Efficiency of the IR system.
3. Maintenance of the search system.
Introduction
Classification of IR systems
Introduction
Classification of IR systems
 Technique-based Classification of IR systems:
Distinguishing between Information retrieval systems
according to the search technique that they employ.
1. Keyword-based search: String matching algorithms are
employed to find documents relevant to the user’s query.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Gap between the textual information contained in the
document collections and the user’s information need.
Introduction
Classification of IR systems
2. Semantics-based search: Semantic aspects of the
user’s query are derived in an attempt to find documents
relevant to the user’s query.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Lack of Semantic Resources.
3. Incompleteness of Background Knowledge
represented in existing Semantic Resources.
4. Semantic Heterogeneity problem between existing
Semantic Resources.
5. Lack of Multi-lingual Semantic Resources.
Introduction
Classification of IR systems
2. Hybrid Approaches: Keyword-based search is enriched with
Semantics-based search to retrieve more relevant results to the
user’s information needs.
 Issues to consider:
1. Precision and Recall of the search algorithm.
2. Lack of Semantic Resources.
3. Priority of the employed techniques.
4. Incompleteness of Background Knowledge represented in
existing Semantic Resources.
5. Types of queries that the system can handle (Single-term vs.
Verbose queries).
6. Lack of Multi-lingual Semantic Resources.
 Research is very active in this area.
 Example: Dbpedia based search engine (June 2015)

More Related Content

What's hot

INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.Lanujessy
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
Libcorpio
 
Information retrieval
Information retrievalInformation retrieval
Information retrievalhplap
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Roi Blanco
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
Mounia Lalmas-Roelleke
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal
EdiFaizal2
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)silambu111
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
Kira
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
El Habib NFAOUI
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalization
Ameenababs
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
Leslie Vargas
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
Rachmat Wahid Saleh Insani
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Carsten Eickhoff
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval ssilambu111
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrievalNanthini Dominique
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3alaa223
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
sathish sak
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantinimaxfalc
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction
Grace Hui Yang
 

What's hot (20)

INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal Konsep Dasar Information Retrieval - Edi faizal
Konsep Dasar Information Retrieval - Edi faizal
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalization
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
 
Lectures 1,2,3
Lectures 1,2,3Lectures 1,2,3
Lectures 1,2,3
 
An Introduction to Information Retrieval and Applications
 An Introduction to Information Retrieval and Applications An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
 
Lec1,2
Lec1,2Lec1,2
Lec1,2
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction
 

Viewers also liked

Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
Mounia Lalmas-Roelleke
 
Representation theory slideshare
Representation theory slideshareRepresentation theory slideshare
Representation theory slideshare
chloelogan1
 
Ir 02
Ir   02Ir   02
Ch8
Ch8Ch8
Ir 09
Ir   09Ir   09
Ir 03
Ir   03Ir   03
Information Retrieval Techniques of Google
Information Retrieval Techniques of Google Information Retrieval Techniques of Google
Information Retrieval Techniques of Google
Cyr Ish
 
Ir 08
Ir   08Ir   08
Ch2020
Ch2020Ch2020
Ch7
Ch7Ch7
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Mauro Dragoni
 
Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th EditionCh 8
Ian Sommerville,  Software Engineering, 9th EditionCh 8Ian Sommerville,  Software Engineering, 9th EditionCh 8
Ian Sommerville, Software Engineering, 9th EditionCh 8
Mohammed Romi
 
Artifical intelligance
Artifical intelliganceArtifical intelligance
Artifical intelligance
Gangasailakshmi Tellakula
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
Amit Ranjan
 

Viewers also liked (17)

Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
 
Representation theory slideshare
Representation theory slideshareRepresentation theory slideshare
Representation theory slideshare
 
Bab ii
Bab iiBab ii
Bab ii
 
Ir 02
Ir   02Ir   02
Ir 02
 
Ch8
Ch8Ch8
Ch8
 
Ir 09
Ir   09Ir   09
Ir 09
 
Ir 03
Ir   03Ir   03
Ir 03
 
Information Retrieval Techniques of Google
Information Retrieval Techniques of Google Information Retrieval Techniques of Google
Information Retrieval Techniques of Google
 
Ir 08
Ir   08Ir   08
Ir 08
 
Ch2020
Ch2020Ch2020
Ch2020
 
Ch7
Ch7Ch7
Ch7
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
 
Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)Ai 02 intelligent_agents(1)
Ai 02 intelligent_agents(1)
 
IR
IRIR
IR
 
Ian Sommerville, Software Engineering, 9th EditionCh 8
Ian Sommerville,  Software Engineering, 9th EditionCh 8Ian Sommerville,  Software Engineering, 9th EditionCh 8
Ian Sommerville, Software Engineering, 9th EditionCh 8
 
Artifical intelligance
Artifical intelliganceArtifical intelligance
Artifical intelligance
 
SlideShare 101
SlideShare 101SlideShare 101
SlideShare 101
 

Similar to Ir 01

CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
Habtamu100
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Vikas Bhushan
 
Hci
HciHci
IRT Unit_I.pptx
IRT Unit_I.pptxIRT Unit_I.pptx
IRT Unit_I.pptx
thenmozhip8
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
captainmactavish1996
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Habtamu100
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
ISAR Publications
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
Malobe Lottin Cyrille Marcel
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
ChandraSekhar1115
 
Use of Wikipedia categories on information retrieval: a brief research
Use of Wikipedia categories on information retrieval: a brief researchUse of Wikipedia categories on information retrieval: a brief research
Use of Wikipedia categories on information retrieval: a brief research
Jesús Tramullas
 
Challenges and emerging practices for knowledge organization in the electron...
Challenges and emerging practices for knowledge  organization in the electron...Challenges and emerging practices for knowledge  organization in the electron...
Challenges and emerging practices for knowledge organization in the electron...Anil Mishra
 
M045067275
M045067275M045067275
M045067275
IJERA Editor
 
chapter 1-Overview of Information Retrieval.ppt
chapter 1-Overview of Information Retrieval.pptchapter 1-Overview of Information Retrieval.ppt
chapter 1-Overview of Information Retrieval.ppt
SamuelKetema1
 

Similar to Ir 01 (20)

CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
 
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information RetrievalIndexing Techniques: Their Usage in Search Engines for Information Retrieval
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
 
Hci
HciHci
Hci
 
IRT Unit_I.pptx
IRT Unit_I.pptxIRT Unit_I.pptx
IRT Unit_I.pptx
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdf
 
Mam assign
Mam assignMam assign
Mam assign
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
 
Chap 1 general introduction of information retrieval
Chap 1  general introduction of information retrievalChap 1  general introduction of information retrieval
Chap 1 general introduction of information retrieval
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
 
Use of Wikipedia categories on information retrieval: a brief research
Use of Wikipedia categories on information retrieval: a brief researchUse of Wikipedia categories on information retrieval: a brief research
Use of Wikipedia categories on information retrieval: a brief research
 
Challenges and emerging practices for knowledge organization in the electron...
Challenges and emerging practices for knowledge  organization in the electron...Challenges and emerging practices for knowledge  organization in the electron...
Challenges and emerging practices for knowledge organization in the electron...
 
M045067275
M045067275M045067275
M045067275
 
Infolitbis
InfolitbisInfolitbis
Infolitbis
 
chapter 1-Overview of Information Retrieval.ppt
chapter 1-Overview of Information Retrieval.pptchapter 1-Overview of Information Retrieval.ppt
chapter 1-Overview of Information Retrieval.ppt
 

More from Mohammed Romi

Ai 01 introduction
Ai 01 introductionAi 01 introduction
Ai 01 introduction
Mohammed Romi
 
Ai 03 solving_problems_by_searching
Ai 03 solving_problems_by_searchingAi 03 solving_problems_by_searching
Ai 03 solving_problems_by_searching
Mohammed Romi
 
Swiching
SwichingSwiching
Swiching
Mohammed Romi
 
Ch19 network layer-logical add
Ch19 network layer-logical addCh19 network layer-logical add
Ch19 network layer-logical add
Mohammed Romi
 
Ch12
Ch12Ch12
Angel6 e05
Angel6 e05Angel6 e05
Angel6 e05
Mohammed Romi
 
Chapter02 graphics-programming
Chapter02 graphics-programmingChapter02 graphics-programming
Chapter02 graphics-programming
Mohammed Romi
 
Swe notes
Swe notesSwe notes
Swe notes
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Ian Sommerville,  Software Engineering, 9th Edition Ch 4Ian Sommerville,  Software Engineering, 9th Edition Ch 4
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch2
Ian Sommerville,  Software Engineering, 9th Edition Ch2Ian Sommerville,  Software Engineering, 9th Edition Ch2
Ian Sommerville, Software Engineering, 9th Edition Ch2
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch1
Ian Sommerville,  Software Engineering, 9th Edition Ch1Ian Sommerville,  Software Engineering, 9th Edition Ch1
Ian Sommerville, Software Engineering, 9th Edition Ch1
Mohammed Romi
 
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Ian Sommerville,  Software Engineering, 9th Edition Ch 23Ian Sommerville,  Software Engineering, 9th Edition Ch 23
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Mohammed Romi
 
Ch 6
Ch 6Ch 6
Ch 4 software engineering
Ch 4 software engineeringCh 4 software engineering
Ch 4 software engineering
Mohammed Romi
 

More from Mohammed Romi (14)

Ai 01 introduction
Ai 01 introductionAi 01 introduction
Ai 01 introduction
 
Ai 03 solving_problems_by_searching
Ai 03 solving_problems_by_searchingAi 03 solving_problems_by_searching
Ai 03 solving_problems_by_searching
 
Swiching
SwichingSwiching
Swiching
 
Ch19 network layer-logical add
Ch19 network layer-logical addCh19 network layer-logical add
Ch19 network layer-logical add
 
Ch12
Ch12Ch12
Ch12
 
Angel6 e05
Angel6 e05Angel6 e05
Angel6 e05
 
Chapter02 graphics-programming
Chapter02 graphics-programmingChapter02 graphics-programming
Chapter02 graphics-programming
 
Swe notes
Swe notesSwe notes
Swe notes
 
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Ian Sommerville,  Software Engineering, 9th Edition Ch 4Ian Sommerville,  Software Engineering, 9th Edition Ch 4
Ian Sommerville, Software Engineering, 9th Edition Ch 4
 
Ian Sommerville, Software Engineering, 9th Edition Ch2
Ian Sommerville,  Software Engineering, 9th Edition Ch2Ian Sommerville,  Software Engineering, 9th Edition Ch2
Ian Sommerville, Software Engineering, 9th Edition Ch2
 
Ian Sommerville, Software Engineering, 9th Edition Ch1
Ian Sommerville,  Software Engineering, 9th Edition Ch1Ian Sommerville,  Software Engineering, 9th Edition Ch1
Ian Sommerville, Software Engineering, 9th Edition Ch1
 
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Ian Sommerville,  Software Engineering, 9th Edition Ch 23Ian Sommerville,  Software Engineering, 9th Edition Ch 23
Ian Sommerville, Software Engineering, 9th Edition Ch 23
 
Ch 6
Ch 6Ch 6
Ch 6
 
Ch 4 software engineering
Ch 4 software engineeringCh 4 software engineering
Ch 4 software engineering
 

Recently uploaded

Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
Steve Thomason
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 

Recently uploaded (20)

Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 

Ir 01

  • 2. About the Course  Book:  An Introduction to Information Retrieval, Christopher D. Manning Prabhakar Raghavan Hinrich Schütze, Cambridge University Press, 2009.  Other materials may be considered depending on the subject.  Principal objective of this course:  To introduce students to Information Retrieval concepts, paradigms and techniques, with an emphasis on String and Semantics based IR techniques.
  • 3. About the Course  Grading & Assessment:  First Exam …………………….. 20%  Second Exam ………………….. 20%  Final Exam …………………….. 35%  Other Activities ………………. 10%  Major Assignment ……………. 15% “You are to build a prototype for a search engine that employs both text-based and semantics-based techniques for retrieving the most relevant results to users’ queries. The search space will be a collection of documents, in addition to a collection of images associated with some textual descriptions”.
  • 4. Course Topics  Part 01 – Introduction  What is IR?  Examples of IR Systems.  Other topics related to IR.  Models of IR  Part 02 – Boolean Retrieval  What is Boolean IR?  Term-Document Incidence Matrices  Terminology and Notations
  • 5. Course Topics  Part 03 – Indexing  Building Indexes  Semantic Networks  Part 04 – Retrieval  Scoring, Ranking  Relevance Feedback  Precision/Recall
  • 6. Course Topics  Part 05 – Exploiting Ontologies in IR  Ontologies  Traditional vs. Semantics-based IR techniques
  • 7. Introduction What is IR  Information Retrieval: “Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers).”  Unstructured Data: “refers to data which does not have clear, semantically overt, easy-for-a- computer structure.” e.g.  Textual information in web pages.  Semistructured Data: “refers to data which have a partially clear, semantically overt, easy-for-a- computer structure.” e.g.  finding a document where the title contains Java and the body contains threading.
  • 8. Introduction What is IR  Structured Data: “refers to data which have a clear, semantically overt, easy- for-a-computer structure.” e.g.  Relational Databases.
  • 9.  A look back: 1990s  Studies showed that most people preferred getting information from other people rather than from information retrieval systems.  Online booking systems?  Following to this period and after relentless optimization of IR:  The field of information retrieval has moved from being a primarily academic discipline to being the basis underlying most people’s preferred means of information access. Introduction What is IR
  • 10.  Information retrieval did not begin with the Web.  The field began with scientific publications and library records, but soon spread to other forms of content, particularly those of information professionals, such as journalists, lawyers, and doctors Introduction What is IR
  • 11. Introduction Other Topics Related to IR  Cross-language IR  Multimedia IR  Speech retrieval  User interfaces for IR  Ontology and Semantics-based IR  Natural Language Processing (NLP) techniques  Dynamic IR  Online Advertising !?
  • 12. Introduction Other Topics Related to IR  The field of information retrieval also covers supporting users in browsing or filtering document collections or further processing a set of retrieved documents.  Given a set of documents, clustering is the task of coming up with a good grouping of the documents based on their contents.  Given a set of topics, standing information needs, or other categories (such as suitability of texts for different age groups), classification is the task of deciding which class(es), if any, each of a set of documents belongs to. It is often approached by first manually classifying some documents and then hoping to be able to classify new documents automatically.
  • 13. Introduction Classification of IR systems  Scale-based Classification of IR systems: Distinguishing between Information retrieval systems according to the scale at which they operate. 1. Web search: The search is conducted over billions of documents stored on millions of computers.  Issues to consider: 1. Needing to gather documents for indexing. 2. Being able to build systems that work efficiently at this enormous scale. 3. Handling particular aspects of the web, such as the exploitation of hypertext and page ranking given the commercial importance of the web.
  • 14. 2. Personal Information Retrieval: Integrating information retrieval into consumer operating systems.  Issues to consider: 1. Handling the broad range of document types on a typical personal computer. 2. Making the search system maintenance free and sufficiently lightweight in terms of startup, processing, and disk space usage that it can run on one machine without annoying its owner. Introduction Classification of IR systems
  • 15. 3. Enterprise, Institutional, and Domain-specific Search: A corporation’s documents will typically be stored on centralized file systems and one or a handful of dedicated machines will provide search over the collection.  Issues to consider: 1. Handling the broad range of document types on a centralized computer. 2. Scale and Efficiency of the IR system. 3. Maintenance of the search system. Introduction Classification of IR systems
  • 16. Introduction Classification of IR systems  Technique-based Classification of IR systems: Distinguishing between Information retrieval systems according to the search technique that they employ. 1. Keyword-based search: String matching algorithms are employed to find documents relevant to the user’s query.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Gap between the textual information contained in the document collections and the user’s information need.
  • 17. Introduction Classification of IR systems 2. Semantics-based search: Semantic aspects of the user’s query are derived in an attempt to find documents relevant to the user’s query.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Lack of Semantic Resources. 3. Incompleteness of Background Knowledge represented in existing Semantic Resources. 4. Semantic Heterogeneity problem between existing Semantic Resources. 5. Lack of Multi-lingual Semantic Resources.
  • 18. Introduction Classification of IR systems 2. Hybrid Approaches: Keyword-based search is enriched with Semantics-based search to retrieve more relevant results to the user’s information needs.  Issues to consider: 1. Precision and Recall of the search algorithm. 2. Lack of Semantic Resources. 3. Priority of the employed techniques. 4. Incompleteness of Background Knowledge represented in existing Semantic Resources. 5. Types of queries that the system can handle (Single-term vs. Verbose queries). 6. Lack of Multi-lingual Semantic Resources.  Research is very active in this area.  Example: Dbpedia based search engine (June 2015)