SlideShare a Scribd company logo
1 of 25
Download to read offline
INFORMATION RETIEVAL (IR)
Edi Faizal
21/483830/SPA/00795
MK “INFORMATION RETRIEVAL”; DOSEN PENGAMPU: AINA MUSDHOLIFAH, S.Kom., M.Kom., Ph.D.
PROGRAM STUDI S3 DOKTOR ILMU KOMPUTER 2021
Course Topic’s
A. Konsep Dasar IR (Information Retrieval)
1. IR vs Recommender System vs Search Engine
2. Jenis-jenis IR
B. Preparing IR
1. Crawling
2. Indexing
3. NLP pada IR
4. Representasi text pada IR
C. Metode klasifikasi pada IR
D. Metode clustering pada IR
E. Evaluation in IR
Part 1
KONSEP DASAR INFORMATION RETIEVAL
Edi Faizal
21/483830/SPA/00795
MK “INFORMATION RETRIEVAL”; DOSEN PENGAMPU: AINA MUSDHOLIFAH, S.Kom., M.Kom., Ph.D.
PROGRAM STUDI S3 DOKTOR ILMU KOMPUTER 2021
Outline
Konsep Dasar IR (Information Retrieval)
• IR vs RecSys vs Search Engine
• Jenis-Jenis IR
Information Retrieval (IR)
versus
Recommender System (RecSys/RSs)
versus
Search Engine (SE)
Information Retrieval (IR)
Information retrieval (IR) is finding material (usually documents) of an
unstructured nature (usually text) that satisfies an information need from
within large collections (usually stored on computers)
(Manning et al, 2009)
 An information need is the topic about which the user desires to know
more about.
 A document is relevant if the user perceives that it contains information
of value with respect to their personal information need.
What is a document? web pages, email, books, news stories, scholarly
papers, text messages, Powerpoint, PDF, forum postings, patents, IM
sessions, Tweets, question answer postings, image, audio, video etc.
 A query is what the user conveys to the computer in an attempt to
communicate the information need.
IR (cont…)
The system should be able to retrieve
the relevant docs eficiently
IR (cont…)
Recommender System (RecSys/RSs)
A recommender system, or a recommendation system (sometimes replacing
'system' with a synonym such as platform or engine), is a subclass
of information filtering system that seeks to predict the "rating" or
"preference" a user would give to an item
(Ricci et al, 211)
 Software tools and techniques providing suggestions for items to be of use to a user
 “Item” is the general term used to denote what the system recommends to users
 The suggestions relate to various decision-making processes, such as what items to
buy, what music to listen to, or what online news to read.
 Designing and developing RSs is a multi-disciplinary effort that has benefited from
results obtained in various computer science fields especially machine learning and
data mining, information retrieval, and human-computer interaction
RecSys (cont…)
Primary model of RecSys:
• Prediction version of problem: memprediksi nilai peringkat
untuk kombinasi user-item, dengan asumsi data pelatihan
tersedia, yang menunjukkan preferensi user to item.
• Ranking version of problem : Proses merekomendasikan top-k
item untuk pengguna tertentu, atau menentukan top-k pengguna
untuk menargetkan item tertentu.
Operational and technical goals RecSys:
• Relevance: merekomendasikan item yang relevan dengan user.
• Novelty: membantu ketika item yang direkomendasikan adalah
sesuatu (item) yang belum pernah dilihat user di masa lalu.
• Serendipity: item yang direkomendasikan tidak terduga
(kebetulan) dan tidak diketahui sebelumnya.
• Increasing recommendation diversity: memberikan keragaman
rekomendasi, biasanya menyarankan daftar top-k item kepada
user.
Relevance Novelty
Serendipity
Increasing
Recom.
diversity
Prediction
version of
problem
Ranking
version of
problem
Goal of RecSys
RecSys (cont…)
Model-model RSs (Khan et al., 2020) dimodifikasi
RecSys (cont…)
IR vs RecSys
Search Engine
“A program that searches for and identifies items in a database that
correspond to keywords or characters specified by the user, used especially
for finding particular sites on the World Wide Web.”
 Salah satu aplikasi umum dari IR adalah search engine atau mesin
pencarian yang terdapat pada jaringan internet.
 Pengguna dapat mencari halaman-halaman web yang dibutuhkannya
melalui search engine.
 Contoh lain dari IR adalah sistem informasi perpustakaan
Search Engine (cont…)
Google.com
http://portal.igpublish.com.ezproxy.ugm.ac.id/
IR & RecSys & Search Engine
IR & RecSys & Search Engine (cont…)
Jenis-jenis IR
Information retrieval models roughly fall into following paradigms:
 Set theoretic models
Boolean model
Extended Boolean model
 Algebraic models
 Vector space model
 Latent models
 Latent semantic indexing (LSI), Random indexing, Topic modelling for IR
 Probabilistic retrieval
Classic probabilistic retrieval: Binary independence model, BM11, BM25
Language models for IR, Semantic ad-hoc retrieval, Embedding models
Jenis-jenis IR (cont…)
Jenis-jenis IR (cont…)
An information retrieval comprises of the following four key elements:
• D − Document Representation.
• Q − Query Representation.
• F − A framework to match and establish a relationship between D and Q.
• R (q, di) − A ranking function that determines the similarity between the
query and the document to display relevant information.
There are three types of Information Retrieval (IR) models:
1. Classical IR Model
2. Non-Classical IR Model
3. Alternative IR Model
Jenis-jenis IR (cont…)
Classical IR Model
 It is designed upon basic mathematical concepts and is the most widely-
used of IR models. Classic Information Retrieval models can be
implemented with ease.
 Its examples include: Vector-space, Boolean and Probabilistic IR models.
 In this system, the retrieval of information depends on documents
ontaining the defined set of queries. There is no ranking or grading of any
kind.
 The different classical IR models take Document Representation, Query
representation, and Retrieval/Matching function into account in their
modelling.
Jenis-jenis IR (cont…)
Non-Classical IR Model
They differ from classic models in that they are built upon propositional
logic. Examples of non-classical IR models include:
 Information Logic,
 Situation Theory, and
 Interaction models.
Jenis-jenis IR (cont…)
Alternative IR Model
These take principles of classical IR model and enhance upon to create
more functional models like :
 Cluster model,
 Alternative Set-Theoretic Models
 Fuzzy Set model,
 Latent Semantic Indexing (LSI) model,
 Alternative Algebraic Models
 Generalized Vector Space Model, etc.
What next…?
Preparing IR
 Crawling
 Indexing
 NLP pada IR
 Representasi text pada IR
Konsep Dasar Information Retrieval - Edi faizal

More Related Content

What's hot

Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrievalSadaf Rafiq
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.Lanujessy
 
Information retrieval
Information retrievalInformation retrieval
Information retrievalhplap
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Kira
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONLibcorpio
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrievalKU Leuven
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Grace Hui Yang
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!Jane Garay
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval systemLeslie Vargas
 
Review of search and retrieval strategies
Review of search and retrieval strategiesReview of search and retrieval strategies
Review of search and retrieval strategiesAbid Fakhre Alam
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalizationAmeenababs
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)silambu111
 
basis of infromation retrival part 1 retrival tools
basis of infromation retrival part 1 retrival toolsbasis of infromation retrival part 1 retrival tools
basis of infromation retrival part 1 retrival toolsSaroj Suwal
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantinimaxfalc
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spacesMounia Lalmas-Roelleke
 

What's hot (20)

Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
INFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.LINFORMATION RETRIEVAL Anandraj.L
INFORMATION RETRIEVAL Anandraj.L
 
Information retrieval
Information retrievalInformation retrieval
Information retrieval
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
 
INFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATIONINFORMATION RETRIEVAL ‎AND DISSEMINATION
INFORMATION RETRIEVAL ‎AND DISSEMINATION
 
Ir 01
Ir   01Ir   01
Ir 01
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrieval
 
Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction Information Retrieval Fundamentals - An introduction
Information Retrieval Fundamentals - An introduction
 
Text Indexing and Retrieval
Text Indexing and RetrievalText Indexing and Retrieval
Text Indexing and Retrieval
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
 
IR
IRIR
IR
 
Information retrieval system
Information retrieval systemInformation retrieval system
Information retrieval system
 
Review of search and retrieval strategies
Review of search and retrieval strategiesReview of search and retrieval strategies
Review of search and retrieval strategies
 
information retrieval Techniques and normalization
information retrieval Techniques and normalizationinformation retrieval Techniques and normalization
information retrieval Techniques and normalization
 
Functions of information retrival system(1)
Functions of information retrival system(1)Functions of information retrival system(1)
Functions of information retrival system(1)
 
basis of infromation retrival part 1 retrival tools
basis of infromation retrival part 1 retrival toolsbasis of infromation retrival part 1 retrival tools
basis of infromation retrival part 1 retrival tools
 
Tovek Presentation by Livio Costantini
Tovek Presentation by Livio CostantiniTovek Presentation by Livio Costantini
Tovek Presentation by Livio Costantini
 
Aggregation for searching complex information spaces
Aggregation for searching complex information spacesAggregation for searching complex information spaces
Aggregation for searching complex information spaces
 
Lec1,2
Lec1,2Lec1,2
Lec1,2
 

Similar to Konsep Dasar Information Retrieval - Edi faizal

Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...IRJET Journal
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefordsapollobgslibrary
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineYi Zeng
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptxHabtamu100
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsIRJET Journal
 
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTAL
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTALCONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTAL
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTALcscpconf
 
Contextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalContextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalcsandit
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfHabtamu100
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsGESIS
 
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...TELKOMNIKA JOURNAL
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsThe University of Edinburgh
 
Leveraging Computational Methods for Theorizing IS Phenomena
Leveraging Computational Methods for Theorizing IS PhenomenaLeveraging Computational Methods for Theorizing IS Phenomena
Leveraging Computational Methods for Theorizing IS PhenomenaMalmi Amadoru
 
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningA Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningIRJET Journal
 

Similar to Konsep Dasar Information Retrieval - Edi faizal (20)

Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
Hci encyclopedia irshortefords
Hci encyclopedia irshortefordsHci encyclopedia irshortefords
Hci encyclopedia irshortefords
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support Engine
 
I1802055259
I1802055259I1802055259
I1802055259
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
M045067275
M045067275M045067275
M045067275
 
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTAL
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTALCONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTAL
CONTEXTUAL MODEL OF RECOMMENDING RESOURCES ON AN ACADEMIC NETWORKING PORTAL
 
Contextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portalContextual model of recommending resources on an academic networking portal
Contextual model of recommending resources on an academic networking portal
 
Chapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdfChapter 1 Introduction to Information Storage and Retrieval.pdf
Chapter 1 Introduction to Information Storage and Retrieval.pdf
 
Demonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations SystemsDemonstrating a Framework for KOS-based Recommendations Systems
Demonstrating a Framework for KOS-based Recommendations Systems
 
CS8080 IRT UNIT I NOTES.pdf
CS8080 IRT UNIT I  NOTES.pdfCS8080 IRT UNIT I  NOTES.pdf
CS8080 IRT UNIT I NOTES.pdf
 
CS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdfCS8080_IRT__UNIT_I_NOTES.pdf
CS8080_IRT__UNIT_I_NOTES.pdf
 
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
 
Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
Hci
HciHci
Hci
 
Leveraging Computational Methods for Theorizing IS Phenomena
Leveraging Computational Methods for Theorizing IS PhenomenaLeveraging Computational Methods for Theorizing IS Phenomena
Leveraging Computational Methods for Theorizing IS Phenomena
 
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningA Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
 

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Konsep Dasar Information Retrieval - Edi faizal

  • 1. INFORMATION RETIEVAL (IR) Edi Faizal 21/483830/SPA/00795 MK “INFORMATION RETRIEVAL”; DOSEN PENGAMPU: AINA MUSDHOLIFAH, S.Kom., M.Kom., Ph.D. PROGRAM STUDI S3 DOKTOR ILMU KOMPUTER 2021
  • 2. Course Topic’s A. Konsep Dasar IR (Information Retrieval) 1. IR vs Recommender System vs Search Engine 2. Jenis-jenis IR B. Preparing IR 1. Crawling 2. Indexing 3. NLP pada IR 4. Representasi text pada IR C. Metode klasifikasi pada IR D. Metode clustering pada IR E. Evaluation in IR
  • 3. Part 1 KONSEP DASAR INFORMATION RETIEVAL Edi Faizal 21/483830/SPA/00795 MK “INFORMATION RETRIEVAL”; DOSEN PENGAMPU: AINA MUSDHOLIFAH, S.Kom., M.Kom., Ph.D. PROGRAM STUDI S3 DOKTOR ILMU KOMPUTER 2021
  • 4. Outline Konsep Dasar IR (Information Retrieval) • IR vs RecSys vs Search Engine • Jenis-Jenis IR
  • 5. Information Retrieval (IR) versus Recommender System (RecSys/RSs) versus Search Engine (SE)
  • 6. Information Retrieval (IR) Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers) (Manning et al, 2009)  An information need is the topic about which the user desires to know more about.  A document is relevant if the user perceives that it contains information of value with respect to their personal information need. What is a document? web pages, email, books, news stories, scholarly papers, text messages, Powerpoint, PDF, forum postings, patents, IM sessions, Tweets, question answer postings, image, audio, video etc.  A query is what the user conveys to the computer in an attempt to communicate the information need.
  • 7. IR (cont…) The system should be able to retrieve the relevant docs eficiently
  • 9. Recommender System (RecSys/RSs) A recommender system, or a recommendation system (sometimes replacing 'system' with a synonym such as platform or engine), is a subclass of information filtering system that seeks to predict the "rating" or "preference" a user would give to an item (Ricci et al, 211)  Software tools and techniques providing suggestions for items to be of use to a user  “Item” is the general term used to denote what the system recommends to users  The suggestions relate to various decision-making processes, such as what items to buy, what music to listen to, or what online news to read.  Designing and developing RSs is a multi-disciplinary effort that has benefited from results obtained in various computer science fields especially machine learning and data mining, information retrieval, and human-computer interaction
  • 10. RecSys (cont…) Primary model of RecSys: • Prediction version of problem: memprediksi nilai peringkat untuk kombinasi user-item, dengan asumsi data pelatihan tersedia, yang menunjukkan preferensi user to item. • Ranking version of problem : Proses merekomendasikan top-k item untuk pengguna tertentu, atau menentukan top-k pengguna untuk menargetkan item tertentu. Operational and technical goals RecSys: • Relevance: merekomendasikan item yang relevan dengan user. • Novelty: membantu ketika item yang direkomendasikan adalah sesuatu (item) yang belum pernah dilihat user di masa lalu. • Serendipity: item yang direkomendasikan tidak terduga (kebetulan) dan tidak diketahui sebelumnya. • Increasing recommendation diversity: memberikan keragaman rekomendasi, biasanya menyarankan daftar top-k item kepada user. Relevance Novelty Serendipity Increasing Recom. diversity Prediction version of problem Ranking version of problem Goal of RecSys
  • 11. RecSys (cont…) Model-model RSs (Khan et al., 2020) dimodifikasi
  • 14. Search Engine “A program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, used especially for finding particular sites on the World Wide Web.”  Salah satu aplikasi umum dari IR adalah search engine atau mesin pencarian yang terdapat pada jaringan internet.  Pengguna dapat mencari halaman-halaman web yang dibutuhkannya melalui search engine.  Contoh lain dari IR adalah sistem informasi perpustakaan
  • 16. IR & RecSys & Search Engine
  • 17. IR & RecSys & Search Engine (cont…)
  • 18. Jenis-jenis IR Information retrieval models roughly fall into following paradigms:  Set theoretic models Boolean model Extended Boolean model  Algebraic models  Vector space model  Latent models  Latent semantic indexing (LSI), Random indexing, Topic modelling for IR  Probabilistic retrieval Classic probabilistic retrieval: Binary independence model, BM11, BM25 Language models for IR, Semantic ad-hoc retrieval, Embedding models
  • 20. Jenis-jenis IR (cont…) An information retrieval comprises of the following four key elements: • D − Document Representation. • Q − Query Representation. • F − A framework to match and establish a relationship between D and Q. • R (q, di) − A ranking function that determines the similarity between the query and the document to display relevant information. There are three types of Information Retrieval (IR) models: 1. Classical IR Model 2. Non-Classical IR Model 3. Alternative IR Model
  • 21. Jenis-jenis IR (cont…) Classical IR Model  It is designed upon basic mathematical concepts and is the most widely- used of IR models. Classic Information Retrieval models can be implemented with ease.  Its examples include: Vector-space, Boolean and Probabilistic IR models.  In this system, the retrieval of information depends on documents ontaining the defined set of queries. There is no ranking or grading of any kind.  The different classical IR models take Document Representation, Query representation, and Retrieval/Matching function into account in their modelling.
  • 22. Jenis-jenis IR (cont…) Non-Classical IR Model They differ from classic models in that they are built upon propositional logic. Examples of non-classical IR models include:  Information Logic,  Situation Theory, and  Interaction models.
  • 23. Jenis-jenis IR (cont…) Alternative IR Model These take principles of classical IR model and enhance upon to create more functional models like :  Cluster model,  Alternative Set-Theoretic Models  Fuzzy Set model,  Latent Semantic Indexing (LSI) model,  Alternative Algebraic Models  Generalized Vector Space Model, etc.
  • 24. What next…? Preparing IR  Crawling  Indexing  NLP pada IR  Representasi text pada IR