SlideShare a Scribd company logo
HINDUSTHAN INSTITUTE OF
TECHNOLOGY
DEPARTMENT OF COMPUER
SCIENCE AND ENGINEERING
20CS512 – INFORMATION RETRIEVAL
Dr. M. Thangamani
Architecture of the IR
Components of the IR
• Document Collection: The document collection is the set of documents that the IRS indexes
and searches. Documents can be in various formats, including text, images, audio, video, or a
combination of these.
• Preprocessing: Preprocessing involves transforming the raw documents into a form that can
be easily indexed and searched. This may include tokenization, stemming, stop-word
removal, and other text processing techniques.
• Indexing: Indexing is the process of creating a data structure (such as an inverted index) that
allows for efficient searching and retrieval of documents. The index maps terms or keywords
to the documents in which they appear.
• Query Processing: Query processing involves interpreting and transforming the user's query
into a form that can be matched against the indexed documents. This may include parsing,
tokenization, stemming, and other query processing techniques.
• Search and Retrieval: Search and retrieval involves searching the index to find documents
that match the query. This may involve ranking the documents based on their relevance to
the query and returning the most relevant documents to the user.
• User Interface: The user interface is the front end of the IRS that allows users to enter
queries, view search results, and interact with the system. The user interface may include
features such as faceted search, filtering, sorting, and visualization.
• Relevance Feedback: Relevance feedback involves collecting feedback from users about the
relevance of the retrieved documents and using this feedback to improve the search results.
This may involve adjusting the search algorithm, updating the index, or re-ranking the
documents.
Process of querying, indexing and
retrieval system
Information Retrieval Model
• basic information retrieval model.
• Classical IR Model
• Boolean Model
• Vector Space Model
• Probability Distribution Model
Continue..
• Boolean Model: This model is based on Boolean logic,
where a query is formulated using logical operators such as
AND, OR, and NOT. The model returns documents that
satisfy the Boolean expression specified in the query.
• Vector Space Model (VSM): This model represents
documents and queries as vectors in a high-dimensional
space, where each dimension corresponds to a term. The
similarity between a document and a query is measured
using a similarity metric, such as the cosine similarity.
• Probabilistic Model: This model estimates the probability
of a document being relevant to a query based on the
probability of the query terms appearing in the document.
One of the most well-known probabilistic models is the
Binary Independence Model (BIM).
Visualization Interface
• Scatter Plots: Scatter plots are used to display the relationship between two variables, and
they can be used to show the distribution of search results based on two different dimensions,
such as relevance and publication date.
• Bar Charts: Bar charts are used to display the frequency or count of items in different
categories, and they can be used to show the distribution of search results based on different
facets, such as document type, author, or topic.
• Line Charts: Line charts are used to display trends over time, and they can be used to show
the distribution of search results based on a temporal dimension, such as publication date or
time of access.
• Heat Maps: Heat maps are used to display data in a matrix format, where the values are
represented by colors. They can be used to show the distribution of search results based on
two dimensions, such as relevance and publication date.
• Word Clouds: Word clouds are used to display the frequency of terms in a text, with more
frequent terms displayed in larger font sizes. They can be used to show the distribution of
terms in the search results or to highlight the most relevant terms in a document.
• Network Graphs: Network graphs are used to display relationships between entities, such as
documents, authors, or topics. They can be used to show the connections between documents
based on citations, references, or other relationships.
• Map Visualizations: Map visualizations are used to display geographical data, and they can be
used to show the distribution of search results based on geographical dimensions, such as
location or region.
Continue..
• Construct the term matrix for the following
document and query
Documemt :
• 1.Taj mahal is a beautiful monument.
• 2.Victoria Memorial is also a monument.
• 3.I like to visit agra.
• Construct the term matrix for the following
document and query
• Documemt :
• 1.Taj mahal is a beautiful monument.
• 2.Victoria Memorial is also a monument.
• 3.I like to visit agra.
Continue..
Term Document Matrix
D1 D2 D3
• Taj 1 0 0
• mahal 1 0 0
• is 1 1 0
• a 1 1 1
• beautiful 1 0 0
• monument 1 1 0
• Victoria 0 1 0
• Memorial 0 1 0
• also 0 1 0
• I 0 0 1
• like 0 0 1
• to 0 0 1
• visit 0 0 1
• agra 0 0 1
In this matrix, the rows represent
the terms, and the columns
represent the documents (D1, D2,
and D3). The entries in the matrix
represent the term frequency in
each document. For example, the
term "Taj" appears once in
document D1 and not in documents
D2 and D3, so its entry in the
matrix is (1, 0, 0).

More Related Content

Similar to Information Retrieval basic presentation

Data processing.pdf
Data processing.pdfData processing.pdf
Data processing.pdf
MuthuLakshmi124949
 
Introduction to ms access database
Introduction to ms access databaseIntroduction to ms access database
Introduction to ms access database
Obuasi Senior High Technical School
 
Leveraging Electronic Resources for Academic Research (Prof. Adebayo Felix A...
Leveraging Electronic Resources for Academic Research  (Prof. Adebayo Felix A...Leveraging Electronic Resources for Academic Research  (Prof. Adebayo Felix A...
Leveraging Electronic Resources for Academic Research (Prof. Adebayo Felix A...
miracleAtianashie1
 
Realizing Semantic Web - Light Weight semantics and beyond
Realizing Semantic Web - Light Weight semantics and beyondRealizing Semantic Web - Light Weight semantics and beyond
Realizing Semantic Web - Light Weight semantics and beyond
Artificial Intelligence Institute at UofSC
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
petrknoth
 
C N I20080404
C N I20080404C N I20080404
C N I20080404
Anita de Waard
 
Torsten Reimer
Torsten ReimerTorsten Reimer
Torsten Reimer
Anita de Waard
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
Dhilsath Fathima
 
Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2
Shahriar Rafee
 
introduction-to-dbms-unit-1.ppt
introduction-to-dbms-unit-1.pptintroduction-to-dbms-unit-1.ppt
introduction-to-dbms-unit-1.ppt
rekhasai2468
 
Ch_2.pdf
Ch_2.pdfCh_2.pdf
Ch_2.pdf
DawitBirhanu13
 
T-SQL
T-SQLT-SQL
unit 1.pptx
unit 1.pptxunit 1.pptx
unit 1.pptx
sirishaYerraboina1
 
Data, Text and Web Mining
Data, Text and Web Mining Data, Text and Web Mining
Data, Text and Web Mining
Jeremiah Fadugba
 
Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641
Aiswaryadevi Jaganmohan
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basic
NivaTripathy2
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
Ramakanta Mohalik
 
12 ipt 0202 Organisation methods
12 ipt 0202   Organisation methods12 ipt 0202   Organisation methods
12 ipt 0202 Organisation methods
ctedds
 
Digital Library
Digital LibraryDigital Library
Making abstract data visible
Making abstract data visibleMaking abstract data visible
Making abstract data visible
Priyanshi Jain
 

Similar to Information Retrieval basic presentation (20)

Data processing.pdf
Data processing.pdfData processing.pdf
Data processing.pdf
 
Introduction to ms access database
Introduction to ms access databaseIntroduction to ms access database
Introduction to ms access database
 
Leveraging Electronic Resources for Academic Research (Prof. Adebayo Felix A...
Leveraging Electronic Resources for Academic Research  (Prof. Adebayo Felix A...Leveraging Electronic Resources for Academic Research  (Prof. Adebayo Felix A...
Leveraging Electronic Resources for Academic Research (Prof. Adebayo Felix A...
 
Realizing Semantic Web - Light Weight semantics and beyond
Realizing Semantic Web - Light Weight semantics and beyondRealizing Semantic Web - Light Weight semantics and beyond
Realizing Semantic Web - Light Weight semantics and beyond
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
C N I20080404
C N I20080404C N I20080404
C N I20080404
 
Torsten Reimer
Torsten ReimerTorsten Reimer
Torsten Reimer
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
Info 2402 irt-chapter_2
Info 2402 irt-chapter_2Info 2402 irt-chapter_2
Info 2402 irt-chapter_2
 
introduction-to-dbms-unit-1.ppt
introduction-to-dbms-unit-1.pptintroduction-to-dbms-unit-1.ppt
introduction-to-dbms-unit-1.ppt
 
Ch_2.pdf
Ch_2.pdfCh_2.pdf
Ch_2.pdf
 
T-SQL
T-SQLT-SQL
T-SQL
 
unit 1.pptx
unit 1.pptxunit 1.pptx
unit 1.pptx
 
Data, Text and Web Mining
Data, Text and Web Mining Data, Text and Web Mining
Data, Text and Web Mining
 
Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641Data mining-primitives-languages-and-system-architectures2641
Data mining-primitives-languages-and-system-architectures2641
 
Data mining concept and methods for basic
Data mining concept and methods for basicData mining concept and methods for basic
Data mining concept and methods for basic
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
 
12 ipt 0202 Organisation methods
12 ipt 0202   Organisation methods12 ipt 0202   Organisation methods
12 ipt 0202 Organisation methods
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Making abstract data visible
Making abstract data visibleMaking abstract data visible
Making abstract data visible
 

Recently uploaded

Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
RadiNasr
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
enizeyimana36
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
mahammadsalmanmech
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
zubairahmad848137
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
wisnuprabawa3
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
sachin chaurasia
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
KrishnaveniKrishnara1
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
Dr Ramhari Poudyal
 

Recently uploaded (20)

Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball playEric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
Eric Nizeyimana's document 2006 from gicumbi to ttc nyamata handball play
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
Question paper of renewable energy sources
Question paper of renewable energy sourcesQuestion paper of renewable energy sources
Question paper of renewable energy sources
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
 
New techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdfNew techniques for characterising damage in rock slopes.pdf
New techniques for characterising damage in rock slopes.pdf
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.The Python for beginners. This is an advance computer language.
The Python for beginners. This is an advance computer language.
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
 

Information Retrieval basic presentation

  • 1. HINDUSTHAN INSTITUTE OF TECHNOLOGY DEPARTMENT OF COMPUER SCIENCE AND ENGINEERING 20CS512 – INFORMATION RETRIEVAL Dr. M. Thangamani
  • 3. Components of the IR • Document Collection: The document collection is the set of documents that the IRS indexes and searches. Documents can be in various formats, including text, images, audio, video, or a combination of these. • Preprocessing: Preprocessing involves transforming the raw documents into a form that can be easily indexed and searched. This may include tokenization, stemming, stop-word removal, and other text processing techniques. • Indexing: Indexing is the process of creating a data structure (such as an inverted index) that allows for efficient searching and retrieval of documents. The index maps terms or keywords to the documents in which they appear. • Query Processing: Query processing involves interpreting and transforming the user's query into a form that can be matched against the indexed documents. This may include parsing, tokenization, stemming, and other query processing techniques. • Search and Retrieval: Search and retrieval involves searching the index to find documents that match the query. This may involve ranking the documents based on their relevance to the query and returning the most relevant documents to the user. • User Interface: The user interface is the front end of the IRS that allows users to enter queries, view search results, and interact with the system. The user interface may include features such as faceted search, filtering, sorting, and visualization. • Relevance Feedback: Relevance feedback involves collecting feedback from users about the relevance of the retrieved documents and using this feedback to improve the search results. This may involve adjusting the search algorithm, updating the index, or re-ranking the documents.
  • 4. Process of querying, indexing and retrieval system
  • 5. Information Retrieval Model • basic information retrieval model. • Classical IR Model • Boolean Model • Vector Space Model • Probability Distribution Model
  • 6. Continue.. • Boolean Model: This model is based on Boolean logic, where a query is formulated using logical operators such as AND, OR, and NOT. The model returns documents that satisfy the Boolean expression specified in the query. • Vector Space Model (VSM): This model represents documents and queries as vectors in a high-dimensional space, where each dimension corresponds to a term. The similarity between a document and a query is measured using a similarity metric, such as the cosine similarity. • Probabilistic Model: This model estimates the probability of a document being relevant to a query based on the probability of the query terms appearing in the document. One of the most well-known probabilistic models is the Binary Independence Model (BIM).
  • 7. Visualization Interface • Scatter Plots: Scatter plots are used to display the relationship between two variables, and they can be used to show the distribution of search results based on two different dimensions, such as relevance and publication date. • Bar Charts: Bar charts are used to display the frequency or count of items in different categories, and they can be used to show the distribution of search results based on different facets, such as document type, author, or topic. • Line Charts: Line charts are used to display trends over time, and they can be used to show the distribution of search results based on a temporal dimension, such as publication date or time of access. • Heat Maps: Heat maps are used to display data in a matrix format, where the values are represented by colors. They can be used to show the distribution of search results based on two dimensions, such as relevance and publication date. • Word Clouds: Word clouds are used to display the frequency of terms in a text, with more frequent terms displayed in larger font sizes. They can be used to show the distribution of terms in the search results or to highlight the most relevant terms in a document. • Network Graphs: Network graphs are used to display relationships between entities, such as documents, authors, or topics. They can be used to show the connections between documents based on citations, references, or other relationships. • Map Visualizations: Map visualizations are used to display geographical data, and they can be used to show the distribution of search results based on geographical dimensions, such as location or region.
  • 8. Continue.. • Construct the term matrix for the following document and query Documemt : • 1.Taj mahal is a beautiful monument. • 2.Victoria Memorial is also a monument. • 3.I like to visit agra.
  • 9. • Construct the term matrix for the following document and query • Documemt : • 1.Taj mahal is a beautiful monument. • 2.Victoria Memorial is also a monument. • 3.I like to visit agra. Continue..
  • 10. Term Document Matrix D1 D2 D3 • Taj 1 0 0 • mahal 1 0 0 • is 1 1 0 • a 1 1 1 • beautiful 1 0 0 • monument 1 1 0 • Victoria 0 1 0 • Memorial 0 1 0 • also 0 1 0 • I 0 0 1 • like 0 0 1 • to 0 0 1 • visit 0 0 1 • agra 0 0 1 In this matrix, the rows represent the terms, and the columns represent the documents (D1, D2, and D3). The entries in the matrix represent the term frequency in each document. For example, the term "Taj" appears once in document D1 and not in documents D2 and D3, so its entry in the matrix is (1, 0, 0).