SlideShare a Scribd company logo
Indexing Language: Concept,
types & characteristics
Dr. Utpal Das
Dibrugarh University,
Dibrugarh, Assam
utpalishaan@gmail.com
Introduction:
A subject is then any concept or combination of concepts
which is expressed in the document. The readers’ task is
to interpret the words and sentences in the document in
order to understand the concepts. Whether a reader
understands a document depends on how precisely the
author expresses the concepts he refers to and whether
the reader is aware of the concepts the author expresses.
The basic idea is that the concepts exist before the
author writes the document and the reader reads the
document.
• Similarly, the indexer’s task is to identify concepts in the
document and re-express these in indexing terms. This is
done first by establishing the subject content, or in other
words the content of concepts in the document.
Thereafter the principal concept presented in the
subject content is identified, and finally, the concepts
are expressed in the indexing language. The indexing is
successful when the document and the indexing term
express the same concepts.
What is indexing languages?
The term ‘indexing languages’ may be understood as same as the term
‘indexing’ in the broader sense, that is, in a general sense.
 Indexing language is a set of items (vocabulary) and devices for
handling the relationships between them in a system for providing
index descriptions. Indexing language is also referred to as retrieval
language.
 Indexing language is the process of creating set of vocabularies that
helps to provide access to objects of information, books,
documents, articles, etc. Like any other language, it will consist of
two parts: vocabulary and syntax.
this process of creating and providing access to objects of
information could either be manual or through computer
technology.
The above definitions will help us define indexing language
in the following different ways:
• As terms or vocabularies used to represent
document or content of document which are extracted from
document text or assigned from authority list adhering some
process or techniques
• Serving as access points for searching
• Possibly being extracted or derived from document text:
natural language
• Possibly being assigned from authority control list:
controlled vocabulary
So in a nutshell
A system for naming subjects using subject-terms or
vocabularies and also devices for handling the
relationships between them to provide a systematic
index descriptions is called an indexing language.
Like any other language, it will consist of two
parts: vocabulary and syntax.
Again, we need to understand that:
If we use terms or vocabulary as they appear in documents
without modification, we are using natural language.
However, using natural language always may lead to
problems. Because, as per as vocabulary is concerned,
different authors may use different terms to express the same
idea or they may use synonyms to express same idea. If that is
so, it will lead to a decrease in recall while searching with any
one term (idea) appear in documents which is against the
whole purpose of indexing and retrieval.
For example: the same idea may be expressed in more than
one way as per syntax is concerned, like : paediatric or child
disease; geriatric or health care of old people; child
psychology or psychology of children; adult education or
education of adults.
For these reasons, assigned indexing systems introduce a
measure of control over the terms used: we use a controlled
vocabulary.
We also formalize a flexible syntax of natural language by
permitting only certain constructions, as for example, instead of
heat treatment of aluminium, we use aluminium-heat
treatment; instead of using libraries for children, or children’s
libraries, we use libraries, children’s. This is what called using
a structured language or controlled vocabulary
A controlled vocabulary and formalized structure are features of
an artificial indexing language.
The extreme example of an artificial indexing is the notation of
a classification scheme; instead of natural language terms, heat
treatment of aluminium, or the more formalized aluminium-
heat treatment, we use 669.71.04.
Once the subject analysis of the document is
completed, the final step is to represent the selected
concepts in the language of indexing system (as index
entries). The indexer should be familiar with the
indexing tools, and their working rules and procedures
in order to ensure that concepts are organized in a
usable and accessible form. The process of subject
indexing involves basically three steps:
Familiarization => Analysis => Representation
Let us now look at how indexing languages are actually
conceptualized and created.
All indexing languages originate as natural language, or the
language found in documents. Natural language does
not refer to writing style, but to the fact that the
language is not under authority control.
Language under authority control is called controlled
vocabulary. There is nothing special about the words in
controlled vocabulary except the fact that they are
standardized for use in certain systems.
The following diagram illustrates the processes involved in
translating natural language (NL) terms into controlled
vocabulary (CV) terms for entry in database records.
The diagram helps explain why . . .
1. Natural indexing languages are also called derived-term
approaches
2. Controlled indexing languages are also called assigned-
term approaches
Abstracting and Indexing Process
Processes Involved in Translating Natural Language Terms
into Controlled Vocabulary
Full-Text
Document
Abstract NL Record Field
NL Record Field
CV Record FieldAuthority File
Natural Language
Controlled
Vocabulary
Enter in
Enter in
Enter inChose from
Write into
To review, subject analysis requires you to
1. become familiar with document content;
2. extract significant concepts and terms;
3. translate extracted terms into the language—
often controlled—of the system; and
4. formalize the terms (format them, etc.) according
to input rules.
Types of Indexing languages
As the above discussion suggest, there are three
types of Indexing language
Natural Language or Natural indexing language
Controlled Vocabulary or Controlled indexing language
Free indexing language
Natural indexing language:
• This is a slightly broader language in which the description of the
document can be done using any of the terms present in the
document. Any term that is used to define or describe the content
within a document is known as a ‘subject term’. That is why
indexing language some time is called ‘subject indexing’ of
‘subject indexing language’.
• In Natural indexing language, a subject term can be used to
describe/search for a specific document based primarily on its
content.
• A subject term may also be described as a compact synonym or
surrogate for a specific subject representation.
Controlled indexing language:
• Controlled indexing language refers to the indexing language in
which only approved terms are allowed to be used to describe the
document. These subject terms are controlled vocabulary under
subject authority file.
• For subject terms under authority control (or vocabulary control), a
subject authority file or list . . .
may be described as a list of terms that are permitted to be
used in describing or representing specific subjects
May be said to standardize one of two synonyms that are
used to assign or represent specific topics
May be used to determine the preferred term when multiple
terms are used to define or describe a single topic
May be used to provide cross references for terms that are
on par with, hierarchical or alternate in position or
relationships
• Cataloguing and indexing professionals have created
different subject authority control structures:
Subject headings lists are used by cataloguers in cases
where subject terms have been used as subject headings.
A thesaurus is used by indexers where subject terms are
known as descriptors.
Free indexing language:
As the name suggests, this type of indexing language brings
into use any term within or outside the document for its
description.
In today’s times, the searching mechanism and trends have
changed and there is a higher use of free text search. This
demands that the natural language with the highest
possible indexing ideally indexing every text be done. Of
course, whether free text search or expert-driven well-
chosen vocabularies is being done to check which is more
efficient is a matter of research.
Here's how the processes differs for natural language and
controlled vocabulary:
Natural language Controlled vocabulary
Terms are based on existing
vocabulary of documents (which
may be inconsistent)
Terms are based on standardized
vocabulary intended to describe
concepts consistently
Indexers / cataloguers extract
terms from documents and
enter them (or their own terms)
in various subject fields extract
terms from documents,
Indexers / cataloguers choose
appropriate authorized terms from
controlled vocabulary list, and
enter terms in designated
controlled vocabulary field
Searchers may enter any search
terms that are likely to occur in
natural language
Searchers must enter search terms
that are in controlled vocabulary
Basics of Subject Indexing
MEANING:
In the literature of LIS, the phrases subject cataloguing and
subject indexing are used more or less interchangeably. But
it should be understood that subject cataloguing is
intended to embrace only that cataloguing activity which
provides a verbal subject approach to library collections,
especially macro documents (i.e. books). It refers
determining and assigning of suitable entries for the
subject component of a document for use in a library’s
catalogue, i.e. subject catalogue is a representation of
documents. The primary purpose of the subject catalogue
is to show which books on a specific subject are possessed
by the library.
Subject indexing refers to that indexing activity
which provides a verbal subject approach to
micro documents (e.g., journal articles, research
reports, patent literature, etc.). Subject indexing
provides a subject entry for every topic
associated with the content of a micro
document, i.e. subject index is a representation
the knowledge expressed by documents
The representation of documents and the knowledge
expressed by them is one of the central and unique areas
of study within Library and Information Science (LIS) and
is commonly referred to as subject indexing. Subject
approach to information has been a long and extensive
concern of librarianship and is assumed to be the major
approach (access method) of users for a very long
period. Indexes facilitate retrieval of information in both
traditional manual systems and newer computerised
systems. Without proper indexing and indexes, search
and retrieval are virtually impossible.
A subject is then any concept or combination of concepts
which is expressed in the document. The readers’ task is
to interpret the words and sentences in the document in
order to understand the concepts. Whether a reader
understands a document depends on how precisely the
author expresses the concepts he refers to and whether
the reader is aware of the concepts the author expresses.
The basic idea is that the concepts exist before the
author writes the document and the reader reads the
document.
END

More Related Content

What's hot

Library and information policy at national and international 1
Library and information policy at national and international 1Library and information policy at national and international 1
Library and information policy at national and international 1
saurabh kaushik
 
Library automation software
Library automation softwareLibrary automation software
Library automation software
Jancypriya M
 
Evaluation of library automation software
Evaluation of library automation softwareEvaluation of library automation software
Evaluation of library automation software
Anil T
 
Canons of library classification
Canons of library classificationCanons of library classification
Canons of library classification
Govt. P.G. College Sendhwa, Barwani (M.P.)
 
Informetrics final
Informetrics finalInformetrics final
Informetrics finalAamir Abbas
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
SUNILKUMARSINGH
 
Information Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overviewInformation Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overview
Indian Institute of Management Ahmedabad
 
CANONS OF CATALOGUING ppt
CANONS OF CATALOGUING pptCANONS OF CATALOGUING ppt
CANONS OF CATALOGUING ppt
University of Delhi
 
NISCAIR.pptx
NISCAIR.pptxNISCAIR.pptx
NISCAIR.pptx
DrIrfanulHaqAkhoon
 
Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 
Dds
Dds Dds
Dds drrst
 
Canons for verbal and notational plane
Canons for verbal and notational planeCanons for verbal and notational plane
Canons for verbal and notational plane
Dr Shalini Lihitkar
 
Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.ghulamsamdani
 
Structure of subject lit ppt
Structure of subject lit pptStructure of subject lit ppt
Web scale discovery service
Web scale discovery serviceWeb scale discovery service
Web scale discovery service
Kankana Baishya
 
Digital library software
Digital library softwareDigital library software
Digital library software
avid
 
CAS & SDI service
CAS & SDI serviceCAS & SDI service
Marc 21
Marc 21Marc 21
FEATURES OF DDC AND UDC ppt
FEATURES OF DDC AND UDC pptFEATURES OF DDC AND UDC ppt
FEATURES OF DDC AND UDC ppt
University of Delhi
 

What's hot (20)

Library and information policy at national and international 1
Library and information policy at national and international 1Library and information policy at national and international 1
Library and information policy at national and international 1
 
Library automation software
Library automation softwareLibrary automation software
Library automation software
 
Evaluation of library automation software
Evaluation of library automation softwareEvaluation of library automation software
Evaluation of library automation software
 
Canons of library classification
Canons of library classificationCanons of library classification
Canons of library classification
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
 
Uniterm indexing
Uniterm indexing Uniterm indexing
Uniterm indexing
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
 
Information Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overviewInformation Analysis Consolidation and Repackaging (IACR): an overview
Information Analysis Consolidation and Repackaging (IACR): an overview
 
CANONS OF CATALOGUING ppt
CANONS OF CATALOGUING pptCANONS OF CATALOGUING ppt
CANONS OF CATALOGUING ppt
 
NISCAIR.pptx
NISCAIR.pptxNISCAIR.pptx
NISCAIR.pptx
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 
Dds
Dds Dds
Dds
 
Canons for verbal and notational plane
Canons for verbal and notational planeCanons for verbal and notational plane
Canons for verbal and notational plane
 
Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.Staff manual,lib.survey,statistics,standards.
Staff manual,lib.survey,statistics,standards.
 
Structure of subject lit ppt
Structure of subject lit pptStructure of subject lit ppt
Structure of subject lit ppt
 
Web scale discovery service
Web scale discovery serviceWeb scale discovery service
Web scale discovery service
 
Digital library software
Digital library softwareDigital library software
Digital library software
 
CAS & SDI service
CAS & SDI serviceCAS & SDI service
CAS & SDI service
 
Marc 21
Marc 21Marc 21
Marc 21
 
FEATURES OF DDC AND UDC ppt
FEATURES OF DDC AND UDC pptFEATURES OF DDC AND UDC ppt
FEATURES OF DDC AND UDC ppt
 

Similar to Indexing language concept types and characteristics

Indexing languages (2)
Indexing languages (2)Indexing languages (2)
Indexing languages (2)yhen06
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
Sarah Morrow
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Lecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptxLecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptx
sabinafarmonova02
 
Beyond the sentence
Beyond the sentenceBeyond the sentence
Beyond the sentence
Melisa Berto
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & Techniques
Dr. Utpal Das
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
bikashtaly
 
Discourse analysis new
Discourse analysis newDiscourse analysis new
Discourse analysis new
Harry Subagyo
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
IhsanSani4
 
4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases
keithstanger
 
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdfenglishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
GinaTabling1
 
English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1
RanelRabago
 
PPT FOR M1.pdf
PPT FOR M1.pdfPPT FOR M1.pdf
PPT FOR M1.pdf
CrisonMagadan2
 
LANGUAGE.pptx
LANGUAGE.pptxLANGUAGE.pptx
LANGUAGE.pptx
RubenAgacio
 
Cl35491494
Cl35491494Cl35491494
Cl35491494
IJERA Editor
 
Kieli analytics
Kieli analyticsKieli analytics
Kieli analytics
Chakir Mahjoubi
 

Similar to Indexing language concept types and characteristics (20)

Indexing
IndexingIndexing
Indexing
 
Indexing languages (2)
Indexing languages (2)Indexing languages (2)
Indexing languages (2)
 
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
A Corpus-based Analysis of the Terminology of the Social Sciences and Humanit...
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Lecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptxLecture 7 Translation techniques of scientific texts.pptx
Lecture 7 Translation techniques of scientific texts.pptx
 
Beyond the sentence
Beyond the sentenceBeyond the sentence
Beyond the sentence
 
Index Language.pptx
Index Language.pptxIndex Language.pptx
Index Language.pptx
 
Subject Indexing & Techniques
Subject Indexing  & TechniquesSubject Indexing  & Techniques
Subject Indexing & Techniques
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
 
Discourse analysis new
Discourse analysis newDiscourse analysis new
Discourse analysis new
 
Controlled Vocabulary.pptx
Controlled Vocabulary.pptxControlled Vocabulary.pptx
Controlled Vocabulary.pptx
 
4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases4 how to_search_traditional_academic_databases
4 how to_search_traditional_academic_databases
 
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdfenglishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
englishforacademicandprofessionalpurposesppt1-200831160448 (1).pdf
 
English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1English for academic and professional purposes ppt#1
English for academic and professional purposes ppt#1
 
PPT FOR M1.pdf
PPT FOR M1.pdfPPT FOR M1.pdf
PPT FOR M1.pdf
 
LANGUAGE.pptx
LANGUAGE.pptxLANGUAGE.pptx
LANGUAGE.pptx
 
Cl35491494
Cl35491494Cl35491494
Cl35491494
 
Kieli analytics
Kieli analyticsKieli analytics
Kieli analytics
 
NLP todo
NLP todoNLP todo
NLP todo
 

More from Dr. Utpal Das

Metrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptxMetrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptx
Dr. Utpal Das
 
Citation Database
Citation Database Citation Database
Citation Database
Dr. Utpal Das
 
Plagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptxPlagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptx
Dr. Utpal Das
 
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Dr. Utpal Das
 
How to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptxHow to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptx
Dr. Utpal Das
 
Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...
Dr. Utpal Das
 
Avoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availabilityAvoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availability
Dr. Utpal Das
 
Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it
Dr. Utpal Das
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
Dr. Utpal Das
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
Dr. Utpal Das
 
Truth, fact and ethics in academic research
Truth, fact and ethics in academic researchTruth, fact and ethics in academic research
Truth, fact and ethics in academic research
Dr. Utpal Das
 
Ethics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarismEthics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarism
Dr. Utpal Das
 
Success and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normalSuccess and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normal
Dr. Utpal Das
 
Information seeking and information use behaviour in libraries
Information seeking  and information use behaviour in librariesInformation seeking  and information use behaviour in libraries
Information seeking and information use behaviour in libraries
Dr. Utpal Das
 
Information literacy
Information literacyInformation literacy
Information literacy
Dr. Utpal Das
 
Chemical factors of deterioration of documents
Chemical factors of deterioration of documentsChemical factors of deterioration of documents
Chemical factors of deterioration of documents
Dr. Utpal Das
 
Remedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritageRemedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritage
Dr. Utpal Das
 
Definition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of ManuscriptsDefinition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of Manuscripts
Dr. Utpal Das
 
Manuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in AssamManuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in Assam
Dr. Utpal Das
 
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrieval
Dr. Utpal Das
 

More from Dr. Utpal Das (20)

Metrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptxMetrics h-Index, g-Index, Altmetrics.pptx
Metrics h-Index, g-Index, Altmetrics.pptx
 
Citation Database
Citation Database Citation Database
Citation Database
 
Plagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptxPlagiarism and its relevance in academics.pptx
Plagiarism and its relevance in academics.pptx
 
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
Understanding IPR and Copyright Law Presentation Jorhat Kendriya Mahavidyalay...
 
How to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptxHow to avoid plagiarism while thesis writing.pptx
How to avoid plagiarism while thesis writing.pptx
 
Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...Role of College Libraries in meeting user’s information needs issues and chal...
Role of College Libraries in meeting user’s information needs issues and chal...
 
Avoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availabilityAvoiding plagiarism in this era of digital availability
Avoiding plagiarism in this era of digital availability
 
Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it Plagiarism in HEI and how to avoid it
Plagiarism in HEI and how to avoid it
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
 
Confronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarismConfronting ethical issues in research for avoiding plagiarism
Confronting ethical issues in research for avoiding plagiarism
 
Truth, fact and ethics in academic research
Truth, fact and ethics in academic researchTruth, fact and ethics in academic research
Truth, fact and ethics in academic research
 
Ethics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarismEthics in academic research: avoiding plagiarism
Ethics in academic research: avoiding plagiarism
 
Success and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normalSuccess and growth of Dibrugarh University Library during new normal
Success and growth of Dibrugarh University Library during new normal
 
Information seeking and information use behaviour in libraries
Information seeking  and information use behaviour in librariesInformation seeking  and information use behaviour in libraries
Information seeking and information use behaviour in libraries
 
Information literacy
Information literacyInformation literacy
Information literacy
 
Chemical factors of deterioration of documents
Chemical factors of deterioration of documentsChemical factors of deterioration of documents
Chemical factors of deterioration of documents
 
Remedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritageRemedies for biological deterioration of wood origin documentary heritage
Remedies for biological deterioration of wood origin documentary heritage
 
Definition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of ManuscriptsDefinition, factors and actions of preservation of Manuscripts
Definition, factors and actions of preservation of Manuscripts
 
Manuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in AssamManuscripts: Concept, Importance and History of manuscripts in Assam
Manuscripts: Concept, Importance and History of manuscripts in Assam
 
Information storage and retrieval
Information storage and  retrievalInformation storage and  retrieval
Information storage and retrieval
 

Recently uploaded

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
PedroFerreira53928
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
Excellence Foundation for South Sudan
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
EduSkills OECD
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
Col Mukteshwar Prasad
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 

Recently uploaded (20)

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......Ethnobotany and Ethnopharmacology ......
Ethnobotany and Ethnopharmacology ......
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
PART A. Introduction to Costumer Service
PART A. Introduction to Costumer ServicePART A. Introduction to Costumer Service
PART A. Introduction to Costumer Service
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
How to Break the cycle of negative Thoughts
How to Break the cycle of negative ThoughtsHow to Break the cycle of negative Thoughts
How to Break the cycle of negative Thoughts
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 

Indexing language concept types and characteristics

  • 1. Indexing Language: Concept, types & characteristics Dr. Utpal Das Dibrugarh University, Dibrugarh, Assam utpalishaan@gmail.com
  • 2. Introduction: A subject is then any concept or combination of concepts which is expressed in the document. The readers’ task is to interpret the words and sentences in the document in order to understand the concepts. Whether a reader understands a document depends on how precisely the author expresses the concepts he refers to and whether the reader is aware of the concepts the author expresses. The basic idea is that the concepts exist before the author writes the document and the reader reads the document.
  • 3. • Similarly, the indexer’s task is to identify concepts in the document and re-express these in indexing terms. This is done first by establishing the subject content, or in other words the content of concepts in the document. Thereafter the principal concept presented in the subject content is identified, and finally, the concepts are expressed in the indexing language. The indexing is successful when the document and the indexing term express the same concepts.
  • 4. What is indexing languages? The term ‘indexing languages’ may be understood as same as the term ‘indexing’ in the broader sense, that is, in a general sense.  Indexing language is a set of items (vocabulary) and devices for handling the relationships between them in a system for providing index descriptions. Indexing language is also referred to as retrieval language.  Indexing language is the process of creating set of vocabularies that helps to provide access to objects of information, books, documents, articles, etc. Like any other language, it will consist of two parts: vocabulary and syntax. this process of creating and providing access to objects of information could either be manual or through computer technology.
  • 5. The above definitions will help us define indexing language in the following different ways: • As terms or vocabularies used to represent document or content of document which are extracted from document text or assigned from authority list adhering some process or techniques • Serving as access points for searching • Possibly being extracted or derived from document text: natural language • Possibly being assigned from authority control list: controlled vocabulary
  • 6. So in a nutshell A system for naming subjects using subject-terms or vocabularies and also devices for handling the relationships between them to provide a systematic index descriptions is called an indexing language. Like any other language, it will consist of two parts: vocabulary and syntax.
  • 7. Again, we need to understand that: If we use terms or vocabulary as they appear in documents without modification, we are using natural language. However, using natural language always may lead to problems. Because, as per as vocabulary is concerned, different authors may use different terms to express the same idea or they may use synonyms to express same idea. If that is so, it will lead to a decrease in recall while searching with any one term (idea) appear in documents which is against the whole purpose of indexing and retrieval. For example: the same idea may be expressed in more than one way as per syntax is concerned, like : paediatric or child disease; geriatric or health care of old people; child psychology or psychology of children; adult education or education of adults.
  • 8. For these reasons, assigned indexing systems introduce a measure of control over the terms used: we use a controlled vocabulary. We also formalize a flexible syntax of natural language by permitting only certain constructions, as for example, instead of heat treatment of aluminium, we use aluminium-heat treatment; instead of using libraries for children, or children’s libraries, we use libraries, children’s. This is what called using a structured language or controlled vocabulary A controlled vocabulary and formalized structure are features of an artificial indexing language. The extreme example of an artificial indexing is the notation of a classification scheme; instead of natural language terms, heat treatment of aluminium, or the more formalized aluminium- heat treatment, we use 669.71.04.
  • 9. Once the subject analysis of the document is completed, the final step is to represent the selected concepts in the language of indexing system (as index entries). The indexer should be familiar with the indexing tools, and their working rules and procedures in order to ensure that concepts are organized in a usable and accessible form. The process of subject indexing involves basically three steps: Familiarization => Analysis => Representation
  • 10. Let us now look at how indexing languages are actually conceptualized and created. All indexing languages originate as natural language, or the language found in documents. Natural language does not refer to writing style, but to the fact that the language is not under authority control. Language under authority control is called controlled vocabulary. There is nothing special about the words in controlled vocabulary except the fact that they are standardized for use in certain systems.
  • 11. The following diagram illustrates the processes involved in translating natural language (NL) terms into controlled vocabulary (CV) terms for entry in database records. The diagram helps explain why . . . 1. Natural indexing languages are also called derived-term approaches 2. Controlled indexing languages are also called assigned- term approaches
  • 12. Abstracting and Indexing Process Processes Involved in Translating Natural Language Terms into Controlled Vocabulary Full-Text Document Abstract NL Record Field NL Record Field CV Record FieldAuthority File Natural Language Controlled Vocabulary Enter in Enter in Enter inChose from Write into
  • 13. To review, subject analysis requires you to 1. become familiar with document content; 2. extract significant concepts and terms; 3. translate extracted terms into the language— often controlled—of the system; and 4. formalize the terms (format them, etc.) according to input rules.
  • 14. Types of Indexing languages As the above discussion suggest, there are three types of Indexing language Natural Language or Natural indexing language Controlled Vocabulary or Controlled indexing language Free indexing language
  • 15. Natural indexing language: • This is a slightly broader language in which the description of the document can be done using any of the terms present in the document. Any term that is used to define or describe the content within a document is known as a ‘subject term’. That is why indexing language some time is called ‘subject indexing’ of ‘subject indexing language’. • In Natural indexing language, a subject term can be used to describe/search for a specific document based primarily on its content. • A subject term may also be described as a compact synonym or surrogate for a specific subject representation.
  • 16. Controlled indexing language: • Controlled indexing language refers to the indexing language in which only approved terms are allowed to be used to describe the document. These subject terms are controlled vocabulary under subject authority file. • For subject terms under authority control (or vocabulary control), a subject authority file or list . . . may be described as a list of terms that are permitted to be used in describing or representing specific subjects May be said to standardize one of two synonyms that are used to assign or represent specific topics May be used to determine the preferred term when multiple terms are used to define or describe a single topic May be used to provide cross references for terms that are on par with, hierarchical or alternate in position or relationships
  • 17. • Cataloguing and indexing professionals have created different subject authority control structures: Subject headings lists are used by cataloguers in cases where subject terms have been used as subject headings. A thesaurus is used by indexers where subject terms are known as descriptors.
  • 18. Free indexing language: As the name suggests, this type of indexing language brings into use any term within or outside the document for its description. In today’s times, the searching mechanism and trends have changed and there is a higher use of free text search. This demands that the natural language with the highest possible indexing ideally indexing every text be done. Of course, whether free text search or expert-driven well- chosen vocabularies is being done to check which is more efficient is a matter of research.
  • 19. Here's how the processes differs for natural language and controlled vocabulary: Natural language Controlled vocabulary Terms are based on existing vocabulary of documents (which may be inconsistent) Terms are based on standardized vocabulary intended to describe concepts consistently Indexers / cataloguers extract terms from documents and enter them (or their own terms) in various subject fields extract terms from documents, Indexers / cataloguers choose appropriate authorized terms from controlled vocabulary list, and enter terms in designated controlled vocabulary field Searchers may enter any search terms that are likely to occur in natural language Searchers must enter search terms that are in controlled vocabulary
  • 20. Basics of Subject Indexing MEANING: In the literature of LIS, the phrases subject cataloguing and subject indexing are used more or less interchangeably. But it should be understood that subject cataloguing is intended to embrace only that cataloguing activity which provides a verbal subject approach to library collections, especially macro documents (i.e. books). It refers determining and assigning of suitable entries for the subject component of a document for use in a library’s catalogue, i.e. subject catalogue is a representation of documents. The primary purpose of the subject catalogue is to show which books on a specific subject are possessed by the library.
  • 21. Subject indexing refers to that indexing activity which provides a verbal subject approach to micro documents (e.g., journal articles, research reports, patent literature, etc.). Subject indexing provides a subject entry for every topic associated with the content of a micro document, i.e. subject index is a representation the knowledge expressed by documents
  • 22. The representation of documents and the knowledge expressed by them is one of the central and unique areas of study within Library and Information Science (LIS) and is commonly referred to as subject indexing. Subject approach to information has been a long and extensive concern of librarianship and is assumed to be the major approach (access method) of users for a very long period. Indexes facilitate retrieval of information in both traditional manual systems and newer computerised systems. Without proper indexing and indexes, search and retrieval are virtually impossible.
  • 23. A subject is then any concept or combination of concepts which is expressed in the document. The readers’ task is to interpret the words and sentences in the document in order to understand the concepts. Whether a reader understands a document depends on how precisely the author expresses the concepts he refers to and whether the reader is aware of the concepts the author expresses. The basic idea is that the concepts exist before the author writes the document and the reader reads the document.
  • 24. END