SlideShare a Scribd company logo
DATA MINING AND ITS APPLICATION
IN LIBRARYAND INFORMATION
SCIENCE
Dr. Santosh Kumar Tunga
[ORCID: 0000-0001-5534-4861]
Librarian, R B C Evening College, Naihati
North 24 Parganas, West Bengal, India
tungask@rediffmail.com
20-05-2021 1
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Database Management Systems gave access to the
data stored but this was only a small part of what
could be gained from the data. Traditional on line
transaction processing system are good at putting
data into databases quickly, safely and efficiently
but are not good at delivering meaningful analysis
in return. Extraction of interesting information or
patterns from data in large database is known as
Data Mining.
20-05-2021 2
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Data Mining, also called as data archeology, data
dredging data harvesting, is the process of
extracting hidden knowledge from large volumes
of raw data and using it to make crucial library
decisions. This is where Data Mining or
knowledge Discovery in Databases has obvious
benefit for any enterprise
20-05-2021 3
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 DM or Knowledge Discover in Database is
concerned with the analysis of data and the use of
software techniques for finding patterns and
regularities in sets of data.
 Extraction of interesting information or patterns
from data in large database is known as DM.
20-05-2021 4
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 DM is the process of analyzing large amount of
data in search of previously undiscovered
information pattern.
 DM refers to using a variety of techniques to
identify nuggets of information or decision making
knowledge in bodies of data and extracting these in
such a way that they can be put to use in the areas
such as decision support prediction, forecasting
and estimation
20-05-2021 5
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Many Digital Libraries utilize Data Mining technology in
order to facilitate advanced future prospective services and
also DL users research.
 A couple of Digital Libraries use data mining technology to
analysis GIS data and a data services unit facilities user’s
analysis of social sciences computing data.
 Data Mining tools are use to perform text analysis and
clustering in large Open Archives Initiative Meta data
repositories.
20-05-2021 6
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Many Digital Libraries (DLs) and Information centers
use data mining technology to facilitate their library's
own administrative or strategic purposes.
 To modernize own existing DLs Data Mining play an
important role.
 Data Mining is a step within the KDD process through
which an organization's data assets are processed and
analyzed to gain insights to assist decision making.
20-05-2021 7
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
Fig-1 Multi- disciplinary approach in Data Mining
20-05-2021 8
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
5. Interpretation Knowledge
4. Data Mining / Pattern Recognition
3. Transformation
2. Data Enrichment
1. Data selection and
cleaning
Fig 2. An Overview of the steps that compose the Data Mining Process
20-05-2021 9
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Data Selection
There are two parts to selecting data for data mining.
The first part, locating data, tends to be more
mechanical in nature than the second part, identifying
data, which requires significant input by a domain
expert for the data.
 Data Cleaning
Data cleaning is the process of ensuring that for data
mining purposes, the data is uniform in terms of key
and attribute usage. Data cleaning attempts to correct
misused or incorrect attributes in existing data.
20-05-2021 10
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Data Enrichment (DE)
DE is the process of adding new attributes such as
calculated fields or data from external sources to existing
data. It involves adding information to existing data. This
can include combining internal data with external data,
obtained from either different departmental libraries or
information centers.
 Data Transformation (DT)
DT is the process of changing the form or structure of
existing attributes. It is separate from data cleaning and
data enrichment for data mining purposes because it does
not correct existing attribute data or add new attributes but
instead grooms existing attributes for data mining
purposes.
20-05-2021 11
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Data Mining (DM)
DM is not so much a single technique as the idea that there
is more knowledge hidden in the data than shows itself on
the surface. From this point of view DM is really an
anything goes affair.
 Visualization/ Interpretation/ Evaluation
Visualization techniques are a very useful method
discovering patterns in datasets. Advanced graphical
techniques virtual reality enable people to wander through
artificial spaces, white historic development of data sets can
be displayed as a kind of animated movies.
20-05-2021 12
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
Fig-3 Architecture of Data Mining
20-05-2021 13
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Database, Data Warehouse or other Information
Repository:
This is one or a set of databases, data warehouses,
spreadsheets or other kinds of information repositories.
Data cleaning and data integration techniques may be
performed on the data.
 Database or Data Warehouse Server:
The database or data warehouse server is responsible for
fetching the relevant data, based on the user's data mining
request.
20-05-2021 14
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Knowledge Base
This is domain knowledge that is used to guide the search
or evaluate the interestingness of resulting patterns. Such
knowledge can include concept hierarchies, used to
organize attributes values into different levels of
abstraction.
 Data Mining Engine
This is essential to the data mining system and ideally
consists of a set of functional modules for tasks such as
characterization, association, classification, cluster
analysis and evaluation and deviation analysis.
20-05-2021 15
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Pattern evaluation module
This component typically employs interestingness
measures and interacts with the data mining
modules so as to focus the search towards
interesting patterns.
 Graphical user Interface
This module allows the user to browse database
and data warehouse schemes or structures;
evaluate mined patterns and visualize the patterns
in different forms.
20-05-2021 16
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Identify library customers who will be most receptive to
new service offers.
 Identify the most effective library customers.
 Find out the library customers who will opt for each type
of service in the following year.
 Predict library customer reaction to the change of library
services.
 Determine customer preference of the different modes of
library transaction.
20-05-2021 17
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Analyzing the most frequently prescribed useful
documents.
 Demand analysis and forecasting helps the library
authority to determine the optimum levels of
library stocks.
 Management of library resources and network
traffic required different.
 Formulating statistical modeling of library
services.
20-05-2021 18
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Application of statistical techniques in citation
studies, bibliometric coupling, correlation and other
areas of informetrics and scientometrics helps in data
mining.
 Methods for knowledge organization in the web are
based on theories, principles and practices that
librarians have long formulated, understood and
applied e.g. classification schemes, subject headings,
authority files and thesauri etc.
20-05-2021 19
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Web mining is mining of data related to the world wide
web or data related to web pages / web activity
 Web mining is the application of DM techniques to
discover patterns from the web
20-05-2021 20
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
Web data can be classifies into the following classes:-
 Content of actual web pages
 Intra-page structure is actual linkage structure between web.
 Inter-page structure is actual linkage structure between web.
 Usage data that describe how web pages are accessed by
visitors.
 User profiles include demographic and registration
information obtained about users.
20-05-2021 21
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
The following are the classification of Web Mining:-
 Web content mining
 Web page content mining
 Search result mining
 Web structure mining
 Web usage mining
 General access pattern tracking
 Customized usage tracking
20-05-2021 22
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
Data Mining is an effective set of analysis tools and
techniques used in the decision support process.
 Data Mining is not a ‘black box’ process in which the
data miner simply builds a data mining models and
watches as meaningful information appears.
 Data Mining does not guarantee the behavior of future
data through the analysis of historical data.
20-05-2021 23
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Data Mining is a guidance tool, used to provide
insight into the trends inherent in historical
information.
 Data Mining is the process of analyzing data from
different perspective and summarizing it into useful
information so that it can be used to increase revenue,
cuts costs or both.
20-05-2021 24
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Appletion, Elaine. (1995). The Right Server
for your Data Warehouse. Datamation,
41(5), 75-84.
 Brachman, R.J. [etc.al]. (1996). Mining
Business Database Communications of the
ACM. 39(11), 32.
 Dunhan, Margaret H. (2004). Data Mining
Introductory and Advanced Topics. New
Delhi: Pearson Education.
20-05-2021 25
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga
 Han, J. & Kamber, M. (2000). Data Mining :
Concepts and Techniques. New Delhi:
Morgan Kaufman. 115.
 Jean-Mare, Adamo. (2000). Data Mining for
Association Rules and Sequential Patterns,
New York: Springer-Verlag. 56.
20-05-2021 26
Data Mining and Its Application in
LIS/Dr. Santosh Kumar Tunga

More Related Content

What's hot

Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 
Retroconversion of documents
Retroconversion of documentsRetroconversion of documents
Retroconversion of documentskmusthu1
 
Inis ppt
Inis pptInis ppt
Inis ppt
sayalijawle21
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
SUNILKUMARSINGH
 
Unisist ppt
Unisist pptUnisist ppt
Unisist ppt
Akshay Kumar
 
Digital Library Software
Digital Library SoftwareDigital Library Software
Informatio retrival evaluation
Informatio retrival evaluationInformatio retrival evaluation
Informatio retrival evaluation
NidhirBiswas
 
Sendoc
SendocSendoc
Sendoc
Akshay Kumar
 
Digital Reference Service in Library
Digital Reference Service in LibraryDigital Reference Service in Library
Digital Reference Service in Library
Pallavi Belkar
 
LIBRARY CONSORTIA
LIBRARY CONSORTIALIBRARY CONSORTIA
LIBRARY CONSORTIA
kavusadu
 
NISCAIR by Jaya Singh
NISCAIR by Jaya SinghNISCAIR by Jaya Singh
NISCAIR by Jaya Singh
AMAN KUMAR KUSHWAHA
 
CAS & SDI service
CAS & SDI serviceCAS & SDI service
Comparative and International Librarianship
Comparative and International LibrarianshipComparative and International Librarianship
Comparative and International Librarianship
Md. Harun Ar Rashid
 
Library Automation sofrwere
Library Automation sofrwereLibrary Automation sofrwere
Library Automation sofrwere
Deepak Malviya
 
Library Consortia
Library Consortia Library Consortia
Library Consortia
Dr. Sushma N Jogan
 
Information Consolidation
Information ConsolidationInformation Consolidation
Information Consolidation
Kishor Sakariya
 
Use and user study
Use and user study Use and user study
Use and user study
Shubhada Nagarkar
 

What's hot (20)

Chain indexing
Chain indexingChain indexing
Chain indexing
 
POPSI
POPSIPOPSI
POPSI
 
INSPEC
INSPECINSPEC
INSPEC
 
Retroconversion of documents
Retroconversion of documentsRetroconversion of documents
Retroconversion of documents
 
Inis ppt
Inis pptInis ppt
Inis ppt
 
Z39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol pptZ39.50: Information Retrieval protocol ppt
Z39.50: Information Retrieval protocol ppt
 
Unisist ppt
Unisist pptUnisist ppt
Unisist ppt
 
Digital Library Software
Digital Library SoftwareDigital Library Software
Digital Library Software
 
Informatio retrival evaluation
Informatio retrival evaluationInformatio retrival evaluation
Informatio retrival evaluation
 
Sendoc
SendocSendoc
Sendoc
 
Digital Reference Service in Library
Digital Reference Service in LibraryDigital Reference Service in Library
Digital Reference Service in Library
 
LIBRARY CONSORTIA
LIBRARY CONSORTIALIBRARY CONSORTIA
LIBRARY CONSORTIA
 
UNISIST
UNISISTUNISIST
UNISIST
 
NISCAIR by Jaya Singh
NISCAIR by Jaya SinghNISCAIR by Jaya Singh
NISCAIR by Jaya Singh
 
CAS & SDI service
CAS & SDI serviceCAS & SDI service
CAS & SDI service
 
Comparative and International Librarianship
Comparative and International LibrarianshipComparative and International Librarianship
Comparative and International Librarianship
 
Library Automation sofrwere
Library Automation sofrwereLibrary Automation sofrwere
Library Automation sofrwere
 
Library Consortia
Library Consortia Library Consortia
Library Consortia
 
Information Consolidation
Information ConsolidationInformation Consolidation
Information Consolidation
 
Use and user study
Use and user study Use and user study
Use and user study
 

Similar to Data Mining and Its Application in Library and Information Science

5. data mining tools and techniques a review--31-39
5. data mining tools and techniques  a review--31-395. data mining tools and techniques  a review--31-39
5. data mining tools and techniques a review--31-39Alexander Decker
 
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...Alexander Decker
 
Business Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptxBusiness Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptx
RupaRani28
 
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
Journal For Research
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
IRJET Journal
 
9. Data Warehousing & Mining.pptx
9. Data Warehousing & Mining.pptx9. Data Warehousing & Mining.pptx
9. Data Warehousing & Mining.pptx
CallplanetsDeveloper
 
Business Analytics 1 Module 1.pdf
Business Analytics 1 Module 1.pdfBusiness Analytics 1 Module 1.pdf
Business Analytics 1 Module 1.pdf
Jayanti Pande
 
11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining
Alexander Decker
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
Scott Bou
 
Data mining
Data miningData mining
Data mining
SATISH KUMAR
 
Association rule visualization technique
Association rule visualization techniqueAssociation rule visualization technique
Association rule visualization technique
mustafasmart
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
IOSR Journals
 
The Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their UsageThe Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their Usage
IRJET Journal
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
Denodo
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
Sourabh Saxena
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378Parag Kapile
 
What Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdfWhat Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdf
Agile dock
 
MC0088 Internal Assignment (SMU)
MC0088 Internal Assignment (SMU)MC0088 Internal Assignment (SMU)
MC0088 Internal Assignment (SMU)
Krishan Pareek
 
BI LECTURE 3- 2023.pptx
BI LECTURE 3- 2023.pptxBI LECTURE 3- 2023.pptx
BI LECTURE 3- 2023.pptx
AmanyaLaban
 
BVRM 402 IMS UNIT V
BVRM 402 IMS UNIT VBVRM 402 IMS UNIT V
BVRM 402 IMS UNIT V
DrNilimaThakur
 

Similar to Data Mining and Its Application in Library and Information Science (20)

5. data mining tools and techniques a review--31-39
5. data mining tools and techniques  a review--31-395. data mining tools and techniques  a review--31-39
5. data mining tools and techniques a review--31-39
 
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...
11.0005www.iiste.org call for paper. data mining tools and techniques- a revi...
 
Business Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptxBusiness Intelligence and Analytics Unit-2 part-A .pptx
Business Intelligence and Analytics Unit-2 part-A .pptx
 
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
9. Data Warehousing & Mining.pptx
9. Data Warehousing & Mining.pptx9. Data Warehousing & Mining.pptx
9. Data Warehousing & Mining.pptx
 
Business Analytics 1 Module 1.pdf
Business Analytics 1 Module 1.pdfBusiness Analytics 1 Module 1.pdf
Business Analytics 1 Module 1.pdf
 
11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining11.challenging issues of spatio temporal data mining
11.challenging issues of spatio temporal data mining
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
 
Data mining
Data miningData mining
Data mining
 
Association rule visualization technique
Association rule visualization techniqueAssociation rule visualization technique
Association rule visualization technique
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
The Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their UsageThe Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their Usage
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
 
GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378GROUP PROJECT REPORT_FY6055_FX7378
GROUP PROJECT REPORT_FY6055_FX7378
 
What Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdfWhat Is Data Mining How It Works, Benefits, Techniques.pdf
What Is Data Mining How It Works, Benefits, Techniques.pdf
 
MC0088 Internal Assignment (SMU)
MC0088 Internal Assignment (SMU)MC0088 Internal Assignment (SMU)
MC0088 Internal Assignment (SMU)
 
BI LECTURE 3- 2023.pptx
BI LECTURE 3- 2023.pptxBI LECTURE 3- 2023.pptx
BI LECTURE 3- 2023.pptx
 
BVRM 402 IMS UNIT V
BVRM 402 IMS UNIT VBVRM 402 IMS UNIT V
BVRM 402 IMS UNIT V
 

More from Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India

Use of E-resources in Digital Environment
Use of E-resources in Digital EnvironmentUse of E-resources in Digital Environment
MDC/IDC on LIS: Teaching Role of the College Librarians
MDC/IDC on LIS: Teaching Role of the College LibrariansMDC/IDC on LIS: Teaching Role of the College Librarians
MDC/IDC on LIS: Teaching Role of the College Librarians
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
NAAC Assessment and Accreditation Process in Affiliated Colleges
NAAC Assessment and Accreditation Process in Affiliated CollegesNAAC Assessment and Accreditation Process in Affiliated Colleges
NAAC Assessment and Accreditation Process in Affiliated Colleges
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
NAAC Related Quality Enhancement Techniques for Academic Libraries
NAAC Related Quality Enhancement Techniques for Academic LibrariesNAAC Related Quality Enhancement Techniques for Academic Libraries
NAAC Related Quality Enhancement Techniques for Academic Libraries
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
Tools & Techniques for Enhancing Academic and Research Profile
Tools & Techniques for Enhancing  Academic and Research ProfileTools & Techniques for Enhancing  Academic and Research Profile
Tools & Techniques for Enhancing Academic and Research Profile
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
Strategies to Enhance Research and Academic Visibility
Strategies to Enhance Research and Academic VisibilityStrategies to Enhance Research and Academic Visibility
Strategies to Enhance Research and Academic Visibility
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
Development of Academic & Research Identity
Development of Academic & Research IdentityDevelopment of Academic & Research Identity
How to Create Academic & Research Identity.ppt
How to Create Academic & Research Identity.pptHow to Create Academic & Research Identity.ppt
Statistical Methods in Library and Information Science: A Fundamental Approach
Statistical Methods in Library and Information Science: A Fundamental ApproachStatistical Methods in Library and Information Science: A Fundamental Approach
Statistical Methods in Library and Information Science: A Fundamental Approach
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 
Weeding as a part of Collection Development: A Guide for Libraries
Weeding as a part of Collection Development: A Guide for Libraries Weeding as a part of Collection Development: A Guide for Libraries
Weeding as a part of Collection Development: A Guide for Libraries
Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India
 

More from Rishi Bankim Chandra Evening College, Naihati, North 24 Parganas, West Bengal, India (10)

Use of E-resources in Digital Environment
Use of E-resources in Digital EnvironmentUse of E-resources in Digital Environment
Use of E-resources in Digital Environment
 
MDC/IDC on LIS: Teaching Role of the College Librarians
MDC/IDC on LIS: Teaching Role of the College LibrariansMDC/IDC on LIS: Teaching Role of the College Librarians
MDC/IDC on LIS: Teaching Role of the College Librarians
 
NAAC Assessment and Accreditation Process in Affiliated Colleges
NAAC Assessment and Accreditation Process in Affiliated CollegesNAAC Assessment and Accreditation Process in Affiliated Colleges
NAAC Assessment and Accreditation Process in Affiliated Colleges
 
NAAC Related Quality Enhancement Techniques for Academic Libraries
NAAC Related Quality Enhancement Techniques for Academic LibrariesNAAC Related Quality Enhancement Techniques for Academic Libraries
NAAC Related Quality Enhancement Techniques for Academic Libraries
 
Tools & Techniques for Enhancing Academic and Research Profile
Tools & Techniques for Enhancing  Academic and Research ProfileTools & Techniques for Enhancing  Academic and Research Profile
Tools & Techniques for Enhancing Academic and Research Profile
 
Strategies to Enhance Research and Academic Visibility
Strategies to Enhance Research and Academic VisibilityStrategies to Enhance Research and Academic Visibility
Strategies to Enhance Research and Academic Visibility
 
Development of Academic & Research Identity
Development of Academic & Research IdentityDevelopment of Academic & Research Identity
Development of Academic & Research Identity
 
How to Create Academic & Research Identity.ppt
How to Create Academic & Research Identity.pptHow to Create Academic & Research Identity.ppt
How to Create Academic & Research Identity.ppt
 
Statistical Methods in Library and Information Science: A Fundamental Approach
Statistical Methods in Library and Information Science: A Fundamental ApproachStatistical Methods in Library and Information Science: A Fundamental Approach
Statistical Methods in Library and Information Science: A Fundamental Approach
 
Weeding as a part of Collection Development: A Guide for Libraries
Weeding as a part of Collection Development: A Guide for Libraries Weeding as a part of Collection Development: A Guide for Libraries
Weeding as a part of Collection Development: A Guide for Libraries
 

Recently uploaded

Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 

Recently uploaded (20)

Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 

Data Mining and Its Application in Library and Information Science

  • 1. DATA MINING AND ITS APPLICATION IN LIBRARYAND INFORMATION SCIENCE Dr. Santosh Kumar Tunga [ORCID: 0000-0001-5534-4861] Librarian, R B C Evening College, Naihati North 24 Parganas, West Bengal, India tungask@rediffmail.com 20-05-2021 1 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 2.  Database Management Systems gave access to the data stored but this was only a small part of what could be gained from the data. Traditional on line transaction processing system are good at putting data into databases quickly, safely and efficiently but are not good at delivering meaningful analysis in return. Extraction of interesting information or patterns from data in large database is known as Data Mining. 20-05-2021 2 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 3.  Data Mining, also called as data archeology, data dredging data harvesting, is the process of extracting hidden knowledge from large volumes of raw data and using it to make crucial library decisions. This is where Data Mining or knowledge Discovery in Databases has obvious benefit for any enterprise 20-05-2021 3 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 4.  DM or Knowledge Discover in Database is concerned with the analysis of data and the use of software techniques for finding patterns and regularities in sets of data.  Extraction of interesting information or patterns from data in large database is known as DM. 20-05-2021 4 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 5.  DM is the process of analyzing large amount of data in search of previously undiscovered information pattern.  DM refers to using a variety of techniques to identify nuggets of information or decision making knowledge in bodies of data and extracting these in such a way that they can be put to use in the areas such as decision support prediction, forecasting and estimation 20-05-2021 5 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 6.  Many Digital Libraries utilize Data Mining technology in order to facilitate advanced future prospective services and also DL users research.  A couple of Digital Libraries use data mining technology to analysis GIS data and a data services unit facilities user’s analysis of social sciences computing data.  Data Mining tools are use to perform text analysis and clustering in large Open Archives Initiative Meta data repositories. 20-05-2021 6 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 7.  Many Digital Libraries (DLs) and Information centers use data mining technology to facilitate their library's own administrative or strategic purposes.  To modernize own existing DLs Data Mining play an important role.  Data Mining is a step within the KDD process through which an organization's data assets are processed and analyzed to gain insights to assist decision making. 20-05-2021 7 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 8. Fig-1 Multi- disciplinary approach in Data Mining 20-05-2021 8 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 9. 5. Interpretation Knowledge 4. Data Mining / Pattern Recognition 3. Transformation 2. Data Enrichment 1. Data selection and cleaning Fig 2. An Overview of the steps that compose the Data Mining Process 20-05-2021 9 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 10.  Data Selection There are two parts to selecting data for data mining. The first part, locating data, tends to be more mechanical in nature than the second part, identifying data, which requires significant input by a domain expert for the data.  Data Cleaning Data cleaning is the process of ensuring that for data mining purposes, the data is uniform in terms of key and attribute usage. Data cleaning attempts to correct misused or incorrect attributes in existing data. 20-05-2021 10 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 11.  Data Enrichment (DE) DE is the process of adding new attributes such as calculated fields or data from external sources to existing data. It involves adding information to existing data. This can include combining internal data with external data, obtained from either different departmental libraries or information centers.  Data Transformation (DT) DT is the process of changing the form or structure of existing attributes. It is separate from data cleaning and data enrichment for data mining purposes because it does not correct existing attribute data or add new attributes but instead grooms existing attributes for data mining purposes. 20-05-2021 11 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 12.  Data Mining (DM) DM is not so much a single technique as the idea that there is more knowledge hidden in the data than shows itself on the surface. From this point of view DM is really an anything goes affair.  Visualization/ Interpretation/ Evaluation Visualization techniques are a very useful method discovering patterns in datasets. Advanced graphical techniques virtual reality enable people to wander through artificial spaces, white historic development of data sets can be displayed as a kind of animated movies. 20-05-2021 12 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 13. Fig-3 Architecture of Data Mining 20-05-2021 13 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 14.  Database, Data Warehouse or other Information Repository: This is one or a set of databases, data warehouses, spreadsheets or other kinds of information repositories. Data cleaning and data integration techniques may be performed on the data.  Database or Data Warehouse Server: The database or data warehouse server is responsible for fetching the relevant data, based on the user's data mining request. 20-05-2021 14 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 15.  Knowledge Base This is domain knowledge that is used to guide the search or evaluate the interestingness of resulting patterns. Such knowledge can include concept hierarchies, used to organize attributes values into different levels of abstraction.  Data Mining Engine This is essential to the data mining system and ideally consists of a set of functional modules for tasks such as characterization, association, classification, cluster analysis and evaluation and deviation analysis. 20-05-2021 15 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 16.  Pattern evaluation module This component typically employs interestingness measures and interacts with the data mining modules so as to focus the search towards interesting patterns.  Graphical user Interface This module allows the user to browse database and data warehouse schemes or structures; evaluate mined patterns and visualize the patterns in different forms. 20-05-2021 16 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 17.  Identify library customers who will be most receptive to new service offers.  Identify the most effective library customers.  Find out the library customers who will opt for each type of service in the following year.  Predict library customer reaction to the change of library services.  Determine customer preference of the different modes of library transaction. 20-05-2021 17 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 18.  Analyzing the most frequently prescribed useful documents.  Demand analysis and forecasting helps the library authority to determine the optimum levels of library stocks.  Management of library resources and network traffic required different.  Formulating statistical modeling of library services. 20-05-2021 18 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 19.  Application of statistical techniques in citation studies, bibliometric coupling, correlation and other areas of informetrics and scientometrics helps in data mining.  Methods for knowledge organization in the web are based on theories, principles and practices that librarians have long formulated, understood and applied e.g. classification schemes, subject headings, authority files and thesauri etc. 20-05-2021 19 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 20.  Web mining is mining of data related to the world wide web or data related to web pages / web activity  Web mining is the application of DM techniques to discover patterns from the web 20-05-2021 20 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 21. Web data can be classifies into the following classes:-  Content of actual web pages  Intra-page structure is actual linkage structure between web.  Inter-page structure is actual linkage structure between web.  Usage data that describe how web pages are accessed by visitors.  User profiles include demographic and registration information obtained about users. 20-05-2021 21 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 22. The following are the classification of Web Mining:-  Web content mining  Web page content mining  Search result mining  Web structure mining  Web usage mining  General access pattern tracking  Customized usage tracking 20-05-2021 22 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 23. Data Mining is an effective set of analysis tools and techniques used in the decision support process.  Data Mining is not a ‘black box’ process in which the data miner simply builds a data mining models and watches as meaningful information appears.  Data Mining does not guarantee the behavior of future data through the analysis of historical data. 20-05-2021 23 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 24.  Data Mining is a guidance tool, used to provide insight into the trends inherent in historical information.  Data Mining is the process of analyzing data from different perspective and summarizing it into useful information so that it can be used to increase revenue, cuts costs or both. 20-05-2021 24 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 25.  Appletion, Elaine. (1995). The Right Server for your Data Warehouse. Datamation, 41(5), 75-84.  Brachman, R.J. [etc.al]. (1996). Mining Business Database Communications of the ACM. 39(11), 32.  Dunhan, Margaret H. (2004). Data Mining Introductory and Advanced Topics. New Delhi: Pearson Education. 20-05-2021 25 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga
  • 26.  Han, J. & Kamber, M. (2000). Data Mining : Concepts and Techniques. New Delhi: Morgan Kaufman. 115.  Jean-Mare, Adamo. (2000). Data Mining for Association Rules and Sequential Patterns, New York: Springer-Verlag. 56. 20-05-2021 26 Data Mining and Its Application in LIS/Dr. Santosh Kumar Tunga