SlideShare a Scribd company logo
1 of 21
Download to read offline
Software for Digital Library
ByBy
Bhupendra Ratha, LecturerBhupendra Ratha, Lecturer
School of Library and Information ScienceSchool of Library and Information Science
Devi Ahilya University, IndoreDevi Ahilya University, Indore
Email:Email: bhu261@gmail.combhu261@gmail.com
Software for Digital Library
If we want to create a digital library than we require
different software's like;
 Digital library software
 OCR
 DOI
 Image Editing Software's
 Other software’s
• Operating Systems
• Data Base Management System Software
• Programming/ Scripting Language
• Firewall & Protection Software
Digital library software
• DSpace is a digital library system to capture, store, index, preserve
and redistribute the intellectual output of a university’s research
faculty in digital formats. Dspace has been developed jointly by
MIT Libraries and Hewlett-Packard (HP). It is now freely available
to research institutions world-wide as an open source system.
• Eprints is generic archive software under development by the
University of Southampton. It is intended to create a highly
configurable web-based archive. EPrints primary goal is to be set up
as an open archive for research papers, but it could be easily used
for other things such as images, research data, audio archives -
anything that can be stored digitally by making changes in
configuration.
• Greenstone is a suite of software for building and distributing
digital library collections. It provides a new way of organizing
information and publishing it on the Internet or on CD-ROM. It is
available for both Windows and Linux O/S. It requires Perl software
to build collections.
OCR
(Optical Character Recognizer)
 OCR is also referred to as Optical Character Reader.
 It allows to scan printed, typewritten or hand written text
(numerals, letters or symbols) and/or convert scanned image
to a computer processable format, either in the form of a plain
text or a word document or an excel spread sheet, which can
be edited, used or reused in other documents.
 “A system that provides a full alphanumeric recognition of
printed or handwritten characters at electronic speed by simply
scanning the documents that called OCR.”
Difference: OCR and OMR
• OCR technologies, images
can be scanned, indexed and
written to optical media.
• OCR can recognize all type
of information.
• It is very flexible.
• OMR is a data collection
technology.
• OMR cannot recognize
hand-printed or machine-
printed characters.
• It is not flexible.
Features of OCR
• It is a program which have recognition capabilities of
characters.
• The technology provides a complete form processing
and documents capture solution.
• OCR is used when recreating a document in
electronic form takes more time
• The converted text files take less space than the
original image file and can be indexed
• Bridges the gap between the paperless and the
papered.
Advantages of OCR
• Savings in costs and efficiencies by not having the paper.
• Scanning and recognition allowed efficient management
and planning for the rest of the processing workload.
• Reduced long term storage requirements, questionnaires
could be destroyed after the initial scanning, recognition
and repair.
• Quick retrieval for editing and reprocessing.
• Minimizes errors associated with physical handling of the
documents.
Disadvantages of OCR
• While OCR technology can be effective in converting
handwritten or typed characters, it does not give as high
accuracy as of OMR for reading data, where users are
actually marking forms.
• Scanning speed will be determined by the quality of the
scanner machines, the size of non-drop out color. paper
quality, cleanness, weights.
• To compare the value of the interpreted image with the
real image of the document.
• Processing can be in geographic order or in random order.
Processing of OCR
Cont…
• Scanner has 4 components:
– A detector, An illumination source, A scan lens and a
document transport.
• OCR hardware/software performs three operational
steps:
– Document analysis, Character recognition, Contextual
processing.
• Output Interface
– Allows character recognition results to be electronically.
Types of OCRs
• Two types of OCRs
– Task specific readers
– General purpose readers
• Task specific readers
– Reads only specific documents: bank cheques.
• General purpose page readers
– High end OCR (usually for offices)
• Speed and Accuracy are important
• Format preservation
• Good proof reading solutions
– Low end OCR (usually for house use)
• Speed is not required
• Proof reading is done manually
Factors affecting OCR quality
• Scanner quality
• Scan resolution
• Type of printed documents, whether laser
printer outputs or photocopied
• Paper quality
• Fonts used in the text
• Linguistic complexities
Evaluating OCRs
• Neat interface
• Easy-to-use wizards
• Accurate recognition
• Scan resolution setting (600 dpi is advisable)
• Time taken from scanning to deliver the final product
• Enhanced usability of the product
• Ability to modify the scan setting
DOI
(Digital Object Identifier)
• An open standard for creating an alphanumeric
name that identifies digital content, mostly
scholarly contents such a e-book or journal
article.
• A DOI is a unique ID number for a document
is paired with the object’s electronic address,
or URL (updatable), along with other metadata
Basic aspects of DOI
“The DOI is like the Bar Code, but for objects on
the Internet.”
Two aspects:
1. Uniquely Identifies the Object –
2. Provides Linking to the Object Itself (or to any related
objects, transactions or services). These links are:
– Permanent
– Dynamically maintainable
– Capable of one-to-many routing
– Capable of supporting new applications over time
Features of DOI
• Applies to any type or format of object
– text, music, film, video, photographs, software, database record.
• Applies at any level of specificity
– whole book/individual chapters, music collection/ individual
tracks, software programs/ individual routines
products/components…
• Compatible with every other numbering scheme (UPC, ISBN,
ISSN)
• Permanent (Once assigned, never changes. “A DOI is Forever.”)
• It protect to copyright.
• A central directory provides a level of indirection between the
ID and its locations or services
DOI number formatDOI number format
• 10.1065/abc123defg = the whole DOI
• 10.1065 = Publisher Prefix
• abc123defg = Suffix= Handle suffix
– item identifier
– any format
– naming authority (publisher)
• in use, a DOI is an opaque string (a “dumb
number” - a good thing)
IDF
• International DOI Foundation (IDF) established October 1997
• Offices in Washington & Geneva
• Non-profit: supported primarily by membership fees
• Develops policies and governance procedures (“policy
infrastructure”)
• Liaises with standards organizations internationally
• Manages the relationship with CNRI (as technology provider) via
service contract
Working Process of DOIWorking Process of DOI
User PC
Browser
2. Forward
Query to
Publisher
1. Send DOI
Query
DOI Directory Server
DOI = Where to go next
Publisher/ Gateway
Object Information
3. Receive Object
Information
User PC
Image Editing Software
• Photoshop
• Coral draw
• Paint brush
Other Software for DL
• Operating Systems:
– MS-Word, excel, power point etc.
• Data Base Management System Software
– Oracle/SQL, MS-Access, Fox-Pro etc
• Programming/ Scripting Language
– Html, Java, .Net, VB. C++ etc.
• Firewall & Protection Software
– Firewall, Anti-virus, bioinformatics programs etc

More Related Content

What's hot

SLSH ppt
SLSH pptSLSH ppt
SLSH pptKumar Gpt
 
Library and information policy at national and international 1
Library and information policy at national and international 1Library and information policy at national and international 1
Library and information policy at national and international 1saurabh kaushik
 
Digital library software
Digital library softwareDigital library software
Digital library softwareavid
 
Functional Requirements For Bibliographic Records - FRBR
Functional Requirements For Bibliographic Records - FRBRFunctional Requirements For Bibliographic Records - FRBR
Functional Requirements For Bibliographic Records - FRBRIslamic University of Lebanon
 
Union Catalogues
Union CataloguesUnion Catalogues
Union CataloguesWajdi Tahmoush
 
Digital Library Initiatives in India : An Overview
Digital Library Initiatives in  India : An OverviewDigital Library Initiatives in  India : An Overview
Digital Library Initiatives in India : An OverviewManoj Kumar Sinha
 
Greenstone Digital Library
Greenstone Digital LibraryGreenstone Digital Library
Greenstone Digital LibraryImran Mansuri
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositoriesSmita Chandra
 
Promotion of e shodh sindhu journals & databases in tripura university li...
Promotion of e shodh sindhu journals & databases in tripura university li...Promotion of e shodh sindhu journals & databases in tripura university li...
Promotion of e shodh sindhu journals & databases in tripura university li...Surendra Kumar Pal
 

What's hot (20)

SLSH ppt
SLSH pptSLSH ppt
SLSH ppt
 
Library and information policy at national and international 1
Library and information policy at national and international 1Library and information policy at national and international 1
Library and information policy at national and international 1
 
Digital library software
Digital library softwareDigital library software
Digital library software
 
Iaslic
IaslicIaslic
Iaslic
 
Functional Requirements For Bibliographic Records - FRBR
Functional Requirements For Bibliographic Records - FRBRFunctional Requirements For Bibliographic Records - FRBR
Functional Requirements For Bibliographic Records - FRBR
 
Union Catalogues
Union CataloguesUnion Catalogues
Union Catalogues
 
Ifla
IflaIfla
Ifla
 
ALA.pptx
ALA.pptxALA.pptx
ALA.pptx
 
Dspace software
Dspace softwareDspace software
Dspace software
 
Digital Library Initiatives in India : An Overview
Digital Library Initiatives in  India : An OverviewDigital Library Initiatives in  India : An Overview
Digital Library Initiatives in India : An Overview
 
IFLA.pptx
IFLA.pptxIFLA.pptx
IFLA.pptx
 
Greenstone Digital Library
Greenstone Digital LibraryGreenstone Digital Library
Greenstone Digital Library
 
ILA.pptx
ILA.pptxILA.pptx
ILA.pptx
 
ISO 2709
ISO 2709ISO 2709
ISO 2709
 
Inis ppt
Inis pptInis ppt
Inis ppt
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
 
International Digital Library Initiatives
International Digital Library InitiativesInternational Digital Library Initiatives
International Digital Library Initiatives
 
Resource discovery tools
Resource discovery toolsResource discovery tools
Resource discovery tools
 
Promotion of e shodh sindhu journals & databases in tripura university li...
Promotion of e shodh sindhu journals & databases in tripura university li...Promotion of e shodh sindhu journals & databases in tripura university li...
Promotion of e shodh sindhu journals & databases in tripura university li...
 
Library 2.0
Library 2.0Library 2.0
Library 2.0
 

Similar to Software Requirements for Building Digital Libraries

Part A--Scanners, Conversion
Part A--Scanners, ConversionPart A--Scanners, Conversion
Part A--Scanners, Conversiongollanmel
 
Paper based interaction
Paper based interactionPaper based interaction
Paper based interactionVaruna Harshana
 
AT Software/Apps Demonstration
AT Software/Apps DemonstrationAT Software/Apps Demonstration
AT Software/Apps Demonstrationshiina419
 
Digitization in theory and practice
Digitization in theory and practiceDigitization in theory and practice
Digitization in theory and practiceHelen Nneka Okpala
 
Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Gladson DSouza
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionGhufran Ataie
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR RecognitionBharat Kalia
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization wordDhana K
 
NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionMohammad Ilyas Malik
 
OCR (Optical Character Recognition)
OCR (Optical Character Recognition) OCR (Optical Character Recognition)
OCR (Optical Character Recognition) IstiaqueBinIslam
 
fdocuments.in_unit-2-foc.ppt
fdocuments.in_unit-2-foc.pptfdocuments.in_unit-2-foc.ppt
fdocuments.in_unit-2-foc.pptKrishanPalSingh39
 
If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!gagravarr
 
Unit 2 computer software
Unit 2 computer softwareUnit 2 computer software
Unit 2 computer softwareHardik Patel
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibDavid Nzoputa Ofili
 
Wroclaw university library - Grazyna Piotrowicz
Wroclaw university library - Grazyna PiotrowiczWroclaw university library - Grazyna Piotrowicz
Wroclaw university library - Grazyna PiotrowiczIMPACT Centre of Competence
 
Text reader [OCR]
Text reader [OCR]Text reader [OCR]
Text reader [OCR]MisbahUddin52
 
School library automation
School library automationSchool library automation
School library automationKuldeep Swami
 

Similar to Software Requirements for Building Digital Libraries (20)

Part A--Scanners, Conversion
Part A--Scanners, ConversionPart A--Scanners, Conversion
Part A--Scanners, Conversion
 
Paper based interaction
Paper based interactionPaper based interaction
Paper based interaction
 
AT Software/Apps Demonstration
AT Software/Apps DemonstrationAT Software/Apps Demonstration
AT Software/Apps Demonstration
 
Digitization in theory and practice
Digitization in theory and practiceDigitization in theory and practice
Digitization in theory and practice
 
Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
CRC Final Report
CRC Final ReportCRC Final Report
CRC Final Report
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
 
Digital Content Creation
Digital Content CreationDigital Content Creation
Digital Content Creation
 
NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognition
 
OCR (Optical Character Recognition)
OCR (Optical Character Recognition) OCR (Optical Character Recognition)
OCR (Optical Character Recognition)
 
fdocuments.in_unit-2-foc.ppt
fdocuments.in_unit-2-foc.pptfdocuments.in_unit-2-foc.ppt
fdocuments.in_unit-2-foc.ppt
 
If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!If You Have The Content, Then Apache Has The Technology!
If You Have The Content, Then Apache Has The Technology!
 
Unit 2 computer software
Unit 2 computer softwareUnit 2 computer software
Unit 2 computer software
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLib
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 
Wroclaw university library - Grazyna Piotrowicz
Wroclaw university library - Grazyna PiotrowiczWroclaw university library - Grazyna Piotrowicz
Wroclaw university library - Grazyna Piotrowicz
 
Text reader [OCR]
Text reader [OCR]Text reader [OCR]
Text reader [OCR]
 
School library automation
School library automationSchool library automation
School library automation
 

Recently uploaded

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 

Software Requirements for Building Digital Libraries

  • 1. Software for Digital Library ByBy Bhupendra Ratha, LecturerBhupendra Ratha, Lecturer School of Library and Information ScienceSchool of Library and Information Science Devi Ahilya University, IndoreDevi Ahilya University, Indore Email:Email: bhu261@gmail.combhu261@gmail.com
  • 2. Software for Digital Library If we want to create a digital library than we require different software's like;  Digital library software  OCR  DOI  Image Editing Software's  Other software’s • Operating Systems • Data Base Management System Software • Programming/ Scripting Language • Firewall & Protection Software
  • 3. Digital library software • DSpace is a digital library system to capture, store, index, preserve and redistribute the intellectual output of a university’s research faculty in digital formats. Dspace has been developed jointly by MIT Libraries and Hewlett-Packard (HP). It is now freely available to research institutions world-wide as an open source system. • Eprints is generic archive software under development by the University of Southampton. It is intended to create a highly configurable web-based archive. EPrints primary goal is to be set up as an open archive for research papers, but it could be easily used for other things such as images, research data, audio archives - anything that can be stored digitally by making changes in configuration. • Greenstone is a suite of software for building and distributing digital library collections. It provides a new way of organizing information and publishing it on the Internet or on CD-ROM. It is available for both Windows and Linux O/S. It requires Perl software to build collections.
  • 4. OCR (Optical Character Recognizer)  OCR is also referred to as Optical Character Reader.  It allows to scan printed, typewritten or hand written text (numerals, letters or symbols) and/or convert scanned image to a computer processable format, either in the form of a plain text or a word document or an excel spread sheet, which can be edited, used or reused in other documents.  “A system that provides a full alphanumeric recognition of printed or handwritten characters at electronic speed by simply scanning the documents that called OCR.”
  • 5. Difference: OCR and OMR • OCR technologies, images can be scanned, indexed and written to optical media. • OCR can recognize all type of information. • It is very flexible. • OMR is a data collection technology. • OMR cannot recognize hand-printed or machine- printed characters. • It is not flexible.
  • 6. Features of OCR • It is a program which have recognition capabilities of characters. • The technology provides a complete form processing and documents capture solution. • OCR is used when recreating a document in electronic form takes more time • The converted text files take less space than the original image file and can be indexed • Bridges the gap between the paperless and the papered.
  • 7. Advantages of OCR • Savings in costs and efficiencies by not having the paper. • Scanning and recognition allowed efficient management and planning for the rest of the processing workload. • Reduced long term storage requirements, questionnaires could be destroyed after the initial scanning, recognition and repair. • Quick retrieval for editing and reprocessing. • Minimizes errors associated with physical handling of the documents.
  • 8. Disadvantages of OCR • While OCR technology can be effective in converting handwritten or typed characters, it does not give as high accuracy as of OMR for reading data, where users are actually marking forms. • Scanning speed will be determined by the quality of the scanner machines, the size of non-drop out color. paper quality, cleanness, weights. • To compare the value of the interpreted image with the real image of the document. • Processing can be in geographic order or in random order.
  • 10. Cont… • Scanner has 4 components: – A detector, An illumination source, A scan lens and a document transport. • OCR hardware/software performs three operational steps: – Document analysis, Character recognition, Contextual processing. • Output Interface – Allows character recognition results to be electronically.
  • 11. Types of OCRs • Two types of OCRs – Task specific readers – General purpose readers • Task specific readers – Reads only specific documents: bank cheques. • General purpose page readers – High end OCR (usually for offices) • Speed and Accuracy are important • Format preservation • Good proof reading solutions – Low end OCR (usually for house use) • Speed is not required • Proof reading is done manually
  • 12. Factors affecting OCR quality • Scanner quality • Scan resolution • Type of printed documents, whether laser printer outputs or photocopied • Paper quality • Fonts used in the text • Linguistic complexities
  • 13. Evaluating OCRs • Neat interface • Easy-to-use wizards • Accurate recognition • Scan resolution setting (600 dpi is advisable) • Time taken from scanning to deliver the final product • Enhanced usability of the product • Ability to modify the scan setting
  • 14. DOI (Digital Object Identifier) • An open standard for creating an alphanumeric name that identifies digital content, mostly scholarly contents such a e-book or journal article. • A DOI is a unique ID number for a document is paired with the object’s electronic address, or URL (updatable), along with other metadata
  • 15. Basic aspects of DOI “The DOI is like the Bar Code, but for objects on the Internet.” Two aspects: 1. Uniquely Identifies the Object – 2. Provides Linking to the Object Itself (or to any related objects, transactions or services). These links are: – Permanent – Dynamically maintainable – Capable of one-to-many routing – Capable of supporting new applications over time
  • 16. Features of DOI • Applies to any type or format of object – text, music, film, video, photographs, software, database record. • Applies at any level of specificity – whole book/individual chapters, music collection/ individual tracks, software programs/ individual routines products/components… • Compatible with every other numbering scheme (UPC, ISBN, ISSN) • Permanent (Once assigned, never changes. “A DOI is Forever.”) • It protect to copyright. • A central directory provides a level of indirection between the ID and its locations or services
  • 17. DOI number formatDOI number format • 10.1065/abc123defg = the whole DOI • 10.1065 = Publisher Prefix • abc123defg = Suffix= Handle suffix – item identifier – any format – naming authority (publisher) • in use, a DOI is an opaque string (a “dumb number” - a good thing)
  • 18. IDF • International DOI Foundation (IDF) established October 1997 • Offices in Washington & Geneva • Non-profit: supported primarily by membership fees • Develops policies and governance procedures (“policy infrastructure”) • Liaises with standards organizations internationally • Manages the relationship with CNRI (as technology provider) via service contract
  • 19. Working Process of DOIWorking Process of DOI User PC Browser 2. Forward Query to Publisher 1. Send DOI Query DOI Directory Server DOI = Where to go next Publisher/ Gateway Object Information 3. Receive Object Information User PC
  • 20. Image Editing Software • Photoshop • Coral draw • Paint brush
  • 21. Other Software for DL • Operating Systems: – MS-Word, excel, power point etc. • Data Base Management System Software – Oracle/SQL, MS-Access, Fox-Pro etc • Programming/ Scripting Language – Html, Java, .Net, VB. C++ etc. • Firewall & Protection Software – Firewall, Anti-virus, bioinformatics programs etc