SlideShare a Scribd company logo
1 of 20
Research support with optical
character recognition apps
Jim Hahn
Text-shot prototype
2
Introduction
• Uses for OCR in library settings
– The prototype Text-shot module uses OCR software
and a backend search system for subject and title
recommendations.
– The choice to recommend library content to users
from the app stems from the objective to connect
students with library resources, and to help students
integrate library resources into their work.
3
Optical Character Recognition Apps
• Wordlens app: can translate words from
different languages using a digital camera feed
• Google Goggles app: take a picture of a book
cover (or painting)to run a google search on
the topic
• Camscanner app: digitize print documents
with camera on app and store/share
documents with others
4
Literature Review
• Optical Character Recognition APIs
– Evernote API: dev.evernote.com/doc
– Google Drive API: support.google.com/drive
– VuForia SDK:
developer.vuforia.com/resources/sdk/android
5
Methodology
• Formative evaluation
– Small set of test participants to gather feedback
early in the design phase so that the software
development process can progress in a direction
that will support user requirements for the
software
6
Methodology
• Test Participants
– Students were recruited from the General Studies
101 course. They are in their first year of study at
the university and have not yet chosen a major.
– There were a total of five test participants in the
first round of study.
7
Methodology
• Study Process
– Students were given an Android phone with the
Text-shot app loaded. Investigators observed the
students as they used the OCR mobile software to
obtain suggested library resources. Investigators
collected two sources of data: observation of how
students interact with the software and a
debriefing interview.
8
Functionality Tests
• Researchers tested the two main functions for
the software.
– Recognizing a string of text by taking a picture of
the words in a student assignment sheet and;
– suggesting subjects and titles based on the
scanned text.
9
Results
• Themes related to the improvement of
suggestions:
– Show broad subjects first
• Then expand to details subjects
– Prominently display title suggestions
10
Results
• Feature Requests:
– Include articles as well as book titles in
recommendations
• Use article APIs
• LibGuides-like help guides
11
Text-shot prototype
12
Next steps in OCR
• Topic Space app: Scanning call numbers in the
library
– If you scan a call number on a book, you can get
recommendations of other, related books in the
library, and other related digital content in the
library.
13
Topic Space: Book Scan
14
Topic Space: Suggested Topic Spaces
15
Topic Space:
Related Books that are not available
16
Topic Space: View Map
17
Future directions
• Implementing OCR modules in the Minrva
app:
– http://minrvaproject.org/modules_topicspace.ph
p
• Open sourcing OCR technology for use in
library settings:
– http://minrvaproject.org/source.php
18
Sponsors
• Institute of Museum and Library Services
• University of Illinois Campus Research Board
19
Acknowledgements
• My thanks to Ben Ryckman for Topic Space module development and support.
• Many thanks to Chris Diaz, Residency Librarian, Scholarly Communications and
Collections, University of Iowa for help with participant recruitment, observation,
and interviewing support in the user studies
• Thanks to Mayur Sadavarte, Graduate Student in Computer Science at the
University of Illinois and Nate Ryckman, Graduate Student in Information Systems
Management at Carnegie Mellon University for Optical Character recognition
programming support.
• Yinan Zhang, PhD Candidate in Computer Science at the University of Illinois,
Sherry (Mengxue) Zheng, Graduate Student in Computer Science for help
developing the search and suggestion functionality of the Deneb near-semantic
index, Maria Lux, Graphic Designer for laying out the polished recommendations
and prototyping Text-shot integration as a Minrva module.
20

More Related Content

Viewers also liked

Bengali optical character recognition system
Bengali optical character recognition systemBengali optical character recognition system
Bengali optical character recognition system
Md. Mahedi Mahfuj
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
Vijay Apurva
 
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
MartySchlabach
 

Viewers also liked (13)

Optical character recognition (ocr) ppt
Optical character recognition (ocr) pptOptical character recognition (ocr) ppt
Optical character recognition (ocr) ppt
 
Bengali optical character recognition system
Bengali optical character recognition systemBengali optical character recognition system
Bengali optical character recognition system
 
OCR
OCROCR
OCR
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
Optical Character Recognition (OCR)
Optical Character Recognition (OCR)Optical Character Recognition (OCR)
Optical Character Recognition (OCR)
 
Basics of-optical-character-recognition
Basics of-optical-character-recognitionBasics of-optical-character-recognition
Basics of-optical-character-recognition
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
Number plate recognition system using matlab.
Number plate recognition system using matlab.Number plate recognition system using matlab.
Number plate recognition system using matlab.
 
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
An Online Game to Correct Inaccurate Optical Character Recognition (OCR) in B...
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
Presentation_OCR
Presentation_OCRPresentation_OCR
Presentation_OCR
 

Similar to Research support with optical character recognition apps

In Search Of The Lost Book - Improving Library Usability
In Search Of The Lost Book - Improving Library UsabilityIn Search Of The Lost Book - Improving Library Usability
In Search Of The Lost Book - Improving Library Usability
Petteri Kivimäki
 
1-Lec - Introduction vhvv,vbvv,v (2).ppt
1-Lec - Introduction vhvv,vbvv,v (2).ppt1-Lec - Introduction vhvv,vbvv,v (2).ppt
1-Lec - Introduction vhvv,vbvv,v (2).ppt
AqeelAbbas94
 

Similar to Research support with optical character recognition apps (20)

The Library Technology Prototyping Service at Illinois
The Library Technology Prototyping Service at IllinoisThe Library Technology Prototyping Service at Illinois
The Library Technology Prototyping Service at Illinois
 
Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...
Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...
Customizing Discovery Interfaces: Understanding Users’ Behaviors and Providin...
 
Macon - about the project
Macon - about the projectMacon - about the project
Macon - about the project
 
Own the User Experience: Provide Discovery for Your Users
Own the User Experience: Provide Discovery for Your UsersOwn the User Experience: Provide Discovery for Your Users
Own the User Experience: Provide Discovery for Your Users
 
Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...
Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...
Presentation to 2014 University of Guelph Accessibility Conference Perspectiv...
 
Responsive hackfest: Code4Lib2014 Pre-conference
Responsive hackfest: Code4Lib2014 Pre-conferenceResponsive hackfest: Code4Lib2014 Pre-conference
Responsive hackfest: Code4Lib2014 Pre-conference
 
AAUP 2016: UPScope (S. Doerr)
AAUP 2016: UPScope (S. Doerr)AAUP 2016: UPScope (S. Doerr)
AAUP 2016: UPScope (S. Doerr)
 
Library website usability study 2012
Library website usability study 2012Library website usability study 2012
Library website usability study 2012
 
Electricity_Monitoring_Presentation.pptx
Electricity_Monitoring_Presentation.pptxElectricity_Monitoring_Presentation.pptx
Electricity_Monitoring_Presentation.pptx
 
1st meeting of PG PUSHPIN
1st meeting of PG PUSHPIN1st meeting of PG PUSHPIN
1st meeting of PG PUSHPIN
 
In Search Of The Lost Book - Improving Library Usability
In Search Of The Lost Book - Improving Library UsabilityIn Search Of The Lost Book - Improving Library Usability
In Search Of The Lost Book - Improving Library Usability
 
empirical-SLR.pptx
empirical-SLR.pptxempirical-SLR.pptx
empirical-SLR.pptx
 
Designing e-Learning Objects
Designing e-Learning ObjectsDesigning e-Learning Objects
Designing e-Learning Objects
 
Jones "Enabling Discovery in the Library"
Jones "Enabling Discovery in the Library"Jones "Enabling Discovery in the Library"
Jones "Enabling Discovery in the Library"
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
1-Lec - Introduction vhvv,vbvv,v (2).ppt
1-Lec - Introduction vhvv,vbvv,v (2).ppt1-Lec - Introduction vhvv,vbvv,v (2).ppt
1-Lec - Introduction vhvv,vbvv,v (2).ppt
 
Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)
 
CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)
CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)
CrowdAsk- A Crowdsourcing Reference System (Internet Librarian 2014)
 
Ocls 4th annual breakfast 2016
Ocls 4th annual breakfast 2016Ocls 4th annual breakfast 2016
Ocls 4th annual breakfast 2016
 
Avalon at Stanford University Libraries
Avalon at Stanford University LibrariesAvalon at Stanford University Libraries
Avalon at Stanford University Libraries
 

Recently uploaded

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Recently uploaded (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Research support with optical character recognition apps

  • 1. Research support with optical character recognition apps Jim Hahn
  • 3. Introduction • Uses for OCR in library settings – The prototype Text-shot module uses OCR software and a backend search system for subject and title recommendations. – The choice to recommend library content to users from the app stems from the objective to connect students with library resources, and to help students integrate library resources into their work. 3
  • 4. Optical Character Recognition Apps • Wordlens app: can translate words from different languages using a digital camera feed • Google Goggles app: take a picture of a book cover (or painting)to run a google search on the topic • Camscanner app: digitize print documents with camera on app and store/share documents with others 4
  • 5. Literature Review • Optical Character Recognition APIs – Evernote API: dev.evernote.com/doc – Google Drive API: support.google.com/drive – VuForia SDK: developer.vuforia.com/resources/sdk/android 5
  • 6. Methodology • Formative evaluation – Small set of test participants to gather feedback early in the design phase so that the software development process can progress in a direction that will support user requirements for the software 6
  • 7. Methodology • Test Participants – Students were recruited from the General Studies 101 course. They are in their first year of study at the university and have not yet chosen a major. – There were a total of five test participants in the first round of study. 7
  • 8. Methodology • Study Process – Students were given an Android phone with the Text-shot app loaded. Investigators observed the students as they used the OCR mobile software to obtain suggested library resources. Investigators collected two sources of data: observation of how students interact with the software and a debriefing interview. 8
  • 9. Functionality Tests • Researchers tested the two main functions for the software. – Recognizing a string of text by taking a picture of the words in a student assignment sheet and; – suggesting subjects and titles based on the scanned text. 9
  • 10. Results • Themes related to the improvement of suggestions: – Show broad subjects first • Then expand to details subjects – Prominently display title suggestions 10
  • 11. Results • Feature Requests: – Include articles as well as book titles in recommendations • Use article APIs • LibGuides-like help guides 11
  • 13. Next steps in OCR • Topic Space app: Scanning call numbers in the library – If you scan a call number on a book, you can get recommendations of other, related books in the library, and other related digital content in the library. 13
  • 14. Topic Space: Book Scan 14
  • 15. Topic Space: Suggested Topic Spaces 15
  • 16. Topic Space: Related Books that are not available 16
  • 18. Future directions • Implementing OCR modules in the Minrva app: – http://minrvaproject.org/modules_topicspace.ph p • Open sourcing OCR technology for use in library settings: – http://minrvaproject.org/source.php 18
  • 19. Sponsors • Institute of Museum and Library Services • University of Illinois Campus Research Board 19
  • 20. Acknowledgements • My thanks to Ben Ryckman for Topic Space module development and support. • Many thanks to Chris Diaz, Residency Librarian, Scholarly Communications and Collections, University of Iowa for help with participant recruitment, observation, and interviewing support in the user studies • Thanks to Mayur Sadavarte, Graduate Student in Computer Science at the University of Illinois and Nate Ryckman, Graduate Student in Information Systems Management at Carnegie Mellon University for Optical Character recognition programming support. • Yinan Zhang, PhD Candidate in Computer Science at the University of Illinois, Sherry (Mengxue) Zheng, Graduate Student in Computer Science for help developing the search and suggestion functionality of the Deneb near-semantic index, Maria Lux, Graphic Designer for laying out the polished recommendations and prototyping Text-shot integration as a Minrva module. 20