EBOOKS WITHOUT 
VENDORS 
Using Open 
Source Tools to 
Create and 
Share 
Meaningful 
Ebook 
Collections
Who am I? 
Matt Weaver 
IT Manager 
Westlake Porter 
Public Library
Not an alternative to Overdrive, ebrary, 
3M, etc.
EBOOKS AS TOOLS 
To be created by: 
• the library 
• the community 
For collaboration 
For connection
Ebooks as source material 
for new products
DIY Ebooks: Library as 
publisher
An Experiment: Library as 
publisher
An Experiment: Library as 
publisher
WHY DIY? 
Design for your community: 
• Responsive 
• Relevant 
• Hyper-local
WHY DIY? 
Gain knowledge and skills that can be 
applied in other projects/partnerships
WHY DIY? 
Content independence
OPEN 
SOURCE: 
WHAT IS IT?
OPEN SOURCE: WHAT IS IT? 
 Free to use 
 Free to develop 
 Uses free licenses 
(GNU GPL most common)
Open Source: Four Freedoms 
The freedom to: 
 Run the program for any purpose 
 Study how the program works and adapt 
i...
Open Source: Four Freedoms 
The freedom to: 
 Redistribute copies so you can help your 
neighbor 
 Improve the program a...
Why Open Source? - Collaboration 
& Community 
Zero software costs, yet you get 
powerful software
Why Open Source? Control over 
Content 
You control development: 
ultimate control over content
Why Open Source? - Collaboration 
& Community 
 Collaborators can be united with 
common tools
Why Open Source? - 
Collaboration & Community 
No restrictions on collaboration by 
software publishers' 
technologies/lic...
An Open-Source Model for 
Community Publishing 
 affordable for even small libraries 
 return on investment
Digital Rights Management 
(DRM) 
DRM (Digital Rights 
Management) 
“Think of DRM on an 
eBook as a lock, with 
your eRead...
DRM in libraries 
 Impedes access by imposing 
“friction” = technological obstacles 
 Expensive 
 Counterproductive 
 ...
DRM in libraries: 
“Adobe isn’t just tracking what users are doing in [Digital 
Editions 4]; this app was also scanning my...
SECURING 
ACCESS TO 
CONTENT
DIY: Copyright 
Disclaimer: 
I am not now, nor have I ever been a 
lawyer. 
I am not a copyright expert.
DIY: Copyright 
Because of digital distribution, 
and 
because the library does not own titles 
to be digitized… 
o no Fai...
DIY: Copyright 
Determine if book has fallen into the public 
domain 
Or seek permission from rightsholder
DIY: Copyright - Resources 
 http://cocatalog.loc.gov/
DIY: Copyright - Resources 
 http://collections.stanford.edu/copyrightrenewals/bin/page 
?forward=home
DIY: Copyright - Resources 
Digital Copyright Slider 
http://librarycopyright.net/resources/digitalslider/
DIY: Copyright - Resources 
Copyright Genie 
http://librarycopyright.net/resources/genie/
DIY: Copyright – Show your work 
Document copyright research to 
justify your usage, and to show that 
you acted professio...
PERMISSION 
TO 
DIGITIZE
DIY: Copyright - Guidelines
Securing permission: consent 
forms 
Organizational leaders: 
• may think they have to sign over 
copyright 
• may be afra...
Securing permission: consent 
forms 
Consent agreement should be clear on 
copyright 
Be clear how content will be used 
I...
EBOOKS 
DISSECTED 
& DIGITIZED
ePub as zip file
ePub as zip file
ebook markup 
HTML & CSS
Everything has been digitized, 
right? 
Bad OCR: hours, fractions 
Scanned ≠ Digitized 
Corrected 
WPPL 
Epub 
page
Homer ebook project 
http://bookscanner.pbworks.com/w/page/40965440/FrontPage
Homer 
The following tools are installed as part of the Homer Project: 
 ImageMagick (for manipulation images) 
 Jpegtra...
Ebook 
Production Workflow
Ebook 
or 
Production Workflow
Homer: ScanTailor 
 Preprocess tiff-format 
images of book pages 
 Deskewing 
 De-speckling 
 Correcting warp 
 Right...
Homer: 
ScanTailor
HOMER BASH SCRIPT 
It looks like 
command-line…
HOMER BASH SCRIPT 
but it’s drag-and-drop!!!
Homer: tesseract-ocr 
Optical Character 
Recognition 
Multilingual support - 
From Afrikaans to 
Vietnamese
Homer: pdfbeads 
Outputs a searchable 
PDF
Homer & pdfbeads 
Outputs a searchable 
PDF
Sigil 
https://code.google.com/p/sigil/
Epub Validator 
http://validator.idpf.org/
Calibre 
http://calibre-ebook.com/
Drupal 
 Open source 
content 
management 
system 
 Widely used in 
libraries 
 Drupal 7 
 “Responsive” 
layout 
drupa...
Drupal 
Ability to create 
custom fields for 
metadata – can be 
hidden from users
3 content 
types: 
•recipe 
•ebook 
•organization 
 Drupal 7 
 “Responsive” 
layout
Drupal – Recipe module
Drupal – ILS authentication 
module
USAGE: Since late Oct. 2013 
More than 1,800 ebook downloads 
More than 32,000 individual recipes 
downloaded or printed
Costs: 
Content: $0 
Software licensing: $0 
Staff time: 4-7 hours per ebook (estimated)
The Community Cookbook – 
what’s next?
The Community Cookbook – 
what’s next?
The Community Cookbook – 
what’s next? 
Original content: 
We can help organizations produce their 
own cookbooks 
Work wi...
The Community Cookbook – 
what’s next? 
…with one more open-source tool, we can 
even help them design print versions: 
We...
It’s an exciting 
possibility… 
for the future of libraries that 
there is value to be mined from 
content already in our ...
Even more exciting 
is the thought that the most 
valuable content to libraries is 
content from our communities 
that has...
Further Reading
Further Reading 
 Jarret Buse - A Hands-on 
Guide to EPUB2 and 
EPUB3 
 Excellent guide to the 
guts of ebooks 
 Featur...
Further Reading 
Stanford University: Copyright & Fair Use – Charts and Tools 
http://fairuse.stanford.edu/charts-and-tool...
mattrweaver
Image credits 
Open Source Sign Timothy Appnel - 
https://www.flickr.com/photos/tappnel/5798812875/ 
“Librarian from Turn ...
Image credits 
Techno_background2.jpg (ones and zeroes) 
http://www.morguefile.com/creative/Grafixar 
Pile of books with l...
Ebooks without Vendors: Using Open Source Software to Create and Share Meaningful Ebook Collections
Upcoming SlideShare
Loading in …5
×

Ebooks without Vendors: Using Open Source Software to Create and Share Meaningful Ebook Collections

3,277 views

Published on

When you start building your own ebook collections from items in your community, you stop looking at them as licensed products and start seeing them as tools. This talk I present the open source tools used to create The Community Cookbook website I created at Westlake Porter Public Library:
http://cooking.westlakelibrary.org

Presented at the Indiana Online Users Group Spring Meeting, May 16, 2014 in Indianapolis, IN. Slides updated for Oct. 10, 2014 talk at Ohio Library Council's Convention & Expo.

UPDATE: I wrote about this project for codelib. The article includes more technical details: http://journal.code4lib.org/articles/9911

Published in: Education, Technology
0 Comments
4 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
3,277
On SlideShare
0
From Embeds
0
Number of Embeds
109
Actions
Shares
0
Downloads
10
Comments
0
Likes
4
Embeds 0
No embeds

No notes for slide
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • Ebooks without Vendors: Using Open Source Software to Create and Share Meaningful Ebook Collections

    1. 1. EBOOKS WITHOUT VENDORS Using Open Source Tools to Create and Share Meaningful Ebook Collections
    2. 2. Who am I? Matt Weaver IT Manager Westlake Porter Public Library
    3. 3. Not an alternative to Overdrive, ebrary, 3M, etc.
    4. 4. EBOOKS AS TOOLS To be created by: • the library • the community For collaboration For connection
    5. 5. Ebooks as source material for new products
    6. 6. DIY Ebooks: Library as publisher
    7. 7. An Experiment: Library as publisher
    8. 8. An Experiment: Library as publisher
    9. 9. WHY DIY? Design for your community: • Responsive • Relevant • Hyper-local
    10. 10. WHY DIY? Gain knowledge and skills that can be applied in other projects/partnerships
    11. 11. WHY DIY? Content independence
    12. 12. OPEN SOURCE: WHAT IS IT?
    13. 13. OPEN SOURCE: WHAT IS IT?  Free to use  Free to develop  Uses free licenses (GNU GPL most common)
    14. 14. Open Source: Four Freedoms The freedom to:  Run the program for any purpose  Study how the program works and adapt it to your needs (requires source code) www.gnu.org/philosophy/free-sw.html
    15. 15. Open Source: Four Freedoms The freedom to:  Redistribute copies so you can help your neighbor  Improve the program and release your improvements to the public www.gnu.org/philosophy/free-sw.html
    16. 16. Why Open Source? - Collaboration & Community Zero software costs, yet you get powerful software
    17. 17. Why Open Source? Control over Content You control development: ultimate control over content
    18. 18. Why Open Source? - Collaboration & Community  Collaborators can be united with common tools
    19. 19. Why Open Source? - Collaboration & Community No restrictions on collaboration by software publishers' technologies/license agreements
    20. 20. An Open-Source Model for Community Publishing  affordable for even small libraries  return on investment
    21. 21. Digital Rights Management (DRM) DRM (Digital Rights Management) “Think of DRM on an eBook as a lock, with your eReader having the key to open the lock and display the file.” - Jason Griffey
    22. 22. DRM in libraries  Impedes access by imposing “friction” = technological obstacles  Expensive  Counterproductive  For much content, isn’t necessary
    23. 23. DRM in libraries: “Adobe isn’t just tracking what users are doing in [Digital Editions 4]; this app was also scanning my computer, gathering the metadata from all of the ebooks sitting on my hard disk, and uploading that data to Adobe’s servers.”
    24. 24. SECURING ACCESS TO CONTENT
    25. 25. DIY: Copyright Disclaimer: I am not now, nor have I ever been a lawyer. I am not a copyright expert.
    26. 26. DIY: Copyright Because of digital distribution, and because the library does not own titles to be digitized… o no Fair Use case, o no section 108 protections
    27. 27. DIY: Copyright Determine if book has fallen into the public domain Or seek permission from rightsholder
    28. 28. DIY: Copyright - Resources  http://cocatalog.loc.gov/
    29. 29. DIY: Copyright - Resources  http://collections.stanford.edu/copyrightrenewals/bin/page ?forward=home
    30. 30. DIY: Copyright - Resources Digital Copyright Slider http://librarycopyright.net/resources/digitalslider/
    31. 31. DIY: Copyright - Resources Copyright Genie http://librarycopyright.net/resources/genie/
    32. 32. DIY: Copyright – Show your work Document copyright research to justify your usage, and to show that you acted professionally in trying to locate rightsholders.
    33. 33. PERMISSION TO DIGITIZE
    34. 34. DIY: Copyright - Guidelines
    35. 35. Securing permission: consent forms Organizational leaders: • may think they have to sign over copyright • may be afraid to sign something • will likely seek broader approval
    36. 36. Securing permission: consent forms Consent agreement should be clear on copyright Be clear how content will be used If you already have a consent form, make sure it applies to new projects For consent agreement questions, consult an attorney.
    37. 37. EBOOKS DISSECTED & DIGITIZED
    38. 38. ePub as zip file
    39. 39. ePub as zip file
    40. 40. ebook markup HTML & CSS
    41. 41. Everything has been digitized, right? Bad OCR: hours, fractions Scanned ≠ Digitized Corrected WPPL Epub page
    42. 42. Homer ebook project http://bookscanner.pbworks.com/w/page/40965440/FrontPage
    43. 43. Homer The following tools are installed as part of the Homer Project:  ImageMagick (for manipulation images)  Jpegtran (loseless jpeg transformation)  JBIG2 encoder (compression tool for bi-level images)  Tesseract-OCR (optical character recognition)  RubyInstaller (installs the Ruby programming language)  Hpricot (HTML parser)  RMagick (interface between the Ruby programming language and ImageMagick)  Pdfbeads (to create searchable PDF)  Cmdow.exe (command-line utility used in Homer)  ScanTailor (post-processing tool)  Homer (command-line bash script)
    44. 44. Ebook Production Workflow
    45. 45. Ebook or Production Workflow
    46. 46. Homer: ScanTailor  Preprocess tiff-format images of book pages  Deskewing  De-speckling  Correcting warp  Right-to-left language support  Outputs images for Homer
    47. 47. Homer: ScanTailor
    48. 48. HOMER BASH SCRIPT It looks like command-line…
    49. 49. HOMER BASH SCRIPT but it’s drag-and-drop!!!
    50. 50. Homer: tesseract-ocr Optical Character Recognition Multilingual support - From Afrikaans to Vietnamese
    51. 51. Homer: pdfbeads Outputs a searchable PDF
    52. 52. Homer & pdfbeads Outputs a searchable PDF
    53. 53. Sigil https://code.google.com/p/sigil/
    54. 54. Epub Validator http://validator.idpf.org/
    55. 55. Calibre http://calibre-ebook.com/
    56. 56. Drupal  Open source content management system  Widely used in libraries  Drupal 7  “Responsive” layout drupal.org
    57. 57. Drupal Ability to create custom fields for metadata – can be hidden from users
    58. 58. 3 content types: •recipe •ebook •organization  Drupal 7  “Responsive” layout
    59. 59. Drupal – Recipe module
    60. 60. Drupal – ILS authentication module
    61. 61. USAGE: Since late Oct. 2013 More than 1,800 ebook downloads More than 32,000 individual recipes downloaded or printed
    62. 62. Costs: Content: $0 Software licensing: $0 Staff time: 4-7 hours per ebook (estimated)
    63. 63. The Community Cookbook – what’s next?
    64. 64. The Community Cookbook – what’s next?
    65. 65. The Community Cookbook – what’s next? Original content: We can help organizations produce their own cookbooks Work with organizations to produce ebook versions…but
    66. 66. The Community Cookbook – what’s next? …with one more open-source tool, we can even help them design print versions: We can do everything but the printing.
    67. 67. It’s an exciting possibility… for the future of libraries that there is value to be mined from content already in our communities.
    68. 68. Even more exciting is the thought that the most valuable content to libraries is content from our communities that hasn’t been created yet.
    69. 69. Further Reading
    70. 70. Further Reading  Jarret Buse - A Hands-on Guide to EPUB2 and EPUB3  Excellent guide to the guts of ebooks  Features many of the open-source programs I have discussed
    71. 71. Further Reading Stanford University: Copyright & Fair Use – Charts and Tools http://fairuse.stanford.edu/charts-and-tools/
    72. 72. mattrweaver
    73. 73. Image credits Open Source Sign Timothy Appnel - https://www.flickr.com/photos/tappnel/5798812875/ “Librarian from Turn of the Century” - http://www.moyak.com/researcher/Clients/male_librarians/ind ex.html?id=34 Ereaders - Michael Porter https://www.flickr.com/photos/libraryman/5052936803/ Apples & oranges http://mrg.bz/n1xLHg
    74. 74. Image credits Techno_background2.jpg (ones and zeroes) http://www.morguefile.com/creative/Grafixar Pile of books with lock: Librarian in Black - http://librarianinblack.net/librarianinblack/2011/12/overdrive.ht ml Ricoh Copier: http://www.itinstock.com/ekmps/shops/itinstock/images/ricoh-aficio- mp-4001-fast-photocopier-copier-printer-scan-fax-5598- p.jpg

    ×