SlideShare a Scribd company logo
What’s past is…
still messing with our
workflows
Jacqueline Whyte Appleby
Scholars Portal
Ontario Council of University Libraries
What is the future
of ebooks management?
Context - Ontario
● 21 universities
● All are public, all have a research mandate.
● Range in size from 1,300 to 83,000 students
● Their libraries work together through OCUL,
the Ontario Council of University Libraries.
Context - Scholars Portal
● Scholars Portal builds & maintains digital services for Ontario university libraries.
● Mix of content & member services
● Locally loading journals since 2001, books since 2009
● Journals platform has been TDR certified since 2013, Books is next
Context - Books platform
● Hosts about 250,000 commercial texts, 400,000+ OA or public domain texts
● PDF & XML-based texts
● Platform released in 2009, software it’s built on sunsetted in 2011
● Platform redevelopment 2016-2018
~ ASSUMPTIONS ~
1. Files will be sent in standard packages
& formats
1. Files will be sent in standard packages
& formats
2. We’ll get MARCs for everything
The dream:
1. Get 1000 PDFs
2. Get 1000 MARCs
3. Match
4. Load everything
5. Celebrate
The reality:
1. Get some PDFs
2. Maybe get some MARCs?
3. Try to match
4. ¯_(ツ)_/¯
5. Load what we can
6. Cry
3. Everyone will buy everything
4. DRM will be loose or non-existent
In sum
● Lack of standardization
● Third party miscommunication
● Licenses are a wild ride
Now what?
#BOOKSGOALS
Harmonized metadata
● BITS (Books Interchange Tag Suite) is the sister
XML tag suite to JATS
● Allows for book-level and chapter-level metadata
● All publisher metadata is crosswalked to BITS,
then ingested into MarkLogic
Accessible, accessible, accessible
● ACE: respond to Accessibility for Ontarians with Disabilities Act, reduce duplication, offer our
students more.
● With a token, students can access scanned copies of books from their local collections
● Now: access the whole of their ebooks entitlements, request alternate formats on the fly.
Long-term
preservation
Admin tool for all w/ hierarchical collections
● KBART on the fly
● MARC packages on the fly
● COUNTER 5 stats on the fly
...it’ll be pretty fly
DRM - the friendly version
● The friendly version is no DRM
● The second friendliest version replicates the experience
non-DRM restricted content as closely as possible
Unsolved challenges
~ an interlude ~
Corrections
● There is no standard for how corrections to an already-loaded book are sent.
● If it’s the whole book - is it a duplicate?
● If it’s a page or chapter - how to integrate?
Chapter-level mark-up
vs
#BIGGERBOOKSGOALS
Adding local content
● ETD
● Other IR content
● Other local publications
● Workflows?
● Metadata?
● Entitlements?
OER
● The development of a provincial OER strategy is a hot topic
● First of all: preservation
● But what about copy-editing, remixing, peer-review within the system?
Web archiving
● Archive-It use is on the upswing
● How can we make a home for non-PDF content?
● How can institutions contribute their own collections?
What is the future
of managing ebooks?
Stewardship
Thanks for listening!
Questions?
jacqueline@scholarsportal.info
The Scholars Portal Books team is:
Bartek Kawula, Sadia Khwaja, Ivan Jankovic,
Sunil Manikonda, Ravit David, Annie Thomas
Selvarajan, Jacqueline Whyte Appleby
With support from: Kate Davis, Amaz Taufique,
Bikram Singh, Harpinder Singh, and Carlos
McGregor Muro.

More Related Content

Similar to Charleston Conference 2017 - What's Past is Still Messing With Our Workflows

Interactive E-Books
Interactive E-BooksInteractive E-Books
Interactive E-Books
Christian Glahn
 
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
BookNet Canada
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the Library
Indranil Das Gupta
 
2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS
RYAN T.
 
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
National Information Standards Organization (NISO)
 
2011 ATE Conference Panel Session
2011 ATE Conference Panel Session2011 ATE Conference Panel Session
2011 ATE Conference Panel Session
American Association of Community Colleges
 
PDA, DDA, UDA --- OMG!
PDA, DDA, UDA --- OMG!PDA, DDA, UDA --- OMG!
PDA, DDA, UDA --- OMG!
klm-shsu
 
Day3 edupub tokyo_idpf
Day3 edupub tokyo_idpfDay3 edupub tokyo_idpf
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at Universities
PK Mishra
 
Object-oriented analysis and design
Object-oriented analysis and designObject-oriented analysis and design
Object-oriented analysis and design
Ahmed Elnaggar
 
eResources for Ontario Universities
eResources for Ontario UniversitieseResources for Ontario Universities
eResources for Ontario Universities
Jacqueline Whyte Appleby
 
One Button Publishing
One Button PublishingOne Button Publishing
One Button Publishing
Clint Lalonde
 
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
National Information Standards Organization (NISO)
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Marcus Hanwell
 
eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...
Centre for Distance Education
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLib
David Nzoputa Ofili
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit
Laurent Le Meur
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
Rob Winters
 
Messaging
MessagingMessaging
Messaging
Sean Kelly
 
DITA, HTML5, and EPUB3 (Content Agility, June 2013)
DITA, HTML5, and EPUB3 (Content Agility, June 2013)DITA, HTML5, and EPUB3 (Content Agility, June 2013)
DITA, HTML5, and EPUB3 (Content Agility, June 2013)
Contrext Solutions
 

Similar to Charleston Conference 2017 - What's Past is Still Messing With Our Workflows (20)

Interactive E-Books
Interactive E-BooksInteractive E-Books
Interactive E-Books
 
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
Who Does What to Make Great EPUB? How to Build an Airplane in Mid-Air - Sabin...
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the Library
 
2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS2012 Software Freedom Day Presentation about Koha ILMS
2012 Software Freedom Day Presentation about Koha ILMS
 
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...Kerscher, Gunderson, and Wise "Unprecedented Access:  Improving the User Expe...
Kerscher, Gunderson, and Wise "Unprecedented Access: Improving the User Expe...
 
2011 ATE Conference Panel Session
2011 ATE Conference Panel Session2011 ATE Conference Panel Session
2011 ATE Conference Panel Session
 
PDA, DDA, UDA --- OMG!
PDA, DDA, UDA --- OMG!PDA, DDA, UDA --- OMG!
PDA, DDA, UDA --- OMG!
 
Day3 edupub tokyo_idpf
Day3 edupub tokyo_idpfDay3 edupub tokyo_idpf
Day3 edupub tokyo_idpf
 
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at Universities
 
Object-oriented analysis and design
Object-oriented analysis and designObject-oriented analysis and design
Object-oriented analysis and design
 
eResources for Ontario Universities
eResources for Ontario UniversitieseResources for Ontario Universities
eResources for Ontario Universities
 
One Button Publishing
One Button PublishingOne Button Publishing
One Button Publishing
 
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
Kerscher "Accessibility in a Nutshell: What Every Publisher, Educator, and Li...
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...eReaders and ePublishing: developing a model for flexible and open distance l...
eReaders and ePublishing: developing a model for flexible and open distance l...
 
Application of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLibApplication of Library Management Software: NewGenLib
Application of Library Management Software: NewGenLib
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
 
Messaging
MessagingMessaging
Messaging
 
DITA, HTML5, and EPUB3 (Content Agility, June 2013)
DITA, HTML5, and EPUB3 (Content Agility, June 2013)DITA, HTML5, and EPUB3 (Content Agility, June 2013)
DITA, HTML5, and EPUB3 (Content Agility, June 2013)
 

More from Jacqueline Whyte Appleby

It's Hard to Say Goodbye
It's Hard to Say GoodbyeIt's Hard to Say Goodbye
It's Hard to Say Goodbye
Jacqueline Whyte Appleby
 
OLA Super Conference Hackfest
OLA Super Conference HackfestOLA Super Conference Hackfest
OLA Super Conference Hackfest
Jacqueline Whyte Appleby
 
More Licenses, More Problems
More Licenses, More ProblemsMore Licenses, More Problems
More Licenses, More Problems
Jacqueline Whyte Appleby
 
Social media for conference networking
Social media for conference networkingSocial media for conference networking
Social media for conference networking
Jacqueline Whyte Appleby
 
Communicating Changes in Digital Services - #OLASC14
Communicating Changes in Digital Services - #OLASC14Communicating Changes in Digital Services - #OLASC14
Communicating Changes in Digital Services - #OLASC14
Jacqueline Whyte Appleby
 
Collaborative Data Mark-up & Distribution
Collaborative Data Mark-up & DistributionCollaborative Data Mark-up & Distribution
Collaborative Data Mark-up & Distribution
Jacqueline Whyte Appleby
 

More from Jacqueline Whyte Appleby (6)

It's Hard to Say Goodbye
It's Hard to Say GoodbyeIt's Hard to Say Goodbye
It's Hard to Say Goodbye
 
OLA Super Conference Hackfest
OLA Super Conference HackfestOLA Super Conference Hackfest
OLA Super Conference Hackfest
 
More Licenses, More Problems
More Licenses, More ProblemsMore Licenses, More Problems
More Licenses, More Problems
 
Social media for conference networking
Social media for conference networkingSocial media for conference networking
Social media for conference networking
 
Communicating Changes in Digital Services - #OLASC14
Communicating Changes in Digital Services - #OLASC14Communicating Changes in Digital Services - #OLASC14
Communicating Changes in Digital Services - #OLASC14
 
Collaborative Data Mark-up & Distribution
Collaborative Data Mark-up & DistributionCollaborative Data Mark-up & Distribution
Collaborative Data Mark-up & Distribution
 

Recently uploaded

Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
ArianaBusciglio
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
NelTorrente
 

Recently uploaded (20)

Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Group Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana BuscigliopptxGroup Presentation 2 Economics.Ariana Buscigliopptx
Group Presentation 2 Economics.Ariana Buscigliopptx
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
MATATAG CURRICULUM: ASSESSING THE READINESS OF ELEM. PUBLIC SCHOOL TEACHERS I...
 

Charleston Conference 2017 - What's Past is Still Messing With Our Workflows

  • 1. What’s past is… still messing with our workflows Jacqueline Whyte Appleby Scholars Portal Ontario Council of University Libraries
  • 2. What is the future of ebooks management?
  • 3. Context - Ontario ● 21 universities ● All are public, all have a research mandate. ● Range in size from 1,300 to 83,000 students ● Their libraries work together through OCUL, the Ontario Council of University Libraries.
  • 4. Context - Scholars Portal ● Scholars Portal builds & maintains digital services for Ontario university libraries. ● Mix of content & member services ● Locally loading journals since 2001, books since 2009 ● Journals platform has been TDR certified since 2013, Books is next
  • 5. Context - Books platform ● Hosts about 250,000 commercial texts, 400,000+ OA or public domain texts ● PDF & XML-based texts ● Platform released in 2009, software it’s built on sunsetted in 2011 ● Platform redevelopment 2016-2018
  • 7. 1. Files will be sent in standard packages & formats
  • 8. 1. Files will be sent in standard packages & formats
  • 9. 2. We’ll get MARCs for everything The dream: 1. Get 1000 PDFs 2. Get 1000 MARCs 3. Match 4. Load everything 5. Celebrate The reality: 1. Get some PDFs 2. Maybe get some MARCs? 3. Try to match 4. ¯_(ツ)_/¯ 5. Load what we can 6. Cry
  • 10. 3. Everyone will buy everything
  • 11. 4. DRM will be loose or non-existent
  • 12. In sum ● Lack of standardization ● Third party miscommunication ● Licenses are a wild ride
  • 14.
  • 16. Harmonized metadata ● BITS (Books Interchange Tag Suite) is the sister XML tag suite to JATS ● Allows for book-level and chapter-level metadata ● All publisher metadata is crosswalked to BITS, then ingested into MarkLogic
  • 17. Accessible, accessible, accessible ● ACE: respond to Accessibility for Ontarians with Disabilities Act, reduce duplication, offer our students more. ● With a token, students can access scanned copies of books from their local collections ● Now: access the whole of their ebooks entitlements, request alternate formats on the fly.
  • 19. Admin tool for all w/ hierarchical collections ● KBART on the fly ● MARC packages on the fly ● COUNTER 5 stats on the fly ...it’ll be pretty fly
  • 20.
  • 21. DRM - the friendly version ● The friendly version is no DRM ● The second friendliest version replicates the experience non-DRM restricted content as closely as possible
  • 22.
  • 23.
  • 25. Corrections ● There is no standard for how corrections to an already-loaded book are sent. ● If it’s the whole book - is it a duplicate? ● If it’s a page or chapter - how to integrate?
  • 28. Adding local content ● ETD ● Other IR content ● Other local publications ● Workflows? ● Metadata? ● Entitlements?
  • 29. OER ● The development of a provincial OER strategy is a hot topic ● First of all: preservation ● But what about copy-editing, remixing, peer-review within the system?
  • 30. Web archiving ● Archive-It use is on the upswing ● How can we make a home for non-PDF content? ● How can institutions contribute their own collections?
  • 31. What is the future of managing ebooks? Stewardship
  • 32. Thanks for listening! Questions? jacqueline@scholarsportal.info The Scholars Portal Books team is: Bartek Kawula, Sadia Khwaja, Ivan Jankovic, Sunil Manikonda, Ravit David, Annie Thomas Selvarajan, Jacqueline Whyte Appleby With support from: Kate Davis, Amaz Taufique, Bikram Singh, Harpinder Singh, and Carlos McGregor Muro.

Editor's Notes

  1. Good morning, my name is Jacqueline Whyte Appleby and I’m the Scholarly Resources Librarian with Scholars Portal, the Ontario Council of University Libraries Despite the title, I only plan to spend a little while talking about what’s messed up…the real question I want to address now isn’t why everything is so messy but
  2. How do we anticipate we anticipate the future of ebooks? My organization is building an ebooks platform now, fully aware that the ebooks landscape will be vastly different in five years. How do we get ready for what’s ahead? That’s not just a technology question, it’s a licensing question, it’s a budget question, it’s a staff development question. For today I’m mostly going to treat it as a technology question.
  3. Scholars Portal is the service arm of OCUL Content: local hosting & discovery point for books, journals, microdata, geospatial data Member: ILL, chat reference, Somewhere in the middle: accessible texts repository, research data management support Most of them do not participate in Portico, We are the long term preservation strategy of many of our members. This means that almost every license that OCUL negotiates for journals and books has a local load clause, ariculates the rights we need for long term preservation.
  4. Publishers loaded include IEEE, Wiley, Taylor & Francis, Springer, Morgan & Claypool, Oxford, Cambridge, all Canadian university presses, many other UPs Theplatform was released in late 2009, and the back end built using software that was shortly thereafter purchased by a major library vendor and not developed any further after 2011. So this is olllld. And kind of a black box in a lot of ways. We began a redevelopment of the platform in the summer of 2016. It’s available to all of our library staff in beta right now, and its public beta is scheduled for release in January. Full release in April. This is a really exciting opportunity for us because
  5. We made some assumptions way back in 2009. I do want to talk briefly about these assumptions, because I think a lot of us are still carrying around these assumptions, maybe not explicitly, but buried in our workflows.
  6. When we get journals, they come to us as issues, with a whole bunch of PDFs, one for each article. They always come like this. The formats and structures in which books are packaged are much, much more diverse. These are some pretty standard formats. The blue represents a folder, pink and red represent different file types.
  7. But there are also ways that we might get the content. For stuff that crosses many years, different books might have different file types. So we might have really good XML mark up, but only for books from 2016 onward. Or we might get some books with chapters, some books as single PDFs. Might get TEI. Might just get an excel file. So what this means is that it’s impossible to write loaders that can account for all of these configurations. It needs to know where to find Front Matter, it needs to know if it should be looking to concatenate chapters. And automation has been a real challenge too because this will change from load to load — a new person will be packaging the files and will do it differently. It’s not to blame them— there’s no standard for how these should come.
  8. We assumed this one so hard, we purchased a platform is dependent on it. We cannot load a book without an associated MARC record. That’s the only place the software is capable of grabbing metadata from. But the workflow for sending PDFs and MARCs together is a dream and not a reality. The reality of course is that there are always going to be some records missing, or a delay in sending them. But the reality is also that often third parties are hired to create the records, so there’s a communication disconnect. And in these cases what happens is we get emails from librarians and faculty going, “my book just came out where is it!!!!” and we can’t do anything until we have a record.
  9. Think back to the magical time that was 2009 when the big deal was still generally accepted as the way to get content. So our entitlements module was set up with the understanding that folks either bought the Oxford ebooks collection, or they didn’t. Fast forward to 2017 and we have: schools buying incredibly granular packages schools dropping out of deals halfway through a year, which means breaking up those entitlements. We have mergers and acquisitions which mean metadata no longer distinguishes between two publishers, while backlist is still sold separately. We have package configurations that change over time as publishers build or cease certain subject areas. And we have some schools buying through third party vendors while other schools are buying directly through publisher. Same books, but possibly sold in different configurations, and also possibly with different licensing terms.
  10. Which leads me to my last assumption— we went many many years only signing deals without concurrent use restrictions. And then one day a concurrent use restricted license showed up, and our software was not designed to deal with that. In many cases users are not allowed to print or copy even a single page from high demand content. We wound up integrating Adobe Content Server into the system, which did not go over well. A major selling point of our platform is a consistent interface for many different ebook packages, and this broke this. It was a surprise and an annoyance to users who were used to always being able to access everything in the browser.
  11. So we made some assumptions, and we were wrong, because of these things… - And I don’t think any of these issues have really gone away, so they need to be at the front of our minds as we go forward.
  12. So: we have an amazing opportunity to do things differently, to learn from past trials & tribulations. We got funding to hire two new programmers full time for two years. It has been fantastic to have new staff not only because it’s more people working on everything but because they are not bogged down in the history of the project. They are able to step back and say, “why?” Our new books platform is running on MarkLogic, which is the software we’ve been using for Journals for years. It’s also what Healthcare.gov runs on. It’s also what the NSA uses, I hear. It can handle a lot of data.
  13. This is what the reading experience will be like on the new platform. All PDFs are rendered as HTMLs, which makes for a much more fluid reading experience. The bar on the left can be toggled closed.
  14. In broad terms, we want a better user experience - (like everyone). But we’re really trying to think broadly about who who a user might be it’s a user at an OCUL institution It’s a user at an OCUL institution with a visual or perceptive disability It’s a library staff member at an OCUL institution It’s a publisher It’s our own staff It’s anyone in the world interested in the growing open access and public domain content we load
  15. BITS is the sister XML tag suite to JATS, which we use for journals. The plan is that a certain point books and journals content can be more integrated. They currently sit on two different platforms. Having related metadata standards is one of the most important steps we can take in that direction. We don’t get any metadata in BITS, we need to write crosswalks for 100% of the content we receive. We didn’t need to used to do that because a MARC is a MARC, just ingest what you get. We’re now able to make use of much richer metadata...but it’s always go a unique DTD. Onix 2 is not the same from publisher to publisher. Long term we hope BITS will be more widely adopted. JATS is pretty prevalent now, it’s been deemed pretty useful.
  16. As you probably know, students with a disabilties can request that a print book be scanned for them but this takes time and once they have the file, where is kept? The idea with ACE was to centtralize that process - scan once, then make available to all schools that have a local copy. And host it on the books platform so that students with disabilities can search and discover other works they have access to. BUT it was its own separate portal. It’s now been fully integrated into the Books platform. Authenticated students with a registered disability will have access to the ACE collection but they will also be able to search and download any book they’re entitled to in the system – that’s 250,000 titles from most commercial publishers.. And they’ll be able to request an alternate format if a standard PDF doesn’t work for them. Internet Archive has a tool that generates alternate formats, and turnaround is a couple of days. Just to be clear - our schools do not sign licenses that do not give us the right to transform content for users with disabilities.
  17. Journals is a TDR, Books is not yet. So part of building the new platform has been figuring out the workflow for this. The workflow will be different from journals, the landscape has changes since we began preserving that content. We will likely be using Archivematica, a tool for creating Archival Information Packages, and we’ll store the final product in the Ontario Library Research Cloud, our distributed cloud storage network which has nodes at five universities across the province. Preservation is as much about policy as it is about technology, so we’re really confirming our rights with each renewal. We need the right to locally load, the right to perpetual access, and the right to transform content over time. We cannot preserve a collection without those three grants, and our schools do not sign licenses that do not allow for long term preservation.
  18. As I said, eresource staff are users too, and we want to build a library of MarkLogic queries that mean staff can at any time grab a current list of titles loaded, can get MARCs if they use them—and a list of books that don’t have an associated MARC! —, and can also get COUNTER 5 stats. It does not make sense for us to work in COUNTER 4 at this point, so some crosswalking will be necessary.
  19. Collections are nested and hierarchical, so we can break them down as far as we need to.The admin tool is live, but the buttons are a mock-up.
  20. As I said, we weren’t set up to monitor concurrent use in the browser, we now feel fairly confident that we can…but we don’t have good data on user behaviour because no one’s had an option to not use ADE. Are there people who do actually prefer that format
  21. So we’re going to offer both. If you just want to browse a bit, or read a single chapter, you can do that in the browser. If you want to really engage with the book, you can download it using Adobe Digital Editions. And if a year from now we can see that no one is bothering to check out the books, we’ll probably cease using it.
  22. Again just a mock up
  23. Those are things we feel pretty confident about putting into place, these are challenges we’re
  24. This is an extremely unsexy issue that we have got to figure out how to deal with, but even better would be some standards organization deal with it and demand everyone fall in line. Since we can get initial deliveries in a standard format, I’m not holding out a lot of hope but...corrections are an issue.
  25. Poor mark up. It looks bad and is confusing, and is also an accessibility issue. To publishers credit most are very receptive to feedback, and we expect 2018 and 2019 books to look better, but no one is going to go back and fix these. The work it would take to teach a computer to scan a book and recognize a chapter heading and then replace what we’ve got is ….a lot for a small issue, so it’s on the back burner.
  26. So that’s the stuff we’ve done in the last year, or we know we need to get done shortly. But we also know there are broader, bigger changes happening, We know that digital scholarship is pushing the boundaries of what a book is —that the monograph read cover to cover or a chapter at a time is no longer the most useful unit of self-expression or study. So the bigger question is: how can we be adaptable to what comes next?
  27. Right now we’re getting one offs - often local faculty publications or local conference proceedings. And we deal with all of that by email. Could this act as a more repository-like tool? We have the underlying preservation infrastructure in place. Who’s in charge of making sure metadata is good and entitlements are accurate?
  28. Long term hosting, but also allowing for in-site remixing? Displaying reuse clauses Notification of reuse Versioning Discovery? Situation where instructor at one school wants to use book in a certain configuration, another instructor at another school wants to use it in an alternate way. Can we host both versions in a way that is useful? Preservation on the module level? In site peer review?? Can we integrate with tools like PressBooks to allow for copy editing, remixing, peer-review within the system?
  29. ArchiveIt is a tool for capturing and preserving web data. There’s a lot of concern about the preservation of local history and municipal documents. Federal and provincial there are some mandates in place, but noting at the more local level. For instances where PDFs are available, there’s interest in creating metadata and hosting the PDFs locally. It’s a lot of work, but we’re well set up to host PDFs But what about web data? Can we flexible enough to be able to display archived versions of websites? We think this will be an important piece of the preservation puzzle, long term. But to be useful, they should be indexed and searchable.