BookNet Canada Project Manager, Tim Middleton finds out what happens if you ask ChatGPT to write your BiblioShare presentation. You'll also find out what happened with BiblioShare in 2022 and what's to come for 2023.
Link to presentation: https://youtu.be/cACWjPn_PXA
Presented by BookNet Canada on April 18, 2023, with support from the Department of Canadian Heritage.
8. Tech Forum 2023 #TechForum
How do I write a presentation on Bibliographic data?
9. Tech Forum 2023 #TechForum
Introduction: Begin by introducing the topic of bibliographic data and its significance in academic research. You
can explain what bibliographic data is and why it is crucial to cite sources accurately in academic writing.
Types of bibliographic data: Describe the various types of bibliographic data, such as author name, publication
date, title, publisher, and location. You can explain how these details differ for different types of sources, such
as books, journals, websites, and other publications.
Citation styles: Explain the different citation styles that are commonly used in academic writing, such as MLA,
APA, and Chicago. You can describe the formatting and citation rules of each style and provide examples of
how to cite different types of sources in each style.
Importance of accuracy: Emphasize the importance of accurately recording bibliographic data to avoid
plagiarism and to allow readers to locate the sources cited in your work. You can provide examples of how
inaccurate or incomplete bibliographic data can cause problems for researchers.
Tools for managing bibliographic data: Discuss the various tools available for managing bibliographic data, such
as citation management software, and provide an overview of their features and benefits.
Conclusion: Summarize the key points of your presentation and emphasize the importance of using accurate
and complete bibliographic data in academic writing.
Remember to use clear and concise language, provide relevant examples, and use visual aids to help illustrate
your points. With these steps, you should be able to create a comprehensive and effective presentation on
bibliographic data.
10. Tech Forum 2023 #TechForum
What about bibliographic data for the trade book industry?
11. Tech Forum 2023 #TechForum
Introduction: Begin by introducing the topic of bibliographic data and its importance in the trade book
industry. You can explain what bibliographic data is and why it is crucial for book professionals to have accurate
and complete data about the books they publish, sell, and manage.
Metadata elements: Describe the various metadata elements that make up bibliographic data in the trade
book industry, such as ISBN, title, author, publisher, publication date, format, genre, and audience. You can
explain how these elements help to identify and promote books to the appropriate readers.
Standards: Explain the various standards that are used in the trade book industry to ensure that bibliographic
data is consistent and easily shared among different systems and platforms. You can describe how standards
such as ONIX (ONline Information eXchange) and BISAC (Book Industry Standards and Communications) help
to streamline the distribution of bibliographic data.
Challenges: Discuss some of the challenges that arise when managing bibliographic data for the trade book
industry, such as keeping up with new formats and technologies, dealing with errors and inconsistencies in
data, and managing data across multiple platforms and systems.
Best practices: Provide some best practices for managing bibliographic data in the trade book industry, such as
following industry standards, using automated tools for data management, and regularly reviewing and
updating data.
Conclusion: Summarize the key points of your presentation and emphasize the importance of accurate and
complete bibliographic data for the success of the trade book industry.
.
12. Tech Forum 2023 #TechForum
Challenges: keeping up with new formats and technologies, dealing with
errors and inconsistencies in data, and managing data across multiple
platforms and systems.
13. Tech Forum 2023 #TechForum
How do you deal with errors and inconsistencies in data, and managing
data across multiple platforms and systems?
14. Tech Forum 2023 #TechForum
Use automated tools: One of the easiest ways to deal with errors and inconsistencies in bibliographic data is to
use automated tools such as data validation software, which can quickly identify and flag any errors in the
data. Automated tools can also help to streamline the process of managing data across multiple platforms and
systems.
Establish data management protocols: To ensure that data is consistent across different systems and platforms,
it is important to establish clear data management protocols. These protocols should include guidelines for
data entry, formatting, and maintenance, as well as procedures for resolving errors and inconsistencies.
Regularly review and update data: It is important to regularly review and update bibliographic data to ensure
that it remains accurate and up-to-date. This can be done by assigning a dedicated team or individual to
manage data and conducting regular audits to identify and resolve any errors or inconsistencies.
Follow industry standards: Following industry standards, such as ONIX and BISAC, can help to ensure that data
is consistent and easily shared across different platforms and systems. These standards provide guidelines for
data formatting and management, as well as procedures for resolving errors and inconsistencies.
Implement quality control measures: Quality control measures, such as peer review and data verification, can
help to identify and resolve errors and inconsistencies in bibliographic data. These measures can be
particularly effective when combined with automated tools and clear data management protocols.
By implementing these tips, you can help to ensure that your bibliographic data is accurate and consistent
across multiple platforms and systems, and that errors and inconsistencies are quickly identified and resolved.
21. Year over Year 2.1
Tech Forum 2023 #TechForum
2021
• Average New Records/day: 1,249
• Average Updated: 134,901
2020
• Average New Records/day: 1,586
• Average Updated: 143,974
2019
• Average New Records/day: 2,942
• Average Updated: 140,612
2018
• Average New Records/day: 3,014
• Average Updated: 103,232
2017
• Average New Records/day: 1,836
• Average Updated: 101,556
2022
• Average New Records/day: 2,047
• Average Updated: 145,977
22. Year over Year 3.0
Tech Forum 2023 #TechForum
2021
• Average New Records/day: 1,249
• Average Updated: 7,757
2020
• Average New Records/day : 1251
• Average Updated : 9532
2019
• Average New Records/day : 274
• Average Updated : 8000
2018
• Average New Records/day : 408
• Average Updated : 9290
2017
• Average New Records/day: 375
• Average Updated : 4278
2022
• Average New Records/day: 5,036
• Average Updated: 16,257
30. Who is using BiblioShare?
Tech Forum 2023 #TechForum
31. BiblioShare 2022
Tech Forum 2023 #TechForum
• 133,245,392 non-BookNet successful requests
came into our BiblioShare services (+22% yoy)
• From 158 Users
32. Web Service Usage
Tech Forum 2023 #TechForum
2022 2021
ONIX 78,755,660 146,748,033
ImageInfo 39,743,517 35,313,646
FULL IMAGE 12,981,249 10,738,105
BiblioSimple 11,131,199 6,991,242
interior 9,274,352 14,116,596
THUMB IMAGE 7,889,359 6,702,731
back 7,683,693 6,266,066
BshAssets 7,003,839 7,208,822
interior tn 6,581,928 502,043
Shopify 4,873,768 3,085,420
BiblioSimple3 97,824 2
back tn 24,114 3,304
author tn 2,148 570
SampleInfo 1,008 25
AUTHOR 36 570
MARCRecord 31
ReleaseCalendar 30 7,196
33. Top 40 of 158 users
Tech Forum 2023 #TechForum
Company Counts
Whitehots 67,835,453
Catalist btlf 54,791,327
BookNet CataList 39,692,260
ULS 30,903,488
49th Shelf 30,249,969
LIQ 17,186,002
All Lit Up 8,184,354
Livres Canada 6,504,009
LibraryBound 2,787,006
Libro World 1,769,025
Wilfrid Laurier 1,513,731
Playwrights 1,087,549
River Bookshop 955,438
Loan Stars 931,860
Rivendelle 851,810
Canadian University Presses 845,301
CoachHousePress 628,996
Once Upon a Bookshop 547,956
Bibli-O-Matic 547,246
Otaku Lounge 455,748
Lorimer 437,284
Raincoast 432,739
Type Books 408,978
Humpty Dumpty Books & Music
Ltd 397,086
Salesdata 392,265
APMA 351,065
Alberta Book Hub 273,691
Merriam Music 264,279
Arsenal Pulp 264,279
Bookworms and Co 262,889
Amanda Lee 252,697
The Bookshelf 213,156
Paragraphe Books 210,004
Sea to Sky 165,082
DE STIIL 144,385
Saga Bookshop 139,340
Book Lady of Fenelon Falls 136,210
Beggars Banquet Books 94,254
Theytus 69,254
Ripe Find 62,492
Megan Doodle 61,738
Hi and Welcome. Thanks for tuning in to this years BiblioShare talk at Tech Forum 2023. My name is Tim Middleton and I am the Product Coordinator for BNC BiblioShare at BookNet.
This year, naturally - I will be giving you an update on our BiblioShare numbers which I hope gives you a sense of the importance of this platform. I try to do that every year. And I will let you know about a couple of the projects that occupied the BiblioShare team. But I do that every year.
It shouldn't be a surprise, unless you haven't listened to a BiblioShare presentation before. If you haven't, not to worry - we have quite a few in the archives that you can listen to. But this year I wanted to do something a little different as well.
The reason I wanted to do this is two fold. One, I was having a hard time putting together the presentation. Maybe I've given a few too many of them? Maybe Covid has done more damage to my creativity then I realize? The second reason was because I like to keep up with the trends. No, I'm not going to announce that we are launching #bishtoc, although that would be incredibly trendy
No, I'm not going to announce that we are launching #bishtoc, although that would be incredibly trendy
And I’m not going to look at the trends that are driving sales – that is BNC SalesData land and Monique Mongeon has already done a great job of that. If you really want to know what is trending in books and gift I highly recommend you give her talk a listen. There you will hear about the categories that are popping in BNC SalesData and find out about everything trendy from mushrooms and upcycling to regencycore and goblincore. Of course in BiblioShare we could let you know how many titles have keywords that are leveraging these trends, but I didn’t do that.
Instead what I decided to do was invite some intelligence into this years discussion. I decided to invite chatgpt to the show! At first I just asked Chatgpt to make my BiblioShare presentation because I heard that's what all the kids are doing. But it didn't go that well. Chatgpt hadn't really heard of BiblioShare yet and was giving credit to all sorts of other platforms for BiblioShare - which when I followed it's trail showed me we had more education to do to get chatgpt up to speed. But we aren't prepared to train an ai model on the BiblioShare quite yet - we're still working on getting companies to transition to ONIX 3.0 10 years after the 2.1 has already been sunset.
So I took a step back and asked more general questions like how do I write a presentation on Bibliographic data? Chatgpt did much better with this and was genuinely helpful.
So I took a step back and asked more general questions like how do I write a presentation on Bibliographic data? Chatgpt did much better with this and was genuinely helpful and understanding by stating that: “Writing a presentation on bibliographic data can be a challenging task, but it is essential to understand the importance of bibliographic data in academic research and writing. Here are some steps that can help you create an effective presentation on bibliographic data” and then went on to give me some biblio 101 instructions for creating a presentation. That is what you are looking at in this slide. Needless to say I wanted to know more.
I wanted to zero in on the trade side of publishing and chatgpt obliged by stating:
Bibliographic data is important for the trade book industry as well. Publishers, booksellers, and librarians rely on bibliographic data to accurately describe and categorize books. Here are some steps to consider when writing a presentation on bibliographic data for the trade book industry:
Well I was getting hooked on chatgpt and started to worry that I would never get this presentation done because the questions I could ask were endless and so were the answers. But in chatgpts last answer I focused in on the challenges that chatgpt highlighted and asked it further:
And again chatgpt graciously recognized that Dealing with errors and inconsistencies in data, and managing data across multiple platforms and systems can be challenging
And then our interviewee, chatgpt offered some tips that any of us in the metadata game could only nod our heads at and fully endorse. In fact, this is what we do all the time at BookNet on the BiblioShare team.
The interview went on – because I wanted to dive into each of those topics in more detail but I will not make you endure each one of those questions and only want to say one thing really surprised me about this very smart, very helpful ai - it is very polite! At one point chatgpt told me something about the platform jstor that I questioned and when I did it very politely told me it was sorry, that it’s last answer may have misled me and that the correct response should have been the thing that it then wrote out.
Well that was my attempt at being trendy. If nothing else, it showed me there is quite a bit of potential in throwing your questions at this ai. I asked it questions I assumed it would have no idea about, but it gave convincing answers on a number of those. Now of course I’m not saying you’re going to replace your metadata team with chatgpt but I would think about using it to keep sharp in areas your team can’t possibly handle. Just be aware it won’t create your entire presentation for you. You are still needed! And with that I am now going to turn it over to my usual self and update you a bit on BiblioShare.
Lets start with the numbers:
As many of you may know CataList has been working with our French Language sister organization BTLF. And as you may also know CataList gets data from BiblioShare. So that means BiblioShare is receiving data for BTLF as well. We now have over 2 million covers for BTLF.
In 2022 the BiblioShare ONIX 2.1 dataset grew by about 225 thousand records. As many records as this is, this number is actually 70,000 lower then what we received in 2021.
But#
Our 3.0 dataset saw more than double the number of records yoy get added. I suppose this is a good place to remind you that BiblioShare treats 2.1 data and 3.0 data as completely
Discrete datasets. We don’t do anything crazy like trying to fit the 2.1 data into the 3.0 framework nor do we downgrade 3.0 data to the 2.1 framework. Although this means we are receiving what is basically duplicated data, 3.0 is not backwards compatible. At some point 2.1 data will be a thing of the past but until then keep sending both your 2.1 and your 3.0 data. We appreciate it!
In this slide we are looking at the daily average intake of data for both New records and updated records in BiblioShare. Because as chatgpt reminds us – bibliographic data is not static – at least not until the thing it is describing is no longer in available. Here we can see that the year 2022 was not a lot different then previous years although that average updated figure looks pretty good over last year. That means companies are paying attention to their data and keeping it fresh!
Looking at the 3.0 #
Dataset we see a big jump both in New records added and in the average for updated records. It is really starting to look like 3.0 data is becoming a thing and it’s only taken about 13 or so years to catch on…
All told we created 79 new or newish accounts in BiblioShare.
The breakdown of publisher#
The breakdown of publisher and supplier ONIX accounts show us we are still adding more 2.1 accounts then 3.0 but in the yoy view we added 40% fewer 2.1 accounts this year while the number of 3.0 accounts added was on par with last year. The story we are seeing here is the slowly ending of the life of 2.1 ONIX and the continuing growth of 3.0. This certainly should be sending signals to all of our data end users out there. If you haven’t worked out your 3.0 migration yet, better get on it!
What continues to amaze us on the BiblioShare at BookNet is the volume of consumption of our webservices and use of Shopify plugin. We had 21 new accounts established for either using our web services or in order to support their Shopify stores.
And we continue to build the holding in BiblioShare of supporting materials. Here we see that we added 7% to our cover images database so we now have close to 3.5 million covers on hand.
Interiors images, which are clearly one of the most obvious supports for everything from art books to children’s books when consumers are making buying decisions , grew by 16% in 2022 to reach almost 500K images.
Author images continues to trickle along and our author images grew by 4% in 2022.
And here is a look at the how the rest of the supporting materials database is growing. The explosive growth of the first couple of years for the samples and excerpts holdings has tapered off but hopefully this is because we are just in a regular workflow now to deliver those as part of your book records and so there is no great backlog to be uploaded. At least that is my take away from looking at the numbers.
So always interesting to see what kind of data and how much of it we are continuously processing but equally interesting is this question – Is the data being used and who is using all of this data?
Well once again our webservices were very busy. This 133million number excludes any of our in-house use of the apis, although we do use the exact same APIs as we make available to the public.
Here is a list of the active web services and what is being pulled. Obviously ONIX is still the biggie but we note that the number of requests to this service dropped dramatically this year. That could have to do with any number of reasons – including the possibility last year that someone had to draw down a lot of data to either prime their solution or repair something that got broke. This, I suppose is the limit of just giving a number without any context. But the other possibility is that solutions are getting smarter with these services. One signal I pick up from these numbers is the increase in usage of our bibliosimple webservice. BiblioSimple is a basically a stub record that we create from the full ONIX we receive. This stub can be a great service to use on preview pages or in blogs etc. where you don’t really need the full ONIX goodness to support your sales and marketing. A;so a nice bump in our Shopify support with almost 2 million extra calls being made.
And to answer that question I asked at the beginning of this segment – here is the list of our top 40 users of Biblioshare web services. The cool thing about this list is that it shows the flexibility of the services. These users in this list are made up of solution providers like Reader Bound, In-house solutions like Whitehots, Shopify users, pubisher sites, distributor sites, retaile sites and author sites. And of course you may have noticed I left in the BookNet products in this list just to give you a feeling for the spread of usage – BookNet is certainly not the only heavy user of the BiblioShare data.
And that was a quick tour through 2022 for BiblioShare data. Our focus remains on ONIX 3,0 data and migrating solutions and support to that dataset but we also had a couple of other projects that kept us busy.
You may know we built an App for the Shopify platform some years ago and it is now being used by close to 70 shops including retailers, publishers and distributors. We have two flavours of the App built for using our 2.1 data set and a third version that pulls from our 3.0 ONIX dataset.
Shopify is a dynamic platform and always making changes but this past year they made a change to their policy about unpublished Apps. Our App was an unpublished App – we were happy with it in that state, but the policy change would make our App obsolete since now users are not allowed to use an unpublished App. This meant we had to submit our App for Shopify review.
It was a bit of a hurdle for us to get past this review. Shopify is pretty strict on the way that Apps get installed and uninstalled, not to mention other frequent changes that they make to their APIs and functionality. As I said, we weren’t planning on publishing our App so when we received the policy change announcement in September we had to carve out some time to work on things. You can now find our App at the url:
find our App at the url: apps.shopify.com/app19778
You will need to request a user token from us which you can get by going to the booknetcanada website under the BiblioShare product and fill out the form for accessing data from biblioshare.
and even though our App is now published in the Shopify App store we still recommend using the Bibli-O-matic browser extension
To help you figure out if data is in our biblioshare database or not. As mentioned last year our Bibli-O-matic plugin now exposes data from both our 2.1 dataset or the 3.0 dataset, if the data exists. I have come to think of Bibli-O-matic as an indispensable tool for troubleshooting since you have at your fingertips a tool that will show you the metadata across all of the BNC platforms including the raw ONIX data that we have on hand, and links to the title in SalesData and CataList. Don’t leave work without it!
And since I am talking about troubleshooting your data I will just highlight once again this very valuable web service that we built called Bish Assets. This web service will show you whatever data we have on a title – that can include both versions of ONIX, a position file, cover images and any other supporting material that has been sent to BiblioShare.
And finally to round out this years presentation I want to draw your attention to a new file that we began generating almost a year ago – and that is a flat file that contains all of the isbns that are new to BiblioShare that day.
We generate a file each for both the 2.1 ONIX data set and the 3.0 data set. The data consists as you can see on this screen of the ean, the timestamp when it was added to the dataset, the main bisac code and the productform. One thing this feed accomplishes is to keep anyone using the BiblioShare data up to date. You can grab the list of isbns and start pulling the new data down to your own servers and applications. Access to these files is on our ftp server and is controlled by a login and password which you can request from us at any time. Like many of the offerings from BiblioShare this is yours for the asking!
Well that in a nutshell is the year in BiblioShare. Oh of course we worked on a number of custom projects for companies looking to pull metadata for their catalogues. We often have capacity to work with companies on custom applications but not always. If you are in the market for a custom solution then let us know. We may be swamped at the time of asking and we won’t be able to give you a hand but it doesn’t hurt to ask.