SlideShare a Scribd company logo
American Chemical Society
Lessons Learned From
Building a Taxonomy and
Indexing 140+ years of
Content
Michael Darr
Columbus, OH
DHUG 2021
February 10
© 2021 American Chemical Society
Who is the American Chemical
Society?
A non-profit scientific organization with
more than 140 years’ experience, we are a
champion for chemistry, its practitioners
and our global community of members.
ACS Family: ACS Publications, C&EN
news, CAS, AACT (American Association
of Chemistry Teachers)
ACS Publications is recognized as a
leading publisher of authoritative scientific
information. Our 60+ peer-reviewed
journals are ranked the “most-trusted,
most-cited and most-read”.
© 2021 American Chemical Society
ACS Publications Products
ACS publishes across the full spectrum of chemistry
and related sciences and in every print medium.
We’ve published more than
• 1.3 million research articles across more than 60
journals
• 100,000 news stories in award winning C&EN
magazine
• 35,000 book chapter across more than 1,600
books
• 1,000 references and standards in ACS Reagent
Chemicals
© 2021 American Chemical Society 5
Where were we starting from?
• In 2016 in partnership with CAS (a sister division of ACS) we
developed in initial Taxonomy for use with ACS Omega, our new a
multidisciplinary open access journal
• Content was indexed manually by CAS scientists during an article’s
production lifecycle
• Terms were available typically just in time for publication, for which
at the time was a relatively small set of content
• Assigned terms were uploaded to our delivery system where they
were displayed on the article page and used to provide a taxonomy-
driven navigation for the journal
© 2021 American Chemical Society 6
Where did we need to go?
• Needed a taxonomy that was more customized for ACS
Publication’s needs
• Classify all published content
• Be able to handle processing 60,000+ articles a year in a timely
fashion
• Integrate display into a newly redesigned website
• Lay the groundwork to allow for expanding opportunities for new
non-journal products
© 2021 American Chemical Society
SLIDE TITLES SHOULD NOT GO MORE THAN
TWO LINES IN LENGTH.
Lessons From
Building a Taxonomy
Infographic vector created by vectorjuice
© 2021 American Chemical Society 8
Lessons From Building a Taxonomy
• Gather information on best practices and others’ experiences
• Get agreement early on from all the business owners on the
requirements for building the taxonomy
– Content domain experts and UI/UX engineers may have differing views
of what the customer and product needs are; establish clear decision
making roles.
• Be aware of complications due to polyhierarchy
– Makes content discoverable under a subject area for which it may not
pertain
– For a publisher prospective authors may try to use it to justify why their
submitted article fits the scope of a journal
© 2021 American Chemical Society 9
Lessons From Building a Taxonomy
• Ensure enough time and budget to enable sufficient
collaboration between your taxonomy consultants and
your internal content subject matter experts
• Establish live documents for more interactive
collaboration
• Ensure random sampling of content still includes an
appropriate percentage of research content and high
value content
• Give more time to building content for customer
research
– Dependent on tools being used to facilitate customer
interaction
© 2021 American Chemical Society 10
Actions We Took
• Chose to have a “full taxonomy” and a “visible taxonomy”
– The full taxonomy was what was needed to accurately classify the
content
– The visible taxonomy is a subset of the full taxonomy, including only the
top levels and specific terms in those levels to display on our platforms
• Engaged in customer focus research testing two different visible
taxonomies
– Found in individual testing that testers didn’t have any real preference
on the structure (note final versions were not hugely dissimilar)
– Found in A|B Testing on our Platform that the data captured on user
interactions didn’t provide a unanimous customer preference
© 2021 American Chemical Society
Visible Taxonomy Display
© 2021 American Chemical Society
SLIDE TITLES SHOULD NOT GO MORE THAN
TWO LINES IN LENGTH.
Lessons From
Classifying Content
Infographic vector created by vectorjuice
© 2021 American Chemical Society 13
Lessons From Classifying Content
• If you have PDF content, evaluate as early as possible how accurate
automated classification of the content will be
– 120 years of PDF-only content caused issues on being able to
programmatically identify content consistently
– Common issue of skewed indexing results due to content from the
preceding and following articles as the content was generated from
scans of the original text
• Engage platform architects early to fully understand all existing
capabilities and limitations for applying and leveraging the terms
• Consider weighting the text of the article for more accurate results
– Example: Title (8), Abstract (8), Experimental Section (4)
© 2021 American Chemical Society 14
Actions We Took
• We developed an internal automated process to derive the visible
taxonomy from the full taxonomy by determining the top 5 terms
• Validation of indexing results at a granular and visible level
– Using internal Subject Matter Experts to ensure consistently
hitting 85% or better accuracy
– Using external customers to verify accuracy of terms displayed
with the article
• Created a process for making adjustments to the visible terms
applied to the content
© 2021 American Chemical Society
Thank You!
Michael Darr
IT Project Manager
Publications Production Operations
American Chemical Society
2540 Olentangy River Rd
Columbus, OH
mdarr@acs.org

More Related Content

What's hot

Publishing Scientific Research & How to Write High-Impact Research Papers
Publishing Scientific Research & How to Write High-Impact Research PapersPublishing Scientific Research & How to Write High-Impact Research Papers
Publishing Scientific Research & How to Write High-Impact Research Papers
jjuhlrich
 
Evaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision MakingEvaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision Making
Selena Killick
 
Scopus & SciVal Training for Researchers
Scopus & SciVal Training for ResearchersScopus & SciVal Training for Researchers
Scopus & SciVal Training for Researchers
Ciarán Quinn
 
Bringing Consistency to Digital Resource Evaluation
Bringing Consistency to Digital Resource EvaluationBringing Consistency to Digital Resource Evaluation
Bringing Consistency to Digital Resource Evaluation
Paula Weaver
 
Elsevier - Why Scopus
Elsevier - Why ScopusElsevier - Why Scopus
Elsevier - Why Scopus
b-on
 
Get a Grip on Your Chemical Inventory
Get a Grip on Your Chemical InventoryGet a Grip on Your Chemical Inventory
Get a Grip on Your Chemical Inventory
Triumvirate Environmental
 
The Benefits and Approach to Optimizing a Supply Chain
The Benefits and Approach to Optimizing a Supply Chain The Benefits and Approach to Optimizing a Supply Chain
The Benefits and Approach to Optimizing a Supply Chain
Jessica McCune
 
What does it mean to be an author?
What does it mean to be an author?What does it mean to be an author?
What does it mean to be an author?
SabahMoran
 
What Do Editors Do All Day? From Science to Publishing.
What Do Editors Do All Day? From Science to Publishing.What Do Editors Do All Day? From Science to Publishing.
What Do Editors Do All Day? From Science to Publishing.
jjuhlrich
 

What's hot (11)

Publishing Scientific Research & How to Write High-Impact Research Papers
Publishing Scientific Research & How to Write High-Impact Research PapersPublishing Scientific Research & How to Write High-Impact Research Papers
Publishing Scientific Research & How to Write High-Impact Research Papers
 
Evaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision MakingEvaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision Making
 
Mersman Resume
Mersman ResumeMersman Resume
Mersman Resume
 
Scopus & SciVal Training for Researchers
Scopus & SciVal Training for ResearchersScopus & SciVal Training for Researchers
Scopus & SciVal Training for Researchers
 
Bringing Consistency to Digital Resource Evaluation
Bringing Consistency to Digital Resource EvaluationBringing Consistency to Digital Resource Evaluation
Bringing Consistency to Digital Resource Evaluation
 
CCF SciVerse Update
CCF SciVerse UpdateCCF SciVerse Update
CCF SciVerse Update
 
Elsevier - Why Scopus
Elsevier - Why ScopusElsevier - Why Scopus
Elsevier - Why Scopus
 
Get a Grip on Your Chemical Inventory
Get a Grip on Your Chemical InventoryGet a Grip on Your Chemical Inventory
Get a Grip on Your Chemical Inventory
 
The Benefits and Approach to Optimizing a Supply Chain
The Benefits and Approach to Optimizing a Supply Chain The Benefits and Approach to Optimizing a Supply Chain
The Benefits and Approach to Optimizing a Supply Chain
 
What does it mean to be an author?
What does it mean to be an author?What does it mean to be an author?
What does it mean to be an author?
 
What Do Editors Do All Day? From Science to Publishing.
What Do Editors Do All Day? From Science to Publishing.What Do Editors Do All Day? From Science to Publishing.
What Do Editors Do All Day? From Science to Publishing.
 

Similar to Acs discoverability-dhug2021

A Practical Guide to Content Strategy in HE
A Practical Guide to Content Strategy in HEA Practical Guide to Content Strategy in HE
A Practical Guide to Content Strategy in HE
Clare Kennedy
 
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
UKSG: connecting the knowledge community
 
UKSG webinar - TERMS revisited: developing the combination of electronic reso...
UKSG webinar - TERMS revisited: developing the combination of electronic reso...UKSG webinar - TERMS revisited: developing the combination of electronic reso...
UKSG webinar - TERMS revisited: developing the combination of electronic reso...
UKSG: connecting the knowledge community
 
Novinky u Elsevier: Citace, metriky, spolupráce
Novinky u Elsevier: Citace, metriky, spolupráceNovinky u Elsevier: Citace, metriky, spolupráce
Novinky u Elsevier: Citace, metriky, spolupráce
KnihovnaUTB
 
Henderson Balancing Rights and Reuse for Authors, Readers and Publishers
Henderson Balancing Rights and Reuse for Authors, Readers and PublishersHenderson Balancing Rights and Reuse for Authors, Readers and Publishers
Henderson Balancing Rights and Reuse for Authors, Readers and Publishers
National Information Standards Organization (NISO)
 
لتحليل الدراسات السابقة Nails محاضرة برنامج
  لتحليل الدراسات السابقة Nails محاضرة برنامج  لتحليل الدراسات السابقة Nails محاضرة برنامج
لتحليل الدراسات السابقة Nails محاضرة برنامج
مركز البحوث الأقسام العلمية
 
WEB240 Version 1 1 Course Syllabus College o.docx
 WEB240 Version 1 1 Course Syllabus College o.docx WEB240 Version 1 1 Course Syllabus College o.docx
WEB240 Version 1 1 Course Syllabus College o.docx
MARRY7
 
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرنمحاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
مركز البحوث الأقسام العلمية
 
Argumentative Research EssayAssignment DescriptionIn upper lev.docx
Argumentative Research EssayAssignment DescriptionIn upper lev.docxArgumentative Research EssayAssignment DescriptionIn upper lev.docx
Argumentative Research EssayAssignment DescriptionIn upper lev.docx
jewisonantone
 
11.m3 cms objectives
11.m3 cms objectives11.m3 cms objectives
11.m3 cms objectivestarensi
 
How to Write an Effective Technical Paper (1).pdf
How to Write an Effective Technical Paper (1).pdfHow to Write an Effective Technical Paper (1).pdf
How to Write an Effective Technical Paper (1).pdf
khalid khan
 
Evaluating Content Management Systems in Academic Libraries
Evaluating Content Management Systems in Academic LibrariesEvaluating Content Management Systems in Academic Libraries
Evaluating Content Management Systems in Academic Libraries
SIMAdmin
 
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
Kevin Nichols
 
Content Management Case Study
Content Management Case StudyContent Management Case Study
Content Management Case Study
Jerald Burget
 
NISO Open Discovery Initiative January 2019
NISO Open Discovery Initiative January 2019NISO Open Discovery Initiative January 2019
NISO Open Discovery Initiative January 2019
National Information Standards Organization (NISO)
 
TM298 Operating systemsArab Open University Short.docx
TM298 Operating systemsArab Open University Short.docxTM298 Operating systemsArab Open University Short.docx
TM298 Operating systemsArab Open University Short.docx
juliennehar
 
Henderson The Central Role of Scholarly Societies in Preprints
Henderson The Central Role of Scholarly Societies in PreprintsHenderson The Central Role of Scholarly Societies in Preprints
Henderson The Central Role of Scholarly Societies in Preprints
National Information Standards Organization (NISO)
 
ASIDIC Spring 2010 Meeting Dwg
ASIDIC Spring 2010 Meeting   DwgASIDIC Spring 2010 Meeting   Dwg
ASIDIC Spring 2010 Meeting Dwg
Darrell W. Gunter
 
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Discovery: Beyond Initial Implementation & Participation - and into Collabora...Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Charleston Conference
 
This course requires use of the Microsoft Project 2010 (or later.docx
This course requires use of the Microsoft Project 2010 (or later.docxThis course requires use of the Microsoft Project 2010 (or later.docx
This course requires use of the Microsoft Project 2010 (or later.docx
christalgrieg
 

Similar to Acs discoverability-dhug2021 (20)

A Practical Guide to Content Strategy in HE
A Practical Guide to Content Strategy in HEA Practical Guide to Content Strategy in HE
A Practical Guide to Content Strategy in HE
 
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
UKSG 2018 Breakout - TERMS redefined: developing the combination of electroni...
 
UKSG webinar - TERMS revisited: developing the combination of electronic reso...
UKSG webinar - TERMS revisited: developing the combination of electronic reso...UKSG webinar - TERMS revisited: developing the combination of electronic reso...
UKSG webinar - TERMS revisited: developing the combination of electronic reso...
 
Novinky u Elsevier: Citace, metriky, spolupráce
Novinky u Elsevier: Citace, metriky, spolupráceNovinky u Elsevier: Citace, metriky, spolupráce
Novinky u Elsevier: Citace, metriky, spolupráce
 
Henderson Balancing Rights and Reuse for Authors, Readers and Publishers
Henderson Balancing Rights and Reuse for Authors, Readers and PublishersHenderson Balancing Rights and Reuse for Authors, Readers and Publishers
Henderson Balancing Rights and Reuse for Authors, Readers and Publishers
 
لتحليل الدراسات السابقة Nails محاضرة برنامج
  لتحليل الدراسات السابقة Nails محاضرة برنامج  لتحليل الدراسات السابقة Nails محاضرة برنامج
لتحليل الدراسات السابقة Nails محاضرة برنامج
 
WEB240 Version 1 1 Course Syllabus College o.docx
 WEB240 Version 1 1 Course Syllabus College o.docx WEB240 Version 1 1 Course Syllabus College o.docx
WEB240 Version 1 1 Course Syllabus College o.docx
 
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرنمحاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
 
Argumentative Research EssayAssignment DescriptionIn upper lev.docx
Argumentative Research EssayAssignment DescriptionIn upper lev.docxArgumentative Research EssayAssignment DescriptionIn upper lev.docx
Argumentative Research EssayAssignment DescriptionIn upper lev.docx
 
11.m3 cms objectives
11.m3 cms objectives11.m3 cms objectives
11.m3 cms objectives
 
How to Write an Effective Technical Paper (1).pdf
How to Write an Effective Technical Paper (1).pdfHow to Write an Effective Technical Paper (1).pdf
How to Write an Effective Technical Paper (1).pdf
 
Evaluating Content Management Systems in Academic Libraries
Evaluating Content Management Systems in Academic LibrariesEvaluating Content Management Systems in Academic Libraries
Evaluating Content Management Systems in Academic Libraries
 
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
The Next Generation of Content Strategy: Omnichannel, Performance-Driven Cont...
 
Content Management Case Study
Content Management Case StudyContent Management Case Study
Content Management Case Study
 
NISO Open Discovery Initiative January 2019
NISO Open Discovery Initiative January 2019NISO Open Discovery Initiative January 2019
NISO Open Discovery Initiative January 2019
 
TM298 Operating systemsArab Open University Short.docx
TM298 Operating systemsArab Open University Short.docxTM298 Operating systemsArab Open University Short.docx
TM298 Operating systemsArab Open University Short.docx
 
Henderson The Central Role of Scholarly Societies in Preprints
Henderson The Central Role of Scholarly Societies in PreprintsHenderson The Central Role of Scholarly Societies in Preprints
Henderson The Central Role of Scholarly Societies in Preprints
 
ASIDIC Spring 2010 Meeting Dwg
ASIDIC Spring 2010 Meeting   DwgASIDIC Spring 2010 Meeting   Dwg
ASIDIC Spring 2010 Meeting Dwg
 
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Discovery: Beyond Initial Implementation & Participation - and into Collabora...Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
 
This course requires use of the Microsoft Project 2010 (or later.docx
This course requires use of the Microsoft Project 2010 (or later.docxThis course requires use of the Microsoft Project 2010 (or later.docx
This course requires use of the Microsoft Project 2010 (or later.docx
 

More from Access Innovations, Inc.

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Access Innovations, Inc.
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
Access Innovations, Inc.
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Access Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
Access Innovations, Inc.
 
Smart submit
Smart submitSmart submit
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
Access Innovations, Inc.
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
Access Innovations, Inc.
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
Access Innovations, Inc.
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
Access Innovations, Inc.
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
Access Innovations, Inc.
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
Access Innovations, Inc.
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
Access Innovations, Inc.
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
Access Innovations, Inc.
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
Access Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
Access Innovations, Inc.
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
Access Innovations, Inc.
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
Access Innovations, Inc.
 

More from Access Innovations, Inc. (20)

Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
 
Eureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 PresentationEureka, I found it! - Special Libraries Association 2021 Presentation
Eureka, I found it! - Special Libraries Association 2021 Presentation
 
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
 

Recently uploaded

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 

Recently uploaded (20)

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 

Acs discoverability-dhug2021

  • 1.
  • 2. American Chemical Society Lessons Learned From Building a Taxonomy and Indexing 140+ years of Content Michael Darr Columbus, OH DHUG 2021 February 10
  • 3. © 2021 American Chemical Society Who is the American Chemical Society? A non-profit scientific organization with more than 140 years’ experience, we are a champion for chemistry, its practitioners and our global community of members. ACS Family: ACS Publications, C&EN news, CAS, AACT (American Association of Chemistry Teachers) ACS Publications is recognized as a leading publisher of authoritative scientific information. Our 60+ peer-reviewed journals are ranked the “most-trusted, most-cited and most-read”.
  • 4. © 2021 American Chemical Society ACS Publications Products ACS publishes across the full spectrum of chemistry and related sciences and in every print medium. We’ve published more than • 1.3 million research articles across more than 60 journals • 100,000 news stories in award winning C&EN magazine • 35,000 book chapter across more than 1,600 books • 1,000 references and standards in ACS Reagent Chemicals
  • 5. © 2021 American Chemical Society 5 Where were we starting from? • In 2016 in partnership with CAS (a sister division of ACS) we developed in initial Taxonomy for use with ACS Omega, our new a multidisciplinary open access journal • Content was indexed manually by CAS scientists during an article’s production lifecycle • Terms were available typically just in time for publication, for which at the time was a relatively small set of content • Assigned terms were uploaded to our delivery system where they were displayed on the article page and used to provide a taxonomy- driven navigation for the journal
  • 6. © 2021 American Chemical Society 6 Where did we need to go? • Needed a taxonomy that was more customized for ACS Publication’s needs • Classify all published content • Be able to handle processing 60,000+ articles a year in a timely fashion • Integrate display into a newly redesigned website • Lay the groundwork to allow for expanding opportunities for new non-journal products
  • 7. © 2021 American Chemical Society SLIDE TITLES SHOULD NOT GO MORE THAN TWO LINES IN LENGTH. Lessons From Building a Taxonomy Infographic vector created by vectorjuice
  • 8. © 2021 American Chemical Society 8 Lessons From Building a Taxonomy • Gather information on best practices and others’ experiences • Get agreement early on from all the business owners on the requirements for building the taxonomy – Content domain experts and UI/UX engineers may have differing views of what the customer and product needs are; establish clear decision making roles. • Be aware of complications due to polyhierarchy – Makes content discoverable under a subject area for which it may not pertain – For a publisher prospective authors may try to use it to justify why their submitted article fits the scope of a journal
  • 9. © 2021 American Chemical Society 9 Lessons From Building a Taxonomy • Ensure enough time and budget to enable sufficient collaboration between your taxonomy consultants and your internal content subject matter experts • Establish live documents for more interactive collaboration • Ensure random sampling of content still includes an appropriate percentage of research content and high value content • Give more time to building content for customer research – Dependent on tools being used to facilitate customer interaction
  • 10. © 2021 American Chemical Society 10 Actions We Took • Chose to have a “full taxonomy” and a “visible taxonomy” – The full taxonomy was what was needed to accurately classify the content – The visible taxonomy is a subset of the full taxonomy, including only the top levels and specific terms in those levels to display on our platforms • Engaged in customer focus research testing two different visible taxonomies – Found in individual testing that testers didn’t have any real preference on the structure (note final versions were not hugely dissimilar) – Found in A|B Testing on our Platform that the data captured on user interactions didn’t provide a unanimous customer preference
  • 11. © 2021 American Chemical Society Visible Taxonomy Display
  • 12. © 2021 American Chemical Society SLIDE TITLES SHOULD NOT GO MORE THAN TWO LINES IN LENGTH. Lessons From Classifying Content Infographic vector created by vectorjuice
  • 13. © 2021 American Chemical Society 13 Lessons From Classifying Content • If you have PDF content, evaluate as early as possible how accurate automated classification of the content will be – 120 years of PDF-only content caused issues on being able to programmatically identify content consistently – Common issue of skewed indexing results due to content from the preceding and following articles as the content was generated from scans of the original text • Engage platform architects early to fully understand all existing capabilities and limitations for applying and leveraging the terms • Consider weighting the text of the article for more accurate results – Example: Title (8), Abstract (8), Experimental Section (4)
  • 14. © 2021 American Chemical Society 14 Actions We Took • We developed an internal automated process to derive the visible taxonomy from the full taxonomy by determining the top 5 terms • Validation of indexing results at a granular and visible level – Using internal Subject Matter Experts to ensure consistently hitting 85% or better accuracy – Using external customers to verify accuracy of terms displayed with the article • Created a process for making adjustments to the visible terms applied to the content
  • 15. © 2021 American Chemical Society Thank You! Michael Darr IT Project Manager Publications Production Operations American Chemical Society 2540 Olentangy River Rd Columbus, OH mdarr@acs.org

Editor's Notes

  1. Note we still clash on whether Subject Areas should be organized alphabetically or by article count